4 results found

Intel, Foxconn, and SambaNova Systems have partnered to build rackscale AI infrastructure, unveiled at Computex 2026. This collaboration targets the shift from AI training to inference, aiming to re-establish Intel Xeon CPUs at the core of data centers by pairing them with SambaNova's SN-50 RDUs for efficient, cost-effective performance. Foxconn will handle system integration and develop CPU-dense variants.
This article explores how a 10-year-old Intel Xeon E5-2620 v4 server with 128 GB DDR3 RAM and no GPU can run a modern LLM like Gemma 4 26B-A4B at reading speed. It highlights that LLM inference is often memory-bound and showcases deep optimization techniques using `ik_llama.cpp`, including speculative decoding, CPU-aware MoE routing, advanced memory management, and specialized attention kernels. The success demonstrates that granular software control can unlock significant performance on older, abundant-RAM hardware.

Intel's Xeon 6+ 'Clearwater Forest' pushes data center compute density with up to 288 E-cores on 18A. While claiming significant per-thread gains over AMD and generational uplifts, its focused benchmarks and higher TDP warrant careful consideration.

Intel and SambaNova's new heterogeneous AI inference platform combines GPUs/AI accelerators, SambaNova RDUs, and Intel Xeon 6 processors. Targeting a broad range of agentic workloads for H2 2026, it promises easy data center integration and competitive performance, aiming to challenge market leaders.