Friday, 7 March 2025
26.7 C
Singapore
29 C
Thailand
20.7 C
Indonesia
27.4 C
Philippines

Microsoft supports new startups as it launches an AI processor that skips GPUs and expensive memory

Discover how d-Matrix's new AI processor, Corsair, backed by Microsoft, redefines AI inference with a GPU-free and cost-effective solution.

D-Matrix Inc., a hardware startup based in Santa Clara, California, has unveiled its first AI processor, Corsair, which aims to redefine AI inference. This innovative product does away with traditional GPUs and costly high-bandwidth memory (HBM), offering a more efficient and cost-effective solution.

Supported by Microsoft and embracing cutting-edge advancements, Corsair is available to early-access customers and is expected to reach broader availability by the second quarter of 2025.

What makes Corsair stand out?

Corsair is purpose-built to tackle demanding AI inference tasks, particularly those involving generative AI models. The processor achieves remarkable speeds, handling 60,000 tokens per second at 1 millisecond per token when running models like Llama3 8B on a single server.

For larger-scale applications, such as Llama3 70B, Corsair achieves 30,000 tokens per second at 2 milliseconds per token on a single rack. This performance significantly lowers energy consumption and operational costs compared to traditional GPU-based systems.

Built on d-Matrixโ€™s Nighthawk and Jayhawk II tiles, Corsair utilises a 6nm manufacturing process. Each Nighthawk tile has four neural cores and a RISC-V CPU, optimised for large-model inference. It incorporates digital in-memory computation (DIMC) and supports versatile datatype processing, including block floating point (BFP).

Corsairโ€™s chiplet packaging integrates memory and computation to boost efficiency. It adheres to the PCIe Gen5 full-height, full-length card form factor, making it compatible with DMX Bridge cards for scalable performance. The card is powered by 2400 TFLOPs of 8-bit peak computing power, 2GB of integrated performance memory, and 256GB of off-chip memory capacity.

A partnership with Nvidiaโ€™s key ally

Micron Technology, a significant partner of Nvidia, is collaborating with d-Matrix to support Corsairโ€™s development and growth.

Although initially slated for release in late 2023, d-Matrix restructured its architecture to align with the growing demand for generative AI. This shift allowed Corsair to incorporate advanced features tailored for transformer models, agentic AI, and emerging applications like interactive video generation.

โ€œOur vision for d-Matrix was to address the massive computing challenges of generative AI and transformers,โ€ said Sid Sheth, co-founder and CEO of d-Matrix. โ€œCorsair is a groundbreaking platform, delivering blazing-fast token generation for interactive AI applications, making generative AI commercially viable.โ€

Corsairโ€™s focus on cost efficiency, scalability, and high-performance positions it as a promising solution in the rapidly evolving AI landscape.

Hot this week

BenQ launches GP520 4K HDR home entertainment projector with Google TV and Netflix

BenQ launches GP520 4K HDR projector with built-in Google TV and Netflix, designed for modern homes with stunning visuals and sound.

Da Nang investment forum 2025 highlights Vietnamโ€™s push to become a financial hub

Da Nang Investment Forum 2025 highlights plans to make Da Nang a financial hub, strengthening economic ties with Singapore and global investors.

Apple unveils MacBook Air with M4 chip, new Sky Blue colour, and lower prices

Apple unveils the MacBook Air with the M4 chip, a Sky Blue colour, and lower prices. Pre-orders are open now, and retail availability will be on March 12.

OpenAI plans to integrate Sora into ChatGPT

OpenAI plans to integrate its AI video tool, Sora, into the ChatGPT app alongside new features like GPT-4.5 and the Operator tool.

Smart Communications reveals 5 key trends shaping customer conversations in 2025

Smart Communicationsโ€™ 2025 Trends Report highlights key trends in AI, personalisation, and modernisation, shaping the future of customer conversations.

ROG Astral GeForce RTX 5090 OC Edition sets six overclocking records

ASUS ROG Astral GeForce RTX 5090 OC Edition breaks six overclocking records in global benchmark tests with advanced cooling and boosted clock speeds.

Crunchyroll announces cinema release dates for Demon Slayer: Kimetsu no Yaiba Infinity Castle

Crunchyroll announces cinema release dates for Demon Slayer: Kimetsu no Yaiba Infinity Castle, starting 14 August 2025 in Singapore and Malaysia.

Trump proposes US crypto reserve, raising concerns over economic impact

Trump proposes a US Crypto Reserve, raising concerns about its impact on the economy and the dollar. Could this move benefit crypto donors over taxpayers?

Apple unveils MacBook Air with M4 chip, new Sky Blue colour, and lower prices

Apple unveils the MacBook Air with the M4 chip, a Sky Blue colour, and lower prices. Pre-orders are open now, and retail availability will be on March 12.

Related Articles