Monday, 3 February 2025
25.6 C
Singapore
21.3 C
Thailand
21.4 C
Indonesia
26 C
Philippines

OpenAI and Broadcom team up to create AI chip for faster, smarter inference

OpenAI partners with Broadcom to create a custom AI inference chip to reduce reliance on Nvidia and expand its AI infrastructure.

OpenAI is collaborating with Broadcom to develop a custom chip to run artificial intelligence (AI) models efficiently after their training phase. According to sources close to the matter, the partnership aims to create a chip specialised for “inference”โ€”a process that allows AI to respond to user inputs based on its pre-trained knowledge. This move comes as the need for chips suited for inference is set to grow in parallel with AI adoption as companies turn to AI to handle more sophisticated tasks.

OpenAI and Broadcom are reportedly consulting with Taiwan Semiconductor Manufacturing Co. (TSMC), the world’s leading chip manufacturer, to aid in chip production. Although sources indicate that OpenAI has considered developing its chip for the past year, the discussions remain preliminary, focusing on faster solutions through collaborations rather than solo production.

A custom path to AI innovation

OpenAI intends to depart from the established graphics processing unit (GPU) market by shifting towards inference-based chips. The companyโ€™s focus on inference-specific chips marks a difference from Nvidia, which dominates the GPU space and has traditionally powered the initial AI model training and development phases. These GPUs are instrumental in building large, generative AI models but are not as efficient in inference, where a lightweight, specialised chip is better suited.

Industry sources reveal that the chip production journeyโ€”spanning design, prototyping, and large-scale productionโ€”can be time-consuming and costly. OpenAI has strategically collaborated with established players to navigate these hurdles rather than pursue chip manufacturing independently. While OpenAI had previously contemplated creating its manufacturing network, the immediate need for high-performance chips prompted the decision to leverage Broadcomโ€™s resources and TSMCโ€™s facilities.

OpenAI has not responded to requests for comment, while Broadcomโ€™s representatives remained silent on the partnership, and TSMC declined to address rumours. However, OpenAIโ€™s strategy mirrors similar moves by other tech giants as they seek alternatives to Nvidia. The rising demand for diverse AI chips has led companies to explore collaborations and invest in different types of processors, including those from Advanced Micro Devices.

Broadcomโ€™s expertise and the AI future

OpenAI and Broadcom team up to create AI chip for faster, smarter inference
Image credit: SDxCentral

Broadcom brings extensive experience as the most prominent designer of application-specific integrated circuits (ASICs). These chips are custom-built for specific tasks, and Broadcomโ€™s client list includes some of the tech industry’s biggest players, such as Google, Meta, and ByteDance. CEO Hock Tan previously commented on Broadcomโ€™s approach, saying the company is cautious about adding new clients and only commits to full-scale production for projects that meet strict requirements. This business model could work well with OpenAIโ€™s needs, as the start-up seeks a specialised chip that can handle AI inference tasks without requiring massive investments in manufacturing infrastructure.

Despite OpenAIโ€™s primary reliance on Nvidia GPUs to develop and train its models, the search for a more sustainable and efficient solution has become critical as AI adoption grows. OpenAIโ€™s service requirementsโ€”particularly in data centres, where vast amounts of computing power are needed to process AI workloadsโ€”are fuelling the pursuit of custom-built chips to meet demand at scale. To fund this expansion, OpenAI CEO Sam Altman has contacted US government agencies and global investors, including some in the Middle East, emphasising the need for enhanced data infrastructure to support future growth.

Preparing data centres and future partnerships

The shift towards custom chip solutions marks another important step in OpenAIโ€™s broader vision to enhance its AI infrastructure. The company invests in data centre partnerships to provide a robust home for these new AI chips. With the rise of generative AI and large-scale language models, the demand for specialised chips capable of efficiently processing inference requests is surging.

As the industry looks for alternatives to Nvidia, OpenAIโ€™s strategic collaboration with Broadcom and consultations with TSMC could pave the way for an innovative solution to handle the vast demands of next-generation AI. By exploring custom chip solutions and building alliances, OpenAI is preparing itself to meet future AI demands with both speed and efficiency while keeping a close eye on developing data centres that can host these advanced processors.

This effort marks OpenAIโ€™s continued commitment to creating faster, smarter, and more accessible AI technology that meets users’ ever-growing expectations worldwide.

Hot this week

OPPO claims Find N5 is thinner than Appleโ€™s iPad Pro (M4)

OPPO is teasing its Find N5 foldable phone, claiming itโ€™s thinner than Appleโ€™s iPad Pro (M4). It is expected to launch globally in February 2025.

DeepSeekโ€™s app disappears from Apple and Google stores in Italy

After regulators raised concerns over its data privacy practices, DeepSeekโ€™s app is no longer available in Apple and Google stores in Italy.

Nothing announces the March 4 event, with Phone (3) expected to debut

Nothing's next event, on March 4, 2025, is set to unveil the anticipated Phone (3), which promises innovation with its "Power in Perspective" tagline.

Pentagon moves to block DeepSeek after staff access Chinese servers

The Pentagon is blocking DeepSeek after employees unknowingly connected work computers to Chinese servers, raising national security concerns.

Former Intel CEO Pat Gelsinger embraces DeepSeekโ€™s AI model for his startup, Gloo

DeepSeekโ€™s open-source AI model, R1, impressed former Intel CEO Pat Gelsinger. It is reshaping the AI industry with affordability and innovation.

X widens legal battle over alleged advertiser boycott

X expands its lawsuit over an alleged advertiser boycott, adding Lego, Nestlรฉ, and Pinterest to the case, claiming significant losses in ad revenue.

Newgen named a leader in IDC MarketScape reports for intelligent CCM and automated document generation

Newgen Software has been recognised as a leader in two IDC MarketScape reports for its AI-driven customer communications and document generation solutions.

YouTube expands its Discord-like Communities to more creators

YouTube is expanding its Communities feature, giving more creators a dedicated space to engage with fans directly on the platform.

OpenAI unveils o3-mini reasoning model with free ChatGPT access

OpenAI launches o3-mini, a faster AI reasoning model for free ChatGPT users with rate limits. It is expanding its features for paid users and developers.

Related Articles