Monday, 10 March 2025
25.7 C
Singapore
28.4 C
Thailand
20.4 C
Indonesia
26.5 C
Philippines

You can now rent Google’s advanced AI chip: Trillium TPU powers Gemini 2.0 and challenges AMD and Nvidia

Google’s Trillium TPU is now available for rent. It offers unmatched AI training efficiency, energy savings, and powerful computing performance.

Google officially announced that its sixth-generation Tensor Processing Unit (TPU), Trillium, is now available for rent. After months of being offered in preview, this powerful AI chip is ready for general use. Designed to revolutionise AI infrastructure, Trillium has already proven its capabilities by training Gemini 2.0, Google’s cutting-edge AI model.

This news is big if you’re in the AI or tech industry. The chip boasts impressive features, including double the High Bandwidth Memory (HBM) capacity and double the Interchip Interconnect bandwidth compared to its predecessors. According to Google, Trillium offers up to a 2.5x improvement in training performance per dollar, making it an efficient choice for businesses aiming to optimise their AI operations.

Revolutionary performance with Trillium TPU

Trillium isn’t just an upgrade—it’s a leap forward. It delivers more than four times the training performance of its predecessor, while energy efficiency has increased by 67%. Regarding raw computing power, each chip’s peak performance is 4.7 times greater than earlier.

Google’s benchmarks reveal that Trillium also significantly enhances inference tasks. For image generation models like Stable Diffusion XL, throughput has increased thrice. Large language models, crucial in today’s AI landscape, see nearly double the throughput.

The chip’s architecture is also optimised for embedding-intensive models. Its third-generation SparseCore improves dynamic and data-dependent operations efficiency, ensuring smooth performance even under complex workloads.

Powering Google Cloud’s AI Hypercomputer

One of Trillium’s standout achievements is its role in Google Cloud’s AI Hypercomputer. This advanced system integrates over 100,000 Trillium chips, all connected through a Jupiter network fabric with an astounding 13 Petabits/sec bandwidth. The system combines this cutting-edge hardware with open-source software and well-known machine-learning frameworks like JAX, PyTorch, and TensorFlow.

What does this mean for you? Google Cloud customers can now use the same state-of-the-art hardware that trained the Gemini 2.0 AI model. With Trillium’s general availability, high-performance AI technology is no longer reserved for a select few. From image generation to complex language models, Trillium opens the door to countless applications, making it a valuable asset for businesses aiming to stay ahead in the AI race.

Hot this week

Smart Communications reveals 5 key trends shaping customer conversations in 2025

Smart Communications’ 2025 Trends Report highlights key trends in AI, personalisation, and modernisation, shaping the future of customer conversations.

WeChat mini-game advertising sees 113% increase, creating new opportunities for developers

WeChat mini-game ads grew 113% in 2024, opening major growth chances for developers aiming to scale in China’s fast-moving mobile game market.

Apple’s fully modernised Siri might not arrive until 2027

Apple may not release a thoroughly modern version of Siri until 2027, with a major AI-powered upgrade expected to roll out in phases.

ASUS unveils new Intel Xeon 6 server range to boost AI, cloud, and enterprise performance

ASUS launches new Intel Xeon 6 servers, delivering high performance, flexibility, and energy efficiency for AI, cloud, and enterprise computing.

Salesforce launches Agentforce 2dx to embed proactive AI into business workflows

Salesforce launches Agentforce 2dx, letting businesses add proactive AI agents into workflows to boost automation and efficiency.

Jim Jordan subpoenas YouTube over alleged censorship ties to the Biden administration

Jim Jordan subpoenas Alphabet, seeking documents on YouTube’s alleged censorship ties to Biden. Google defends its content policies amid scrutiny.

Dell and Alienware unveil new monitors in Singapore

Dell launches new monitors in Singapore, including the Pro 14 Plus, Pro 34 Plus, and a 75-inch touch monitor for professional use.

Microsoft intensifies AI race to rival OpenAI

Microsoft is increasing its AI efforts, developing its models and testing alternatives to OpenAI technology for products like Copilot.

Google co-founder Larry Page reportedly launching AI-driven manufacturing startup

Google co-founder Larry Page is reportedly launching Dynatomics, an AI-driven manufacturing startup that will optimise product design and production.

Related Articles