Thursday, 2 January 2025
30.9 C
Singapore

You can now rent Google’s advanced AI chip: Trillium TPU powers Gemini 2.0 and challenges AMD and Nvidia

Google’s Trillium TPU is now available for rent. It offers unmatched AI training efficiency, energy savings, and powerful computing performance.

officially announced that its sixth-generation Tensor Processing Unit (TPU), Trillium, is now available for rent. After months of being offered in preview, this powerful AI chip is ready for general use. Designed to revolutionise AI infrastructure, Trillium has already proven its capabilities by Gemini 2.0, Google’s cutting-edge AI model.

This news is big if you’re in the AI or tech industry. The chip boasts impressive features, including double the High Bandwidth Memory (HBM) capacity and double the Interchip Interconnect bandwidth compared to its predecessors. According to Google, Trillium offers up to a 2.5x improvement in training performance per dollar, making it an efficient choice for businesses aiming to optimise their AI operations.

Revolutionary performance with Trillium TPU

Trillium isn’t just an upgrade—it’s a leap forward. It delivers more than four times the training performance of its predecessor, while energy efficiency has increased by 67%. Regarding raw computing power, each chip’s peak performance is 4.7 times greater than earlier.

Google’s benchmarks reveal that Trillium also significantly enhances inference tasks. For image generation models like Stable Diffusion XL, throughput has increased thrice. Large language models, crucial in today’s AI landscape, see nearly double the throughput.

The chip’s architecture is also optimised for embedding-intensive models. Its third-generation SparseCore improves dynamic and data-dependent operations efficiency, ensuring smooth performance even under complex workloads.

Powering Google Cloud’s AI Hypercomputer

One of Trillium’s standout achievements is its role in Google Cloud’s AI Hypercomputer. This advanced system integrates over 100,000 Trillium chips, all connected through a Jupiter network fabric with an astounding 13 Petabits/sec bandwidth. The system combines this cutting-edge hardware with open-source software and well-known machine-learning frameworks like JAX, PyTorch, and TensorFlow.

What does this mean for you? Google Cloud customers can now use the same state-of-the-art hardware that trained the Gemini 2.0 AI model. With Trillium’s general availability, high-performance AI technology is no longer reserved for a select few. From image generation to complex language models, Trillium opens the door to countless applications, making it a valuable asset for businesses aiming to stay ahead in the AI race.

Hot this week

Twilio: How customer engagement will be evolving in APJ by 2025

Explore how Twilio's insights will shape customer engagement strategies in APJ by 2025, focusing on trust, AI integration, data management, and innovative loyalty programmes.

Huawei slashes smartphone prices to compete for high-end market share in China

Huawei slashes prices on flagship smartphones, including the Mate X5 and Pura 70 Ultra, as it fights for market share in China's premium segment.

US Treasury Department faces major security breach

The US Treasury Department faced a cyberattack linked to a China-state-sponsored hacker exploiting third-party software. No ongoing access was found.

Asus unveils new ROG Strix, and ProArt monitors ahead of CES 2025

Asus unveils new monitors, including ROG Strix XG32UCG for gamers, ProArt 6K and 4K displays for creators, and updated ZenScreen models for portability.

LG’s Gram laptops upgraded with AI tools and advanced Intel chips

LG updates Gram laptops with AI tools, Intel chips, and enhanced features, offering lightweight designs and improved connectivity.

LG unveils AI-powered mobility solutions at CES 2025

LG introduces its MX platform at CES 2025, bringing AI-powered smart home technology to mobility spaces for a seamless lifestyle experience.

Business optimism grows for 2025, but manpower costs and demand uncertainty remain key concerns

Singapore businesses remain optimistic for 2025 despite rising costs and uncertain demand, with calls for continued government support and workforce integration.

OPPO’s Reno13 series launches in Singapore on January 4

Explore the OPPO Reno13 series, launching January 4 in Singapore, with powerful performance, underwater photography, and exclusive launch deals.

Twilio: How customer engagement will be evolving in APJ by 2025

Explore how Twilio's insights will shape customer engagement strategies in APJ by 2025, focusing on trust, AI integration, data management, and innovative loyalty programmes.

Related Articles