Sunday, 2 February 2025
30.7 C
Singapore
36.6 C
Thailand
27.7 C
Indonesia
26.7 C
Philippines

You can now rent Google’s advanced AI chip: Trillium TPU powers Gemini 2.0 and challenges AMD and Nvidia

Google’s Trillium TPU is now available for rent. It offers unmatched AI training efficiency, energy savings, and powerful computing performance.

Google officially announced that its sixth-generation Tensor Processing Unit (TPU), Trillium, is now available for rent. After months of being offered in preview, this powerful AI chip is ready for general use. Designed to revolutionise AI infrastructure, Trillium has already proven its capabilities by training Gemini 2.0, Google’s cutting-edge AI model.

This news is big if you’re in the AI or tech industry. The chip boasts impressive features, including double the High Bandwidth Memory (HBM) capacity and double the Interchip Interconnect bandwidth compared to its predecessors. According to Google, Trillium offers up to a 2.5x improvement in training performance per dollar, making it an efficient choice for businesses aiming to optimise their AI operations.

Revolutionary performance with Trillium TPU

Trillium isn’t just an upgrade—it’s a leap forward. It delivers more than four times the training performance of its predecessor, while energy efficiency has increased by 67%. Regarding raw computing power, each chip’s peak performance is 4.7 times greater than earlier.

Google’s benchmarks reveal that Trillium also significantly enhances inference tasks. For image generation models like Stable Diffusion XL, throughput has increased thrice. Large language models, crucial in today’s AI landscape, see nearly double the throughput.

The chip’s architecture is also optimised for embedding-intensive models. Its third-generation SparseCore improves dynamic and data-dependent operations efficiency, ensuring smooth performance even under complex workloads.

Powering Google Cloud’s AI Hypercomputer

One of Trillium’s standout achievements is its role in Google Cloud’s AI Hypercomputer. This advanced system integrates over 100,000 Trillium chips, all connected through a Jupiter network fabric with an astounding 13 Petabits/sec bandwidth. The system combines this cutting-edge hardware with open-source software and well-known machine-learning frameworks like JAX, PyTorch, and TensorFlow.

What does this mean for you? Google Cloud customers can now use the same state-of-the-art hardware that trained the Gemini 2.0 AI model. With Trillium’s general availability, high-performance AI technology is no longer reserved for a select few. From image generation to complex language models, Trillium opens the door to countless applications, making it a valuable asset for businesses aiming to stay ahead in the AI race.

Hot this week

Apple’s revenue rises despite an 11% drop in China sales

Apple’s Q1 2025 revenue rose 4% to US$124.3B, despite an 11% decline in China iPhone sales. Strong growth in services and Mac sales helped offset losses.

Microsoft in talks to acquire TikTok as Trump pushes for a bidding war

Microsoft is in talks to acquire TikTok after Trump’s executive order delayed the app’s U.S. ban. A bidding war could be on the horizon.

XIAOVV Smart Baby Monitor review: 2K wide-angle lens and real-time alerts

An evaluative look at the XIAOVV Smart Baby Monitor, focusing on its design aesthetics, performance capabilities, and app functionality for modern caregiving.

OPPO claims Find N5 is thinner than Apple’s iPad Pro (M4)

OPPO is teasing its Find N5 foldable phone, claiming it’s thinner than Apple’s iPad Pro (M4). It is expected to launch globally in February 2025.

Resolution Games unveils Battlemarked, a new VR Dungeons & Dragons game

Resolution Games and Wizards of the Coast unveil Battlemarked, a new VR D&D game. Featuring turn-based combat and story-driven campaigns, it launches soon.

Newgen named a leader in IDC MarketScape reports for intelligent CCM and automated document generation

Newgen Software has been recognised as a leader in two IDC MarketScape reports for its AI-driven customer communications and document generation solutions.

YouTube expands its Discord-like Communities to more creators

YouTube is expanding its Communities feature, giving more creators a dedicated space to engage with fans directly on the platform.

OpenAI unveils o3-mini reasoning model with free ChatGPT access

OpenAI launches o3-mini, a faster AI reasoning model for free ChatGPT users with rate limits. It is expanding its features for paid users and developers.

Samsung Galaxy S25 Ultra dominates pre-orders in South Korea

The Samsung Galaxy S25 Ultra leads pre-orders in South Korea, making up 60-70% of sales. Find out which colours are trending and how to pre-order yours.

Related Articles