Friday, 4 April 2025
29.3 C
Singapore
34.1 C
Thailand
27 C
Indonesia
28.6 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (โ€œoโ€ for โ€œomniโ€), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voicesโ€”three male and three femaleโ€”that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAIโ€™s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

NVIDIA Blackwell platform sets new performance benchmark in MLPerf Inference v5.0

NVIDIAโ€™s GB200 NVL72 sets a new benchmark in MLPerf Inference v5.0 with 30x token throughput, leading AI factory performance.

Nothing Phone (3a) Pro review: A mid-range marvel with standout zoom

Nothing Phone (3a) Pro blends standout design, powerful zoom camera, and smart features, making it a top choice in the mid-range segment.

Misconceptions about STEM careers continue to deter young women in Singapore

New research shows stereotypes and lack of support are deterring young women from STEM careers, posing a risk to Singaporeโ€™s innovation goals.

Huawei reports 38% revenue surge as smartphone sales soar

Despite US sanctions, Huaweiโ€™s consumer business revenue surged 38% in 2024, driven by strong smartphone sales and home-grown chip production.

Exabeam introduces Nova, an agentic AI that boosts cybersecurity operations

Exabeam unveils Nova, a proactive AI agent that boosts security team productivity and reduces incident investigation time by over 50%.

Amazon introduces AI shopping assistant to buy from third-party sites

Amazon is testing "Buy for Me," an AI shopping tool that buys from third-party sites. Please find out how it works and what it means for online shopping.

How ByteDance’s AI investment is reshaping the future of technology

ByteDance is investing US$12 billion in AI infrastructure for 2025 to enhance platforms like TikTok and drive innovation across industries, with a focus on acquiring AI chips globally.

Spotify introduces AI-powered ads and programmatic ad buying

Spotify unveils AI-powered ads and the Spotify Ad Exchange, making it easier for advertisers to reach Gen Z listeners with real-time bidding.

YouTube expands shopping affiliate programme in Singapore through Shopee partnership

YouTube teams up with Shopee to launch its Shopping affiliate programme in Singapore, giving creators new ways to monetise their content.

Related Articles