Sunday, 23 February 2025
25.3 C
Singapore
36.8 C
Thailand
22.1 C
Indonesia
27.2 C
Philippines

Anthropic’s newest Claude chatbot outperforms GPT-4o in benchmarks

Explore Claude 3.5 Sonnet, Anthropic's latest AI model, now available. It excels at understanding nuance and visual input, outpacing GPT-4o benchmarks.

On Thursday, Anthropic introduced its latest AI language model, Claude 3.5 Sonnet. This new version surpasses the companyโ€™s previous top-tier model, the Claude 3 Opus, while operating at twice the speed. You can now explore this enhanced chatbot, even with a free account.

Key features and performance

Claude 3.5 Sonnet is the first in the Claude 3.5 series and is considered Anthropicโ€™s most balanced model. Future releases in this series will include Claude 3.5 Haiku, the fastest model, and Claude 3.5 Opus, the most powerful. These updates will roll out later this year while the current versions remain on Claude 3. The quick release of Sonnet, just months after the Claude 3 family, highlights the rapid pace at which AI companies are developing their technologies.

Anthropic claims that Claude 3.5 Sonnet significantly improves understanding of nuance, humour, and complex prompts, enabling it to write in a more natural tone. Benchmark tests indicate that the new model sets industry records for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency. It surpasses OpenAIโ€™s GPT-4o in many of these benchmarks. However, it is worth noting that the latest models from Claude, ChatGPT, Gemini, and Llama are all closely matched, often scoring within a few percentage points of each other, reflecting the intense competition in the AI field.

Enhanced visual interpretation and a new workspace

The company asserts that Claude 3.5 Sonnet excels at interpreting visual input better than its predecessor, Claude 3.0 Opus. The new model can accurately transcribe text from imperfect images, a feature expected to attract retail, logistics, and financial services customers who require precise data interpretation from charts, graphs, and other visual cues.

Claudeโ€™s latest update also includes a new workspace feature called Artifacts. When you prompt the chatbot to generate content such as code, text documents, or web designs, a dedicated window appears next to the chat interface. This Artefacts window allows you to request changes, and it will update with the chatbotโ€™s latest output. Anthropic sees artefacts as a step towards making Claude a hub for broader team collaboration. The company envisions a future where teams and entire organisations can securely centralise their knowledge, documents, and ongoing projects in one shared space, with Claude acting as an on-demand team member.

Availability and pricing

Claude 3.5 Sonnet is now available for anyone with an account to try on Anthropicโ€™s website and through the Claude iOS app. Pro and team subscribers on these platforms will benefit from higher token counts. Additionally, you can access it via the Anthropic API, Amazon Bedrock, and Google Cloudโ€™s Vertex AI. The cost remains the same as the previous model, at US$3 per million input tokens and US$15 per million output tokens.

Hot this week

Hitachi Vantara: Building AI success without falling into financial traps

Discover how Hitachi Vantara guides Southeast Asia firms to maximise AI's ROI through strategic planning, scalable infrastructure, and targeted use cases.

How SMBs can stay connected affordably and efficiently

Discover how SMBs can stay connected affordably with 5G solutions and managed services, ensuring seamless operations without high costs.

Duolingoโ€™s Cybertruck stunt โ€˜killsโ€™ mascot Duo, and users canโ€™t get enough

Duolingoโ€™s marketing stunt claims its mascot, Duo the Owl, was hit by a Cybertruckโ€”boosting app engagement and sparking a viral campaign.

OpenAI moves to loosen ChatGPT restrictions

OpenAI updates ChatGPTโ€™s policies to promote intellectual freedom, allowing for more perspectives on controversial topics while maintaining neutrality.

88% of top Asia Pacific companies still vulnerable to email fraud amid rising cyber threats

88% of top Asia Pacific companies lack strong email security, exposing customers to cyber threats as phishing attacks surge. Experts urge action.

BT and Equinix expand partnership to enhance global interconnectivity

BT and Equinix expand their partnership to boost interconnectivity for multinational businesses, deploying BTโ€™s Global Fabric NaaS in 40+ Equinix data centres worldwide.

LG unveils new SKS branding for luxury kitchen suite at KBIS 2025

LG rebrands Signature Kitchen Suite to SKS at KBIS 2025, introducing new luxury appliances like a free-zone induction range and an advanced island system.

LG unveils advanced laundry solutions at KBIS 2025

LG unveils its latest heat pump washer and dryer lineup at KBIS 2025, featuring AI-driven efficiency, ventless design, and smart connectivity.

The Vision Pro is now easier to share, and getting a new iPhone app

Appleโ€™s Vision 2.4 update makes sharing the Vision Pro easier, introduces a new iPhone app for content discovery, and adds the Spatial Gallery app.

Related Articles