Saturday, 23 November 2024
25 C
Singapore

Anthropic’s newest Claude chatbot outperforms GPT-4o in benchmarks

Explore Claude 3.5 Sonnet, Anthropic's latest AI model, now available. It excels at understanding nuance and visual input, outpacing GPT-4o benchmarks.

On Thursday, Anthropic introduced its latest AI language model, Claude 3.5 Sonnet. This new version surpasses the company’s previous top-tier model, the Claude 3 Opus, while operating at twice the speed. You can now explore this enhanced chatbot, even with a free account.

Key features and performance

Claude 3.5 Sonnet is the first in the Claude 3.5 series and is considered Anthropic’s most balanced model. Future releases in this series will include Claude 3.5 Haiku, the fastest model, and Claude 3.5 Opus, the most powerful. These updates will roll out later this year while the current versions remain on Claude 3. The quick release of Sonnet, just months after the Claude 3 family, highlights the rapid pace at which AI companies are developing their technologies.

Anthropic claims that Claude 3.5 Sonnet significantly improves understanding of nuance, humour, and complex prompts, enabling it to write in a more natural tone. Benchmark tests indicate that the new model sets industry records for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency. It surpasses OpenAI’s GPT-4o in many of these benchmarks. However, it is worth noting that the latest models from Claude, ChatGPT, Gemini, and Llama are all closely matched, often scoring within a few percentage points of each other, reflecting the intense competition in the AI field.

Enhanced visual interpretation and a new workspace

The company asserts that Claude 3.5 Sonnet excels at interpreting visual input better than its predecessor, Claude 3.0 Opus. The new model can accurately transcribe text from imperfect images, a feature expected to attract retail, logistics, and financial services customers who require precise data interpretation from charts, graphs, and other visual cues.

Claude’s latest update also includes a new workspace feature called Artifacts. When you prompt the chatbot to generate content such as code, text documents, or web designs, a dedicated window appears next to the chat interface. This Artefacts window allows you to request changes, and it will update with the chatbot’s latest output. Anthropic sees artefacts as a step towards making Claude a hub for broader team collaboration. The company envisions a future where teams and entire organisations can securely centralise their knowledge, documents, and ongoing projects in one shared space, with Claude acting as an on-demand team member.

Availability and pricing

Claude 3.5 Sonnet is now available for anyone with an account to try on Anthropic’s and through the Claude iOS app. Pro and team subscribers on these platforms will benefit from higher token counts. Additionally, you can access it via the Anthropic API, Amazon Bedrock, and Cloud’s Vertex AI. The cost remains the same as the previous model, at US$3 per million input tokens and US$15 per million output tokens.

Hot this week

UGREEN Surge Protector Power Strip review: Fast charging meets smart safety

The UGREEN Surge Protector Power Strip offers fast charging, 10-device support, and surge protection but faces durability concerns.

Hong Kong’s PC Partner moves HQ to Singapore amidst shifting supply chains

PC Partner moves to Singapore and opens an Indonesian factory, diversifying amid US-China tensions and rising global demand.

Apple may have upgraded M4 MacBook Pro with quantum dot display technology

Apple may have added quantum dot technology to the M4 MacBook Pro display, enhancing its colour accuracy and performance while staying eco-friendly.

ASUS unveils next-generation infrastructure solutions at SC24 with NVIDIA and Ubitus collaboration

ASUS unveils next-gen AI infrastructure solutions at SC24, featuring AI servers, advanced cooling, and green-energy data centres.

Roblox tightens chat rules for children under 13

Roblox introduces safety updates limiting communication for users under 13, adds parental tools, and changes content access for younger players.

Anglo-Chinese School students win top prize in Samsung Solve for Tomorrow 2024

Anglo-Chinese School students win Samsung Solve for Tomorrow 2024 with innovative smart glasses for the hearing impaired. Other projects celebrated.

DXC Technology and ServiceNow partner to accelerate generative AI adoption for businesses

DXC Technology partners with ServiceNow to fast-track generative AI adoption through a new Centre of Excellence, combining industry expertise and AI solutions.

Avenir CRYPTO unveils US$500 million initiative to lead global crypto innovation

Avenir CRYPTO’s US$500M initiative tackles market fragmentation and boosts crypto trading innovation at its flagship event in Singapore.

New STEM foundation launched at Expand Space to inspire youth in underserved communities

Expand Space 2024 launches a new STEM Foundation to empower underserved youth with hands-on opportunities in Deep Tech, robotics, and AI.

Related Articles

Popular Categories