Sunday, 9 November 2025
29.2 C
Singapore
26.9 C
Thailand
21 C
Indonesia
28.2 C
Philippines

Elon Musk’s xAI set to launch Grok-1.5, rivalling GPT-4’s prowess

Elon Musk's xAI is launching Grok-1.5, a cutting-edge AI model with enhanced capabilities, set to rival top models like GPT-4 and Claude 3.

In a groundbreaking announcement, Elon Musk’s xAI is poised to unveil Grok-1.5 next week, an upgrade to its proprietary large language model (LLM), Grok-1. This new version boasts enhanced reasoning and problem-solving abilities, edging closer to the performance benchmarks set by leading LLMs like OpenAI’s GPT-4 and Anthropic’s Claude 3.

Bridging the gap with Grok-1.5

Just weeks after making Grok-1 open-source, xAI is taking significant strides with Grok-1.5, which promises substantial improvements in AI benchmarks across coding, math, and language understanding tasks. According to xAI, Grok-1.5 has demonstrated a 50.6% score on the MATH benchmark and an impressive 90% on the GSM8K benchmark, indicative of its superior math problem-solving skills. Moreover, it has achieved a 74.1% score on the HumanEval benchmark, underscoring its advanced code generation and problem-solving capabilities.

Notably, Grok-1.5’s score of 81.3% on the MMLU benchmark, which assesses language understanding across various tasks, represents a significant leap from Grok-1’s 73%. This achievement signifies Grok-1.5’s enhanced comprehension skills and its capacity to process up to 128,000 tokens, allowing it to handle complex prompts and analyse long documents more effectively than its predecessor.

Nearing the zenith of AI performance

Grok-1.5’s advancements place it in close competition with other significant LLMs, though it still trails behind some on benchmarks like the MMLU and GSM8K. Despite these gaps, Grok-1.5 leads in the HumanEval benchmark, excluding its performance against Claude 3 Opus. With continuous improvements, xAI anticipates that the forthcoming Grok-2 will surpass current AI models on all metrics, according to Elon Musk.

Unveiling to the world

Set for deployment next week, Grok-1.5 will initially be available to early testers and Grok chatbot users on the X platform. This phased rollout aims to enhance the model further and introduce new features, catering to a broader user base over time. Musk’s strategy to integrate Grok into the X platform, coupled with subscription benefits for specific users, underscores his vision for widespread adoption of both the AI model and the platform.

In conclusion, Grok-1.5 represents a significant leap forward in AI, bringing us closer to models that can understand and solve complex problems with near-human proficiency. With its deployment, users can look forward to engaging with an AI that pushes the boundaries of what’s possible, marking a new era in the evolution of artificial intelligence.

Hot this week

WhatsApp reportedly testing companion app for Apple Watch

WhatsApp is testing a companion app for Apple Watch, allowing users to view and reply to messages directly from their wrist.

Sharp launches AQUOS sense10 with AI-powered features for photography and communication

Sharp unveils the AQUOS sense10 with AI-powered photo and voice features, Snapdragon 7s Gen 3 performance, and long battery life.

AI-powered water quality sensor and smart keyboard for Parkinson’s named global winners of the James Dyson Award

WaterSense and OnCue win the 2025 James Dyson Award for tackling water pollution and improving life for people with Parkinson’s.

New Relic launches AI monitoring and MCP server to drive enterprise observability

New Relic launches Agentic AI Monitoring and MCP Server to boost enterprise observability and accelerate AI adoption across workflows.

Apple delays OLED screen for MacBook Air until 2028

Apple delays OLED screen for MacBook Air until 2028, prioritising other devices in its display upgrade roadmap.

Workato launches AI Lab in Singapore to drive applied AI innovation and workforce development

Workato opens its AI Lab in Singapore to accelerate applied AI innovation, create skilled jobs, and strengthen industry-academia collaboration.

Synology marks 25 years with launch of next-generation enterprise solutions

Synology celebrates its 25th anniversary with new AI-powered enterprise storage and cybersecurity solutions for digital transformation.

Meta introduces a quick connect shortcut for smart glasses

Meta’s new quick connect feature lets smart glasses users call or text with one touch, reducing reliance on “hey Meta” voice commands.

Square Enix cuts UK and US jobs as it shifts focus back to Japan

Square Enix lays off UK and US developers as it consolidates operations in Japan and expands its use of AI in game development.

Related Articles

Popular Categories