Monday, 24 February 2025
25.8 C
Singapore
29.3 C
Thailand
20 C
Indonesia
25.9 C
Philippines

Elon Musk’s AI company, xAI, enhances Grok with multimodal inputs

xAI, Elon Musk's AI company, adds image capabilities to Grok, offering enhanced features for users and closing the gap with competitors.

As revealed in public developer documents, Elon Musk’s artificial intelligence (AI) company, xAI, is working on integrating multimodal inputs into its Grok chatbot. This development implies that users will soon be able to upload images to Grok and receive text-based responses.

In a recent blog post by xAI, a teaser indicated that the upcoming Grok-1.5V version will introduce “multimodal models across various domains.” The latest updates in the developer documents suggest advancements towards the implementation of a new model.

The developer documents showcase a sample Python script illustrating how developers can leverage the xAI software development kit library to generate responses based on both text and images. By reading an image file, setting up a text prompt, and utilising the xAI SDK, developers can create responses efficiently.

Enhancements for Grok users

Grok, initially launched by xAI in November 2023, is accessible to users subscribed to the X Premium Plus service. The most recent update, Grok 1.5, introduced enhanced reasoning capabilities to the platform in March.

The model is trained on various textual data from publicly available sources up to Q3 2023 and datasets meticulously reviewed by human evaluators. While Grok-1 was not trained on xAI data, it possesses real-time knowledge of the world, including information from x posts.

Founded by Elon Musk in March 2023, xAI is a newcomer to the AI industry, lagging behind competitors like OpenAI’s ChatGPT. However, xAI’s blog post highlights that their Grok 1.5 model is narrowing the gap with GPT-4 across different benchmarks, covering a broad spectrum of academic problems from grade school to high school.

Challenges in benchmarking Large Language Models

Benchmarking large language models can be contentious. Models may excel in benchmarks if the data is part of their training set, akin to memorising answers rather than understanding the content. Despite these challenges, xAI is making significant strides with Grok’s development.

The landscape of AI is evolving towards multimodal conversational chatbots, with notable advancements announced at events like Google I/O and OpenAI’s release of GPT-4o. Grok’s integration of multimodal capabilities signifies a step forward in keeping pace with industry trends and enhancing the user experience.

Hot this week

Addressing growing cyber threats with advanced security solutions

Commvault’s SHIFT 2025 roadshow in Kuala Lumpur will equip Malaysian enterprises with strategies to strengthen cyber resilience and ensure business continuity.

BT and Equinix expand partnership to enhance global interconnectivity

BT and Equinix expand their partnership to boost interconnectivity for multinational businesses, deploying BT’s Global Fabric NaaS in 40+ Equinix data centres worldwide.

Google Play Books introduces direct purchases on iOS

Google Play Books now allows direct purchases on iOS, bypassing Apple’s fees. A new “Get book” button links users to Google Play for payments.

Federal agency to deactivate charging stations and offload electric vehicles

The GSA is shutting down its EV chargers nationwide, calling them “not mission critical,” and plans to offload newly purchased electric vehicles.

Apple’s first foldable iPhone might not look like a Galaxy Z Fold

Apple’s foldable iPhone may not resemble Samsung’s Z Fold. A wider design and later launch are expected.

BT and Equinix expand partnership to enhance global interconnectivity

BT and Equinix expand their partnership to boost interconnectivity for multinational businesses, deploying BT’s Global Fabric NaaS in 40+ Equinix data centres worldwide.

LG unveils new SKS branding for luxury kitchen suite at KBIS 2025

LG rebrands Signature Kitchen Suite to SKS at KBIS 2025, introducing new luxury appliances like a free-zone induction range and an advanced island system.

LG unveils advanced laundry solutions at KBIS 2025

LG unveils its latest heat pump washer and dryer lineup at KBIS 2025, featuring AI-driven efficiency, ventless design, and smart connectivity.

The Vision Pro is now easier to share, and getting a new iPhone app

Apple’s Vision 2.4 update makes sharing the Vision Pro easier, introduces a new iPhone app for content discovery, and adds the Spatial Gallery app.

Related Articles