Tuesday, 21 January 2025
25.7 C
Singapore
33.7 C
Thailand
21.6 C
Indonesia
27.4 C
Philippines

ChatGPT could soon gain the ability to see

ChatGPTโ€™s Advanced Voice Mode might soon include a live camera feature, enabling AI to identify objects and interact visually.

ChatGPTโ€™s Advanced Voice Mode, known for enabling real-time conversations with a chatbot, might soon include visual capabilities. Code uncovered in the latest beta version of the app hints at introducing a “live camera” feature. This discovery in ChatGPT v1.2024.317, as reported by Android Authority, suggests that the rollout of this exciting feature could be just around the corner. However, OpenAI has yet to confirm an official release date.

A glimpse into the featureโ€™s early tests

The idea of ChatGPT having a visual edge has been introduced previously. During the initial alpha testing phase of Advanced Voice Mode in May, OpenAI demonstrated its potential visual capabilities. In one example, the chatbot used a phone’s camera to identify a dog, recognise its ball, and associate the two in the context of playing fetch. This ability to observe, understand, and link objects to real-world scenarios was widely praised by early testers.

Alpha testers were quick to explore the featureโ€™s uses. A notable example came from a user on X (formerly Twitter), Manuel Sainsily, who utilised the camera to ask questions about his new kitten. This interactive capability showcased how the feature could provide fun and practical benefits.

When Advanced Voice Mode entered beta testing in September for ChatGPT Plus and Enterprise users, its visual functionality was notably absent. Despite this, the voice feature gained immense popularity for enabling natural, dynamic conversations. According to OpenAI, users could interrupt the chatbot at any moment, and it could even pick up on the speaker’s emotional tone.

What sets it apart from competitors?

ChatGPT could have a unique edge over rivals like Google and Meta if the live camera feature is introduced. Google’s conversational AI, Gemini Live, may speak over 40 languages but lacks visual processing capabilities. Similarly, Meta’s Natural Voice Interactions, showcased at the Connect 2024 event in September, cannot use camera inputs. While these systems are competent in their ways, OpenAIโ€™s visual feature could redefine how AI assistants interact with the world.

Desktop users can now enjoy Advanced Voice Mode

In a related update, OpenAI announced that Advanced Voice Mode is now available to paid ChatGPT Plus users on desktop. Previously limited to mobile devices, this update means users can now access this feature directly on their laptops or PCs.

The introduction of the live camera could mark a significant leap forward, combining the ability to see and hear into one seamless AI experience. While the exact timing remains uncertain, the potential impact of this development is already generating excitement among users and industry experts alike.

Hot this week

Apple reveals apps removed from U.S. App Store alongside TikTok

Apple lists all apps removed in the U.S. alongside TikTok, including CapCut and Lemon8, citing legal obligations under U.S. law.

TikTok services were restored in the US after a brief shutdown

TikTok restored its service in the US after a brief outage following former President Trumpโ€™s executive action to delay a looming nationwide ban.

Nintendo Switch 2 reveal: Everything you need to know

Nintendo Switch 2, confirmed for 2025, will have a larger design, improved Joy-Con, backward compatibility, and a new Mario Kart game.

Google partners with Indian startup for the worldโ€™s largest biochar carbon removal deal

Google partners with Indian startup Varaha in a deal for 100,000 tons of biochar carbon removal credits, promoting sustainable climate solutions.

Samsung Galaxy S25 colours leak ahead of launch

Samsung Galaxy S25 leaks reveal eight stunning colours and confirm Galaxy Unpacked on January 22. Full details on models, shades, and more.

Apple set to launch iPhone SE 4 with Dynamic Island and iPad Air featuring M3 chip

The iPhone SE 4 with Dynamic Island and iPad Air with M3 chip are expected to launch soon. They will offer modern design and performance upgrades.

President Trump signs executive order delaying TikTok ban for 75 days

Trump delayed the TikTok ban with a 75-day executive order, allowing time to address national security concerns and find a resolution.

President Trump repeals Bidenโ€™s AI executive order on first day in office

President Trump repeals Biden's 2023 AI executive order on day one, sparking debate over AI regulation, innovation, and national security risks.

RedNote, Flip, Clapper, and Likee dominate app charts as TikTok returns online

TikTokโ€™s brief ban boosted rivals RedNote, Flip, Clapper, and Likee, which are now leading U.S. app charts and reshaping video-sharing app trends.

Related Articles