OpenAI Introduces New Vision Features and Santa Voice in ChatGPT Advanced Mode

Last updated: December 14, 2024 11:04 am

2 Min Read

OpenAI has significantly upgraded ChatGPT’s Advanced Voice Mode by adding vision capabilities. This new feature allows users to interact with ChatGPT through live video, enabling the AI to analyze what users see in real-time. Since the introduction of GPT-4o, only audio functionality has been available, but now users can utilize their smartphone cameras for conversations, allowing ChatGPT to respond to visual inputs.

To use this feature, ChatGPT Plus, Team, and Pro subscribers can tap the voice icon in the app and then select the video option. This enables them to point their phones at objects or share their screens, making it easier for ChatGPT to provide explanations or suggestions based on what it observes. The rollout of this feature began recently and will continue over the next week, although access for some users will not be available until January.

During a recent demonstration, OpenAI’s Chief Product Officer Kevin Weil and his team showcased how ChatGPT can assist with making pour-over coffee. By directing their camera at the brewing process, they illustrated that ChatGPT comprehends how the coffee maker works and can provide guidance on brewing techniques. Additionally, the team highlighted ChatGPT’s new ability to support screen sharing, as it recognized an open message on a smartphone while noting that Weil was sporting a Santa beard. This interactive capability is part of the latest updates to ChatGPT’s Advanced Voice Mode, which now includes vision features for real-time video analysis.

The recent update from OpenAI follows Google’s launch of its Gemini 2.0 model, which can handle both visual and audio inputs. Gemini 2.0 is designed for complex tasks and includes three key projects: Project Astra, a universal assistant; Project Mariner; and Project Jules, which serve as developer tools. In contrast, OpenAI’s latest demonstration of ChatGPT highlights its new vision capabilities, enabling the AI to recognize objects and facilitate smooth interactions in real-time.

Share This Article

Sixteen schools in Delhi get bomb threats via email for the second time this week

Underrated Indian Films of 2024: Highlights from Malayalam Cinema and Alia Bhatt’s Jigra

Technology

Health

Entertainment

OpenAI Introduces New Vision Features and Santa Voice in ChatGPT Advanced Mode

Leave a Reply Cancel reply

Stay Connected

Latest News

Prime Minister Narendra Modi’s Historic Visit to Ghana: Strengthening Ties and Building Partnerships

NASA Confirms Axiom-4 Mission Launch with Indian Astronaut Shubhanshu Shukla

Ceasefire Between Israel and Iran Triggers Sharp Fall in Oil Prices

Ceasefire in Place After Deadly Strikes: Trump Updates on Israel-Iran Conflict

MorningScrolls.com is your daily companion for curated news and insightful articles, offering a fresh perspective on global and local happenings. Stay informed and inspired with stories that matter, delivered straight to your digital doorstep.

Categories

Quick LInks

Sign Up for Our Newsletter

Technology

Health

Entertainment

You Might Also Like

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Stay Connected

Latest News

Join Us!