ChatGPT Now Talks More Like a Human

OpenAI has upgraded ChatGPT’s voice mode to deliver emotionally expressive speech and real-time translation, bringing human-like interactions—and a few odd bugs—to the forefront of AI conversation. (Source: Image by RR)

Advanced Voice Mode Now Offers Improved Intonation, Pausing and Emotional Range

OpenAI has significantly enhanced ChatGPT’s voice capabilities for paying subscribers with a major update to its “Advanced Voice Mode.” The improvements make the AI’s speech sound more natural, adding lifelike elements such as empathy, sarcasm and emotional nuance. These changes aim to make conversations with ChatGPT feel smoother and more human-like, improving intonation, timing and pauses.

One of the key features in this update is real-time translation. Users can now carry out live, bilingual conversations with ChatGPT acting as an interpreter. This function, as noted in the-decoder.com, is designed for practical, real-world use cases like ordering food in a foreign language or managing multilingual workplace discussions. The translation tool is continuous and active until the user stops it.

The new voice features are now accessible to all paying users across devices by clicking the language icon in the app. OpenAI has expanded the rollout since introducing Advanced Voice Mode in May 2024 and made it available in the EU by October 2024. With this update, ChatGPT also supports live visual analysis when the user’s camera is on—similar to features offered by Google’s Gemini app.

However, limitations remain. Some users may still experience glitches, including abrupt shifts in pitch or volume and occasional lapses in audio quality, especially when using certain AI voices. OpenAI also acknowledges the persistence of “hallucinated” sounds, where ChatGPT unexpectedly generates random audio elements such as background music, strange noises, or even snippets resembling advertisements—despite the platform not serving any ads. One notable report involved ChatGPT suddenly playing what sounded like a commercial mid-conversation, raising questions about the root of such anomalies. While these bugs don’t affect core functionality, they highlight the ongoing challenge of perfecting realistic AI-generated speech at scale.

About the Author: Roque Ramirez

Leave A Comment Cancel reply

Our Company Mission

Seeflection.AI / Seeflection.com is focused in two areas, which provide synergies to each other. First, Seeflection.com provides AI news, information and e- learning and associated development resources. Second, we provide AI-based development and support services to companies focused in AI, quantum-AI and AI-enabled blockchain development. We have a rapidly growing set of affiliations with a range of corporate and non-profit Artificial Intelligence laboratories and research centers-- as well as individuals in various AI specialties. We are active in both primary and applied AI research and development programs, as well as AI applied to medicine, robotics, media and related markets.

Our Philosophy

Create synergy through applying technology to address long-term problems and create lasting opportunities for people.

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Advanced Voice Mode Now Offers Improved Intonation, Pausing and Emotional Range

About the Author: Roque Ramirez

Claude Gov Powers U.S. Intelligence

Raibo Nails Robotic Parkour

Nvidia Sets New Benchmark in AI Training

Smarter Dog Care With Fi’s New AI Collar

Musk Launches Secure Messaging on X

Leave A Comment Cancel reply

Our Company Mission

Our Philosophy

ChatGPT Now Talks More Like a Human