AI Creates Eerily Accurate Facial Images from Voices

MIT Algorithm Views Videos, Extrapolates Speakers’ Likeness

A study published at Cornell University’s database on computer science, arXiv.org, describes how the “Speech2Face” algorithm has created images of people based solely on the sound of their voices. Developed at the Massachusetts Institute of Technology, the AI draws from what it learned from watching millions of videos on the internet and YouTube to arrive at its images.

The researchers provided details of the Speech2Face program and its development on Github. Researchers presented the paper at the IEEE Conference on Computer Vision and Pattern Recognition this year. The authors write:

“Note that some of the features in our predicted faces may not even be physically connected to speech, for example hair color or style. However, if many speakers in the training set who speak in a similar way (e.g., in the same language) also share some common visual traits (e.g., a common hair color or style), then those visual traits may show up in the predictions.”

A story in Futurism.com marveled at the new technology, stating: “In practice, the Speech2Face algorithm seems to have an uncanny knack for spitting out rough likenesses of people based on nothing but their speaking voices.” It noted the author’s caution that it is a “purely academic exploration” and that the researchers are sensitive to the potential misuse of the technology.

Almost as fascinating as the accurate depictions are the inaccurate ones in the graphic below:

Speech2Face failures

The authors of the paper explained the mistakes as being linked to variations on the norm in terms of speech. For instance, a high-pitched male voice or a child’s voice may trick the algorithm into predicting a facial image with female features. Sometimes the spoken language does not match ethnicity. Also, age mismatches can occur if a voice sounds “younger” than the speaker’s age.

By Toni Denis|2019-05-30T15:34:02-07:00June 3rd, 2019|AI News, Neural Networks, Research & Education, Vertical Markets|0 Comments

About the Author: Toni Denis

Toni Denis is the Publisher & Editor of Seeflection.com. She previously launched two other websites, including the first B2B vertical for the hotel industry. Her writing experience includes working for daily newspapers, magazines and websites.

Leave A Comment Cancel reply

Our Company Mission

Seeflection.AI / Seeflection.com is focused in two areas, which provide synergies to each other. First, Seeflection.com provides AI news, information and e- learning and associated development resources. Second, we provide AI-based development and support services to companies focused in AI, quantum-AI and AI-enabled blockchain development. We have a rapidly growing set of affiliations with a range of corporate and non-profit Artificial Intelligence laboratories and research centers-- as well as individuals in various AI specialties. We are active in both primary and applied AI research and development programs, as well as AI applied to medicine, robotics, media and related markets.

Our Philosophy

Create synergy through applying technology to address long-term problems and create lasting opportunities for people.

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

MIT Algorithm Views Videos, Extrapolates Speakers’ Likeness

About the Author: Toni Denis

MIT: 95% of AI Pilots Stall

AI Predicts Solar Chaos before It Strikes

Musk Promises, Waymo Delivers

VIVE Eagle Challenges Meta in AI Wearables

ICU Brain Monitoring Enters AI Age

Leave A Comment Cancel reply

Our Company Mission

Our Philosophy

AI Creates Eerily Accurate Facial Images from Voices