MIT ML Model Learns Abstract Video Concepts

These video images were used to train the ML model on abstract concepts. (Source: MIT CSAIL)

Video Training Lets AI Recognize, Connect Abstract Concepts

While it may not sound like a monumental step forward, the success by MIT’s AI lab in teaching a machine-learning vision model of an algorithm to connect random images and the sounds they make as being conceptually similar is a breakthrough for neural networks. The model also rejected videos that didn’t fit.

To put it into perspective, it’s the closest an AI has come to a human level of intelligence, even though it’s still at the level of a human toddler. As the MIT News explained:

“Organizing the world into abstract categories does not come easily to computers, but in recent years researchers have inched closer by training machine learning models on words and images infused with structural information about the world, and how objects, animals, and actions relate. In a new study at the European Conference on Computer Vision this month, researchers unveiled a hybrid language-vision model that can compare and contrast a set of dynamic events captured on video to tease out the high-level concepts connecting them.”

The ML model did well in identifying video concepts that went together conceptually, such as putting together a set of three videos of a dog barking, a man howling beside his dog and a crying baby out of five videos presented. Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) trained two AI systems in action recognition: MIT’s Multi-Moments in Time and DeepMind’s Kinetics.

“We show that you can build abstraction into an AI system to perform ordinary visual reasoning tasks close to a human level,” says the study’s senior author Aude Oliva, a senior research scientist at MIT, co-director of the MIT Quest for Intelligence, and MIT director of the MIT-IBM Watson AI Lab. “A model that can recognize abstract events will give more accurate, logical predictions and be more useful for decision-making.”

A story on edgy.app explained another concept the model put together from WordNet, a database of word meanings, to map each action-class label’s relation in their dataset.

“For example, they linked words like ‘sculpting,’ ‘carving,’ and ‘cutting’ to higher-level concepts such as ‘crafting,’ ‘cooking,’ and ‘making art.’ So, when the model recognizes sculpting activity, it can pick out conceptually similar activities.”

These basic abilities are bringing AI closer to the human ability to compare and to contrast, according to the study’s senior author Aude Oliva, a senior research scientist at MIT, co-director of the MIT Quest for Intelligence and MIT director of the MIT-IBM Watson AI Lab.

“It’s a rich and efficient way to learn that could eventually lead to machine learning models that can understand analogies and are that much closer to communicating intelligently with us.”

About the Author: Toni Denis

Toni Denis is the Publisher & Editor of Seeflection.com. She previously launched two other websites, including the first B2B vertical for the hotel industry. Her writing experience includes working for daily newspapers, magazines and websites.

Leave A Comment Cancel reply

Our Company Mission

Seeflection.AI / Seeflection.com is focused in two areas, which provide synergies to each other. First, Seeflection.com provides AI news, information and e- learning and associated development resources. Second, we provide AI-based development and support services to companies focused in AI, quantum-AI and AI-enabled blockchain development. We have a rapidly growing set of affiliations with a range of corporate and non-profit Artificial Intelligence laboratories and research centers-- as well as individuals in various AI specialties. We are active in both primary and applied AI research and development programs, as well as AI applied to medicine, robotics, media and related markets.

Our Philosophy

Create synergy through applying technology to address long-term problems and create lasting opportunities for people.

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Video Training Lets AI Recognize, Connect Abstract Concepts

About the Author: Toni Denis

AI Robot Performs Gallbladder Surgery

Nvidia Hits $4 Trillion Valuation

Google’s AI Assistant Is Now at Hand

OpenAI Targets Chrome’s Crown

Grok’s ‘Horrific Behavior’ Sparks Outrage

Leave A Comment Cancel reply

Our Company Mission

Our Philosophy

MIT ML Model Learns Abstract Video Concepts