Google Employs AVA Video Dataset to Train AI on Human Actions

AVA in action. Image via Google

Google Seeks to Help Computers Understand ‘Atoms’ of Human Activity

In October, Google’s Machine Perception Research organization announced the release of a large dataset of video segments designed to help neural networks better train abilities to recognize and interpret human behavior. Google –which owns the YouTube video service as well as the eminent AI research group DeepMind— named the project AVA, or “atomic visual actions,” and seeks as per the name to produce a diverse data set of short clips highlighting basic–hence, “atomic”–human activities.

An example of AVA clips showing specific “atomic” actions indicated by red boxes. Images via Google, featuring video from this source.

In a process further described in a corresponding research paper on the project, AVA is derived from long-from video content sourced directly from films and television series featured on YouTube, chosen to reflect a wide variety of human actions as well as a diversity in the race and genders of the humans present in the clips. Google then further split this collection of video content into the more than 57,000 3-second long, non-overlapping clips that comprise the dataset. Each 3-second clip was then precisely analyzed and annotated to highlight the actions being performed by the individual(s) in the scene from actions across 80 categories. In total, AVA’s clip database features 96,000 labeled individuals performing a grand total of 210,000 actions.

An small subset showing the most common of AVA’s specific atomic action labels. Image via Google.

While computer vision networks–such as Google’s own famous Inception–have become adept at identifying objects in images and videos thanks to vast benchmark datasets for object recognition, AVA seeks to lay the groundwork for neural networks that can be better trained to recognize human actions, which “are, by nature, less well-defined than objects in videos, making it difficult to construct a finely labeled action video dataset,” according to Google.

AVA’s potential academic and commercial applications are limited only to the imagination, and such a diverse and large amount of data on real-time human activities will prove undoubtedly vital in any number of applied projects, from better automated content analysis and summary of videos, to improved computer vision systems in robots that are better able to judge human actions, reactions, and sentiments. According to the AVA team, the long-term goal of the project is “to enable machines to achieve human-level intelligence in sensory perception” and ultimately “imbu[e] computers with ‘social visual intelligence’–the ability to perceive what humans are doing, what might they do next, and what they are trying to achieve.”

By Jordan Castinado|2018-01-15T16:35:30-07:00October 24th, 2017|AI News, Deep Learning, Technology|0 Comments

About the Author: Jordan Castinado

Jordan is a proud cat dad, published poet, former Infantry Marine, and an Arizona-based technology reporter focusing on ethical AI development and the impacts of the tech industry on the future of society.

Leave A Comment Cancel reply

Our Company Mission

Seeflection.AI / Seeflection.com is focused in two areas, which provide synergies to each other. First, Seeflection.com provides AI news, information and e- learning and associated development resources. Second, we provide AI-based development and support services to companies focused in AI, quantum-AI and AI-enabled blockchain development. We have a rapidly growing set of affiliations with a range of corporate and non-profit Artificial Intelligence laboratories and research centers-- as well as individuals in various AI specialties. We are active in both primary and applied AI research and development programs, as well as AI applied to medicine, robotics, media and related markets.

Our Philosophy

Create synergy through applying technology to address long-term problems and create lasting opportunities for people.

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Google Seeks to Help Computers Understand ‘Atoms’ of Human Activity

About the Author: Jordan Castinado

AI Robot Performs Gallbladder Surgery

Nvidia Hits $4 Trillion Valuation

Google’s AI Assistant Is Now at Hand

OpenAI Targets Chrome’s Crown

Grok’s ‘Horrific Behavior’ Sparks Outrage

Leave A Comment Cancel reply

Our Company Mission

Our Philosophy

Google Employs AVA Video Dataset to Train AI on Human Actions