Researchers: Uber’s AI Masters Complex Video Games

View Larger Image

Researchers: Uber’s AI Masters Complex Video Games

Uber Tops Other Research Groups in Pitfall!, Montezuma’s Revenge

It may seem insignificant in the scheme of AI advances, but the machine-learning algorithms developed by Uber’s AI team are far more important than it appears at first glance.

For the first time, machine learning has been able to advance through ’80s-era video games that involve few rewards or clues–but hinge on memory, according to a story in Technology Review. The advances have enabled the AI to reach high scores in Montezuma’s Revenge and Pitfall! after failing and scoring zero for two years. The group’s blog explained how it worked:

“The team’s new family of reinforcement-learning algorithms, dubbed Go-Explore, remember where they have been before, and will return to a particular area or task later on to see if it might help provide better overall results. The researchers also found that adding a little bit of domain knowledge, by having human players highlight interesting or important areas, sped up the algorithms’ learning and progress by a remarkable amount. This is significant because there may be many real-world situations where you would want an algorithm and a person to work together to solve a hard task.”

Instead of being rewards-oriented, the AI is “motivation” oriented so that reinforcement learning takes place, which is harder than it sounds. Other AI researchers, who have been attempting to crack the games, too, are finally making headway. OpenAI, a nonprofit in San Francisco, created an algorithm that is making progress in Montezuma’s Revenge. A research group at Stanford made “modest” progress on Pitfall! using an approach similar to Uber’s.

A story in VentureBeat.com on the Uber group’s report describes the advances as a “two-phase solution” involving exploration and “robustification.”

“In the exploration phase, Go-Explore builds an archive of different game states — cells — and the various trajectories, or scores, that lead to them. It chooses a cell, returns to that cell, explores the cell, and, for all cells it visits, swaps it in as the trajectory if a given new trajectory is better (i.e., the score is higher).”

The original paper on Uber’s advances can be read at the company’s engineering blog page.

About the Author: Toni Denis

Toni Denis is the Publisher & Editor of Seeflection.com. She previously launched two other websites, including the first B2B vertical for the hotel industry. Her writing experience includes working for daily newspapers, magazines and websites.

Leave A Comment Cancel reply

Our Company Mission

Seeflection.AI / Seeflection.com is focused in two areas, which provide synergies to each other. First, Seeflection.com provides AI news, information and e- learning and associated development resources. Second, we provide AI-based development and support services to companies focused in AI, quantum-AI and AI-enabled blockchain development. We have a rapidly growing set of affiliations with a range of corporate and non-profit Artificial Intelligence laboratories and research centers-- as well as individuals in various AI specialties. We are active in both primary and applied AI research and development programs, as well as AI applied to medicine, robotics, media and related markets.

Our Philosophy

Create synergy through applying technology to address long-term problems and create lasting opportunities for people.

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Uber Tops Other Research Groups in Pitfall!, Montezuma’s Revenge

About the Author: Toni Denis

MIT: 95% of AI Pilots Stall

AI Predicts Solar Chaos before It Strikes

Musk Promises, Waymo Delivers

VIVE Eagle Challenges Meta in AI Wearables

ICU Brain Monitoring Enters AI Age

Leave A Comment Cancel reply

Our Company Mission

Our Philosophy

Researchers: Uber’s AI Masters Complex Video Games

Researchers: Uber’s AI Masters Complex Video Games

Uber Tops Other Research Groups in Pitfall!, Montezuma’s Revenge

Share This Story, Choose Your Platform!

About the Author: Toni Denis

Related Posts

MIT: 95% of AI Pilots Stall

AI Predicts Solar Chaos before It Strikes

Musk Promises, Waymo Delivers

VIVE Eagle Challenges Meta in AI Wearables

ICU Brain Monitoring Enters AI Age

Leave A Comment Cancel reply