AI Training Other AIs Brings Questions, Results

If one AI Is Brilliant-Does That Mean Two Are Twice As Smart?

AI Paired with Other AI Advances Self Teaching, Self Design of Curricula

As anyone in the field of education can tell you, children learn at different rates. You know, some catch on quickly, and others take more time to absorb the same information. Sciencemag.com has an article this week that tells us that programmers have finally caught up to that old teacher’s maxim “each one teach one.” In other words, they paired two AI and allowed them to teach each other.

Unlike human students, computers don’t seem to get bored or frustrated when a lesson is too easy or too hard. But just like humans, they do better when a lesson plan is “just right” for their level of skill. Coming up with the right curricula isn’t easy, though, so computer scientists wondered: What if they could get the machines design their own?

That’s what researchers have done in several new studies, creating artificial intelligence (AI) that can figure out how best to teach itself. The work could speed learning in self-driving cars and household robots, and it might even help crack previously unsolvable math problems.

To help it navigate increasingly complex worlds, the researchers—led by University of California (UC), Berkeley, graduate student Michael Dennis and Natasha Jaques, a research scientist at Google—considered two ways in which they could draw the maps. One method randomly distributed blocks; with it, the AI didn’t learn much. Another method remembered what the AI had struggled with in the past, and maximized difficulty accordingly. But that made the worlds too hard—and sometimes even impossible—to complete.

So the scientists created a setting that was just right, using a new approach they call PAIRED. First, they coupled their AI with a nearly identical one, albeit with a slightly different set of strengths, which they called the antagonist. Then, they had a third AI design worlds that were easy for the antagonist—but hard for the original protagonist. That kept the tasks just at the edge of the protagonist’s ability to solve. The designer, like the two agents, uses a neural network—a program inspired by the brain’s architecture—to learn its task over many trials.

After training, the protagonist attempted a set of difficult mazes. If it trained using the two older methods, it solved none of the new mazes. But after training with PAIRED, it solved one in five, the team reported last month at the Conference on Neural Information Processing Systems (NeurIPS).

“We were excited by how PAIRED started working pretty much out of the gate,” Dennis says.

In another study, presented at a NeurIPS workshop, Jaques and colleagues at Google used a version of PAIRED to teach an AI agent to fill out web forms and book a flight. Whereas a simpler teaching method led it to fail nearly every time, an AI trained with the PAIRED method succeeded about 50% of the time.

The researchers tested their trained agent on 225 problems that no computer had ever solved. It cracked 80% of them, with about one-third of its success coming strictly from the novel training method. “That was just fun to see,” Selman says. He says he now receives messages of astonishment from AI researchers who’ve been working on the problems for decades. He hopes to apply the method next to unsolved math proofs.

Pieter Abbeel, a computer scientist at UC Berkeley, also showed at the meeting that “autocurricula” can help robots learn to manipulate objects. He says the approach could even be used for human students.

“As an instructor, I think, ‘Hey, not every student needs the same homework exercise,’” Abbeel says, noting that AI could help tailor harder or easier material to a student’s needs. As for AI autocurricula, he says, “I think it’s going to be at the core of pretty much all reinforcement learning.”

Matt Houston’s article details the efforts in the link below.

About the Author: Paul Morris

Morris has a background in multimedia and is a published author. He also has spent years in radio broadcasting, the music industry, and is now a contributor to Seeflection.com.