UIC News - 2006 University Scholar Derong Liu

Issue: 09/27/06

2006 University Scholar Derong Liu

Can a computer learn from its mistakes?

[print version] [email article]

09/27/06

Paul Francuch

Derong Liu: creating a computer that learns and remembers.

Photo: Roberta Dupuis-Devlin

The University Scholars Program, now in its 22nd year, honors faculty members for superior research and teaching, along with great promise for future achievement. The award provides $10,000 a year for three years.

Can a computer be programmed to learn from its mistakes, just like a person?

Derong Liu thinks so.

"When I was a child, I imagined machines doing things automatically," said Liu, professor of electrical and computer engineering.

His curiosity led him to study mechanical engineering as an undergraduate in China. By the time he entered a master's program at Beijing's Institute of Automation of the Chinese Academy of Sciences, he had switched to a field of electrical engineering called automatic control.

"That required additional study in electronics, circuits and automatic control theory," he recalled. "I discovered this was something that I'd really like to do as a career."

After finishing his Ph.D. at Notre Dame, Liu worked at the General Motors Tech Center in suburban Detroit and as an assistant professor at the Stevens Institute of Technology in Hoboken, N.J., before joining the UIC faculty in 1999.

As part of his ongoing research, Liu is interested in developing what are called artificial neural networks, which mimic animal nervous systems.

"A neural network is a nonlinear dynamical system. The human brain is also a nonlinear system,” he explained.

Liu’s research focuses on adaptive dynamic programming, or approximate dynamic programming.

“This scheme mimics the human brain's learning capabilities,” he said, adding that the field is in such early stages of study that only a few researchers around the world are working on it.

He and his students are studying new ways to control such devices as computers, car engines and other machines that traditionally work in a straight "a-to-z" manner, considering the multitude of simultaneous factors needed to make an intelligent decision.

"When you learn something, you don't usually use it right away. You wait until the next time you need it," he said. "The approach we're studying is very similar to this concept.

“What we eventually can have is a machine which will do something and if it does it good, remembers. The next time, it will do the same thing.

“But if it does not do a good job, it also remembers that and next time tries to correct it. So it basically has a kind of reward or penalty whenever the machine does something."

And how do you reward a computer?

"It's just numbers," Liu said.

"You tell the computer that it gets a positive number when it does a good job or a negative number when it does a bad job."

Browse Back Issues

What is RSS?