Using Stories To Teach Robots Right From Wrong
Robots, much like children, can learn a lot from a good bedtime story. Even robots need to understand the Grimm tales of life.
Join the DZone community and get the full member experience.Join For Free
there has been a sense that as the capabilities of artificial intelligence has expanded at a rapid pace in the past few years that we need to step back and think of the philosophical and ethical side of ai.
this is especially so when we have such a patchy understanding of how seemingly straightforward goals might be carried out by an ai. for instance, requesting that an ai eradicate cancer could prompt it to kill all humans, thus achieving its ultimate goal but probably not in the way we’d desire.
researchers from the georgia institute of technology believe that robots can learn sufficient ethics, even if it’s not hardwired into them by using an approach they’re calling quixote .
the approach, which was documented in a recent paper , uses value alignment, with the robots trained using stories to understand right from wrong.
“the collected stories of different cultures teach children how to behave in socially acceptable ways with examples of proper and improper behavior in fables, novels and other literature,” the authors say. “we believe story comprehension in robots can eliminate psychotic-appearing behavior and reinforce choices that won’t harm humans and still achieve the intended purpose.”
morality via stories
the approach used by quixote is designed to align the goals of the ai with human values by placing certain rewards for certain behaviors. it’s built on previous work by the researchers that highlighted how ai can infer the appropriate actions from various crowdsourced story plots harvested from the web.
the system learns the correct behavior and then passes this basic data structure on to quixote, which then converts the signal into a reward that is designed to reinforce certain behaviors (and punish others).
so, for instance, if the robot is asked to pick up a prescription, the system is given options such as robbing the chemist, waiting in line or interacting politely with the staff.
if no value alignment took place, the ai might determine that the best way of achieving its goal would be to rob the chemist, but when values are programmed into it, it is more likely to wait in line and pay for the prescription.
thinking about thought
the researchers put the system through its paces and believe it has made crucial progress in uncovering the various steps possible for a particular scenario.
they have developed a plot trajectory tree, which is then used by the ai to make choices in much the same way as readers do in a choose your own adventure novel.
at the moment, the method is effective for robots that have a relatively limited purpose, but are nonetheless required to interact with human beings to achieve their goal. the team believe it is an important step towards giving machines a degree of moral reasoning however.
“we believe that ai has to be enculturated to adopt the values of a particular society, and in doing so, it will strive to avoid unacceptable behavior,” they say. “giving robots the ability to read and understand our stories may be the most expedient means in the absence of a human user manual.”
Published at DZone with permission of Adi Gaskell, DZone MVB. See the original article here.
Opinions expressed by DZone contributors are their own.