Incorporating advice into agents that learn from reinforcements

Richard Maclin; Jude W. Shavlik

Incorporating advice into agents that learn from reinforcements

Richard Maclin, Jude W. Shavlik

Computer Science (Duluth)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

26 Scopus citations

Abstract

Learning from reinforcements is a promising approach for creating intelligent agents. However, reinforcement learning usually requires a large number of training episodes. We present an approach that addresses this shortcoming by allowing a connectionist Q-learner to accept advice given, at any time and in a natural manner, by an external observer. In our approach, the advice-giver watches the learner and occasionally makes suggestions, expressed as instructions in a simple programming language. Based on techniques from knowledge-based neural networks, these programs are inserted directly into the agent's utility function. Subsequent reinforcement learning further integrates and refines the advice. We present empirical evidence that shows our approach leads to statistically-significant gains in expected reward. Importantly, the advice improves the expected reward regardless of the stage of training at which it is given.

Original language	English (US)
Title of host publication	Proceedings of the National Conference on Artificial Intelligence
Publisher	AAAI
Pages	694-699
Number of pages	6
Volume	1
State	Published - Dec 1 1994
Event	Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2) - Seattle, WA, USA Duration: Jul 31 1994 → Aug 4 1994

Other

Other	Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2)
City	Seattle, WA, USA
Period	7/31/94 → 8/4/94

OpenUrl availability

Full text

Cite this

@inproceedings{41e090c3ace24f63a4834fdf93c1362c,

title = "Incorporating advice into agents that learn from reinforcements",

abstract = "Learning from reinforcements is a promising approach for creating intelligent agents. However, reinforcement learning usually requires a large number of training episodes. We present an approach that addresses this shortcoming by allowing a connectionist Q-learner to accept advice given, at any time and in a natural manner, by an external observer. In our approach, the advice-giver watches the learner and occasionally makes suggestions, expressed as instructions in a simple programming language. Based on techniques from knowledge-based neural networks, these programs are inserted directly into the agent's utility function. Subsequent reinforcement learning further integrates and refines the advice. We present empirical evidence that shows our approach leads to statistically-significant gains in expected reward. Importantly, the advice improves the expected reward regardless of the stage of training at which it is given.",

author = "Richard Maclin and Shavlik, {Jude W.}",

year = "1994",

month = dec,

day = "1",

language = "English (US)",

volume = "1",

pages = "694--699",

booktitle = "Proceedings of the National Conference on Artificial Intelligence",

publisher = "AAAI",

note = "Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2) ; Conference date: 31-07-1994 Through 04-08-1994",

}

TY - GEN

T1 - Incorporating advice into agents that learn from reinforcements

AU - Maclin, Richard

AU - Shavlik, Jude W.

PY - 1994/12/1

Y1 - 1994/12/1

N2 - Learning from reinforcements is a promising approach for creating intelligent agents. However, reinforcement learning usually requires a large number of training episodes. We present an approach that addresses this shortcoming by allowing a connectionist Q-learner to accept advice given, at any time and in a natural manner, by an external observer. In our approach, the advice-giver watches the learner and occasionally makes suggestions, expressed as instructions in a simple programming language. Based on techniques from knowledge-based neural networks, these programs are inserted directly into the agent's utility function. Subsequent reinforcement learning further integrates and refines the advice. We present empirical evidence that shows our approach leads to statistically-significant gains in expected reward. Importantly, the advice improves the expected reward regardless of the stage of training at which it is given.

AB - Learning from reinforcements is a promising approach for creating intelligent agents. However, reinforcement learning usually requires a large number of training episodes. We present an approach that addresses this shortcoming by allowing a connectionist Q-learner to accept advice given, at any time and in a natural manner, by an external observer. In our approach, the advice-giver watches the learner and occasionally makes suggestions, expressed as instructions in a simple programming language. Based on techniques from knowledge-based neural networks, these programs are inserted directly into the agent's utility function. Subsequent reinforcement learning further integrates and refines the advice. We present empirical evidence that shows our approach leads to statistically-significant gains in expected reward. Importantly, the advice improves the expected reward regardless of the stage of training at which it is given.

UR - http://www.scopus.com/inward/record.url?scp=0028566290&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0028566290&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0028566290

VL - 1

SP - 694

EP - 699

BT - Proceedings of the National Conference on Artificial Intelligence

PB - AAAI

T2 - Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2)

Y2 - 31 July 1994 through 4 August 1994

ER -

Incorporating advice into agents that learn from reinforcements

Abstract

Other

OpenUrl availability

Other files and links

Fingerprint

Cite this