On environment difficulty and discriminating power

Verfasser / Beitragende:
[José Hernández-Orallo]
Ort, Verlag, Jahr:
2015
Enthalten in:
Autonomous Agents and Multi-Agent Systems, 29/3(2015-05-01), 402-454
Format:
Artikel (online)
ID: 605514739
LEADER caa a22 4500
001 605514739
003 CHVBK
005 20210128100706.0
007 cr unu---uuuuu
008 210128e20150501xx s 000 0 eng
024 7 0 |a 10.1007/s10458-014-9257-1  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10458-014-9257-1 
100 1 |a Hernández-Orallo  |D José  |u DSIC, Universitat Politècnica de València, Valencia, Spain  |4 aut 
245 1 0 |a On environment difficulty and discriminating power  |h [Elektronische Daten]  |c [José Hernández-Orallo] 
520 3 |a This paper presents a way to estimate the difficulty and discriminating power of any task instance. We focus on a very general setting for tasks: interactive (possibly multi-agent) environments where an agent acts upon observations and rewards. Instead of analysing the complexity of the environment, the state space or the actions that are performed by the agent, we analyse the performance of a population of agent policies against the task, leading to a distribution that is examined in terms of policy complexity. This distribution is then sliced by the algorithmic complexity of the policy and analysed through several diagrams and indicators. The notion of environment response curve is also introduced, by inverting the performance results into an ability scale. We apply all these concepts, diagrams and indicators to two illustrative problems: a class of agent-populated elementary cellular automata, showing how the difficulty and discriminating power may vary for several environments, and a multi-agent system, where agents can become predators or preys, and may need to coordinate. Finally, we discuss how these tools can be applied to characterise (interactive) tasks and (multi-agent) environments. These characterisations can then be used to get more insight about agent performance and to facilitate the development of adaptive tests for the evaluation of agent abilities. 
540 |a The Author(s), 2014 
690 7 |a Environment difficulty  |2 nationallicence 
690 7 |a Agent evaluation  |2 nationallicence 
690 7 |a Discriminating power  |2 nationallicence 
690 7 |a Agent policy  |2 nationallicence 
690 7 |a Algorithmic information theory  |2 nationallicence 
690 7 |a Universal psychometrics  |2 nationallicence 
690 7 |a Reinforcement learning  |2 nationallicence 
690 7 |a Elementary cellular automata  |2 nationallicence 
773 0 |t Autonomous Agents and Multi-Agent Systems  |d Springer US; http://www.springer-ny.com  |g 29/3(2015-05-01), 402-454  |x 1387-2532  |q 29:3<402  |1 2015  |2 29  |o 10458 
856 4 0 |u https://doi.org/10.1007/s10458-014-9257-1  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10458-014-9257-1  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 100  |E 1-  |a Hernández-Orallo  |D José  |u DSIC, Universitat Politècnica de València, Valencia, Spain  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Autonomous Agents and Multi-Agent Systems  |d Springer US; http://www.springer-ny.com  |g 29/3(2015-05-01), 402-454  |x 1387-2532  |q 29:3<402  |1 2015  |2 29  |o 10458