On environment difficulty and discriminating power
Gespeichert in:
Verfasser / Beitragende:
[José Hernández-Orallo]
Ort, Verlag, Jahr:
2015
Enthalten in:
Autonomous Agents and Multi-Agent Systems, 29/3(2015-05-01), 402-454
Format:
Artikel (online)
Online Zugang:
| LEADER | caa a22 4500 | ||
|---|---|---|---|
| 001 | 605514739 | ||
| 003 | CHVBK | ||
| 005 | 20210128100706.0 | ||
| 007 | cr unu---uuuuu | ||
| 008 | 210128e20150501xx s 000 0 eng | ||
| 024 | 7 | 0 | |a 10.1007/s10458-014-9257-1 |2 doi |
| 035 | |a (NATIONALLICENCE)springer-10.1007/s10458-014-9257-1 | ||
| 100 | 1 | |a Hernández-Orallo |D José |u DSIC, Universitat Politècnica de València, Valencia, Spain |4 aut | |
| 245 | 1 | 0 | |a On environment difficulty and discriminating power |h [Elektronische Daten] |c [José Hernández-Orallo] |
| 520 | 3 | |a This paper presents a way to estimate the difficulty and discriminating power of any task instance. We focus on a very general setting for tasks: interactive (possibly multi-agent) environments where an agent acts upon observations and rewards. Instead of analysing the complexity of the environment, the state space or the actions that are performed by the agent, we analyse the performance of a population of agent policies against the task, leading to a distribution that is examined in terms of policy complexity. This distribution is then sliced by the algorithmic complexity of the policy and analysed through several diagrams and indicators. The notion of environment response curve is also introduced, by inverting the performance results into an ability scale. We apply all these concepts, diagrams and indicators to two illustrative problems: a class of agent-populated elementary cellular automata, showing how the difficulty and discriminating power may vary for several environments, and a multi-agent system, where agents can become predators or preys, and may need to coordinate. Finally, we discuss how these tools can be applied to characterise (interactive) tasks and (multi-agent) environments. These characterisations can then be used to get more insight about agent performance and to facilitate the development of adaptive tests for the evaluation of agent abilities. | |
| 540 | |a The Author(s), 2014 | ||
| 690 | 7 | |a Environment difficulty |2 nationallicence | |
| 690 | 7 | |a Agent evaluation |2 nationallicence | |
| 690 | 7 | |a Discriminating power |2 nationallicence | |
| 690 | 7 | |a Agent policy |2 nationallicence | |
| 690 | 7 | |a Algorithmic information theory |2 nationallicence | |
| 690 | 7 | |a Universal psychometrics |2 nationallicence | |
| 690 | 7 | |a Reinforcement learning |2 nationallicence | |
| 690 | 7 | |a Elementary cellular automata |2 nationallicence | |
| 773 | 0 | |t Autonomous Agents and Multi-Agent Systems |d Springer US; http://www.springer-ny.com |g 29/3(2015-05-01), 402-454 |x 1387-2532 |q 29:3<402 |1 2015 |2 29 |o 10458 | |
| 856 | 4 | 0 | |u https://doi.org/10.1007/s10458-014-9257-1 |q text/html |z Onlinezugriff via DOI |
| 898 | |a BK010053 |b XK010053 |c XK010000 | ||
| 900 | 7 | |a Metadata rights reserved |b Springer special CC-BY-NC licence |2 nationallicence | |
| 908 | |D 1 |a research-article |2 jats | ||
| 949 | |B NATIONALLICENCE |F NATIONALLICENCE |b NL-springer | ||
| 950 | |B NATIONALLICENCE |P 856 |E 40 |u https://doi.org/10.1007/s10458-014-9257-1 |q text/html |z Onlinezugriff via DOI | ||
| 950 | |B NATIONALLICENCE |P 100 |E 1- |a Hernández-Orallo |D José |u DSIC, Universitat Politècnica de València, Valencia, Spain |4 aut | ||
| 950 | |B NATIONALLICENCE |P 773 |E 0- |t Autonomous Agents and Multi-Agent Systems |d Springer US; http://www.springer-ny.com |g 29/3(2015-05-01), 402-454 |x 1387-2532 |q 29:3<402 |1 2015 |2 29 |o 10458 | ||