Learning classifier systems with memory condition to solve non-Markov problems

Verfasser / Beitragende:
[Zhaoxiang Zang, Dehua Li, Junying Wang]
Ort, Verlag, Jahr:
2015
Enthalten in:
Soft Computing, 19/6(2015-06-01), 1679-1699
Format:
Artikel (online)
ID: 605468664
LEADER caa a22 4500
001 605468664
003 CHVBK
005 20210128100318.0
007 cr unu---uuuuu
008 210128e20150601xx s 000 0 eng
024 7 0 |a 10.1007/s00500-014-1357-y  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s00500-014-1357-y 
245 0 0 |a Learning classifier systems with memory condition to solve non-Markov problems  |h [Elektronische Daten]  |c [Zhaoxiang Zang, Dehua Li, Junying Wang] 
520 3 |a In the family of learning classifier systems, the classifier system XCS has been successfully used for many applications. However, the standard XCS has no memory mechanism and can only learn optimal policy in Markov environments, but fails in non-Markov ones. In this work, we aim to develop a new classifier system based on XCS to tackle this problem. It adds a memory list with numbered slots to XCS to record input sensation history, and extends only a small number of classifiers with memory conditions. The classifier's memory condition, as a foothold to disambiguate non-Markov states, is used to sense a specified element in the memory list, which makes our system can "jump over” irrelevant or confusing states to get decisive prior information that may be far back in time. Besides, a detection method is employed to recognize non-Markov states in environments, to avoid these states controlling over classifiers' memory conditions. Furthermore, four sets of different complex maze environments have been tested by the proposed method. Experimental results show that our system can overcome the overhead problem often encountered in history-window approaches, and is an effective technique to solve non-Markov environments. 
540 |a Springer-Verlag Berlin Heidelberg, 2014 
690 7 |a Learning classifier system  |2 nationallicence 
690 7 |a XCS  |2 nationallicence 
690 7 |a Memory condition  |2 nationallicence 
690 7 |a Aliasing state detection  |2 nationallicence 
690 7 |a Partially observable environments  |2 nationallicence 
690 7 |a Non-Markov problems  |2 nationallicence 
700 1 |a Zang  |D Zhaoxiang  |u College of Computer and Information Technology, China Three Gorges University, 443002, Yichang, Hubei, China  |4 aut 
700 1 |a Li  |D Dehua  |u Institute for Pattern Recognition and Artificial Intelligence, Huazhong University of Science and Technology, 430074, Wuhan, Hubei, China  |4 aut 
700 1 |a Wang  |D Junying  |u College of Computer and Information Technology, China Three Gorges University, 443002, Yichang, Hubei, China  |4 aut 
773 0 |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/6(2015-06-01), 1679-1699  |x 1432-7643  |q 19:6<1679  |1 2015  |2 19  |o 500 
856 4 0 |u https://doi.org/10.1007/s00500-014-1357-y  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s00500-014-1357-y  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Zang  |D Zhaoxiang  |u College of Computer and Information Technology, China Three Gorges University, 443002, Yichang, Hubei, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Li  |D Dehua  |u Institute for Pattern Recognition and Artificial Intelligence, Huazhong University of Science and Technology, 430074, Wuhan, Hubei, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Wang  |D Junying  |u College of Computer and Information Technology, China Three Gorges University, 443002, Yichang, Hubei, China  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/6(2015-06-01), 1679-1699  |x 1432-7643  |q 19:6<1679  |1 2015  |2 19  |o 500