UMP Institutional Repository

Policy abstraction for transfer learning using learning vector quantization in reinforcement learning framework

Ahmad Afif, Mohd Faudzi (2015) Policy abstraction for transfer learning using learning vector quantization in reinforcement learning framework. PhD thesis, Kyushu University.

[img]
Preview
PDF
FKEE - AHMAD AFIF MOHD FAUDZI.PDF

Download (1MB) | Preview
[img]
Preview
PDF
FKEE - AHMAD AFIF MOHD FAUDZI - CHAP 1.PDF

Download (200kB) | Preview
[img]
Preview
PDF
FKEE - AHMAD AFIF MOHD FAUDZI - CHAP 3.PDF

Download (182kB) | Preview

Abstract

People grow up every day exposed to the infinite state space environment interacting with active biological subjects and machines. There are routines that are always expected and unpredicted events that are not completely known beforehand as well. When people interact with the future routines, they do not require the same effort as they do during the first time. Based on experience, irrelevant information that does not affect the achievement is ignored. For example, a new worker in his/her first day will carefully recognize the road to his/her office, including the road's name, signboards, and buildings as well as focusing on the traffic. After several months he/she, possibly, will focus only on buildings and traffic. Furthermore, when people interact with an unpredicted event, they will usually try to cope with the situation using their knowledge that is acquired from their past experience. For example, an accident happened and the worker's daily route was jammed, here, he/she will try to find the alternate route based on the distance and the location of his/her office. This shows that people have an ability to benefit from their previous experience and knowledge for the future. Furthermore, the knowledge is not stored in a concrete or very detailed form, but in an abstract form that is ready to be used for routine events and also to be used for assisting in unknown events. Such abilities are obviously acquired through the most significant ability of a human being, which is learning ability from its successes and failures.

Item Type: Thesis (PhD)
Additional Information: Thesis (Doctor of Engineering) -- Kyushu University - 2015
Uncontrolled Keywords: Policy; transfer learning; vector quantization
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Faculty/Division: Faculty of Electrical & Electronic Engineering
Depositing User: Ms. Nurezzatul Akmal Salleh
Date Deposited: 24 Jun 2016 00:42
Last Modified: 24 Jun 2016 00:42
URI: http://umpir.ump.edu.my/id/eprint/13521
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item