Librería: California Books, Miami, FL, Estados Unidos de America
EUR 25,73
Cantidad disponible: Más de 20 disponibles
Añadir al carritoCondición: New.
Librería: AHA-BUCH GmbH, Einbeck, Alemania
EUR 17,95
Cantidad disponible: 1 disponibles
Añadir al carritoTaschenbuch. Condición: Neu. Druck auf Anfrage Neuware - Printed after ordering - Bachelor Thesis from the year 2021 in the subject Engineering - Computer Engineering, grade: 1,3, Hamburg University of Technology (Embedded Systems), language: English, abstract: This bachelor thesis aims to illustrate the idea behind Markov Decision Processes (MDP) and to present a few basic methods of Reinforcement Learning (RL) namely Monte Carlo Learning and Q-Learning, which are the solutions for decision problems modelled by MDPs. For the last section we apply these methods on an application and in the end discuss the results. Let us imagine the scenario where we put a hamster inside a maze, we expect the hamster to go through the maze till it reaches some point we considered as the goal. Well, it may randomly work but most of the time it won¿t. At this place, the hamster does not know how important this particular point remains namely the goal. But how will it be, when we remunerate the hamster once the goal is reached, he receives a reward for example a piece of cheese. The hamster will start to remember the route, which leads to the cheese and he maybe will learn to go the easy and quick way to achieve this goal. What we did, is that we reinforce the good behavior of the hamster by giving it some reward.
Librería: preigu, Osnabrück, Alemania
EUR 17,95
Cantidad disponible: 5 disponibles
Añadir al carritoTaschenbuch. Condición: Neu. Methods of Machine Learning and their Application. The Basics of Markov Decision Processes and Reinforcement Learning | Omar Baiazid | Taschenbuch | Englisch | 2021 | GRIN Verlag | EAN 9783346518194 | Verantwortliche Person für die EU: preigu GmbH & Co. KG, Lengericher Landstr. 19, 49078 Osnabrück, mail[at]preigu[dot]de | Anbieter: preigu.
Librería: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Alemania
EUR 17,95
Cantidad disponible: 2 disponibles
Añadir al carritoTaschenbuch. Condición: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Bachelor Thesis from the year 2021 in the subject Engineering - Computer Engineering, grade: 1,3, Hamburg University of Technology (Embedded Systems), language: English, abstract: This bachelor thesis aims to illustrate the idea behind Markov Decision Processes (MDP) and to present a few basic methods of Reinforcement Learning (RL) namely Monte Carlo Learning and Q-Learning, which are the solutions for decision problems modelled by MDPs. For the last section we apply these methods on an application and in the end discuss the results. Let us imagine the scenario where we put a hamster inside a maze, we expect the hamster to go through the maze till it reaches some point we considered as the goal. Well, it may randomly work but most of the time it won¿t. At this place, the hamster does not know how important this particular point remains namely the goal. But how will it be, when we remunerate the hamster once the goal is reached, he receives a reward for example a piece of cheese. The hamster will start to remember the route, which leads to the cheese and he maybe will learn to go the easy and quick way to achieve this goal. What we did, is that we reinforce the good behavior of the hamster by giving it some reward. 32 pp. Englisch.
Idioma: Inglés
Publicado por GRIN Verlag, GRIN Verlag Nov 2021, 2021
ISBN 10: 3346518191 ISBN 13: 9783346518194
Librería: buchversandmimpf2000, Emtmannsberg, BAYE, Alemania
EUR 17,95
Cantidad disponible: 1 disponibles
Añadir al carritoTaschenbuch. Condición: Neu. This item is printed on demand - Print on Demand Titel. Neuware -Bachelor Thesis from the year 2021 in the subject Engineering - Computer Engineering, grade: 1,3, Hamburg University of Technology (Embedded Systems), language: English, abstract: This bachelor thesis aims to illustrate the idea behind Markov Decision Processes (MDP) and to present a few basic methods of Reinforcement Learning (RL) namely Monte Carlo Learning and Q-Learning, which are the solutions for decision problems modelled by MDPs. For the last section we apply these methods on an application and in the end discuss the results. Let us imagine the scenario where we put a hamster inside a maze, we expect the hamster to go through the maze till it reaches some point we considered as the goal. Well, it may randomly work but most of the time it won¿t. At this place, the hamster does not know how important this particular point remains namely the goal. But how will it be, when we remunerate the hamster once the goal is reached, he receives a reward for example a piece of cheese. The hamster will start to remember the route, which leads to the cheese and he maybe will learn to go the easy and quick way to achieve this goal. What we did, is that we reinforce the good behavior of the hamster by giving it some reward. 32 pp. Englisch.