Verification of Reinforcement Learning Models:

A, Patrick Jeeva

Verification of Reinforcement Learning Models:

dc.contributor.author	A, Patrick Jeeva
dc.date.accessioned	2025-02-05T10:59:13Z
dc.date.available	2025-02-05T10:59:13Z
dc.date.issued	2024-06
dc.description	Dissertation under the supervision of Dr. Swarup Mohalik and Dr. Ansuman Banerjee.	en_US
dc.description.abstract	In recent years of advancements in reinforcement learning (RL), utilizing neural network based models to make decisions in dynamic and complex environments has emerged as a powerful paradigm. In particular, model based reinforcement learning has been widely used for its ability to increase learning efficiency and performance. By constructing an environment model beforehand, the agent attains a prior knowledge of the dynamics of the model to take informed decisions and converge fast to optimal policies. Real-world environments are often intricate and subject to external disturbances, posing substantial challenges for accurate modeling. Addressing these challenges requires the application of sophisticated neural network-based models that can effectively approximate the underlying environment dynamics. In this work, we develop and evaluate extensive neural network models, specifically focusing on Gaussian Ensemble models, Bayesian neural networks, and Monte Carlo Dropout techniques, to approximate various standard gym environments. These models are trained on different numbers of samples to understand their efficiency and accuracy in capturing environment dynamics. Once trained, the neural network models are used to construct Markov Decision Processes (MDPs) with various discretization strategies. The constructed MDPs are then analyzed and compared to evaluate the performance of each neural network approach. The purpose of this thesis is to present a comprehensive study on the construction of environment models using advanced neural network techniques. We aim to approximate the standard environments in the reinforcement learning setup, utilizing a variety of neural networks and compare the efficiency based on the reconstruction of MDPs.	en_US
dc.identifier.citation	65p.	en_US
dc.identifier.uri	http://hdl.handle.net/10263/7506
dc.language.iso	en	en_US
dc.publisher	Indian Statistical Institute, Kolkata	en_US
dc.relation.ispartofseries	MTech(CS) Dissertation;22-20
dc.subject	Gaussian Ensemble Model	en_US
dc.subject	Bayesian Neural Network	en_US
dc.subject	Monte Carlo Dropout Model	en_US
dc.subject	Markov Decision Processes	en_US
dc.title	Verification of Reinforcement Learning Models:	en_US
dc.title.alternative	Comparing Construction of Environment Models	en_US
dc.type	Other	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Patrick_jeeva-Cs2220-Mtech-CS-2024.pdf
Size:: 2.63 MB
Format:: Adobe Portable Document Format
Description:: Dissertations - M Tech (CS)

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Dissertations - M Tech (CS)