anacarsi / ss24_ReinforcementLearning_BomberMan

Repository for the implementation of a reinforcement learning agent for the game of Bomber Man.

0 stars 0 forks source link

Future Experiments for Report #3

Closed anacarsi closed 1 month ago

anacarsi commented 2 months ago

Experiments to Justify Neural Network Usage

We will try to justify the hyperparameters and architecture used based on different experiments.

Ana -> Performance and Training experiments (state representation, reward shaping, exploration strategies, time metrics)

Experiment	Key	Purpose
Baseline Comparison	Compare NN with non-NN strategies (e.g., rule-based, tabular Q-learning)	show improvements over simpler or heuristic methods
Ablation Study	remove/alter neural network components (e.g., layers, inputs)	find the most important features of the neural network and how changes affect performance
Hyperparameter Tuning	vary learning rate, hidden layers, neurons, etc.	find the optimal hyperparameters and justify the neural network's configuration
Generalization Test (Robustness)	train in one environment, test in different/altered environments	demonstrate that the neural network can generalize to unseen scenarios and is robust to environmental changes
Training time vs. Performance	track performance over time and compare with simpler methods	show the efficiency of the neural network in terms of how quickly it converges compared to other approaches
Exploration vs. Exploitation	test different exploration strategies (fixed/decaying epsilon)	impact of exploration strategies on learning and performance of the agent
Reward Shaping	modify reward structure for different objectives	show how different rewards affect the agent's behavior, e.g. focus on survival or objectives
State Representation Analysis	use different state representations (e.g., raw pixels vs. features)	validate the feature representation and show how different state inputs affect performance and learning speed
Policy Visualization	visualize agent decisions, action distributions	try to show that the neural network is making intelligent decisions

anacarsi commented 1 month ago

We successfullly developed the experiments and took insights from them. They are published in our final report.