bomberman-rl-keras's People
bomberman-rl-keras's Issues
Add transformed coords of bombs to input feed
Resulting is a 5x5 matrix as input adding the transformed ๐ฃ, so we can train to avoid them
Plus reducing the number of displays of own position to one
Problem with perspective and correct order
Confirm whether left/right/top/bottom orientation of gym and real game are the same.
processor load not fully used in A3C
We should port Threads to Processors in order to gain more computational power.
Implement plugin to bombermanRLe
Don't forget to rebase!
DDPG experiment
A3C not working stable with few cores
We observed that A3C is not performing well, when run with 2 threads compared with 16. We assume that is caused by the design of A3C which requires multiple actors to work on global model. Mitigated by 0e73661 for a3c_coinman
Make crate field sparse
Currently number of crates is dense, should be switched to more sparse approach --> more different experience
Add history of events to input
Test on empty area
Similar to what has been done here one could test a trained agent on an empty gaming area and see how it behaves. As there are neither coins nor crates one would expect that bomberman runs around randomly, perhaps sometimes dropping a bomb.
Cythonize gym
Explore whether it is feasible to use Cython to accelerate our gym_bomberman.
Start in the middle of the field as fifth option
Bomberman needs to learn more extreme situation
DQN variants
Evaluate variants of DQN: double DQN and dueling DQN. By default the parameters in the DQNAgent constructor are like this enable_double_dqn=True, enable_dueling_network=False
.
tensorboard integration for dqn and a3c
Integration #2
Reintegrate and merge all branches into master
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.