Comments (4)
Well, I find the issue.
The RLInterface
dependency is in prototype.jl
file that is not part of tests.
I wanted to take a look at how to use DeepQLearning with POMDPs. There is an example with SubHunt.jl. Unfortunately not up to date.
from deepqlearning.jl.
Ok, so you are saying that there actually is not any problem related to RLInterface, right now? @MaximeBouton , could you leave a short comment explaining what prototype.jl and flux_test.jl are in those files?
I wanted to take a look at how to use DeepQLearning with POMDPs. There is an example with SubHunt.jl. Unfortunately not up to date.
Where is the example? After loading POMDPModelTools, you should be able to do convert(CommonRLInterface.AbstractEnv, SubHuntPOMDP())
or something like that. Deep Q learning will probably not work too well on that problem though - it is quite hard to plan without a belief updater in that environment.
from deepqlearning.jl.
I was looking for an example on how to use POMDPs with DeepQLearning.jl, found the prototype,jl where @MaximeBouton probably tested out if solver correctly works with SubHunt.jl. Then I somehow found out that RLInterface.jl is deprecated.
I think it may be beneficial to have an example POMDPs usage. SubHunt could serve as that.
However, the example should use CommonRLInterface. Also, the SubHunt.jl would need to have initialobs(::SubHuntPOMDP, ::SubState)
defined.
from deepqlearning.jl.
@Omastto1 thank you for reporting this, I should remove this prototype.jl file.
There is an example with POMDPs in the tests though, check https://github.com/JuliaPOMDP/DeepQLearning.jl/blob/master/test/runtests.jl#L150 DDRQN on tigerPOMDP
(let's close once I remove the file)
from deepqlearning.jl.
Related Issues (20)
- Support last Flux version with Float32 HOT 1
- Logging Training Information HOT 3
- Fix avgR in terminal while training HOT 1
- TensorBoardLogger.jl New Version Compatability HOT 2
- Compilation Error HOT 1
- Type of the discount factor. HOT 1
- Use only RLInterface.jl interface HOT 3
- DQExperience should support AbstractArrays HOT 1
- Dimension Mismatch HOT 6
- Support of AbtractEnvironment HOT 2
- Problem with reading log files HOT 1
- Exploration Policy requires a (PO)MDP
- Automatically convert to Float32
- TagBot trigger issue HOT 10
- Error: Can't differentiate loopinfo expression HOT 6
- Question: How would you make a decay schedule for prioritized replay alpha/beta? HOT 1
- Deprecation of `loadparams!`
- Action masking feature (legal actions) HOT 1
- GPU support HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepqlearning.jl.