Comments (2)
It should be in the same folder inside of data
as the pngs.
Note that you need to wait until the RL part of the algorithm is running. If you see output about "train/epoch
", then it is still pretraining the VAE. Once you see the log outputting information about the "Policy Loss
", then that means RL is running.
Lastly, save_video_period=100
means that videos will only be logged once every 100 epochs. If you want to save videos more frequently, reduce this number.
from rlkit.
It should be in the same folder inside of
data
as the pngs.Note that you need to wait until the RL part of the algorithm is running. If you see output about "
train/epoch
", then it is still pretraining the VAE. Once you see the log outputting information about the "Policy Loss
", then that means RL is running.Lastly,
save_video_period=100
means that videos will only be logged once every 100 epochs. If you want to save videos more frequently, reduce this number.
Thank you, sir. I can see the video for 100, 200, 300 now(due to my poor gpu, I need more time)
In reality I have two Dobot Robotic Arms, and some cameras. I wonder where can I change the Robotic Arms and cameras in the sample experiment. And how could I test in real World.
Sorry it seem that my question is a little stupid, I did not learn a lot about Reinforcement Learning and Pytorch(even Machine Learning or Deep Learning). In another word, a rookie. I read the paper 'RIG', it seems that I can almost get the idea of what it means. Could you give me some advice of understand the algorithm, or DRL better ?
from rlkit.
Related Issues (20)
- unable to create the conda environment with linux-cpu-env.yml HOT 2
- Issue SMAC algorithm HOT 4
- multi-GPU optimised implementations for running algorithms HOT 1
- Doubt on Q-function loss in AWAC HOT 1
- Question about VAEPolicy in rlkit.torch.sac.policies HOT 2
- CustomMDPPathCollector is not found HOT 2
- Doubt on advantage calculation to update the policy on AWAC.
- Position Control with mujoco-py
- Cannot reproduce the results of IQL on antmaze HOT 1
- High Memory & Disk Requirement for SMAC HOT 1
- Skew-fit gaussian_identity_variance
- AWAC doesn't profit from offline data HOT 4
- IQL: make checkpoints public
- Could someone provide right environment installation procedure? HOT 4
- Python3.5 is not suitable for this project! HOT 1
- Why I could not see result file๏ผ
- SAC log_alpha different from paper HOT 1
- IQL results different with the paper HOT 1
- Reproduce and create figures results in AWAC.
- Download link is expired HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rlkit.