Comments (2)
Hello,
The original implementation/paper uses [400, 300] neural net for DDPG and TD3.
It was only changed after the paper release here: sfujim/TD3@a8d53f7
We are using the original paper hyperparameters as they have shown to be working quite well,
see https://arxiv.org/abs/2005.05719 .
from stable-baselines3.
I see. Thanks for clearing that up
from stable-baselines3.
Related Issues (20)
- [Feature Request] Resume trained model with set_parameters without reset_num_timesteps HOT 4
- How does stable-baselines work with a multi-agent pettingzoo environment? HOT 1
- [Question] [Multiprocessing] RolloutBuffer groups environment transitions on a per-environment basis. HOT 1
- How to elegantly modify an algorithm by adding new architectures trained with custom losses? HOT 2
- Why does VecFrameStack clear the prior frames in the stack for the step when "terminated=True"? HOT 2
- [Question] influence of buffer size when using vecenv and save customized replay buffer HOT 2
- Training of PPO freezes after number of iterations HOT 8
- [Question] Discretize continuous actions/observations ? HOT 1
- Why does the Logger only return the train/ metrics, and not eval/, time/, and rollout/? HOT 1
- [Question] How to pass a varying gamma to DQN or PPO during training? HOT 5
- [Bug]: EOFError after running for some steps HOT 1
- [Question] Saving PPO rollout buffer on GPU HOT 2
- Issue(HER with in SAC algorithm) HOT 2
- [Question] CheckpointCallback keep last K HOT 2
- [Bug]: Potential Bug in PPO? Clarification requested HOT 2
- Off policy algorithm policy_kwargs HOT 2
- [Feature Request] Enable predict to take tensor as input HOT 3
- [Question] policy gradient loss and explained variance very small (almost zero) from the training start? HOT 2
- [Question] Discontinuous reward training curve HOT 4
- [Bug]: if learning_rate function uses special types, they can cause torch.load to fail when weights_only=True HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from stable-baselines3.