Comments (10)
Here is an update on the ghost robot which the robot will need to follow:
This will be used to train the RL algorithm in accordance with Deep Mimic's policies (at least that's the hope for now)
- Some of the data is a bit choppy as it is using IK to follow mocap from the humanoid (which is hidden).
- The speed of motion of each joint, the robot base, and the joint calibration offsets are variable(needs tweaking, some of them are shown).
- There are issues with joint retargeting as of now(right foot of the robot), but, it will be tweaked later.
ghost_robot_slerp_fn-2021-02-19_12.34.54.mp4
Warm Regards,
Rohit Kumar J
from deepworlds.
@tsampazk, Thank you for adding me as a contributor!
I am currently working on the Deep Mimic Example in pybullet. Testing methods that would help to parent the stock humanoid so any other similarly structured robot can be used for training without much initial setup. Here are some results:
In this video, the stock humanoid is driving the custom-designed robot using inverse kinematics (I'm hoping that this will be the reward function for the robot during training):
full_body_ik-2021-02-17.mp4
Once, complete, hopefully, we can port this example to Webots.
Warm Regards,
Rohit
from deepworlds.
We are at preliminary training by using IK. The code structure that was initially built upon was... unelegant, however, the agents are stand-alone, hence we may be able to re-build the environment files while reusing the agents(or perhaps use the agents in deep worlds:thinking: :thought_balloon: ). However, this may take some time.
Initial_training-2021-04-22_00.23.09.mp4
The checkpoint file(agent) here, is at 18 million samples, according to the deep mimic paper, it takes about 61 mil samples for the stock humanoid to achieve a perfect walking gait and 48 mil samples for Atlas. They also mention that it takes 2 days to train the humanoid. To get to the 18 mil samples mark as seen in this video, it took me 24 hours of training with 12(or was it 6? 🤔 ) cores (actually on my friend's PC). I think it needs some tuning to optimize the results.
Hopefully, I could begin developing this example on Webots once this is fully trained.
Warm Regards,
Rohit Kumar J
PS: The sudden jump at 00:12 from the robot was me dragging the robot with the mouse :D
from deepworlds.
@all-contributors please add @rohit-kumar-j for ideas
from deepworlds.
I've put up a pull request to add @rohit-kumar-j! 🎉
from deepworlds.
@all-contributors please add @rohit-kumar-j for ideas
from deepworlds.
I've put up a pull request to add @rohit-kumar-j! 🎉
from deepworlds.
@rohit-kumar-j This looks really promising Rohit! I'm looking forward to seeing the complete example, so as to start working on porting it to Webots. I think it would make for an impressive example to be added in the deepworlds repository.
from deepworlds.
@tsampazk, I agree. Unfortunately, I do not know Webots code-base and methods, hence, I can help out with the logic and implementation while simultaneously learning Webots. I hope it is okay if I post updates on the example in this thread itself.
Warm Regards,
Rohit Kumar J
from deepworlds.
I hope it is okay if I post updates on the example in this thread itself.
@rohit-kumar-j Yeap sounds fine, go ahead. 😀
from deepworlds.
Related Issues (20)
- error: DDPG_runner.run() HOT 5
- FindAndAvoid directory
- Deep Mimic example in Webots?
- Cartpole continuous directory typo
- Cartpole README update
- add constants at robot_supervisor_manager.py for all examples
- About panda robot demo HOT 1
- [find_and_avoid] Fix convergeness issues HOT 2
- Several worlds still have old rotations HOT 4
- [CartPole] Fix convergeness issues HOT 6
- Broken links in Cartpole README.md HOT 1
- Additional contributions HOT 4
- Wrong argument name in docstring
- [KHR-3HV] Fix convergeness issues
- Usage of snake_case
- A thinking of observation window
- A doubt for behaviors that the agent finally converged HOT 8
- A problem about timestep in Deepbots HOT 2
- Panda robot environment outputs IndexedFaceSet errors HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepworlds.