aws-deepracer-community / deepracer-simapp Goto Github PK
View Code? Open in Web Editor NEWExtract from AWS DeepRacer Robomaker Bundle
License: MIT License
Extract from AWS DeepRacer Robomaker Bundle
License: MIT License
The kinesis_video_streamer
package is missing, it is impossible to stream to KVS.
During training, at the phase where sagemaker does policy training and robomaker does evaluation runs the, the system occasionally fails to transition back to train a new iteration once the policy training is done and the final evaluation run is completed.
Last lines of Sagemaker logs:
Policy training> Surrogate loss=-0.018119553104043007, KL divergence=0.0014306496595963836, Entropy=0.3204973340034485, training epoch=8, learning_rate=1e-06
Policy training> Surrogate loss=-0.015547456219792366, KL divergence=0.0013924918603152037, Entropy=0.3194952607154846, training epoch=9, learning_rate=1e-06
Checkpoint> Saving in path=['./checkpoint/411_Step-87087.ckpt']
Uploaded 3 files for checkpoint 411 in 0.95 seconds
saved intermediate frozen graph: Champs-May-12/model/model_411.pb
Best checkpoint number: 398, Last checkpoint number: 409
Copying the frozen checkpoint from ./frozen_models/agent/model_398.pb to /opt/ml/model/agent/model.pb.
Deleting the frozen models in s3 for the iterations: {'408'}
Last lines of Robomaker logs:
DEBUG: s: 167.0, wp_p: 144, wp_n: 145, wp_f: 5, rew: 0.0, prog: 0.94, saf: 172.0, eff: 1.66, d1: 23.04, d2: 69.02
DEBUG: s: 168.0, wp_p: 145, wp_n: 146, wp_f: 6, rew: 0.0, prog: 0.94, saf: 173.0, eff: 1.63, d1: 30.23, d2: 72.07
Testing> Name=main_level/agent, Worker=0, Episode=760, Total reward=338.25, Steps=88095, Training iteration=0
## agent: Finished evaluation phase. Success rate = 0.0, Avg Total Reward = 338.25
Reward function in this case is outputting one line per step; it is clear that there is no evaluation ongoing. It seems as if the two systems are waiting on each other to progress.
In the Video Stream one sees the car stand in a corner (against a wall) or driving in circles, depending on the last action that was sent. Environment was not reset.
Issue is seen in maybe 1% of transitions between evaluation and training.
I have tried running most of the worlds (particularly Las Vegas, Spain, Canada) but AWS Track does not seem to load correctly. It hangs hanging on the find file function and then terminates.
When would the rogue circuit track would be uploaded. It is a track for AWS student.
@jamesjennens - this one is for you; the addition you did to rollout_worker.py is needed in evaluation_worker.py as well.
[ERROR] [1625995045.682140900, 12.518000000]: [KinesisVideoStreamSetup] Skipping stream id 0 due to failure initializing stream. Error code: 4100 Failed to setup the kinesis video streamer[ERROR] [1625995045.682206300, 12.518000000]: [InitializeStreamSubscriptions] KinesisVideoStreamerSetup failed with error code : 4096. Exiting
In the Robomaker logs one can read Could not find encoder for codec id 27: Encoder not found
When cloning a model the "best_checkpoint" and not the "last_checkpoint" is used. This can cause a significant 'backwards' jump in training, esp. as the best checkpoint only uses the completion as a metric.
See: 48c2065/bundle/sagemaker_rl_agent/lib/python3.5/site-packages/markov/training_worker.py#L319
The code has moved into ./bundle/sagemaker_rl_agent/lib/python3.5/site-packages/markov/metrics/s3_metrics.py
; still heavily relying on time.time(). Needs to be improved to use the rospy.get_time() -- see larsll/deepracer-for-cloud@69a43bb
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.