ahn-kj / dqn-tensorflow Goto Github PK
View Code? Open in Web Editor NEWThis project forked from devsisters/dqn-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
License: MIT License
This project forked from devsisters/dqn-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
License: MIT License
akjhhui-MacBook-Pro:DQN-tensorflow akjhh$ python main.py --env_name=Breakout-v0 --is_train=True --display=True
Traceback (most recent call last):
File "main.py", line 4, in
from dqn.agent import Agent
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 11, in
from .replay_memory import ReplayMemory
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/replay_memory.py", line 8, in
from .utils import save_npy, load_npy
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/utils.py", line 2, in
import _pickle as cPickle
ImportError: No module named _pickle
akjhhui-MacBook-Pro:DQN-tensorflow akjhh$ python main.py --env_name=Breakout-v0 --is_train=True --display=True
[] GPU : 1.0000
[2016-12-05 13:55:49,125] Making new env: Breakout-v0
{'_save_step': 500000,
'_test_step': 50000,
'action_repeat': 4,
'backend': 'tf',
'batch_size': 32,
'cnn_format': 'NCHW',
'discount': 0.99,
'display': True,
'double_q': False,
'dueling': False,
'env_name': 'Breakout-v0',
'env_type': 'detail',
'ep_end': 0.1,
'ep_end_t': 1000000,
'ep_start': 1.0,
'history_length': 4,
'learn_start': 50000.0,
'learning_rate': 0.00025,
'learning_rate_decay': 0.96,
'learning_rate_decay_step': 50000,
'learning_rate_minimum': 0.00025,
'max_delta': 1,
'max_reward': 1.0,
'max_step': 50000000,
'memory_size': 1000000,
'min_delta': -1,
'min_reward': -1.0,
'model': 'm1',
'random_start': 30,
'scale': 10000,
'screen_height': 84,
'screen_width': 84,
'target_q_update_step': 10000,
'train_frequency': 4}
[] Loading checkpoints...
[!] Load FAILED: checkpoints/Breakout-v0/min_delta--1/max_delta-1/history_length-4/train_frequency-4/target_q_update_step-10000/double_q-False/memory_size-1000000/action_repeat-4/ep_end_t-1000000/dueling-False/min_reward--1.0/backend-tf/random_start-30/scale-10000/env_type-detail/learning_rate_decay_step-50000/ep_start-1.0/screen_width-84/learn_start-50000.0/cnn_format-NCHW/learning_rate-0.00025/batch_size-32/discount-0.99/max_step-50000000/max_reward-1.0/learning_rate_decay-0.96/learning_rate_minimum-0.00025/env_name-Breakout-v0/ep_end-0.1/model-m1/screen_height-84/
2016-12-05 13:55:52.135 Python[64291:368607] ApplePersistenceIgnoreState: Existing state will not be touched. New state will be written to /var/folders/qt/2s1ztgn16w91j4rb66y4f_q00000gn/T/org.python.python.savedState
0%| | 49999/50000000 [10:32<258:27:47, 53.68it/s]E tensorflow/core/common_runtime/executor.cc:334] Executor failed to create kernel. Invalid argument: CPU BiasOp only supports NHWC.
[[Node: target/target_l1/BiasAdd = BiasAdd[T=DT_FLOAT, data_format="NCHW", _device="/job:localhost/replica:0/task:0/cpu:0"](target/target_l1/Conv2D, target/target_l1/biases/read)]]
Traceback (most recent call last):
File "main.py", line 66, in
tf.app.run()
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 30, in run
sys.exit(main(sys.argv[:1] + flags_passthrough))
File "main.py", line 61, in main
agent.train()
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 57, in train
self.observe(screen, reward, action, terminal)
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 136, in observe
self.q_learning_mini_batch()
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 158, in q_learning_mini_batch
q_t_plus_1 = self.target_q.eval({self.target_s_t: s_t_plus_1})
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 559, in eval
return _eval_using_default_session(self, feed_dict, self.graph, session)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 3761, in _eval_using_default_session
return session.run(tensors, feed_dict)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 717, in run
run_metadata_ptr)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 915, in _run
feed_dict_string, options, run_metadata)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 965, in _do_run
target_list, options, run_metadata)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/client/session.py", line 985, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors.InvalidArgumentError: CPU BiasOp only supports NHWC.
[[Node: target/target_l1/BiasAdd = BiasAdd[T=DT_FLOAT, data_format="NCHW", _device="/job:localhost/replica:0/task:0/cpu:0"](target/target_l1/Conv2D, target/target_l1/biases/read)]]
Caused by op u'target/target_l1/BiasAdd', defined at:
File "main.py", line 66, in
tf.app.run()
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/platform/app.py", line 30, in run
sys.exit(main(sys.argv[:1] + flags_passthrough))
File "main.py", line 58, in main
agent = Agent(config, env, sess)
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 30, in init
self.build_dqn()
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/agent.py", line 241, in build_dqn
32, [8, 8], [4, 4], initializer, activation_fn, self.cnn_format, name='target_l1')
File "/Users/akjhh/git/javacafe/DQN-tensorflow/dqn/ops.py", line 26, in conv2d
out = tf.nn.bias_add(conv, b, data_format)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/ops/nn_ops.py", line 393, in bias_add
return gen_nn_ops._bias_add(value, bias, data_format=data_format, name=name)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/ops/gen_nn_ops.py", line 279, in _bias_add
data_format=data_format, name=name)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/framework/op_def_library.py", line 749, in apply_op
op_def=op_def)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 2380, in create_op
original_op=self._default_original_op, op_def=op_def)
File "/usr/local/lib/python2.7/site-packages/tensorflow/python/framework/ops.py", line 1298, in init
self._traceback = _extract_stack()
InvalidArgumentError (see above for traceback): CPU BiasOp only supports NHWC.
[[Node: target/target_l1/BiasAdd = BiasAdd[T=DT_FLOAT, data_format="NCHW", _device="/job:localhost/replica:0/task:0/cpu:0"](target/target_l1/Conv2D, target/target_l1/biases/read)]]
akjhhui-MacBook-Pro:DQN-tensorflow akjhh$
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.