dqn-hfo

This is an continuous action deep reinforcement learning agent for the RoboCup 2D domain. The domain can be found and downloaded from https://github.com/mhauskn/HFO.

This repo is designed to work with a specific version of Caffe (commit 2ef584785c8ade90260eb117f189146364494183) with the following minor changes:

--- a/include/caffe/solver.hpp
+++ b/include/caffe/solver.hpp
@@ -67,6 +67,7 @@ class Solver {
     return test_nets_;
   }
   int iter() { return iter_; }
+  void set_iter(int new_iter) { iter_ = new_iter; }

   // Invoked at specific points during an iteration
   class Callback {
@@ -84,7 +85,6 @@ class Solver {

   void CheckSnapshotWritePermissions();

- protected:
   // Make and apply the update value for the current iteration.
   virtual void ApplyUpdate() = 0;
   // The Solver::Snapshot function implements the basic snapshotting utility
@@ -95,6 +95,7 @@ class Solver {
   string SnapshotFilename(const string extension);
   string SnapshotToBinaryProto();
   string SnapshotToHDF5();
+ protected:
   // The test routine
   void TestAll();
   void Test(const int test_net_id = 0);

Installation

First install the correct version of Caffe:
git clone https://github.com/BVLC/caffe.git
cd caffe && git checkout 2ef584785c8ade90260eb117f189146364494183
Apply the changes to solver listed above
Follow installation instructions at https://github.com/BVLC/caffe
Next install HFO:
git clone https://github.com/LARG/HFO.git
Follow installation instructions at https://github.com/LARG/HFO
Now we are ready to install dqn-hfo:
git clone https://github.com/mhauskn/dqn-hfo.git
cd dqn-hfo && mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release -DCAFFE_ROOT_DIR=/u/mhauskn/projects/caffe/ -DHFO_ROOT_DIR=/u/mhauskn/projects/HFO/ .. You will have to change the paths to point to your installation of caffe and HFO
make -j4
Run a test job: mkdir state && ./bin/dqn -save state/test -alsologtostderr

Errors

Cannot find cublas_v2.h:

device_alternate.hpp:34:23: fatal error: cublas_v2.h: No such file or directory
 #include <cublas_v2.h>
                       ^
compilation terminated.

Solution: Include your Cuda path in the installation:

locate cublas_v2.h -- this should give you the path to your cuda installation
export CPLUS_INCLUDE_PATH=/your/cuda/path:$CPLUS_INCLUDE_PATH

Cannot find caffe.pb.h:

caffe/include/caffe/blob.hpp:9:34: fatal error: caffe/proto/caffe.pb.h:
No such file or directory
 #include "caffe/proto/caffe.pb.h"
                                  ^
compilation terminated.

Solution: Symlink the built proto files.

cd your_caffe_dir/include/caffe
ln -s ../../.build_release/src/caffe/proto/ .

Citing

If this repository has helped your research, please cite the following:

@InProceedings{ICLR16-hausknecht,
  author = {Matthew Hausknecht and Peter Stone},
  title = {Deep Reinforcement Learning in Parameterized Action Space},
  booktitle = {Proceedings of the International Conference on Learning Representations (ICLR)},
  location = {San Juan, Puerto Rico},
  month = {May},
  year = {2016},
}

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
cmake/Modules		cmake/Modules
scripts		scripts
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
matplotlibrc		matplotlibrc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dqn-hfo

Installation

Errors

Cannot find cublas_v2.h:

Cannot find caffe.pb.h:

Citing

About

Releases

Packages

Languages

License

mhauskn/dqn-hfo

Folders and files

Latest commit

History

Repository files navigation

dqn-hfo

Installation

Errors

Cannot find cublas_v2.h:

Cannot find caffe.pb.h:

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages