Giter Club home page Giter Club logo

rl-reading-group-fall18-'s Introduction

Reinforcement Learning Reading Group (Fall-Spring '18,'19)

Our weekly reading group where we discuss Reinforcement Learning approaches in the domain of Natural Language Processing.

Please cite this github repository if you are using our slides.

Last updated: 04/04/2019

Meetings

  • Reinforcement Learning for Sequence Learning in Neural Models

    Presented by Yanjun Gao.

    Readings:

    Find the presentation here.

  • Deep Q-Learning

    Presented by Kyriaki Zafeiroudi.

    Reading: Playing Atari with Deep Reinforcement Learning by Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. [3]

    Find the presentation here.

  • Background of Reinforcement Learnng, Imitation Learning and Human-level control through deep reinforcement learning

    Presented by Saptarashmi Bandyopadhyay.

    Readings: Reinforcement Learning An Introduction by Richard S. Sutton and Andrew G. Barto [4] Chapters 1, 2 and 3
    A Course in Machine Learning by Haul Daumé III [5] Chapter 18
    Human-level control through deep reinforcement learning by Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg & Demis Hassabis [6]
    Atari 2600 game console in the paper The arcade learning environment: An evaluation platform for general agents by Marc G. Bellemare, Yavar Naddaf, Joel Veness amd Michael Bowling [7]

    Find the presentation here.

  • Overview of Deep Reinforcement Learning in a Natural Language Action Space

    Presented by Saptarashmi Bandyopadhyay.

    Readings: Deep Reinforcement Learning with a Natural Language Action Space by Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng and Mari Ostendorf, University of Washington, Seattle, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1621–1630, Berlin, Germany, August 7-12, 2016 [10] Language Understanding for Text-based Games using Deep Reinforcement Learning by Karthik Narasimhan, Tejas D Kulkarni and Regina Barzilay, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp 1–11, Lisbon, Portugal, 17-21 September 2015 [11]

    Find the presentation here.

  • (D)DPG:(Deep) Deterministic Policy Gradients Network for Continuous Action Space

    Presented by Yanjun Gao.

    Readings: Silver, David, et al. "Deterministic policy gradient algorithms." ICML. 2014. [19] Lillicrap, Timothy P., et al. "Continuous Control with Deep Reinforcement Learning."  (2015). [20]

    Find the presentation here.

  • DRL in Parameterized Action Space

    Presented by Yanjun Gao.

    Readings: Hausknecht, Matthew, and Peter Stone. "Deep reinforcement learning in parameterized action space." ICLR 2016. [21]

    Find the presentation here.

  • Gaussian process Reinforcment learning for Dialog Management.

    Presented by Maryam Zare.

    Readings: M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, S. Ultes, D. Vandyke, T.-H. Wen and S. Young . "Dialogue manager domain adaptation using Gaussian process reinforcement learning" Computer Speech and Language. (2017). [18]

    Find the presentation here

  • Double Q-Learning

    Presented by Saptarashmi Bandyopadhyay

    Readings: Deep Reinforcement Learning with Double Q-Learning by Hado van Hasselt , Arthur Guez, and David Silver, Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16),2016 [13].

    Find the presentation here.

  • Gaussian process Reinforcment learning for Dialog Management in More Detail.

    Presented by Maryam Zare.

    Readings: Gašić, Milica, and Steve Young. "Gaussian processes for pomdp-based dialogue manager optimization." IEEE/ACM Transactions on Audio, Speech, and Language Processing(2014) [17]

    Find the presentation here

  • Dueling Network Architectures for Deep Reinforcement Learning

    Presented by Saptarashmi Bandyopadhyay

    Readings: Dueling Network Architectures for Deep Reinforcement Learning by Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, and Nando de Freitas, Proceedings of the 33rd International Conference on Machine Learning New York, NY, USA, 2016. JMLR: W&CP volume 48. [14].

    Find the presentation here.

  • (Asynchronous) Actor Advantage Critics Reinforcement Learning: A2C and A3C

    Presented by Yanjun Gao.

    Readings: Asynchronous Methods for Deep Reinforcement Learning Mnih, Volodymyr, et al. International conference on machine learning. 2016. [15]

    Find the presentation here

  • Toward Diverse Text Generation with Inverse Reinforcement Learning

    Presented by Yanjun Gao.

    Readings: Toward Diverse Text Generation with Inverse Reinforcement Learning Shi, Zhan, et al. Proceedings of the 27th International Joint Conference on Artificial Intelligence. AAAI Press, 2018.[16]

    Find the presentation here.


More Reinforcement Learning Recourses


References

  1. Ronald J. Williams. 1992. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Mach. Learn. 8, 3-4 (May 1992), 229-256. DOI: https://doi.org/10.1007/BF00992696
  2. Ranzato, M. Chopra, S. Auli, M. and Zaremba, W. 2015. Sequence level training with recurrent neural networks. CoRR abs/1511.06732.
  3. Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D. & Riedmiller, M. (2013). Playing Atari with Deep Reinforcement Learning (cite arxiv:1312.5602Comment: NIPS Deep Learning Workshop 2013)
  4. Reinforcement Learning An Introduction by Richard S. Sutton and Andrew G. Barto, Second Edition, 2018, MIT Press
  5. ciml.info/
  6. Human-level control through deep reinforcement learning by V Mnih, K Kavukcuoglu, D Silver, A Rusu, J Veness, M G Bellemare, A Graves, M Riedmiller, A Fidjeland, G Ostrovski, S Petersen, C Beattie, A Sadik, I Antonoglou, H King, D Kumaran, D Wierstra, S Legg, D Hassabis, Nature journal, Volume 545, pp. 529-533, February 2015, DOI:10.1038/nature14236
  7. The arcade learning environment: An evaluation platform for general agents by Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling, Journal of Artificial Intelligence Research 47, pages 253-279, DOI: 10.1613/jair.3912 arXiv:1207.4708
  8. Silver, David, et al. "Deterministic policy gradient algorithms." ICML. 2014.
  9. Lillicrap, Timothy P., et al. "Continuous Control with Deep Reinforcement Learning."  (2015).
  10. Hausknecht, Matthew, and Peter Stone. "Deep reinforcement learning in parameterized action space." ICLR 2016.
  11. Deep Reinforcement Learning with a Natural Language Action Space by Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng and Mari Ostendorf, University of Washington, Seattle, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1621–1630, Berlin, Germany, August 7-12, 2016.
  12. Language Understanding for Text-based Games using Deep Reinforcement Learning by Karthik Narasimhan, Tejas D Kulkarni and Regina Barzilay, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp 1–11, Lisbon, Portugal, 17-21 September 2015
  13. Deep Reinforcement Learning with Double Q-Learning by Hado van Hasselt , Arthur Guez, and David Silver, Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16),2016.
  14. Dueling Network Architectures for Deep Reinforcement Learning by Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, and Nando de Freitas, Proceedings of the 33rd International Conference on Machine Learning New York, NY, USA, 2016. JMLR: W&CP volume 48.
  15. Mnih, Volodymyr, et al. "Asynchronous methods for deep reinforcement learning." International conference on machine learning. 2016.
  16. Shi, Zhan, et al. "Toward diverse text generation with inverse reinforcement learning." Proceedings of the 27th International Joint Conference on Artificial Intelligence. AAAI Press, 2018.
  17. Gašić, Milica, and Steve Young. (http://mi.eng.cam.ac.uk/~sjy/papers/gayo14.pdf) IEEE/ACM Transactions on Audio, Speech, and Language Processing(2014) 18.M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, S. Ultes, D. Vandyke, T.-H. Wen and S. Young . "[Dialogue manager domain adaptation using Gaussian process reinforcement learning]" Computer Speech and Language. (2017).
  18. Silver, David, et al. "Deterministic policy gradient algorithms." ICML. 2014.
  19. Lillicrap, Timothy P., et al. "Continuous Control with Deep Reinforcement Learning."  (2015).
  20. Hausknecht, Matthew, and Peter Stone. "Deep reinforcement learning in parameterized action space." ICLR 2016.

rl-reading-group-fall18-'s People

Contributors

serenayj avatar saptab avatar mry94 avatar kzafeiroudi avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar  avatar paper2code - bot avatar

Forkers

serenayj

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.