Journal Press India®

A New Model of Reinforcement Learning, Algorithms

Vol 8 , Issue 4 , October - December 2020 | Pages: 48-51 | Research Paper  

https://doi.org/10.51976/ijari.842009

| | |


Author Details ( * ) denotes Corresponding author

1. * Vijay Bhandari, Department of Computer Science, SIRTS BHOPAL, India (vijayhomee@gmail.com)
2. Arpana Bhandari, Department of Computer Science, SIRTS BHOPAL, India (arpanabhandari08@gmail.com)
3. Ritu Srivastava, Dean, SIRTS BHOPAL, India
4. Kapil Chaturvedi, Professor, SIRTS BHOPAL, India

RL doen’t need prior knowledge, it can autonomously get optional policy with the knowledge obtained by trial-and-error and continuously interacting with dynamic environment. Its characteristics of self- improving and online learning make reinforcement learning become one of intelligent agent’s core technologies. In this article, we firstly literature the model and theory of reinforcement learning. Then, we roundly present the main reinforcement learning algorithms, including Sarsa, temporal difference, Q-learning and function approximation. Finally, we briefly introduce some applications of reinforcement learning and point out some future research directions of reinforcement learning.

Keywords

Reinforcement Learning; SARSA; temporal difference; Q-learning; function approximation


  1. Rummery G, Niranjan M. On-learning using  connectionist systems. Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994.

  2. Sutton R S. Learning to Predict by th methods of temporal differences. Machine Learning,1988,3:9~44.

  3. Watkins C㧚Q-Learning [J]㧚Machine Learning㧘1992㧘8 (3)㧦279-292㧚

  4. Singh S, Jaakkola T, Jordan M I. Reinforcement learning with soft state aggregation. In: Tesauro G, Touretzky D, Advances in Neural Information Processing Systems, 7. Morgan Kaufmann: MIT Press, 1995.361~368.

  5. Crites R H, Barto A G. Elevator group control using multiple reinforcement learning agents. Machine Learning, 1998,33(3),235~262.

  6. McCallum A K. Reinforcement learning with selective perception and hidden State Ph. D. dissertation]. Department CS, University Rochester,1996.

  7. Sutton R S. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In: Touretzky D, Mozer M, Hasselno M, Advances in Neural Information Processing Systems, B. NY: MIT Press, 1996 1038~1044.

  8. Anderson C W. Learning to control an inverted pendulum using neural network [J] . IEEE Control System Magazine , 1989 , 30 (  4) :31 – 36.

  9. Whitley D ,Dominic S ,Das R and Aanderson C W. Genetic reinforcement learning for neurocontrol problems [J ] . Machine Learning ,1993 ,13 :259 – 284.

  10. Berebji H R. Learning and tuning fuzzy logic controllers through reinforcements [J]. IEEE Trans . on Neural Networks , 1992 , 3 (5)

  11. Khan E. Reinforcement control with unsupervised learning [A]. Int.Joint Conference on Neural Network [ C] ,Beijing ,1992 ,88 – 93.

  12. N.R.Jennings,J.Corera,I.Laresgoti,.H.mamdani,F.Perriolat,P.Skare k and L.Z.Varga.using ARCHON to develop real-world DAI applications for electricity transportation management and Particle acceleration control[J].IEEE Exert,1996,11(6):60-88,December

  13. Crites R H and Barto A G. Improving elevator performance using reinforcement learning[A]. In: Touretzky D S ,Mozer M C , and M E H. Advances in Neural Information Processing Systems [M]. Cambridge,MAThe MIT Press ,1995 ,1017 – 1023

Abstract Views: 1
PDF Views: 143

Advanced Search

News/Events

Indira School of Bus...

Indira School of Mangement Studies PGDM, Pune Organizing Internatio...

Indira Institute of ...

Indira Institute of Management, Pune Organizing International Confe...

D. Y. Patil Internat...

D. Y. Patil International University, Akurdi-Pune Organizing Nation...

ISBM College of Engi...

ISBM College of Engineering, Pune Organizing International Conferen...

Periyar Maniammai In...

Department of Commerce Periyar Maniammai Institute of Science &...

Institute of Managem...

Vivekanand Education Society's Institute of Management Studies ...

Institute of Managem...

Deccan Education Society Institute of Management Development and Re...

S.B. Patil Institute...

Pimpri Chinchwad Education Trust's S.B. Patil Institute of Mana...

D. Y. Patil IMCAM, A...

D. Y. Patil Institute of Master of Computer Applications & Managem...

Vignana Jyothi Insti...

Vignana Jyothi Institute of Management International Conference on ...

By continuing to use this website, you consent to the use of cookies in accordance with our Cookie Policy.