W2_L2: Markov decision process (MDP)

Welcome to Week 2 Lecture 2 of the course "Special topics in ML (Reinforcement Learning)" by Prof. Balaraman Ravindran. Full Course: https://study.iitm.ac.in/ds/course_pa... Video Overview This lecture formalises the core components of reinforcement learning using the Markov Decision Process (MDP) framework. It explains the agent–environment interaction, defines states, actions, and rewards, introduces transition probabilities and expected rewards, and clarifies the Markov property. The session also discusses the concept of a policy, which governs how an agent behaves in different states. About IIT Madras' online Bachelor of Science programme IIT Madras offers four-year BS programmes that aim to provide quality education to all, irrespective of age, educational background, or location. The BS programme has multiple levels, which provide flexibility to students to exit at any of these levels. Depending on the courses completed and credits earned, the learner can receive a Foundation Certificate from IITM CODE (Centre for Outreach and Digital Education), Diploma(s) from IIT Madras, or BSc/BS Degrees from IIT Madras. For more details, Visit: https://www.iitm.ac.in/academics/stud... #reinforcementlearning #mdp #markovdecisionprocess #markovproperty #statesactionsrewards #rlformulation #machinelearning #iitmadrasbs