Seminar Topics & Project Ideas On Computer Science Electronics Electrical Mechanical Engineering Civil MBA Medicine Nursing Science Physics Mathematics Chemistry ppt pdf doc presentation downloads and Abstract

Full Version: Markov Systems with Rewards, Markov Decision Processes
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Markov Systems with Rewards, Markov Decision Processes

[attachment=30469]

Where We Are and Outline

• Planning
– Deterministic state, preconditions, effects
– Uncertainty
• Conditional planning, conformant planning, nondeterministic
• Probabilistic modeling of systems with
uncertainty and rewards
• Modeling probabilistic systems with control, i.e.,
action selection
• Reinforcement learning

Markov Systems with Rewards

• Finite set of n states, si
• Probabilistic state matrix, P, pij
• “Goal achievement” - Reward for each state, ri
• Discount factor - γ
• Process/observation:
– Assume start state si
– Receive immediate reward ri
– Move, or observe a move, randomly to a new state
according to the probability transition matrix
– Future rewards (of next state) are discounted by γ

Summary

• Markov Models with Reward
• Value iteration
• Markov Decision Process
• Value Iteration
• Policy Iteration
• Reinforcement Learning