Jump to content

Markov decision process

From Simple English Wikipedia, the free encyclopedia
Revision as of 04:28, 10 November 2021 by MathXplore (talk | changes) (added Category:Probability theory using HotCat)

A Markov decision process is a method for optimizing decision making over time in a step-by-step manner in situations where the outcomes of the decisions are partially random and partially determined by the decisions.