Jump to content

Markov decision process

From Simple English Wikipedia, the free encyclopedia
Revision as of 04:28, 10 November 2021 by MathXplore (talk | changes) (Added {{math-stub}} tag to article (TW))
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

A Markov decision process is a method for optimizing decision making over time in a step-by-step manner in situations where the outcomes of the decisions are partially random and partially determined by the decisions.