Jump to content

User:TTencoder/ReinforcementLearning

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by TTencoder (talk | contribs) at 02:52, 17 June 2025 (Created page with ' == Tools == * Cursor * GPT-4o * OpenAI o3 * Claude (language model) * Ableton Live * Microsoft * Nvidia * AMD * Intel == Research Interests == === Artificial Intelligence === * Machine learning ** Reinforcement learning *** Q-learning *** Temporal difference learning *** Monte Carlo method *** Monte Carlo tree search *** Markov decision process ** Supervised learning...'). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)