Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Contribute
Help
Learn to edit
Community portal
Recent changes
Upload file
Special pages
Search
Search
Appearance
Donate
Create account
Log in
Personal tools
Donate
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Contents
move to sidebar
hide
(Top)
1
Tools
2
Research Interests
Toggle Research Interests subsection
2.1
Artificial Intelligence
Toggle the table of contents
User
:
TTencoder/ReinforcementLearning
Add languages
User page
Talk
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
User contributions
User logs
View user groups
Upload file
Permanent link
Page information
Get shortened URL
Download QR code
Print/export
Download as PDF
Printable version
Appearance
move to sidebar
hide
From Wikipedia, the free encyclopedia
<
User:TTencoder
This is an
old revision
of this page, as edited by
TTencoder
(
talk
|
contribs
)
at
02:52, 17 June 2025
(
←
Created page with ' == Tools == *
Cursor
*
GPT-4o
*
OpenAI o3
*
Claude (language model)
*
Ableton Live
*
Microsoft
*
Nvidia
*
AMD
*
Intel
== Research Interests == === Artificial Intelligence === *
Machine learning
**
Reinforcement learning
***
Q-learning
***
Temporal difference learning
***
Monte Carlo method
***
Monte Carlo tree search
***
Markov decision process
**
Supervised learning
...')
. The present address (URL) is a
permanent link
to this revision, which may differ significantly from the
current revision
.
Revision as of 02:52, 17 June 2025 by
TTencoder
(
talk
|
contribs
)
(
←
Created page with ' == Tools == *
Cursor
*
GPT-4o
*
OpenAI o3
*
Claude (language model)
*
Ableton Live
*
Microsoft
*
Nvidia
*
AMD
*
Intel
== Research Interests == === Artificial Intelligence === *
Machine learning
**
Reinforcement learning
***
Q-learning
***
Temporal difference learning
***
Monte Carlo method
***
Monte Carlo tree search
***
Markov decision process
**
Supervised learning
...')
(diff) ← Previous revision |
Latest revision
(
diff
) |
Newer revision →
(
diff
)
Tools
Cursor
GPT-4o
OpenAI o3
Claude (language model)
Ableton Live
Microsoft
Nvidia
AMD
Intel
Research Interests
Artificial Intelligence
Machine learning
Reinforcement learning
Q-learning
Temporal difference learning
Monte Carlo method
Monte Carlo tree search
Markov decision process
Supervised learning
Unsupervised learning
Neural networks
Deep learning
Convolutional neural networks
Optimal control
Kalman filter
Model predictive control
Bellman equation
Bellman pseudospectral method
Stochastic control
Proportional–integral–derivative controller
Brachistochrone curve
Generalized filtering
Search
Search
Toggle the table of contents
User
:
TTencoder/ReinforcementLearning
Add languages
Add topic