Jump to content

Talk:Deep reinforcement learning

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 92.233.167.63 (talk) at 13:48, 28 January 2024. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

make an orange character

}}give him legs,arms and a head }}name him John }}make John learn to walk

"Training" Section

The current "training" section is a mixture of a lot of different but very specific topics. It would make more sense to have it be an overview of deep RL algorithms, and then have a separate section on broad research directions that are being investigated: off-policy RL, inverse RL, meta-RL, goal-conditioned RL. Happy to do this myself if there is agreement. Anair13 (talk) 20:36, 24 November 2020 (UTC)[reply]