Jump to content

Talk:Deep reinforcement learning

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Cewbot (talk | contribs) at 13:17, 13 February 2024 (Maintain {{WPBS}}: 4 WikiProject templates. Keep majority rating "Stub" in {{WPBS}}. Remove 2 same ratings as {{WPBS}} in {{WikiProject Articles for creation}}, {{WikiProject Computer science}}. Keep 2 different ratings in {{WikiProject Science}}, {{WikiProject Engineering}}.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

make an orange character

}}give him legs,arms and a head }}name him John }}make John learn to walk

"Training" Section

The current "training" section is a mixture of a lot of different but very specific topics. It would make more sense to have it be an overview of deep RL algorithms, and then have a separate section on broad research directions that are being investigated: off-policy RL, inverse RL, meta-RL, goal-conditioned RL. Happy to do this myself if there is agreement. Anair13 (talk) 20:36, 24 November 2020 (UTC)[reply]