Jump to content

Talk:Reflection (artificial intelligence)

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Hplotter (talk | contribs) at 16:48, 5 February 2025. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
WikiProject iconArtificial Intelligence
WikiProject iconThis article is within the scope of WikiProject Artificial Intelligence, a collaborative effort to improve the coverage of Artificial intelligence on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.

How you can contribute

  • Would it be better to shorten or supplement the data transferred from Prompt Engineering?
  • The history section could be expanded, ideally incorporating the achievements of external teams (both foundational developments and agent-based systems like DeepClaude).
  • A list of benchmarks and their results should be included.

TheTeslak (talk) 11:04, 5 February 2025 (UTC)[reply]

Thanks for leading this. As of now, it needs more references (e.g. the Coconut's and R1 papers), and the mention of reinforcement learning applied to reasoning steps. Hplotter (talk) 16:34, 5 February 2025 (UTC)[reply]