Display title | Draft:Group Relative Policy Optimization |
Default sort key | Group Relative Policy Optimization |
Page length (in bytes) | 4,227 |
Namespace ID | 118 |
Namespace | Draft |
Page ID | 80409603 |
Page content language | en - English |
Page content model | wikitext |
Indexing by robots | Disallowed |
Number of page watchers | Fewer than 30 watchers |
Number of redirects to this page | 1 |
Number of subpages of this page | 0 (0 redirects; 0 non-redirects) |
Local description | Reinforcement learning algorithm that eliminates the need for a critic network |
Page views in the past 30 days | |
Edit | Allow all users (no expiry set) |
Move | Allow all users (no expiry set) |
Page creator | Flynyeguy (talk | contribs) |
Date of page creation | 16:54, 10 July 2025 |
Latest editor | Gheus (talk | contribs) |
Date of latest edit | 02:18, 13 July 2025 |
Total number of edits | 9 |
Recent number of edits (within past 30 days) | 9 |
Recent number of distinct authors | 6 |
Magic words (2) | - __NOINDEX__
- __NONEWSECTIONLINK__
|
Hidden categories (3) | This page is a member of 3 hidden categories (help):
|
Transcluded templates (76) | Pages transcluded onto the current version of this page (help):
|