Jump to content

Wikipedia:WikiProject AI Cleanup

From Wikipedia, the free encyclopedia
(Redirected from Wikipedia:AICLEANUP)
Main pageDiscussionNoticeboardGuideResourcesPoliciesResearch
WikiProject
AI Cleanup

Welcome to WikiProject AI Cleanup, a collaboration to combat the increasing problem of unsourced, poorly written AI-generated content on Wikipedia. If you would like to help, add yourself as a participant in the project, inquire on the talk page, and see the to-do list.

Goals

[edit]

Since 2022, large language models (LLMs) like GPTs have become a convenient tool for writing at scale. Unfortunately, these models virtually always fail to properly source claims and often introduce errors. Essays like WP:LLM strongly encourage care in using them for editing articles. These are the project's goals:

  • To identify text written by AI, and proofread such text to make sure they follow Wikipedia's policies. Any unsourced or likely inaccurate claims should be removed.
  • To identify AI-generated images and ensure appropriate usage.
  • To help and keep track of AI-using editors who may not realize the deficiencies of AI as a writing tool.

The purpose of this project is not to restrict or ban the use of AI in articles, but to verify that its output is acceptable and constructive, and to fix or remove it otherwise.

Editing advice

[edit]
  • Tag articles with appropriate templates, remove unsourced information and warn users who add unsourced AI-generated content to articles.
  • Articles that are clearly entirely LLM-generated pages without human review can be nominated for speedy deletion under WP:G15.
  • Identifying AI-assisted edits is difficult in most cases since the generated text is often indistinguishable from human text. The signs of AI writing page provides a list of characteristics that are associated with text generated by AI chatbots.
    • If the text contains phrases like "as an AI model" or "as of my last knowledge update", or if the editor copy-pasted the prompt used to generate the text together with the AI response, the text is almost certainly AI-generated.
    • Other indications include the presence of fake references or other obvious AI hallucinations. AI content sometimes takes a promotional tone, reading like a tourism website. Other times, the AI gets confused and will write about a hotel instead of a nearby village.
    • AI content detection tools like GPTZero are unreliable and should not be used as the sole means of determining whether text is AI-generated. Given the high rate of false positives, deleting or tagging content purely because it was flagged by an automatic AI detector is not acceptable.
  • When missing more precise information, AI will often describe in detail very generic and common features, praising a village for its fertile farmlands, livestock and scenic countryside despite it being in an arid mountain range.
  • AI content is not always "unsourced"—sometimes it has real sources that are unrelated to the article's topic, sometimes it creates its own fake sources, and sometimes it uses legitimate sources to create the AI content. Be careful when removing bad AI content not to remove legitimate sources, and always check the cited sources for legitimacy.
    • Example: the article Leninist historiography was entirely written by AI and previously included a list of completely fake sources in Russian and Hungarian at the bottom of the page. Google turned up no results for these sources.
    • Other example: the article Estola albosignata, about a beetle species, had paragraphs written by AI sourced to actual German and French sources. While the sourced articles were real, they were completely off-topic, with the French one discussing a completely unrelated lifeform.
  • Sometimes entire articles are AI-generated, and in such a case, make sure to check that the topic is legitimate and notable. Occasionally, WP:HOAXes have made it onto Wikipedia because AI tools can create fake citations that may appear legitimate.
    • Example: the article Amberlihisar was created in January 2023, passed articles for creation, and was not discovered to be entirely fictional until December 2023. It has since now been deleted.
  • Text that was present in an article before November 30, 2022 (the release date of ChatGPT) is very unlikely to be AI-generated.

Open tasks

[edit]

See Category:Articles containing suspected AI-generated texts for all articles that have been tagged as possibly {{AI-generated}}. The tasks page recommends ways to handle articles, talk page discussions, and sources that use AI-generated content.

Participants

[edit]

Primary contacts: Chaotıċ Enby (talk · contribs) • 3df (talk)  • Queen of Heartstalk

Feel free to add yourself here!

Resources

[edit]

Essays

[edit]

Information

[edit]

Relevant discussions

[edit]

These threads may be useful for editors seeking information about how AI has previously been handled on Wikipedia.

Want to update this table? Try using the visual editor to edit this page.

Date Type Page Discussion Conclusion/Notes
Dec 2022 Wikipedia:Village pump (policy) Wikipedia response to chatbot-generated content
Feb 2023 Wikipedia:Village pump (idea lab) OpenAI and ChatGPT Disclosure suggested
Mar 2023 Wikipedia:Village pump (idea lab) Adding LLM edit tag Impractical with current technology
Jun 2023 Wikipedia:Village pump (miscellaneous) GPT-4 user-created template at top of page
Oct 2023 RfC Wikipedia talk:Large language models RfC: Is this proposal ready to be promoted? Overwhelming consensus to not promote.
Oct 2023 Wikipedia:Village pump (idea lab) Project Res-Up About using AI to increase resolution on old photos
Nov 2023 Wikipedia:Village pump (proposals) Scoring for Wikipedia type Articles Generated by LLM External research project hoping to recruit Wikipedia editors for off-wiki feedback (not editing here)
Jan 2024 RfC Wikipedia talk:Large language model policy RFC No consensus to adopt any wording as either a policy or guideline at this time.
Jan 2024 Wikipedia:Village pump (idea lab) Can Wikipedia Provide An AI Tool To Evaluate News and Information on the Internet
Jan 2024 Wikipedia:Village pump (idea lab) Use of ChatGPT and other LLMs specifically for medical and scientific content For text, not photos
Feb 2024 Wikipedia:Village pump (idea lab) Have a way to prevent "hallucinated" AI-generated citations in articles Goal supported in theory
Mar 2024 Wikipedia:Village pump (technical) AI helper Tool idea for creating articles
Mar 2024 Wikipedia:Village pump (technical) What if we had an AI to suggest edits along the lines of edits typically made by good editors? Tool idea for smaller edits
Mar 2024 Wikipedia:Village pump (proposals) AI for WP guidelines/ policies AI-based search of Wikipedia's ruleset
May 2024 Wikipedia:Village pump (idea lab) Another job aid proposal, this time with AI
Aug 2024 Wikipedia:Village pump (proposals) Proposal: Create quizzes on Wikipedia AI not seen as integral to the idea
Oct 2024 Wikipedia:Village pump (miscellaneous) Feedback on chatbots as valid sources, or identifiers of them
Oct 2024 Module talk:Find sources Chatbots as valid sources or identifiers of them Not supported at this time
Nov 2024 Wikipedia:Village pump (proposals) Add AI translation option for translating from English to non-English article. Off topic, as we don't decide what happens to other Wikipedias
Nov 2024 Wikipedia:Village pump (idea lab) Wiki AI? Request for a chatbot
Dec 2024 RfC Wikipedia:Village pump (policy) LLM/chatbot comments in discussions Consensus that "it is within admins' and closers' discretion to discount, strike, or collapse obvious use of generative LLMs" (Now in guideline: WP:AITALK)
Jan 2025 Wikipedia:Village pump (proposals) The use of AI-generated content Proposed rule accepting LLMs for translation and grammar but not on talk pages; not accepted
Jan 2025 RfC Wikipedia:Requests for comment/AI images BLPs Clear consensus against using AI-generated imagery to depict BLP subjects. (Now in policy: WP:AIIMGBLP)
Jan 2025 Wikipedia:Village pump (policy) Adding the undisclosed use of AI to post a wall of text into discussions as disruptive editing Not inherently disruptive, but can be disruptive
Feb 2025 Wikipedia:Village pump (policy) The real use case for AI on Wikipedia Ideas for copyediting and grammar fixes
Mar 2025 Wikipedia:Village pump (policy) URLs with utm_source=chatgpt.com codes
Apr 2025 RfC Wikipedia:Requests for comment/AI images Relist with broader question: Ban all AI images? "Most images wholly generated by AI should not be used." "Obvious exceptions include articles about AI, and articles about notable AI-generated images. The community objects particularly strongly to AI-generated images (1) of named people, and (2) in technical or scientific subjects such as anatomy and chemistry." (Now in policy: WP:AIIMAGES)
Jun 2025 RfC Wikipedia:Village pump (WMF) RfC: Adopting a community position on WMF AI development Pending closure
Jun 2025 Wikipedia:Village pump (technical) Simple summaries: editor survey and 2-week mobile study The WMF announced that machine-generated summaries of articles would be presented to readers, but then put the project on hold in response to negative community feedback.
Jul 2025 RfC Wikipedia talk:Speedy deletion RFC: New CSD for unreviewed LLM content Overwhelming consensus to adopt new speedy deletion criterion (Now in policy: WP:G15)
Aug 2025 RfC Wikipedia talk:Speedy deletion RfC: Including Markdown in G15 Consensus against including Markdown in G15 as it is not consistently an indicator of LLM-generated content.
Aug 2025 RfC Wikipedia talk:Speedy deletion RfC: Including emojis in G15 There was no consensus to adopt this criterion.
Aug 2025 Wikipedia:Village_pump_(policy) LLM/AI generated proposals? Discussion in progress
Sep 2025 Wikipedia:Village pump (idea lab) AI Moderator proposal Idea to augment existing edit filters/recent changes patrolling with an LLM, inspired by a Reddit extension
Sep 2025 Wikipedia:Village_pump_(policy) What is Wikipedia’s official stance on Ai-generated content Discussion archived
Oct 2025 Wikipedia:Village_pump_(idea lab) Add a bot/policy that bans AI edits from non-extended confirmed users Discussion in progress
Oct 2025 RfC Wikipedia:Writing articles with large language models Wikipedia talk:Writing articles with large language models § RfC Discussion in progress

Project resources

[edit]