Wikipedia:Articles for deletion/Waluigi effect (2nd nomination)

The following discussion is an archived debate of the proposed deletion of the article below. Please do not modify it. Subsequent comments should be made on the appropriate discussion page (such as the article's talk page or in a deletion review). No further edits should be made to this page.‎

The result was merge to AI alignment. $Salvio giuliano$ 12:14, 24 October 2025 (UTC)[reply]

Waluigi effect

AfDs for this article:

[Hide this box] New to Articles for deletion (AfD)? Read these primers!

(Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL)

Re-nomination. The topic appears to be a neologism with limited uptake in reliable secondary sources, fails WP:NEO and WP:N. Coverage is primarily primary/derivative commentary and blogs rather than significant independent sources, so it does not meet WP:GNG or WP:RS. If independent, in-depth sourcing exists, it has not been demonstrated in the article.

There is no indication of notability for the term "Waluigi effect" per WP:GNG or WP:FRINGE. The term is not recognized in reliable sources as a phenomenon in large language models (LLMs) or artificial intelligence (AI). Per WP:RS, it appears to have originated from a personal blog post by a crypto blogger on Substack in 2023^[1] and was subsequently discussed on LessWrong and in some online forums. While a conference paper cites LessWrong as a source, there is no significant coverage in independent, reliable, secondary sources. The term is not established within the AI or machine learning field. Research on related topics, such as "sleeper agents" in AI, has been published by frontier labs under different terminology^[2]. The "Waluigi effect" does not meet Wikipedia's notability or reliable sourcing standards and is not an accepted term of art in the field. 0xReflektor (talk) 06:28, 10 October 2025 (UTC)[reply]

Comment: Sources do not sufficiently support the name of the effect. If the article is merged, AI alignment makes sense as a target. Sushidude21! (talk) 06:04, 16 October 2025 (UTC)[reply]

References

Note: This discussion has been included in the deletion sorting lists for the following topics: Computing and Internet. 0xReflektor (talk) 06:28, 10 October 2025 (UTC)[reply]
Merge to Waluigi per WP:NOPAGE. It's not up to Wikipedia to say that citing a blog post means something is invalid if it's done by RS. On the other hand, this doesn't seem to pass the bar for requiring its own page entirely. ᴢxᴄᴠʙɴᴍ (ᴛ) 11:51, 10 October 2025 (UTC)[reply]

Waluigi is the Mario (franchise) character this effect gets its name from. While the AI Waluigi effect has an etymological connection to the character, it could be tricky to merge such disparate content. —A. B. ^{(talk • contribs • global count)} 15:10, 10 October 2025 (UTC)[reply]

Well it's the only target that makes sense given the conscious decision to name it after Waluigi. ᴢxᴄᴠʙɴᴍ (ᴛ) 18:20, 10 October 2025 (UTC)[reply]

No, the article is not about Waluigi, at all... Merko (talk) 17:57, 20 October 2025 (UTC)[reply]

Comment Waluigi comes from "warui" which is equivalent to "bad" + "Luigi". The effect name seems too forced. – The Grid (talk) 12:30, 10 October 2025 (UTC)[reply]

Query - @ 0xReflektor, what about Google Scholar results? I see several hits that look peer-reviewed. Also, if it’s talked about on LessWrong by the experts there, that’s a tipoff this is a real thing (even if LessWrong does not meet our reliable sources requirements). —A. B. ^{(talk • contribs • global count)} 14:56, 10 October 2025 (UTC)[reply]
Also, what about the Fortune (magazine) article that our article cites? A. B. ^{(talk • contribs • global count)} 15:02, 10 October 2025 (UTC)[reply]

@A. B. Scholar results cite LessWrong, LessWrong cites the Substack article. There is a recent Arxiv preprint that attempts a framework formalization, however the authors call it an empirical phenomena then cite LessWrong, citing Substack. 0xReflektor (talk) 05:19, 11 October 2025 (UTC)[reply]

https://scholar.google.com/scholar?q=%22Trinity+Institute+of+Professional+Studies%22

https://www.researchgate.net/profile/Manikyamba-Aliveli/publication/355368095_TRINITY_INSTITUTE_OF_PROFESSIONAL_STUDIES_th/links/616d077d951b3574c65dcd30/TRINITY-INSTITUTE-OF-PROFESSIONAL-STUDIES-th.pdf#page=246 Musykhandelhi (talk) 09:20, 17 October 2025 (UTC)[reply]
Comment not sure yet. There are sources, but they don't look great so far. If merged, AI alignment makes much, much more sense as a target than Waluigi. Grayfell (talk) 20:10, 10 October 2025 (UTC)[reply]
Comment I'm not entirely certain what should be done with the article, but I agree that "AI alignment" makes more sense as a merge target than "Waluigi" - Cukie Gherkin (talk) 03:53, 11 October 2025 (UTC)[reply]

Relisted to generate a more thorough discussion and clearer consensus.
Please add new comments below this notice. Thanks,

Salvio giuliano

06:35, 17 October 2025 (UTC)[reply]

Merge, either to AI alignment per Grayfell or to Waluigi. Either is OK with me. I agree that it fails WP:NEO and possibly WP:N. Gommeh 📖 🎮 16:40, 17 October 2025 (UTC)[reply]
Merge to AI alignment per Grayfell as there does not seem RS coverage on this specific topic for now. Not opposed to recreating this article if more RSs are published about this topic. I don't think the merge target for Waluigi makes as much sense given this topic is more to do with the AI than the namesake. Spy-cicle💥 ^Talk? 17:58, 20 October 2025 (UTC)[reply]
Weak keep, sources barely support its existence. If it won't be kept, it should be merged to AI alignment, not Waluigi. This article is not about the video game character. Merko (talk) 18:06, 20 October 2025 (UTC)[reply]
keep or merge I’ve seen it mentioned on these sources(https://www.wired.com/story/waluigi-effect-generative-artificial-intelligence-morality/

https://fortune.com/2023/05/27/what-is-waluigi-effect-artificial-intelligence-personal-assistant-bill-gates/

https://www.nytimes.com/2023/03/10/podcasts/hard-fork-don-beyer-tiktok.html) and it even has a Know Your Meme article now, so probs should be kept. If not should me merged Zulresso! :D (talk) 23:55, 20 October 2025 (UTC)[reply]

Keep or Merge to AI alignment. There does seem to be some mention of this in RS; I'd be more inclined to just request additional citations. Still, the phenomenon in question has everything to do with AI and nothing to do with Waluigi, so if it is to be merged, the place to put it is obvious. Endovior (talk) 22:36, 22 October 2025 (UTC)[reply]

Merged or Deleted, The concept discussed here is not a legitimate fringe belief, but rather a buzzword that originated from a blog post. It has only gained academic attention because some papers cited LessWrong as a source creating circular attribution pointing back to the blog post. By hosting this article, Wikipedia inadvertently legitimizes this concept when people search to verify its authenticity. We take greater lengths to flag fringe beliefs that are minority positions in scholarship, yet somehow seem willing to maintain this article that documents an off-hand Substack theory. 0xReflektor (talk) 05:23, 24 October 2025 (UTC)[reply]

Merge to AI alignment#Specification gaming and side effects to be specific. NenadWeber (talk) 11:07, 24 October 2025 (UTC)[reply]

The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made on the appropriate discussion page (such as the article's talk page or in a deletion review). No further edits should be made to this page.

[coryeth-1] ttps://coryeth.substack.com/p/the-waluigi-effect

[sleeperagents-2] ttps://arxiv.org/abs/2401.05566

[1]

[2]