Jump to content

Talk:Artificial intelligence in Wikimedia projects

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Bluerasberry (talk | contribs) at 21:28, 3 December 2018 (Bias study: new section). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
WikiProject iconComputing Start‑class Low‑importance
WikiProject iconThis article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
LowThis article has been rated as Low-importance on the project's importance scale.
WikiProject iconWikipedia Start‑class Mid‑importance
WikiProject iconThis article is within the scope of WikiProject Wikipedia, a collaborative effort to improve Wikipedia's encyclopedic coverage of itself. If you would like to participate, please visit the project page. Please remember to avoid self-references and maintain a neutral point of view, even on topics relating to Wikipedia.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
MidThis article has been rated as Mid-importance on the project's importance scale.

Undue weight

I am publishing this draft with two sections - AI for Wiki and Wiki for AI. I gave more weight to the AI for Wiki section just because that is the concept which has the attention of the popular press and popular science discussion.

The weight of the sources is not in the popular press, but actually in the academic literature. There are many more sources talking about the use of Wikimedia projects for off-wiki AI projects than there are sources and projects applying AI to develop Wikimedia projects. I wanted to establish this article to be more accessible to the first wave of readers whom I expect to want to read this content and sort their thoughts about it.

Wiki for AI is much bigger now and the foreseeable future is that Wikipedia content will either be the basis of future AI research or otherwise the future of AI research will have a basis in Wikimedia content branded and further developed as some next generation dataset.

I would like to see this article re-written to identify whatever review articles exist and to give fair weight to the various topics which papers most deeply examine. Blue Rasberry (talk) 17:04, 24 August 2018 (UTC)[reply]

Wikipedia as a data set

  • Mehdi, Mohamad; Okoli, Chitu; Mesgari, Mostafa; Nielsen, Finn Årup; Lanamäki, Arto (March 2017). "Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus". Information Processing & Management. 53 (2): 505–529. doi:10.1016/j.ipm.2016.07.003.

This article identified 132 other academic articles which describe how they used Wikipedia as a data set at the base of other research. As I start this article, there is a subsection on how artificial intelligence projects uses Wikimedia content in their development. That subsection could be split off into its own article, and that split article could itself be split into other articles. One such possible article could be something like "Wikipedia as a data set", because there is this article and the 132 it identifies as reliable source material for developing this as an independent concept. Blue Rasberry (talk) 16:58, 24 August 2018 (UTC)[reply]

"Artificial intelligence" as a buzzword

"Artificial intelligence" is a term with many meanings in various fields. Perhaps most of the sources which talk about artificial intelligence in Wikimedia are talking about machine learning, which often is another name for artificial intelligence. Blue Rasberry (talk) 17:12, 24 August 2018 (UTC)[reply]

Bias study

There is no research reporting from this yet but here is a 2018 research project analyzing bias in Wikimedia data structuring.

The PIs on this Brent Hecht and Loren Terveen seem to comment on Wikidata.

Blue Rasberry (talk) 21:28, 3 December 2018 (UTC)[reply]