This article is within the scope of WikiProject Wikipedia, a collaborative effort to improve Wikipedia's encyclopedic coverage of itself. If you would like to participate, please visit the project page. Please remember to avoid self-references and maintain a neutral point of view, even on topics relating to Wikipedia.WikipediaWikipedia:WikiProject WikipediaTemplate:WikiProject WikipediaWikipedia
This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing
Mehdi, Mohamad; Okoli, Chitu; Mesgari, Mostafa; Nielsen, Finn Årup; Lanamäki, Arto (March 2017). "Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus". Information Processing & Management. 53 (2): 505–529. doi:10.1016/j.ipm.2016.07.003.
This article identified 132 other academic articles which describe how they used Wikipedia as a data set at the base of other research. As I start this article, there is a subsection on how artificial intelligence projects uses Wikimedia content in their development. That subsection could be split off into its own article, and that split article could itself be split into other articles. One such possible article could be something like "Wikipedia as a data set", because there is this article and the 132 it identifies as reliable source material for developing this as an independent concept. Blue Rasberry (talk)16:58, 24 August 2018 (UTC)[reply]