This is a (rather incomplete) history ofWikipediabots that have contributed substantial amounts of encyclopedic material in an automated fashion.
Several hundred articles fromFederal Standard 1037C, a glossary of telecommunications terms, were imported in February 2002. The articles were pre-filtered to remove short entries and entries that were from non-public-domain sources. They were then auto-wikified. Only articles with titles that did not already exist on Wikipedia were imported.
In August–October 2001, and again in August 2002, contributors imported quite a few entries from the 100-year-oldEaston's Bible Dictionary (EBD).IP 216.99.203.xxx operated byAmillar uploaded the EDB text of 125 articles, starting withAbner.IP 213.253.39.188, operated byNeil Harris, created 130 articles on 7 August 2002, starting withÆnon.IP 217.158.106.111 created 52 articles on 7 August 2002.
These imports caused some discussion because entries werebiased, written in a pedanticVictorian prose, out of date, and some were incorrectly wikified (self links or multiple links). The issues wereworked out with the bot's creator on theWikipedia mailing list (see[1] and search for Easton's) and varioustalk pages, and new contributor guidelines were also revised accordingly. SeeWikipedia:Easton's Dictionary topics for the ongoing project, andthis list of articles which were not imported.
The same bot that uploaded theEaston Bible Dictionary material was later used to upload the first major wave of selected articles fromProject Gutenberg's copy of the1911Encyclopedia Britannica volume 1.[when?]
IP 62.30.112.2 started uploading on 18 August 2002, starting withHippolytus.
Because thedecade entries are highly structured, a bot was used to load templates for many of them.
(October 18,2002 –October 26,2002):rambot, operated byRam-Man, created approximately 30,000 U.S. city articles based onU.S. Census tables, at a rate of thousands of articles per day.[2] One error in the data set being used caused a corruption of around 2,000 new articles. Another incident caused theRecent Changes to become more difficult to use for a time and prompted much discussion and the creation of a Wikipedia bot policy.[1]
Theanomebot was written in 2003 for the purpose of uploading approximately 5,000 map images to illustrate the U.S. counties articles.
Rob Hooft's bot was developed on the Dutch Wikipedia in Summer 2003. Although other applications were developed first (year templates), its main use became doing interlanguage links, first at the Dutch Wikipedia, later also in other languages (Danish from August, French from November and many other languages as well). In early October, it was first used on the English Wikipedia, using usernameRobbot and helping in resolving links to disambiguation pages.
In 2008, a bot (ClueBot II) created thousands of tiny articles about asteroids usingNASA data.[2][3] A large percentage of these have subsequently been deleted asnon-notable and redirected to a table.[4]
Concurrently with the Rambot, some 800 stub articles on Dutch (500) and Swedish (300) municipalities were created by a bot owned byJeronimo. These appear under IP 131.155.230.xxx.
On the Italian Wikipedia, city articles have been uploaded byGacbot. This bot has started been used in 2004 and it was used to upload municipalities of Italy, Poland, Botswana, United States, Germany, Finland, Portugal and partially France; the municipalities of France have been edited in a semi-automatic way byUser:Gac and uploaded byUser:Gac andUser:Paginazero.[5]
The municipalities of China have been uploaded on Wikidata; data is accessible on several Wikipedias by searching for 郭坑镇 or it can be accessed throughReasonator.
In 2013,user:Lsjbot had created about 454,000 articles on Swedish Wikipedia (about half of the project), using data collected from different sources.[3] The bot first wrote articles about species of animals and plants. It has also been used onCebuano Wikipedia (ceb:Gumagamit:Lsjbot),Dutch Wikipedia (nl:Gebruiker:Lsjbot), andWaray Wikipedia (war:Gumaramit:Lsjbot).
Qbugbot, run by user Edibobb, has created a large number of insect-related articles.
The above two are a truly fascinating read if you're into bot history and social norms related to bots on Wikipedia. They mostly cover 2009–2014ish. The thesis is in French, but the appendix also contains a free copy of the English summary/prior research published inComputers in Human Behavior.
In 2008, another bot created thousands of tiny articles about asteroids, pulling a few items of data for each one from an online Nasa database.
In 2008—almost prehistory, by Wikipedia bot standards – an algorithm called ClueBot II 'wrote' 15,000 articles on asteroids, by parsing and rewriting public data from NASA's database. Those articles sat there, being edited by other bots—one changed the tags, another linked to the Japanese version, a third corrected a style guide issue—untilan actual human realised that having 'an out of date, broken, copy of the NASA website' wasn't the best way to run an encyclopaedia. In 2012, the creation was finally undone, and today, all of Cluebot's work lives in one 'list of minor planets'.