Fill inwikidata:Property:P13273, Letterboxd Studio ID. Letterboxd's film data is apparently imported from TMDB; however, Letterboxd has content of its own that makes it a useful resource worth linking here. Letterboxd studio IDs are mostly generated from the company name, punctuation- and accent-stripped, lowercased, with words separated by hyphens; but some are different, having, for example, "-1" or "-2" at the end, for reasons yet to be determined.
See query onwikidata:User:The Anome. So far 222 of a possible 5000+ film production companies have Letterboxd studio IDs assigned. — 28 March 2025
Special case:{{infobox mapframe}} that contains a{{WikidataCoord}} template with a non-blank 'display=' parameter results in a page with no geocoordinate link. Hunt these down and fix them.
Disambiguate references toLaw and order, distinguishing the preservation of civil order and the rule of law from the political ideology of "laura norder". Now down to less than 100.
the Stiletto Fads, successor to the Stilletoshttps://bedfordandbowery.com/2012/07/nightclubbing-stiletto-fads/ -- Quote: "Elda transformed the Stillettos into the Stilletto Fads, with Tish and Snooky Bellomo as back up singers." merge to The Stilettos, as a lineup change?
Can we add some of the images of Elda Gentile to Commons, per theCommons:Hirtle chart, as PD-US workseither published without a copyright notice in the period 1928 through 1977,or published without notice, and without subsequent registration within 5 years, in the period 1978 through 28 February 1989?
or, perThe Rialto Report, would it be possible to persuade one of the photographers (Meryl Meisler,Allan Tannenbaum,Bob Gruen, andRoberta Bayley) to relase one of their images under a Commons-compatible license?
ModifyTemplate:Taxobox/core to detect if images are completely missing (none of image, image2, image_upright, image2_upright), and to add a category "Taxobox without images", or perhaps "$ORDER taxobox without images", where $ORDER is taken from the taxonomic hierarchy.
Articles onGold Star Publications, wrapping all their magazines up within the same article, since they were highly notable for their large porn magazine stable, but the individual magazines seemingly not. This would thus complete at least the 60's-2010's paper magazine era of the British porn series of articles. Done
Rewrite the Anomebot's internal API access library to use Python's "requests" HTTP library under the hood, to allow the use of persistent HTTPS connections
Fix bot exclusion stuff: complex spec, painful to implement -- do a simplified version that just detects the{{bot}} tag
Remove lines from visited.txt that have "no such page" or "a type article" in logfile.txt; currently there are 4000+ of these; the first is caused by draftification, the second by a now-fixed bug in the bot code finding false positives
Possible low-hanging fruit for geocoding: the following categories have thousands of non-geocoded articles that are not getting matched by my current software, and may benefit from special-purpose matching heuristics:
It looks like a lot of this might be repetition of the same location in multiple places: the bot's code gets 7000+ multi-matches for Iran
See also this paper: "Cross linguistic name matching in English and Arabic: a "one to many mapping" extension of the Levenshtein edit distance algorithm" inFreeman, A. T.; Condon, S. L.; Ackerman, C. M. (2006). "Cross linguistic name matching in English and Arabic".Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics: 471.doi:10.3115/1220835.1220895.
Note: insignificant number (< 100) of multimatches
This may be a matter of transliteration:McCune–Reischauer vs. Revised Romanization
Category:Turkey articles missing geocoordinate data (5000+ articles) -- lots of places with the same names but in different regions (eg. 17 villages all called "Akpınar"), same problem as was found with Polish placenames (also: why isAkçakoca failing to be caught?) The bot code finds 3000+ multi-matches for Turkey.
Also, this is due to non-standard naming conventions for the hierarchy of Turkish article categories: see, for exampleCategory:Ankara Province.
I've now used spatial disambiguation to resolve some 2000+ of these.
Total is over 27,000 possibles: even doing a fraction of these would make a big dent in the backlog.
-- looks like the Kelvin symbol got normalized to a letter "K", so removed it, ditto the Angstrom sign which also got normalized, and now the ohm sign, which gets normalized to a capital omega
Here is a further reduced subset of those, limited further to letter-like shapes: