This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "GeoNames" – news ·newspapers ·books ·scholar ·JSTOR(March 2013) (Learn how and when to remove this message) |
GeoNames (orGeoNames.org) is a user-editablegeographical database available and accessible through variousweb services, under aCreative Commons attribution license. The project was founded in late 2005.[1]
The GeoNames dataset differs from, but includes data from,[2] the US Government's similarly namedGEOnet Names Server.
The GeoNames database contains over 25,000,000geographical names corresponding to over 11,800,000 unique features.[3] All features are categorized into one of nine feature classes and further subcategorized into one of 645 feature codes. Beyond names of places in various languages, data stored includelatitude,longitude,elevation, population, administrative subdivision andpostal codes. Allcoordinates use the WorldGeodetic System 1984 (WGS84).
Those data are accessible free of charge through a number of Web services and a daily database export.[4]
The core of the GeoNames database is derived from official public sources, whose quality may vary. Through awiki interface, users can manually edit and enhance the database by correcting names, updating locations, adding new features, and refining existing entries[5]
Each GeoNames feature is represented as aweb resource identified by a stableURI. This URI provides access, throughcontent negotiation, either to the HTML wiki page, or to aRDF description of the feature, using elements of the GeoNamesontology.[6] This ontology describes the GeoNames features properties using theWeb Ontology Language, the feature classes and codes being described in theSKOS language.Through Wikipedia articles URL linked in the RDF descriptions, GeoNames data are linked toDBpedia data and other RDFLinked Data.
As in othercrowdsourcing schemes, GeoNames edit interface allows everyone to sign in and edit the database, hence false information can be entered and such information can remain undetected especially for places that are not accessed frequently.Ahlers (2013) studies these inaccuracies and classifies them into loss in the granularity of coordinates (e.g., due to truncation and low-resolution geocoding in some cases), wrong feature codes, near-identical places, and the placement of places outside their designated countries.Manually correcting these inaccuracies is both tedious and error-prone (due to the database size) and may require experts.
The literature provides very few works on automatically resolving them.Singh & Rafiei (2018) study the problem of automatically detecting the scope of locations in a geographical database and its applications in identifying inconsistencies and improving the quality of the database.Computing the boundary information can help detect inconsistencies such as near-identical places and the placement of locations such as cities under wrong parents such as provinces or countries. Singh and Rafiei show that the boundary information derived in their work can move more than 20% of locations in GeoNames to better positions in the spatial hierarchy and the accuracy of those moves is over 90%.