WikiCite: creating a shared bibliographic repository for all Wikimedia projectsBibliographic data form a big and well-connected part of WikidataExample query based on such metadata:Gallery of authors of scientific articles that have been published on this day of the yearA representation of how different aspects of the WikiCite ecosystem fit togetherThe World needs citations, and citations need source metadata.The English word "citation" sung by a soprano singerMetadata on the way from a physical copy of a book intoInventaire, a Wikidata front-endVarious tools exist to assist with the entry and curation of metadata, e.g. theWikidata Edit Framework depicted here.Topics co-occurring withZika virus(Q202864)Usage history of some key Wikidata properties around bibliographic and citation data as of January 2019. Live resultshere
The aim ofWikiCite is:
to act as a hub for work in Wikidata involving citation data and bibliographic data as part of the broaderWikiCite initiative.
to define a set of properties that can be used by citations,infoboxes, andWikisource.
to map and import all relevant metadata that currently is spread across Commons, Wikipedia, and Wikisource.
establish methods to interact with this metadata from different projects.
to create a large open bibliographic database within Wikidata.
to reveal, build, and maintain community stakeholdership for the inclusion and management of source metadata in Wikidata.
There have been various proposals over the years for similar projects (seemeta:WikiCite for details). Now that Wikidata is here, we can make it happen.
User:Research Bot has imported metadata for millions of academic works. Some data is stored as strings (such as author names)
Go toScholia, search on a name for an author, go to that page, append "/missing" to the URL (e.g.,https://scholia.toolforge.org/author/Q6371926/missing) and then start resolving author strings. There are "/missing" pages for topics as well, e.g. as documentedhere, and venues have them too (example).
AWikiCite Roadmap of the future of bibliographic data in Wikidata has been discussed
so9q worked on apython bot to follow the Wikimedia eventstream and list DOIs and ISBNs found in Wikipedia articles mentioned in the stream. Next step it to import them if thebot request is approved..
Daniel Mietchen is addingmain subject(P921) to publications (this is not really an import, since we do not have appropriate tooling, e.g. as discussedhere);
The GWMAB is working, mostly manually, on the import of items of researchers affiliated to Italian universities (seeproject IRIS) which is the main focused driving force of the import strategy in Italy
How to create a scholarly profile for a researcher in Wikidata
Consider the platform
Visit Wikidata
Wikidata is the database which anyone can edit
The Wikidata community curates this data
Consider Wikicite
Wikicite is the community project within Wikidata which curates source metadata
The Wikicite community is a subset of the Wikidata community
Consider how anyone accesses data
Scholia is the specialized Wikidata tool for viewing academic profiles of people, topics, universities, etc
If a profile looks good in Scholia, then the data is correctly formatted to be maximally open and accessible in Wikidata and the Semantic Web
Making a profile look good in Scholia is the quickest and easiest way to format data once and for all
the Wikidata Query Service is the general Wikidata tool for viewing groups of Wikidata content
Everyone else, including big tech, big publishing, big government, etc scrape Wikidata and reuse this content, so what is in Wikidata goes everywhere else
Identify or create the Wikidata item for the researcher to profile
use basic Wikidata search by the person's name
if the item for the person exists, then use it
if the item does not exist, then create it
follow the instructions for creating a profile for a human in Wikidata:WikiProject Biographies
add enough information to uniquely identify this person by name and a few other characteristics
If there is ambiguity because multiple people have the same name and characteristics, then create a new item. Items can be merged, and merging duplicates is easier to fix than separating mixed items.
Try to add the ORCID, which is a unique scientific identifier
ContentMine presentation, Wikimania 2014. Wikiwish: "An Open Bibliography of science, updated daily" (the first bulletpoint at 27:30)Citing as a public service: presentation byUser:DarTar at the 2015 Wikipedia Science Conference pitching Wikidata as an open bibliographic and citation data repository
Some possibleData Collaborators have expressed interest on working on source metadata in Wikidata: others might usefully be approached.
ContentMine has some excellentopen software tools, which we could use to let Wikidata answer queries like "List all the review papers ever written on malaria vaccines", "List all the articles that mentionLygodactylus williamsii", "List every paper ever written by John Tuzo Wilson" and "List all the papers cited in Wikipedia articles that have been retracted". They listed "An Open Bibliography of science, updated daily" as a "wikiwish" at Wikimania 2014, apparently unaware that this project has been started at a slightly earlier workshop.
PLOS has anAPI for RichCitations, which contains metadata on all PLOS papers up through late 2014. Rich Citations is a novel structured format to express each citation as a data element, and it includes a set of useful, additional terms specific to scholarly literature that enable research about the knowledge web citations create. It also includes a display feature much likeReference Tooltips, but linked to a database (which is open licensed), so it can update metainformation. They presented at Wikimania 2014 and are keen to collaborate and share their results with us.
Zotero is interested in the idea of a proofread metadata source. Some Zotero users currently upload to cloud storage; we might build tools to let them upload here, instead.CiteseerX has a large open-licensed database of article metadata, and might want to set up an exchange, but have not responded to e-mails.
The Cochrane Collaboration is developing anAPI to its metadata (they were contacted about this project in July 2014, so this use case may have helped shape the API). They produce large amounts of non-conventional metadata on works they review, and on works they produce, both of which Wikimedians quote.
Institutional repositories are also increasingly interested in open APIs and linked databases, and seem generally receptive to this project. The university-run academic search engineBASE aggregates and normalizes these repositories and makes its data collection available for non-commercial purposes.
The french ministry of research and teaching developped an open science barometer by harvesting various source of bibliometric data.