Logo | |
Type of site | Australian library database aggregator |
|---|---|
| Available in | English |
| Owner | National Library of Australia |
| URL | trove |
| Commercial | no |
| Registration | Optional |
| Launched | 2009; 16 years ago (2009) |
| Current status | Online |
Trove is an Australian online library database owned by theNational Library of Australia in which it holds partnerships with source providersNational and State Libraries Australia, an aggregator and service which includesfull text documents,digital images,bibliographic and holdings data of items which are not available digitally, and a freefaceted-search engine as a discovery tool.
The database includesarchives, images, newspapers, official documents,archived websites,manuscripts and other types of data. it is one of the most well-respected and accessedGLAM services in Australia, with over 70,000 daily users.
Based on antecedents dating back to 1996, the first version of Trove was released for public use in late 2009. It includes content from libraries, museums,archives, repositories and other organisations with a focus on Australia. It allows searching ofcatalogue entries of books in Australian libraries (some fully available online), academic and other journals, full-text searching of digitised archived newspapers,government gazettes and archived websites. It provides access todigitised images, maps, aggregated information about people and organisations, archived diaries and letters, and allborn-digital content which has been deposited viaNational edeposit (NED). Searchable content also includes music, sound andvideos, and transcripts of radio programs. With the exception of the digitised newspapers, none of the contents is hosted by Trove itself, which indexes the content of its partners' collectionmetadata, formats and manages it, and displays the aggregated information in a relevance-ranked search result.
In the wake of government funding cuts since 2015, the National Library and other organisations have been struggling to keep up with ensuring that content on Trove is kept flowing through and up to date.
Trove's origins can be seen in the development of earlier services such as theAustralian Bibliographic Network (ABN),[1] a shared cataloguing service launched in 1981.
The "Single Business Discovery Project" was launched in August 2008.[2] The intention was to create a single point of entry for the public to the various online discovery services developed by the library between 1997 and 2008, including:[2][3][4]
The service developed by the project was calledSingle Business Discovery Service, and also briefly known by the staff asGirt. The name Trove was suggested by a staff member, with the associations of atreasure trove and theFrench verbtrouver (to find or discover).[4]
The key features of the service were designed to create afaceted search system specifically for Australian content. Tight integration with the provider databases has allowed "Find and Get" functions (e.g. viewing digitally, borrowing, buying, copying). Important extra features include the provision of a "check copyright" tool andpersistent identifiers (which enables stable URLs).[7]
The first version of Trove was released to the public in late 2009.[7]
The National Library of Australia combined eight different online discovery tools that had been developed over a period of twelve years into a new single discovery interface that was released as a prototype in May 2009 for public comment before launching in November 2009 as Trove.[8] It is continually updated to expand its reach.[9][10] With the notable exception of the newspaper "zone", none of the material that appears in Trove search results is hosted by Trove itself. Instead, it indexes the content of its content partners' collection metadata and displays the aggregated information in a relevance-ranked search result.[11]
The service is built using a variety ofopen source software.[12][13] Trove provides a free, publicApplication Programming Interface (API).[14] This allows developers to search across the records for books, images, maps, video, archives, music, sound, journal articles, newspaper articles and lists and to retrieve the associated metadata using XML andJSON encoding.[15][16] The full text of digitised newspaper articles is also available.[17]
Severalcitation styles are automatically produced by the software, giving a stable URL to the edition, page or article-level for any newspaper.Wikipedia was closely integrated from the beginning of the project, making Trove the firstGLAM website in the world to integrate the Wikipedia API into its product.[18]
Trove has continued to evolve and take on new services and collections.
In 2012, Music Australia was integrated with Trove, and ceased to exist as a separate entity.[19]
In 2016, in collaboration with theState Library of New South Wales, Trove launched theGovernment Gazettes zone, and continues to collect the official gazettes of all levels of government (Commonwealth andState and Territory) where possible.[20]
In March 2019 PANDORA became part of the largerAustralian Web Archive, which comprises the PANDORA archive, the Australian Government Web Archive (AGWA) and the National Library's ".au"domain collections, using a single interface in Trove which is publicly available.[21][22][23][24]
Trove has grown beyond its original aims, and has become "a community, a set of services, an aggregation ofmetadata, and a growing repository of full text digital resources" and "a platform on which new knowledge is being built". It is now a collaboration between the National Library, Australia'sState and Territory libraries and hundreds of other cultural and research institutions around Australia.[25]
It is an Australian onlinelibrary database aggregator; a free faceted-search engine hosted by the National Library of Australia,[26] in partnership with content providers, including members of the National and State Libraries Australia (NSLA).[7]
Trove "brings together content from libraries, museums, archives, repositories and other research and collecting organisations big and small" in order to help users find and use resources relating to Australia and therefore the content is Australian-focused.[25] Much of the material may be difficult to retrieve with other search tools, for example in cases where it is part of thedeep web, including records held in collection databases,[7] or in projects such as thePANDORA web archive, Australian Research Online, Australian National Bibliographic Database and others mentioned above.[3]
Since 2019, Trove has included access to allelectronic documents deposited by Australian publishers under thelegal deposit provisions of theCopyright Act 1968, as amended in 2017 to included such publications.[27] These resources are identifiable by a display in the top right-hand corner in both the ebook and pdf viewers, saying "National edeposit collection". Many of these are readable and some aredownloadable, depending on the access conditions.[28]
The site's content is split into "zones" designating different forms of content which can be searched all together, or separately.[29]
The book zone allows searching of the collective catalogues of institutions findable inLibraries Australia using the Australian National Bibliographic Database (ANBD). It provides access to books,audio books,e-books,theses,conference proceedings andpamphlets listed in ANBD, which is aunion catalogue of items held in Australian libraries and a national bibliographic database of resources including Australian online publications.[30] Bibliographic records from the ANBD are also uploaded into theWorldCat global union catalogue.[31] The results can be filtered by format if searching forbraille, audio books, theses or conference proceedings and also by decade and language of publication.[32] A filter for Australian content is also provided.[8][33]


Trove allows text-searching of digitised historic newspapers, with the Newspapers zone replacing the previous "Australian Newspapers" website.[citation needed] It provides text-searchable access to over 700 historic Australian newspapers from each State and Territory.[35] By 2014, over 13.5 million digitised newspaper pages had been made available through Trove as part of the Australian Newspaper Plan (ANPlan),[36] a "collaborative program to collect and preserve every newspaper published in Australia, guaranteeing public access" to these important historical records.[37]
The extent ofdigitised newspaper archives is wide reaching and includes now defunct publications, such as theAustralian Home Companion and Band of Hope Journal andThe Barrier Miner in New South Wales andThe Argus in Victoria.[note 1][38] It includes the earliest published Australian newspaper, theSydney Gazette (which dates to 1803), and somecommunity language newspapers.[36] Also included isThe Australian Women's Weekly.[39][note 2]
The Canberra Times is the only major newspaper available beyond 1957. It allowed publication of its in-copyright archive up to 1995 as part of the "centenary of Canberra" in 2013,[41] and the digitisation costs were raised with acrowdfunding campaign.[42] Also crowdfunded, the Australian feminist magazineThe Dawnwas included onInternational Women's Day 2012.[43][44]
As of 10 May 2020[update], 23,498,368 newspaper pages and 2,026,782 government gazette pages were available to view.
On 25 July 2008 the "Australian Newspapers Beta" service was released to the public as a standalone website and a year later became a fully integrated part of the newly launched Trove. The service contains millions of articles from 1803 onwards, with more content being added regularly.[45] The website was the public face of the Australian Newspapers Digitisation Project, a coordination of major libraries in Australia to convert historic newspapers to text-searchable digital files. The Australian Newspapers website allowed users to search the database of digitised newspapers from 1803 to 1954 which are now in thepublic domain.
The newspapers (frequentlymicrofiche or other photographic facsimiles) were scanned and the text from the articles has been captured byoptical character recognition (OCR) to facilitate easy searching, but it contains many OCR errors, often due to poor quality facsimiles.[46][47]
Since August 2008 the system has incorporatedcrowdsourced text-correction as a major feature, allowing the public to change the searchable text. Many users have contributed tens of thousands of corrected lines, and some have contributed millions.[48] As of January 2022 5.82% of articles have at least one correction.[49] This collaborative participation allows users to give back to the service and over time improves the database's searchability.[50][51] The text-correcting community and other Trove users have been referred to as "Trovites"[52] or, less euphoniously, "Voluntroves".[53]
TheAustralian Web Archive, created in March 2019,[54] includes websites archived from 1996 until the present. This is the primary search portal of thePANDORA web-archiving service, and also includes theAustralian Government Web Archive (AGWA) as well as websites from the ".au"domain, which are collected annually through largecrawl harvests.[55]
(In order of presentation along the top tab.)
In a keynote address to the 14th NationalAustralian Library and Information Association (ALIA) Conference inMelbourne in 2014,Roly Keating, Chief Executive of theBritish Library described Trove as "exemplary" – a "both-end choice" of deep rich interconnected archive.[58]
Digital humanities researcher and Trove manager Tim Sherratt noted that in relation to the Trove Application Programming Interface (API) "delivery of cultural heritage resources in a machine-readable form, whether through a custom API or asLinked Open Data, provides more than just improved access or possibilities for aggregation. It opens those resources to transformation. It empowers us to move beyond 'discovery' as a mode of interaction to analyse, extract, visualise and play".[59] The subsequent development of the GLAM Workbench[60] aims to utilise such machine readable data.[61] Since 2018 the Australian Academic and Research Network (AARNet) has provided a dedicatedJupyter Notebooks environment that enables researchers "easily explore and analyse data held in the National Library of Australia (and Cloudstor) using Jupyter Notebooks created and openly shared by Associate Professor Tim Sherratt via the 'GLAM Workbench'."[62]
The site has been described as "a model for collaborative digitization projects and serves to inform cultural heritage institutions building both large and small digital collections".[63]
The reach of the newspaper archives makes the service attractive togenealogists[64][65][66] andknitters.[9] It is one of the most well-respected[67] and accessed GLAM (galleries, libraries, archives and museums) services in Australia, with over 70,000 daily users.[68][9]
Dr Liz Stainforth of theUniversity of Leeds calls it "that rare beast: a digital heritage platform with popular appeal"; "of the most successful of its kind among aggregators such asEuropeana, theDigital Public Library of America and...DigitalNZ". What distinguishes it from the other three is that it also delivers content, and engages with the general public, which has created a form ofvirtual community amongst its text correctors. Users can log in and thus create their own lists, and also correct the text of newspapers scanned usingOptical character recognition (OCR), with an honour board for the top correctors. International researchers also use Trove: a 2018 showed the site among the top 15 for external citations in the English-language version of Wikipedia. The width and breadth of its audience adds to its uniqueness.[69]
Trove received the 2011 Excellence in eGovernment Award and the 2011 Service Delivery Category Award.[70][71]
In the wake of theAustralian Government's 2015 Mid-Year Economic and Fiscal Outlook Statement, Trove funding was cut with the result that the National Library of Australia would cease "aggregating content in Trove from museums and universities unless ... fully funded to do so".[72] In addition, it was argued that the cuts would further "result in many smaller institutions across Australia being unable to afford to add their digital collections to this national knowledge infrastructure".[73] Those smaller institutions would include local historical societies, clubs, schools, and commercial and public organisations, as well as private collections.
In March 2016 ten major Australian galleries, libraries, archives and museums (commonly referred to as theGLAM sector) signed a statement of support for Trove, in which they warned that the budgetary cuts would "hamper the development of our world leading portal and will be a major obstacle to exposing the collections of smaller and regional institutions" and that "without additional funding, Trove will not fulfil its promise as the discovery site for all Australian cultural content".[74] Similar statements were issued by theAustralian Academy of the Humanities[75] and theNational Trust (NSW).[76]
Tim Sherratt, a former manager of Trove, warned in early 2016 that fewer collections would be added and that less digitised content would be available – "not quite a content freeze, but certainly a slowdown".[77]
Following extensive campaigning, including a public campaign onTwitter, Trove received a commitment ofA$16.4 million in December 2016, spread over four years.[69][78]
By early 2020, with the surge in demand for all types of digital services, the National Library was having to cope with increasingly dwindling staff resources to develop services on Trove and National edeposit, and undertook a restructure of its staffing and operations.[79]
The Age andThe Sydney Morning Herald revealed in 2022 that the current funding arrangements for Trove would cease at the end of June 2023, leading to its closure.[80] In April, it was announced that the federal government pledgedemergency funding of $33 million over the next four years to the NLA.[81][82][83]
In July–August 2020 a redesigned user interface was unrolled, with a more open display of search results and a new logo reminiscent of akeyhole.[84]
Pilot testing for handwritten text recognition usingOptical Character Recognition (OCR) andHandwritten Text Recognition (HTR) began in October 2023 with text correcting functionality appearing on some handwritten and unpublished material.[85]
{{cite web}}: CS1 maint: bot: original URL status unknown (link){{cite journal}}:Cite journal requires|journal= (help){{cite journal}}: CS1 maint: multiple names: authors list (link)