Language documentation (also:documentary linguistics) is a subfield oflinguistics which aims to describe the grammar and use ofhuman languages. It aims to provide a comprehensive record of the linguistic practices characteristic of a givenspeech community.[1][2][3]
Language documentation seeks to create as thorough a record as possible of the speech community for both posterity andlanguage revitalization. This record can be public or private depending on the needs of the community and the purpose of the documentation. In practice, language documentation can range from solo linguistic anthropological fieldwork to the creation of vast online archives that contain dozens of different languages, such asFirstVoices orOLAC.[4]
Language documentation provides a firmer foundation for linguistic analysis in that it creates acorpus of materials in the language. The materials in question can range from vocabulary lists and grammar rules to children's books and translated works. These materials can then support claims about the structure of the language and its usage.[5] This should be seen as a basictaxonomic task for linguistics, identifying the range of languages and their characteristics.
While various linguists had already been practising language documentation during the 20th century, the discipline was established as a specific field within linguistics following a 1998 publication byNikolaus Himmelmann,[1] which many understand as foundational.[6]
Typical steps involve recording, maintaining metadata, transcribing (often using theInternational Phonetic Alphabet and/or a "practicalorthography" made up for that language), annotation and analysis, translation into a language of wider communication, archiving and dissemination.[7]Critical is the creation of good records in the course of doing language description. The materials can be archived, but not all archives are equally adept at handling language materials preserved in varying technological formats, and not all are equally accessible to potential users.[8]
Language documentation complementslanguage description, which aims to describe a language's abstract system of structures and rules in the form of a grammar or dictionary. By practising good documentation in the form of recordings with transcripts and then collections of texts and a dictionary, a linguist works better and can provide materials for use by speakers of the language. New technologies permit better recordings with better descriptions which can be housed in digital archives such asAILLA,Pangloss, orParadisec. These resources can then be made available to the speakers. The first example of a grammar with a media corpus isThieberger's grammar ofSouth Efate (2006).[9]
Language documentation has also given birth to new specialized publications, such as the free online and peer-reviewed journalLanguage Documentation & Conservation and the SOAS working papersLanguage Documentation & Description.
Thedigitization of archives is a critical component of language documentation andrevitalization projects.[10] There are descriptive records of local languages that could be put to use in language revitalization projects that are overlooked due to obsolete formatting, incomplete hard-copy records, or systematic inaccessibility. Local archives in particular, which may have vital records of the area'sindigenous languages, are chronically underfunded and understaffed.[11] Historic records relating to language that have been collected by non-linguists such as missionaries can be overlooked if the collection is not digitized.[12] Physical archives are naturally more vulnerable to damage and information loss.[10]
Language documentation can be beneficial to individuals who would like to teach or learn an endangered language.[13] If a language has limited documentation this also limits how it can be used in a language revitalization context. Teaching with documentation and linguist's field notes can provide more context for those teaching the language and can add information they were not aware of.[13] Documentation can be useful for understanding culture and heritage, as well as learning the language. Important components when teaching a language includes: Listening, reading, speaking, writing, and cultural components. Documentation gives resources to further the skills for learning a language.[13] For example, the Kaurna language was revitalized through written resources.[14] These written documents served as the only resource and were used to re-introduce the language and one way was through teaching, which also included the making of a teaching guide for the Kaurna language.[14] Language documentation and teaching have a relationship because if there are no fluent speakers of a language, documentation can be used as a teaching resource.
Language description, as a task within linguistics, may be divided into separate areas of specialization: