Movatterモバイル変換


[0]ホーム

URL:



Full-Text Electronic Databases of Pre-modern Japanese Literature

By Satoko Shimazaki



Introduction

This project seeks to introduce significant electronic full-text databasesfor pre-modern Japanese literature. There are numerous on-line sites reproducingclassical works today, but databases that are useful for researchers inJapanese studies are rare. Since e-texts cannot substitute for annotatedscholarly print editions, their major utility is the capacity to searchthe texts for specific words and word combinations.  For those withfull Japanese language capability on their computers, of course, you cansimply download the texts and then use the search functions in your word-processingprograms.  Insection 4 below, a list of full-textdatabases without search functions is provided for those using a Japaneseinterface with search capabilities.

But for the rest, it will be necessary to rely on the search functionsprovided by the sites themselves, on which I have focused in this report,specifically the Japanese Text Initiative at University of Virginia, thevarious electronic databases at Kokubungaku Kenkyû Shiryôkan(NIJL), and the Kichô shiryô gazô at Kyoto University,which offers digitalized images of original manuscripts.  Online full-textsof Japanese literature are still fairly new and there are still many inconveniencesfor users. But e-texts are a rapidly expanding field, which are sure tobring about significant changes in textual research methods.

Each site described below is is divided into the following four sections:

Introduction: a general overview of the project
Basic information: background information on the site and itspurpose.
Content: features of its text and search functions
Future prospects: future plans on the improvement and goalsof the site.
 


CONTENTS

1.Introduction: Index of E-texts
2. Searchable Full-Text Databases
    A)Japanese Text Initiative
    B)Kokubungaku kenkyûshiryôkan (NIJL)
        a.NKBT Full-TextDatabase
        b.Nijûichi-daishû Database
        c.Renga Database
3.Imaged Manuscripts
4.Other On-line Texts for Japanese Literature
 


1.  INTRODUCTION: INDEX OFE-TEXTS

The best starting point for locating particular electronic text versionsof classical (but not modern) Japanese literature texts is the PMJS indexof"Translationsof Classical Japanese Works."   This crucial resource liststranslations into European languages of all major classical works, as wellas where they can be found in electronic form, either on the Web or CD-ROM.


2.  SEARCHABLE FULL-TEXT DATABASES

A)  JAPANESE TEXT INITIATIVE

 Japanese TextInitiative
URL: http://etext.lib.virginia.edu/japanese

Introduction:The Japanese Text Initiative (JTI) is presentlythe most important site for pre-modern electronic texts. JTI was createdfor research purposes and contains a search function that allows one tolook for any word or character in the entire JTI text, either by itselfor in a selected context. As JTI is primarily intended for English-speakingscholars, the users are less likely to face technical problems in downloadingthe materials as long as one’s interface can read and input Japanese characters.Although the database is still limited in selection at present, it is constantlybeing updated with new materials. The project has received internationalacclaim as JTI was named the winner of the second annual Digital ArchivesAward by Digital Frontier Kyoto, Japan in October 2000. With the inclusionof a greater number of texts, JTI will become an indispensable site forscholars in pre-modern Japanese literature.

Basic information: JTI is an ongoing collaborative electronicproject between the University of Virginia library and the University ofPittsburgh library based at the University of Virginia’s Electronic TextCenter. Scholars both in the United States and Japan have participatedin this project those including Professor J. Thomas Rimer and Mae J. Smethurstof the University of Pittsburgh and Professor Lewis Cook of Queens College.

The site intends to provide on-line web texts of both classical andmodern Japanese literature in Japanese characters. The e-texts are intendednot as a substitute for the current printed editions but to add searchcapabilities for words and characters to overcome the limitations of hardcopies. The site is originally aimed at English-speaking scholars and studentsand thus English translations are provided for works where possible. Materialsdownload in HTML format and thus it is possible to search both in Englishand Japanese.

As of December 2000, JTI has included nearly 44 works, mostly pre-moderntexts, and scholars’ introductions. The site is constantly up-dated. Itis the most rapidly growing database for on-line Japanese texts with acomprehensive word search function.

Content:The key feature of this site is the useful organizationof the text and the rapid and comprehensive search functions.

Each text is visually very well structured. For long works such as theTaleof Genji, a small frame with a table of contents appear on the righthand side while the text itself is displayed in the larger window. Thetable of content is not provided for shorter works, instead, it appearswith a break between each chapter.

Source information and annotations on the text are provided for everywork. Each text contains an acknowledgment section to clarify who has input,proofread, and tagged the text as well as an editorial note to show themanuscript used to reproduce the work on-line and links to other siteswhere the work is also available. Some materials contain an introduction,for example,Kokinshû, where scholars give an overview ofthe work and bibliographical information. The most complete section inJTI is the Noh database where notes, English translations, a list of technicalterms in Japanese, and a glossary of Japanese Noh terms are provided.

What distinguishes JTI from other electronic literature text sites isthat it permits rapid word or a character searches. Two methods of searchare possible: simple search and compound search. In the simple search,one can input a word or character and select to perform a search withinthe full text or within the bibliographical information. As an option,the search can be limited to the entire work of a particular author orto a single work. The compound search function, in addition to the optionoffered in simple search, allows one to look for a character, characterstring or words near another character or string. Thus one may conductsearch of word X near word Y within a range of 40, 80, or 120, characters.

In the "Tips for search," JTI states that search can be performed inJapanese characters, English, Rômaji, or combinations of Japaneseand English. Despite the notice, the results tend to appear only in theform it is provided in the text. A word could also appear both in kanjiand kana equivalent. Therefore, it is safest to try the search in variousformats to get the most accurate data.

Excerpts of the identified part and the number of matches in the textare presented in the result. If one has cookies enabled in one’s browser,the selected characters or words appear highlighted in red in the results.

Future prospects:In the short term, JTI intends to put all TwentyClassical Works in J. Thomas Rimer’sA Reader’s Guide to Japanese Literature(New York: Kodansha, 1988) on-line. In the longer goal, they plan to addpre-twentieth-century works such as Shinkokinshû, other anthologiesofHachidaishû,and twenty-century literature in ProfessorRimer’s guide, those that are free from copyright law restriction.


B)  KOKUBUNGAKU KENKYUSHIRYOKAN (NIJL)

KokubungakuKenkyû Shiryôkan
URL: http://www.nijl.ac.jp/databases/databases.htm

Introduction: The NIJL (National Institute of JapaneseLiterature = Kokuritsu Kokubungaku Kenkyû Shiryôkan) in Tokyooffers various on-line text databases, including the IwanamiNihon Kotenbungaku taikei, theNijûichi-dai shû, a collectionof renga, theEiri Genji monogatari, and a collectionof texts of modern poetry. Here, I will introduce only the first threedatabases, since the others are either still under construction or inaccessibleto those without a Japanese operating system. All three sites are temporarilyopened to the public before the replacement of a final version. Each hasan excellent search function and is, therefore, a useful site for researchersin pre-modern Japanese literature.

TheNKBT  Database is particularly pioneering in offeringthe entire digitalized text of  Iwanami’sNihon Koten bungaku taikeionthe Web. The full text can be downloaded, and the online version offerstwo search functions: word search and frequency order search by charactersor word. One drawback of this database compared to JTI at University ofVirginia is that it is extremely slow and some characters are not distinguishableon a non-Japanese Windows system. If this improves after the drastic reconstructionto take place in January 2000, this database will be the most comprehensiveone for researchers in pre-modern Japanese literature.

With theNijûichi-dai shû andrenga databases,even though both are still in progress, rapid word search and downloadingis possible and the text is free from “mojibake”. Since JTI does not includetheNijûichi-dai shû nor anyrenga collectionsat present, this is the only site with search functions for both sources.TheNijûichi-dai shû database can to a limited extentbe used as a substitute for theKokka taikan for those without accessto the CD-ROM version of the latter.


a)NKBT FULLTEXT DATABASE

Basic information: Nihon koten bungaku honbun database(jikken ban) [NKBT-DB] is a site operated by the Kokubungaku kenkyûshiryôkan and it contains the 560 full texts of pre-modern Japaneseliterature included in 100-volumeNihon Koten Bungaku Taikei(kyûban) from Iwanami Shoten. The site provides the full text of a work, wordsearch, and a list of letters and characters in frequency order in thetext. NKBT-DB is the largest on-line pre-modern text database.

NKBT-DB has been running from April 1999 as a tentative research site.The purpose was to improve the system before releasing the final versionon web. The test version will be suspended from January 12th,2001 for the construction of the improved version, which will open, possibly,in February 1st, 2001. The present tentative site performs slowlysince all interactions are processed via the main computer. In the newsystem, Kokubungaku kenkyû shiryôkan will move all databasesto an exclusive server, which will improve the downloading and search procedure.The renewed version is also expected to come out with corrections of thepresent version where necessary.

NKBT-DB is intended solely for research purpose and thus the users arelimited: the site allows access only to researchers, participating scholarsat the center, those with special admission, faculties in academia, librarians,undergraduate and graduate students. As for undergraduate students, thesite can be used only for their thesis project. In order to gain accessto the database, a membership is required. This is free at the moment butmight change after the renewal of the system. Membership can be obtainedby filling out an application on-line. Within a few weeks, a user ID anda password are sent to you via e-mail. As for undergraduate students, inaddition to the application, Kokubungaku kenkyû shiryôkan requiresa proposal of their thesis project and a letter of approval from theiradvisor.

Since it is controlled by the main computer of the Kokubungaku kenkyûshiryôkan, the database has a time restriction for users. Downloadingof materials and research functions can be performed only between 9:30-21:00,Japan time, and the main server is also down on Saturdays, Sundays, Holidays,between new years, and the end of each month. This inconvenience shouldbe resolved after the database is transferred to a private server.

Content: NKBT-DB contains the largest pre-modern textsand has a useful search function, however, often characters are not distinguishableon a non-Japanese operating system and the download time is extremely slow.

NKBT-DB offers two options in accessing the database: in HTML formator in Tel-net. The HTML format, which is the only option for those usingEnglish windows, is recommended because it is visually easier to read andfaster. However, Kokubungaku kenkyû shiryôkan states that theTel-net format includes more comprehensive information. After selectingthe format, one must limit the search by the first hiragana of the titleof the text and the period it belongs to, jôdai, chûko, chûse,or kinse. It is only after this procedure that a list of title appears.In this sense, this site is designed for those with a particular text inmind. An ID and pass-word is required and one can either select to downloadthe full text, search for a word in the text, or attain the list of lettersand characters in frequency order.

This database is not designed for mere viewing purpose, as the full-textis hard to read due to its small font. Rather, it is useful as a supplementafter the word search. Also, in this tentative version, it is importantto pay attention to the size of a document indicated before downloadingsince even the smallest material requires time to download.

Despite the disadvantage, NKBT-DB is still extremely useful due to itsexcellent search function. First of all, the letter and character frequencysearch (moji hindo risuto no sakusei), organizes all words, letters, charactersand symbols in the text in frequency order. Each entry is provided withthe number of times it appears in the text. At the top of the result, thenumber of different words, characters, and symbols in the text as wellas their total cumulation is indicated. As for its word search (goi mataha moji retsu kensaku to hyôji), this must be done only in Japanesecharacters. After entering a word, letter, character, or short sentencefor search, the result provides the number of hits in the text with thepage and line number. The context, in which the intended words, character,or short sentence appears is also shown. These informations allow one toreturn to the text and easily find the intended place.

Future prospects:NKBT-DB's slow downloading process, which hasbeen its major concern, should improve as soon as the database is switchedto an exclusive server. As for the other problem, the minor "mojibake"that appears in an English Windows, the only solution seems to be to accessthe site through a Japanese operating system. As the database is not originallydesigned for those accessing from a different interface, it is less likelythat Kokubungaku kenkyû shiryôkan would change the entire systemfor this purpose.


b)NIJUICHI-DAI SHU DATABASE

Basic information:Nijûichi-dai shû database(NDS-DB) contains the full text of twenty-one major waka collection fromKokinwakashû toshin zoku kokin waka shû. The source is takenfrom Shôho versionNijûichi-dai shûat the Kokubungakukenkyû shiryôkan. This is still an ongoing project and thesite, at present, is still a tentative one.  Unlike the NKBT database,NDS-DB requires no ID or password and it is opened to the public. The siteallows for rapid search and every character is distinguishable even ona non-Japanese computer.

Content: NDS-DB is visually very well organized and containsa good word search function.

Word search can be done for the word itself or in a certain context.Also, one can limit the search to a particular collection or look for itin the entire twenty-one collections. There is an option to further limitthe word search to its preface, annotation, poem, or footnotes.

Future Prospects:The completed version of the NDS-DB is almostfinished as the site is going under the supervision of the specialistsfor each collection. Eventually, NDS-DB will provide a bridge to the newestversion of theKokka Taikan by appending the poem number. Also,minor interpretation of the reading of the poem will be added as referencefor research. The completed version should allow one to cut, paste, andwrite into the poems freely after downloading them on one’s server.


c)RENGA DATABASE

Basic information:Kokubungaku Kenkyû Shiryôkan providesvery little information on this database. Basically this is a site to lookfor a renga through period, key words, or author. It contains a large numberof renga from various different sources, however, a full list of textsused to create this site is not indicated. The source information is providedonly in the search result. This site is, at present, also a tentative one.It still performs very well even on a non-Japanese operating system.

Content: The most useful feature of the site is its comprehensivesearch function. One can limit a search by the specific year (up to two),key words (up to two) in the title or the text, and author of the hokkuor wakiku. The result includes the title, renga, authors, composed year,and bibliographical information. The result of the word or character searchis highlighted.
 


3.  IMAGED MANUSCRIPTS

KichôShiryô Gazô (Kyôto University)
URL: http://ddb.libnet.kulib.kyoto-u.ac.jp/exhibit/index.html

Introduction: Kichô Shiryô Gazô [KSG]database created by Kyôto University is the most significant sitefor original materials on pre-modern Japanese literature. It has a largecollection and is reproduced on-line in clear picture. In the United States,it is difficult for scholars to gain access to these valuable materials,however, through this site, one can read them on screen by enlarging theimage to its original size. Every page of the text is very well scannedand all characters are distinguishable on-screen. Kyoto University aimsto eventually add the full text beside the images to allow for word searches.Once this becomes possible, KSG can be used both to look at the originalmaterial and to search for a key term in the text.

Basic information: KSG is an on-line project owned by KyotoUniversity. It aims to create a "desktop library" to provide access tothe valuable original pre-modern texts owned by the University. The projecthas started in 1996 and has been open to the public from 1998. By as ofDecember 2000, Kyoto University has included one hundred and twenty sixfull texts.

KSG started with the intention of allowing public access to the valuablepre-modern text and preserving the old materials on the web while the writingsare still distinguishable. As a result, the site is open to anyone; however,it is aimed at researchers in pre-modern Japanese studies. Kyoto Universityhas spent more time putting as many works as possible on-line instead ofproviding commentaries, modern translation or English translation for thegeneral viewer. The only exception to these criteria isKonjyaku monogatarishû, which includes a full introduction with a scholar’s interpretation,annotation, and translation.

In putting up the actual pages of the old materials, Kyoto Universityhaspaid much attention to the following three points: 1.Making all words decipherable:2. Providing an image where one can distinguish between the original writingand a stain or worm eaten parts. Also, making clear the red writings addedlater: 3. Allowing enlargement to view small details especially for pictorialimages. The result is satisfactory. All text downloads fast and all ofthe writing is distinguishable when enlarged.

Content: Each text is provided with a small window withthe table of content on the left, and is well organized. First the imageappears in the size to fit the screen, but by clicking on the image, itenlarges to the actual size of the document. Each material is providedwith bibliographical information and some include commentaries and introduction.

There are five ways of finding materials. To get an overview of thewhole collection, it is easiest to click on "subete no shiryô woichiran suru." Another way is to limit the text by its type: newly addeddocument (shinki tsuika gazô), national treasure (kokuhô),cultural assets (jûyô bunkazai), valuable document (kichôshiryô), materials from the bureau (bukyoku shozô gazô),and for beginners (shoshinsha muke), which includes a summary of the contextand introduction. Subject search is also possible by going to the "naiyôkara sagasu". If you have a particular title in mind, you can search bythe first hiragana sound of its title. Also, if the work is included ina special collection such as Ishinshiryô gazô database, zôkeishoinbon, fujikawa, bunko, or tanimura bunko, one can directly jump to the indexin each database.

Future Prospects:Kyoto University aims to eventually improvethe quality of the pictures to be able to put up works that require a clearerimage such as maps. Also, in the long run, they hope to be able to includetext information and data along with the images. The word search functionis another concern. Once the text is created, and the keywords are organized,this will be possible in the KSG. The word search function, which mighttake a while before it appears, would turn the site into a powerful researchtool as well as a significant resource for viewing original materials.
 


4.  OTHER ONLINE TEXTS FOR JAPANESE LITERATURE

Introduction: The list of on-line sources provide hereis open to the public and does not have a word search function. In thissense, these are not intended as a research tool except if one uses a Japaneseinterface with a search function. Most of the works provided here havenot been finally checked by a scholar. Therefore, some information mightbe less accurate compared to University of Virginia or Kokubungaku KenkyuShiryôkan sites.


NihonBungaku tô Tekisuto Fairu  (Text Files for JapaneseLiterature)
URL: http://kuzan.f-edu.fukui-u.ac.jp/bungaku.htm

This site is maintained by Okajima Akihiro of Fukui University, providinglinks to downloadable pre-modern Japanese texts, and itself contains alarge private collection. Most of the entries are based on the list byM. Shibata’s at Meisei University and other private sources. Works areorganized in chronological order and download quickly. Among the privatesites without search functions, Okajima’s is the most comprehensive.


AozoraBunko
URL: http://www.aozora.gr.jp/main.html

Aozora Bunko is an internet library created by volunteers under thesupport of the Toyota Foundation. It puts up works of which the copyrightshave expired (fifty years after the death of the author). The collectionis large and it is constantly updated.  Most of the emphasis is ontexts of modern Japanese literature, particularly in the first half ofthe 20th century.  All works can be downloaded in HTML, text file,or richer text format.


Yôkyoku350-banshû Nyûryoku

This is an ongoing project among volunteers to input the 350 plays inthe Noh repertoire, based on the Meicho zenshû edition ofYôkyokusanbyaku gojyû banand the Akao Shômondô versionYôkyokusanbyaku gojyû ban shû. Roughly 70% of the project hasalready been completed. The name of the person in charge of each work isindicated in the entire list of  works. The purpose of the projectis not for research but to provide access to Noh texts for anyone interested.The site does not include any search functions for the texts themselves.


[8]ページ先頭

©2009-2026 Movatter.jp