JP2003108278A

Movatterモバイル変換

Info

Publication number: JP2003108278A
Application number: JP2001303205A
Authority: JP
Inventors: Yasuo Kasai; 庸雄河西; Koji Yamada; 孝司山田; Kazuyoshi Nagaho; 和義長保; Katsuhito Suzuki; 勝仁鈴木
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2001-09-28
Filing date: 2001-09-28
Publication date: 2003-04-11
Anticipated expiration: 2021-09-28
Also published as: JP4058928B2

Abstract

(57)【要約】【課題】往信−返信関係を有する一連の文書につい
て、各文書の要旨を迅速かつ的確に把握可能とし、文書
情報の取捨選択を簡単かつ効率的に行なえるようにする
ことを目的とする。【解決手段】掲示板表示システムＫＳでは、電子掲示
板の各スレッドＴＨにおいて、スレッドＴＨに属する各
文書（投稿文書Ａ，返信文書Ａ−１および返信文書Ａ−
１−ａ）から重要文を抽出し、各重要文からスレッドＴ
Ｈの要約を作成して画面に表示する。上記の重要文の抽
出においては、一のスレッドＴＨ内に属する全文書の本
文によって表わされる内容を考慮し、全文書の本文によ
って表わされる内容に最も近い意味の文を各文書の本文
から重要文として抽出する。(57) [Summary] [Problem] To make it possible to quickly and accurately grasp the gist of each document in a series of documents having a forward / backward relationship, and to easily and efficiently select document information. With the goal. SOLUTION: In a bulletin board display system KS, in each thread TH of the electronic bulletin board, each document (posted document A, reply document A-1 and reply document A-
1-a), extract important sentences from each important sentence
A summary of H is created and displayed on the screen. In the extraction of the important sentence described above, considering the contents represented by the texts of all the documents belonging to one thread TH, a sentence having the meaning closest to the content represented by the texts of all the documents is extracted from the text of each document. Extract as

Description

Translated fromJapanese

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、文書表示システム
に関し、詳しくは、所定の関係を有する複数の文書から
構成された文書群に関し、該文書群に属する文書の少な
くとも一部を画面上に閲覧可能に表示する技術に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document display system, and more particularly, to a document group composed of a plurality of documents having a predetermined relationship, and browsing at least a part of the documents belonging to the document group on a screen. Regarding display technology possible.

【０００２】[0002]

【従来の技術】通信技術の発達した今日では、インター
ネットやパソコン通信等のネットワーク上で複数人が文
字や画像等を用いて情報を伝達し合うことが日常的に行
なわれている。このような情報伝達は、ネットワーク上
以外での情報伝達と同様に、他人に向けて自発的に情報
を発信する者（発信者）および該情報発信者によって発
信された情報を受け取る者（受領者）、受け取った情報
に対する回答や意見等を情報発信者に返信する者（返信
者）が存在することにより成立している。2. Description of the Related Art Today, with the development of communication technology, it is commonplace for a plurality of people to communicate information on the network such as the Internet and personal computer communication by using characters and images. Such information transmission is similar to information transmission other than on the network, such as a person who voluntarily transmits information to another person (sender) and a person who receives the information transmitted by the information sender (recipient). ), It is established because there is a person (replyer) who replies to the information sender with an answer or opinion to the received information.

【０００３】一方、ネットワーク上での情報伝達は、発
信者と受領者ないし返信者とのやりとりが、データの送
受信や受信データの閲覧等のサービスを提供するサーバ
を介して行なわれるという特徴がある。即ち、発信者な
いし返信者は、発信ないし返信しようとする情報をネッ
トワークを介してサーバに送る。情報を受け取ったサー
バは、受け取った情報をネットワークを介して受領者に
送信したり、受け取った情報を受領者がネットワークを
介して閲覧可能に表示したり、前記受領者への送信や受
領者による閲覧のために保存したりするのである。な
お、サーバに接続して上記のサービスを受ける者（上記
の発信者や受領者，返信者）のことを、以下、クライア
ントという。On the other hand, the information transmission on the network is characterized in that the sender and the receiver or the replyer communicate with each other through a server which provides services such as data transmission / reception and reception data browsing. . That is, the sender or replyer sends information to be sent or reply to the server via the network. The server that receives the information sends the received information to the recipient via the network, displays the received information so that the recipient can view it via the network, and sends it to the recipient or by the recipient. Save it for viewing. A person who connects to the server and receives the above service (the above-mentioned sender, receiver, and replyer) is hereinafter referred to as a client.

【０００４】近年では、ネットワーク上での情報伝達の
手法として、発信者と特定の受領者との間で文書を送り
合うことにより情報伝達を行なう手法（例えば、プッシ
ュ型の電子メール等）以外に、サーバに設けられた電子
掲示板（ＢＢＳ）への文書の書き込みにより多数の者の
間で情報伝達を行なう手法が用いられている。具体的に
は、ある主題の文書を他人に発信しようとする発信者
は、発信しようとする文書（以下、投稿文書という）の
内容を電子掲示板に書き込む。電子掲示板に書き込まれ
た投稿文書の内容は、ネットワークを介して多数の者
（不特定人若しくは特定のグループ全員）に閲覧可能と
なる。閲覧により投稿文書の内容を受け取った受領者
は、この投稿文書に対する返答や意見等を内容とする文
書（以下、投稿に対する返信文書という）を電子掲示板
に書き込むことができる。書き込まれた投稿に対する返
信文書の内容は、投稿文書と同様に多数の者に閲覧可能
となる。閲覧により返信文書の内容を受け取った受領者
は、この返信文書に対する返答や意見等を内容とする文
書（以下、返信に対する返信文書という）を電子掲示板
に書き込むことができる。書き込まれた返信に対する返
信文書は、投稿文書等と同様に多数の者に閲覧可能とな
る。[0004] In recent years, as a method of transmitting information on a network, other than a method of transmitting information by sending a document between a sender and a specific recipient (for example, push type electronic mail). , A method of transmitting information among a large number of people by writing a document on a bulletin board (BBS) provided in a server is used. Specifically, a sender who wants to send a document of a certain subject to another person writes the content of the document to be sent (hereinafter referred to as posted document) on the electronic bulletin board. The content of the posted document written on the electronic bulletin board can be viewed by a large number of people (unspecified people or all specified groups) via the network. Recipients who have received the content of the posted document by browsing can write a document (hereinafter referred to as a reply document to the post) containing the reply, opinion, etc., to the electronic bulletin board. The content of the reply document to the written post can be viewed by many people like the posted document. The recipient who has received the contents of the reply document by browsing can write a document (hereinafter referred to as a reply document to the reply) containing the reply, opinion, etc., to the electronic bulletin board. The reply document to the written reply can be viewed by a large number of people like the posted document.

【０００５】投稿文書と投稿に対する返信文書は、主題
が共通する文書として一連のつながりを有しており、所
定の関係を有している。このため、従来の電子掲示板で
は、往信−返信関係を有する一連の文書を一の文書群と
みなし、各文書群に属する文書をツリー構造（枝分かれ
をした階層構造）に整理していた。ここで、ツリー構造
の最上層に位置する一の投稿文書から該投稿文書が属す
るツリー構造の最下層に位置する返信文書までが一の往
信−返信関係で接続されるとき、該一の往信−返信関係
で接続される全ての文書のまとまりを、以下、スレッド
という。The posted document and the reply document to the post have a series of connections as a document having a common subject, and have a predetermined relationship. For this reason, in the conventional electronic bulletin board, a series of documents having an outgoing-reply relationship is regarded as one document group, and the documents belonging to each document group are arranged in a tree structure (branched hierarchical structure). Here, when one post document located at the uppermost layer of the tree structure to a reply document located at the bottom layer of the tree structure to which the posted document belongs is connected in one reply-reply relationship, the one reply- A group of all documents connected in a reply relationship is hereinafter called a thread.

【０００６】従来の電子掲示板では、上記のツリー構造
に基づいて各文書群に属する文書の系列図表を作成し
て、この系列図表を閲覧可能に表示し、文書間の往信−
返信関係をクライアントが容易に把握できるようにして
いた。また、表示される系列図表中に各文書の表題や各
文書の本文の冒頭部分を併記することにより、クライア
ントが、各文書の本文にアクセスする前に文書の概要を
把握できるようにしていた。In the conventional electronic bulletin board, a series diagram of documents belonging to each document group is created based on the above tree structure, the series diagram is displayed so that it can be viewed, and communication between documents
The client was able to easily understand the reply relationship. Also, by adding the title of each document and the beginning of the body of each document in the displayed sequence chart, the client can grasp the outline of the document before accessing the body of each document.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、従来の
ネットワーク上での情報伝達では、所定の関係を有する
各文書の要旨を一目で正確に把握することは困難であ
り、各文書の要旨を正確に把握するためには、最上層に
位置する文書から順番に全ての文書の本文にアクセスし
て本文の内容を理解しなければならず、正確な文書の要
旨の把握に時間と労力がかかってしまうという問題があ
った。特に、クライアントに対して一度に表示される文
書数が多い場合には、上記の問題はより顕著であった。However, in the conventional information transmission on the network, it is difficult to accurately grasp the gist of each document having a predetermined relationship at a glance, and the gist of each document can be accurately determined. In order to understand, it is necessary to access the texts of all the documents in order from the document located in the top layer and understand the contents of the texts, and it takes time and labor to accurately grasp the outline of the document. There was a problem. In particular, when the number of documents displayed to the client at one time is large, the above problem is more remarkable.

【０００８】例えば、多数の文書を含む系列図表が表示
される電子掲示板の場合には、系列図表中に表示される
各文書の表題から文書の正確な要旨を把握することは難
しかった。各文書の表題は、種々の発信者ないし返信者
が思い思いの自由なスタイルで記載したものであるた
め、文書の正確な要旨を表現しているとは言い難いから
である。特に、返信者による返信文書においては、投稿
文書の表題の冒頭に「Ｒｅ：」のようなレス表示を付加
したものが自動的に表題とされるような場合があり、こ
のような場合には返信文書の要旨を表題から全く理解す
ることができなかった。また、系列図表に各文書の本文
の冒頭部分を併記しても、冒頭に文書の要旨とは関係な
いことが記載されている場合には、文書の要旨を把握す
ることはできなかった。[0008] For example, in the case of an electronic bulletin board displaying a series chart including a large number of documents, it is difficult to grasp the exact gist of the document from the title of each document displayed in the series chart. This is because the title of each document is written by various senders or responders in a freely styled manner, and it is difficult to say that the title of each document accurately represents the document. In particular, in a reply document sent by a replyer, there may be a case where a reply message such as "Re:" is automatically added to the beginning of the title of a posted document, and in such a case, I could not understand the gist of the reply document from the title at all. In addition, even if the beginning part of the text of each document is added to the sequence chart, if the beginning does not relate to the gist of the document, the gist of the document could not be grasped.

【０００９】また、携帯電話等において多数のメールを
一度に受信した場合には多数の各メールの一部（例え
ば、表題や本文の冒頭部分）がディスプレイに初期表示
されるが、この初期表示されるメールの表題からメール
の正確な要旨を把握することは上記電子掲示板と同様の
理由により難しく、多数のメールの正確な要旨を把握す
るために、受領者は、初期表示後に、全てのメールにつ
いて、メール本文を更に表示するための操作を行なうと
共に、メール本文が表示された画面をスクロール等しな
がらメール本文を読まなければならなかった。When a large number of mails are received at once by a mobile phone or the like, a part of each of the many mails (for example, the title or the beginning of the text) is initially displayed on the display. It is difficult to understand the exact gist of the mail from the subject of the e-mail for the same reason as the above electronic bulletin board. , It was necessary to perform an operation to further display the mail text and read the mail text while scrolling the screen on which the mail text was displayed.

【００１０】また、発信者と特定の受領者との間で文書
を送り合う電子メール等の場合にも、送信メールと受信
メールとが往信−返信関係にあり、主題が共通する文書
として一連のつながりを有している。このような往信−
返信関係を有する一連のメールの要旨を、各メール本文
にアクセスすることなく、一目で正確に把握する手法に
ついては、従来において何ら提案されていなかった。Also, in the case of electronic mail or the like in which documents are sent between a sender and a specific recipient, the sent mail and the received mail have a forward-return relationship and a series of documents having common subjects are used. Have a connection. Such communication-
In the past, no method has been proposed for accurately grasping the gist of a series of emails having a reply relationship at a glance without accessing each email text.

【００１１】本発明は、かかる問題を解決し、往信−返
信関係を有する一連の文書について、各文書の要旨を迅
速かつ的確に把握可能とし、文書情報の取捨選択を簡単
かつ効率的に行なえるようにすることを目的として、以
下の構成を採った。The present invention solves such a problem, makes it possible to quickly and accurately grasp the gist of each document in a series of documents having a back-and-forth reply, and to select document information easily and efficiently. The following configuration was adopted for the purpose of doing so.

【００１２】[0012]

【課題を解決するための手段およびその作用・効果】本
発明の文書表示システムは、所定の関係を有する複数の
文書の少なくとも一部を画面上に閲覧可能に表示する文
書表示システムであって、前記複数の文書からひとまと
まりの文書群を抽出する文書群抽出手段と、前記文書群
に属する文書全体によって表わされる内容を考慮して、
前記文書群に属する各文書から該文書に所定の相関を有
する文を重要文として抽出する重要文抽出手段と、該重
要文抽出手段により抽出された重要文を、前記文書群に
属する文書の一部として前記画面上に表示する重要文表
示手段とを備えたことを要旨とする。A document display system of the present invention is a document display system for displaying at least a part of a plurality of documents having a predetermined relationship on a screen so that the document can be browsed. Considering the document group extraction means for extracting a group of documents from the plurality of documents, and the content represented by all the documents belonging to the document group,
An important sentence extracting unit that extracts a sentence having a predetermined correlation with the document as an important sentence from each document belonging to the document group, and an important sentence extracted by the important sentence extracting unit as one of the documents belonging to the document group. The gist is to provide an important sentence display means for displaying on the screen as a part.

【００１３】上記発明の文書表示システムによれば、重
要文抽出手段が、ひとまとまりの文書群に属する文書全
体によって表わされる内容を考慮して、文書群に属する
各文書から該文書に所定の相関を有する文を重要文とし
て抽出し、抽出された重要文を重要文表示手段が文書の
一部として画面上に表示する。従って、文書群における
主題の展開を、表示された重要文を見ただけで正確に把
握することができる。上記所定の関係を有する複数の文
書を、往信と返信の関係にある文書上とすることも好ま
しい。According to the document display system of the invention described above, the important sentence extracting means considers the contents represented by the entire documents belonging to the group of documents and makes a predetermined correlation with each document belonging to the documents. The sentence having "is extracted as an important sentence, and the extracted important sentence is displayed on the screen by the important sentence display means as a part of the document. Therefore, the development of the subject in the document group can be accurately grasped only by looking at the displayed important sentence. It is also preferable that the plurality of documents having the above-mentioned predetermined relationship are on a document having a relationship between an outgoing message and a reply.

【００１４】文書群の種類を判別する種類判別手段を備
え、該種類判別手段により判別された種類に応じて重要
文を抽出する構成としてもよい。こうすれば、文書群の
種類に応じた重要文が表示されるので、主題の展開をよ
り正確に把握することができる。It is also possible to provide a type discriminating means for discriminating the type of the document group and to extract an important sentence according to the type discriminated by the type discriminating means. In this way, since the important sentence corresponding to the type of document group is displayed, the development of the subject can be grasped more accurately.

【００１５】この場合において、文書群の種類を、質問
表現を含む質問型，複数の話題を含む複数話題型，会話
文を含む会話型のうち少なくとも一つとすることも好適
である。また、重要文抽出手段が、文書群に属する各文
書から該文書に所定の相関を有する文を重要文として抽
出する基準を該文書群に属する文書の種類に応じて異な
らせる相関設定手段を備えることも望ましい。In this case, it is also preferable that the type of the document group is at least one of a question type including a question expression, a plural topic type including a plurality of topics, and a conversation type including a conversation sentence. Further, the important sentence extracting means is provided with a correlation setting means for changing a criterion for extracting a sentence having a predetermined correlation with the document as a significant sentence from each document belonging to the document group according to a type of the document belonging to the document group. Is also desirable.

【００１６】重要文抽出手段により抽出された重要文に
基づいて文書群の要約を作成する要約作成手段を備え、
重要文表示手段に替えて、該要約作成手段により作成さ
れた要約を画面上に表示する要約表示手段を備えること
も望ましい。こうすれば、文書群ごとの話題の内容を迅
速かつ的確に把握することができる。A summary creating means for creating a summary of the document group based on the important sentences extracted by the important sentence extracting means,
It is also desirable to provide a summary display means for displaying the summary created by the summary creation means on the screen, instead of the important sentence display means. By doing so, it is possible to quickly and accurately grasp the content of the topic for each document group.

【００１７】こうした要約表示手段と共に、重要文抽出
手段により抽出された重要文を文書群に属する文書の一
部として前記画面上に表示する重要文表示手段を備える
構成としても差し支えない。In addition to the summary display means, the important sentence display means may be arranged to display the important sentence extracted by the important sentence extracting means on the screen as a part of the document belonging to the document group.

【００１８】所定の基準に基づいて前記各文書の文を標
準的な表現に書き換える文標準化手段を備えることも好
適である。また、要約作成手段が、少なくとも前記重要
文抽出手段により抽出された重要文の表現を標準的な表
現に書き換える手段を備える構成とすれば、文書作成者
の文書作成に関する技量に拘らず均一なレベルの文が文
書や文書群の要旨として表示されるので、文書群ごとの
話題の内容をより一層把握しやすくなる。It is also preferable to include sentence standardization means for rewriting the sentence of each document into a standard expression based on a predetermined standard. Further, if the summary creating means is provided with at least means for rewriting the expression of the important sentence extracted by the important sentence extracting means into a standard expression, a uniform level can be obtained regardless of the skill of the creator of the document creation. Is displayed as the summary of the document or document group, it becomes easier to understand the content of the topic for each document group.

【００１９】上記の文標準化手段としては、例えば、冗
長な表現の短文化や自立語の他の自立語への置換，付属
語の用法の訂正，方言から標準語への置換，予め定めた
文字への置き換え，表記のゆれの統一等を考えることが
できる。Examples of the sentence standardization means include, for example, a short culture of redundant expressions and replacement of an independent word with another independent word, correction of usage of ancillary words, replacement of dialects with standard words, and predetermined characters. It is possible to think about replacement with, unification of notations, etc.

【００２０】所定の基準に基づいて文書の中から文書の
要旨を構成し得ない表現を除いた文を有意味文として抽
出する有意味文抽出手段を備え、重要文抽出手段または
要約作成手段を、該有意味文を用いて重要文の抽出また
は要約の作成を行なう手段とすることも好適である。上
記の表現抽出手段により抽出される表現としては、例え
ば、以前の発言の引用箇所や挨拶文、発信者の署名等を
考えることができる。こうすれば、重要文の抽出または
要約の作成を効率的かつ精度よく行なうことができる。Meaningful sentence extracting means for extracting a sentence excluding expressions that cannot constitute the gist of the document from the document on the basis of a predetermined criterion as a meaningful sentence is provided. It is also preferable that the meaningful sentence is used as a means for extracting an important sentence or creating a summary. The expression extracted by the expression extracting means may be, for example, a quoted part of a previous statement, a greeting, a signature of the sender, or the like. In this way, important sentences can be extracted or a summary can be created efficiently and accurately.

【００２１】重要文表示手段または要約表示手段を、文
書群についての重要文または要約を時系列順に一覧表示
する手段とすることも好適である。こうすれば、最新の
主題や話題に容易にアクセスすることができる。It is also preferable that the important sentence display means or the summary display means is a means for displaying a list of important sentences or summaries of the document group in chronological order. This way you can easily access the latest themes and topics.

【００２２】重要文表示手段または要約表示手段を、文
書群に属する文書の作成者を重要文または要約と関連付
けて表示する手段としてもよい。こうすれば、各作成者
による発言内容の変化を把握しやすくなる。The important sentence display means or the summary display means may be a means for displaying the creator of the document belonging to the document group in association with the important sentence or the summary. This makes it easier to understand changes in the content of comments made by each creator.

【００２３】重要文抽出手段による抽出に基づいて文書
群の表題を作成する表題作成手段を備え、該表題作成手
段により作成された各文書群の表題を重要文または要約
と関連付けて表示しても差し支えない。こうすれば、作
成者によって付けられた各文書の表題を羅列する場合と
比較して、各文書群の話題の種類を把握しやすくなる。A title creating means for creating a title of a document group based on the extraction by the important sentence extracting means is provided, and the title of each document group created by the title creating means is displayed in association with the important sentence or summary. It doesn't matter. This makes it easier to understand the type of topic of each document group compared to the case where the titles of each document given by the creator are listed.

【００２４】本発明の受信装置は、通信回線を介して文
書を受信する受信装置であって、前記受信した複数の文
書からひとまとまりの文書群を抽出する文書群抽出手段
と、該文書群抽出手段により抽出された一の文書群に関
し、該文書群に属する各文書から少なくとも本文を取り
出す本文取出手段と、該本文取出手段により取り出され
た本文全体によって表わされる内容を考慮し、前記文書
群に属する各文書から該文書に所定の相関を有する文を
重要文として抽出する抽出手段と、該抽出手段により抽
出された前記一の文書群についての重要文を通信回線を
介して出力する出力手段とを備えたことを要旨とする。The receiving device of the present invention is a receiving device for receiving a document via a communication line, and a document group extracting means for extracting a group of documents from the received plurality of documents, and the document group extraction. Regarding the one document group extracted by the means, the text group extracting means for extracting at least the body of each document belonging to the document group and the content represented by the entire body extracted by the body extracting means are taken into account in the document group. Extraction means for extracting a sentence having a predetermined correlation with the document as an important sentence from each document to which it belongs, and output means for outputting the important sentence for the one document group extracted by the extracting means via a communication line. The summary is that

【００２５】上記発明の受信装置によれば、文書群抽出
手段が、通信回線を介して受信した複数の文書からひと
まとまりの文書群を抽出し、抽出された一の文書群に属
する各文書から本文取出手段が少なくとも本文を取り出
す。抽出手段は、取り出された本文全体によって表わさ
れる内容を考慮して、文書群に属する各文書から該文書
に所定の相関を有する文を重要文として抽出し、抽出さ
れた一の文書群についての重要文を出力手段が通信回線
を介して出力する。従って、出力された重要文を通信回
線を介して得ることにより、一の文書群における主題の
展開を正確に把握可能なデータを得ることが可能とな
る。例えば、このデータを表示や印刷等すれば、文書群
における主題の展開を、表示ないし印刷された重要文を
見ただけで正確に把握することができる。According to the receiving apparatus of the above invention, the document group extracting means extracts a group of documents from a plurality of documents received via the communication line, and extracts from each document belonging to the extracted one document group. The text extracting means extracts at least the text. The extracting means extracts, from the respective documents belonging to the document group, a sentence having a predetermined correlation with the document as an important sentence in consideration of the content represented by the entire extracted body, and The output means outputs the important sentence via the communication line. Therefore, by obtaining the output important sentence via the communication line, it is possible to obtain data that can accurately grasp the development of the subject in one document group. For example, if this data is displayed or printed, the development of the subject in the document group can be accurately grasped only by looking at the important sentence displayed or printed.

【００２６】なお、上記の文書群形成手段，本文取出手
段，抽出手段および出力手段を二以上の装置に分けて設
け、これら二以上の装置を組み合わせて上記発明の受信
装置を実現することも可能である。The above-mentioned document group forming means, text extracting means, extracting means and output means may be provided separately in two or more devices, and these two or more devices may be combined to realize the receiving device of the present invention. Is.

【００２７】抽出手段により抽出された重要文に基づい
て文書の要約を作成する作成手段を備え、該作成手段に
より作成された要約を通信回線を介して出力する要約出
力手段を出力手段に替えて備えることも望ましい。こう
すれば、出力された要約を通信回線を介して得ることに
より、文書群ごとの話題の内容を迅速かつ的確に把握可
能なデータを得ることが可能となる、データの利用価値
を高めることができる。There is provided a creating means for creating the abstract of the document based on the important sentence extracted by the extracting means, and the abstract outputting means for outputting the abstract created by the creating means via the communication line is replaced with the output means. It is also desirable to be prepared. In this way, by obtaining the output summary via the communication line, it becomes possible to obtain data that allows the content of the topic for each document group to be grasped quickly and accurately, and the utility value of the data can be increased. it can.

【００２８】本発明の文書表示方法は、所定の関係を有
する複数の文書の少なくとも一部を画面上に閲覧可能に
表示する文書表示方法であって、前記複数の文書からひ
とまとまりの文書群を抽出し、前記文書群に属する文書
全体によって表わされる内容を考慮して、前記文書群に
属する各文書から該文書に所定の相関を有する文を重要
文として抽出し、該抽出された重要文を、前記文書群に
属する文書の一部として前記画面上に表示することを要
旨とする。The document display method of the present invention is a document display method for displaying at least a part of a plurality of documents having a predetermined relationship in a viewable manner on a screen, and a group of documents is selected from the plurality of documents. In consideration of the content represented by the entire document belonging to the document group, a sentence having a predetermined correlation with the document is extracted as an important sentence from each document belonging to the document group, and the extracted important sentence is extracted. The gist is to display on the screen as a part of the documents belonging to the document group.

【００２９】上記発明の文書表示方法によれば、ひとま
とまりの文書群に関し、該文書群に属する文書全体によ
って表わされる内容を考慮して、各文書から該文書に所
定の相関を有する文を重要文として抽出し、抽出された
重要文を文書の一部として画面上に表示する。従って、
文書群における主題の展開を、表示された重要文を見た
だけで正確に把握することができる。According to the document display method of the present invention, regarding a group of documents, considering the contents represented by the entire documents belonging to the document group, the sentences having a predetermined correlation with each document are important. The sentence is extracted as a sentence, and the extracted important sentence is displayed on the screen as a part of the document. Therefore,
The development of the subject in the document group can be accurately grasped only by looking at the displayed important sentence.

【００３０】本発明のコンピュータプログラムを記録し
た記録媒体は、所定の関係を有する複数の文書の少なく
とも一部を画面上に閲覧可能に表示するためのコンピュ
ータプログラムを記録した記録媒体であって、前記複数
の文書からひとまとまりの文書群を抽出する工程と、前
記文書群に属する文書全体によって表わされる内容を考
慮して、前記文書群に属する各文書から該文書に所定の
相関を有する文を重要文として抽出する工程と、該抽出
された重要文を、前記文書群に属する文書の一部として
前記画面上に表示する工程とをコンピュータに実行させ
るためのプログラムをコンピュータに読み取り可能に記
録したことを要旨とする。A recording medium recording a computer program of the present invention is a recording medium recording a computer program for displaying at least a part of a plurality of documents having a predetermined relationship on a screen so that the document can be browsed. Considering the process of extracting a group of documents from a plurality of documents and the content represented by the entire documents belonging to the document group, it is important to select a sentence having a predetermined correlation from each document belonging to the document group. A program for causing a computer to execute a step of extracting the sentence as a sentence and a step of displaying the extracted important sentence on the screen as a part of a document belonging to the document group is recorded in a computer-readable manner. Is the gist.

【００３１】上記発明の記録媒体によれば、記録された
プログラムがコンピュータに読み取られることにより、
ひとまとまりの文書群に属する文書全体によって表わさ
れる内容を考慮して、各文書から該文書に所定の相関を
有する文を重要文として抽出し、抽出された重要文を文
書の一部として画面上に表示する。従って、文書群にお
ける主題の展開を、表示された重要文を見ただけで正確
に把握することができる。According to the recording medium of the above invention, the recorded program is read by the computer,
Considering the content represented by the entire document belonging to a group of documents, a sentence having a predetermined correlation with the document is extracted as an important sentence from each document, and the extracted important sentence is displayed on the screen as a part of the document. To display. Therefore, the development of the subject in the document group can be accurately grasped only by looking at the displayed important sentence.

【００３２】なお、上記の記録媒体が備える各工程を二
以上の記録媒体に分けて記録し、これら二以上の記録媒
体を組み合わせることにより上記発明の記録媒体を実現
することも可能である。It is also possible to realize the recording medium of the present invention by recording the respective steps of the above recording medium separately on two or more recording media and recording these two or more recording media in combination.

【００３３】なお、記録媒体としては、フレキシブルデ
ィスク，ＣＤ−ＲＯＭ，ＤＶＤ−ＲＯＭ，半導体メモリ
（ＲＯＭ，ＰＲＯＭ，ＥＥＰＲＯＭ，フラッシュメモリ
等）など、種々の記録媒体を用いることができる。もと
より、インターネットなどのネットワーク上に置かれた
サーバにこれらのプログラムを記憶しておき、クライア
ントのコンピュータにダウンロードして利用することも
可能である。As the recording medium, various recording media such as flexible disk, CD-ROM, DVD-ROM, semiconductor memory (ROM, PROM, EEPROM, flash memory, etc.) can be used. Of course, it is also possible to store these programs in a server placed on a network such as the Internet and download them to the client computer for use.

【００３４】また、重要文や要約等の表示は、ディスプ
レイや用紙上に表示するものにとどまらず、重要文を音
声合成などを利用して読み上げることで表示することも
可能である。耳から聞いて内容を認識する場合、何が重
要かをすぐに判別することは、文字を読み取って判断す
ることより困難なので、重要文を抽出して読み上げる構
成は、極めて有用である。携帯電話でメール文書を確認
するような場合には、重要文を音声で読み上げれば、携
帯電話の小さなディスプレイに制約されることがなく、
この点でも有用である。Further, the display of the important sentence or the summary is not limited to the one displayed on the display or the paper, and it is also possible to display the important sentence by reading it out by using voice synthesis or the like. When recognizing the content by hearing from the ear, it is more difficult to immediately judge what is important than to read the character, and thus the structure in which the important sentence is extracted and read out is extremely useful. When you check a mail document on a mobile phone, you can read out important sentences aloud without being restricted by the small display of the mobile phone.
This point is also useful.

【００３５】[0035]

【発明の実施の形態】以下、本発明の実施の形態を実施
例に基づいて説明する。（１）実施例の構成：はじめに、実施例の構成について
図１を用いて説明する。図１は、本発明の第１実施例で
ある掲示板表示システムＫＳの概略構成を示す説明図で
ある。図１に示すように、掲示板表示システムＫＳで
は、インターネットのような大規模なネットワーク１０
０に、配信サーバ３００，３１０，３２０・・・および
データベースサーバ２００が接続されている。配信サー
バ３００には、クライアントのパーソナルコンピュータ
９００，９１０，９２０・・・が電話回線を介して接続
されている。また、電子掲示板のウェブページが格納さ
れたサーバ２００には、電子掲示板の管理人のパーソナ
ルコンピュータ８００が電話回線を介して接続されてい
る。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below based on Examples. (1) Configuration of Embodiment: First, the configuration of the embodiment will be described with reference to FIG. FIG. 1 is an explanatory diagram showing a schematic configuration of a bulletin board display system KS that is a first embodiment of the present invention. As shown in FIG. 1, in the bulletin board display system KS, a large-scale network 10 such as the Internet is used.
0, the distribution servers 300, 310, 320, ... And the database server 200 are connected. Personal computers 900, 910, 920 ... Of clients are connected to the distribution server 300 via a telephone line. Further, a personal computer 800 of the administrator of the electronic bulletin board is connected to the server 200 in which the web page of the electronic bulletin board is stored via a telephone line.

【００３６】パーソナルコンピュータ９００から配信サ
ーバ３００に対して電子掲示板のウェブページへのアク
セス信号が送出されると、配信サーバ３００とデータベ
ースサーバ２００内の電子掲示板のウェブページとがネ
ットワーク１００を介して接続される。これにより、パ
ーソナルコンピュータ９００のディスプレイに電子掲示
板のウェブページが表示される。この後、クライアント
が電子掲示板への文書の掲載を希望する場合には、掲載
しようとする文書をパーソナルコンピュータ９００を用
いて作成し、作成した文書を電子掲示板のアドレスに送
る。作成された文書は、配信サーバ３００からネットワ
ーク１００を介してデータベースサーバ２００内の電子
掲示板のウェブページに到達する。これにより、クライ
アントの作成した文書が電子掲示板に書き込まれ、パー
ソナルコンピュータ９００のディスプレイには、書き込
み後の電子掲示板が表示される。なお、他のクライアン
トのパーソナルコンピュータ９１０，９２０・・・を用
いた場合も、上記パーソナルコンピュータ９００と同様
に、電子掲示板へのアクセスないし書き込みが可能であ
る。When an access signal to the electronic bulletin board web page is sent from the personal computer 900 to the distribution server 300, the distribution server 300 and the electronic bulletin board web page in the database server 200 are connected via the network 100. To be done. As a result, the web page of the electronic bulletin board is displayed on the display of the personal computer 900. After that, when the client wants to post the document on the electronic bulletin board, the document to be posted is created using the personal computer 900, and the created document is sent to the address of the electronic bulletin board. The created document reaches the web page of the electronic bulletin board in the database server 200 from the distribution server 300 via the network 100. As a result, the document created by the client is written on the electronic bulletin board, and the post-writing electronic bulletin board is displayed on the display of the personal computer 900. Even when the personal computers 910, 920, ... Of other clients are used, the electronic bulletin board can be accessed or written in the same manner as the personal computer 900.

【００３７】データベースサーバ２００の構造を図２に
示す。データベースサーバ２００は、ネットワーク１０
０とのデータのやり取りを制御するネットワークインタ
フェース（ＮＴ−Ｉ／Ｆ）２１０、処理を行なうＣＰＵ
２２０、処理プログラムや固定的なデータを記憶するＲ
ＯＭ２３０、ワークエリアとしてのＲＡＭ２４０、時間
を管理するタイマ２５０、後述する各種のデータを蓄積
するデータベース蓄積部２６０、辞書などを記憶してい
るハードディスク２７０等を備える。なお、データベー
ス蓄積部２６０は、実際には、ハードディスクなどの記
憶装置に格納されているが、ここでは、説明の都合上、
独立の装置として扱うものとする。The structure of the database server 200 is shown in FIG. The database server 200 is the network 10
Network interface (NT-I / F) 210 for controlling data exchange with 0, CPU for processing
220, R for storing processing programs and fixed data
An OM 230, a RAM 240 as a work area, a timer 250 for managing time, a database accumulating section 260 for accumulating various kinds of data described later, a hard disk 270 for storing a dictionary and the like are provided. Note that the database storage unit 260 is actually stored in a storage device such as a hard disk, but here, for convenience of explanation,
It shall be treated as an independent device.

【００３８】データベース蓄積部２６０には、電子掲示
板に書き込まれている多数の文書が蓄積されている。こ
れらの文書は、前述した往信−返信関係に基づいて整理
された状態で蓄積されている。具体的には、データベー
ス蓄積部２６０には、投稿文書やこの投稿に対する返信
文書，返信に対する返信文書に関する情報として、図３
に示すような文書の系列図表が格納されている。A large number of documents written on the electronic bulletin board are stored in the database storage unit 260. These documents are accumulated in a state in which they are organized based on the above-mentioned outgoing-return relationship. Specifically, the database storage unit 260 stores information about the posted document, a reply document to this post, and information about the reply document to the reply as shown in FIG.
A series chart of documents as shown in is stored.

【００３９】図３に示すように、系列図表には、各投稿
文書Ａ〜Ｚおよび返信文書に関する往信−返信関係がツ
リー構造で表されている。例えば、投稿文書Ａに対する
返信としては四つの返信文書Ａ−１〜４があり、返信文
書Ａ−１に対する返信文書としては二つの返信文書Ａ−
１−ａ〜ｂがある。なお、系列図表には、各投稿文書Ａ
〜Ｚおよび返信文書の近傍に、各文書が電子掲示板に書
き込まれた日付け，メッセージ番号，作成者名等が付記
されている（図示せず）。また、各投稿文書Ａ〜Ｚおよ
び返信文書の本文は、系列図表とリンクされてデータベ
ース蓄積部２６０に蓄積されている。As shown in FIG. 3, the series chart shows the tree-structure of the correspondence between the posted documents A to Z and the reply document-the reply. For example, there are four reply documents A-1 to A-4 as replies to the posted document A, and two reply documents A- to the reply document A-1.
There are 1-ab. In addition, in the sequence chart, each posted document A
The date when each document was written on the electronic bulletin board, the message number, the name of the creator, and the like are added to the vicinity of Z to the reply document (not shown). Further, the body text of each of the posted documents A to Z and the reply document is stored in the database storage unit 260 in association with the series chart.

【００４０】前述したスレッドの一例を、図３に二点鎖
線で示した。即ち、ツリー構造の最上層である第１層に
位置する一の投稿文書Ａから該投稿文書Ａが属するツリ
ー構造の最下層である第３層に位置する返信文書Ａ−１
−ａまでは、第２層に位置する返信文書Ａ−１を介して
一の往信−返信関係で接続されている。こうした往信−
返信関係で接続される全ての文書のまとまり（つまり、
投稿文書Ａ，返信文書Ａ−１，返信文書Ａ−１−ａ）が
スレッドＴＨとなる。図３の系列図表には、図３に表さ
れている限りにおいて、「投稿文書Ａ，返信文書Ａ−
１，返信文書Ａ−１−ａ」、「投稿文書Ａ，返信文書Ａ
−１，返信文書Ａ−１−ｂ」、「投稿文書Ａ，返信文書
Ａ−２」、「投稿文書Ａ，返信文書Ａ−３，返信文書Ａ
−３−ａ」、「投稿文書Ａ，返信文書Ａ−４」、「投稿
文書Ｂ，返信文書Ｂ−１，返信文書Ｂ−１−ａ」、「投
稿文書Ｚ，返信文書Ｚ−１，返信文書Ｚ−１−ａ」とい
う７個のスレッドＴＨが存在している。An example of the thread described above is shown by a chain double-dashed line in FIG. That is, from one posted document A located in the first layer which is the uppermost layer of the tree structure, to the reply document A-1 located in the third layer which is the lowermost layer of the tree structure to which the posted document A belongs.
Up to -a, they are connected in one outgoing-replying relationship via the reply document A-1 located in the second layer. Such communication-
A collection of all documents connected in a reply relationship (that is,
The posted document A, the reply document A-1, and the reply document A-1-a) form a thread TH. In the sequence chart of FIG. 3, as long as it is shown in FIG. 3, “posted document A, reply document A-
1, reply document A-1-a "," posted document A, reply document A "
-1, reply document A-1-b "," posted document A, reply document A-2 "," posted document A, reply document A-3, reply document A
-3-a "," Posted Document A, Reply Document A-4 "," Posted Document B, Reply Document B-1, Reply Document B-1-a "," Posted Document Z, Reply Document Z-1, Reply " There are seven threads TH named "Document Z-1-a".

【００４１】データベースサーバ２００のＲＯＭ２３０
内には、上記の系列図表を作成する処理（以下、系列図
表作成処理という）を記述したプログラムが格納されて
いる。このプログラムは、ＣＰＵ２２０によって実行さ
れることにより、多数の文書を文書群としてのスレッド
ごとに分類する分類手段として機能する。この系列図表
作成処理の内容および処理手順を系列図表作成処理ルー
チンとして図４に示す。本ルーチンは、投稿文書や返信
文書が書き込まれたときにＣＰＵ２２０が実行する処理
である。本ルーチンが起動されると、まず、書き込まれ
た文書の宛先とされた文書を系列図表中で特定する処理
を行なう（ステップＳ１００）。次に、特定された文書
の下層に、特定された文書と関連付けながら、書き込ま
れた文書の一部（例えば、表題等）を追加する処理を行
なう（ステップＳ１２０）。次に、系列図表中の投稿文
書や返信文書をスレッドごとに分類し直し（ステップＳ
１４０）、分類後の系列図表をデータベース蓄積部２６
０に更新して記憶して（ステップＳ１６０）、本ルーチ
ンを終了する。ROM 230 of database server 200
A program in which a process for creating the above-mentioned sequence chart (hereinafter referred to as a sequence diagram creation process) is stored therein. This program, when executed by the CPU 220, functions as a classifying unit that classifies a large number of documents for each thread as a document group. FIG. 4 shows the contents and processing procedure of this series chart creation processing as a series chart creation processing routine. This routine is a process executed by the CPU 220 when a posted document or a reply document is written. When this routine is activated, first, a process for specifying the document as the destination of the written document in the sequence chart is performed (step S100). Next, a process of adding a part (for example, a title) of the written document to the lower layer of the identified document in association with the identified document is performed (step S120). Next, the posted documents and reply documents in the sequence chart are reclassified for each thread (step S
140), the database storage unit 26
It is updated to 0 and stored (step S160), and this routine ends.

【００４２】データベースサーバ２００のＲＯＭ２３０
には、上記の系列図表作成処理の他に、電子掲示板に書
き込まれた投稿文書および返信文書の要約を作成し、こ
の作成された要約を系列図表と共にパーソナルコンピュ
ータ９００のディスプレイ上に表示する処理（以下、要
約作成・表示処理という）の内容が記述されたプログラ
ムが格納されている。この要約作成・表示処理の内容お
よび処理手順を要約作成・表示処理ルーチンとして図５
に示す。本ルーチンは、電子掲示板に書き込まれた文書
の要約を作成する旨の実行指示がなされたときにＣＰＵ
２２０が実行する処理である。ROM 230 of database server 200
In addition to the above sequence diagram creation process, a process of creating a summary of posted documents and reply documents written on the electronic bulletin board and displaying the created summary together with the sequence diagram on the display of the personal computer 900 ( Hereinafter, a program in which the content of the summary creation / display processing) is described is stored. The contents and processing procedure of this summary creation / display processing are shown in FIG. 5 as a summary creation / display processing routine.
Shown in. This routine is executed by the CPU when an execution instruction to create a summary of a document written on the electronic bulletin board is issued.
This is the process executed by 220.

【００４３】本ルーチンが起動されると、まず、スレ
ッド特定処理（ステップＳ２００）を実行する。スレッ
ド特定処理は、系列図表の中から要約を作成する対象と
なる一のスレッドＴＨを特定する処理である。以後、こ
の特定された一のスレッドＴＨを対象として、有効本
文特定処理（ステップＳ２１０）、重要文抽出処理
（ステップＳ２２０）、要約作成処理（ステップＳ２
４０）、表題作成処理（ステップＳ２６０）を実行す
る。これらの処理の詳細については後述する。これらの
ステップＳ２２０〜Ｓ２６０の処理が全てのスレッドＴ
Ｈについて終了したときに（ステップＳ２７０）、後述
する表示データ出力処理（ステップＳ２８０）を実行
して、本ルーチンを終了する。When this routine is activated, first, a thread specifying process (step S200) is executed. The thread specifying process is a process of specifying one thread TH from which a summary is created from the sequence chart. After that, for the specified one thread TH, the effective text specifying process (step S210), the important sentence extracting process (step S220), and the abstract creating process (step S2) are performed.
40), and a title creating process (step S260) is executed. Details of these processes will be described later. The processing of these steps S220 to S260 is performed for all threads T
When the process for H has been completed (step S270), a display data output process (step S280) described later is executed, and this routine is completed.

【００４４】以下、有効本文特定処理、重要文抽出
処理、要約作成処理、表題作成処理、表示データ
出力処理の内容について順に説明する。なお、重要文
抽出処理には全体評価後切出処理が含まれており、要
約作成処理には文標準化処理が含まれているが、これら
の処理についても重要文抽出処理、要約作成処理と
併せて説明する。The contents of the effective text specifying process, the important sentence extracting process, the abstract creating process, the title creating process, and the display data output process will be described below in order. Note that the important sentence extraction process includes the post-evaluation cutout process, and the summary creation process includes the sentence standardization process. These processes are also combined with the important sentence extraction process and the summary creation process. Explain.

【００４５】有効本文特定処理（図５のステップＳ２
１０）有効本文特定処理は、スレッド特定処理により特定され
たスレッドＴＨにつき、該スレッドＴＨ内に属する各文
書の本文から文書の要旨を構成し得ない表現を抽出し、
抽出された表現を除外した本文（以下、有効本文とい
う）を特定する処理である。具体的には、図３で二点鎖
線で囲ったスレッドＴＨの場合には、投稿文書Ａの本文
を読み出し、挨拶文や投稿者の署名等の文書の主題の特
定に関係の薄い表現を本文から除外し、削除後の本文を
有効本文とする。返信文書Ａ−１や返信文書Ａ−１−ａ
についても、本文を読み出して、投稿文書Ａと同様の処
理を行なう。但し、返信文書Ａ−１や返信文書Ａ−１−
ａの場合には、投稿文書Ａの本文を引用した箇所につい
ても本文から削除し、削除後の本文を有効本文とする。
こうした有効本文は、ＲＡＭ２４０の所定領域に一時的
に記憶される。Valid Body Specification Processing (Step S2 of FIG. 5)
10) The effective body identifying process extracts, for the thread TH identified by the thread identifying process, an expression that cannot form the gist of the document from the body of each document belonging to the thread TH,
This is a process for identifying a body text (excluding the valid body text) excluding the extracted expressions. Specifically, in the case of the thread TH surrounded by a chain double-dashed line in FIG. 3, the text of the posted document A is read, and an expression unrelated to the identification of the subject of the document such as a greeting or a signature of the poster is written. Exclude from the text and make the text after deletion the valid text. Reply document A-1 and reply document A-1-a
With respect to, the text is read and the same processing as that of the posted document A is performed. However, the reply document A-1 and the reply document A-1-
In the case of a, the part of the text of the posted document A that has been quoted is also deleted from the text, and the text after deletion is made the valid text.
Such valid text is temporarily stored in a predetermined area of the RAM 240.

【００４６】図３で二点鎖線で囲ったスレッドＴＨに属
する各文書（投稿文書Ａ，返信文書Ａ−１および返信文
書Ａ−１−ａ）の例文を図６に示す。投稿文書Ａは甲に
より作成されて電子掲示板に書き込まれた文書であり、
返信文書Ａ−１は、投稿文書Ａに対する返信として乙に
より作成され、電子掲示板に書き込まれた文書である。
返信文書Ａ−１−ａは返信文書Ａ−１に対する返信とし
て甲により作成され、電子掲示板に書き込まれた文書で
ある。図６に示すように、返信文書Ａ−１，返信文書Ａ
−１，返信文書Ａ−１−ａには、それぞれ４つの文，６
つの文，５つの文が含まれているが、上記の有効本文特
定処理の実行により、返信文書Ａ−１の第１文の「いつ
もお世話になります。」という定型的な挨拶文は削除さ
れ、残りの５つの文が有効本文とされる。なお、後の説
明をわかりうやすくするため、図６における各文の文頭
に、有効本文とされる文の文番号（１〜１４）を示して
いる。FIG. 6 shows an example sentence of each document (posted document A, reply document A-1 and reply document A-1-a) belonging to the thread TH surrounded by the chain double-dashed line in FIG. Posted document A is a document created by A and written on the electronic bulletin board,
The reply document A-1 is a document created by the second party as a reply to the posted document A and written in the electronic bulletin board.
The reply document A-1-a is a document created by A as a reply to the reply document A-1 and written on the electronic bulletin board. As shown in FIG. 6, reply document A-1, reply document A
-1, Reply document A-1-a has four sentences, 6 respectively.
One sentence and five sentences are included, but by executing the above-mentioned effective text identification process, the standard greeting sentence "I always appreciate your help." In the first sentence of reply document A-1 is deleted. , The remaining five sentences are regarded as valid texts. Note that the sentence numbers (1 to 14) of the sentences to be the effective text are shown at the beginning of each sentence in FIG. 6 for easy understanding of the description below.

【００４７】重要文抽出処理（図５のステップＳ２２
０）重要文抽出処理は、有効本文特定処理により特定された
各文書の有効本文から各文書に所定の相関を有する文を
重要文として抽出する処理である。本実施例では、重要
文抽出処理において、一のスレッドＴＨ内に属する全有
効本文によって表わされる内容を考慮し、各文書に所定
の相関を有する文として、全有効本文によって表わされ
る内容に最も近い意味の文を各文書の本文から切り出す
処理（以下、全体評価後切出処理という）を行なう。こ
の全体評価後切出処理の実行により切り出された文が重
要文として抽出される。Important sentence extraction processing (step S22 in FIG. 5)
0) The important sentence extraction process is a process of extracting, as an important sentence, a sentence having a predetermined correlation with each document from the effective body of each document specified by the effective body specification process. In the present embodiment, in the important sentence extraction process, the contents represented by all the valid texts belonging to one thread TH are considered, and the sentence having a predetermined correlation with each document is the closest to the content represented by the all valid texts. A process of cutting out a sentence of meaning from the body of each document (hereinafter referred to as cutout process after overall evaluation) is performed. The sentence cut out by the execution of the cutting process after the overall evaluation is extracted as an important sentence.

【００４８】重要文の抽出処理には、種々の手法が考え
られるが、この実施例では、次の手法を採用した。重要
文抽出処理の内容及び手順を、重要文抽出処理ルーチン
として図７に示す。本ルーチンは有効本文特定処理の終
了後に起動する。本ルーチンが起動されると、まず、電
子掲示板が有する各スレッドＴＨの種類を判別する処理
を行なう（ステップＳ３００）。スレッドＴＨの種類に
応じてスレッドＴＨに含まれる各文書からの重要文の抽
出手法を変えるためである。このスレッドの種類として
は、例えば、スレッド内の各文書により最初の投稿文書
の主題とは異なった主題に展開されているか否か（スレ
ッド内の文書の現実の内容）、Ｑ＆Ａ方式のスレッドか
否か（スレッドの形式）、スレッドがチャットのように
往信と返信が頻繁に繰り返される性質のものか否か（ス
レッドの性質，特質）、スレッド内の一つの文書の長さ
が長文若しくは短文のいずれの傾向にあるか（スレッド
内の文書の傾向）等を考えることができる。本実施例で
は、スレッドＴＨの種類を、質問型，複数話題型，会話
型の３つのうちのいずれかに判別することとしている。Various methods are conceivable for the process of extracting the important sentence, but the following method is adopted in this embodiment. The content and procedure of the important sentence extraction processing are shown in FIG. 7 as an important sentence extraction processing routine. This routine is started after the valid text specifying process is completed. When this routine is activated, first, a process of determining the type of each thread TH included in the electronic bulletin board is performed (step S300). This is because the method of extracting the important sentence from each document included in the thread TH is changed according to the type of the thread TH. The type of this thread includes, for example, whether or not each document in the thread is developed into a subject different from the subject of the first posted document (actual content of the document in the thread), whether or not it is a Q & A thread. Whether the thread has the property of frequently repeating incoming and outgoing responses such as chat (the property and characteristics of the thread), whether the length of one document in the thread is long or short. It is possible to consider whether there is a tendency (the tendency of documents in the thread). In the present embodiment, the type of the thread TH is determined to be one of three types: question type, multiple topic type, and conversation type.

【００４９】スレッドＴＨの種類の判別は、以下の要領
で行なうことができる。以下、図３で二点鎖線で囲った
スレッドＴＨを例として説明する。まず、主題を提供す
る最初の文書である投稿文書Ａの有効本文を取り出し、
有効本文内に「〜の質問」や「〜がわからない」等の表
現がある場合には、このスレッドＴＨは質問型であると
判別される。質問型にあてはまらない場合には、スレッ
ドＴＨ内の全有効本文（投稿文書Ａ，返信文書Ａ−１お
よび返信文書Ａ−１−ａの有効本文）を取り出し、全有
効本文の一定の範囲内に複数の主題があるか否かを判定
する。複数の主題がある場合には、このスレッドＴＨは
複数話題型であると判別され、複数の主題がない場合に
は、このスレッドＴＨは会話型であると判別される。な
お、複数の主題の有無の判定は、全有効本文をＴＦＩＤ
Ｆ法を用いてベクトル表現に変換することにより行なう
ことができるが、ここでは詳しい説明は省略する。The type of thread TH can be determined in the following manner. Hereinafter, the thread TH surrounded by a chain double-dashed line in FIG. 3 will be described as an example. First, take out the effective text of posted document A, which is the first document that provides the subject,
If the valid text includes expressions such as "question for" and "don't know", this thread TH is determined to be a question type. If it does not apply to the question type, all valid texts in the thread TH (valid texts of posted document A, reply document A-1, and reply document A-1-a) are taken out, and within a certain range of all valid texts. Determine if there are multiple subjects. If there are a plurality of subjects, the thread TH is determined to be a multi-topic type, and if there are no plurality of subjects, the thread TH is determined to be a conversational type. In addition, to determine the presence or absence of multiple subjects, all valid texts are
This can be done by converting into a vector expression using the F method, but a detailed description is omitted here.

【００５０】こうしてスレッドＴＨの種類を判別した
後、スレッドＴＨの種類に応じて重要文を抽出する処理
を行なう（ステップＳ３２０）。この処理の詳細につ
き、以下、質問型，複数話題型，会話型に分けて説明す
る。After determining the type of thread TH in this way, a process of extracting an important sentence is performed according to the type of thread TH (step S320). The details of this processing will be described below separately for question type, multiple topic type, and conversation type.

【００５１】スレッドＴＨの種類が質問型である場合の
重要文の抽出処理手法を図８のフローチャートに示す。
質問型である場合には、まず、主題を提供する最初の文
書であ（図３の例では投稿文書Ａ）の有効本文から「〜
の質問」や「〜がわからない」等の表現を含む文を抽出
し、これを最初の文書の重要文とする（ステップＳ４０
０、Ｓ４１０）。次に、最初の文書と同じスレッドＴＨ
内に最初の文書と同じ者により作成された他の文書（図
３の例では返信文書Ａ−１−ａ）を探し、他の文書の有
効本文から「うまくいった」や「解決した」，「わかり
ました」等の表現を含む文を抽出する。抽出された文
が、最初の文書と同じ者により作成された他の文書の重
要文となる（ステップＳ４２０，Ｓ４３０）。続いて、
上記他の文書の有効本文の「うまくいった」等の表現を
含む文の前後において質問に対する解決策が含まれてい
る文を取り出し、この文に含まれる単語を解決策を示す
単語として切り出す。次に、最初の文書とは異なる者に
より作成された他人の文書（図３の例では返信文書Ａ−
１）の有効本文から上記解決策を示す単語が含まれてい
る文を抽出する。抽出された文が、他人の文書の重要文
となる（ステップＳ４４０〜Ｓ４６０）。A flowchart of FIG. 8 shows a method of extracting an important sentence when the type of the thread TH is a question type.
In the case of the question type, first, from the effective text of the first document (posted document A in the example of FIG. 3) that provides the subject, “...
Of a sentence including an expression such as "question" or "I don't know" is used as the important sentence of the first document (step S40).
0, S410). Then the same thread TH as the first document
Search for another document (reply document A-1-a in the example of FIG. 3) created by the same person as the first document, and "success" or "solved" from the valid text of the other document. Extract sentences that include expressions such as "I understand." The extracted sentence becomes an important sentence of another document created by the same person as the first document (steps S420 and S430). continue,
A sentence including a solution to the question is extracted before and after a sentence including an expression such as “successful” in the valid text of the other document, and the word included in this sentence is cut out as a word indicating the solution. Next, another person's document created by a person different from the first document (in the example of FIG. 3, the reply document A-
A sentence including a word indicating the above solution is extracted from the valid text of 1). The extracted sentence becomes an important sentence of another person's document (steps S440 to S460).

【００５２】このように、本実施例では、スレッドＴＨ
の種類が質問型である場合には、スレッドＴＨ内の各文
書から該文書に所定の相関を有する文（本実施例では、
全有効本文によって表わされる内容に最も近い意味の
文）を切り出す基準を、該文書群に属する文書の種類
（図３の例では投稿文書Ａ、返信文書Ａ−１、返信文書
Ａ−１−ａ）に応じて異ならせている。As described above, in this embodiment, the thread TH
Is a question type, a sentence having a predetermined correlation with each document in the thread TH (in the present embodiment,
The criterion for cutting out a sentence having a meaning closest to the content represented by the entire valid text is the type of document belonging to the document group (posted document A, reply document A-1, reply document A-1-a in the example of FIG. 3). ) Differently.

【００５３】スレッドＴＨの種類が複数話題型および会
話型である場合の重要文の抽出処理手法を図９のフロー
チャートに示す。複数話題型および会話型の場合には、
まず、一のスレッドＴＨ内に属する全文書の有効本文
（以下、全有効本文という）を取り出す処理を行なう
（ステップＳ５００）。FIG. 9 is a flowchart showing an important sentence extraction processing method in the case where the types of threads TH are a multi-topic type and a conversation type. In case of multiple topics and conversations,
First, a process of extracting valid texts of all documents (hereinafter referred to as all valid texts) belonging to one thread TH is performed (step S500).

【００５４】本実施例では、図６に示した例文を有する
スレッドＴＨを会話型とみなし、図９に示す重要文の抽
出処理を適用している。図６に示す例文の場合には、投
稿文書Ａの有効本文（図６に示す文番号１〜４の文），
返信文書Ａ−１の有効本文（図６に示す文番号５〜９の
文）および返信文書Ａ−１−ａの有効本文（図６に示す
文番号１０〜１４の文）の計１４個の文が、全有効本文
として取り出される。In this embodiment, the thread TH having the example sentence shown in FIG. 6 is regarded as conversational, and the important sentence extraction process shown in FIG. 9 is applied. In the case of the example sentence shown in FIG. 6, the effective text of the posted document A (sentences of sentence numbers 1 to 4 shown in FIG. 6),
A total of 14 valid texts of the reply document A-1 (sentences of sentence numbers 5 to 9 shown in FIG. 6) and valid texts of the reply document A-1-a (sentences of sentence numbers 10 to 14 shown in FIG. 6). The sentence is retrieved as the entire valid body.

【００５５】なお、複数話題型の場合には、ステップＳ
５００の処理において、全有効本文のうち、ある一の主
題に属する本文と他の主題に属する本文とが別々に取り
出され、取り出されたそれぞれの本文を全有効本文とし
てステップＳ５１０以下の処理を行なう。In the case of plural topics, step S
In the processing of 500, out of all the valid texts, the texts belonging to a certain subject and the texts belonging to other themes are extracted separately, and the extracted texts are regarded as the entire valid texts, and the processing from step S510 is performed. .

【００５６】次に、全有効本文から名詞である単語を抽
出し、抽出された単語が全有効本文を構成する各文にお
いて偏って頻出する程度を求める処理を行なう（ステッ
プＳ５１０）。Next, a word that is a noun is extracted from all the valid texts, and the degree to which the extracted words appear frequently in a biased manner in each of the sentences forming the entire valid text is performed (step S510).

【００５７】単語が偏って頻出する程度は、その単語
が、全有効本文内で出現する回数を全有効本文を構成す
る文の数により正規化した値によって評価することがで
きる。これは、例えばＴＦＩＤＦとして知られている。
ＴＦＩＤＦは、次の式で定義される。なお、以下の式
で、ｄｂは、対象となっているひとまとまりのテキスト
データ（ここでは全有効本文）であり、ｄは、全有効本
文を構成している各文、ｔはこのテキストに含まれる単
語、とする。The degree of frequent occurrence of a word can be evaluated by a value obtained by normalizing the number of times the word appears in the entire valid text by the number of sentences forming the entire valid text. This is known as, for example, TFIDF.
TFIDF is defined by the following formula. In the following equation, db is a set of target text data (all valid texts in this case), d is each sentence that constitutes all valid texts, and t is included in this text. The word is

【００５８】ＴＦＩＤＦ（ｄ，ｔ）＝ＴＦ（ｄ，ｔ）×Ｉｄｆ（ｔ） …（１）但し：ＴＦ（ｄ，ｔ）は、各文ｄ内において単語ｔが出
現する回数、Ｉｄｆは、次式（２）による。Ｉｄｆ（ｔ）＝ＬＯＧｅ｛ＤＢ（ｄｂ）／ｆ（ｔ，ｄｂ）｝ …（２）ここで、ＤＢ（ｄｂ）は、全有効本文を構成する文の
数、従って、図６に示した例では、値１４、ｆ（ｔ，ｄ
ｂ）は、全有効本文において、単語ｔが出現する文の
数、である。TFIDF (d, t) = TF (d, t) × Idf (t) (1) where: TF (d, t) is the number of times the word t appears in each sentence d, and Idf is According to the following equation (2). Idf (t) = LOGe {DB (db) / f (t, db)} (2) Here, DB (db) is the number of sentences forming the entire valid text, and thus the example shown in FIG. Then, the value 14, f (t, d
b) is the number of sentences in which the word t appears in all valid texts.

【００５９】図１０は、図６に示した例文において、該
例文に含まれている名詞である単語の出現頻度を数えた
結果を示す。こうして名詞である単語の出現頻度を求め
た後、本実施例では、出現頻度が２以上の単語につき、
上式に基づいてＴＦＩＤＦ値を求めた。こうして求めら
れた各単語のＴＦＩＤＦ値を図１１に示す。FIG. 10 shows the result of counting the frequency of appearance of words that are nouns included in the example sentence shown in FIG. After the appearance frequency of the word that is a noun is obtained in this way, in the present embodiment, for words having an appearance frequency of 2 or more,
The TFIDF value was calculated based on the above equation. The TFIDF value of each word thus obtained is shown in FIG.

【００６０】続いて、各文を構成する名詞である単語の
ＴＦＩＤＦを合算する処理を行なう（ステップＳ５２
０）。この合算処理の結果を図１１の合算値の欄に示し
た。Then, the process of adding up the TFIDF of the words which are the nouns forming the sentences is performed (step S52).
0). The result of this summing process is shown in the summed value column of FIG.

【００６１】次に、スレッドＴＨに含まれる各文書の各
有効本文ごとに各文の合算値の大小を判定し、合算値が
最も大きい２つの文を抽出する処理を行なう（ステップ
Ｓ５３０）。図１１に星印付きで示すように、図６の例
文の場合には、投稿文書Ａからは文番号１，２の文が、
返信文書Ａ−１からは文番号７，８の文が、および返信
文書Ａ−１−ａからは文番号１１，１４の文が、それぞ
れ抽出される。Next, the size of the combined value of each sentence is determined for each valid text of each document included in the thread TH, and the process of extracting the two sentences having the largest combined value is performed (step S530). As shown by a star mark in FIG. 11, in the case of the example sentence of FIG. 6, the sentences of the sentence numbers 1 and 2 from the posted document A are
The sentences with sentence numbers 7 and 8 are extracted from the reply document A-1, and the sentences with sentence numbers 11 and 14 are extracted from the reply document A-1-a.

【００６２】以上のステップＳ５１０〜Ｓ５３０までの
処理が、前述した全体評価後切出処理に相当する。この
全体評価後切出処理によれば、各有効本文内における出
現頻度が高くなければ、単語のＴＦＩＤＦ値は高くなら
ない。反面、全有効本文にまんべんなく出現するような
単語（例えば、「こと」や「場合」）の場合には、ｆ
（ｔ，ｄｂ）が大きな値となるために、ＴＦＩＤＦ値は
結局小さな値となる。つまり、各有効本文を構成する各
文において高い頻度で出現する単語が存在する文ほど、
ＴＦＩＤＦ値は大きな値となり、しかもそういう単語が
多いほど、合算値は高い値となるのである。The above-mentioned steps S510 to S530 correspond to the above-mentioned post-evaluation cut-out processing. According to the cutout processing after the overall evaluation, the TFIDF value of the word does not become high unless the appearance frequency in each valid text is high. On the other hand, in the case of words that appear evenly in all valid texts (for example, "koto" and "case"), f
Since (t, db) has a large value, the TFIDF value eventually has a small value. In other words, sentences with more frequently appearing words in each sentence that make up each valid text,
The TFIDF value is a large value, and the more such words are, the higher the sum value is.

【００６３】以上のように、スレッドＴＨに含まれる各
文書から２つの重要文が抽出される。この後、抽出され
た重要文をＲＡＭ２４０の所定領域に記憶して（図７の
ステップＳ３４０）本ルーチンを終了し、要約作成処理
（図５のステップＳ２４０）に移る。As described above, two important sentences are extracted from each document included in the thread TH. Thereafter, the extracted important sentence is stored in a predetermined area of the RAM 240 (step S340 in FIG. 7), the present routine is terminated, and the process proceeds to the abstract creating process (step S240 in FIG. 5).

【００６４】要約作成処理（図５のステップＳ２４
０）要約作成処理次に、重要文抽出処理により抽出された重
要文に基づいてスレッドＴＨの要約を作成する処理であ
る。図３の例では、投稿文書Ａ，返信文書Ａ−１および
返信文書Ａ−１−ａからなるスレッドＴＨの要約が作成
される。この要約の作成は、具体的には、スレッド内の
各文書から抽出された重要文を所定のアルゴリズムに基
づいて組み合わせることにより作成される。例えば、図
３に二点鎖線で囲ったスレッドＴＨが質問型であり、投
稿文書Ａが質問を含み，返信文書Ａ−１が解決策の提案
を含み、返信文書Ａ−１−ａが解決したという結果を含
む場合には、投稿文書Ａから抽出された重要文と返信文
書Ａ−１−ａから抽出された重要文とを組み合わせるウ
エイトを大きくして、要約を作成すればよい。「…とい
う疑問点が解決した」という文は、一般に、閲覧者に興
味を生じさせると考えられるからである。Summary creation processing (step S24 in FIG. 5)
0) Summary Creation Process Next, a process of creating a summary of the thread TH based on the important sentence extracted by the important sentence extraction process. In the example of FIG. 3, a summary of the thread TH including the posted document A, the reply document A-1, and the reply document A-1-a is created. Specifically, this summary is created by combining important sentences extracted from each document in the thread based on a predetermined algorithm. For example, a thread TH surrounded by a chain double-dashed line in FIG. 3 is a question type, a posted document A includes a question, a reply document A-1 includes a solution proposal, and a reply document A-1-a is solved. When the result is included, the weight combining the important sentence extracted from the posted document A and the important sentence extracted from the reply document A-1-a may be increased to create the summary. This is because the sentence "the question" ... has been solved "is generally considered to cause interest to the viewer.

【００６５】本実施例では、重要文に重要文抽出処理に
より抽出された重要文を標準的な表現に書き換える処理
（以下、文標準化処理という）を行なうことにより、要
約を作成している。具体的には、文標準化処理におい
て、冗長な表現の短文化や自立語の他の自立語への置
換，付属語の用法の訂正，方言から標準語への置換，予
め定めた文字への置き換え，表記のゆれの統一等の処理
を行なう。標準化の処理には大別すると、文字の標準
化、表記のゆれの統一、自立語の統一などを考えること
ができる。これらの処理については説明は省略するが、
大まかな例示を挙げると以下の通りである。In this embodiment, a summary is created by performing a process of rewriting an important sentence extracted by the important sentence extraction process into a standard expression (hereinafter referred to as a sentence standardization process). Specifically, in sentence standardization processing, the short culture of redundant expressions and replacement of independent words with other independent words, correction of usage of adjuncts, replacement of dialects with standard words, replacement with predetermined characters , Performs processing such as unification of notation fluctuations. The standardization process can be broadly divided into standardization of characters, unification of writing variations, unification of independent words, etc. Description of these processes is omitted,
A rough example is as follows.

【００６６】文字の標準化を例示すると、（Ａ）括弧：『』と「」の置き換えを行なうなど、（Ｂ）引用符：“”と””の置き換えを行なうなど、（Ｃ）一般記号：種々の記号（例えば「：，？！」な
ど）について、半角／全角の置き換えを行なうなど、（Ｄ）カタカナや英数字：全角／半角や大文字／小文字
の置き換えを行なうなど、（Ｅ）句点・読点：句点、読点を「、」「。」に統一す
るなど、（Ｆ）名前の繋文字：「クイーン＝エリザベス」を「ク
イーン・エリザベス」に置き換えるなど、がある。Examples of standardization of characters include (A) parentheses: "" and "" are replaced, (B) quotation marks: "" and "" are replaced, (C) General symbols: various Symbols (eg, ":,?!") Are replaced with half-width / full-width characters, (D) Katakana and alphanumeric characters: Full-width / half-width characters and uppercase / lowercase letters are replaced, (E) Punctuation / punctuation : Unify punctuation and reading marks into "," and ".". (F) Concatenation of names: "Queen Elizabeth" is replaced with "Queen Elizabeth".

【００６７】表記のゆれとは、日本語における表記の曖
昧さ、許容幅を言い、例えば、長音記号のゆれ：例、ウィンドウズ、ウィンドーズ、送り仮名のゆれ：例、売上げ、売り上げ、拗音表記のゆれ：例、ウィザード、ウイザード、複合語のかな表記のゆれ：例、売り上げ、売りあげ、外来語表記のゆれ：エンゼル、エンジェル、繰り返し文字のゆれ：例、正正堂堂、正々堂々、など
を挙げることができる。The fluctuation of the notation means the ambiguity of the notation in Japanese, and the allowable width. For example, the fluctuation of the long-sound mark: eg, Windows, Windows, the fluctuation of the syllabary: eg, the sales, the sales, the fluctuation of the notation : Examples, wizards, wizards, compound word kana fluctuations: eg, sales, sales, foreign word fluctuations: angel, angel, repeated character fluctuations: eg, Shoshodo, Seisodo, etc. it can.

【００６８】更に、自立語の統一処理としては、（イ）修飾語：すごく、最高の、一番の、高い、など（ロ）名詞：パーソナルコンピュータ、パソコン、Ｐ
Ｃ、など（ハ）動詞：知らせる、連絡する、通知する、などなどを例示することができる。Further, as a unified process of independent words, (a) modifier: very, best, highest, high, etc. (b) noun: personal computer, personal computer, P
C, etc. (C) Verb: Inform, contact, notify, etc. can be illustrated.

【００６９】こうした処理を行なうための基準データや
ルールは、データベースサーバ２００のハードディスク
２７０内の辞書に格納されている。予め定めた文字への
置き換えを例にとると、「ＰＣ」といった単語は「パソ
コン」という単語に置き換えられる。これにより、一の
スレッドＴＨ内における文書や重要文の用語を統一する
ことができる。Reference data and rules for performing such processing are stored in the dictionary in the hard disk 270 of the database server 200. Taking the replacement with a predetermined character as an example, the word “PC” is replaced with the word “PC”. As a result, the terms of documents and important sentences in one thread TH can be unified.

【００７０】表題作成処理（図５のステップＳ２６
０）表題作成処理は、重要文抽出処理により抽出された重要
文に基づいてスレッドの表題を作成する処理である。図
３の例では、投稿文書Ａ，返信文書Ａ−１および返信文
書Ａ−１−ａからなるスレッドＴＨの表題が作成され
る。具体的には、重要文の文法情報を解析し、必要な単
語を切り出して付属語を補完することにより、表題を作
成する。Title creation processing (step S26 in FIG. 5)
0) The title creating process is a process for creating a thread title based on the important sentence extracted by the important sentence extracting process. In the example of FIG. 3, the title of the thread TH including the posted document A, the reply document A-1, and the reply document A-1-a is created. Specifically, the grammatical information of the important sentence is analyzed, necessary words are cut out, and supplementary words are complemented to create a title.

【００７１】表示データ出力処理（図５のステップＳ
２８０）表示データ出力処理は、上記作成された要約，表題およ
び系列図表を表示データとして出力する処理である。こ
の処理は、図５のステップＳ２７０の処理において電子
掲示板の全てのスレッドＴＨについて要約および表題が
作成されたと判断されたときに実行される。Display data output process (step S in FIG. 5)
280) The display data output process is a process of outputting the created summary, title, and series chart as display data. This process is executed when it is determined in the process of step S270 of FIG. 5 that the abstract and the title have been created for all the threads TH of the electronic bulletin board.

【００７２】表示データ出力処理において出力されるデ
ータイメージの一例を図１２に示す。図１２では、図３
に示した系列図表に基づくデータイメージを表してい
る。このデータイメージは、表示データがデータベース
サーバ２００からネットワーク１００を介してクライア
ントのパーソナルコンピュータ９００に到達したとき
に、パーソナルコンピュータ９００のディスプレイのプ
レビュー画面に初期表示される内容である。FIG. 12 shows an example of a data image output in the display data output process. In FIG. 12, FIG.
A data image based on the series chart shown in FIG. This data image is the content initially displayed on the preview screen of the display of the personal computer 900 when the display data reaches the client personal computer 900 from the database server 200 via the network 100.

【００７３】図１２に示すように、プレビュー画面に
は、各スレッドごとに要約および表題が表示されてい
る。スレッド名は、系列図表の最下層に位置する返信文
書に基づいて付けられており、例えば、図３で二点鎖線
で囲ったスレッドＴＨのスレッド名は、最下層の返信文
書名である「Ａ−１−ａ」とされている。なお、本実施
例では、投稿日（投稿文書が書き込まれた日）、更新日
（返信文書が書き込まれた最近の日）、階層数（スレッ
ド内における返信文書の数）が併せて表示されるので、
これらの表示を話題の選択に関する参考資料とすること
ができる。As shown in FIG. 12, the preview screen displays a summary and a title for each thread. The thread name is given based on the reply document located in the lowest layer of the sequence chart. For example, the thread name of the thread TH surrounded by the chain double-dashed line in FIG. 3 is the reply document name of the lowest layer "A. -1-a ". In this embodiment, the posting date (the date when the posted document was written), the update date (the latest date when the reply document was written), and the number of layers (the number of reply documents in the thread) are also displayed. So
These displays can be used as reference materials for selecting topics.

【００７４】各スレッドについての表題および要約は時
系列順に一覧表示される。本実施例では、更新日が新し
いスレッドから順に表示している。図１２で言えば、ス
レッド名が「Ｂ−１−ａ」のものが最も最近に更新され
たスレッドである。これにより、最新の主題や話題に容
易にアクセスすることができる。The titles and summaries for each thread are listed in chronological order. In this embodiment, the update dates are displayed in order from the newest thread. In FIG. 12, the thread having the thread name “B-1-a” is the most recently updated thread. This allows you to easily access the latest subjects and topics.

【００７５】なお、図１２に示すプレビュー画面におい
て、表題や要約等の他、各スレッドに属する投稿文書や
返信文書の作成者の名前を表示することとしてもよい。
こうすれば、各作成者による発言内容の変化を把握しや
すくなる。Note that the preview screen shown in FIG. 12 may display the names of the creators of posted documents and reply documents belonging to each thread, in addition to the title and summary.
This makes it easier to understand changes in the content of comments made by each creator.

【００７６】クライアントは、プレビュー画面において
表題や要約を参照し、興味のある話題のスレッドを選択
することができる。こうしてスレッドが選択されると、
選択されたスレッド内に属する全ての文書情報が、文書
の往信−返信関係と共に、一覧表示画面に表示される。
スレッド名が「Ａ−１−ａ」のスレッドＴＨが選択され
たときに一覧表示画面に表示される内容を図１３に示
す。この一覧表示画面により、興味のある話題の内容を
詳細に確認することができる。The client can select the thread of the topic of interest by referring to the title and the summary on the preview screen. When a thread is selected in this way,
All document information belonging to the selected thread is displayed on the list display screen together with the document's outgoing-reply relationship.
FIG. 13 shows the contents displayed on the list display screen when the thread TH having the thread name “A-1-a” is selected. With this list display screen, the details of the topic of interest can be confirmed in detail.

【００７７】以上説明した本実施例の掲示板表示システ
ムＫＳは、往信−返信関係を有する投稿文書，投稿に対
する返信文書，返信に対する返信文書から構成されたス
レッドＴＨに関し、スレッドＴＨに属する文書全体によ
って表わされる内容を考慮して、スレッドＴＨに属する
各文書から文書全体の内容に最も近い意味の文を重要文
として抽出し、抽出された重要文に基づいてスレッドＴ
Ｈの要約を作成し、作成された要約を画面上に表示す
る。従って、スレッドＴＨごとの話題の内容を迅速かつ
的確に把握することが可能となり、話題の取捨選択を迅
速かつ容易に行なうことができる。The bulletin board display system KS of the present embodiment described above relates to the thread TH composed of the posted document having the outgoing-reply relationship, the reply document for the post, and the reply document for the reply, and is represented by the entire document belonging to the thread TH. In consideration of the contents to be read, a sentence having a meaning closest to the contents of the entire document is extracted as an important sentence from each document belonging to the thread TH, and the thread T
Create a summary of H and display the created summary on the screen. Therefore, the content of the topic for each thread TH can be grasped quickly and accurately, and the selection of the topic can be performed quickly and easily.

【００７８】本実施例の掲示板表示システムＫＳは、ス
レッドＴＨの種類を判別し、判別されたスレッドＴＨの
種類に応じて重要文を抽出し、この重要文を表示する。
従って、主題の展開や話題の内容をより正確に把握する
ことができる。The bulletin board display system KS of this embodiment discriminates the type of the thread TH, extracts the important sentence according to the discriminated type of the thread TH, and displays the important sentence.
Therefore, it is possible to more accurately grasp the development of the subject and the content of the topic.

【００７９】本実施例の掲示板表示システムＫＳは、抽
出された重要文に文標準化手段を適用して、重要文を標
準的な表現に書き換え、要約を作成する。こうすれば、
文書作成者の文書作成に関する技量に拘らず均一なレベ
ルの文がスレッドの要旨として表示されるので、文書群
ごとの話題の内容をより一層把握しやすくなる。例え
ば、冗長な表現を有する重要文を短文化した場合には、
限られた表示領域において内容の把握しやすい文を表示
することができる。The bulletin board display system KS of the present embodiment applies sentence standardization means to the extracted important sentences, rewrites the important sentences into a standard expression, and creates a summary. This way
Sentences of a uniform level are displayed as the gist of the thread regardless of the skill of the document creator in creating the document, which makes it easier to understand the content of the topic for each document group. For example, when an important sentence with a redundant expression is short-cultured,
It is possible to display a sentence whose content is easy to understand in a limited display area.

【００８０】本実施例の掲示板表示システムＫＳは、各
スレッドＴＨに属する文書から文書の要旨を構成し得な
い表現を除外した有効本文に基づいて重要文の抽出また
は要約の作成を行なう。従って、重要文の抽出または要
約の作成を効率的かつ精度よく行なうことができる。The bulletin board display system KS of the present embodiment extracts important sentences or creates summaries from the documents belonging to each thread TH, excluding the expressions that cannot constitute the gist of the documents. Therefore, it is possible to efficiently and accurately extract important sentences or create abstracts.

【００８１】本実施例の掲示板表示システムＫＳは、重
要文の抽出結果を利用してスレッドＴＨの表題を作成
し、スレッドＴＨの要約と共に表示する。従って、作成
者によって付けられた各文書の表題を羅列する場合と比
較して、各スレッドＴＨの話題の種類を把握しやすくな
る。The bulletin board display system KS of this embodiment creates the title of the thread TH by using the extraction result of the important sentence and displays it together with the summary of the thread TH. Therefore, as compared with the case where the titles of the respective documents given by the creator are listed, it becomes easier to understand the type of topic of each thread TH.

【００８２】（２）第２実施例：往信−返信関係を有す
る文書には、電子掲示板への書き込み文書以外に、発信
者と特定の受領者との間で文書を送り合う電子メールが
ある。このような電子メールのまとまりに対して重要文
を抽出し、要約を作成する実施例をメール表示システム
ＭＳとして説明する。図１４は、本発明の第２実施例で
あるメール表示システムＭＳの概略構成を示す説明図で
ある。図１４に示すように、メール表示システムＭＳ
は、掲示板表示システムＫＳとほぼ共通の構成を備えて
いるが、データベースサーバ２００を備えない点で掲示
板表示システムＫＳと異なる。なお、図１４では、この
共通の構成につき、符号の上二桁を図１と同じ数字を用
いて表わしている。(2) Second Embodiment: The documents having the outgoing-return relationship include not only the documents written on the electronic bulletin board but also electronic mails for sending and receiving documents between the sender and a specific recipient. An embodiment will be described as a mail display system MS in which an important sentence is extracted from such a group of electronic mails and a summary is created. FIG. 14 is an explanatory diagram showing a schematic configuration of the mail display system MS which is the second embodiment of the present invention. As shown in FIG. 14, the mail display system MS
Has substantially the same configuration as the bulletin board display system KS, but differs from the bulletin board display system KS in that the database server 200 is not provided. Note that, in FIG. 14, the upper two digits of the reference numerals are represented by using the same numerals as those in FIG. 1 for this common configuration.

【００８３】配信サーバ３０，３１のＲＯＭには、特定
の発信者と特定の受領者との間で送受信されたメールの
まとまり（以下、メール束という）を受信ないし送信日
時順に整理して時系列図表を作成する処理を記述したプ
ログラムが格納されている。このプログラムが実行され
ると、図１５に示すような時系列図表が作成される。図
１５に示すように、Ｘ氏からＹ氏に送信したメールＸ１
を受け取ったＹ氏は、Ｘ氏に対してメールＹ１を送信
し、このメールＹ１を受け取ったＸ氏は、Ｙ氏に対して
メールＸ２を送信している。図１５では、こうしたＸ氏
とＹ氏との間のメールの送受信が５回続いている。これ
により、Ｘ氏とＹ氏との間でやりとりされたメールは、
「メールＸ１→メールＹ１→メールＸ２→メールＹ１→
メールＸ３→メールＹ３→メールＸ４→メールＹ４→メ
ールＸ５→メールＹ５」というように時系列順に連鎖し
ている。このように連鎖する合計１０通のメールがメー
ル束ＭＢとなる。なお、時系列図表中の各メールの本文
は、時系列図表とリンクされて配信サーバ３０，３１内
のハードディスクに蓄積されている。In the ROMs of the distribution servers 30 and 31, a group of mails (hereinafter, referred to as a mail bundle) sent and received between a specific sender and a specific recipient is arranged in order of reception or transmission date and time series. A program that describes the process of creating a chart is stored. When this program is executed, a time series chart as shown in FIG. 15 is created. As shown in FIG. 15, mail X1 sent from Mr. X to Mr. Y
Mr. Y who has received the mail has transmitted the mail Y1 to Mr. X, and Mr. X who has received this mail Y1 has transmitted the mail X2 to Mr. Y. In FIG. 15, such mail transmission / reception between Mr. X and Mr. Y continues five times. As a result, the mail exchanged between Mr. X and Mr. Y is
"Mail X1 → Email Y1 → Email X2 → Email Y1 →
Mail X3 → mail Y3 → mail X4 → mail Y4 → mail X5 → mail Y5 are linked in chronological order. A total of 10 mails that are chained in this way becomes a mail bundle MB. The text of each mail in the time-series chart is linked to the time-series chart and is stored in the hard disk in the distribution servers 30 and 31.

【００８４】配信サーバ３０，３１のＲＯＭには、上記
の第１実施例と同様の、要約作成・表示処理の内容が記
述されたプログラムが格納されている。このプログラム
が配信サーバ３０，３１のＣＰＵの命令によって実行さ
れることにより、メール束に対して、第１実施例におけ
るスレッドＴＨと同様に、スレッド特定処理、有効本
文特定処理、重要文抽出処理、要約作成処理、表
題作成処理、表示データ出力処理と同様の処理が行な
われる。この結果、メール束に属するメール全体によっ
て表わされる内容を考慮して、メール束に属する各メー
ルからメール全体の内容に最も近い意味の文が重要文と
して抽出され、抽出された重要文に基づいてメール束の
要約が作成される。こうしたメール束の要約は、Ｘ氏の
パーソナルコンピュータ９０とＹ氏のパーソナルコンピ
ュータ９１の双方のプレビュー画面上に表示される。従
って、クライアントは、メール束ごとの話題の内容を迅
速かつ的確に把握することが可能となり、メールの取捨
選択を迅速かつ容易に行なうことができる。The ROMs of the distribution servers 30 and 31 store the programs in which the contents of the summary creating / displaying process are described, which is the same as in the first embodiment. By executing this program according to the instructions of the CPUs of the distribution servers 30 and 31, a thread identification process, an effective text identification process, an important sentence extraction process are performed on the mail bundle, similar to the thread TH in the first embodiment. The same processes as the summary creating process, the title creating process, and the display data output process are performed. As a result, considering the content represented by the entire email belonging to the email bundle, the sentences with the meanings closest to the content of the entire email are extracted as important sentences from each email belonging to the email bundle, and based on the extracted important sentences. A bunch of mail bundles is created. The summary of the mail bundle is displayed on the preview screens of both Mr. X's personal computer 90 and Mr. Y's personal computer 91. Therefore, the client can quickly and accurately grasp the content of the topic for each mail bundle, and can quickly and easily select mails.

【００８５】なお、上記の実施例では、メールの本文の
表示を例として取り上げたが、こうした表示の手法は、
メール本文の表示に限定されるものではなく、複数の文
から構成されたデータについても適用することができ
る。例えば、論文データベースの抄録文を対象として、
抄録文の中から更に重要文を抽出し、これを論文の表題
と共に表示するといった態様で実現することができる。
あるいは電子カルテの記載や、電子的に配信される新聞
などの記事から重要文を抽出して表示するといった実現
形態も存在する。抽出した重要文は、モニタに表示する
だけでなく、モニタへの表示に代えて、あるいは表示と
共に、音声により読み上げることも好適である。もとよ
り、プリンタなどで印字することも差し支えない。In the above embodiment, the display of the body of the mail is taken as an example, but the display method is as follows.
The present invention is not limited to the display of the mail text, but can be applied to data composed of a plurality of sentences. For example, targeting abstracts in a thesis database,
This can be realized by extracting more important sentences from the abstracts and displaying them together with the title of the paper.
Alternatively, there is a realization form in which an electronic medical record is described or an important sentence is extracted and displayed from an electronically delivered article such as a newspaper. It is preferable not only to display the extracted important sentence on the monitor, but also to read aloud by voice instead of or together with the display on the monitor. Of course, you can print with a printer.

【００８６】以上、本発明の実施の形態について説明し
たが、本発明はこうした実施の形態に何等限定されるも
のではなく、例えば、重要文を他とは異なる態様（例え
ば色違い）で表示するなど、本発明の要旨を逸脱しない
範囲内において、更に種々なる形態で実施し得ることは
勿論である。Although the embodiments of the present invention have been described above, the present invention is not limited to these embodiments. For example, important sentences are displayed in a mode different from the others (for example, different colors). Needless to say, the present invention can be implemented in various forms without departing from the scope of the present invention.

【００８７】第１実施例では、系列図表作成処理（図
４）や要約作成・表示処理（図５）をデータベースサー
バ２００が実行することとしたが、系列図表作成処理や
要約作成・表示処理の内容を記述したプログラムをデー
タベースサーバ２００以外の装置に格納し、データベー
スサーバ２００以外の装置が実行する構成としてもよ
い。例えば、配信サーバ３００，３１０，３２０やクラ
イアントのパーソナルコンピュータ９００，９１０，９
２０が実行する構成等を考えることができる。In the first embodiment, the database server 200 executes the sequence chart creation process (FIG. 4) and the summary creation / display process (FIG. 5). A program in which the content is described may be stored in a device other than the database server 200 and executed by a device other than the database server 200. For example, distribution servers 300, 310, 320 and client personal computers 900, 910, 9
It is possible to think of a configuration or the like executed by the 20.

【００８８】また、第２実施例では、系列図表作成処理
や要約作成・表示処理を配信サーバ３０，３１が実行す
ることとしたが、系列図表作成処理や要約作成・表示処
理の内容を記述したプログラムを配信サーバ３０，３１
以外の装置に格納し、配信サーバ３０，３１以外の装置
が実行する構成としてもよい。例えば、上記プログラム
を読み取ったパーソナルコンピュータ９０，９１が実行
する構成等を考えることができる。In the second embodiment, the distribution server 30 or 31 executes the sequence chart creation process or the summary creation / display process, but the contents of the sequence diagram creation process or the summary creation / display process are described. Program distribution server 30, 31
It may be configured to be stored in a device other than the above and executed by a device other than the distribution servers 30 and 31. For example, a configuration executed by the personal computers 90 and 91 that read the program can be considered.

【００８９】また、上記実施例では、重要文抽出処理に
おいて、電子掲示板への書き込み文書が、平均的な文の
数が約５つ程度であり、一文が比較的短い傾向にあるこ
とを考慮し、抽出する文の数を２つと定めたが（図９の
ステップＳ５３０の処理）、抽出する文の数は、スレッ
ドＴＨに含まれる各文書（図３で言えば、投稿文書Ａ，
返信文書Ａ−１，返信文書Ａ−１−ａ）の長さ等に応じ
て任意に定めることができる。Further, in the above-described embodiment, in the important sentence extraction processing, considering that the average number of sentences written in the electronic bulletin board is about 5, and one sentence tends to be relatively short. Although the number of sentences to be extracted is set to be two (processing of step S530 in FIG. 9), the number of sentences to be extracted is determined by the number of documents included in the thread TH (posted document A, in FIG. 3).
It can be arbitrarily determined according to the length of the reply document A-1, reply document A-1-a).

【００９０】上記実施例では、重要文を抽出した後、こ
の重要文に対して文標準化処理を行なったが、重要文を
抽出する前に有効本文（投稿文書や返信文書の本文等）
に対して文標準化処理を行なう構成としてもよい。In the above embodiment, after the important sentence is extracted, the sentence standardization process is performed on this important sentence. However, before extracting the important sentence, the effective text (the text of the posted document or the reply document) is extracted.
A sentence standardization process may be performed for the.

【００９１】上記実施例では、重要文抽出処理（図５の
ステップＳ２２０）の終了後、要約作成処理（図５のス
テップＳ２４０），表題作成処理（図５のステップＳ２
６０）を行なったが、この要約作成処理や表題作成処理
を行なうことなく、重要文抽出処理で抽出された各文書
の重要文を表示データ出力処理（図５のステップＳ２８
０）により表示可能に出力する構成としてもよい。この
構成によっても、スレッドＴＨメール束に属する文書全
体によって表わされる内容を考慮して、スレッドＴＨに
属する各文書から文書全体の内容に最も近い意味の文が
重要文として抽出され、抽出された重要文が画面上に表
示される。従って、各スレッドＴＨや各メール束におけ
る主題の展開を、表示された重要文を見ただけで正確に
把握することができる。In the above embodiment, after the important sentence extracting process (step S220 in FIG. 5) is completed, the abstract creating process (step S240 in FIG. 5) and the title creating process (step S2 in FIG. 5).
60) was performed, but the important sentence of each document extracted by the important sentence extracting process is displayed without outputting the summary creating process and the title creating process (step S28 in FIG. 5).
It may be configured to output so that it can be displayed by 0). With this configuration also, in consideration of the content represented by the entire document belonging to the thread TH mail bundle, the sentence having the meaning closest to the content of the entire document is extracted as an important sentence from each document belonging to the thread TH, and the extracted important The sentence is displayed on the screen. Therefore, the development of the subject matter in each thread TH or each mail bundle can be accurately grasped only by looking at the displayed important sentence.

【００９２】特に、メール束に属する各メールから重要
文を抽出し、この重要文を時系列に配列した場合には、
このように配列された重要文をモニタ上に表示したり、
プリントアウトすることにより、過去のやり取りを一目
で把握することが可能となり、ビジネスシーン等におい
て有効に活用することができる。過去のやり取りにおけ
る特定の内容（例えば、双方が合意した納期）を調べよ
うとする際に、メール束に属する各メールを順次にモニ
タに表示したり若しくはプリントアウトし、メール束か
ら特定の内容を探すという必要がないからである。Particularly, when an important sentence is extracted from each mail belonging to the mail bundle and the important sentences are arranged in time series,
Display important sentences arranged in this way on the monitor,
By printing out, it becomes possible to grasp past exchanges at a glance, and it can be effectively utilized in business scenes and the like. When trying to find out the specific content in the past exchange (for example, the delivery date agreed by both parties), each mail belonging to the mail bundle is sequentially displayed on the monitor or printed out, and the specific content is extracted from the mail bundle. Because there is no need to search.

【００９３】以上本発明のいくつかの実施例と変形例に
ついて説明したが、本発明は、これらの実施例に何ら限
定されるものではなく、本発明の要旨を逸脱しない範囲
内において種々の態様で実施することができる。例え
ば、複数の文書の関係は、往信−返信の関係に限られる
ものではなく、例えば、同一の作成者が作成した複数の
文書、コメントや日記、報告書などであっても差し支え
ない。また、本発明の文書表示を行なう部分は、複数の
文書を蓄積しているサーバ上に置いても良いし、複数の
文書を閲覧するクライアント側においても良い。あるい
はこれらのサーバ−クライアントシステムの中間に、専
用のサーバとして置くことも可能である。これらのシス
テムを構成するためのプログラムは、ＣＤ−ＲＯＭなど
の媒体の形態で取り扱っても良いし、プログラムをサー
バ上に置き、使用するものがこれをダウンロードして、
自らのコンピュータ上で、上述した文書表示システムの
少なくとも一部を構築するものとしても良い。Although some embodiments and modifications of the present invention have been described above, the present invention is not limited to these embodiments, and various embodiments are possible without departing from the gist of the present invention. Can be implemented in. For example, the relationship between the plurality of documents is not limited to the relationship between the outgoing communication and the reply, and may be, for example, a plurality of documents created by the same creator, a comment, a diary, or a report. Further, the document display portion of the present invention may be placed on a server that stores a plurality of documents, or may be on the client side that browses a plurality of documents. Alternatively, it can be placed as a dedicated server between these server-client systems. The program for configuring these systems may be handled in the form of a medium such as a CD-ROM, or the program used on the server may be downloaded by a person who uses it.
At least a part of the document display system described above may be constructed on its own computer.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の第１実施例である掲示板表示システム
ＫＳの概略構成を示す説明図である。FIG. 1 is an explanatory diagram showing a schematic configuration of a bulletin board display system KS that is a first embodiment of the present invention.

【図２】データベースサーバ２００の構造を示す説明図
である。FIG. 2 is an explanatory diagram showing a structure of a database server 200.

【図３】電子掲示板に書き込まれている文書の系列図表
を示す説明図である。FIG. 3 is an explanatory diagram showing a sequence chart of documents written on an electronic bulletin board.

【図４】系列図表作成処理ルーチンを示すフローチャー
トである。FIG. 4 is a flowchart showing a sequence chart creation processing routine.

【図５】要約作成・表示処理ルーチンを示すフローチャ
ートである。FIG. 5 is a flowchart showing a summary creation / display processing routine.

【図６】図３で二点鎖線で囲ったスレッドＴＨに属する
各文書の例文を示す説明図である。FIG. 6 is an explanatory diagram showing an example sentence of each document belonging to a thread TH surrounded by a chain double-dashed line in FIG. 3;

【図７】重要文抽出処理ルーチンを示すフローチャート
である。FIG. 7 is a flowchart showing an important sentence extraction processing routine.

【図８】スレッドＴＨの種類が質問型である場合の重要
文の抽出処理を示すフローチャートである。FIG. 8 is a flowchart showing an important sentence extraction process when the type of thread TH is a question type.

【図９】スレッドＴＨの種類が複数話題型および会話型
である場合の重要文の抽出処理を示すフローチャートで
ある。FIG. 9 is a flowchart showing an important sentence extraction process when the types of threads TH are a multi-topic type and a conversation type.

【図１０】図９の例文を用いて各単語のＴＦＩＤＦ値を
計算した計算例を示す説明図である。10 is an explanatory diagram showing a calculation example in which the TFIDF value of each word is calculated using the example sentence of FIG.

【図１１】図９の例文におけるＴＦＩＤＦ値の文単位の
合算値を例示する説明図である。FIG. 11 is an explanatory diagram exemplifying a total value of TFIDF values in sentence units in the example sentence of FIG. 9;

【図１２】表示データの出力後にパーソナルコンピュー
タ９００のディスプレイのプレビュー画面に初期表示さ
れる内容を示す説明図である。FIG. 12 is an explanatory diagram showing the contents initially displayed on the preview screen of the display of the personal computer 900 after the display data is output.

【図１３】一覧表示画面の一例を示す説明図である。FIG. 13 is an explanatory diagram showing an example of a list display screen.

【図１４】メール表示システムＭＳの概略構成を他の実
施例として示す説明図である。FIG. 14 is an explanatory diagram showing a schematic configuration of a mail display system MS as another embodiment.

【図１５】配信サーバ３０，３１により作成されるメー
ルの時系列図表を示す説明図である。FIG. 15 is an explanatory diagram showing a time series chart of mails created by distribution servers 30 and 31.

【符号の説明】[Explanation of symbols]

１０…ネットワーク３０，３１…配信サーバ９０，９１…パーソナルコンピュータ１００…ネットワーク２００…データベースサーバ２１０…ネットワークインタフェース２２０…ＣＰＵ２３０…ＲＯＭ２４０…ＲＡＭ２５０…タイマ２６０…データベース蓄積部２７０…ハードディスク３００，３１０，３２０…配信サーバ８００…パーソナルコンピュータ９００，９１０，９２０…パーソナルコンピュータＫＳ…掲示板表示システムＭＳ…メール表示システムＭＢ…メール束ＴＨ…スレッド10 ... Network30, 31 ... Delivery server90, 91 ... Personal computer100 ... Network200 ... Database server210 ... Network interface220 ... CPU230 ... ROM240 ... RAM250 ... Timer260 ... Database storage unit270 ... Hard disk300, 310, 320 ... Delivery server800 ... Personal computer900, 910, 920 ... Personal computerKS ... Bulletin board display systemMS ... Mail display systemMB ... Email bundleTH ... thread

─────────────────────────────────────────────────────フロントページの続き (72)発明者長保和義長野県諏訪市大和三丁目３番５号セイコーエプソン株式会社内(72)発明者鈴木勝仁長野県諏訪市大和三丁目３番５号セイコーエプソン株式会社内Ｆターム(参考） 5E501 AA02 AB19 AC25 AC34 BA05 CA02 CB02 CB09 EA12 EB17 FA13 FA23 FB45 ─────────────────────────────────────────────────── ───Continued front page (72) Inventor Kazuyoshi Nagabo Seiko, 3-3 Yamato 3-chome, Suwa City, Nagano Prefecture -In Epson Corporation(72) Inventor Katsuhito Suzuki Seiko, 3-3 Yamato 3-chome, Suwa City, Nagano Prefecture -In Epson CorporationF-term (reference) 5E501 AA02 AB19 AC25 AC34 BA05 CA02 CB02 CB09 EA12 EB17 FA13 FA23 FB45