JP7570088B2

Movatterモバイル変換

Info

Publication number: JP7570088B2
Application number: JP2020105587A
Authority: JP
Inventors: 英明間世田; 賢一郎今井
Original assignee: National Institute of Advanced Industrial Science and Technology AIST
Current assignee: National Institute of Advanced Industrial Science and Technology AIST
Priority date: 2020-06-18
Filing date: 2020-06-18
Publication date: 2024-10-21
Anticipated expiration: 2040-06-18
Also published as: WO2021256525A1; JP2021197100A

Description

Translated fromJapanese

本発明は、情報処理システム、情報処理方法、同定方法及びプログラムに関する。The present invention relates to an information processing system, an information processing method, an identification method, and a program.

遺伝子（ゲノム）の変異が様々な疾患の原因となっていることが知られている。例えば、がん細胞は、遺伝子変異により悪性化することが知られている。がんのタイプに特徴的な遺伝子変異を特定することは、発がん機構の解明のみならず、がんの診断、がんの予測、治療法の開発などにも貢献することが期待される（特許文献１参照）。It is known that genetic (genome) mutations are the cause of various diseases. For example, it is known that cancer cells become malignant due to genetic mutations. Identifying genetic mutations characteristic of a type of cancer is expected to contribute not only to elucidating the mechanism of carcinogenesis, but also to cancer diagnosis, cancer prediction, and the development of treatment methods (see Patent Document 1).

再表２０１５－９３５５７号公報Re-table No. 2015-93557

しかし、健康診断等を受ける検査対象者（被検者）について、遺伝子変異に伴う疾患の将来のリスクを把握することが難しいという第１の問題がある。またそもそも遺伝子（ゲノム）の変異がランダムに起こることから、莫大な遺伝子変異から、疾患に関与する遺伝子変異を見つけ出すことは難しいという第２の問題がある。However, the first problem is that it is difficult to grasp the future risk of diseases associated with gene mutations in test subjects (examinees) undergoing health checkups, etc. The second problem is that, because gene (genome) mutations occur randomly in the first place, it is difficult to find gene mutations that are involved in disease from the enormous number of gene mutations.

本発明の一態様は、上記第１の問題に鑑みてなされたものであり、遺伝子変異に伴う疾患のリスクを把握することを容易化する、もしくは遺伝子変異が起こる可能性を把握することを容易化することを可能とする情報処理システム、情報処理方法及びプログラムを提供することを目的とする。
本発明の別の態様は、上記第２の問題に鑑みてなされたものであり、疾患に関与する遺伝子変異を見つけ出すことを容易化することを可能とする情報処理システム、同定方法及びプログラムを提供することを目的とする。 One aspect of the present invention has been made in consideration of the first problem described above, and aims to provide an information processing system, an information processing method, and a program that make it easier to understand the risk of a disease associated with a genetic mutation, or to make it easier to understand the possibility of a genetic mutation occurring.
Another aspect of the present invention has been made in consideration of the second problem described above, and aims to provide an information processing system, an identification method, and a program that make it possible to easily find genetic mutations involved in disease.

遺伝子変異に伴う疾患のリスクを把握することを容易化することについては、以下の第１の態様から第７の態様で解決する。
本発明の第１の態様に係る情報処理システムは、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］から第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異に関する情報処理システムであって、検査対象者の遺伝子配列において、対象の疾患に対応する特定の遺伝子位置における前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行、及び／または前記検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量の抽出の実行を行う実行部と、前記実行部による少なくとも一方の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報もしくは遺伝子変異に関する情報を出力する出力部と、を備える。 The problem of facilitating understanding of the risk of diseases associated with gene mutations is solved by the following first to seventh aspects.
The information processing system according to a first aspect of the present invention is an information processing system for a genetic mutation from a first formula: X-Y-X-Y-X [wherein X and Y are base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X] to a base sequence of a second formula: X-Y-X, and is provided with an execution unit that executes a search for whether or not there is a mutation in the base sequence of the second formula: X-Y-X at a specific gene position corresponding to a target disease in the genetic sequence of a test subject, and/or executes extraction of the amount of transcription of the base sequence of the second formula: X-Y-X to RNA in a sample of the test subject, and an output unit that outputs information on the risk of the test subject suffering from the target disease or information on the genetic mutation using at least one of the execution results by the execution unit.

この構成によれば、対象の疾患に前記検査対象者が罹るリスクに関する情報が出力される場合であって第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘから第２の式：Ｘ－Ｙ－Ｘの塩基配列への特定の遺伝子位置における変異が対象の疾患に関与する場合には、対象の疾患に検査対象者が罹るリスクに関する情報を取得することができるので、遺伝子変異に伴う疾患のリスクを把握することを容易化することができる。一方、遺伝子変異に関する情報が出力される場合であって遺伝子変異に関する情報が出力される場合、第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こる可能性が分かるので、遺伝子変異が起こる可能性を把握することを容易化することができる。なお、ここで検体は組織もしくは体液である。According to this configuration, when information on the risk of the test subject suffering from the target disease is output and a mutation at a specific gene position from the base sequence of the first formula: X-Y-X-Y-X to the base sequence of the second formula: X-Y-X is involved in the target disease, information on the risk of the test subject suffering from the target disease can be obtained, making it easier to understand the risk of the disease associated with a genetic mutation. On the other hand, when information on a genetic mutation is output, the possibility of a mutation to the base sequence of the second formula: X-Y-X occurring can be known, making it easier to understand the possibility of a genetic mutation occurring. Note that the sample here is a tissue or a body fluid.

本発明の第２の態様に係る情報処理システムは、第１の態様に係る情報処理システムであって、前記出力部は、前記検査対象者の前記転写量と、健常者及び／または前記対象の疾患の患者の前記転写量とを比較し、当該比較結果及び／または前記探索の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報を出力する。The information processing system according to the second aspect of the present invention is the information processing system according to the first aspect, in which the output unit compares the transcription amount of the test subject with the transcription amount of a healthy individual and/or a patient with the target disease, and outputs information regarding the risk of the test subject suffering from the target disease using the comparison result and/or the execution result of the search.

この構成によれば、検査対象者の転写量と、健常者及び／または対象の疾患の患者の前記転写量とを比較結果を用いて、対象の疾患に検査対象者が罹るリスクを推定することができる。With this configuration, the test subject's risk of contracting the target disease can be estimated using the results of comparing the transcription amount of the test subject with that of healthy individuals and/or patients with the target disease.

本発明の第３の態様に係る情報処理システムは、第２の態様に係る情報処理システムであって、健常者及び／または前記対象の疾患の患者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量が記憶されているストレージを備え、前記出力部は、前記ストレージを参照して前記検査対象者の前記転写量と、健常者及び／または前記対象の疾患の患者の前記転写量とを比較し、当該比較結果及び／または前記探索の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報を出力する。The information processing system according to the third aspect of the present invention is the information processing system according to the second aspect, and includes a storage device in which the amount of transcription into RNA of the base sequence of the second formula: X-Y-X in a sample from a healthy individual and/or a patient with the target disease is stored, and the output unit refers to the storage device to compare the amount of transcription of the test subject with the amount of transcription of the healthy individual and/or the patient with the target disease, and outputs information regarding the risk of the test subject suffering from the target disease using the comparison result and/or the execution result of the search.

この構成によれば、遺伝子変異に伴う疾患のリスクを把握することを容易化することができる。This configuration makes it easier to understand the risk of disease associated with genetic mutations.

本発明の第４の態様に係る情報処理システムは、第１から３のいずれかの態様に係る情報処理システムであって、前記対象の疾患に関与する遺伝子変異後の第２の式：Ｘ－Ｙ－Ｘの塩基配列と、当該対象の疾患に関与する当該第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が生じる遺伝子位置とが関連付けられて記憶されているストレージを備え、前記実行部は、前記ストレージから前記第２の式：Ｘ－Ｙ－Ｘの塩基配列と前記遺伝子位置とを読み出し、前記検査対象者の遺伝子配列において、当該読み出した遺伝子位置において当該読み出した前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行を行う
。 An information processing system according to a fourth aspect of the present invention is the information processing system according to any one of the first to third aspects, comprising a storage in which a base sequence of a second formula: X-Y-X after a gene mutation involved in a disease of the subject and a gene position at which a mutation to the base sequence of the second formula: X-Y-X involved in the disease of the subject occurs are stored in association with each other, and the execution unit reads out the base sequence of the second formula: X-Y-X and the gene position from the storage, and executes a search for whether or not there is a mutation to the read-out base sequence of the second formula: X-Y-X at the read-out gene position in the gene sequence of the test subject.

本発明の第５の態様に係る情報処理システムは、第１から４のいずれかの態様に係る情報処理システムであって、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列の長さの合計は、４０ｍｅｒ以上である。The information processing system according to the fifth aspect of the present invention is an information processing system according to any one of the first to fourth aspects, in which the total length of the base sequences before and after the base sequence of the second formula: X-Y-X is 40 mer or more.

この構成によれば、遺伝子変異が起こる確率が高い配列について、疾患に関与する遺伝子変異としてリスク推定の考慮に入れることができ、疾患のリスク推定の精度を担保することができる。With this configuration, sequences with a high probability of genetic mutations can be taken into consideration in risk estimation as genetic mutations involved in disease, ensuring the accuracy of disease risk estimation.

本発明の第６の態様に係る情報処理方法は、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］から第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異に関する情報処理方法であって、検査対象者の遺伝子配列において、対象の疾患に対応する特定の遺伝子位置における前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行、及び／または前記検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量の抽出の実行を行う実行工程と、前記実行工程による少なくとも一方の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報もしくは遺伝子変異に関する情報を出力する出力工程と、を有する。The information processing method according to the sixth aspect of the present invention is an information processing method for a genetic mutation from a first formula: X-Y-X-Y-X [wherein X and Y represent base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X.] to a base sequence of a second formula: X-Y-X, and includes an execution step of searching for a mutation in the base sequence of the second formula: X-Y-X at a specific gene position corresponding to a target disease in the genetic sequence of a test subject, and/or extracting the amount of transcription of the base sequence of the second formula: X-Y-X to RNA in a sample from the test subject, and an output step of outputting information on the risk of the test subject suffering from the target disease or information on the genetic mutation using at least one of the execution results from the execution step.

この構成によれば、対象の疾患に前記検査対象者が罹るリスクに関する情報が出力される場合であって第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘから第２の式：Ｘ－Ｙ－Ｘの塩基配列への特定の遺伝子位置における変異が対象の疾患に関与する場合には、対象の疾患に検査対象者が罹るリスクに関する情報を取得することができるので、遺伝子変異に伴う疾患のリスクを把握することを容易化することができる。一方、遺伝子変異に関する情報が出力される場合であって遺伝子変異に関する情報が出力される場合、第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こる可能性が分かるので、遺伝子変異が起こる可能性を把握することを容易化することができる。According to this configuration, when information on the risk of the test subject suffering from the target disease is output and a mutation at a specific genetic position from the base sequence of the first formula: X-Y-X-Y-X to the base sequence of the second formula: X-Y-X is involved in the target disease, information on the risk of the test subject suffering from the target disease can be obtained, making it easier to grasp the risk of disease associated with a genetic mutation. On the other hand, when information on a genetic mutation is output, the possibility of a mutation occurring in the base sequence of the second formula: X-Y-X is known, making it easier to grasp the possibility of a genetic mutation occurring.

本発明の第７の態様に係るプログラムは、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］から第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異に関するプログラムであって、コンピュータを、検査対象者の遺伝子配列において、対象の疾患に対応する特定の遺伝子位置における前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行、及び／または前記検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量の抽出の実行を行う実行部と、前記実行部による少なくとも一方の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報もしくは遺伝子変異に関する情報を出力する出力部と、として機能させるためのプログラムである。The program according to the seventh aspect of the present invention is a program for gene mutation from a first formula: X-Y-X-Y-X [wherein X and Y are base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X.] to a base sequence of a second formula: X-Y-X, and causes a computer to function as an execution unit that searches for a mutation in the base sequence of the second formula: X-Y-X at a specific gene position corresponding to a target disease in the genetic sequence of a test subject, and/or extracts the amount of transcription of the base sequence of the second formula: X-Y-X to RNA in a sample from the test subject, and an output unit that outputs information on the risk of the test subject suffering from the target disease or information on the gene mutation using at least one of the execution results by the execution unit.

疾患に関与する遺伝子変異を見つけ出すことを容易化することについては以下の第８から第１０の態様で解決する。
本発明の第８の態様に係る情報処理システムは、標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方、対象の疾患の患者の遺伝子配列の対応する遺伝子位置において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、当該遺伝子位置における前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異を前記対象の疾患に関与する遺伝子変異として抽出する抽出部を備える。 The problem of facilitating the discovery of genetic mutations involved in diseases is solved in the following eighth to tenth aspects.
An information processing system according to an eighth aspect of the present invention includes an extraction unit that, when a base sequence of a first formula: X-Y-X-Y-X [wherein X and Y represent base sequences differing in number of bases by two or more, with the proviso that Y does not follow immediately after the last X, and Y does not appear immediately before the first X] is present in a genetic sequence of a standard or healthy individual, while a base sequence of the second formula: X-Y-X, rather than the first formula: X-Y-X-Y-X, is present at a corresponding genetic position in a genetic sequence of a patient with a target disease, extracts a mutation to the base sequence of the second formula: X-Y-X at the genetic position as a genetic mutation involved in the target disease.

この構成によれば、対象の疾患に関与する遺伝子変異を抽出することができるので、疾患に関与する遺伝子変異を見つけ出すことを容易化することができる。This configuration makes it possible to extract genetic mutations involved in the target disease, making it easier to find genetic mutations involved in the disease.

本発明の第９の態様に係る同定方法は、標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方、対象の疾患の患者の遺伝子配列の対応する位置において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列を前記対象の疾患に関与する遺伝子変異として抽出する工程を有する。The identification method according to the ninth aspect of the present invention includes a step of extracting the base sequence of the second formula: X-Y-X-Y-X [wherein X and Y represent base sequences differing in number of bases by two or more, provided that Y does not immediately follow the last X, and Y does not appear immediately before the first X] in the genetic sequence of a standard or healthy individual, while the base sequence of the second formula: X-Y-X, rather than the first formula: X-Y-X-Y-X, is present at the corresponding position in the genetic sequence of a patient with a target disease, extracting the base sequence of the second formula: X-Y-X as a genetic mutation involved in the target disease.

本発明の第１０の態様に係るプログラムは、コンピュータを、標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方、対象の疾患の患者の遺伝子配列の対応する位置において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列を前記対象の疾患に関与する遺伝子変異として抽出する抽出部として機能させるためのプログラムである。The program according to the tenth aspect of the present invention is a program for causing a computer to function as an extraction unit that extracts the base sequence of the second formula: X-Y-X-Y-X [wherein X and Y represent base sequences differing in number of bases by two or more, provided that Y does not immediately follow the last X, and Y does not appear immediately before the first X] in the genetic sequence of a standard or healthy individual, while the base sequence of the second formula: X-Y-X, rather than the first formula: X-Y-X-Y-X, is present at the corresponding position in the genetic sequence of a patient with a target disease, as a genetic mutation involved in the target disease.

この構成によれば、対象の疾患に関与する遺伝子変異を自動で抽出することができるので、疾患に関与する遺伝子変異を見つけ出すことを容易化することができる。This configuration makes it possible to automatically extract genetic mutations involved in a target disease, making it easier to find genetic mutations involved in the disease.

本発明の一態様によれば、対象の疾患に検査対象者が罹るリスクに関する情報を取得することができるので、遺伝子変異に伴う疾患のリスクを把握することを容易化することができる。本発明の別の態様によれば、対象の疾患に関与する遺伝子変異を自動で抽出することができるので、疾患に関与する遺伝子変異を見つけ出すことを容易化することができる。According to one aspect of the present invention, it is possible to obtain information regarding the risk of a test subject suffering from a target disease, thereby facilitating the understanding of the risk of disease associated with a genetic mutation. According to another aspect of the present invention, it is possible to automatically extract genetic mutations involved in a target disease, thereby facilitating the discovery of genetic mutations involved in the disease.

第１の実施形態に係る遺伝子変異を説明するための模式図である。FIG. 2 is a schematic diagram for explaining a gene mutation according to the first embodiment.第１の実施形態に係る標準もしくは健常者と、ある疾患Ａの患者の遺伝子配列の模式図である。FIG. 2 is a schematic diagram of gene sequences of a standard or healthy subject and a patient with disease A according to the first embodiment.第１の実施形態に係る情報処理システムの概略構成図である。1 is a schematic configuration diagram of an information processing system according to a first embodiment.第１の実施形態に係る端末の概略構成図である。FIG. 2 is a schematic configuration diagram of a terminal according to the first embodiment.第１の実施形態に係るサーバの概略構成図である。FIG. 2 is a schematic configuration diagram of a server according to the first embodiment.第１の実施形態に係る端末１の画面遷移の一例である。6 is an example of a screen transition of the terminal 1 according to the first embodiment.第１の実施形態に係る処理の流れの第１の例を示すフローチャートである。4 is a flowchart showing a first example of a processing flow according to the first embodiment.第１の実施形態に係るサーバのストレージに記憶される変異塩基配列テーブルＴの一例である。4 is an example of a mutant base sequence table T stored in a storage of a server according to the first embodiment.第２の実施形態において検査対象者の疾患リスクを説明するための模式図である。FIG. 11 is a schematic diagram for explaining a disease risk of a test subject in the second embodiment.第２の実施形態において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列からＸ－Ｙの塩基配列が抜ける過程を説明するための模式図である。FIG. 11 is a schematic diagram for explaining the process in which the base sequence XY is removed from the base sequence of the first formula: XY-XY-X in the second embodiment.第２の実施形態で検査対象者の別の疾患リスクを説明するための模式図である。FIG. 11 is a schematic diagram for explaining another disease risk of a test subject in the second embodiment.転写量とその測定法のイメージについて説明するための模式図である。FIG. 1 is a schematic diagram for explaining an image of a transfer amount and a method for measuring the transfer amount.第２の実施形態に係る端末１の検査における画面遷移の一例である。13 is a diagram illustrating an example of a screen transition during an inspection of the terminal 1 according to the second embodiment.ＲＮＡ転写量情報のファイルの中身の例である。13 is an example of the contents of a file of RNA transcription amount information.リスク値の計算方法の一例を説明するための表である。13 is a table for explaining an example of a method for calculating a risk value.ストレージに保存されている疾患毎の遺伝子変異テーブルの例である。13 is an example of a gene mutation table for each disease stored in the storage.ストレージに保存されている疾患毎の転写量テーブルの例である。13 is an example of a transcription amount table for each disease stored in the storage.ＤＮＡ配列とＲＮＡ転写量の両方を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process for displaying a list of disease risks of a test subject using both a DNA sequence and an RNA transcription amount.図１８Ａの続きのフローチャートである。This is a continuation of the flowchart in Figure 18A.ＤＮＡ配列を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process for displaying a list of disease risks to which a test subject is subject using a DNA sequence.ＲＮＡ転写量情報を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。13 is a flowchart showing an example of a process for displaying a list of disease risks to which a test subject is subject, using RNA transcription amount information.図２０Ａの続きのフローチャートである。This is a continuation of the flowchart in Figure 20A.

以下、各実施形態について、図面を参照しながら説明する。但し、必要以上に詳細な説明は省略する場合がある。例えば、既によく知られた事項の詳細説明や実質的に同一の構成に対する重複説明を省略する場合がある。これは、以下の説明が不必要に冗長になるのを避け、当業者の理解を容易にするためである。Each embodiment will be described below with reference to the drawings. However, more detailed explanation than necessary may be omitted. For example, detailed explanation of already well-known matters or duplicate explanation of substantially identical configurations may be omitted. This is to avoid the following explanation becoming unnecessarily redundant and to make it easier for those skilled in the art to understand.

＜第１の実施形態＞
第１の実施形態では、本発明の課題のうち、疾患に関与する遺伝子変異を見つけ出すことを容易化する課題を解決する。図１は、第１の実施形態に係る遺伝子変異を説明するための模式図である。例えば、図１に示すように、生物の遺伝子配列（具体的には例えばＤＮＡ配列もしくはＲＮＡ配列）に含まれている第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］は、ＤＮＡからＲＮＡに転写される過程においてある確率で、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列から、例えば、図１の破線の四つの枠のいずれかで囲まれたＸ－ＹもしくはＹ－Ｘの塩基配列が抜けることにより、その塩基配列の構成要素である塩基配列Ｘ及びＹから構成される第２の式：Ｘ－Ｙ－Ｘの塩基配列に変異することを、本願の発明者は新たに発見した。ただし、最終的にＸ－Ｙ－Ｘが生じればその抜け方は上記のみに規定されるものではない。塩基配列がＸ－Ｙ－Ｘに短くなった結果として、フレームシフトによってコドンの読み枠が変わり、異常なタンパク質に変わることが疾患の原因となり得る。またフレームシフトが生じなくてもタンパク質が短縮され、異常なタンパク質に変わることが疾患の原因となり得る。
ここで、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列から第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異については、（１）ゲノムＤＮＡの塩基配列を読む場合と、（２）ｍＲＮＡの塩基配列を読む場合の両方があり得る。第１の実施形態では、いずれの場合であってもよい。 First Embodiment
In the first embodiment, the present invention solves the problem of facilitating the discovery of genetic mutations involved in diseases. FIG. 1 is a schematic diagram for explaining genetic mutations according to the first embodiment. For example, as shown in FIG. 1, the inventors of the present application have newly discovered that a first formula: X-Y-X-Y-X [wherein X and Y represent base sequences with two or more different bases, respectively, which are included in the genetic sequence of an organism (specifically, for example, a DNA sequence or an RNA sequence). However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X] is mutated to a second formula: X-Y-X base sequence composed of the base sequences X and Y, which are components of the base sequence, with a certain probability in the process of transcription from DNA to RNA, by removing, for example, the base sequence of X-Y or Y-X enclosed in any of the four dashed frames in FIG. 1 from the base sequence of the first formula: X-Y-X-Y-X. However, if X-Y-X is finally generated, the manner of removal is not limited to the above. As a result of the shortening of the base sequence to X-Y-X, the reading frame of the codon changes due to a frameshift, resulting in an abnormal protein, which can cause disease.In addition, even if a frameshift does not occur, the protein can be shortened and turned into an abnormal protein, which can cause disease.
Here, regarding the mutation from the base sequence of the first formula: X-Y-X-Y-X to the base sequence of the second formula: X-Y-X, there are two cases: (1) reading the base sequence of genomic DNA, and (2) reading the base sequence of mRNA. In the first embodiment, either case may be used.

図２は、第１の実施形態に係る標準もしくは健常者と、ある疾患Ａの患者の遺伝子配列の模式図である。図２では、標準もしくは健常者の遺伝子配列（具体的には例えばＤＮＡ配列もしくはＲＮＡ配列）において、２３９９５番の遺伝子位置において第１の式の一例であるＸ₁－Ｙ₁－Ｘ₁―Ｙ₁－Ｘ₁の塩基配列であるのに対し、疾患Ａの患者の遺伝子配列において、２３９９５番の遺伝子位置において第２の式の一例であるＸ₁－Ｙ₁－Ｘ₁の塩基配列が示されている。図２に示すように、第１の実施形態では、標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列が存在するが、特定の疾患の患者の遺伝子配列の対応する位置において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、第２の式：Ｘ－Ｙ－Ｘの塩基配列を疾患Ａに関与する遺伝子変異として抽出する。ここで、健常者及び疾患Ａの患者において塩基配列が得られた検体は、ＤＮＡを取得する検体と、ＲＮＡを取得する検体があり得る。上記（１）のゲノムＤＮＡの塩基配列を読む場合には、検体には少なくともＤＮＡが含まれる。一方、上記（２）のｍＲＮＡの塩基配列を読む場合には、検体には少なくともＲＮＡが含まれる。 FIG. 2 is a schematic diagram of the gene sequences of a standard or healthy individual and a patient with a certain disease A according to the first embodiment. In FIG. 2, the gene sequence (specifically, for example, a DNA sequence or an RNA sequence) of the standard or healthy individual has a base sequence of X₁ -Y₁ -X₁ -Y₁ -X₁ , which is an example of the first formula, at the gene position 23995, whereas the gene sequence of the patient with disease A has a base sequence of X₁ -Y₁ -X₁ , which is an example of the second formula, at the gene position 23995. As shown in FIG. 2, in the first embodiment, when the gene sequence of the standard or healthy individual has a base sequence of the first formula: X-Y-X-Y-X, but the gene sequence of the patient with a certain disease has a base sequence of the second formula: X-Y-X rather than the first formula: X-Y-X-Y-X at the corresponding position, the base sequence of the second formula: X-Y-X is extracted as a genetic mutation involved in disease A. Here, the samples from which base sequences are obtained in healthy individuals and patients with disease A may be either samples from which DNA is obtained or samples from which RNA is obtained. When reading the base sequence of genomic DNA as described above in (1), the sample contains at least DNA. On the other hand, when reading the base sequence of mRNA as described above in (2), the sample contains at least RNA.

図３は、第１の実施形態に係る情報処理システムの概略構成図である。図３に示すように、情報処理システムＳは一例として、端末１－１～１－Ｎと通信回路網ＣＮを介して接続されたサーバ２を備える。端末１－１、…、１－Ｎ（Ｎは自然数）は一例として、情報処理システムＳの外部に設けられている。なお、情報処理システムＳは、端末１－１、…、１－Ｎを備えてもよい。ここでは一例として情報処理システムＳは一台のサーバ２で構成される例を示すが、これに限定されるものではなく、クラウドサービスのように複数のコンピュータで構成されてもよい。Figure 3 is a schematic diagram of an information processing system according to a first embodiment. As shown in Figure 3, the information processing system S includes, as an example, a server 2 connected to terminals 1-1 to 1-N via a communication circuit network CN. As an example, terminals 1-1, ..., 1-N (N is a natural number) are provided outside the information processing system S. Note that the information processing system S may also include terminals 1-1, ..., 1-N. Here, an example is shown in which the information processing system S is configured with one server 2, but the information processing system S is not limited to this and may be configured with multiple computers like a cloud service.

端末１－１～１－Ｎは、別々のユーザが使用する端末装置であり、例えば、多機能携帯電話（いわゆるスマートフォン）などの携帯電話、タブレット、ノートパソコン、またはデスクトップパソコンなどである。端末１－１～１－Ｎには例えば、第１の実施形態に係るプログラムを含むアプリケーションがインストールされており、このアプリケーションを立ち上げることで、このアプリケーションによって表示される画面においてサーバ２から提供される情報を表示する。なお、これに限らず、端末１－１～１－Ｎは例えば、ＷＥＢブラウザを用いて、サーバ２から提供される情報を表示してもよい。以下、第１の実施形態では、端末１は、一例としてデスクトップパソコンであるものとして説明する。Terminals 1-1 to 1-N are terminal devices used by different users, and may be, for example, mobile phones such as multi-function mobile phones (so-called smartphones), tablets, notebook computers, or desktop computers. An application including a program according to the first embodiment is installed on terminals 1-1 to 1-N, and by launching this application, information provided by server 2 is displayed on a screen displayed by this application. However, this is not limited to this, and terminals 1-1 to 1-N may display information provided by server 2 using, for example, a web browser. In the following first embodiment, terminal 1 will be described as a desktop computer as an example.

サーバ２は情報処理装置の一例であり、端末１－１～１－Ｎに対して情報を提供する。以下、端末１－１～１－Ｎを総称して端末１とも呼ぶ。Server 2 is an example of an information processing device, and provides information to terminals 1-1 to 1-N. Hereinafter, terminals 1-1 to 1-N will also be collectively referred to as terminal 1.

図４は、第１の実施形態に係る端末の概略構成図である。図４に示すように、端末１は例えば、入力インタフェース１１と、通信回路１２と、ストレージ１３と、メモリ１４と、出力インタフェース１５と、プロセッサ１６と、ディスプレイ１７とを備える。
入力インタフェース１１は、ユーザからの入力を受け付け、受け付けた入力に応じた入力信号をプロセッサ１６へ出力する。入力インタフェース１１は例えばキーボードである。
通信回路１２は、通信回路網ＣＮに接続されて、通信回路網ＣＮに接続されているサーバ２と通信する。この通信は有線であっても無線であってもよい。 Fig. 4 is a schematic configuration diagram of a terminal according to the first embodiment. As shown in Fig. 4, the terminal 1 includes, for example, an input interface 11, a communication circuit 12, a storage 13, a memory 14, an output interface 15, a processor 16, and a display 17.
The input interface 11 receives an input from a user, and outputs an input signal corresponding to the received input to the processor 16. The input interface 11 is, for example, a keyboard.
The communication circuit 12 is connected to a communication network CN and communicates with a server 2 which is also connected to the communication network CN. This communication may be wired or wireless.

ストレージ１３には、プロセッサ１６が読み出して実行するためのアプリケーションのプログラム及び各種のデータが格納されている。このアプリケーションは、サーバもしくはクラウド経由でダウンロードされてインストールされたものである。
メモリ１４は、データ及びプログラムを一時的に保持する。メモリ１４は、揮発性メモリであり、例えばＲＡＭ（Random Access Memory）である。
出力インタフェース１５は、ディスプレイ１７に接続されており、プロセッサ１６の指令に従って映像信号をディスプレイに出力する。なお、ディスプレイ１７は端末１に外付けではなく、内蔵であってもよい。 The storage 13 stores application programs and various data to be read and executed by the processor 16. These applications are downloaded and installed via a server or the cloud.
The memory 14 temporarily stores data and programs and is a volatile memory, such as a random access memory (RAM).
The output interface 15 is connected to a display 17, and outputs a video signal to the display in accordance with a command from the processor 16. The display 17 does not have to be external to the terminal 1, and may be built into the terminal 1.

プロセッサ１６は、ストレージ１３から第１の実施形態に係るアプリケーションのプログラムをメモリ１４にロードし、当該プログラムに含まれる一連の命令を実行する。The processor 16 loads the program of the application according to the first embodiment from the storage 13 into the memory 14 and executes a series of instructions contained in the program.

図５は、第１の実施形態に係るサーバの概略構成図である。図５に示すように、サーバ２は、入力インタフェース２１と、通信回路２２と、ストレージ２３と、メモリ２４と、出力インタフェース２５と、プロセッサ２６とを備える。
入力インタフェース２１は、サーバ２の管理者からの入力を受け付け、受け付けた入力に応じた入力信号をプロセッサ２６へ出力する。
通信回路２２は、通信回路網ＣＮに接続されて、通信回路網ＣＮに接続されている端末１－１～１－Ｎと通信する。この通信は有線であっても無線であってもよい。 Fig. 5 is a schematic configuration diagram of a server according to the first embodiment. As shown in Fig. 5, the server 2 includes an input interface 21, a communication circuit 22, a storage 23, a memory 24, an output interface 25, and a processor 26.
The input interface 21 receives an input from an administrator of the server 2, and outputs an input signal corresponding to the received input to the processor 26.
The communication circuit 22 is connected to a communication network CN and communicates with the terminals 1-1 to 1-N connected to the communication network CN. This communication may be wired or wireless.

ストレージ２３は、プロセッサ２６が読み出して実行するためのプログラム及び各種のデータが格納されている。例えば、ストレージ２３には、標準もしくは健常者の遺伝子配列及び転写量情報が記憶されている。
メモリ２４は、データ及びプログラムを一時的に保持する。メモリ２４は、揮発性メモリであり、例えばＲＡＭ（Random Access Memory）である。
出力インタフェース２５は、外部の機器（例えばディスプレイと接続されており、プロセッサ２６からの指令に従って当該外部の機器に信号（例えば映像信号）を出力する。これにより、例えばディスプレイに映像信号が入力されて情報が表示される。 The storage 23 stores various data and programs to be read and executed by the processor 26. For example, the storage 23 stores gene sequences and transcription amount information of standard or healthy individuals.
The memory 24 temporarily stores data and programs and is a volatile memory, such as a random access memory (RAM).
The output interface 25 is connected to an external device (e.g., a display), and outputs a signal (e.g., a video signal) to the external device in accordance with a command from the processor 26. As a result, the video signal is input to, for example, a display, and information is displayed.

プロセッサ２６は、ストレージ２３からプログラムをメモリ２４にロードし、当該プログラムに含まれる一連の命令を実行することにより、抽出部２６１、実行部２６２、出力部２６３、通信制御部２６４として機能する。The processor 26 loads a program from the storage 23 into the memory 24 and executes a series of instructions contained in the program, thereby functioning as an extraction unit 261, an execution unit 262, an output unit 263, and a communication control unit 264.

図６は、第１の実施形態に係る端末１の画面遷移の一例である。画面Ｇ１には、疾患名が入力可能なテキストボックスＢ１が表示されており、当該疾患の患者の遺伝子配列を受付可能なボックスＢ２が表示されている。ここで患者の遺伝子配列は例えば、独自に取得したデータや公開サーバからの取得したデータである。また画面Ｇ１には送信ボタンＢ３が表示されており、この送信ボタンＢ３が押下された場合、入力された疾患名と当該疾患の患者の遺伝子配列がサーバ２へ送信される。Figure 6 shows an example of screen transitions on terminal 1 according to the first embodiment. Screen G1 displays a text box B1 in which the name of a disease can be input, and a box B2 in which the genetic sequence of a patient with that disease can be received. Here, the genetic sequence of the patient is, for example, data acquired independently or data acquired from a public server. Screen G1 also displays a send button B3, and when this send button B3 is pressed, the input disease name and the genetic sequence of the patient with that disease are sent to server 2.

サーバ２は、この疾患名と当該疾患の患者の遺伝子配列を受信した場合、サーバ２の抽出部２６１は、ストレージ２３に記憶されている標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方、当該疾患の患者の遺伝子配列の対応する位置において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列を当該疾患に関与する遺伝子変異として抽出する。When the server 2 receives the disease name and the genetic sequence of a patient with the disease, the extraction unit 261 of the server 2 extracts the base sequence of the second formula: X-Y-X-Y-X [wherein X and Y indicate base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X] in the genetic sequence of a standard or healthy individual stored in the storage 23, if the base sequence of the second formula: X-Y-X is present at the corresponding position in the genetic sequence of the patient with the disease, rather than the first formula: X-Y-X-Y-X, the extraction unit 261 of the server 2 extracts the base sequence of the second formula: X-Y-X as a genetic mutation involved in the disease.

ここで、前記標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列が存在するが、対象の疾患の患者の遺伝子配列の対応する位置において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合とは、前記対応する場所において、健常者の前記遺伝子配列に比べて、前記対象の疾患の患者の前記遺伝子配列に、統計的に優位に、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合であってもよい。ここで、統計的優位性を示す場合、対象の疾患について複数の患者の遺伝子配列群から統計検定を行うが、複数の患者の遺伝子配列群それぞれを予め入手した上で、抽出部２６１は、前記対応する場所において、健常者の遺伝子配列に比べて、対象の疾患の患者の遺伝子配列に、統計的に優位に、第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合に、当該対象の疾患に関与する遺伝子変異として抽出する（もしくは当該遺伝子を疾患関連変異遺伝子として抽出する）。これにより、疾患に関与していないのに疾患に関与している遺伝子変異として抽出されるという擬陽性の確率を低減することができる。
ここで対象の疾患の患者であっても、対象の遺伝子位置における第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異が起こっている患者と起こっていない患者がいる。そのため、抽出部２６１は、対象の遺伝子位置における第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異を対象の疾患に関与する遺伝子変異として抽出した場合において、対象の疾患の患者群において、この対象の遺伝子位置における第２の式：Ｘ－Ｙ－Ｘの塩基配列への遺伝子変異が起こっている確率（以下、変異発生率という）を算出してもよい。 Here, the case where the base sequence of the first formula: X-Y-X-Y-X is present in the gene sequence of the standard or healthy individual, but the base sequence of the second formula: X-Y-X is present instead of the first formula: X-Y-X-Y-X at the corresponding position of the gene sequence of the patient with the target disease may be the case where the base sequence of the second formula: X-Y-X is present in the gene sequence of the patient with the target disease in a statistically superior manner compared to the gene sequence of the healthy individual at the corresponding position. Here, when statistical superiority is shown, a statistical test is performed from gene sequence groups of multiple patients for the target disease, and after obtaining each of the gene sequence groups of multiple patients in advance, the extraction unit 261 extracts the gene as a genetic mutation involved in the target disease (or extracts the gene as a disease-related mutant gene) when the base sequence of the second formula: X-Y-X is present in a statistically superior manner compared to the gene sequence of the healthy individual at the corresponding position. This can reduce the probability of false positives, where a gene mutation that is not involved in a disease is extracted as being involved in the disease.
Here, even among patients with the target disease, there are patients in which a genetic mutation to the base sequence of the second formula: X-Y-X has occurred at the target gene position and patients in which no genetic mutation has occurred. Therefore, when the extraction unit 261 extracts a genetic mutation to the base sequence of the second formula: X-Y-X at the target gene position as a genetic mutation involved in the target disease, the extraction unit 261 may calculate the probability that a genetic mutation to the base sequence of the second formula: X-Y-X has occurred at the target gene position in a patient group with the target disease (hereinafter referred to as the mutation occurrence rate).

遺伝子配列（例えばＤＮＡ配列）の第２の式：Ｘ－Ｙ－Ｘの塩基配列の前の塩基配列ａｂｃ(但し、ａ、ｂ、ｃは一つ以上の塩基配列)と、当該第２の式：Ｘ－Ｙ－Ｘの塩基配列の後の塩基配列ｄｅｆ(但し、ｄ、ｅ、ｆは一つ以上の塩基配列)の長さの合計が４０ｍｅｒ以上であることが好ましい。これは、第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列が４０ｍｅｒ未満であると、Ｘ－ＹもしくはＹ－Ｘの抜けが起こる頻度が落ちる（すなわち抜ける確率が落ちる）ので、抽出部２６１が抽出する第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列は、４０ｍｅｒ以上であることが好ましい。これにより、遺伝子変異が起こる確率が高いものについて、疾患に関与する遺伝子変異として同定することができる。但し、当該第２の式：Ｘ－Ｙ－Ｘの塩基配列が十分な長さがあれば、第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列ａｂｃとｄｅｆの長さの合計が０でもよく、好ましくは１ｍｅｒ以上、より好ましくは２０ｍｅｒ以上、更に好ましくは４０ｍｅｒ以上である。It is preferable that the total length of the base sequence abc (where a, b, c are one or more base sequences) before the base sequence of the second formula: X-Y-X in the gene sequence (e.g., DNA sequence) and the base sequence def (where d, e, f are one or more base sequences) after the base sequence of the second formula: X-Y-X is 40mer or more. This is because if the base sequence before and after the base sequence of the second formula: X-Y-X is less than 40mer, the frequency of X-Y or Y-X omissions decreases (i.e., the probability of omission decreases), so it is preferable that the base sequence before and after the base sequence of the second formula: X-Y-X extracted by the extraction unit 261 is 40mer or more. This makes it possible to identify those with a high probability of genetic mutations occurring as genetic mutations involved in diseases. However, if the base sequence of the second formula: X-Y-X is sufficiently long, the total length of the base sequences abc and def before and after the base sequence of the second formula: X-Y-X may be 0, and is preferably 1 mer or more, more preferably 20 mer or more, and even more preferably 40 mer or more.

また、対象の疾患の患者の遺伝子配列は、前記対象の疾患が生じている組織もしくは血液などの体液から抽出されたものである。これは、対象の疾患の患者であっても、疾患が生じていない組織においては、当該疾患に関与する遺伝子変異が起きていない可能性があるからである。また血液などの体液中に、疾患に関与する遺伝子変異が起きたＲＮＡが染み出すことがあるので血液などの体液でもよい。これにより、疾患に関与する遺伝子変異の同定精度を向上させることができる。The gene sequence of a patient with a target disease is extracted from tissues in which the target disease occurs or from bodily fluids such as blood. This is because even in patients with the target disease, genetic mutations involved in the disease may not occur in tissues in which the disease does not occur. In addition, bodily fluids such as blood may be used, since RNA in which genetic mutations involved in the disease have occurred may seep into bodily fluids such as blood. This can improve the accuracy of identifying genetic mutations involved in the disease.

またここで、ストレージ２３には、一例として遺伝子変異の配列位置と遺伝子名との組が蓄積されている。この場合、抽出部２６１は例えば、この遺伝子変異の配列位置に対応する遺伝子名をストレージ２３から読み出す。そして、抽出部２６１は、この読み出した遺伝子名と、ＸＹ欠失が起こった配列位置、元の塩基配列、変異後の塩基配列とを関連付けて表示するための情報（例えば、画面Ｇ２を表示するための情報）を生成する。通信制御部２６４は、この情報を端末１に送信するよう制御する。端末１は、この情報を受信すると、この情報をディスプレイ１７に表示する。これにより、図６の画面Ｇ２が端末１のディスプレイ１７で表示される。Here, as an example, a pair of the sequence position of the gene mutation and the gene name is stored in the storage 23. In this case, the extraction unit 261, for example, reads out the gene name corresponding to the sequence position of this gene mutation from the storage 23. Then, the extraction unit 261 generates information (for example, information for displaying screen G2) for displaying the read out gene name in association with the sequence position where the XY deletion occurred, the original base sequence, and the base sequence after the mutation. The communication control unit 264 controls to transmit this information to the terminal 1. When the terminal 1 receives this information, it displays this information on the display 17. As a result, the screen G2 in FIG. 6 is displayed on the display 17 of the terminal 1.

続いて、図６を参照しつつ、図７を用いて処理の流れを説明する。図７は、第１の実施形態に係る処理の流れの第１の例を示すフローチャートである。以降に示すフローチャートにおける処理は、それぞれ端末１のプロセッサ１６もしくはサーバ２のプロセッサ２６の処理を示すが、簡略のため、プロセッサの記載は省略する。Next, the process flow will be explained using FIG. 7 while referring to FIG. 6. FIG. 7 is a flowchart showing a first example of the process flow according to the first embodiment. The processes in the flowcharts shown below respectively show the processes of the processor 16 of the terminal 1 or the processor 26 of the server 2, but for simplicity, the description of the processors is omitted.

（ステップＳ１０）まず端末１は、図６の画面Ｇ１に示すように、疾患名と当該疾患の患者の遺伝子配列を受け付ける。(Step S10) First, terminal 1 accepts the disease name and the genetic sequence of a patient with that disease, as shown on screen G1 in FIG. 6.

（ステップＳ２０）次に、端末１は、疾患名と当該疾患の患者の遺伝子配列を送信する。(Step S20) Next, terminal 1 transmits the disease name and the genetic sequence of the patient with that disease.

（ステップＳ３０）次に、サーバ２は、標準の遺伝子配列（もしくは健常者の遺伝子配列）と、当該疾患の患者の遺伝子配列を比較する。(Step S30) Next, server 2 compares the standard gene sequence (or the gene sequence of a healthy individual) with the gene sequence of a patient with the disease.

（ステップＳ４０）次に、サーバ２の抽出部２６１は、標準もしくは健常者の遺伝子配列と当該疾患の患者の遺伝子配列とを比較して、標準もしくは健常者の遺伝子配列で第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方で、当該疾患の患者の遺伝子配列の対応する遺伝子位置で例えば統計的に優位に第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合を探索する。(Step S40) Next, the extraction unit 261 of the server 2 compares the genetic sequence of a standard or healthy individual with the genetic sequence of a patient with the disease, and searches for cases where the genetic sequence of the standard or healthy individual contains a base sequence of the first formula: X-Y-X-Y-X [wherein X and Y indicate base sequences that differ by two or more bases. However, Y does not immediately follow the last X, and Y does not appear immediately before the first X], while the genetic sequence of the patient with the disease contains, for example, a base sequence of the second formula: X-Y-X with statistical superiority at the corresponding gene position.

（ステップＳ５０）次に、サーバ２の抽出部２６１は、標準もしくは健常者の遺伝子配列で第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方で、当該疾患の患者の遺伝子配列の対応する遺伝子位置で例えば統計的に優位に第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在することを検出した場合（ステップＳ４０ＹＥＳ）、検出位置における当該Ｘ－Ｙ－Ｘの塩基配列を当該疾患の疾患に関与する遺伝子変異として抽出し、当該検出位置と当該Ｘ－Ｙ－Ｘの塩基配列を当該疾患に関連付けてストレージ２３に保存する。(Step S50) Next, when the extraction unit 261 of the server 2 detects that a base sequence of the first formula: X-Y-X-Y-X [wherein X and Y indicate base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X] is present in the genetic sequence of a standard or healthy individual, while a base sequence of the second formula: X-Y-X is present, for example, statistically predominantly at the corresponding genetic position of the genetic sequence of a patient with the disease (step S40 YES), the extraction unit 261 of the server 2 extracts the base sequence of X-Y-X at the detected position as a genetic mutation involved in the disease, and stores the detected position and the base sequence of X-Y-X in the storage 23 in association with the disease.

（ステップＳ６０）次に、サーバ２検出位置における当該Ｘ－Ｙ－Ｘの塩基配列を当該疾患に関与する遺伝子変異である旨の情報を送信する。(Step S60) Next, the server 2 transmits information indicating that the base sequence X-Y-X at the detected position is a genetic mutation involved in the disease.

（ステップＳ７０）端末１は、ステップＳ６０で送信された情報を受信した場合、当該情報を用いて、図６の画面Ｇ２に示すように、検出位置における当該Ｘ－Ｙ－Ｘの塩基配列を当該疾患に関与する遺伝子変異である旨を表示制御する。(Step S70) When the terminal 1 receives the information transmitted in step S60, it uses the information to control the display to indicate that the X-Y-X base sequence at the detection position is a genetic mutation involved in the disease, as shown on screen G2 in FIG. 6.

以上、第１の実施形態に係る情報処理システムは、標準もしくは健常者の遺伝子配列において、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］の塩基配列が存在する一方、対象の疾患の患者の遺伝子配列の対応する位置において第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘではなく前記第２の式：Ｘ－Ｙ－Ｘの塩基配列が存在する場合、前記第２の式：Ｘ－Ｙ－Ｘの塩基配列を前記対象の疾患に関与する遺伝子変異として抽出する抽出部２６１を備える。As described above, the information processing system according to the first embodiment includes an extraction unit 261 that extracts the base sequence of the second formula: X-Y-X-Y-X (wherein X and Y represent base sequences with two or more different bases, with the exception that Y does not follow immediately after the last X, and Y does not appear immediately before the first X) in the genetic sequence of a standard or healthy individual, and extracts the base sequence of the second formula: X-Y-X as a genetic mutation associated with the target disease when the base sequence of the second formula: X-Y-X is present rather than the first formula: X-Y-X-Y-X at the corresponding position in the genetic sequence of a patient with the target disease.

この構成により、抽出部２６１が、対象の疾患に関与する遺伝子変異を自動で抽出することができるので、疾患に関与する遺伝子変異を見つけ出すことを容易化することができる。This configuration allows the extraction unit 261 to automatically extract genetic mutations involved in the target disease, making it easier to find genetic mutations involved in the disease.

図７の処理を繰り返すことにより、図８に示すように、変異塩基配列テーブルＴ１がストレージ２３に蓄積される。図８は、第１の実施形態に係るサーバのストレージに記憶される変異塩基配列テーブルＴの一例である。図８に示すように、第１の実施形態に係るサーバ２のストレージ２３に記憶されている変異塩基配列テーブルＴ１には例えば、疾患名、Ｘ－Ｙ欠失が起こった配列位置、元の塩基配列、変異後の塩基配列の組のレコードが蓄積されている。このように、ストレージ２３に、第２の式：Ｘ－Ｙ－Ｘの塩基配列の遺伝子変異と、当該遺伝子変異が生じた遺伝子配列上の位置と、当該遺伝子変異が関与する疾患とが関連付けられて記憶される。By repeating the process of FIG. 7, a mutant base sequence table T1 is accumulated in the storage 23 as shown in FIG. 8. FIG. 8 is an example of the mutant base sequence table T stored in the storage of the server according to the first embodiment. As shown in FIG. 8, the mutant base sequence table T1 stored in the storage 23 of the server 2 according to the first embodiment stores records of pairs of, for example, a disease name, a sequence position where an X-Y deletion occurred, an original base sequence, and a base sequence after mutation. In this way, the storage 23 stores a genetic mutation of the base sequence of the second formula: X-Y-X, the position in the genetic sequence where the genetic mutation occurred, and a disease associated with the genetic mutation.

＜第２の実施形態：疾患リスクの検査方法＞
続いて、第２の実施形態において検査対象者（被検者）に疾患のリスクがあるか検査する方法について、以下説明する。第２の実施形態では、本発明の課題のうち、遺伝子変異に伴う疾患のリスクを把握することを容易化する課題を解決する。 Second embodiment: method for testing disease risk
Next, a method for testing whether a test subject (subject) is at risk for a disease in the second embodiment will be described below. In the second embodiment, one of the objects of the present invention is to easily grasp the risk of a disease associated with a gene mutation.

図９は、第２の実施形態において検査対象者の疾患リスクを説明するための模式図である。図２に示すように、２３９９５番の遺伝子位置もしくはその近傍において第２の式の一例であるＸ₁－Ｙ₁－Ｘ₁の塩基配列に変異する遺伝子変異が、疾患Ａに関与していることが判明したとする。その場合において、図９に示すように、検査対象者の遺伝子配列において、２３９９５番の遺伝子位置において、第２の式の一例であるＸ₁－Ｙ₁－Ｘ₁の塩基配列が検出された場合、疾患Ａにかかるリスクがあることが分かる。本実施形態の一例では、これを疾患Ａに係るリスクを表すリスク値の計算に利用する。 Fig. 9 is a schematic diagram for explaining the disease risk of a test subject in the second embodiment. As shown in Fig. 2, it is assumed that a gene mutation that mutates to a base sequence of X₁ -Y 1 -X₁ , which is an example of the second formula, at or near the gene position 23995 is found to be involved in disease A. In that case, as shown in Fig. 9, if the base sequence of X₁ -Y₁_-X₁ , which is an example of the second formula, is detected at the gene position 23995 in the gene sequence of the test subject, it is found that there is a risk of contracting disease A. In one example of this embodiment, this is used to calculate a risk value that represents the risk of contracting disease A.

図１０の向かって左側において、ＤＮＡ配列の第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘの塩基配列の前後にそれぞれ塩基配列ａｂｃ、塩基配列ｄｅｆが存在するが、ＲＮＡに転写する際に、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－ＸからＸ－Ｙが弾かれて、一部の塩基配列Ｘ－Ｙ－ＸだけがＲＮＡに転写される。この転写物があたかも鋳型となり対象のＤＮＡ配列を第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘから当該第２の式：Ｘ－Ｙ－Ｘを生じさせる。この反応が進むためには、当該第２の式：Ｘ－Ｙ－Ｘの塩基配列が十分な長さがあれば、第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列ａｂｃとｄｅｆの長さの合計が０でもよいが、好ましくは１ｍｅｒ以上、より好ましくは２０ｍｅｒ以上、更に好ましくは４０ｍｅｒ以上である。図１０の向かって右側に示すように、上記の遺伝子位置以外に、ＲＮＡ配列の第２の式：Ｘ－Ｙ－Ｘの塩基配列の前の塩基配列ａｂｃと後の塩基配列ｄｅｆが一致している場合、図１０の向かって左側のＤＮＡ配列から生じたＲＮＡと同じ挙動をする。このとき、Ｘ－Ｙが抜けるためには、当該第２の式：Ｘ－Ｙ－Ｘの塩基配列が十分な長さがあれば、第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列ａｂｃとｄｅｆの長さの合計が０でもよいが、好ましくは１ｍｅｒ以上、より好ましくは２０ｍｅｒ以上、更に好ましくは４０ｍｅｒ以上である。On the left side of Figure 10, the base sequence abc and the base sequence def exist before and after the base sequence of the first formula: X-Y-X-Y-X in the DNA sequence, but when transcribed into RNA, X-Y is rejected from the first formula: X-Y-X-Y-X, and only a portion of the base sequence X-Y-X is transcribed into RNA. This transcript acts as a template to generate the target DNA sequence from the first formula: X-Y-X-Y-X to the second formula: X-Y-X. For this reaction to proceed, if the base sequence of the second formula: X-Y-X is long enough, the total length of the base sequences abc and def before and after the base sequence of the second formula: X-Y-X may be 0, but is preferably 1 mer or more, more preferably 20 mer or more, and even more preferably 40 mer or more. As shown on the right side of Figure 10, in addition to the above gene positions, if the base sequence abc before the base sequence of the second formula: X-Y-X of the RNA sequence matches the base sequence def after it, it will behave in the same way as the RNA generated from the DNA sequence on the left side of Figure 10. In this case, if the base sequence of the second formula: X-Y-X is long enough for X-Y to be removed, the total length of the base sequences abc and def before and after the base sequence of the second formula: X-Y-X can be 0, but is preferably 1 mer or more, more preferably 20 mer or more, and even more preferably 40 mer or more.

図１１は、第２の実施形態で検査対象者の別の疾患リスクを説明するための模式図である。図１１に示すように、例えば健常者の遺伝子配列において、２３８２１番目の遺伝子位置において第２の式：Ｘ₁－Ｙ₁－Ｘ₁の塩基配列が検出された場合、他の遺伝子位置である２３９９５番目の位置における第１の式：Ｘ₁－Ｙ₁－Ｘ₁－Ｙ₁－Ｘ₁の塩基配列から、Ｘ₁－Ｙ₁またはＹ₁－Ｘ₁が抜けて、第２の式：Ｘ₁－Ｙ₁－Ｘ₁の塩基配列に変異するリスクがあることを本願の発明者は新たに発見した。また、本願の発明者は、第２の式：Ｘ₁－Ｙ₁－Ｘ₁の転写量が多いほど、第１の式：Ｘ₁－Ｙ₁－Ｘ₁－Ｙ₁－Ｘ₁の塩基配列が、第２の式：Ｘ₁－Ｙ₁－Ｘ₁の塩基配列に変異するリスクが高くなるということを発見した。 Fig. 11 is a schematic diagram for explaining the risk of another disease in a test subject in the second embodiment. As shown in Fig. 11, for example, in the gene sequence of a healthy person, when the base sequence of the second formula:_X1 -_Y1 -_X1 is detected at the 23821st gene position, the inventors of the present application have newly discovered that there is a risk that X1-_Y1 or_Y1 -_X1 is removed from the base sequence of the first formula: X1_-_Y1 -X1_-_Y1 -_X1 at the 23995th gene position, which is another gene position, and the base sequence is mutated to the base sequence of the_second formula:_X1 -_Y1 -_X1 . In addition, the inventors of the present application have discovered that the greater the transcription amount of the second formula:_X1 -_Y1 -_X1 , the higher the risk that the base sequence of the first formula: X1_-_Y1 -_X1 -_Y1 -_X1 will mutate to the base sequence of the second formula:_X1 -_Y1 -_X1 .

遺伝子位置２３９９５における第２の式：Ｘ₁－Ｙ₁－Ｘ₁の変異は、疾患Ａに関与するという情報が予め得られていれば、第２の式：Ｘ₁－Ｙ₁－Ｘ₁の転写量が多いほど、疾患Ａにかかるリスクが高い。本実施形態の一例では、これを疾患Ａに係るリスクを表すリスク値の計算に利用する。 If it is known in advance that a mutation of the second formula: X₁ -Y₁ -X₁ at gene position 23995 is involved in disease A, then the higher the transcription amount of the second formula: X₁ -Y₁ -X₁ , the higher the risk of contracting disease A. In one example of this embodiment, this is used to calculate a risk value that represents the risk associated with disease A.

ここで、転写量とその測定法のイメージについて図１２を用いて説明する。図１２は、転写量とその測定法のイメージについて説明するための模式図である。図１２に示すように、転写量の情報は例えば、ＲＮＡシークエンスから得られる。全転写物の転写量を定量するには、まずサンプル（例えば、検査対象者から得られた検体）から転写物であるＲＮＡを抽出する。ここで検体は例えば組織、細胞、もしくは体液などのＲＮＡを含む検体であり得る。組織は例えば、臓器、臓器の一部、生検検体、髪の毛、体毛、皮膚などが挙げられ、体液は例えば、臓器の一部、血液、血清、血漿、唾液、リンパ液、脳脊髄液、尿、精液などが挙げられるが、それに限るものではない。
第２の実施形態では、検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量を対象の疾患に罹るリスクを決定するのに用いる場合（図１８Ａ、図１８Ｂ、図２０Ａ、図２０Ｂ参照）と用いない場合（図１９参照）がある。ＲＮＡの転写量を用いる場合には、検体はＲＮＡの転写量が検出できるものであればよい。一方、ＲＮＡの転写量を用いない場合には、（１）ゲノムＤＮＡの塩基配列を読む場合と、（２）ｍＲＮＡの塩基配列を読む場合の２通りがあり、（１）ゲノムＤＮＡの塩基配列を読む場合には、検体には少なくともＤＮＡが含まれ、（２）ｍＲＮＡの塩基配列を読む場合には、は、検体には少なくともＲＮＡが含まれる。
次に、検体から単離されたＲＮＡのシークエンスを行うことができる。ある態様では、これらのＲＮＡを短く切断し、できたＲＮＡ断片からｃＤＮＡライブラリーを作成し、最後に、次世代シーケンサーを利用して、ｃＤＮＡの両端から１塩基ずつ読み取ることができる。ある態様では、ＲＮＡからｃＤＮＡを合成し、合成したＤＮＡの断片の塩基配列を読み取ることができる。このように両端から読み込まれた塩基配列はリードと呼ばれる。 Here, the image of the transcription amount and its measurement method will be described with reference to FIG. 12. FIG. 12 is a schematic diagram for explaining the image of the transcription amount and its measurement method. As shown in FIG. 12, information on the transcription amount can be obtained, for example, from RNA sequencing. To quantify the transcription amount of all transcripts, first, RNA, which is a transcript, is extracted from a sample (for example, a specimen obtained from a test subject). Here, the specimen can be, for example, a specimen containing RNA, such as tissue, cell, or body fluid. Examples of tissue include organs, parts of organs, biopsy specimens, hair, body hair, and skin, and examples of body fluids include, but are not limited to, parts of organs, blood, serum, plasma, saliva, lymph, cerebrospinal fluid, urine, and semen.
In the second embodiment, the amount of transcription of the base sequence of the second formula: X-Y-X in the sample of the test subject to RNA may be used to determine the risk of the subject's disease (see FIG. 18A, FIG. 18B, FIG. 20A, FIG. 20B) or not (see FIG. 19). When the amount of transcription of RNA is used, the sample may be one in which the amount of transcription of RNA can be detected. On the other hand, when the amount of transcription of RNA is not used, there are two ways: (1) reading the base sequence of genomic DNA and (2) reading the base sequence of mRNA. When (1) reading the base sequence of genomic DNA, the sample contains at least DNA, and when (2) reading the base sequence of mRNA, the sample contains at least RNA.
The RNA isolated from the sample can then be sequenced. In one embodiment, the RNA is cut into short pieces, a cDNA library is created from the resulting RNA fragments, and finally, a next-generation sequencer is used to read one base at a time from both ends of the cDNA. In one embodiment, cDNA is synthesized from the RNA, and the base sequence of the synthesized DNA fragment can be read. The base sequence read from both ends in this way is called a read.

次に、リードがどの転写物に由来するのかを決定するため、各リードの塩基配列を、転写物の塩基配列と照合し、リードがどの転写物にアラインメントされるのかを調べるマッピングを行う。各転写物にマッピングされたリードの数（例えば順方向と逆方向を足して１とする）を数える。ただし、転写物量が多いほど、リード数も多く、また、転写物が長いと、ＲＮＡ断片も多くなるので、この場合もリードの数が多くなる。このため、リード数をそのまま転写物の発現量と見做すことはできないので、リード数を、転写物の長さで補正する等の正規化の作業を行い、その結果得られた値が転写量である。Next, to determine which transcript the read comes from, the base sequence of each read is compared with the base sequence of the transcript, and mapping is performed to determine which transcript the read aligns to. The number of reads mapped to each transcript (for example, the forward and reverse reads added together = 1) is counted. However, the greater the amount of transcript, the greater the number of reads, and the longer the transcript, the greater the number of RNA fragments, so in this case too, the number of reads will be greater. For this reason, the number of reads cannot be regarded as the expression level of the transcript as it is, so the number of reads is normalized by correcting it for the length of the transcript, and the resulting value is the transcription amount.

図１３は、第２の実施形態に係る端末１の検査における画面遷移の一例である。図１３に示すように、画面Ｇ３には、「ＤＮＡ配列および転写量での診断」を選択するためのラジオボタンＲＢ１、「ＤＮＡ配列のみでの診断」を選択するためのラジオボタンＲＢ２、「転写量のみでの診断」を選択するためのラジオボタンＲＢ３が表示されている。また画面Ｇ３には、ＤＮＡ配列および転写量での診断の場合、検査対象者の遺伝子配列（ここでは一例としてＤＮＡ配列）を受付可能なボックスＢ４及び遺伝子の転写量情報（例えばＲＮＡ転写量情報）を受付可能なボックスＢ５が設けられている。ここでは一例としてユーザがボックスＢ４内のファイル選択ボタンを押してＤＮＡ配列が含まれるファイルを選択することによってＤＮＡ配列が入力され、ユーザがボックスＢ５内のファイル選択ボタンを押してＲＮＡ転写量情報が含まれるファイルを選択することによってＲＮＡ転写量情報が入力される。Figure 13 is an example of screen transitions in the test of the terminal 1 according to the second embodiment. As shown in Figure 13, the screen G3 displays a radio button RB1 for selecting "Diagnosis by DNA sequence and transcription amount", a radio button RB2 for selecting "Diagnosis by DNA sequence only", and a radio button RB3 for selecting "Diagnosis by transcription amount only". In addition, in the case of diagnosis by DNA sequence and transcription amount, the screen G3 is provided with a box B4 that can receive the gene sequence of the test subject (here, DNA sequence as an example) and a box B5 that can receive gene transcription amount information (for example, RNA transcription amount information). Here, as an example, the user presses the file selection button in box B4 to select a file containing the DNA sequence, thereby inputting the DNA sequence, and the user presses the file selection button in box B5 to select a file containing the RNA transcription amount information, thereby inputting the RNA transcription amount information.

ＤＮＡ配列のみでの診断の場合、検査対象者の遺伝子配列（ここでは一例としてＤＮＡ配列）を受付可能なボックスＢ６が設けられている。ここでは一例としてユーザがボックスＢ６内のファイル選択ボタンを押してＤＮＡ配列が含まれるファイルを選択することによってＤＮＡ配列が入力される。In the case of diagnosis based only on DNA sequence, a box B6 is provided that can accept the genetic sequence of the test subject (here, as an example, a DNA sequence). Here, as an example, the DNA sequence is input by the user pressing the file selection button in box B6 to select a file that contains the DNA sequence.

転写量のみでの診断の場合、検査対象者の遺伝子の転写量情報（例えばＲＮＡ転写量情報）を受付可能なボックスＢ７が設けられている。ここでは一例としてユーザがボックスＢ７内のファイル選択ボタンを押してＲＮＡ転写量情報が含まれるファイルを選択することによってＲＮＡ転写量情報が入力される。When diagnosis is based only on the transcription amount, a box B7 is provided that can accept transcription amount information (e.g., RNA transcription amount information) of the subject's genes. As an example, the user presses the file selection button in box B7 to select a file that contains RNA transcription amount information, thereby inputting the RNA transcription amount information.

画面Ｇ３において送信ボタンＢ８が押下された場合、受け付けた検査対象者の遺伝子配列（例えばＤＮＡ配列）及び／または遺伝子の転写量情報（例えばＲＮＡ転写量情報）が端末１からサーバ２へ送信される。その後、サーバ２で処理が実行され、サーバ２から情報が端末１へ送信され、この情報が端末１で例えば図１３に示すように表示される。これにより、画面Ｇ４には、疾患リスク表が表示される。この疾患リスク表には、検査対象者が罹るリスクがある疾患名とその疾患のリスク値との組が表示される。When the send button B8 is pressed on screen G3, the accepted gene sequence (e.g., DNA sequence) and/or gene transcription amount information (e.g., RNA transcription amount information) of the test subject is sent from terminal 1 to server 2. Processing is then executed on server 2, and information is sent from server 2 to terminal 1, and this information is displayed on terminal 1, for example, as shown in FIG. 13. As a result, a disease risk table is displayed on screen G4. This disease risk table displays pairs of disease names that the test subject is at risk of suffering from and risk values for those diseases.

ここで、ＲＮＡ転写量情報が含まれるファイルについて図１４を用いて説明する。図１４は、ＲＮＡ転写量情報のファイルの中身の例である。図１４に示すＲＮＡ転写量情報のファイルには例えば、遺伝子名、当該遺伝子の転写量、当該遺伝子の遺伝子位置の組がリストされている。このように、ＲＮＡ転写量情報には例えば、遺伝子名、当該遺伝子の転写量、当該遺伝子の遺伝子位置の組のリストが含まれる。なお、ＲＮＡ転写量情報に、遺伝子名があれば遺伝子位置を特定できるので遺伝子位置がなくてもよいし、遺伝子の遺伝子位置があれば遺伝子名が特定できるので遺伝子名がなくてもよい。Here, a file containing RNA transcription amount information will be described with reference to FIG. 14. FIG. 14 is an example of the contents of an RNA transcription amount information file. The RNA transcription amount information file shown in FIG. 14 lists, for example, pairs of gene names, transcription amounts of the genes, and gene positions of the genes. In this way, the RNA transcription amount information includes, for example, a list of pairs of gene names, transcription amounts of the genes, and gene positions of the genes. Note that if the RNA transcription amount information contains gene names, the gene positions can be specified, so gene positions do not have to be included, and, on the other hand, if the gene positions of the genes are included, the gene names can be specified, so gene names do not have to be included.

図１５は、リスク値の計算方法の一例を説明するための表である。図１５では、遺伝子位置ａにおけるＸ_a－Ｙ_a－Ｘ_aへの遺伝子変異の有無、遺伝子位置ｂにおけるＸ_b－Ｙ_b－Ｘ_bへの遺伝子変異の有無、遺伝子位置ｃにおけるＸ_c－Ｙ_c－Ｘ_cへの遺伝子変異の有無の組み合わせ毎に、リスク値が関連付けられている。また図１５の表では、Ｘ_a－Ｙ_a－Ｘ_aの転写量のレベル、Ｘ_b－Ｙ_b－Ｘ_bの転写量のレベル、Ｘ_c－Ｙ_c－Ｘ_cの転写量のレベルがそれぞれ３段階で表されている。ここで健常者の転写量をＴｎ、対象の疾患の患者の転写量をＴｐ、検査対象者の転写量をＴｓとする。レベル１の場合、検査対象者の転写量Ｔｓが健常者の転写量Ｔｎ以下（Ｔｓ≦Ｔｎ）である。レベル２の場合、検査対象者の転写量Ｔｓが健常者の転写量Ｔｎを超えるが患者の転写量Ｔｐを下回る（Ｔｎ＜Ｔｓ＜Ｔｐ）。レベル３の場合、検査対象者の転写量Ｔｓが患者の転写量Ｔｐ以上である（Ｔｐ≦Ｔｓ）。 FIG. 15 is a table for explaining an example of a method for calculating a risk value. In FIG. 15, a risk value is associated with each combination of the presence or absence of a gene mutation to Xa_-_Ya -_Xa at gene position a, the presence or absence of a gene mutation to Xb_-_Yb -_Xb at gene position b, and the presence or absence of a gene mutation to Xc_-_Yc -_Xc at gene position c. In addition, in the table of FIG. 15, the transcription level of Xa_-_Ya -_Xa , the transcription level of Xb-_Yb_-Xb_, and the transcription level of Xc_-_Yc -_Xc are each expressed in three levels. Here, the transcription amount of a healthy person is Tn, the transcription amount of a patient with a target disease is Tp, and the transcription amount of a test subject is Ts. In the case of level 1, the transcription amount Ts of the test subject is equal to or less than the transcription amount Tn of a healthy person (Ts≦Tn). In the case of level 2, the transcription amount Ts of the test subject exceeds the transcription amount Tn of a healthy person but is less than the transcription amount Tp of the patient (Tn<Ts<Tp). In the case of level 3, the transcription amount Ts of the test subject is equal to or greater than the transcription amount Tp of the patient (Tp≦Ts).

ここで、対象の疾患の患者であっても、遺伝子位置ａにおけるＸ_a－Ｙ_a－Ｘ_aへの遺伝子変異が起こっている患者と起こっていない患者がいるので、対象の疾患の患者群において、この遺伝子位置ａにおけるＸ_a－Ｙ_a－Ｘ_aへの遺伝子変異が起こっている確率（以下、変異発生率という）をＰａとする。同様に、対象の疾患の患者であっても、遺伝子位置ｂにおけるＸ_b－Ｙ_b－Ｘ_bへの遺伝子変異が起こっている患者と起こっていない患者がいるので、対象の疾患の患者群において、この遺伝子位置ｂにおけるＸ_b－Ｙ_b－Ｘ_bへの遺伝子変異が起こっている確率（変異発生率）をＰｂとする。同様に、対象の疾患の患者であっても、遺伝子位置ｃにおけるＸ_c－Ｙ_c－Ｘ_cへの遺伝子変異が起こっている患者と起こっていない患者がいるので、対象の疾患の患者群において、この遺伝子位置ｃにおけるＸ_c－Ｙ_c－Ｘ_cへの遺伝子変異が起こっている確率（変異発生率）をＰｃとする。 Here, even among patients with the target disease, there are patients with and without a gene mutation to Xa_-_Ya -_Xa at gene position a, so the probability (hereinafter referred to as the mutation incidence rate) that a gene mutation to Xa_-_Ya -_Xa has occurred at this gene position a in the patient group with the target disease is Pa. Similarly, even among patients with the target disease, there are patients with and without a gene mutation to Xb_-_Yb -_Xb at gene position b, so the probability (mutation incidence rate) that a gene mutation to Xb_-_Yb -_Xb has occurred at this gene position b in the patient group with the target disease is Pb. Similarly, even among patients with the target disease, there are patients with and without a gene mutation to Xc_-_Yc -_Xc at gene position c, so the probability (mutation incidence rate) that a gene mutation to Xc_-_Yc -_Xc has occurred at this gene position c in the patient group with the target disease is Pc.

ケース１～４の場合、上記三つの遺伝子変異が全て生じているので、対象の疾患の罹るリスクの大きさを表すリスク値は、例えばＰａ＋Ｐｂ＋Ｐｃで表される。このように、配列情報のみで、リスク値を計算可能である。In cases 1 to 4, all three gene mutations have occurred, so the risk value indicating the magnitude of the risk of contracting the target disease is expressed as, for example, Pa + Pb + Pc. In this way, the risk value can be calculated using only the sequence information.

ここでは転写量がレベル１の場合、転写量が少ないため正常なＸ－Ｙ－Ｘ－Ｙ－Ｘの配列がＸ－Ｙ－Ｘに変異する確率が低いので、変異発生率に乗じる重みｗ_L1は例えば０に設定される。転写量がレベル２の場合、転写量が中程度であるため正常なＸ－Ｙ－Ｘ－Ｙ－Ｘの配列がＸ－Ｙ－Ｘに変異する確率が中程度あるので、変異発生率に乗じる重みｗ_L2は例えば０．５で設定される。転写量がレベル３の場合、転写量が対象の疾患より多いため正常なＸ－Ｙ－Ｘ－Ｙ－Ｘの配列がＸ－Ｙ－Ｘに変異する確率が高いので、変異発生率に乗じる重みｗ_L3は１である。 Here, when the transcription amount is level 1, the probability that a normal X-Y-X-Y-X sequence will mutate to X-Y-X is low because the transcription amount is low, so the weight w_L1 multiplied by the mutation rate is set to, for example, 0. When the transcription amount is level 2, the transcription amount is moderate, so the probability that a normal X-Y-X-Y-X sequence will mutate to X-Y-X is moderate, so the weight w_L2 multiplied by the mutation rate is set to, for example, 0.5. When the transcription amount is level 3, the transcription amount is greater than the target disease, so the probability that a normal X-Y-X-Y-X sequence will mutate to X-Y-X is high, so the weight w_L3 multiplied by the mutation rate is 1.

一方、ケース５では、遺伝子位置ａ、ｂでは変異が起こっているが遺伝子位置ｃでは変異が起こっていない。この場合であっても、遺伝子位置ｃに対応するＸ_c－Ｙ_c－Ｘ_cの転写量が多いほど、遺伝子位置ｃにおいて将来的にＸ_c－Ｙ_c－Ｘ_cに遺伝子変異する確率が高まる。ここでは、Ｘ_c－Ｙ_c－Ｘ_cの転写量のレベルが３であるため、リスク寄与度としてｗ_L3Ｐｃが算出される。そして、リスク値は、例えば、Ｐａ＋Ｐｂ＋ｗ_L3Ｐｃで表される。このように、配列情報と転写量情報の両方を用いてリスク値が算出される。なお、後述するように転写量情報のみを用いてリスク値が算出されてもよい。 On the other hand, in case 5, mutations have occurred at gene positions a and b, but no mutation has occurred at gene position c. Even in this case, the greater the transcription amount of Xc_-_Yc -_Xc corresponding to gene position c, the higher the probability of gene mutation to Xc_-_Yc -_Xc at gene position c in the future. Here, since the transcription amount level of Xc_-_Yc -_Xc is 3,_wL3Pc is calculated as the risk contribution. Then, the risk value is expressed, for example, as Pa+Pb+_wL3Pc . In this way, the risk value is calculated using both the sequence information and the transcription amount information. Note that the risk value may be calculated using only the transcription amount information, as described later.

ケース６では、遺伝子位置ｂでは変異が起こっているが遺伝子位置ａ、ｃでは変異が起こっていない。この場合であっても、遺伝子位置ａに対応するＸ_a－Ｙ_a－Ｘ_aの転写量が多いほど、遺伝子位置ａにおいて将来的にＸ_a－Ｙ_a－Ｘ_aに遺伝子変異する確率が高まる。ここでは例えば、Ｘ_a－Ｙ_a－Ｘ_aの転写量のレベルが１であるため、リスク寄与度としてｗ_L1Ｐａが算出される。同様に、遺伝子位置ｃに対応するＸ_c－Ｙ_c－Ｘ_cの転写量が多いほど、遺伝子位置ｃにおいて将来的にＸ_c－Ｙ_c－Ｘ_cに遺伝子変異する確率が高まる。ここでは例えば、Ｘ_c－Ｙ_c－Ｘ_cの転写量のレベルが１であるため、リスク寄与度としてｗ_L1Ｐｃが算出される。そして、リスク値は、例えば、ｗ_L1Ｐａ＋Ｐｂ＋ｗ_L1Ｐｃで表される。このように、配列情報と転写量情報の両方を用いてリスク値が算出される。なお、後述するように転写量情報のみを用いてリスク値が算出されてもよい。 In case 6, mutation occurs at gene position b, but mutation does not occur at gene positions a and c. Even in this case, the greater the transcription amount of X_a -Y_a -X_a corresponding to gene position a, the higher the probability of gene mutation to X_a -Y_a -X_a in the future at gene position a. Here, for example, since the level of the transcription amount of X_a -Y_a -X_a is 1, w_L1 Pa is calculated as the risk contribution. Similarly, the greater the transcription amount of X_c -Y_c -X_c corresponding to gene position c, the higher the probability of gene mutation to X_c -Y_c -X_c in the future at gene position c. Here, for example, since the level of the transcription amount of X_c -Y_c -X_c is 1, w_L1 Pc is calculated as the risk contribution. And the risk value is expressed, for example, as w_L1 Pa + Pb + w_L1 Pc. In this way, the risk value is calculated using both the sequence information and the transcription amount information. It should be noted that the risk value may be calculated using only the transcription amount information as described later.

ケース７～９は、遺伝子位置ａ、ｂ、ｃのいずれでも変異が起こっていない場合の例である。この場合、Ｘ_a－Ｙ_a－Ｘ_aの転写量によるリスク寄与度はｗ_LaＰａであり、Ｘ_b－Ｙ_b－Ｘ_bの転写量によるリスク寄与度はｗ_LbＰｂであり、Ｘ_c－Ｙ_c－Ｘ_cの転写量によるリスク寄与度がｗ_LcＰｃである。そしてリスク値は、例えば、ｗ_LaＰａ＋ｗ_LbＰｂ＋ｗ_LcＰｃで表される。このように、配列情報と転写量情報の両方を用いてリスク値が算出される。なお、後述するように転写量情報のみを用いてリスク値が算出されてもよい。 Cases 7 to 9 are examples where no mutation occurs at any of gene positions a, b, or c. In this case, the risk contribution due to the transcription amount of Xa_-_Ya -_Xa is_wLaPa , the risk contribution due to the transcription amount of Xb-_Yb_-Xb_is_wLbPb , and the risk contribution due to the transcription amount of Xc_-_Yc -_Xc is_wLcPc . The risk value is expressed, for example, as_wLaPa +_wLbPb +_wLcPc . In this way, the risk value is calculated using both the sequence information and the transcription amount information. Note that the risk value may be calculated using only the transcription amount information, as described later.

図１６は、ストレージに保存されている疾患毎の遺伝子変異テーブルの例である。図１６に示すように、例えば、疾患が疾患１から疾患Ｎ（Ｎは自然数）まであるとすると、疾患１の遺伝子変異テーブルＴＡ₁から疾患Ｎの遺伝子変異テーブルＴＡ_Nまで、Ｎ個のテーブルがサーバ２のストレージ２３に格納されている。Ｎ１、Ｎ２、Ｎ３はインデックスである。ｉ（ｉは自然数）を疾患インデックスとすると、疾患ｉの遺伝子変異テーブルＴＡ_iには、当該疾患ｉと関連する遺伝子、遺伝子位置、変異前配列、変異後配列、健常者の当該Ｘ_i－Ｙ_i－Ｘ_iの変異を持つ遺伝子の総転写量、当該疾患ｉの患者の当該Ｘ_i－Ｙ_i－Ｘ_iの変異を持つ遺伝子の総転写量、当該疾患ｉの患者が当該遺伝子位置において当該Ｘ_i－Ｙ_i－Ｘ_iの変異の変異発生率の組のレコードが蓄積されている。この疾患毎の遺伝子変異テーブルは、後述する図１８のＤＮＡ配列とＲＮＡ転写量から疾患のリスク値を算出するのに用いられる。 FIG. 16 is an example of a genetic mutation table for each disease stored in the storage. As shown in FIG. 16, for example, assuming that diseases range from disease 1 to disease N (N is a natural number), N tables are stored in the storage 23 of the server 2, from genetic mutation table TA₁ for disease 1 to genetic mutation table TA_N for disease N. N1, N2, and N3 are indexes. If i (i is a natural number) is a disease index, the genetic mutation table TA_i for disease i stores records of a gene associated with the disease i, a genetic position, a pre-mutation sequence, a post-mutation sequence, a total transcription amount of a gene having the X_i -Y_i -X_i mutation in a healthy person, a total transcription amount of a gene having the X_i -Y_i -X_i mutation in a patient with the disease i, and a mutation occurrence rate of the X_i -Y_i -X_i mutation at the genetic position in a patient with the disease i. This genetic mutation table for each disease is used to calculate a risk value of a disease from the DNA sequence and RNA transcription amount in FIG. 18, which will be described later.

図１７は、ストレージに保存されている疾患毎の転写量テーブルの例である。図１７に示すように、疾患が疾患１から疾患Ｎまであるとすると、疾患１の転写量テーブルから疾患Ｎの転写量テーブルまで、Ｎ個のテーブルがサーバ２のストレージ２３に格納されている。疾患ｉの転写量テーブルＴＡ_iには、Ｘ_j－Ｙ_j－Ｘ_jの変異が当該疾患ｉと関連する遺伝子、当該Ｘ_j－Ｙ_j－Ｘ_jの変異の変異発生率、他の遺伝子で当該Ｘ_j－Ｙ_j－Ｘ_jの変異をもつものあるだけに対してそれぞれ当該Ｘ_j－Ｙ_j－Ｘ_jの遺伝子名／患者の転写量／健常者の転写量のセットの組のレコードが蓄積されている。この疾患毎の転写量テーブルは、後述する図２０のＲＮＡ転写量から疾患のリスク値を算出するのに用いられる。 FIG. 17 is an example of a transcription amount table for each disease stored in the storage. As shown in FIG. 17, assuming that there are diseases 1 to N, N tables are stored in the storage₂₃ of the server 2, from the transcription amount table for disease₁ to the transcription_amount_table for disease_N. In the transcription amount table TA_i for disease i, records of a gene associated with the mutation of X_j_-Y_j_-X_j in the disease i_, the mutation incidence rate of_the mutation ...

図１８Ａは、ＤＮＡ配列とＲＮＡ転写量の両方を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。図１８Ｂは、図１８Ａの続きのフローチャートである。図１８Ａ及び図１８Ｂのフローチャートは、図１３の画面Ｇ３において、「ＤＮＡ配列および転写量での診断」のラジオボタンＲＢ１が選択された場合において、画面Ｇ３から画面Ｇ４への遷移する場合の処理の流れの一例である。以下の図１８Ａ～図２０Ｂの処理は、端末１であれば端末１のプロセッサ１６の処理であり、サーバ２であればサーバ２のプロセッサ２６の処理であり、それぞれプロセッサ１６、プロセッサ２６について明示的に記載しない。Figure 18A is a flowchart showing an example of a process for displaying a list of disease risks of a test subject using both DNA sequence and RNA transcription amount. Figure 18B is a continuation of the flowchart in Figure 18A. The flowcharts in Figures 18A and 18B are an example of a process flow when transitioning from screen G3 to screen G4 when radio button RB1 for "Diagnosis using DNA sequence and transcription amount" is selected on screen G3 in Figure 13. The processes in the following Figures 18A to 20B are the processes of processor 16 of terminal 1 in the case of terminal 1, and the processes of processor 26 of server 2 in the case of server 2, and processor 16 and processor 26 are not explicitly described.

（ステップＳ１１０）端末１は、検査対象者のＤＮＡ配列とＲＮＡ転写量情報を受け付ける。具体的には例えば、図１３の画面Ｇ３においてボックスＢ４においてＤＮＡ配列のファイルが選択され、ボックスＢ５においてＲＮＡ転写量情報のファイルが選択された場合に、端末１は、これらの検査対象者のＤＮＡ配列とＲＮＡ転写量情報を受け付ける。(Step S110) Terminal 1 accepts the DNA sequence and RNA transcription amount information of the test subject. Specifically, for example, when a DNA sequence file is selected in box B4 on screen G3 in FIG. 13 and a RNA transcription amount information file is selected in box B5, terminal 1 accepts the DNA sequence and RNA transcription amount information of these test subjects.

（ステップＳ１２０）そして端末１は、例えば図１３の画面Ｇ３において送信ボタンＢ８が押された場合、ステップＳ１１０で受け付けたＤＮＡ配列とＲＮＡ転写量情報をサーバ２へ送信する。(Step S120) Then, for example, when the send button B8 on screen G3 in FIG. 13 is pressed, the terminal 1 sends the DNA sequence and RNA transcription amount information received in step S110 to the server 2.

（ステップＳ１３０）サーバ２は、ＤＮＡ配列とＲＮＡ転写量情報を受信した場合、疾患インデックスｉおよび疾患リスク表を初期化する。(Step S130) When server 2 receives the DNA sequence and RNA transcription amount information, it initializes the disease index i and the disease risk table.

（ステップＳ１４０）次にサーバ２は、Ｘ－Ｙ－Ｘ配列インデックスｊおよびリスク値を初期化する。(Step S140) Next, server 2 initializes the X-Y-X array index j and the risk value.

（ステップＳ１５０）次にサーバ２は、ストレージ２３に記憶された、疾患ｉとそれに関連するＸ－Ｙ－Ｘ配列のリストを、疾患ｉの遺伝子変異テーブルＴＡ_iから取得する。例えば、最初のループでは、疾患インデックスｉがステップＳ１３０で１に初期化されているので、図１６の疾患１の遺伝子変異テーブルＴＡ₁から変異後のＸ－Ｙ－Ｘ配列のリストが取得される。 (Step S150) Next, the server 2 obtains a list of disease i and its associated X-Y-X sequences from the genetic mutation table TA_i for disease i stored in the storage 23. For example, in the first loop, since the disease index i is initialized to 1 in step S130, a list of X-Y-X sequences after mutation is obtained from the genetic mutation table TA₁ for disease 1 in FIG.

（ステップＳ１６０）次にサーバ２は、ストレージに記憶された疾患ｉに関連するＸ－Ｙ－Ｘ配列のリストのｊ番目のＸ_j－Ｙ_j－Ｘ_j配列を、検査対象者のＤＮＡ配列の対応する配列位置において検索する。 (Step S160) Next, the server 2 searches for the j-th X_j -Y_j -X_j sequence in the list of XYX sequences associated with disease i stored in the storage, at the corresponding sequence position in the DNA sequence of the test subject.

（ステップＳ１７０）次にサーバ２は、対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出したか否か判定する。 (Step S170) Next, the server 2 determines whether or not an X_j -Y_j -X_j array has been detected in the corresponding array position.

（ステップＳ１８０）ステップＳ１７０で対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出した場合、サーバ２は、ストレージ２３に記憶された、疾患ｉの遺伝子変異テーブルＴＡ_iにおいて、ｊ番目のＸ_j－Ｙ_j－Ｘ_j配列の患者での変異発生率を取得する。 (Step S180) When an X_j -Y_j -X_j sequence is detected at the corresponding sequence position in step S170, the server 2 acquires the mutation incidence rate of the j-th X_j -Y_j -X_j sequence in patients in the gene mutation table TA_i for the disease i stored in the storage 23.

（ステップＳ１９０）ステップＳ１８０に引き続いてサーバ２は、疾患ｉのリスク値に、得られたｊ番目のＸ_j－Ｙ_j－Ｘ_j配列の変異発生率を加算する。そして処理がステップＳ２４０に移行する。 (Step S190) Following step S180, the server 2 adds the mutation incidence rate of the obtained j-th X_j -Y_j -X_j sequence to the risk value of disease i, and then the process proceeds to step S240.

（ステップＳ２００）ステップＳ１７０において対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出していない場合、サーバ２は、対応する配列位置以外におけるＸ_j－Ｙ_j－Ｘ_j配列を検索する。 (Step S200) If an Xj_-_Yj -_Xj array is not found in the corresponding array position in step S170, the server 2 searches for an Xj-_Yj_-_Xj array in an array position other than the corresponding array position.

（ステップＳ２１０）サーバ２は、対応する配列位置以外の位置にＸ_j－Ｙ_j－Ｘ_j配列を検出したか否か判定する。 (Step S210) The server 2 determines whether or not an X_j -Y_j -X_j array has been detected at a position other than the corresponding array position.

（ステップＳ２２０）ステップＳ２１０において対応する配列位置以外の位置にＸ_j－Ｙ_j－Ｘ_j配列が検出された場合、検査対象者の転写量情報から検出したＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の転写量を取得し、Ｘ_j－Ｙ_j－Ｘ_jの転写量に加算する。そしてステップＳ２００に戻って、対応する配列位置以外におけるＸ_j－Ｙ_j－Ｘ_j配列が、他にないか検索する。 (Step S220) If an X_j -Y_j -X_j sequence is detected at a position other than the corresponding sequence position in step S210, the transcription amount of the gene having the detected X_j -Y_j -X_j sequence is obtained from the transcription amount information of the test subject and added to the transcription amount of X_j -Y_j -X_j . Then, returning to step S200, a search is made to see if there is any other X_j -Y_j -X_j sequence other than the corresponding sequence position.

（ステップＳ２３０）ステップＳ２１０において対応する配列位置以外の位置にＸ_j－Ｙ_j－Ｘ_j配列が検出されなかった場合、サーバ２は、ストレージ２３から検出されたＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の患者と健常者の総転写量情報を取得し、検査対象者の転写量との比較から、重みｗ_jを算出し、疾患ｉの遺伝子変異テーブルＴＡ_iのｊ番目の変異発生率Ｓ_jを読み出し、ｊ番目のＸ_j－Ｙ_j－Ｘ_j配列のリスク寄与度をｗ_jＳ_jとして推算し、疾患ｉのリスク値に、このリスク寄与度ｗ_jＳ_jを加算する。そして処理がステップＳ２４０に移行する。 (Step S230) If the X_j -Y_j -X_j sequence is not detected at a position other than the corresponding sequence position in step S210, the server 2 obtains the total transcription amount information of the patient and the healthy person of the gene having the detected X_j -Y_j -X_j sequence from the storage 23, calculates the weight w_j from a comparison with the transcription amount of the test subject, reads the j-th mutation incidence rate S_j of the gene mutation table TA_i of the disease i, estimates the risk contribution of the j-th X_j -Y_j -X_j sequence as w_j S_j , and adds this risk contribution w_j S_j to the risk value of the disease i. Then, the process proceeds to step S240.

（ステップＳ２４０）次にサーバ２は、ストレージ２３の疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1－Ｘ_j+1配列のレコードがあるか否か判定する。 (Step S240) Next, the server 2 judges whether or not there is a record of the next X_j+1 -Y_j+1 -X_j+1 array in the gene mutation table TA_i for the disease i in the storage 23.

（ステップＳ２５０）ステップＳ２４０においてストレージ２３の疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1－Ｘ_j+1配列のレコードがある場合、Ｘ－Ｙ－Ｘ配列インデックスｊを１増やす。そして、処理がステップＳ１６０に戻って、ステップＳ１６０以降の処理が繰り返される。 (Step S250) In step S240, if there is a record of the next X_j+1 -Y_j+1 -X_j+1 array in the gene mutation table TA_i for the disease i in the storage 23, the X-Y-X array index j is incremented by 1. Then, the process returns to step S160, and the processes from step S160 onwards are repeated.

（ステップＳ２６０）ステップＳ２４０においてストレージ２３の疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1－Ｘ_j+1配列のレコードがない場合、サーバ２は例えば疾患ｉの名称と、そのリスク値を、疾患リスク表に追加する。 (Step S260) If there is no record of the next X_j+1 -Y_j+1 -X_j+1 array in the gene mutation table TA_i for disease i in the storage 23 in step S240, the server 2 adds, for example, the name of disease i and its risk value to the disease risk table.

（ステップＳ２７０）次にサーバ２は、ストレージ２３に次の疾患ｉ＋１の遺伝子変異テーブルがあるか否か判定する。(Step S270) Next, the server 2 determines whether or not the storage 23 contains a gene mutation table for the next disease i+1.

（ステップＳ２８０）ステップＳ２７０においてストレージ２３に次の疾患ｉ＋１の遺伝子変異テーブルがあると判定された場合、サーバ２は疾患インデックスｉを１増やす。そして、処理がステップＳ１４０に戻って、疾患ｉ＋１についてステップＳ１４０以降の処理が繰り返される。(Step S280) If it is determined in step S270 that the storage 23 contains a gene mutation table for the next disease i+1, the server 2 increments the disease index i by 1. Then, the process returns to step S140, and the processes from step S140 onwards are repeated for disease i+1.

（ステップＳ２９０）ステップＳ２７０においてストレージ２３に次の疾患ｉ＋１の遺伝子変異テーブルがないと判定された場合、サーバ２は、検査対象者が罹るリスクがある疾患とそのリスク値を示した疾患リスク表の情報を端末２へ送信する。(Step S290) If it is determined in step S270 that there is no gene mutation table for the next disease i+1 in storage 23, server 2 transmits to terminal 2 information on a disease risk table showing diseases that the test subject is at risk of suffering from and their risk values.

（ステップＳ３００）端末１は、サーバ２から情報を受信した場合、この情報を用いて、検査対象者に対する疾患リスク表の情報を表示制御する。これにより、例えば図１３の画面Ｇ４がディスプレイ１７に表示される。検査対象者についてそれぞれの疾患のリスク値が表示されるので、検査対象者がそれぞれの疾患のリスクを把握することができる。以上で本フローチャートの処理を終了する。(Step S300) When the terminal 1 receives information from the server 2, it uses this information to control the display of the information in the disease risk table for the test subject. As a result, for example, screen G4 in FIG. 13 is displayed on the display 17. The risk value for each disease for the test subject is displayed, allowing the test subject to understand the risk of each disease. This ends the processing of this flowchart.

図１９は、ＤＮＡ配列を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。図１９のフローチャートは、図１３の画面Ｇ３において、「ＤＮＡ配列のみでの診断」のラジオボタンＲＢ２が選択された場合において、画面Ｇ３から画面Ｇ４への遷移する場合の処理の流れの一例である。Figure 19 is a flowchart showing an example of a process for displaying a list of disease risks of a test subject using DNA sequences. The flowchart in Figure 19 is an example of the process flow when transitioning from screen G3 to screen G4 when radio button RB2 for "Diagnosis using DNA sequence only" is selected on screen G3 in Figure 13.

（ステップＳ４１０）端末１は、検査対象者のＤＮＡ配列を受け付ける。具体的には例えば、図１３の画面Ｇ３においてボックスＢ６においてＤＮＡ配列のファイルが選択された場合、端末１は、この検査対象者のＤＮＡ配列を受け付ける。(Step S410) Terminal 1 accepts the DNA sequence of the test subject. Specifically, for example, when a DNA sequence file is selected in box B6 on screen G3 of FIG. 13, terminal 1 accepts the DNA sequence of this test subject.

（ステップＳ４２０）そして端末１は、例えば図１３の画面Ｇ３において送信ボタンＢ８が押された場合、ステップＳ１１０で受け付けたＤＮＡ配列をサーバ２へ送信する。(Step S420) Then, for example, when the send button B8 on screen G3 in FIG. 13 is pressed, terminal 1 sends the DNA sequence received in step S110 to server 2.

（ステップＳ４３０）サーバ２は、ＤＮＡ配列を受信した場合、疾患インデックスｉおよび疾患リスク表を初期化する。(Step S430) When the server 2 receives a DNA sequence, it initializes the disease index i and the disease risk table.

（ステップＳ４４０）次にサーバ２は、Ｘ－Ｙ－Ｘ配列インデックスｊおよびリスク値を初期化する。(Step S440) Next, server 2 initializes the X-Y-X array index j and the risk value.

（ステップＳ４５０）次にサーバ２は、ストレージ２３に記憶された、疾患ｉとそれに関連するＸ－Ｙ－Ｘ配列のリストと対応する遺伝子位置のリストを、疾患ｉの遺伝子変異テーブルＴＡ_iから取得する。例えば、最初のループでは、疾患インデックスｉがステップＳ４３０で１に初期化されているので、図１２の疾患１の遺伝子変異テーブルＴＡ₁から変異後のＸ－Ｙ－Ｘ配列のリストと対応する遺伝子位置のリストが取得される。 (Step S450) Next, the server 2 obtains a list of disease i and its associated X-Y-X sequences and a list of corresponding gene positions from the genetic mutation table TA_i for disease i, which are stored in the storage 23. For example, in the first loop, since the disease index i is initialized to 1 in step S430, a list of X-Y-X sequences after mutation and a list of corresponding gene positions are obtained from the genetic mutation table TA₁ for disease 1 in FIG.

（ステップＳ４６０）サーバ２は、ストレージ２３に記憶された疾患ｉに関連するＸ－Ｙ－Ｘ配列のリストのｊ番目のＸ_j－Ｙ_j－Ｘ_j配列を、検査対象者のＤＮＡ配列の対応する遺伝子位置において検索する。 (Step S460) The server 2 searches for the j-th X_j -Y_j -X_j sequence in the list of XYX sequences associated with disease i stored in the storage 23, at the corresponding gene position in the DNA sequence of the test subject.

（ステップＳ４７０）サーバ２は、対応する遺伝子位置においＸ_j－Ｙ_j－Ｘ_j配列を検出したか否か判定する。ステップＳ４７０で対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出されなかった場合、処理がステップＳ５００に進む。 (Step S470) The server 2 determines whether or not the Xj_-_Yj -_Xj sequence is detected at the corresponding gene position. If the Xj_-_Yj -_Xj sequence is not detected at the corresponding sequence position in step S470, the process proceeds to step S500.

（ステップＳ４８０）ステップＳ４７０で対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出した場合、サーバ２は、ストレージ２３に記憶された、疾患ｉの遺伝子変異テーブルＴＡ_iにおいて、ｊ番目のＸ_j－Ｙ_j－Ｘ_j配列の患者での変異発生率を取得する。 (Step S480) When an X_j -Y_j -X_j sequence is detected at the corresponding sequence position in step S470, the server 2 acquires the mutation incidence rate of the j-th X_j -Y_j -X_j sequence in patients in the gene mutation table TA_i for the disease i stored in the storage 23.

（ステップＳ４９０）サーバ２は、疾患ｉに関連するＸ－Ｙ－Ｘ配列の変異発生率だけ、疾患ｉのリスク値を増量する。(Step S490) Server 2 increases the risk value of disease i by the mutation occurrence rate of the X-Y-X sequence associated with disease i.

（ステップＳ５００）ステップＳ４７０で対応する配列位置においてＸ_j－Ｙ_j－Ｘ_j配列を検出されなかった場合もしくはステップＳ４９０の処理の後に、サーバ２は、疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1-Ｘ_j+1配列のレコードがあるか否か判定する。 (Step S500) If the X_j -Y_j -X_j sequence is not detected at the corresponding sequence position in step S470 or after the processing of step S490, the server 2 determines whether or not there is a record of the next X_j+1 -Y_j+1- X_j+1 sequence in the gene mutation table TA_i for disease i.

（ステップＳ５１０）ステップＳ５００で疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1-Ｘ_j+1配列のレコードがあると判定された場合、サーバ２はＸ－Ｙ－Ｘ配列インデックスｊを１増やす。そして、処理がステップＳ４６０に戻って、Ｘ－Ｙ－Ｘ配列インデックスｊについてステップＳ４６０以降の処理を繰り返す。 (Step S510) If it is determined in step S500 that there is a record of the next X_j+1 -Y_j+1- X_j+1 array in the gene mutation table TA_i for disease i, the server 2 increments the X-Y-X array index j by 1. Then, the process returns to step S460, and the processes from step S460 onwards are repeated for the X-Y-X array index j.

（ステップＳ５２０）ステップＳ５００で疾患ｉの遺伝子変異テーブルＴＡ_iにおいて次のＸ_j+1－Ｙ_j+1-Ｘ_j+1配列のレコードがないと判定された場合、サーバ２は、疾患ｉの名称とそのリスク値を疾患リスク表に追加する。 (Step S520) If it is determined in step S500 that there is no record of the next X_j+1 -Y_j+1- X_j+1 sequence in the gene mutation table TA_i for disease i, the server 2 adds the name of disease i and its risk value to the disease risk table.

（ステップＳ５３０）次にサーバ２は、ストレージ２３に次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがあるか否か判定する。 (Step S530) Next, the server 2 judges whether or not the storage 23 contains the gene mutation table TA_i for the next disease i+1.

（ステップＳ５４０）ステップＳ５３０において次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがあると判定された場合、サーバ２は、疾患インデックスｉを１増やす。そして、処理がステップ４４０に戻り、次の疾患ｉ＋１についてステップＳ４４０以降の処理が繰り返される。 (Step S540) If it is determined in step S530 that there is a gene mutation table TA_i for the next disease i+1, the server 2 increments the disease index i by 1. Then, the process returns to step S440, and the processes from step S440 onwards are repeated for the next disease i+1.

（ステップＳ５５０）ステップＳ５３０において次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがないと判定された場合、サーバ２は、検査対象者が罹るリスクがある疾患とそのリスク値を示した疾患リスク表の情報を端末１へ送信する。 (Step S550) If it is determined in step S530 that there is no genetic mutation table TA_i for the next disease i+1, the server 2 transmits to the terminal 1 information on a disease risk table showing the diseases that the test subject is at risk of having and their risk values.

（ステップＳ５６０）端末１は、サーバ２から情報を受信した場合、この情報を用いて、検査対象者に対する疾患リスク表の情報を表示制御する。これにより、例えば図１３の画面Ｇ４がディスプレイ１７に表示される。検査対象者についてそれぞれの疾患のリスク値が表示されるので、検査対象者がそれぞれの疾患のリスクを把握することができる。以上で本フローチャートの処理を終了する。(Step S560) When the terminal 1 receives information from the server 2, it uses this information to control the display of the information in the disease risk table for the test subject. As a result, for example, screen G4 in FIG. 13 is displayed on the display 17. The risk value for each disease for the test subject is displayed, allowing the test subject to understand the risk of each disease. This ends the processing of this flowchart.

図２０Ａは、ＲＮＡ転写量情報を用いて、検査対象者が罹る疾患リスクの一覧を表示する処理の一例を示すフローチャートである。図２０Ｂは、図２０Ａの続きのフローチャートである。図２０Ａ及び図２０Ｂのフローチャートは、図１３の画面Ｇ３において、「転写量のみでの診断」のラジオボタンＲＢ３が選択された場合において、画面Ｇ３から画面Ｇ４への遷移する場合の処理の流れの一例である。Figure 20A is a flowchart showing an example of a process for displaying a list of disease risks of a test subject using RNA transcription amount information. Figure 20B is a continuation of the flowchart in Figure 20A. The flowcharts in Figures 20A and 20B are an example of a process flow for transitioning from screen G3 to screen G4 when radio button RB3 for "Diagnosis based on transcription amount only" is selected on screen G3 in Figure 13.

（ステップＳ６１０）端末１は、検査対象者のＲＮＡ転写量情報を受け付ける。具体的には例えば、図１３の画面Ｇ３においてボックスＢ７において検査対象者のＲＮＡ転写量情報のファイルが選択された場合に、端末１は、これらの検査対象者のＲＮＡ転写量情報を受け付ける。(Step S610) The terminal 1 accepts the RNA transcription amount information of the test subjects. Specifically, for example, when a file of the RNA transcription amount information of the test subjects is selected in box B7 on screen G3 of FIG. 13, the terminal 1 accepts the RNA transcription amount information of these test subjects.

（ステップＳ６２０）そして端末１は、例えば図１３の画面Ｇ３において送信ボタンＢ８が押された場合、ステップＳ６１０で受け付けたＲＮＡ転写量情報をサーバ２へ送信する。(Step S620) Then, for example, when the send button B8 is pressed on screen G3 of FIG. 13, the terminal 1 sends the RNA transcription amount information received in step S610 to the server 2.

（ステップＳ６３０）サーバ２は、ＲＮＡ転写量情報を受信した場合、疾患インデックスｉおよび疾患リスク表を初期化する。(Step S630) When server 2 receives RNA transcription amount information, it initializes the disease index i and the disease risk table.

（ステップＳ６４０）次にサーバ２は、遺伝子インデックスｊおよびリスク値を初期化する。ここで、遺伝子インデックスｊは、図１７に示すように、疾患ｉの転写量テーブルＴＢ_iにおいて、行方向のインデックスである。 (Step S640) Next, the server 2 initializes the gene index j and the risk value, where the gene index j is an index in the row direction in the transcription amount table TB_i for the disease i, as shown in FIG.

（ステップＳ６５０）次にサーバ２は、ストレージ２３に記憶された疾患ｉの転写量テーブルＴＢ_iを参照して、疾患ｉと関連するＸ－Ｙ－Ｘ配列を持つ遺伝子リストと、当該Ｘ－Ｙ－Ｘ配列を有する他の遺伝子のリストを取得する。例えば、疾患１について疾患１の転写量テーブルＴＢ₁を参照して、Ｘ₁－Ｙ₁－Ｘ₁への変異が見られる遺伝子Ａ、Ｘ₂－Ｙ₂－Ｘ₂への変異が見られる遺伝子Ｂ、Ｘ₃－Ｙ₃－Ｘ₃への変異が見られる遺伝子Ｃ、…が「疾患ｉと関連するＸ－Ｙ－Ｘ配列を持つ遺伝子リスト」として取得される。またそれとともに例えば、疾患１について疾患１の転写量テーブルＴＢ₁を参照して、Ｘ₁－Ｙ₁－Ｘ₁を持つ遺伝子名Ａ’、Ｘ₁－Ｙ₁－Ｘ₁を持つ遺伝子名Ａ’’、Ｘ₂－Ｙ₂－Ｘ₂を持つ遺伝子名Ｂ’、…が「Ｘ－Ｙ－Ｘ配列を有する他の遺伝子のリスト」として取得される。 (Step S650) Next, the server 2 refers to the transcription amount table TB_i for disease i stored in the storage 23 to obtain a list of genes having an X-Y-X sequence associated with disease i and a list of other genes having the X-Y-X sequence. For example, by referring to the transcription amount table TB₁ for disease 1, gene A showing a mutation to X₁ -Y₁ -X₁ , gene B showing a mutation to X₂ -Y₂ -X₂ , gene C showing a mutation to X₃ -Y₃ -X₃ , ... are obtained as a "list of genes having an X-Y-X sequence associated with disease i". In addition, for example, with respect to disease 1, by referring to the transcription amount table TB₁ for disease 1, a gene name A' having X₁ -Y₁ -X₁ , a gene name A'' having X₁ -Y₁ -X₁ , a gene name B' having X₂ -Y₂ -X₂ , ... are obtained as a "list of other genes having the X-Y-X sequence."

（ステップＳ６６０）次にサーバ２は、ストレージ２３の疾患ｉの転写量テーブルＴＢ_iから、疾患ｉと関連する遺伝子でリストのｊ番目のＸ_j－Ｙ_j－Ｘ_jへの変異が見られる遺伝子の患者と健常者の転写量を取得する。例えば、疾患１と関連する遺伝子でリストの１番目のＸ₁－Ｙ₁－Ｘ₁への変異が見られる遺伝子の患者と健常者の転写量が取得される。 (Step S660) Next, the server 2 obtains the transcription amounts of the gene associated with disease i, which is mutated to X_j -Y_j -X_j at the jth position in the list, in patients and healthy subjects, from the transcription amount_table TB i for disease i in the storage 23. For example, the transcription amounts of the gene associated with disease 1, which is mutated to X₁ -Y₁ -X₁ at the 1st position in the list, in patients and healthy subjects are obtained.

（ステップＳ６７０）次にサーバ２は、端末１から受信した、検査対象者の転写量情報から、ｊ番目のＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の転写量を取得する。 (Step S 670 ) Next, the server 2 obtains the transcription amount of the jth gene having the X_j -Y_j -X_j sequence from the transcription amount information of the test subject received from the terminal 1 .

（ステップＳ６８０）次にサーバ２は、ステップＳ６５０で取得された他の遺伝子リストに他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子があるか否か判定する。例えば、ｉ＝１、ｊ＝１の場合、ストレージ２３の疾患１の転写量テーブルＴＢ₁において、Ｘ₁－Ｙ₁－Ｘ₁を持つ遺伝子名Ａ’、Ｘ₁－Ｙ₁－Ｘ₁を持つ遺伝子名Ａ’’があるので、他の遺伝子リストに他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子があると判定される。ここで他の遺伝子リストに他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子がないと判定された場合、処理がステップＳ７４０に進む。 (Step S680) Next, the server 2 judges whether or not there is another gene having the X_j -Y_j -X_j sequence in the other gene list acquired in step S650. For example, when i=1 and j=1, there is a gene name A' having X₁ -Y₁ -X₁ and a gene name A'' having X₁ -Y₁ -X₁ in the transcription amount table TB₁ for disease 1 in the storage 23, so it is judged that there is another gene having the X_j -Y_j -X_j sequence in the other gene list. If it is judged that there is no other gene having the X_j -Y_j -X_j sequence in the other gene list, the process proceeds to step S740.

（ステップＳ６９０）ステップＳ６８０において他の遺伝子リストに他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子があると判定された場合、サーバ２は、インデックスｋを初期化する。ここで、インデックスｋは、図１７に示すように、疾患ｉの転写量テーブルＴＢ_iにおいて、「他の遺伝子でＸ－Ｙ－Ｘを持つもの１」という項目を起点とする列方向のインデックスである。 (Step S690) If it is determined in step S680 that the other gene list contains another gene having an X_j -Y_j -X_j sequence, the server 2 initializes the index k. Here, the index k is a column-wise index starting from the item "Another gene having X-Y-X 1" in the transcription amount table TB_i for disease i, as shown in FIG.

（ステップＳ７００）ステップＳ６９０に続いて、サーバ２は、ストレージ２３から、ステップＳ６５０で取得された他の遺伝子リストにおいて他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つｋ番目の遺伝子について患者と健常者の転写量を取得する。例えば、ｉ＝１
、ｊ＝１、ｋ＝１の場合、ストレージ２３の疾患１の転写量テーブルＴＢ₁において、Ｘ₁－Ｙ₁－Ｘ₁を持つ１番目の遺伝子名Ａ’の患者の転写量と健常者の転写量が取得される。サーバ２は、取得した患者の転写量を、Ｘ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の患者の転写量Ｔｐに加算し、取得した健常者の転写量を、Ｘ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の健常者の転写量Ｔｎに加算する。 (Step S700) Following step S690, the server 2 acquires from the storage 23 the transcription amounts of the patient and the healthy subject for the k-th gene having the X_j -Y_j -X_j sequence in the other gene list acquired in step S650. For example, when i=1
, j=1, k=1, the transcription amount of the patient and the transcription amount of the healthy subject for the first gene name A' having X₁ -Y₁ -X₁ are obtained in the transcription amount table TB₁ for disease 1 in the storage 23. The server 2 adds the obtained transcription amount of the patient to the transcription amount Tp of the patient for the gene having the X_j -Y_j -X_j sequence, and adds the obtained transcription amount of the healthy subject to the transcription amount Tn of the healthy subject for the gene having the X_j -Y_j -X_j sequence.

（ステップＳ７１０）ステップＳ７００に続いて、サーバ２は、検査対象者の転写量情報から他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つｋ番目の遺伝子の転写量を取得し、Ｘ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子の検査対象者の転写量Ｔｓに加算する。 (Step S710) Following step S700, the server 2 obtains the transcription amount of the kth gene having the Xj_- Yj-_Xj sequence among other genes from the transcription amount information of the test subject, and adds it to the transcription amount Ts of the test subject of the gene having the Xj_-_Yj_-_Xj sequence.

（ステップＳ７２０）サーバ２は、ステップＳ６５０で取得された他の遺伝子リストに次のＸ_j－Ｙ_j－Ｘ_j配列を持つｋ＋１番目の遺伝子のレコードがあるか否か判定する。 (Step S720) The server 2 determines whether or not there is a record of the k+1th gene having the next X_j -Y_j -X_j sequence in the other gene list acquired in step S650.

（ステップＳ７３０）ステップＳ７２０において他の遺伝子リストに次のＸ_j－Ｙ_j－Ｘ_j配列を持つｋ＋１番目の遺伝子のレコードがあると判定された場合、サーバ２は、インデックスｋを１増やす。そして、処理がステップＳ７００に戻って、インデックスｋ＋１についてステップＳ７００以降の処理が繰り返される。 (Step S730) If it is determined in step S720 that the other gene list contains a record of the k+1th gene having the next X_j -Y_j -X_j sequence, the server 2 increments the index k by 1. Then, the process returns to step S700, and the processes from step S700 onwards are repeated for the index k+1.

（ステップＳ７４０）ステップＳ６８０において他の遺伝子リストに他の遺伝子でＸ_j－Ｙ_j－Ｘ_j配列を持つ遺伝子がないと判定された場合（ステップＳ６８０ＮＯ）、またはステップＳ７２０において他の遺伝子リストに次のＸ_j－Ｙ_j－Ｘ_j配列を持つｋ＋１番目の遺伝子のレコードがないと判定された場合（ステップＳ７２０ＮＯ）、サーバ２は、以下の処理を実行する。すなわちサーバ２は、患者と健常者の転写量Ｔｐ、Ｔｎと検査対象者の転写量Ｔｓとの比較から、重みｗ_jを算出し、疾患ｉの転写量テーブルからｊ番目の変異発生率Ｓ_jを読み出し、ｊ番目のＸ_j－Ｙ_j－Ｘ_j配列のリスク寄与度をｗ_jＳ_jとして推算し、疾患ｉのリスク値に、このリスク寄与度を加算する。ここで上述したように例えば、Ｔｓ≦Ｔｎの場合、ｗ_j＝０で、Ｔｎ＜Ｔｓ＜Ｔｐの場合、ｗ_j＝０．５で、Ｔｐ≦Ｔｓの場合、ｗ_j＝１が算出される。 (Step S740) If it is determined in step S680 that there is no other gene having the X_j -Y_j -X_j sequence in the other gene list (step S680 NO), or if it is determined in step S720 that there is no record of the k+1th gene having the next X_j -Y_j -X_j sequence in the other gene list (step S720 NO), the server 2 executes the following process. That is, the server 2 calculates the weight w_j from a comparison of the transcription amounts Tp and Tn of the patient and healthy person with the transcription amount Ts of the test subject, reads out the jth mutation incidence rate S_j from the transcription amount table of disease i, estimates the risk contribution of the jth X_j -Y_j -X_j sequence as w_j S_j , and adds this risk contribution to the risk value of disease i. Here, as described above, for example, when Ts≦Tn, w_j =0, when Tn<Ts<Tp, w_j =0.5, and when Tp≦Ts, w_j =1 is calculated.

（ステップＳ７５０）サーバ２は、ステップＳ６５０で取得された遺伝子リストにｊ＋１番目の遺伝子のレコードがあるか否か判定する。(Step S750) Server 2 determines whether or not there is a record for the j+1th gene in the gene list obtained in step S650.

（ステップＳ７６０）ステップＳ７５０において遺伝子リストにｊ＋１番目の遺伝子のレコードがあると判定された場合、サーバ２は、Ｘ－Ｙ－Ｘ配列インデックスｊを１増やす。そして処理がステップＳ６６０に戻って、Ｘ－Ｙ－Ｘ配列インデックスｊ＋１についてステップＳ６６０以降の処理が繰り返される。(Step S760) If it is determined in step S750 that the gene list contains a record for the j+1th gene, the server 2 increments the X-Y-X array index j by 1. Then, the process returns to step S660, and the processes from step S660 onwards are repeated for the X-Y-X array index j+1.

（ステップＳ７７０）ステップＳ７５０において遺伝子リストにｊ＋１番目の遺伝子のレコードがないと判定された場合、サーバ２は、疾患ｉの名称とそのリスク値を疾患リスク表に追加する。(Step S770) If it is determined in step S750 that there is no record for the j+1th gene in the gene list, the server 2 adds the name of disease i and its risk value to the disease risk table.

（ステップＳ７８０）次にサーバ２は、ストレージ２３に次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがあるか否か判定する。 (Step S780) Next, the server 2 judges whether or not the storage 23 contains the gene mutation table TA_i for the next disease i+1.

（ステップＳ７９０）ステップＳ７８０において次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがあると判定された場合、サーバ２は、疾患インデックスｉを１増やす。そして、処理がステップ６４０に戻り、次の疾患ｉ＋１についてステップＳ６４０以降の処理が繰り返される。 (Step S790) If it is determined in step S780 that there is a gene mutation table TA_i for the next disease i+1, the server 2 increments the disease index i by 1. Then, the process returns to step S640, and the processes from step S640 onwards are repeated for the next disease i+1.

（ステップＳ８００）ステップＳ７８０において次の疾患ｉ＋１の遺伝子変異テーブルＴＡ_iがないと判定された場合、サーバ２は、検査対象者が罹るリスクがある疾患とそのリスク値を示した疾患リスク表の情報を端末１へ送信する。 (Step S800) If it is determined in step S780 that there is no genetic mutation table TA_i for the next disease i+1, the server 2 transmits to the terminal 1 information on a disease risk table showing the diseases that the test subject is at risk of having and their risk values.

（ステップＳ８１０）端末１は、サーバ２から情報を受信した場合、この情報を用いて、検査対象者に対する疾患リスク表の情報を表示制御する。これにより、例えば図１３の画面Ｇ４がディスプレイ１７に表示される。検査対象者についてそれぞれの疾患のリスク値が表示されるので、検査対象者がそれぞれの疾患のリスクを把握することができる。以上で本フローチャートの処理を終了する。(Step S810) When the terminal 1 receives information from the server 2, it uses this information to control the display of the information in the disease risk table for the test subject. As a result, for example, screen G4 in FIG. 13 is displayed on the display 17. The risk value for each disease for the test subject is displayed, allowing the test subject to understand the risk of each disease. This ends the processing of this flowchart.

以上、第２の実施形態に係る情報処理システムは、第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ［式中、Ｘ及びＹは塩基数が２以上の異なる塩基配列を示す。但し、最後のＸの直後にはＹが続かず、最初のＸの直前にはＹが出現しない。］から第２の式：Ｘ－Ｙ－Ｘの塩基配列への特定の遺伝子位置における変異が対象の疾患に関与することを前提とする情報処理システムである。この第２の実施形態に係る情報処理システムは、検査対象者の遺伝子配列において、当該特定の遺伝子位置における前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行、及び／または前記検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量の抽出の実行を行う実行部２６２と、実行部２６２による少なくとも一方の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報を出力する出力部２６３と、を備える。As described above, the information processing system according to the second embodiment is an information processing system that assumes that a mutation at a specific genetic position from the first formula: X-Y-X-Y-X [wherein X and Y represent base sequences with two or more different bases. However, Y does not follow immediately after the last X, and Y does not appear immediately before the first X] to the base sequence of the second formula: X-Y-X is involved in the target disease. The information processing system according to the second embodiment includes an execution unit 262 that searches for whether or not there is a mutation in the base sequence of the second formula: X-Y-X at the specific genetic position in the genetic sequence of the test subject, and/or extracts the amount of transcription of the base sequence of the second formula: X-Y-X to RNA in the sample of the test subject, and an output unit 263 that outputs information regarding the risk of the test subject suffering from the target disease using at least one of the execution results by the execution unit 262.

この構成により、対象の疾患に検査対象者が罹るリスクに関する情報を取得することができるので、遺伝子変異に伴う疾患のリスクを把握することを容易化することができる。This configuration makes it possible to obtain information regarding the test subject's risk of contracting the target disease, making it easier to understand the risk of disease associated with genetic mutations.

また出力部２６３は、前記検査対象者の前記転写量と、健常者及び／または前記対象の疾患の患者の前記転写量とを比較し、当該比較結果及び／または前記探索の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報を出力する。The output unit 263 also compares the transcription amount of the test subject with the transcription amount of healthy individuals and/or patients with the target disease, and outputs information regarding the risk of the test subject suffering from the target disease using the comparison results and/or the results of the search execution.

この構成により、検査対象者の転写量と、健常者及び／または対象の疾患の患者の前記転写量とを比較結果を用いて、対象の疾患に検査対象者が罹るリスクを推定することができる。With this configuration, the test subject's risk of contracting the target disease can be estimated using the results of comparing the transcription amount of the test subject with that of healthy individuals and/or patients with the target disease.

上記転写量の比較の具体的な例として、第２の実施形態に係る情報処理システムは、健常者及び／または前記対象の疾患の患者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量が記憶されているストレージ２３を備える。そして出力部２６３は、ストレージ２３を参照して検査対象者の転写量と、健常者及び／または前記対象の疾患の患者の前記転写量とを比較し、当該比較結果及び／または前記探索の実行結果を用いて、前記対象の疾患に前記検査対象者が罹るリスクに関する情報を出力する。As a specific example of the comparison of the transcription amounts, the information processing system according to the second embodiment includes a storage 23 in which the transcription amount of the base sequence of the second formula: X-Y-X to RNA in a sample of a healthy individual and/or a patient with the target disease is stored. The output unit 263 then refers to the storage 23 to compare the transcription amount of the test subject with the transcription amount of the healthy individual and/or the patient with the target disease, and outputs information regarding the risk of the test subject suffering from the target disease using the comparison result and/or the execution result of the search.

また実行部２６２における探索の実行の具体的な例として、前記対象の疾患に関与する遺伝子変異後の第２の式：Ｘ－Ｙ－Ｘの塩基配列と、当該対象の疾患に関与する当該第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が生じる遺伝子位置とが関連付けられて記憶されているストレージ２３を備える。実行部２６２は、ストレージ２３から第２の式：Ｘ－Ｙ－Ｘの塩基配列と遺伝子位置とを読み出し、検査対象者の遺伝子配列において、当該読み出した遺伝子位置において当該読み出した前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行を行う。As a specific example of the execution of the search in the execution unit 262, a storage 23 is provided in which the base sequence of the second formula: X-Y-X after a genetic mutation involved in the disease of the subject and the genetic position at which the mutation to the base sequence of the second formula: X-Y-X involved in the disease of the subject occur are stored in association with each other. The execution unit 262 reads out the base sequence of the second formula: X-Y-X and the genetic position from the storage 23, and executes a search to determine whether or not there is a mutation to the base sequence of the second formula: X-Y-X that has been read out at the genetic position that has been read out in the genetic sequence of the test subject.

また第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列の長さの合計は、４０ｍｅｒ以上であることが好ましい。これにより、遺伝子変異が起こる確率が高い配列について、疾患に関与する遺伝子変異としてリスク推定の考慮に入れることができ、疾患のリスク推定の精度を担保することができる。
なお、図１８Ａ～図２０Ｂでは、出力部は、対象の疾患に前記検査対象者が罹るリスクに関する情報を出力したが、これに限ったものではなく、遺伝子変異に関する情報を出力してもよい。ここで、遺伝子変異に関する情報は例えば、遺伝子変異の可能性もしくはリスクであってもよいし、遺伝子変異が将来起こり得ることを注意喚起する情報であってもよいし、第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こる可能性もしくはリスクであってもよいし、第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こり得ることを注意喚起する情報であってもよい。また遺伝子変異に関する情報は例えば、転写量順の第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ及び／または第２の式：Ｘ－Ｙ－Ｘの塩基配列に関する情報であってもよいし、転写量が基準値を超えるものだけ第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ及び／または第２の式：Ｘ－Ｙ－Ｘの塩基配列に関する情報が出力されてもよいし、予め決められた転写量の順位まで第１の式：Ｘ－Ｙ－Ｘ－Ｙ－Ｘ及び／または第２の式：Ｘ－Ｙ－Ｘの塩基配列に関する情報であってもよい。
この場合、出力部は例えば、検査対象者の検体中の第２の式：Ｘ－Ｙ－Ｘの塩基配列のＲＮＡへの転写量が多いほど、遺伝子変異に関する情報の一例として「第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こる可能性が高いことを示す情報」を出力してもよい。これにより、第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が起こる可能性が分かるので、遺伝子変異が起こる可能性を把握することを容易化することができる。 In addition, the total length of the base sequences before and after the base sequence of the second formula: X-Y-X is preferably 40 mer or more. This allows sequences with a high probability of causing a gene mutation to be taken into consideration in risk estimation as a gene mutation involved in a disease, thereby ensuring the accuracy of risk estimation of the disease.
18A to 20B, the output unit outputs information on the risk of the test subject suffering from the target disease, but is not limited thereto, and may output information on genetic mutation. Here, the information on genetic mutation may be, for example, the possibility or risk of genetic mutation, information calling attention to the possibility of genetic mutation occurring in the future, the possibility or risk of mutation occurring in the base sequence of the second formula: X-Y-X, or information calling attention to the possibility of mutation occurring in the base sequence of the second formula: X-Y-X. Furthermore, the information on gene mutations may be, for example, information on the base sequences of the first formula: X-Y-X-Y-X and/or the second formula: X-Y-X in order of transcription amount, or information on the base sequences of the first formula: X-Y-X-Y-X and/or the second formula: X-Y-X may be output only for those whose transcription amount exceeds a reference value, or information on the base sequences of the first formula: X-Y-X-Y-X and/or the second formula: X-Y-X may be output up to a predetermined order of transcription amount.
In this case, the output unit may output, for example, "information indicating that the higher the transcription amount of the base sequence of the second formula: X-Y-X into RNA in the sample of the test subject, the higher the possibility of occurrence of a mutation in the base sequence of the second formula: X-Y-X" as an example of information regarding a genetic mutation. This makes it possible to know the possibility of occurrence of a mutation in the base sequence of the second formula: X-Y-X, and thus to easily grasp the possibility of occurrence of a genetic mutation.

なお、上述した実施形態で説明した情報処理システムＳの少なくとも一部は、ハードウェアで構成してもよいし、ソフトウェアで構成してもよい。ソフトウェアで構成する場合には、情報処理システムＳの少なくとも一部の機能を実現するプログラムをフレキシブルディスクやＣＤ－ＲＯＭ等の記録媒体に収納し、コンピュータに読み込ませて実行させてもよい。記録媒体は、磁気ディスクや光ディスク等の着脱可能なものに限定されず、ハードディスク装置やメモリなどの固定型の記録媒体でもよい。At least a part of the information processing system S described in the above embodiment may be configured with hardware or software. When configured with software, a program that realizes at least a part of the functions of the information processing system S may be stored on a recording medium such as a flexible disk or CD-ROM, and may be read and executed by a computer. The recording medium is not limited to removable recording media such as magnetic disks and optical disks, but may also be fixed recording media such as a hard disk device or memory.

また、情報処理システムＳの少なくとも一部の機能を実現するプログラムを、インターネット等の通信回線（無線通信も含む）を介して頒布してもよい。さらに、同プログラムを暗号化したり、変調をかけたり、圧縮した状態で、インターネット等の有線回線や無線回線を介して、あるいは記録媒体に収納して頒布してもよい。In addition, a program that realizes at least a part of the functions of the information processing system S may be distributed via a communication line (including wireless communication) such as the Internet. Furthermore, the program may be encrypted, modulated, or compressed and distributed via a wired line or wireless line such as the Internet, or stored on a recording medium.

さらに、一つまたは複数の情報処理装置によって情報処理システムＳを機能させてもよい。複数の情報処理装置を用いる場合、情報処理装置のうちの１つをコンピュータとし、当該コンピュータが所定のプログラムを実行することにより情報処理システムＳの少なくとも１つの手段として機能が実現されてもよい。Furthermore, the information processing system S may be operated by one or more information processing devices. When multiple information processing devices are used, one of the information processing devices may be a computer, and the computer may execute a predetermined program to realize the function as at least one of the means of the information processing system S.

また、情報処理システムＳの一部または全部の機能を、端末１で実行してもよい。In addition, some or all of the functions of the information processing system S may be executed by the terminal 1.

以上、本発明は上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。更に、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。As mentioned above, the present invention is not limited to the above-mentioned embodiment as it is, and in the implementation stage, the components can be modified and embodied without departing from the gist of the invention. Furthermore, various inventions can be formed by appropriately combining the multiple components disclosed in the above-mentioned embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, components from different embodiments may be appropriately combined.

１端末
１１入力インタフェース
１２通信回路
１３ストレージ
１４メモリ
１５出力インタフェース
１６プロセッサ
１７ディスプレイ
２サーバ
２１入力インタフェース
２２通信回路
２３ストレージ
２４メモリ
２５出力インタフェース
２６プロセッサ
２６１抽出部
２６２検索部
２６３出力部
２６４通信制御部
Ｓ情報処理システムReference Signs List 1 Terminal 11 Input interface 12 Communication circuit 13 Storage 14 Memory 15 Output interface 16 Processor 17 Display 2 Server 21 Input interface 22 Communication circuit 23 Storage 24 Memory 25 Output interface 26 Processor 261 Extraction unit 262 Search unit 263 Output unit 264 Communication control unit S Information processing system

Claims

Translated fromJapanese

前記対象の疾患に関与する遺伝子変異後の第２の式：Ｘ－Ｙ－Ｘの塩基配列と、当該対象の疾患に関与する当該第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異が生じる遺伝子位置とが関連付けられて記憶されているストレージを備え、
前記実行部は、前記ストレージから前記第２の式：Ｘ－Ｙ－Ｘの塩基配列と前記遺伝子位置とを読み出し、前記検査対象者の遺伝子配列において、当該読み出した遺伝子位置において当該読み出した前記第２の式：Ｘ－Ｙ－Ｘの塩基配列への変異があるか否かの探索の実行を行う
請求項１に記載の情報処理システム。 A storage device is provided in which a base sequence of a second formula: X-Y-X after a gene mutation involved in a disease of the subject and a gene position at which a mutation occurs in the base sequence of the second formula: X-Y-X involved in the disease of the subject are associated and stored;
2. The information processing system according to claim 1, wherein the execution unit reads out the base sequence of the second formula: X-Y-X and the gene position from the storage, and executes a search for whether or not there is a mutation in the base sequence of the second formula: X-Y-X at the read gene position in the gene sequence of the test subject.

前記第２の式：Ｘ－Ｙ－Ｘの塩基配列の前後の塩基配列の長さの合計は、４０ｍｅｒ以上である
請求項１または２に記載の情報処理システム。3. The information processing system according to claim 1, wherein the total length of the base sequences before and after the base sequence of the second formula: X-Y-X is 40 mer or more.