JP7660782B1

Movatterモバイル変換

Info

Publication number: JP7660782B1
Application number: JP2025507750A
Authority: JP
Inventors: 秀幸鈴木; ティボヴェルモデロシュ; 廉枋桑原
Original assignee: Hirata Corp
Current assignee: Hirata Corp
Priority date: 2024-09-30
Filing date: 2024-09-30
Publication date: 2025-04-11
Anticipated expiration: 2044-09-30

Abstract

情報処理装置は、相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析部と、要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶部と、ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定部と、を備える。The information processing device includes an FPO analysis unit that performs an FPO analysis process for each rank from a lower limit to an upper limit of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from among the non-seed elements that have a high correlation with the seed element among the elements contained in the multivariate data that is the target of correlation network analysis, when constructing a module of the seed element from the non-seed elements that have a high correlation with the seed element; a module information storage unit that stores module configuration information of the best module F score for each rank, by element; and an FPO module selection unit that selects an optimal module of the FPO analysis results based on the best module F score at each rank from the lower limit to the upper limit of the rank.

Description

Translated fromJapanese

本発明は、影響しあう複数の構成要素の関係の解析を行うこと、および特定の効果に影響の高いモジュールを特定し、候補の抽出を行う情報処理装置、情報処理方法、コンピュータプログラム、及びコンピュータ読み取り可能な記録媒体に関する。The present invention relates to an information processing device, an information processing method, a computer program, and a computer-readable recording medium that analyze the relationships between multiple influencing components, identify modules that have a high influence on a particular effect, and extract candidates.

特許文献１には、オミックス解析で取得された網羅的分子情報である多変量データを相関ネットワーク解析するための技術が記載されている。非特許文献１には、ビッグデータ等の多変量データを相関ネットワーク解析するための技術が記載されている。Patent Document 1 describes a technique for performing correlation network analysis on multivariate data, which is comprehensive molecular information obtained by omics analysis. Non-PatentDocument 1 describes a technique for performing correlation network analysis on multivariate data such as big data.

特許第６３１８３３４号公報Patent No. 6318334

萬年一斗、尾形善之、鈴木秀幸、“多変量解析に一石を投じる－ConfeitoGUIplusの開発”、生物工学会誌、第95巻第7号、２０１７年７月Kazuto Mannen, Yoshiyuki Ogata, Hideyuki Suzuki, "A new direction in multivariate analysis - Development of ConfeitoGUIplus", Journal of Bioengineering, Vol. 95, No. 7, July 2017

しかし、特許文献１や非特許文献１に記載された技術では、解析精度が不十分であった。However, the techniques described inPatent Document 1 andNon-Patent Document 1 had insufficient analytical accuracy.

本発明の目的は、多変量データに対する相関ネットワーク解析の精度向上を図ることにある。The object of the present invention is to improve the accuracy of correlation network analysis for multivariate data.

本発明の一態様は、相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析部と、要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶部と、ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定部と、を備える情報処理装置である。One aspect of the present invention is an information processing device that includes an FPO analysis unit that performs an FPO analysis process for each rank from a lower limit to an upper limit of a rank, which is the number of elements other than the seed elements to be incorporated into the module of the seed element, to remove false positive non-seed elements from among the non-seed elements that have a high correlation with the seed element among elements contained in multivariate data that is the subject of a correlation network analysis, when constructing a module of the seed element from the non-seed elements that have a high correlation with the seed element; a module information storage unit that stores module configuration information of the best module F score for each rank by element; and an FPO module selection unit that selects an optimal module of the FPO analysis results based on the best module F score at each rank from the lower limit to the upper limit of the rank.

本発明の一態様は、情報処理装置が実行する情報処理方法であって、相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、要素別に、ランク毎に最良のモジュールＦスコア値のモジュールの構成情報を格納するモジュール情報記憶ステップと、ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、を含む情報処理方法である。One aspect of the present invention is an information processing method executed by an information processing device, which includes an FPO analysis step in which, when constructing a module of a seed element from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of correlation network analysis, an FPO analysis process is performed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from among the non-seed elements having a high correlation with the seed element; a module information storage step in which, for each element, module configuration information of the best module F-score value for each rank is stored; and an FPO module selection step in which an optimal module of the FPO analysis results is selected based on the best module F-score at each rank from the lower limit value to the upper limit value of the rank.

本発明の一態様は、コンピュータに、相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶ステップと、ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、を実行させるためのコンピュータプログラムである。One aspect of the present invention is a computer program for causing a computer to execute an FPO analysis step in which, when constructing a module of a seed element from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of a correlation network analysis, an FPO analysis process is executed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from among the non-seed elements having a high correlation with the seed element; a module information storage step for storing module configuration information of the best module F score for each rank by element; and an FPO module selection step for selecting an optimal module of the FPO analysis results based on the best module F score at each rank from the lower limit value to the upper limit value of the rank.

本発明の一態様は、コンピュータに、相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶ステップと、ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、を実行させるためのコンピュータプログラムを記録したコンピュータ読み取り可能な記録媒体である。One aspect of the present invention is a computer-readable recording medium having recorded thereon a computer program for causing a computer to execute an FPO analysis step in which, when constructing a module of a seed element from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of a correlation network analysis, an FPO analysis process is performed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from among the non-seed elements having a high correlation with the seed element; a module information storage step for storing module configuration information of the best module F score for each rank by element; and an FPO module selection step for selecting an optimal module of the FPO analysis results based on the best module F score at each rank from the lower limit value to the upper limit value of the rank.

本発明によれば、多変量データに対する相関ネットワーク解析の精度向上を図ることができるという効果が得られる。According to the present invention, it is possible to improve the accuracy of correlation network analysis for multivariate data.

一実施形態に係る情報処理装置の概略の構成例を示す図である。FIG. 1 is a diagram illustrating an example of a schematic configuration of an information processing device according to an embodiment.一実施形態に係る情報処理方法の手順の一例を示すフローチャートである。1 is a flowchart illustrating an example of a procedure of an information processing method according to an embodiment.一実施形態に係る多変量データの構成例を示す図である。FIG. 2 is a diagram illustrating an example of a configuration of multivariate data according to an embodiment.一実施形態に係る要素間の相関係数の一例を説明するための図である。FIG. 11 is a diagram for explaining an example of a correlation coefficient between elements according to an embodiment.一実施形態に係る要素間の相関係数の一例を説明するための図である。FIG. 11 is a diagram for explaining an example of a correlation coefficient between elements according to an embodiment.一実施形態に係るＦＰＯモジュール選定方法を説明するための図である。1 is a diagram for explaining an FPO module selection method according to an embodiment;一実施形態に係るＦＰＯモジュール選定方法を説明するための図である。1 is a diagram for explaining an FPO module selection method according to an embodiment;一実施形態に係るＦＰＯ解析の一例を説明するための図である。FIG. 13 is a diagram for explaining an example of FPO analysis according to an embodiment.一実施形態に係る最終要素Ｆスコアの一例を説明するための図である。FIG. 13 is a diagram for explaining an example of a final element F-score according to an embodiment.一実施形態に係る最終要素Ｆスコアの一例を説明するための図である。FIG. 13 is a diagram for explaining an example of a final element F-score according to an embodiment.一実施形態に係る最終要素Ｆスコアの算出方法の一例を説明するための図である。FIG. 13 is a diagram for explaining an example of a method for calculating a final element F-score according to an embodiment.一実施形態に係る統合処理の一例を説明するための図である。FIG. 11 is a diagram illustrating an example of an integration process according to an embodiment.一実施形態に係る統合処理の一例を説明するための図である。FIG. 11 is a diagram illustrating an example of an integration process according to an embodiment.一実施形態に係る最終的ＦＰＯモジュールの統合結果の例を示す図である。FIG. 13 illustrates an example of a final FPO module integration result according to one embodiment.図８の最終的なＦＰＯモジュールの統合結果に対応するＦＰＯモジュールの統合結果の例を示す図である。9 is a diagram showing an example of an FPO module integration result corresponding to the final FPO module integration result of FIG. 8 .図８の最終的なＦＰＯモジュールの距離指標の計算の一例を説明するための図である。FIG. 9 is a diagram for explaining an example of calculation of a distance metric of the final FPO module of FIG. 8 .一実施形態に係るＦＮＩ解析処理を説明するための図である。FIG. 13 is a diagram for explaining an FNI analysis process according to an embodiment.再帰公式の有無による要素Ｐ値の解析時間を比較した結果の例を示す図である。FIG. 13 is a diagram showing an example of a result of comparing analysis times for element P values with and without a recursive formula;

次に、本実施形態の情報処理装置、情報処理方法及びコンピュータプログラムを、図面を参照しつつ説明する。以下で説明する実施形態は一例に過ぎず、本発明が適用される実施形態は、以下の実施形態に限られない。なお、実施形態を説明するための全図において、同一の機能を有するものは同一符号を用い、繰り返しの説明は省略する。
また、本願でいう「ＸＸに基づいて」とは、「少なくともＸＸに基づく」ことを意味し、ＸＸに加えて別の要素に基づく場合も含む。また、「ＸＸに基づいて」とは、ＸＸを直接に用いる場合に限定されず、ＸＸに対して演算や加工が行われたものに基づく場合も含む。「ＸＸ」は、任意の要素（例えば、任意の情報）である。 Next, the information processing device, the information processing method, and the computer program of the present embodiment will be described with reference to the drawings. The embodiment described below is merely an example, and the embodiment to which the present invention is applied is not limited to the following embodiment. In all the drawings for explaining the embodiment, the same reference numerals are used for the parts having the same functions, and the repeated description will be omitted.
In addition, "based on XX" in this application means "based on at least XX," and includes cases where it is based on other elements in addition to XX. Furthermore, "based on XX" is not limited to cases where XX is directly used, but also includes cases where it is based on XX that has been calculated or processed. "XX" is any element (for example, any information).

本実施形態に係る用語の説明を以下の表１、表２及び表３に示す。Explanations of the terms used in this embodiment are shown in Tables 1, 2 and 3 below.

図１は、本実施形態に係る情報処理装置１の概略の構成例を示す図である。図１において、情報処理装置１は、入出力部１１と、記憶部１２と、制御部１３とを備える。Figure 1 is a diagram showing an example of the general configuration of aninformation processing device 1 according to this embodiment. In Figure 1, theinformation processing device 1 includes an input/output unit 11, amemory unit 12, and acontrol unit 13.

入出力部１１は、情報処理装置１の外部の装置との間で、データの入力の受付及びデータの出力を行う。データの入出力方法は限定されない。入出力部１１は、例えば通信回線を介して通信によりデータを送受信してもよい。入出力部１１は、例えば記録媒体を介してデータを入出力してもよい。The input/output unit 11 accepts data input and outputs data between theinformation processing device 1 and devices external to theinformation processing device 1. The method of inputting and outputting data is not limited. The input/output unit 11 may transmit and receive data by communication, for example, via a communication line. The input/output unit 11 may input and output data, for example, via a recording medium.

記憶部１２は、制御部１３が実行するプログラムや制御部１３が使用するデータ等の各種のデータを記憶する。なお、記憶部１２は、ハードディスク装置や光磁気ディスク装置、フラッシュメモリ等の不揮発性のメモリや、ＣＤ－ＲＯＭ等の読み出しのみが可能な記録媒体、ＤＲＡＭ（Dynamic Random Access Memory）等の揮発性のメモリ、あるいはこれらの組み合わせにより構成されるものとする。Thememory unit 12 stores various data such as programs executed by thecontrol unit 13 and data used by thecontrol unit 13. Thememory unit 12 is configured from a hard disk device, a magneto-optical disk device, a non-volatile memory such as a flash memory, a read-only recording medium such as a CD-ROM, a volatile memory such as a dynamic random access memory (DRAM), or a combination of these.

制御部１３は、ＣＰＵ（Central Processing Unit：中央演算処理装置）を備え、記憶部１２に記憶されたプログラムを実行することによって各種の機能を実現する。本実施形態に係る制御部１３が記憶部１２に記憶された相関ネットワーク解析プログラム１２１を実行することによって、相関係数算出部１３１、ＦＰＯ解析部１３２、ＦＰＯモジュール選定部１３３、ＦＰＯモジュール統合部１３４、ＦＮＩ解析部１３５、ＦＮＩアサイン処理部１３６、Ｐ値算出部１３７、及び出力データ生成部１３８の機能が実現される。Thecontrol unit 13 includes a CPU (Central Processing Unit) and realizes various functions by executing programs stored in thememory unit 12. When thecontrol unit 13 according to this embodiment executes the correlationnetwork analysis program 121 stored in thememory unit 12, the functions of a correlationcoefficient calculation unit 131, anFPO analysis unit 132, an FPOmodule selection unit 133, an FPOmodule integration unit 134, anFNI analysis unit 135, an FNIassignment processing unit 136, a P-value calculation unit 137, and an outputdata generation unit 138 are realized.

なお、情報処理装置１として、汎用のコンピュータ装置を使用して構成してもよく、又は、専用のハードウェア装置として構成してもよい。例えば、情報処理装置１は、インターネット等の通信ネットワークに接続されるサーバコンピュータを使用して構成されてもよい。また、情報処理装置１の各機能はクラウドコンピューティングにより実現されてもよい。また、情報処理装置１は、単独のコンピュータにより実現するものであってもよく、又は情報処理装置１の機能を複数のコンピュータに分散させて実現するものであってもよい。また、情報処理装置１として、例えばＷＷＷシステム等を利用してウェブサイトを開設するように構成してもよい。Theinformation processing device 1 may be configured using a general-purpose computer device, or may be configured as a dedicated hardware device. For example, theinformation processing device 1 may be configured using a server computer connected to a communication network such as the Internet. Furthermore, each function of theinformation processing device 1 may be realized by cloud computing. Furthermore, theinformation processing device 1 may be realized by a single computer, or may be realized by distributing the functions of theinformation processing device 1 among multiple computers. Furthermore, theinformation processing device 1 may be configured to open a website, for example, by using a WWW system or the like.

記憶部１２は、相関係数リスト１２２、要素別にランク毎のモジュール情報１２３、モジュールＦスコア（ＭＦ）及び要素Ｆスコア（ＶＦ）閾値毎のモジュール統合情報１２４、最終モジュール統合情報１２５、対象ステップ毎の評価値情報１２６、最終モジュール情報１２７、及びＦＰＯ解析済モジュールリスト１２８をそれぞれ格納する記憶領域を有する。なお、それぞれの情報に対してハッシュ値を計算し付加することによって、記憶部１２に格納された情報の検索を高速にできるようにしてもよい。Thememory unit 12 has a memory area for storing acorrelation coefficient list 122, module information for each rank byelement 123, module integration information for each module F score (MF) and element F score (VF)threshold 124, finalmodule integration information 125, evaluation value information for eachtarget step 126,final module information 127, and FPO analyzedmodule list 128. Note that a hash value may be calculated and added to each piece of information to speed up searching for information stored in thememory unit 12.

次に図２を参照して本実施形態に係る情報処理方法を説明する。図２は、本実施形態に係る情報処理方法の手順の一例を示すフローチャートである。Next, the information processing method according to this embodiment will be described with reference to Figure 2. Figure 2 is a flowchart showing an example of the procedure of the information processing method according to this embodiment.

（ステップＳ１）入出力部１１は、入力データを受付ける。入力データは、相関ネットワーク解析を行う対象の多変量データである。入力データは、例えば、オミックス解析で取得された網羅的分子情報であってもよい。入力データは、例えば、不良品の原因分析を目的として製品の製造及び検査の工程で取得された製品に関する情報であってもよい。入力データは、例えば、ビッグデータであってもよい。入力データは、例えば、時系列データであってもよい。(Step S1) The input/output unit 11 accepts input data. The input data is multivariate data on which correlation network analysis is performed. The input data may be, for example, comprehensive molecular information obtained by omics analysis. The input data may be, for example, information about a product obtained during the manufacturing and inspection process of the product for the purpose of analyzing the cause of defective products. The input data may be, for example, big data. The input data may be, for example, time series data.

なお、本実施形態に係る相関ネットワーク解析の対象の多変量データは、その取得経路や取得方法や種類や属性等などに特に制限はない。例えば、オミックス解析で得られた多変量データを挙げることができる。「オミックス解析」とは、一般に、個々の網羅的分子情報を統合解析することを意味し、網羅的分子情報の代表的例として遺伝子の転写物に関する網羅的情報であるトランスクリトームデータ、代謝産物に関する網羅的情報であるメタボロームデータ等を挙げることができる。これらの網羅的データは、任意の方法や手段、例えば、各種遺伝子解析、遺伝子発現解析、並びに、液体クロマトグラフィー質量分析（ＬＣ－ＭＳ）、ガスクロマトグラフィー質量分析（ＧＣ－ＭＳ）およびキャピラリー電気泳動－質量分析（ＣＥ－ＭＳ）等の各種質量分析等によって取得することができる。更に、これら網羅的データの取得源に特に制限はなく、様々な種類の動植物・微生物・細菌由来の、部位、器官、組織および細胞等を挙げることができる。更に、多変量データは或る環境から取得した試料および人工製造物（例えば、加工食品等）等から任意の方法で取得された情報でもよい。The multivariate data to be subjected to the correlation network analysis according to this embodiment is not particularly limited in terms of its acquisition route, acquisition method, type, attributes, etc. For example, multivariate data obtained by omics analysis can be cited. "Omics analysis" generally means an integrated analysis of individual comprehensive molecular information, and representative examples of comprehensive molecular information include transcriptome data, which is comprehensive information on gene transcripts, and metabolome data, which is comprehensive information on metabolites. These comprehensive data can be obtained by any method or means, for example, various gene analyses, gene expression analyses, and various mass analyses such as liquid chromatography mass spectrometry (LC-MS), gas chromatography mass spectrometry (GC-MS), and capillary electrophoresis-mass spectrometry (CE-MS). Furthermore, there is no particular limit to the source of acquisition of these comprehensive data, and examples include parts, organs, tissues, and cells derived from various types of animals, plants, microorganisms, and bacteria. Furthermore, the multivariate data may be information obtained by any method from samples obtained from a certain environment and artificial products (e.g., processed foods, etc.).

入力データの形式は、複数の要素それぞれに対応する定量値から成る複数のデータセットを有すればよい。図３は、本実施形態に係る多変量データ（入力データ）の構成例を示す図である。入力データは、図３に例示されるように、各要素に対応する行と、各データセットに対応する列とから構成される定量値テーブルであってもよい。図３に示される定量値テーブルおいて、第１列は要素名（例えば要素の識別子（図３中のＩＤ００１、ＩＤ００２等））であり、第１行はデータセット名（図３中のＤＡＴＡ＿ａ、ＤＡＴＡ＿ｂ等）であり、ｘ行ｙ列（ｘ及びｙは２以上の整数）には要素ｘのデータセットｙの定量値が格納される。例えば、３行３列のデータ「ｄａｔａ＿ｂ００２」は、要素「ＩＤ００２」のデータセット「ＤＡＴＡ＿ｂ」の定量値である。The input data may be in the form of a plurality of data sets each consisting of a plurality of quantitative values corresponding to each of the elements. FIG. 3 is a diagram showing an example of the configuration of multivariate data (input data) according to this embodiment. The input data may be a quantitative value table consisting of rows corresponding to each element and columns corresponding to each data set, as exemplified in FIG. 3. In the quantitative value table shown in FIG. 3, the first column is the element name (e.g., element identifier (ID001, ID002, etc. in FIG. 3)), the first row is the data set name (DATA_a, DATA_b, etc. in FIG. 3), and the quantitative value of data set y of element x is stored in x row and y column (x and y are integers of 2 or more). For example, the data "data_b002" inrow 3 andcolumn 3 is the quantitative value of data set "DATA_b" of element "ID002".

（ステップＳ２）相関係数算出部１３１は、入力データが有する定量値に基づいて、要素間の相関係数を算出する。相関係数算出部１３１は、入力データが有する全要素に関して、要素毎に、相関係数の降順にソートした相関係数リスト１２２を生成する。これにより、入力データが有する全要素に関してそれぞれの相関係数リスト１２２が生成される。要素ｘの相関係数リスト１２２は、要素ｘとの相関係数が大きい方から順番に、要素名と相関係数の組がソートされたリストである。相関係数リスト１２２が生成した各要素の相関係数リスト１２２は記憶部１２に格納される。(Step S2) The correlationcoefficient calculation unit 131 calculates the correlation coefficient between elements based on the quantitative values of the input data. The correlationcoefficient calculation unit 131 generates acorrelation coefficient list 122 for all elements of the input data, sorting the correlation coefficients in descending order for each element. In this way, correlation coefficient lists 122 are generated for all elements of the input data. Thecorrelation coefficient list 122 for element x is a list in which pairs of element names and correlation coefficients are sorted in descending order of correlation coefficient with element x. The correlation coefficient lists 122 for each element generated by thecorrelation coefficient list 122 are stored in thememory unit 12.

本実施形態に係る要素間の相関係数は、例えば、ピアソンの積率相関係数、スピアマンの順位相関係数およびコサイン類似度のいずれかの指標である。要素間の相関係数は、－１以上１以下の実数である。The correlation coefficient between elements in this embodiment is, for example, any one of the following indices: Pearson's product-moment correlation coefficient, Spearman's rank correlation coefficient, and cosine similarity. The correlation coefficient between elements is a real number between -1 and 1.

なお、本実施形態に係る要素間の相関係数としては、負の相関係数を「０」に置き換えてもよいし、負の相関係数を「０」に置き換えなくてもよい。負の相関係数を「０」に置き換えない場合は、「負の相関」および「無相関」を判別できる。このため、他の要素に対して強い正の相関があり、且つ同じモジュールに負の相関を有する２つの要素が所属する場合でも、それらを対応させることができる。このため、モジュールを解釈しやすくできる。In the correlation coefficient between elements according to this embodiment, negative correlation coefficients may or may not be replaced with "0". When negative correlation coefficients are not replaced with "0", "negative correlation" and "no correlation" can be distinguished. Therefore, even if two elements that have a strong positive correlation with another element and a negative correlation belong to the same module, they can be made to correspond to each other. This makes it easier to interpret the modules.

図４Ａおよび図４Ｂは、本実施形態に係る要素間の相関係数の一例を説明するための仮想のイメージ図である。図４Ａはモジュールの一例を示し、要素が丸で示される。特定の相関係数以上の値の要素間を線で繋いでいる。要素には、シード要素、モジュール内要素が含まれる。丸の中に示される記号は、要素の識別情報である。図４Ａに示される例では、要素Ｘと要素Ｙとの相関係数ρ_ＸＹは零より大きく、要素Ｘと要素Ｚとの相関係数ρ_ＸＺは零より大きい場合を考える。この場合、要素Ｙと要素Ｚとの相関係数ρ_ＹＺは零より大きいか否かは不明である。図４Ｂは、相関係数ρ_ＹＺが取り得る値のイメージ図を示す。図４Ｂによれば、相関係数ρ_ＹＺは、零よりも大きい値を取り得ることも、零未満の値を取り得ることも、零よりも大きいか零未満であるか予測不明であることもあることが分かる。なお、ρは相関係数を示す。図４Ｂはピアソン相関値の値の範囲（－１から１）をＸ軸、Ｙ軸で示した。 4A and 4B are imaginary image diagrams for explaining an example of the correlation coefficient between elements according to this embodiment. FIG. 4A shows an example of a module, with elements shown as circles. Elements with values equal to or greater than a specific correlation coefficient are connected by lines. The elements include seed elements and elements within a module. Symbols shown in circles are identification information of the elements. In the example shown in FIG. 4A, consider a case in which the correlation coefficient ρ_XY between element X and element Y is greater than zero, and the correlation coefficient ρ_XZ between element X and element Z is greater than zero. In this case, it is unclear whether the correlation coefficient ρ YZ between element Y and element_Z is greater than zero. FIG. 4B shows an image diagram of values that the correlation coefficient ρ_YZ can take. According to FIG. 4B, it can be seen that the correlation coefficient ρ_YZ can take a value greater than zero, can take a value less than zero, or can be predicted as to whether it is greater than zero or less than zero. Note that ρ indicates the correlation coefficient. FIG. 4B shows the range of values of the Pearson correlation value (-1 to 1) on the X axis and the Y axis.

（ステップＳ３）ＦＰＯ解析部１３２は、入力データが有する全要素をそれぞれシード要素とし、シード要素の相関係数リスト１２２において上位（相関係数が大きい方）に位置する非シード要素の中から偽陽性（False-Positive）の非シード要素を除外する（Out）処理（ＦＰＯ解析処理）を実行する。ＦＰＯ解析処理について以下に説明する。(Step S3) TheFPO analysis unit 132 performs a process (FPO analysis process) in which all elements in the input data are treated as seed elements, and false-positive non-seed elements are excluded (Out) from among the non-seed elements positioned at the top (those with larger correlation coefficients) in thecorrelation coefficient list 122 of the seed elements. The FPO analysis process is described below.

ＦＰＯ解析部１３２は、シード要素の相関係数リスト１２２において所定の上位（Ｍ位まで）の非シード要素をシード要素とエッジで結んだ仮モジュールを構成する。Ｍは、モジュールサイズの所定範囲（モジュールサイズの下限値から上限値まで）のいずれかの値である。シード要素のモジュールを構成する要素（シード要素を除く）の数はランク（ランク＝Ｍ）である。モジュールサイズの所定範囲は、例えば、予め情報処理装置１にＭＩＮ値とＭＡＸ値を設定しておくことで作業者が任意に設定可能とする。ここでは、ＭＩＮ値及びＭＡＸ値はシード要素を除いたモジュールサイズであり、ランクがとり得る最小値と最大値である。従って、モジュールを構成する要素（シード要素と非シード要素）の総数（モジュールサイズ）の下限値は「ＭＩＮ値＋１」であり、上限値は「ＭＡＸ値＋１」である。TheFPO analysis unit 132 constructs a provisional module by connecting a predetermined number of top (up to Mth) non-seed elements in thecorrelation coefficient list 122 of the seed element with an edge. M is any value in a predetermined range of module sizes (from the lower limit to the upper limit of the module size). The number of elements (excluding the seed element) that constitute the module of the seed element is the rank (rank = M). The predetermined range of the module size can be arbitrarily set by the operator, for example, by setting a MIN value and a MAX value in advance in theinformation processing device 1. Here, the MIN value and the MAX value are the module size excluding the seed element, and are the minimum and maximum values that the rank can take. Therefore, the lower limit of the total number of elements (seed elements and non-seed elements) that constitute the module (module size) is "MIN value + 1", and the upper limit is "MAX value + 1".

次いで、ＦＰＯ解析部１３２は、仮モジュールに対して、シード要素及び各非シード要素の相関係数リスト１２２に含まれる全ての要素間の相関係数を降順でソートしたリストを生成する。ＦＰＯ解析部１３２は、仮モジュールに対して、生成したリストにおいて所定の相関閾値よりも大きい相関係数の要素間をさらにエッジで結ぶ。ここで、相関閾値が複数設定されることにより、エッジパターンが異なる複数の仮モジュールが構成される。例えば、相関閾値として取り得る全ての値が相関閾値として設定されることにより、当該全ての相関閾値毎のエッジパターンの仮モジュールが構成される。相関閾値を低くすればするほど、エッジの数が増える。例えば、同時に複数の相関閾値を設定するのではなく、相関閾値を最大値「１」に初期化した後に、相関閾値を減らしながら、各相関閾値で仮モジュールを構成するようにしてもよい。
次いで、ＦＰＯ解析部１３２は、エッジパターンが異なる複数の仮モジュールそれぞれに対して、仮モジュールのＦスコア、ＭＦを次式により算出する。 Next, theFPO analysis unit 132 generates a list of correlation coefficients between all elements included in thecorrelation coefficient list 122 of the seed element and each non-seed element, sorted in descending order, for the provisional module. TheFPO analysis unit 132 further connects elements in the generated list having a correlation coefficient greater than a predetermined correlation threshold with an edge for the provisional module. Here, multiple correlation thresholds are set, thereby configuring multiple provisional modules with different edge patterns. For example, all possible values of the correlation threshold are set as the correlation threshold, thereby configuring provisional modules with edge patterns for each of the correlation thresholds. The lower the correlation threshold, the greater the number of edges. For example, instead of setting multiple correlation thresholds at the same time, the correlation threshold may be initialized to the maximum value "1", and then the correlation threshold may be reduced while configuring a provisional module with each correlation threshold.
Next, theFPO analysis unit 132 calculates the F score, MF, of each of the provisional modules having different edge patterns using the following formula.

ｎはモジュール内の要素数である。ｅｄｇｅ（ｉ）は要素ｉのエッジ数であり、現状の相関係数に該当するエッジパターンに対して、要素ｉが持つ同じモジュール内の繋がりの要素の数である。ｉは１からｎの間の整数値であり、要素のｉｎｄｅｘ（指標）である。ｄｅｇｒｅｅ（ｉ）は要素ｉの次数（ディグリー）であり、現状の相関係数に該当するエッジパターンに対して、要素ｉが持つモジュール外とモジュール内の繋がりの要素の総数である。n is the number of elements in the module. edge(i) is the number of edges of element i, and is the number of elements connected to element i within the same module for the edge pattern corresponding to the current correlation coefficient. i is an integer value between 1 and n, and is the index of the element. degree(i) is the degree of element i, and is the total number of elements connected to element i both outside and within the module for the edge pattern corresponding to the current correlation coefficient.

ＭＦは、モジュール密度（ＭＤ）とモジュール特異率（ＭＳ）の調和平均である。ＭＤは、モジュール全体として、モジュールが内部で繋がり合っている程度を表す。ＭＳは、モジュールが外部と分離している（排他的に繋がっている）程度を表す。したがって、ＭＦが大きいモジュールは、モジュール外部と分離しておりモジュール全体として強い結びつきを有するモジュールと言える。MF is the harmonic mean of module density (MD) and module specificity (MS). MD represents the degree to which a module is internally connected as a whole. MS represents the degree to which a module is isolated from the outside (exclusively connected). Therefore, a module with a large MF can be said to be a module that is isolated from the outside and has strong connections as a whole.

ＦＰＯ解析部１３２は、エッジパターンが異なる複数の仮モジュールそれぞれに対して算出されたＭＦのうち最大値、及びＭＦの最大値に対応する相関閾値とエッジパターンを記憶部１２に格納する。
ＦＰＯ解析部１３２は、ＭＦの最大値に対応する相関閾値とエッジパターンに対する各非シード要素ｉのＦスコア、ＶＦ（ｉ）を次式により算出する。 TheFPO analysis unit 132 stores in thestorage unit 12 the maximum value of the MFs calculated for each of a plurality of provisional modules having different edge patterns, and the correlation threshold value and edge pattern corresponding to the maximum value of the MF.
TheFPO analysis unit 132 calculates the correlation threshold corresponding to the maximum value of MF and the F score, VF(i), of each non-seed element i for the edge pattern according to the following formula:

ＶＦ（ｉ）は、要素密度（ＶＤ（ｉ））と要素特異率（ＶＳ（ｉ））の調和平均である。ＶＤ（ｉ）は、要素ｉがモジュール内部の他の要素と繋がっている割合を表す。ＶＳ（ｉ）は、要素ｉがネットワーク全体に対して持つディグリーのうち、モジュールの要素と繋がっている割合を表す。ｉは１からｎの間の整数値であり、要素のｉｎｄｅｘ（指標）である。したがって、ＶＦ（ｉ）が大きい要素ｉは、モジュールと排他的に密に繋がっており、当該モジュールの構成要素として相応しい要素と言える。VF(i) is the harmonic mean of element density (VD(i)) and element singularity (VS(i)). VD(i) represents the proportion of element i that is connected to other elements within the module. VS(i) represents the proportion of the degree that element i has relative to the entire network that is connected to elements of the module. i is an integer value between 1 and n, and is the index of the element. Therefore, an element i with a large VF(i) is closely and exclusively connected to the module, and can be said to be an element suitable as a component of the module.

ＦＰＯ解析部１３２は、仮モジュールのＶＦ（ｉ）が最小となる非シード要素ｉを偽陽性と判定し、偽陽性と判定された非シード要素ｉを仮モジュールから取り除く。次いで、ＦＰＯ解析部１３２は、偽陽性と判定された非シード要素ｉを取り除いた仮モジュールに対して、再度、ＭＦの最大値、および各非シード要素ｉのＶＦ（ｉ）を算出する。この偽陽性の非シード要素ｉを取り除くＦＰＯ解析処理は、仮モジュールの要素数が「モジュールサイズ下限値」、つまり「ＭＩＮ値＋１」になるまで繰り返される。ＦＰＯ解析部１３２は、ＭＦが最大であった時点を過不足なく偽陽性が除かれた状態であると判定し、ＭＦが最大であった時点の仮組みモジュールの構成情報を、シード要素の「ランク」における最大ＭＦモジュールとしてモジュール情報１２３に格納する。TheFPO analysis unit 132 judges the non-seed element i with the smallest VF(i) of the provisional module to be a false positive, and removes the non-seed element i judged to be a false positive from the provisional module. Next, theFPO analysis unit 132 calculates the maximum MF and the VF(i) of each non-seed element i again for the provisional module from which the non-seed element i judged to be a false positive has been removed. This FPO analysis process for removing the false positive non-seed element i is repeated until the number of elements of the provisional module becomes the "module size lower limit", that is, "MIN value + 1". TheFPO analysis unit 132 judges the time when the MF was maximum as a state in which the false positives have been removed without excess or deficiency, and stores the configuration information of the provisional assembly module at the time when the MF was maximum in themodule information 123 as the maximum MF module in the "rank" of the seed element.

ＦＰＯ解析部１３２は、ランクの所定範囲（仮モジュールを構成した時点のモジュールサイズの下限値（ＭＩＮ値＋１）から上限値（ＭＡＸ値＋１）まで）において、各ランクに対して、ＦＰＯ解析処理を実行する。これにより、一のシード要素に関して、ランクの所定範囲の全てのランク毎に、ＭＦが最大であった時点の仮組みモジュールの構成情報が最大ＭＦモジュールとしてモジュール情報１２３に格納される。TheFPO analysis unit 132 executes an FPO analysis process for each rank within a specified range of ranks (from the lower limit (MIN value + 1) to the upper limit (MAX value + 1) of the module size at the time the provisional module is constructed). As a result, for one seed element, for each rank within the specified range of ranks, the configuration information of the provisionally assembled module at the time when the MF was maximum is stored in themodule information 123 as a maximum MF module.

なお、ＦＰＯ解析部１３２は、ランクの下限値（ＭＩＮ値）から上限値（ＭＡＸ値）までランクを増やしながらステップＳ３のＦＰＯ解析を行い、ステップＳ３のＦＰＯ解析が済んだモジュール（ＦＰＯ解析済モジュール）の構成情報をＦＰＯ解析済モジュールリスト１２８に格納し、既にＦＰＯ解析済モジュールリスト１２８に存在するＦＰＯ解析済モジュールと同じモジュールが構築された場合には、偽陽性の非シード要素を取り除くＦＰＯ解析処理を止めて当該ランクの最大ＭＦモジュールの検索を早期終了してもよい。In addition, theFPO analysis unit 132 performs the FPO analysis in step S3 while increasing the rank from the lower limit value (MIN value) of the rank to the upper limit value (MAX value), and stores the configuration information of the module (FPO analyzed module) for which the FPO analysis in step S3 has been completed in the FPO analyzedmodule list 128. If a module identical to an FPO analyzed module that is already present in the FPO analyzedmodule list 128 is constructed, the FPO analysis process for removing false positive non-seed elements may be stopped and the search for the maximum MF module of that rank may be terminated early.

ＦＰＯ解析部１３２は、相関閾値とエッジパターンを用いてＭＦの最大値が更新されないか否かを判定し、エッジパターンが異なる複数の仮モジュールの中でＭＦが未計算の仮モジュールでＭＦの最大値が更新されないと判定した場合に、ＭＦの計算を終了し、当該ランクの最大ＭＦモジュールの検索を早期終了してもよい。
このように構成することによって、ＦＰＯ解析部１３２は、エッジパターンが異なる複数の仮モジュールの中でＭＦが未計算の仮モジュールでＭＦの最大値が更新されないと判定した場合に、ＭＦの計算を終了し、当該ランクの最大ＭＦモジュールの検索を早期終了できるため、ＭＦの最大値が更新されない場合でも、ＭＦの計算が継続され、モジュールの最適化処理が継続される場合と比較して、計算量を低減できるとともに、ＦＰＯ解析処理時間を短縮できる。 TheFPO analysis unit 132 uses a correlation threshold and an edge pattern to determine whether the maximum MF value is not updated, and if it determines that the maximum MF value is not updated in a provisional module whose MF has not been calculated among multiple provisional modules with different edge patterns, it may terminate the MF calculation and early terminate the search for the maximum MF module of that rank.
By configuring in this manner, when theFPO analysis unit 132 determines that the maximum MF value will not be updated for a provisional module for which the MF has not been calculated among multiple provisional modules with different edge patterns, it can terminate the MF calculation and early terminate the search for the maximum MF module of that rank.Therefore, even if the maximum MF value is not updated, the amount of calculation can be reduced and the FPO analysis processing time can be shortened compared to when the MF calculation continues and the module optimization processing continues.

（ステップＳ４）ＦＰＯモジュール選定部１３３は、ある一つのシード要素に関して、モジュール情報１２３に基づいて、ＦＰＯ解析結果の最適モジュール（ＦＰＯモジュールと称する）を選定する。(Step S4) The FPOmodule selection unit 133 selects an optimal module (referred to as an FPO module) based on themodule information 123 for a given seed element based on the FPO analysis results.

図５Ａは、本実施形態に係るＦＰＯモジュール選定方法を説明するための図である。図５Ａの例は、すりおろし大根の時系列メタボロームデータ（要素数２３０５ｘ時系列１０ポイントデータセット）である。ピアソンの積率相関係数、ＭＩＮ値「２」、ＭＡＸ値「２３０５」で、識別子「ｃ１４８７」を持つ要素をシード要素としてＦＰＯ解析処理が行われた結果である。図５Ａには、ランク（横軸）に対応するＭＦの最大値（縦軸）が示されている。ＦＰＯモジュール選定部１３３は、ランクの値を増やしてもＭＦの最大値が更新されない期間の最大値ＰＴｍａｘ（最長プラトー長）を示すランクのモジュールＭ＿ＰＴｍａｘを判定する。ＦＰＯモジュール選定部１３３は、判定したモジュールＭ＿ＰＴｍａｘを、ＦＰＯモジュールに選定する。Figure 5A is a diagram for explaining the FPO module selection method according to this embodiment. The example of Figure 5A is time-series metabolome data of grated radish (number of elements 2305 x time-series 10-point data set). The result of FPO analysis processing was performed using an element with identifier "c1487" as a seed element with Pearson's product-moment correlation coefficient, MIN value "2", and MAX value "2305". Figure 5A shows the maximum value of MF (vertical axis) corresponding to the rank (horizontal axis). The FPOmodule selection unit 133 determines the module M_PTmax of the rank that shows the maximum value PTmax (longest plateau length) of the period in which the maximum value of MF is not updated even if the rank value is increased. The FPOmodule selection unit 133 selects the determined module M_PTmax as the FPO module.

図５Ａに示される例では、「ランク＝６２３」で構築されたモジュールのＭＦが０．６８３と計算され、ＭＦが０．６８３を超えるモジュールが「ランク＝７６１」まで構築されなかった。従って、ランクの値を増やしてもＭＦの最大値が更新されない期間であるプラトー長は、「ランク＝７６１」から「ランク＝６２３」を減算した「１３８」となる。この例では「１３８」が最も長いプラトー長（最長プラトー長）ＰＴｍａｘとなったため、「ランク＝６２３」で構築されたモジュールが最長プラトー長ＰＴｍａｘを示すランク「６２３」のモジュールＭ＿ＰＴｍａｘに選定された。In the example shown in Figure 5A, the MF of the module constructed at "Rank = 623" was calculated to be 0.683, and no modules with an MF exceeding 0.683 were constructed up until "Rank = 761". Therefore, the plateau length, which is the period during which the maximum MF value is not updated even if the rank value is increased, is "138", which is obtained by subtracting "Rank = 623" from "Rank = 761". In this example, "138" was the longest plateau length (longest plateau length) PTmax, so the module constructed at "Rank = 623" was selected as the module M_PTmax of rank "623", which shows the longest plateau length PTmax.

従来は、全てのランクにおいて最大のＭＦを示すランクのモジュールをＦＰＯモジュールに選定していた。しかし、全てのランクにおいて最大のＭＦを示すランクは、ランクの上限値（ＭＡＸ値）または上限値付近であることが多い。このため、従来は、巨大な偏ったモジュールが構成されることが多かった。また、従来は、ランクの上限値（ＭＡＸ値）の設定に依存するので、最適なＦＰＯ解析結果が得られない可能性があった。Conventionally, the module of the rank showing the maximum MF among all ranks was selected as the FPO module. However, the rank showing the maximum MF among all ranks is often at or near the upper limit value (MAX value) of the rank. For this reason, conventionally, huge biased modules were often constructed. Furthermore, conventionally, because it depended on the setting of the upper limit value (MAX value) of the rank, there was a possibility that optimal FPO analysis results could not be obtained.

これに対して本実施形態によれば、ＭＦが大きく且つ適切な要素数のモジュールを得ることができることにより、ＦＰＯ解析結果の品質が向上する効果が得られる。また、ＦＰＯ解析結果は、ランクの上限値（ＭＡＸ値）の設定に影響されづらい。このため、よりロバスト性の高いＦＰＯ解析を行うことができる。In contrast, according to the present embodiment, a module with a large MF and an appropriate number of elements can be obtained, which has the effect of improving the quality of the FPO analysis results. In addition, the FPO analysis results are less affected by the setting of the upper limit value (MAX value) of the rank. This allows for a more robust FPO analysis.

また、ランク毎のモジュール情報１２３は、ＦＰＯモジュールの選定基準の検証に用いることができる。出力データ生成部１３８は、ランク毎のモジュール情報１２３を用いて、図５Ａに例示されるＭＦの最大値のグラフのデータを生成してもよい。このグラフのデータは、表示画面上にグラフを表示したり、又はグラフを印字したりすることができるデータである。これにより、作業者は、ＦＰＯモジュールの選定基準の検証を容易に行うことができる。Furthermore, themodule information 123 for each rank can be used to verify the selection criteria for the FPO module. The outputdata generation unit 138 may use themodule information 123 for each rank to generate data for a graph of the maximum MF value exemplified in FIG. 5A. This graph data is data that can be used to display the graph on a display screen or print the graph. This allows the operator to easily verify the selection criteria for the FPO module.

なお、他のＦＰＯモジュール選定方法として、モジュールの要素数や要素間の相関係数の分布を問わずロバスト性を維持する基準を採用してもよい。例えば、図５Ｂに例示されるＭＦの最大値の移動平均のグラフのプラトー長を用いた基準、又は当該グラフの増加率が一定値を下回る基準を採用してもよい。また、要素間の相関係数の分布を基にした、データセットごとに最適化した基準を採用してもよい。この場合は、ランクの上限値、つまりＭＡＸ値は設定パラメータとして調整する必要がなくなるので、作業者の負担を軽減できる。As another method for selecting an FPO module, a criterion that maintains robustness regardless of the number of elements in the module or the distribution of correlation coefficients between elements may be adopted. For example, a criterion using the plateau length of the graph of the moving average of the maximum MF values exemplified in FIG. 5B, or a criterion that the increase rate of the graph is below a certain value, may be adopted. Also, a criterion optimized for each data set based on the distribution of correlation coefficients between elements may be adopted. In this case, the upper limit value of the rank, i.e., the MAX value, does not need to be adjusted as a setting parameter, thereby reducing the burden on the operator.

図５Ｂは、本実施形態に係るＦＰＯモジュール選定方法を説明するための図である。図５Ｂの例は、すりおろし大根の時系列メタボロームデータ（要素数２３０５ｘ時系列１０ポイントデータセット）である。ピアソンの積率相関係数、ＭＩＮ値「２」、ＭＡＸ値「２３０５」で、識別子「ｃ１４８７」を持つ要素をシード要素としてＦＰＯ解析処理が行われた結果である。図５Ｂには、ランク（横軸）に対応するＭＦの最大値の１０点移動平均（縦軸）が示されている。図５Ｂに示される例では、「ランク＝１５２」で構築されたモジュールのＭＦが０．５９０と計算され、ＭＦが０．５９０を超えるモジュールが「ランク＝３８５」まで構築されなかった。従って、ランクの値を増やしてもＭＦの最大値が更新されない期間である移動平均のプラトー長は、「ランク＝３８５」から「ランク＝１５２」を減算した「２３３」となる。この例では「２３３」が移動平均の最も長いプラトー長（最長プラトー長）ＭＡＰＴｍａｘとなったため、「ランク＝１５２」で構築されたモジュールが最長プラトー長ＭＡＰＴｍａｘを示すランク「１５２」のモジュールＭ＿ＭＡＰＴｍａｘに選定された。Figure 5B is a diagram for explaining the FPO module selection method according to this embodiment. The example of Figure 5B is time-series metabolome data of grated radish (number of elements 2305 x time-series 10-point data set). The result of FPO analysis processing was performed using an element with identifier "c1487" as a seed element with Pearson's product-moment correlation coefficient, MIN value "2", and MAX value "2305". Figure 5B shows a 10-point moving average (vertical axis) of the maximum value of MF corresponding to the rank (horizontal axis). In the example shown in Figure 5B, the MF of the module constructed at "rank = 152" was calculated to be 0.590, and no module with an MF exceeding 0.590 was constructed up to "rank = 385". Therefore, the plateau length of the moving average, which is the period during which the maximum value of MF is not updated even if the rank value is increased, is "233", which is obtained by subtracting "rank = 152" from "rank = 385". In this example, since "233" was the longest plateau length (longest plateau length) MAPTmax of the moving average, the module constructed with "rank = 152" was selected as the module M_MAPTmax with rank "152" indicating the longest plateau length MAPTmax.

なお、ＦＰＯモジュール選定部１３３は、所定の条件を満たす場合に、ＦＰＯ解析及びＦＰＯモジュールの選定を終了してもよい。図５Ｃは、本実施形態に係るＦＰＯ解析の一例を説明するための図である。図５Ｃは、すりおろし大根の時系列メタボロームデータ（要素数２３０５ｘ時系列１０ポイントデータセット）である。ピアソンの積率相関係数、ＭＩＮ値「２」、ＭＡＸ値「２３０５」で、識別子「ｃ１４８７」を持つ要素をシード要素としてＦＰＯ解析処理が行われた結果である。図５Ｃには、ランク（横軸）に対応するＭＦの最大値（縦軸）が示されている。なお、この例では、識別子「ｃ１４８７」に対して正の相関係数を持つ要素は１５６９個あるため、識別子「ｃ１４８７」の最大ランクは、１５６９となる。図５Ｃは、ステップＳ３において、ＦＰＯ解析部１３２がランク毎の最大ＭＦの計算を終了した度、ＦＰＯモジュール選定部１３３が実行され、最長プラトー長が更新されないか否かを判定した場合の処理を示す。ＦＰＯモジュール選定部１３３は、最長プラトー長が更新されないと判定した場合に、ランクが変化しても（ランクが増加しても）プラトー長の最大値が更新されないと判定し、ランクの増加を終了し、ＦＰＯ解析及びＦＰＯモジュールの選定を終了してもよい。図５Ｃでランクが１４３２を超える時点で残りのランク数は、最大ランク「１５６９」からランク「１４３２」を減算した「１３７」となる。ランク数「１３７」は、ＰＴｍａｘ「１３８」を下回るので、ランクが１４３２以上の部分でＰＴｍａｘを超えないと判定し、計算不要と記載された部分（ランクが１４３２以上の部分）の計算が行われない。
このように構成することによって、ＦＰＯモジュール選定部１３３は、要素を追加しても（ランクが増加しても）最長プラトー長が更新されない場合に、ＦＰＯ解析及びＦＰＯモジュールの選定を終了できるため、最長プラトー長が更新されない場合でも、ランク毎のＦＰＯ解析及びＦＰＯモジュールの選定が継続される場合と比較して、計算量を低減できるとともに、ＦＰＯ解析及びＦＰＯモジュールの選定に要する時間を短縮できる。 In addition, the FPOmodule selection unit 133 may end the FPO analysis and the selection of the FPO module when a predetermined condition is satisfied. FIG. 5C is a diagram for explaining an example of the FPO analysis according to this embodiment. FIG. 5C is time-series metabolome data of grated radish (number of elements 2305 x time-series 10-point data set). The result is an FPO analysis process performed with an element having an identifier "c1487" as a seed element with a Pearson product-moment correlation coefficient, a MIN value of "2", and a MAX value of "2305". FIG. 5C shows the maximum value (vertical axis) of MF corresponding to the rank (horizontal axis). In this example, since there are 1569 elements having a positive correlation coefficient with the identifier "c1487", the maximum rank of the identifier "c1487" is 1569. 5C shows a process in the case where the FPOmodule selection unit 133 is executed and determines whether the longest plateau length is not updated each time theFPO analysis unit 132 finishes the calculation of the maximum MF for each rank in step S3. When the FPOmodule selection unit 133 determines that the longest plateau length is not updated, it may determine that the maximum value of the plateau length is not updated even if the rank changes (even if the rank increases), end the increase in rank, and end the FPO analysis and the selection of the FPO module. In FIG. 5C, when the rank exceeds 1432, the remaining number of ranks is "137", which is the maximum rank "1569" minus the rank "1432". Since the number of ranks "137" is below the PTmax "138", it is determined that the part with the rank of 1432 or more does not exceed PTmax, and the calculation of the part described as not requiring calculation (the part with the rank of 1432 or more) is not performed.
By configuring in this manner, the FPOmodule selection unit 133 can terminate FPO analysis and selection of FPO modules when the longest plateau length is not updated even when an element is added (even when the rank is increased). Therefore, even if the longest plateau length is not updated, the amount of calculations can be reduced and the time required for FPO analysis and selection of FPO modules can be shortened compared to when FPO analysis and selection of FPO modules for each rank are continued.

（ステップＳ５）入力データが有する全要素をそれぞれシード要素としてＦＰＯモジュールの選定が完了した場合は（ステップＳ５、ＹＥＳ）ステップＳ６－１に進む。一方、まだＦＰＯモジュールが選定されていない要素が残っている場合は（ステップＳ５、ＮＯ）ステップＳ３に戻り、まだＦＰＯモジュールが選定されていない要素をシード要素としてＦＰＯ解析及びＦＰＯモジュール選定を行う。(Step S5) If the selection of FPO modules has been completed using all elements in the input data as seed elements (Step S5, YES), proceed to Step S6-1. On the other hand, if there are still elements remaining for which an FPO module has not yet been selected (Step S5, NO), return to Step S3 and perform FPO analysis and FPO module selection using the elements for which an FPO module has not yet been selected as seed elements.

（ステップＳ６－１）ＦＰＯ解析部１３２は、各シード要素のＦＰＯモジュールに対して、ＦＰＯモジュール内の各要素（ｉ）についてのＶＦ（ｉ）の最大値（最終ＶＦと称する）、及び最終ＶＦに基づくＭＦ（最終ＭＦと称する）を算出する。各シード要素のＦＰＯモジュールに対して算出された最終ＭＦ及び最終ＶＦは、評価値情報１２６に格納される。(Step S6-1) TheFPO analysis unit 132 calculates, for the FPO module of each seed element, the maximum value of VF(i) for each element (i) in the FPO module (referred to as the final VF) and an MF based on the final VF (referred to as the final MF). The final MF and final VF calculated for the FPO module of each seed element are stored in theevaluation value information 126.

図６Ａおよび図６Ｂは、本実施形態に係る最終ＶＦの一例を説明するための仮想のイメージ図である。図６Ａはモジュールおよび要素の一例を示す。図６Ａでは、要素が丸で示される。要素には、注目要素（最終ＶＦを算出する対象の要素）、モジュール内要素、モジュール外要素が含まれる。丸の中に示される数値は、注目要素に対する相関順位である。ここでの一例として、相関順位は、相関が高いほど上位（値が小さい）である。図６Ａでは、注目要素とモジュール内要素、及び注目要素とモジュール外要素において、特定の相関係数以上の値の要素間を線で繋いでいる。実線は、エッジに含まれる要素、すなわち、モジュール内要素間の関係性を示す。点線は、エッジに含まれない要素、すなわち、モジュール外要素間の関係性を示す。6A and 6B are hypothetical image diagrams for explaining an example of the final VF according to this embodiment. FIG. 6A shows an example of a module and an element. In FIG. 6A, elements are indicated by circles. The elements include the element of interest (the element for which the final VF is calculated), elements within the module, and elements outside the module. The numbers indicated in the circles are correlation rankings for the element of interest. In this example, the higher the correlation ranking, the higher (the smaller the value). In FIG. 6A, lines are used to connect elements with correlation coefficients equal to or greater than a certain value between the element of interest and the elements within the module, and between the element of interest and the elements outside the module. Solid lines indicate the relationships between elements included in the edge, i.e., between the elements within the module. Dotted lines indicate the relationships between elements not included in the edge, i.e., between the elements outside the module.

ＦＰＯ解析部１３２は、一のＦＰＯモジュールに対して、ＦＰＯモジュール内の要素（ｉ）毎に最終ＶＦを計算する。具体的には、ＦＰＯ解析部１３２は、ＦＰＯモジュール内の各要素（ｉ）を注目要素にし、その注目要素についてＶＦ（ｉ）を計算する。図６Ｂは、一の要素（ｉ）（注目要素）に対するＶＦ（ｉ）の計算結果を示す。図６Ｂにおいて、横軸は相関順位を示し、縦軸はＶＦ（ｉ）を示す。ＦＰＯ解析部１３２は、一の要素（ｉ）（注目要素）に関して、ＶＦ（ｉ）の最大値（最終ＶＦ）を取得する。図６Ｂの例では、相関順位「５」の場合、モジュール内要素へは３本、モジュール外要素へは２本であり、ディグリー（ｄｅｇｒｅｅ）は「５＝２＋３」、エッジ（ｅｄｇｅ）は３、モジュールの要素数（ｎ）は５であり、ＶＦ（ｉ）は、上記（２）式より「ＶＦ（ｉ）＝（２×ｅｄｇｅ（ｉ））／（ｄｅｇｒｅｅ（ｉ）＋ｎ－１）＝（２×３）／（５＋５－１）＝０．６７」となり、ＶＦ（ｉ）の最大値（最終ＶＦ）を示す。For one FPO module, theFPO analysis unit 132 calculates the final VF for each element (i) in the FPO module. Specifically, theFPO analysis unit 132 sets each element (i) in the FPO module as a focus element and calculates a VF(i) for the focus element. Figure 6B shows the calculation result of VF(i) for one element (i) (focus element). In Figure 6B, the horizontal axis indicates the correlation rank, and the vertical axis indicates VF(i). TheFPO analysis unit 132 obtains the maximum value of VF(i) (final VF) for one element (i) (focus element). In the example of Figure 6B, when the correlation rank is "5", there are three lines to elements within the module and two lines to elements outside the module, the degree is "5 = 2 + 3", the edge is 3, the number of elements in the module (n) is 5, and VF(i) is calculated from equation (2) above as "VF(i) = (2 x edge(i)) / (degree(i) + n-1) = (2 x 3) / (5 + 5-1) = 0.67", which shows the maximum value of VF(i) (final VF).

なお、ＦＰＯ解析部１３２は、最終ＶＦの算出において、ＶＦ（ｉ）が相関順位を増加させても最大値を超えないか否かを判定するようにしてもよい。ＦＰＯ解析部１３２は、ＶＦ（ｉ）が相関順位を増加させても最大値を超えないと判定した場合に、ＶＦ（ｉ）を計算する処理を終了するようにしてもよい。In addition, theFPO analysis unit 132 may determine whether or not VF(i) does not exceed the maximum value even if the correlation rank is increased in calculating the final VF. TheFPO analysis unit 132 may terminate the process of calculating VF(i) when it determines that VF(i) does not exceed the maximum value even if the correlation rank is increased.

図６Ｃは、本実施形態に係る最終ＶＦの算出方法の一例を説明するための仮想のイメージ図である。
ＦＰＯ解析部１３２は、一のＦＰＯモジュールに対して、ＦＰＯモジュール内の要素（ｉ）毎に最終ＶＦを計算する。具体的には、ＦＰＯ解析部１３２は、ＦＰＯモジュール内の各要素（ｉ）を注目要素にし、その注目要素についてＶＦ（ｉ）を計算する。図６Ｃは、一の要素（ｉ）（注目要素）に対するＶＦ（ｉ）の計算結果を示す。図６Ｃにおいて、横軸は相関順位を示し、縦軸はＶＦ（ｉ）を示す。ＦＰＯ解析部１３２は、相関順位が８の時点でＶＦ（ｉ）が最大値を超えないと判定し、ＶＦ（ｉ）を計算する処理を終了する。ＶＦ（ｉ）の計算を終了できる理由を以下に述べる。 FIG. 6C is a hypothetical image diagram for explaining an example of a method for calculating the final VF according to this embodiment.
TheFPO analysis unit 132 calculates the final VF for each element (i) in an FPO module for one FPO module. Specifically, theFPO analysis unit 132 sets each element (i) in the FPO module as an element of interest, and calculates VF(i) for the element of interest. FIG. 6C shows the calculation result of VF(i) for one element (i) (element of interest). In FIG. 6C, the horizontal axis indicates the correlation rank, and the vertical axis indicates VF(i). TheFPO analysis unit 132 determines that VF(i) does not exceed the maximum value when the correlation rank is 8, and ends the process of calculating VF(i). The reason why the calculation of VF(i) can be ended will be described below.

上記（３）式において、ＶＦは、要素Ｆスコアである。ｎは、対象モジュールの要素数である。ｅｄｇｅは、対象モジュール内のエッジ数である。ｄｅｇｒｅｅは、要素のディグリー数である。上記の式（３）より、モジュール内要素が他のモジュールの要素と全て繋いでもｅｄｇｅ＝ｎ‐１になるので、ｅｄｇｅ≦ｎ‐１を言える。最終ＶＦを計算する時点で、モジュールの要素数ｎは固定値であるので、ディグリー数のみを用いて達成可能な最終ＶＦの上限値を求めることができる。In the above formula (3), VF is the element F score. n is the number of elements in the target module. edge is the number of edges in the target module. degree is the number of degrees of the elements. From the above formula (3), it can be said that edge≦n-1 because edge=n-1 even if all elements in a module are connected to elements in other modules. At the time of calculating the final VF, the number of elements n in the module is a fixed value, so it is possible to find the upper limit of the achievable final VF using only the number of degrees.

上記（４）式において、ＶＦは、要素Ｆスコアである。ＶＦ_ａは、相関順位ａの時点で最大ＶＦ値とする。ｎは、対象モジュールの要素数である。ｄｅｇｒｅｅは、要素のディグリー数である。図６Ｃの例ではｎ＝５、また、相関順位ａ＝５の時点において、ＶＦ_ａ＝０．６７となることから、ＶＦ_ａを超えるために、上の式（４）によりｄ≦８である必要がある。ここで、ｄｅｇｒｅｅの値は相関順位に依存し、相関順位「９」の場合、ｄｅｇｒｅｅの値は９となる。従って、相関順位「８」の場合を計算した後に、相関順位「９」以上の場合の計算は不要である。なお、図６Ｃの例では、相関順位「６」の場合、モジュール内要素へは３本、モジュール外要素へは３本であり、ディグリー数（ｄｅｇｒｅｅ）は「６＝３＋３」、エッジ数（ｅｄｇｅ）は３、モジュールの要素数（ｎ）は５であり、ＶＦ（ｉ）は、上記（２）式より「ＶＦ（ｉ）＝（２×ｅｄｇｅ（ｉ））／（ｄｅｇｒｅｅ（ｉ）＋ｎ－１）＝（２×３）／（６＋５－１）＝０．６０」となる。相関順位「７」の場合、モジュール内要素へは３本、モジュール外要素へは４本であり、ディグリー数（ｄｅｇｒｅｅ）は「７＝４＋３」、エッジ数（ｅｄｇｅ）は３、モジュールの要素数（ｎ）は５であり、ＶＦ（ｉ）は、上記（２）式より「ＶＦ（ｉ）＝（２×ｅｄｇｅ（ｉ））／（ｄｅｇｒｅｅ（ｉ）＋ｎ－１）＝（２×３）／（７＋５－１）＝０．５４」となる。相関順位「８」の場合、モジュール内要素へは３本、モジュール外要素へは５本であり、ディグリー数（ｄｅｇｒｅｅ）は「８＝５＋３」、エッジ数（ｅｄｇｅ）は３、モジュールの要素数（ｎ）は５であり、ＶＦ（ｉ）は、上記（２）式より「ＶＦ（ｉ）＝（２×ｅｄｇｅ（ｉ））／（ｄｅｇｒｅｅ（ｉ）＋ｎ－１）＝（２×３）／（８＋５－１）＝０．５０」となる。このように、式（4）を用いて判定したように、相関順位の高い順に要素を追加しても、要素がモジュール外要素の場合は、ＶＦ（ｉ）が０．６７（相関順位「５」の場合）を超えることがないことが確認できる。このように構成することによって、相関順位の全てについてＶＦ（ｉ）を計算する場合と比較して、計算量を低減できるとともに、処理時間を短縮できる。 In the above formula (4), VF is the element F score. VF_a is the maximum VF value at the time of correlation rank a. n is the number of elements in the target module. degree is the number of degrees of the elements. In the example of FIG. 6C, n=5, and VF_a =0.67 at the time of correlation rank a=5, so in order to exceed VF_a , d≦8 is required according to the above formula (4). Here, the value of degree depends on the correlation rank, and in the case of correlation rank "9", the value of degree is 9. Therefore, after calculating the case of correlation rank "8", it is not necessary to calculate the case of correlation rank "9" or higher. In the example of Figure 6C, when the correlation rank is "6", there are three connections to elements within the module and three connections to elements outside the module, the number of degrees is "6 = 3 + 3", the number of edges is 3, the number of elements in the module (n) is 5, and VF(i) is calculated from equation (2) above as "VF(i) = (2 x edge(i)) / (degree(i) + n-1) = (2 x 3) / (6 + 5-1) = 0.60". In the case of a correlation ranking of "7", there are three connections to elements within the module and four connections to elements outside the module, the number of degrees is "7 = 4 + 3", the number of edges is 3, the number of elements in the module (n) is 5, and VF(i) is calculated from equation (2) above as "VF(i) = (2 x edge(i)) / (degree(i) + n-1) = (2 x 3) / (7 + 5-1) = 0.54". In the case of a correlation rank of "8", there are three connections to elements within the module and five connections to elements outside the module, the number of degrees is "8 = 5 + 3", the number of edges is 3, the number of elements in the module (n) is 5, and VF(i) is "VF(i) = (2 x edge (i)) / (degree (i) + n-1) = (2 x 3) / (8 + 5-1) = 0.50" according to the above formula (2). As described above, as determined using formula (4), it can be confirmed that VF(i) does not exceed 0.67 (in the case of correlation rank "5") when the element is an element outside the module, even if the element is added in descending order of correlation rank. By configuring in this way, the amount of calculation can be reduced and the processing time can be shortened compared to the case where VF(i) is calculated for all correlation ranks.

ＦＰＯ解析部１３２は、一のＦＰＯモジュールに対して、ＦＰＯモジュール内の各要素（ｉ）の最終ＶＦに基づいてＭＦ（最終ＭＦと称する）を算出する。具体的には、ＦＰＯ解析部１３２は、ＦＰＯモジュール内の要素（ｉ）毎に最終ＶＦが得られたときのｅｄｇｅ（ｉ）及びｄｅｇｒｅｅ（ｉ）を取得し、取得した各要素（ｉ）のｅｄｇｅ（ｉ）及びｄｅｇｒｅｅ（ｉ）を用いて上記（１）式よりＭＦ（最終ＭＦ）を算出する。TheFPO analysis unit 132 calculates an MF (referred to as the final MF) for one FPO module based on the final VF of each element (i) in the FPO module. Specifically, theFPO analysis unit 132 obtains edge(i) and degree(i) when the final VF is obtained for each element (i) in the FPO module, and calculates the MF (final MF) from the above formula (1) using the obtained edge(i) and degree(i) of each element (i).

なお、最終ＶＦおよび最終ＭＦの算出方法は、最終ＶＦおよび最終ＭＦの算出対象のモジュールにおけるシード要素に依存しないようにＶＦ（ｉ）およびＭＦを算出する方法であればよい。例えば、モジュール内のある要素（ｉ）がモジュール内の他の要素に対して、後述する要素Ｐ値が最小である場合のＶＦ（ｉ）を最終ＶＦに用いてもよい。The method for calculating the final VF and final MF may be any method that calculates VF(i) and MF without depending on the seed element in the module for which the final VF and final MF are calculated. For example, the VF(i) in the module where an element P value (described later) is the smallest for an element (i) in the module compared to other elements in the module may be used as the final VF.

（ステップＳ６－２）Ｐ値算出部１３７は、各シード要素のＦＰＯモジュールに対して、次式により、モジュールＰ値を算出する。(Step S6-2) The P-value calculation unit 137 calculates the module P-value for the FPO module of each seed element using the following formula.

Ｎは、入力データである多変量データの全要素数である。ｎは、モジュールの要素数である。ｅｄｇｅ_ａｖｇは、最終ＭＦの算出に用いた各要素（ｉ）のｅｄｇｅ（ｉ）の平均値に最も近い整数である。ｄｅｇｒｅｅ_ａｖｇは、最終ＭＦの算出に用いた各要素（ｉ）のｄｅｇｒｅｅ（ｉ）の平均値に最も近い整数である。ｑ、ｒは、正の整数であって、ｒ≦ｑである。
なお、ｅｄｇｅ_ａｖｇの代わりに、ＦＰＯモジュールのＭＦの最大値が得られたときの相関閾値以上の相関係数を有する要素同士を結んだモジュール内のエッジ数を用い、またｄｅｇｒｅｅ_ａｖｇの代わりに、ＦＰＯモジュールのＭＦの最大値が得られたときのディグリー（ｄｅｇｒｅｅ）を用いてもよい。 N is the total number of elements of the multivariate data that is the input data. n is the number of elements of the module. edge_avg is the integer closest to the average value of edge(i) of each element (i) used to calculate the final MF. degree_avg is the integer closest to the average value of degree(i) of each element (i) used to calculate the final MF. q and r are positive integers, and r≦q.
In addition, instead of edge_avg , the number of edges in the module connecting elements having a correlation coefficient equal to or greater than the correlation threshold when the maximum value of the MF of the FPO module is obtained may be used, and instead of degree_avg , the degree when the maximum value of the MF of the FPO module is obtained may be used.

各シード要素のＦＰＯモジュールに対して算出されたモジュールＰ値は、ＦＰＯ解析後のモジュールＰ値として評価値情報１２６に格納される。モジュールＰ値は、統計処理で用いられる確率と同じ意味を持ち、観測した事象よりも稀な事象が起こる確率を表す。モジュールＰ値によれば、形成されたモジュールが正しいか否かを検定することができる。上記（５）式で算出されるモジュールＰ値は、ランダムでモジュールを構成した場合において、同じＭＤを与える確率と同じＭＳを与える確率との積から求められる。
ＦＰＯ解析後のモジュールＰ値によって、作業者は各シード要素のＦＰＯモジュールの有意性を評価することができる。これにより、モジュールの信憑性を付加することができるようになる。 The module P value calculated for the FPO module of each seed element is stored in theevaluation value information 126 as the module P value after FPO analysis. The module P value has the same meaning as the probability used in statistical processing, and represents the probability of an event occurring that is rarer than the observed event. The module P value can be used to verify whether the formed module is correct or not. The module P value calculated by the above formula (5) is obtained by multiplying the probability of giving the same MD and the probability of giving the same MS when the module is randomly constructed.
The module P-value after FPO analysis allows the operator to evaluate the significance of the FPO module of each seed element, which allows the operator to add credibility to the module.

（ステップＳ７）ＦＰＯモジュール統合部１３４は、各シード要素のＦＰＯモジュールに対して、モジュール統合処理（モジュラライズ処理）を実行する。ＦＰＯモジュールは、上述したステップＳ３－Ｓ５によって、シード要素毎にそれぞれ別個に構成される。このため、全要素の数だけＦＰＯモジュールが構成され、ＦＰＯモジュール間には要素の重複が生じる。この要素の重複をモジュール統合処理によって解消する。モジュール統合処理について以下に説明する。(Step S7) The FPOmodule integration unit 134 executes a module integration process (modularization process) for the FPO modules of each seed element. The FPO modules are configured separately for each seed element by steps S3-S5 described above. As a result, FPO modules are configured for the total number of elements, and overlapping elements occur between the FPO modules. This overlapping of elements is eliminated by the module integration process. The module integration process is described below.

ＦＰＯモジュール統合部１３４は、ＶＦ閾値の所定範囲（ＶＦ閾値の最小値から最大値まで）において、ＶＦ閾値毎に、モジュールを再構成する。ＶＦ閾値は、ＶＦ閾値の最小値から最大値まで一定の刻み幅で予め設定される。モジュール再構成方法としては、全てのＦＰＯモジュールを対象にして、既に在るエッジを全て削除し、ＶＦ閾値以上の最終ＶＦである要素同士を改めてエッジで結ぶ。このとき、ＦＰＯモジュール間に要素の重複がある場合、その重複している要素を介してＦＰＯモジュール同士の統合も行う。これにより、ＶＦ閾値毎に、ＦＰＯモジュールの統合結果が得られる。ＦＰＯモジュール統合部１３４は、ＶＦ閾値毎のＦＰＯモジュールの統合結果をモジュール統合情報１２４に格納する。
なお、ＶＦ閾値は、ＶＦ閾値の最小値から最大値まで一定の刻み幅で設定されることに限定されない。例えば、ＶＦ閾値としてＭＦ及びＶＦの全ての取りえる値をリスト化し、全てのＭＦとＶＦにおいて統合化を行ってもよい。この場合、ＶＦ閾値は、一定の刻みでなくてもよい。 The FPOmodule integration unit 134 reconfigures modules for each VF threshold within a predetermined range of the VF threshold (from the minimum to the maximum value of the VF threshold). The VF threshold is preset at a constant interval from the minimum to the maximum value of the VF threshold. The module reconfiguration method is to delete all existing edges for all FPO modules, and to reconnect elements with final VFs equal to or greater than the VF threshold with edges. At this time, if there is an overlap of elements between FPO modules, the FPO modules are also integrated via the overlapping elements. This allows the integration result of the FPO modules to be obtained for each VF threshold. The FPOmodule integration unit 134 stores the integration result of the FPO modules for each VF threshold in themodule integration information 124.
The VF threshold is not limited to being set at a constant interval from the minimum to maximum value of the VF threshold. For example, all possible values of MF and VF may be listed as the VF threshold, and integration may be performed for all MF and VF. In this case, the VF threshold does not have to be set at a constant interval.

なお、ＦＰＯモジュール統合部１３４は、ステップＳ３－Ｓ５によって構成された全てのＦＰＯモジュールに統合処理を実施するのではなく、一部のＦＰＯモジュールにのみ統合処理を実施するようにしても良い。例えば、評価値情報１２６に格納された、各ＦＰＯモジュールの最終ＶＦや最終ＭＦやモジュールＰ値に基づいて、所定の条件を満たさないＦＰＯモジュールを統合処理から除外しても良い。例えば、ＶＦ閾値がＶＦ閾値の最小値から最大値まで設定された際、最終ＭＦがＶＦ閾値未満のＦＰＯモジュール、及びシード要素の最終ＶＦがＶＦ閾値未満のＦＰＯモジュールを統合処理から除外しても良い。このように、評価値に基づいて品質または有意性の低いＦＰＯモジュールを統合処理から全体的に除外することで、統合されていくモジュールの品質の向上または有意性の向上を図ることができる。The FPOmodule integration unit 134 may perform the integration process only on some of the FPO modules, rather than on all the FPO modules constructed by steps S3-S5. For example, FPO modules that do not satisfy a predetermined condition may be excluded from the integration process based on the final VF, final MF, and module P value of each FPO module stored in theevaluation value information 126. For example, when the VF threshold is set from the minimum value to the maximum value of the VF threshold, FPO modules whose final MF is less than the VF threshold and FPO modules whose seed element final VF is less than the VF threshold may be excluded from the integration process. In this way, by excluding FPO modules with low quality or significance from the integration process based on the evaluation value, it is possible to improve the quality or significance of the modules being integrated.

図７は、本実施形態に係る統合処理の一例を説明するための仮想のイメージ図である。図７Ａは、モジュラライズ処理において、最終ＶＦを用いて、要素間のエッジを除外して、モジュールを統合化するイメージ図である。図７Ａには、一例として、３つのＦＰＯモジュールＭ７‐１，Ｍ７‐２，Ｍ７‐３を示し、要素が丸で示され、要素の識別子「ｃ００１」－「ｃ０１１」、及びステップＳ６―１で計算された各ＦＰＯモジュールの最終ＭＦ、各要素の最終ＶＦが示されている。この例では、要素「ｃ００２」がＦＰＯモジュールＭ７‐１とＭ７‐２で重複し、要素「ｃ００８」がＦＰＯモジュールＭ７‐２とＭ７‐３で重複する。ＶＦ閾値が０．５に設定された場合、統合処理により全ての要素を含むモジュールＭ７‐４が構成される。ＶＦ閾値が０．７５に設定された場合、モジュールＭ７‐２内の要素「ｃ００２」、及びモジュールＭ７‐３内の要素「ｃ０１１」の最終ＶＦがＶＦ閾値未満であるためエッジで結ばれず、統合処理により除外される。この結果、統合処理によりモジュールＭ７‐５及びモジュールＭ７‐６が構成される。ＶＦ閾値が０．８５に設定された場合、統合処理を同様に実施した際、モジュールＭ７‐７、Ｍ７‐８、Ｍ７‐９が構成される。ここに、モジュールＭ７‐８とＭ７‐９の要素数はそれぞれ２と１と少なく、低く評価されると共に、複数の要素間に有意な繋がりを示さないため、作業者によるＦＰＯモジュールの統合結果の解析を妨げる。Figure 7 is a virtual image diagram for explaining an example of the integration process according to this embodiment. Figure 7A is an image diagram of integrating modules by removing edges between elements using the final VF in the modularization process. Figure 7A shows three FPO modules M7-1, M7-2, and M7-3 as an example, with elements shown as circles, element identifiers "c001"-"c011", and the final MF and final VF of each FPO module calculated in step S6-1. In this example, element "c002" overlaps between FPO modules M7-1 and M7-2, and element "c008" overlaps between FPO modules M7-2 and M7-3. When the VF threshold is set to 0.5, a module M7-4 including all elements is constructed by the integration process. When the VF threshold is set to 0.75, the final VF of element "c002" in module M7-2 and element "c011" in module M7-3 is less than the VF threshold, so they are not connected by an edge and are excluded by the integration process. As a result, modules M7-5 and M7-6 are formed by the integration process. When the VF threshold is set to 0.85, modules M7-7, M7-8, and M7-9 are formed when the integration process is performed in the same manner. Here, modules M7-8 and M7-9 have a small number of elements, 2 and 1, respectively, and are evaluated low. In addition, they do not show significant connections between multiple elements, which hinders the operator from analyzing the integration results of the FPO modules.

一方、図７Ｂは、モジュラライズ処理において、最終ＭＦ及びシード要素の最終ＶＦを用いてモジュール毎の構成要素を除外し、かつ、最終ＶＦを用いて要素間のエッジを除外することにより、モジュールを統合化するイメージ図である。図７Ｂは、ＶＦ閾値がＶＦ閾値の最小値から最大値まで設定された際、最終ＭＦがＶＦ閾値未満のＦＰＯモジュール、及びシード要素の最終ＶＦがＶＦ閾値未満のＦＰＯモジュールを統合処理から除外した場合の統合処理を示す。ＶＦ閾値が０．８５に設定された場合、最終ＭＦが０．８５未満のモジュールＭ７‐２とＭ７‐３が統合処理から除外され、統合処理を実施した際、モジュールＭ７‐７のみが構成される。このように、統合処理により評価値の低いモジュールの構成を防ぐことができ、構成されるモジュールの品質の向上または有意性の向上を図ることができる。On the other hand, FIG. 7B is an image diagram of integrating modules by excluding components for each module using the final MF and the final VF of the seed element in the modularization process, and excluding edges between elements using the final VF. FIG. 7B shows the integration process in the case where the VF threshold is set from the minimum value to the maximum value of the VF threshold, and FPO modules whose final MF is less than the VF threshold and whose seed element's final VF is less than the VF threshold are excluded from the integration process. When the VF threshold is set to 0.85, modules M7-2 and M7-3 whose final MF is less than 0.85 are excluded from the integration process, and only module M7-7 is configured when the integration process is performed. In this way, the integration process can prevent the construction of modules with low evaluation values, and can improve the quality or significance of the modules that are configured.

ＦＰＯモジュール統合部１３４は、モジュールサイズの所定範囲に収まるＦＰＯモジュールの個数が最大であるＶＦ閾値のＦＰＯモジュールの統合結果（ＦＰＯモジュール群）を、最終的なＦＰＯモジュールの統合結果として最終モジュール統合情報１２５に格納する。The FPOmodule integration unit 134 stores the integration result (FPO module group) of the FPO modules with the VF threshold value that has the largest number of FPO modules that fall within a specified range of module sizes in the finalmodule integration information 125 as the final FPO module integration result.

従来、最終的なＦＰＯモジュールの統合結果のみでは、要素の位置関係の全体像が分からず、ＦＰＯモジュールの内部構造の解析が難しかった。しかし、本実施形態によれば、ＶＦ閾値毎のＦＰＯモジュールの統合結果がモジュール統合情報１２４に格納されるので、ＶＦ閾値毎のＦＰＯモジュールの統合結果によって、作業者が、ＶＦ閾値の変化に伴ってＦＰＯモジュールの統合結果が変化する様子を確認することができる。これにより、最終的なＦＰＯモジュールの統合結果における要素の位置関係の全体像を把握し、ＦＰＯモジュールの内部構造の解析に貢献することができる。Conventionally, the final FPO module integration result alone did not provide an overall picture of the positional relationships of the elements, making it difficult to analyze the internal structure of the FPO module. However, according to this embodiment, the FPO module integration result for each VF threshold is stored in themodule integration information 124, so that the FPO module integration result for each VF threshold allows the operator to check how the FPO module integration result changes as the VF threshold changes. This allows the operator to grasp the overall picture of the positional relationships of the elements in the final FPO module integration result, which contributes to the analysis of the internal structure of the FPO module.

図８及び図９は、本実施形態に係る最終的なＦＰＯモジュールの統合結果の例を示す図である。図８及び図９は、米国のある大学の空手クラブに所属する３４人の交友関係を示したデータセットを用いて、ピアソンの積率相関係数、ＭＩＮ値「５」、ＭＡＸ値「３４」でＦＰＯ解析を実施した結果である。構成要素は丸で示し、分かれたメンバー毎に丸の中の模様で区別した。番号は、空手クラブの３４人の識別番号である。実線は、モジュラライズ処理で得られた結びつきの関係性を示す。図８は、得られたモジュール毎にネットワーク描写した結果である。図９Ａは、最終的なＦＰＯモジュールの統合結果に対応するＶＦ閾値毎のＦＰＯモジュールの統合結果の例を示す図である。図９Ａの例では、ＶＦ閾値（横軸）の変化に応じてＦＰＯモジュールの統合結果（各要素（縦軸）のモジュール化）が変化する様子がグラフで示されている。例えば、ＶＦ閾値（ＶＦ_ｔｈ）が０．８１の場合、図８に示されるモジュールＭ８‐１，Ｍ８‐２，Ｍ８‐３の３個のモジュールに関してモジュールＭ８‐１，Ｍ８‐２，Ｍ８‐３の関係を図９Ａから把握することができる。 8 and 9 are diagrams showing an example of the final FPO module integration result according to the present embodiment. FIG. 8 and FIG. 9 are the results of FPO analysis using a data set showing the friendships of 34 people belonging to a karate club at a university in the United States, with Pearson's product-moment correlation coefficient, MIN value "5", and MAX value "34". Components are shown as circles, and the patterns inside the circles distinguish each divided member. The numbers are the identification numbers of the 34 members of the karate club. The solid lines show the relationship of the connections obtained by the modularization process. FIG. 8 is the result of network depiction for each obtained module. FIG. 9A is a diagram showing an example of the FPO module integration result for each VF threshold corresponding to the final FPO module integration result. In the example of FIG. 9A, a graph is shown showing how the FPO module integration result (modularization of each element (vertical axis)) changes according to the change in the VF threshold (horizontal axis). For example, when the VF threshold (VF_th ) is 0.81, the relationship between the modules M8-1, M8-2, and M8-3 shown in FIG. 8 can be understood from FIG. 9A.

出力データ生成部１３８は、ＶＦ閾値毎のモジュール統合情報１２４を用いて、図９に例示されるＶＦ閾値（横軸）とＦＰＯモジュールの統合結果（各要素（縦軸）のモジュール化）との対応関係を示すデータ（例えばグラフ等）を生成してもよい。このグラフのデータは、表示画面上にグラフを表示したり、又はグラフを印字したりすることができるデータである。これにより、作業者は、最終的なＦＰＯモジュールの統合結果の解析を容易に行うことができる。The outputdata generation unit 138 may use themodule integration information 124 for each VF threshold to generate data (e.g., a graph) showing the correspondence between the VF threshold (horizontal axis) and the FPO module integration result (modularization of each element (vertical axis)) illustrated in FIG. 9. This graph data is data that can be used to display the graph on a display screen or print the graph. This allows the operator to easily analyze the final FPO module integration result.

ＦＰＯモジュール統合部１３４は更に、モジュール統合情報１２４に格納されたＶＦ閾値毎のＦＰＯモジュールの統合結果を基に、モジュール統合後の最終的なＦＰＯモジュールの距離を示す距離指標ｄを計算しても良い。距離指標は正の実数であり、全てのＦＰＯモジュールのペアにおいて計算される。距離指標ｄは、その値が低ければ低いほど、２つのモジュールの距離が近いと判定する。距離指標ｄを算出することによって、ＦＰＯモジュール同士の位置関係の全体像を容易に、かつ定量的に評価できるようになる。The FPOmodule integration unit 134 may further calculate a distance index d indicating the distance of the final FPO module after module integration based on the integration results of the FPO modules for each VF threshold stored in themodule integration information 124. The distance index is a positive real number and is calculated for all pairs of FPO modules. The lower the value of the distance index d, the closer the distance between the two modules is determined to be. By calculating the distance index d, it becomes possible to easily and quantitatively evaluate the overall positional relationship between the FPO modules.

図９Ｂは、本実施形態に係る最終的なＦＰＯモジュールの距離指標ｄの計算方法の一例を説明するための図である。図９Ｂは、図９Ａに例示される最終的なＦＰＯモジュールの統合結果を基に距離指標ｄの一つの計算方法を示す。図９Ｂでは、図９Ａに示されるＶＦ_ｔｈ＝０．８１で分離したモジュールＭ８‐１，Ｍ８‐２，Ｍ８‐３の３個のモジュールに対して、ＶＦ閾値を上げた際のモジュールの構造の情報を更に含める仮想モジュールＭ９‐１，Ｍ９‐２，Ｍ９‐３を考慮する。グラフ内の点Ｇは、分離された仮想モジュールＭ９‐１，Ｍ９‐２，Ｍ９‐３の中心点として定義する。得られたＦＰＯモジュール同士の距離に関して、ＶＦ閾値の差分と構成要素の数を利用して、仮想モジュールの中心点までの距離を演算することにより、ＦＰＯモジュールの統合結果における要素の位置関係の全体像をより正確的に把握する。 FIG. 9B is a diagram for explaining an example of a method for calculating the distance index d of the final FPO module according to the present embodiment. FIG. 9B shows one method for calculating the distance index d based on the integration result of the final FPO module illustrated in FIG. 9A. In FIG. 9B, virtual modules M9-1, M9-2, and M9-3 that further include information on the structure of the module when the VF threshold is increased are considered for the three modules M8-1, M8-2, and M8-3 separated at VF_th =0.81 shown in FIG. 9A. Point G in the graph is defined as the center point of the separated virtual modules M9-1, M9-2, and M9-3. With regard to the distance between the obtained FPO modules, the difference in the VF threshold and the number of components are used to calculate the distance to the center point of the virtual module, thereby more accurately grasping the overall picture of the positional relationship of the elements in the integration result of the FPO module.

例えばモジュールＭ９‐１とＭ９‐２の距離指標を計算した際、以下の流れ（１）～（３）で処理を行う。
（１）モジュールＭ８‐１とＭ８‐２の構成要素がモジュール統合処理により分割される最小のＶＦ閾値、ＶＦ_{ｔｈ，ｍｉｎ}を検索する。この例では、ＶＦ_{ｔｈ，ｍｉｎ} ＝０．７６となる。
（２）仮想モジュールＭ９‐１とＭ９‐２の構成要素が、ＶＦ閾値をＶＦ_ｔｈ以上に設定した際、モジュール統合処理により除外される最小のＶＦ閾値の平均値ＶＦ_{ｔｈ, ｍｉｎａｖｇ}を計算する。例えば、仮想モジュールＭ９‐１の場合、要素「２３」、「２１」、「１９」、「１６」、「１５」は、ＶＦ閾値が０．９５以上の場合に除外され、要素「３３」、「３１」、「３０」、「２４」は、ＶＦ閾値が０．９２以上の場合に除外され、要素「３４」、「２９」、「２８」、「２７」、「１０」は、ＶＦ閾値が０．９１以上の場合に除外され、要素「９」は、ＶＦ閾値が０．９０以上の場合に除外され、要素「３２」は、ＶＦ閾値が０．８６以上の場合に除外されることから、１６個の構成要素が除外される最小のＶＦ閾値の平均値、ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－１）で算出される。具体的には、ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－１）＝（０．９５×５＋０．９２×４＋０．９１×５＋０．９０×１＋０．８６×１）／１６＝０．９２１２と算出できる。
同様に、仮想モジュールＭ９‐２の場合、要素「８」、「４」、「２２」、「２」、「１８」は、ＶＦ閾値が０．９０以上の場合に除外され、要素「１」、「１３」、「１４」は、ＶＦ閾値が０．８８以上の場合に除外され、要素「２０」は、ＶＦ閾値が０．８７以上の場合に除外され、要素「３」は、ＶＦ閾値が０．８６以上の場合に除外されることから、１０個の構成要素が除外される最小のＶＦ閾値の平均値ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－２）で算出される。具体的には、ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－２）＝（０．９×５＋０．８８×３＋０．８７×１＋０．８６×１）／１０＝０．８８７０と算出できる。
（３）距離指標ｄ（Ｍ８－１, Ｍ８－２）を次式（６）により計算する。 For example, when calculating the distance index between modules M9-1 and M9-2, the process is carried out in the following steps (1) to (3).
(1) Find the minimum VF threshold, VF_th,min , at which the components of modules M8-1 and M8-2 are split by the module merging process. In this example, VF_th,min =0.76.
(2) When the components of the virtual modules M9-1 and M9-2 have their VF thresholds set to VF_th or more, the average value VF_th,minavg of the minimum VF thresholds to be excluded by the module integration process is calculated. For example, in the case of the virtual module M9-1, the elements "23", "21", "19", "16", and "15" are excluded when the VF threshold is 0.95 or more, the elements "33", "31", "30", and "24" are excluded when the VF threshold is 0.92 or more, the elements "34", "29", "28", "27", and "10" are excluded when the VF threshold is 0.91 or more, the element "9" is excluded when the VF threshold is 0.90 or more, and the element "32" is excluded when the VF threshold is 0.86 or more. Therefore, the average value VF_th,minavg (M8-1) of the minimum VF thresholds from which 16 components are excluded is calculated. Specifically, VF_th,minavg (M8-1) can be calculated as (0.95×5+0.92×4+0.91×5+0.90×1+0.86×1)/16=0.9212.
Similarly, in the case of the virtual module M9-2, the elements "8", "4", "22", "2", and "18" are excluded when the VF threshold is 0.90 or more, the elements "1", "13", and "14" are excluded when the VF threshold is 0.88 or more, the element "20" is excluded when the VF threshold is 0.87 or more, and the element "3" is excluded when the VF threshold is 0.86 or more, so that the average value VF_th,minavg (M8-2) of the minimum VF threshold from which 10 components are excluded is calculated. Specifically, it can be calculated as VF_th,minavg (M8-2) = (0.9 x 5 + 0.88 x 3 + 0.87 x 1 + 0.86 x 1) / 10 = 0.8870.
(3) Calculate the distance index d(M8-1, M8-2) using the following equation (6).

上記（６）式において、ｄはモジュール同士の距離指標を示す。ＶＦ_ｔｈは、ＶＦ閾値を示す。ＶＦ_{ｔｈ，ｍｉｎ}は、モジュールＭ８‐１とＭ８‐２の構成要素がモジュール統合処理により分割される最小のＶＦ閾値を示す。ＶＦ_{ｔｈ, ｍｉｎａｖｇ}は、両モジュールの構成要素が除外される最小のＶＦ閾値のそれぞれの平均値を示す。この例では、ｄ（Ｍ８－１，Ｍ８－２）＝２（０．８１－０．７６）＋（０．８８７０－０．８１）／２＋（０．９２１２－０．８１）／２＝０．１０００＋０．０３８５＋０．０５５６＝０．１９４１と算出できる。 In the above formula (6), d indicates the distance index between modules. VF_th indicates the VF threshold._{VF th,min} indicates the minimum VF threshold at which the components of modules M8-1 and M8-2 are divided by the module integration process._{VF th,minavg} indicates the average value of the minimum VF threshold at which the components of both modules are excluded. In this example, it can be calculated as d(M8-1, M8-2) = 2 (0.81 - 0.76) + (0.8870 - 0.81) / 2 + (0.9212 - 0.81) / 2 = 0.1000 + 0.0385 + 0.0556 = 0.1941.

仮想モジュールＭ９‐２とＭ９‐３に対して同様の計算を施すと、ｄ（Ｍ８－２，Ｍ８－３）＝０．０５８５、ｄ（Ｍ８－１, Ｍ８－３）＝０．１７５６と計算される。ｄ（Ｍ８－２, Ｍ８－３）の場合では、仮想モジュールＭ９－３の場合、要素「７」、「６」、「５」、「１２」、「１１」は、ＶＦ閾値が０．８５以上の場合に除外されることから、５個の構成要素が除外されるＶＦ閾値の平均値、ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－３）で算出される。具体的には、ＶＦ_{ｔｈ, ｍｉｎａｖｇ}（Ｍ８－３）＝０．８５×５／５＝０．８５００と算出できる。モジュールＭ８‐２とＭ８‐３の構成要素がモジュール統合処理により分割される最小のＶＦ閾値、ＶＦ_{ｔｈ，ｍｉｎ}を検索する。この例では、ＶＦ_{ｔｈ，ｍｉｎ}＝０．８１となる。この例では、ｄ（Ｍ８－２，Ｍ８－３）＝２（０．８１－０．８１）＋（０．８８７０－０．８１）／２＋（０．８５００－０．８１）／２＝０．０３８５＋０．０２００＝０．０５８５と算出できる。
また、ｄ（Ｍ８－１，Ｍ８－３）の場合では、モジュールＭ８‐１とＭ８‐３の構成要素がモジュール統合処理により分割される最小のＶＦ閾値、ＶＦ_{ｔｈ，ｍｉｎ}を検索する。この例では、ＶＦ_{ｔｈ，ｍｉｎ}＝０．７６となる。この例では、ｄ（Ｍ８－１，Ｍ８－３）＝２（０．８１－０．７６）＋（０．９２１２－０．８１）／２＋（０．８８７０－０．８１）／２＝０．１０００＋０．０５５６＋０．０２００＝０．１７５６と算出できる。
以上のことから、Ｍ８－２はＭ８－３よりＭ８－１に近いと判断できる。このように、最終的なＦＰＯモジュールの統合結果における要素の位置関係の全体像をより正確的に把握することができる。 When the same calculation is performed for the virtual modules M9-2 and M9-3, d(M8-2, M8-3)=0.0585 and d(M8-1, M8-3)=0.1756 are calculated. In the case of d(M8-2, M8-3), in the case of the virtual module M9-3, the elements "7", "6", "5", "12", and "11" are excluded when the VF threshold is 0.85 or more, so the average value of the VF threshold at which five components are excluded, VF_th,minavg (M8-3), is calculated. Specifically, it can be calculated as VF_th,minavg (M8-3)=0.85×5/5=0.8500. The minimum VF threshold, VF_th,min , at which the components of the modules M8-2 and M8-3 are divided by the module integration process is searched for. In this example, VF_th,min = 0.81. In this example, it can be calculated that d(M8-2, M8-3) = 2(0.81-0.81) + (0.8870-0.81)/2 + (0.8500-0.81)/2 = 0.0385 + 0.0200 = 0.0585.
In addition, in the case of d(M8-1, M8-3), the minimum VF threshold, VF_th,min , at which the components of modules M8-1 and M8-3 are divided by the module integration process is searched for. In this example, VF_th,min = 0.76. In this example, it can be calculated as d(M8-1, M8-3) = 2 (0.81 - 0.76) + (0.9212 - 0.81) / 2 + (0.8870 - 0.81) / 2 = 0.1000 + 0.0556 + 0.0200 = 0.1756.
From the above, it can be determined that M8-2 is closer to M8-1 than to M8-3. In this way, it is possible to more accurately grasp the overall picture of the positional relationship of elements in the final integration result of the FPO module.

図９Ｂには、図９Ａに例示される最終的なＦＰＯモジュールの統合結果を基に算出した距離指標を数値として表記する。モジュールＭ８－１において、距離０．１０５６は、モジュールＭ８－２が分割される最小のＶＦ閾値からモジュールＭ８－１を構成する要素が除外される最小のＶＦ閾値までの各要素の距離の平均値を示す。モジュールＭ８－１において、距離０．０５５６は、モジュールＭ８－３が分割される最小のＶＦ閾値からモジュールＭ８－１を構成する要素が除外される最小のＶＦ閾値までの距離の平均値を示す。モジュールＭ８－２において、距離０．０８８５は、モジュールＭ８－１が分割される最小のＶＦ閾値からモジュールＭ８－２を構成する要素が除外される最小のＶＦ閾値までの各要素の距離の平均値を示す。モジュールＭ８－２において、距離０．０３８５は、モジュールＭ８－３が分割される最小のＶＦ閾値からモジュールＭ８－２を構成する要素が除外される最小のＶＦ閾値までの各要素の距離の平均値を示す。モジュールＭ８－３において、距離０．０７００は、モジュールＭ８－１が分割される最小のＶＦ閾値からモジュールＭ８－３を構成する要素が除外される最小のＶＦ閾値までの距離の平均値を示す。モジュールＭ８－３において、距離０．０２００は、モジュールＭ８－２が分割される最小のＶＦ閾値からモジュールＭ８－３を構成する要素が除外される最小のＶＦ閾値までの各要素の距離の平均値を示す。In FIG. 9B, the distance index calculated based on the final FPO module integration result illustrated in FIG. 9A is expressed as a numerical value. In module M8-1, the distance 0.1056 indicates the average value of the distance of each element from the minimum VF threshold at which module M8-2 is divided to the minimum VF threshold at which the elements constituting module M8-1 are excluded. In module M8-1, the distance 0.0556 indicates the average value of the distance from the minimum VF threshold at which module M8-3 is divided to the minimum VF threshold at which the elements constituting module M8-1 are excluded. In module M8-2, the distance 0.0885 indicates the average value of the distance of each element from the minimum VF threshold at which module M8-1 is divided to the minimum VF threshold at which the elements constituting module M8-2 are excluded. In module M8-2, the distance 0.0385 indicates the average value of the distance of each element from the minimum VF threshold at which module M8-3 is divided to the minimum VF threshold at which the elements constituting module M8-2 are excluded. In module M8-3, the distance 0.0700 indicates the average value of the distance from the minimum VF threshold at which module M8-1 is divided to the minimum VF threshold at which the elements constituting module M8-3 are excluded, and the distance 0.0200 indicates the average value of the distance of each element from the minimum VF threshold at which module M8-2 is divided to the minimum VF threshold at which the elements constituting module M8-3 are excluded.

（ステップＳ８－１）ＦＰＯモジュール統合部１３４は、上記ステップＳ６－１のＦＰＯ解析部１３２と同様に、モジュール統合後の最終的なＦＰＯモジュールそれぞれに対して、最終ＶＦ及び最終ＭＦを算出する。モジュール統合後の最終的なＦＰＯモジュールそれぞれに対して算出された最終ＶＦ及び最終ＭＦは、評価値情報１２６に格納される。(Step S8-1) The FPOmodule integration unit 134, similar to theFPO analysis unit 132 in step S6-1 above, calculates the final VF and final MF for each of the final FPO modules after the module integration. The final VF and final MF calculated for each of the final FPO modules after the module integration are stored in theevaluation value information 126.

（ステップＳ８－２）Ｐ値算出部１３７は、モジュール統合後の最終的なＦＰＯモジュールそれぞれに対して、上記の（５）式により、モジュールＰ値を算出する。モジュール統合後の最終的なＦＰＯモジュールそれぞれに対して算出されたモジュールＰ値は、モジュール統合後のモジュールＰ値として評価値情報１２６に格納される。
モジュール統合後のモジュールＰ値によって、作業者はモジュール統合後の最終的なＦＰＯモジュールそれぞれの有意性を評価することができる。これにより、モジュールの信憑性を付加することができるようになる。(Step S8-2) The Pvalue calculation unit 137 calculates a module P value for each of the final FPO modules after the module integration by the above formula (5). The module P value calculated for each of the final FPO modules after the module integration is stored in theevaluation value information 126 as the module P value after the module integration.
The module P value after module integration allows the operator to evaluate the significance of each final FPO module after module integration, which makes it possible to add credibility to the module.

（ステップＳ９）ＦＮＩ解析部１３５は、モジュール統合後の最終的なＦＰＯモジュール（最終ＦＰＯモジュール）のいずれか複数または全てをそれぞれ対象にして、偽陰性（False-Negative）の要素を取り入れる（In）処理（ＦＮＩ解析処理）を実行する。(Step S9) TheFNI analysis unit 135 performs a process (FNI analysis process) to incorporate (In) false negative elements on some or all of the final FPO modules (final FPO modules) after module integration.

ＦＮＩ解析処理の対象にする最終ＦＰＯモジュールは、例えば、ユーザが複数を指定できるようにしてもよい。又は、ＦＮＩ解析処理の対象にする最終ＦＰＯモジュールの条件（ＦＮＩ解析処理対象条件）を予め情報処理装置１に設定しておき、ＦＮＩ解析部１３５が、ＦＮＩ解析処理対象条件を満たさない最終ＦＰＯモジュールを除外し、残りの最終ＦＰＯモジュールをＦＮＩ解析処理の対象に決定してもよい。ＦＮＩ解析処理対象条件は、例えば、一定の有意性（モジュールＰ値）以上を持つモジュールである。
ＦＮＩ解析処理について以下に説明する。 For example, the user may be allowed to specify multiple final FPO modules to be the target of the FNI analysis process. Alternatively, conditions for the final FPO module to be the target of the FNI analysis process (FNI analysis process target conditions) may be set in advance in theinformation processing device 1, and theFNI analysis unit 135 may exclude final FPO modules that do not satisfy the FNI analysis process target conditions, and determine the remaining final FPO modules as the target of the FNI analysis process. The FNI analysis process target condition is, for example, a module having a certain level of significance (module P value) or more.
The FNI analysis process is described below.

図１０は、本実施形態に係るＦＮＩ解析処理を説明するための仮想のイメージ図である。図１０では、要素が丸で示される。要素には、注目要素、候補要素、対象モジュール内の要素、対象モジュール外の要素が含まれる。対象モジュール内の要素とは、モジュール統合後の最終的なＦＰＯモジュール（最終ＦＰＯモジュール）に含まれる要素であり、対象モジュール外の要素とは、モジュール統合後の最終的なＦＰＯモジュール（最終ＦＰＯモジュール）に含まれない要素である。また、丸の中に示す数は、候補要素に対する相関順位である。各要素間は線で繋がれる。実線は、注目要素を含めてモジュール内要素間の関係性を示す。点線は、候補要素を含めてモジュール外要素間の関係性を示す。
ＦＮＩ解析部１３５は、ＦＮＩ解析処理の対象の全ての最終ＦＰＯモジュールに含まれる全要素それぞれを順次注目要素に設定しながら、注目要素に関するＦＮＩ解析処理を繰り返す。したがって、ＦＮＩ解析処理の対象の全ての最終ＦＰＯモジュールに含まれる全要素の数だけ、ＦＮＩ解析処理が実行される。ここでは、図１０を参照して、図１０に示される１個の注目要素を例に挙げて説明する。 FIG. 10 is a virtual image diagram for explaining the FNI analysis process according to the present embodiment. In FIG. 10, elements are indicated by circles. The elements include an element of interest, a candidate element, an element in a target module, and an element outside the target module. An element in a target module is an element included in a final FPO module (final FPO module) after module integration, and an element outside the target module is an element not included in a final FPO module (final FPO module) after module integration. The number shown in the circle is a correlation rank with respect to the candidate element. Each element is connected by a line. A solid line indicates a relationship between elements in a module including an element of interest. A dotted line indicates a relationship between elements outside the module including a candidate element.
TheFNI analysis unit 135 repeats the FNI analysis process for the element of interest while sequentially setting all elements included in all final FPO modules targeted for the FNI analysis process as an element of interest. Therefore, the FNI analysis process is executed as many times as the number of elements included in all final FPO modules targeted for the FNI analysis process. Here, with reference to FIG. 10, an explanation will be given by taking one element of interest shown in FIG. 10 as an example.

ＦＮＩ解析部１３５は、注目要素が属する最終ＦＰＯモジュール（対象モジュール）に取り入れる候補要素を設定する。例えば、対象モジュールに取り入れる候補要素は、対象モジュールに属さない全ての要素（つまり、対象モジュール外の全要素）であってもよい。例えば、対象モジュールに取り入れる候補要素は、対象モジュール外の全要素のうち、注目要素に関する相関順位が所定の上位の順位までに限定してもよい。例えば、対象モジュールに取り入れる候補要素は、対象モジュール外の全要素のうち、注目要素に関する相関係数が所定の相関閾値以上に限定してもよい。例えば、対象モジュールに取り入れる候補要素は、対象モジュール外の全要素のうち、注目要素に関する相関順位が所定の上位の順位まで且つ注目要素に関する相関係数が所定の相関閾値以上に限定してもよい。例えば、対象モジュールに取り入れる候補要素は、どの最終ＦＰＯモジュールにも属さない要素に限定してもよい。ＦＮＩ解析部１３５は、候補要素毎に、対象モジュールに関する偽陰性の判定を行う。TheFNI analysis unit 135 sets candidate elements to be incorporated into the final FPO module (target module) to which the element of interest belongs. For example, the candidate elements to be incorporated into the target module may be all elements that do not belong to the target module (i.e., all elements outside the target module). For example, the candidate elements to be incorporated into the target module may be limited to all elements outside the target module whose correlation ranking with respect to the element of interest is a predetermined high rank. For example, the candidate elements to be incorporated into the target module may be limited to all elements outside the target module whose correlation coefficient with respect to the element of interest is equal to or greater than a predetermined correlation threshold. For example, the candidate elements to be incorporated into the target module may be limited to all elements outside the target module whose correlation ranking with respect to the element of interest is equal to or greater than a predetermined correlation threshold. For example, the candidate elements to be incorporated into the target module may be limited to elements that do not belong to any final FPO module. TheFNI analysis unit 135 performs a false negative determination for the target module for each candidate element.

具体的には、ＦＮＩ解析部１３５は、候補要素ｉの相関係数リスト１２２の上位（相関が高い方からの順位（相関順位））から順番に参照し、各順位において、候補要素ｉの対象モジュールに関する上記（２）式のＶＳ（ｉ）「ＶＳ（ｉ）＝（候補要素ｉの対象モジュール内のエッジ数ｅｄｇｅ（ｉ））÷（候補要素ｉのディグリー数ｄｅｇｒｅｅ（ｉ））」を算出する。ＦＮＩ解析部１３５は、算出した全てのＶＳ（ｉ）のうち、所定のＶＳ閾値を超えるＶＳ（ｉ）に関する相関順位等の情報を記憶部１２に記録する。ＶＳ閾値は、例えば作業者が任意に設定可能とする。ＦＮＩ解析部１３５は、所定のＶＳ閾値を超えるＶＳ（ｉ）が算出された場合、候補要素が注目要素に関する偽陰性の要素であると判定し、候補要素を対象モジュールに取り入れる。Specifically, theFNI analysis unit 135 refers to thecorrelation coefficient list 122 of the candidate element i in order from the top (rank from the highest correlation (correlation rank)), and calculates, for each rank, the VS(i) of the above formula (2) for the target module of the candidate element i, "VS(i) = (number of edges in the target module of the candidate element i edge(i)) ÷ (number of degrees of the candidate element i degree(i))". TheFNI analysis unit 135 records information such as the correlation rank for VS(i) that exceeds a predetermined VS threshold among all the calculated VS(i) in thestorage unit 12. The VS threshold can be set arbitrarily by, for example, an operator. When a VS(i) that exceeds a predetermined VS threshold is calculated, theFNI analysis unit 135 determines that the candidate element is a false negative element for the target element, and incorporates the candidate element into the target module.

（ステップＳ１０）Ｐ値算出部１３７は、ＶＳ閾値を超えるＶＳ（ｉ）に対応する相関順位における対象モジュールに関する要素Ｐ値を、次式により算出する。(Step S10) The P-value calculation unit 137 calculates the element P-value for the target module in the correlation ranking corresponding to the VS(i) that exceeds the VS threshold using the following formula.

上記（７）式において、Ｎは、入力データである多変量データの全要素数である。ｎは、対象モジュールの要素数である。ｅｄｇｅは、候補要素ｉの対象モジュール内のエッジ数ｅｄｇｅ（ｉ）である。ｄｅｇｒｅｅは、候補要素ｉのディグリー数ｄｅｇｒｅｅ（ｉ）である。ｑ、ｒは、正の整数であって、ｒ≦ｑである。In the above formula (7), N is the total number of elements of the multivariate data that is the input data. n is the number of elements in the target module. edge is the number of edges edge(i) in the target module of candidate element i. degree is the number of degrees degree(i) of candidate element i. q and r are positive integers, and r≦q.

なお、要素Ｐ値は、ステップＳ９の各相関順位におけるＶＳ（ｉ）を計算しながら（８）式の再帰公式を用いて計算してもよい。The element P value may be calculated using the recursive formula (8) while calculating VS(i) at each correlation rank in step S9.

上記（８）式において、Ｎは、入力データである多変量データの全要素数である。ｎは、モジュールの要素数である。ｅｄｇｅは、候補要素ｉの対象モジュール内のエッジ数ｅｄｇｅ（ｉ）である。ｄｅｇｒｅｅは、候補要素ｉのディグリー数ｄｅｇｒｅｅ（ｉ）である。
ステップＳ９のＶＳ（ｉ）を計算しながら（８）式の再帰公式を用いて要素Ｐ値を計算することによって、組み合わせの数の計算を省略することができ、処理の効率向上を図ることができる。 In the above formula (8), N is the total number of elements of the multivariate data that is the input data. n is the number of elements in the module. edge is the number of edges edge(i) in the target module of the candidate element i. degree is the number of degrees degree(i) of the candidate element i.
By calculating the element P value using the recursive formula of equation (8) while calculating VS(i) in step S9, it is possible to omit the calculation of the number of combinations, thereby improving the processing efficiency.

図１１は、シロイヌナズナのマイクロアレイデータ（要素数２７１２７ｘ９４４２サンプルのデータセット）を用いて、ピアソンの積率相関係数、ＭＩＮ値「２」、ＭＡＸ値「２５０」でＦＰＯ解析を完了した上、ＦＮＩ解析（ステップＳ９からＳ１３－２（後述）までを含め）の実施に要した時間を示す。本発明のとおり、ステップＳ９とステップＳ１０を同時に実行させて要素Ｐ値の計算に再帰公式を採用したことで、解析時間が１／９２と大幅に減少したことを表している。Figure 11 shows the time required to complete the FPO analysis using Arabidopsis microarray data (a data set of 27,127 elements x 9,442 samples) with Pearson's product-moment correlation coefficient, MIN value of "2", and MAX value of "250", and to perform the FNI analysis (including steps S9 to S13-2 (described below)). As per the present invention, steps S9 and S10 are executed simultaneously and a recursive formula is used to calculate the element P value, which shows that the analysis time was significantly reduced to 1/92.

Ｐ値算出部１３７は、ＶＳ閾値を超えるＶＳ（ｉ）に対応する相関順位のうち要素Ｐ値が最小である相関順位に関する情報を、ＦＮＩ解析における評価値情報１２６に格納する。要素Ｐ値が最小である相関順位の情報は、最も有意性があると判定できる相関順位の情報である。The P-value calculation unit 137 stores information about the correlation rank with the smallest element P value among the correlation ranks corresponding to VS(i) exceeding the VS threshold in theevaluation value information 126 in the FNI analysis. The information about the correlation rank with the smallest element P value is the information about the correlation rank that can be determined to be the most significant.

（ステップＳ１１）ＦＮＩ解析処理の対象の最終ＦＰＯモジュールの全てについて全要素に対するＦＮＩ解析処理が完了した場合（ステップＳ１１、ＹＥＳ）ステップＳ１２に進む。一方、まだＦＮＩ解析処理が未実行の要素がある場合（ステップＳ１１、ＮＯ）ステップＳ９に戻り、ＦＮＩ解析処理が未実行の要素を注目要素としてＦＮＩ解析処理を実行する。(Step S11) If the FNI analysis process has been completed for all elements of all final FPO modules that are the subject of the FNI analysis process (Step S11, YES), proceed to Step S12. On the other hand, if there are still elements for which the FNI analysis process has not been performed (Step S11, NO), return to Step S9 and perform the FNI analysis process on the elements for which the FNI analysis process has not been performed as the focus elements.

（ステップＳ１２）ＦＮＩアサイン処理部１３６は、ＦＮＩ解析処理によって発生した複数の最終ＦＰＯモジュールで重複する候補要素に対して重複を解消するアサイン処理を実行する。アサイン処理では、次のアサイン判定基準を用いる。(Step S12) The FNIassignment processing unit 136 performs an assignment process to eliminate overlapping candidate elements in multiple final FPO modules generated by the FNI analysis process. The assignment process uses the following assignment criteria:

アサイン判定基準：候補要素ｉが重複する複数の対象モジュールに対して、以下の優先順位を用いる。
ＶＳ（ｉ）＞「候補要素ｉを対象モジュールに取り入れた場合の上記（５）式のモジュールＰ値」＞モジュールの要素数
ＶＳ（ｉ）が最優先であり、次いでモジュールＰ値が優先であり、次いでモジュールの要素数である。 Assignment Criteria: For multiple target modules with overlapping candidate elements i, the following priorities are used:
VS(i)>"module P value of equation (5) above when candidate element i is incorporated into the target module">number of elements in the module VS(i) has top priority, followed by the module P value, and then the number of elements in the module.

まず、ＶＳ（ｉ）が最大の対象モジュールのみに候補要素ｉを残し、それ以外の対象モジュールから候補要素ｉを削除する。
一方、ＶＳ（ｉ）が同等である場合には、「候補要素ｉを対象モジュールに取り入れた場合の上記（５）式のモジュールＰ値」が最小の対象モジュールのみに候補要素ｉを残し、それ以外の対象モジュールから候補要素ｉを削除する。
ＶＳ（ｉ）が同等である、且つ「候補要素ｉを対象モジュールに取り入れた場合の上記（５）式のモジュールＰ値」が対象モジュール間で同等のときは、モジュールの要素数が最大である対象モジュールのみに候補要素ｉを残し、それ以外の対象モジュールから候補要素ｉを削除する。 First, the candidate element i is left only in the target module with the largest VS(i), and the candidate element i is removed from the other target modules.
On the other hand, if VS(i) is equivalent, the candidate element i is left only in the target module having the smallest “module P value of equation (5) above when the candidate element i is incorporated into the target module,” and the candidate element i is removed from the other target modules.
When VS(i) is equivalent and the "module P value of equation (5) above when candidate element i is incorporated into the target module" is equivalent between the target modules, the candidate element i is left only in the target module with the largest number of elements, and the candidate element i is deleted from the other target modules.

なお、アサイン処理は、ＦＮＩ解析処理によって発生した複数のモジュール間の要素の重複を解消できればよく、上記したアサイン判定基準による処理に限定されない。例えば、モジュール統合処理（モジュラライズ処理）によって、重複する要素を持つモジュールの統合を行ってもよい。The assignment process is not limited to the process based on the above-mentioned assignment criteria, as long as it can eliminate overlapping elements between multiple modules that occur as a result of the FNI analysis process. For example, modules with overlapping elements may be integrated by a module integration process (modularization process).

ＦＮＩアサイン処理部１３６は、アサイン処理後の最終ＦＰＯモジュールの情報を、最終モジュール情報１２７に格納する。最終モジュール情報１２７は、モジュール個別のファイルであってもよいし、全てのモジュールをまとめたファイルであってもよいし、又は、モジュール個別のファイル及び全てのモジュールをまとめたファイルの両方であってもよい。なお、アサイン処理前の最終ＦＰＯモジュールの情報を、記憶部１２に記録してもよい。The FNIassignment processing unit 136 stores information on the final FPO module after the assignment process in thefinal module information 127. Thefinal module information 127 may be a file for each module, a file that includes all modules, or both a file for each module and a file that includes all modules. Information on the final FPO module before the assignment process may be recorded in thememory unit 12.

本実施形態によれば、複数の最終ＦＰＯモジュールに対してＦＮＩ解析処理を行った結果として複数の最終ＦＰＯモジュールで重複する候補要素が発生した場合に、候補要素の重複を解消することができる。これにより、作業者は、ＦＮＩ解析処理を実行する最終ＦＰＯモジュールを個別に指定する手間が省け、複数の最終ＦＰＯモジュールに対して一括でＦＮＩ解析処理を実行することができると共に、要素の重複がないＦＮＩ解析処理結果を得ることができる。According to this embodiment, when overlapping candidate elements occur in multiple final FPO modules as a result of performing FNI analysis processing on multiple final FPO modules, it is possible to eliminate the overlapping candidate elements. This saves the operator the trouble of individually specifying the final FPO module on which to perform the FNI analysis processing, and allows the operator to perform the FNI analysis processing on multiple final FPO modules in a batch, while also obtaining an FNI analysis processing result without overlapping elements.

（ステップＳ１３－１）ＦＮＩアサイン処理部１３６は、上記ステップＳ６－１のＦＰＯ解析部１３２と同様に、アサイン処理後の最終ＦＰＯモジュールそれぞれに対して、最終ＶＦ及び最終ＭＦを算出する。アサイン処理後の最終ＦＰＯモジュールそれぞれに対して算出された最終ＶＦ及び最終ＭＦは、評価値情報１２６に格納される。(Step S13-1) The FNIassignment processing unit 136, similar to theFPO analysis unit 132 in step S6-1 above, calculates the final VF and final MF for each final FPO module after the assignment process. The final VF and final MF calculated for each final FPO module after the assignment process are stored in theevaluation value information 126.

（ステップＳ１３－２）Ｐ値算出部１３７は、アサイン処理後の最終ＦＰＯモジュールそれぞれに対して、上記の（５）式により、モジュールＰ値を算出する。アサイン処理後の最終ＦＰＯモジュールそれぞれに対して算出されたモジュールＰ値は、アサイン処理後のモジュールＰ値として評価値情報１２６に格納される。
アサイン処理後のモジュールＰ値によって、作業者はアサイン処理後の最終ＦＰＯモジュールそれぞれの有意性を評価することができる。これにより、モジュールの信憑性を付加することができるようになる。(Step S13-2) The Pvalue calculation unit 137 calculates a module P value for each of the final FPO modules after the assignment process by the above formula (5). The module P value calculated for each of the final FPO modules after the assignment process is stored in theevaluation value information 126 as the module P value after the assignment process.
The module P-value after the assignment process allows the operator to evaluate the significance of each final FPO module after the assignment process, thereby adding credibility to the module.

（ステップＳ１４）出力データ生成部１３８は、各種の出力データを生成する。出力データは、表示画面上にデータを表示したり、又はデータを印字したりすることができるデータである。例えば、出力データ生成部１３８は、アサイン処理後の最終ＦＰＯモジュールそれぞれの構成を表現する描画データを生成する。この描画データによって、アサイン処理後の最終ＦＰＯモジュールそれぞれの構成が描画されることで可視化される。(Step S14) The outputdata generation unit 138 generates various output data. The output data is data that can be displayed on a display screen or printed. For example, the outputdata generation unit 138 generates drawing data that represents the configuration of each final FPO module after the assignment process. The configuration of each final FPO module after the assignment process is drawn and visualized by this drawing data.

例えば、出力データ生成部１３８は、ＦＰＯ解析処理により得られた、各シード要素についての各ランクに対応するＭＦを示すグラフデータ（例えば図５Ａに例示されるグラフデータなど）を生成してもよい。例えば、出力データ生成部１３８は、ＦＰＯモジュール統合処理により得られた、ＶＦ閾値別での統合結果を示すグラフデータ（例えば図９に例示されるグラフデータなど）を生成してもよい。For example, the outputdata generation unit 138 may generate graph data (such as the graph data illustrated in FIG. 5A) showing the MF corresponding to each rank for each seed element obtained by the FPO analysis process. For example, the outputdata generation unit 138 may generate graph data (such as the graph data illustrated in FIG. 9) showing the integration results by VF threshold obtained by the FPO module integration process.

出力データ生成部１３８が生成した出力データは、入出力部１１によって情報処理装置１の外部の装置へ出力されてもよいし、記憶部１２に格納されてもよい。記憶部１２に格納された出力データは、入出力部１１によって、外部の装置へ出力可能である。The output data generated by the outputdata generation unit 138 may be output by the input/output unit 11 to a device external to theinformation processing device 1, or may be stored in thememory unit 12. The output data stored in thememory unit 12 can be output by the input/output unit 11 to an external device.

本実施形態によれば、例えばオミックス解析で取得された網羅的分子情報やビッグデータ等の多変量データに対する相関ネットワーク解析の精度向上を図ることができるという効果が得られる。According to this embodiment, it is possible to obtain the effect of improving the accuracy of correlation network analysis of multivariate data such as comprehensive molecular information obtained by omics analysis and big data.

以上、本発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、本発明の要旨を逸脱しない範囲の設計変更等も含まれる。Although an embodiment of the present invention has been described in detail above with reference to the drawings, the specific configuration is not limited to this embodiment and also includes design modifications within the scope that does not deviate from the gist of the present invention.

また、上述した各装置の機能を実現するためのコンピュータプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行するようにしてもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものであってもよい。
また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、フラッシュメモリ等の書き込み可能な不揮発性メモリ、ＤＶＤ（Digital Versatile Disc）等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。 In addition, a computer program for implementing the functions of each of the above-mentioned devices may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read into a computer system and executed. Note that the term "computer system" may include hardware such as an OS and peripheral devices.
Furthermore, if the "computer system" utilizes a WWW system, it also includes the home page providing environment (or display environment).
In addition, the term "computer-readable recording medium" refers to a storage device such as a flexible disk, a magneto-optical disk, a ROM, a writable non-volatile memory such as a flash memory, a portable medium such as a DVD (Digital Versatile Disc), or a hard disk built into a computer system.

さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ（例えばＤＲＡＭ）のように、一定時間プログラムを保持しているものも含むものとする。
また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）や電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。
また、上記プログラムは、前述した機能の一部を実現するためのものであってもよい。さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 Furthermore, the term "computer-readable recording medium" includes devices that retain a program for a certain period of time, such as volatile memory (e.g., DRAM) within a computer system that serves as a server or client when a program is transmitted via a network such as the Internet or a communication line such as a telephone line.
The program may be transmitted from a computer system in which the program is stored in a storage device or the like to another computer system via a transmission medium, or by a transmission wave in the transmission medium. Here, the "transmission medium" that transmits the program refers to a medium that has a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.
The program may be a program for implementing some of the above-mentioned functions, or may be a so-called differential file (differential program) that can implement the above-mentioned functions in combination with a program already recorded in the computer system.

１…情報処理装置、１１…入出力部、１２…記憶部、１２１…相関ネットワーク解析プログラム、１２２…相関係数リスト、１２３…モジュール情報、１２４…モジュール統合情報、１２５…最終モジュール統合情報、１２６…評価値情報、１２７…最終モジュール情報、１２８…ＦＰＯ解析済モジュールリスト、１３…制御部、１３１…相関係数算出部、１３２…ＦＰＯ解析部、１３３…ＦＰＯモジュール選定部、１３４…ＦＰＯモジュール統合部、１３５…ＦＮＩ解析部、１３６…ＦＮＩアサイン処理部、１３７…Ｐ値算出部、１３８…出力データ生成部1...information processing device, 11...input/output unit, 12...memory unit, 121...correlation network analysis program, 122...correlation coefficient list, 123...module information, 124...module integration information, 125...final module integration information, 126...evaluation value information, 127...final module information, 128...FPO analyzed module list, 13...control unit, 131...correlation coefficient calculation unit, 132...FPO analysis unit, 133...FPO module selection unit, 134...FPO module integration unit, 135...FNI analysis unit, 136...FNI assignment processing unit, 137...P value calculation unit, 138...output data generation unit

Claims

Translated fromJapanese

相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析部と、
要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶部と、
ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定部と、
を備える情報処理装置。 an FPO analysis unit that, when constructing a module of seed elements from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of correlation network analysis, executes an FPO analysis process for removing false positive non-seed elements from the non-seed elements having a high correlation with the seed element, for each rank from a lower limit value to an upper limit value of a rank that is the number of elements other than the seed element to be incorporated into the module of the seed element;
A module information storage unit that stores configuration information of modules with the best module F scores for each rank by element;
an FPO module selection unit that selects an optimal module of an FPO analysis result based on the best module F-score in each rank from a lower limit value to an upper limit value of the rank;
An information processing device comprising:

前記ＦＰＯモジュール選定部は、ランクの下限値から上限値までの各ランクにおけるモジュールＦスコアに関して、ランクを大きくしてもモジュールＦスコアの最良値が更新されない期間の最大値を示すランクのモジュールを、ＦＰＯ解析結果の最適モジュールに選定する、
請求項１に記載の情報処理装置。 The FPO module selection unit selects, as an optimal module of the FPO analysis result, a module having a rank that indicates a maximum value of a period during which the best value of the module F score is not updated even if the rank is increased, with respect to the module F score at each rank from the lower limit to the upper limit of the rank.
The information processing device according to claim 1 .

モジュールＦスコアは、モジュール密度とモジュール特異率の調和平均である、
請求項１又は２のいずれか１項に記載の情報処理装置。 The module F-score is the harmonic mean of the module density and the module specificity.
The information processing device according to claim 1 .

ランク毎の最良のモジュールＦスコアのグラフのデータを生成する出力データ生成部、
をさらに備える請求項１又は２のいずれか１項に記載の情報処理装置。 an output data generator for generating data for a graph of the best module F-scores per rank;
The information processing device according to claim 1 , further comprising:

情報処理装置が実行する情報処理方法であって、
相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、
要素別に、ランク毎に最良のモジュールＦスコア値のモジュールの構成情報を格納するモジュール情報記憶ステップと、
ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、
を含む情報処理方法。 An information processing method executed by an information processing device,
an FPO analysis step in which, when constructing a module of seed elements from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of correlation network analysis, an FPO analysis process is performed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from the non-seed elements having a high correlation with the seed element;
A module information storage step for storing configuration information of modules with the best module F-score value for each rank by element;
an FPO module selection step of selecting an optimal module of the FPO analysis result based on the best module F-score at each rank from the lower limit to the upper limit of the rank;
An information processing method comprising:

コンピュータに、
相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、
要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶ステップと、
ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、
を実行させるためのコンピュータプログラム。 On the computer,
an FPO analysis step in which, when constructing a module of seed elements from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of correlation network analysis, an FPO analysis process is performed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from the non-seed elements having a high correlation with the seed element;
A module information storage step for storing configuration information of the module with the best module F score for each rank by element;
an FPO module selection step of selecting an optimal module of the FPO analysis result based on the best module F-score at each rank from the lower limit to the upper limit of the rank;
A computer program for executing the above.

コンピュータに、
相関ネットワーク解析を行う対象の多変量データが有する要素のうちシード要素との相関が上位の非シード要素から前記シード要素のモジュールを構成する際に、前記シード要素との相関が上位の非シード要素の中から偽陽性の非シード要素を取り除くＦＰＯ解析処理を、前記シード要素のモジュールに組み入れる前記シード要素以外の要素の数であるランクの下限値から上限値まで、ランク毎に実行するＦＰＯ解析ステップと、
要素別に、ランク毎に最良のモジュールＦスコアのモジュールの構成情報を格納するモジュール情報記憶ステップと、
ランクの下限値から上限値までの各ランクにおける最良のモジュールＦスコアに基づいてＦＰＯ解析結果の最適モジュールを選定するＦＰＯモジュール選定ステップと、
を実行させるためのコンピュータプログラムを記録したコンピュータ読み取り可能な記録媒体。 On the computer,
an FPO analysis step in which, when constructing a module of seed elements from non-seed elements having a high correlation with a seed element among elements contained in multivariate data that is the subject of correlation network analysis, an FPO analysis process is performed for each rank from a lower limit value to an upper limit value of a rank, which is the number of elements other than the seed element to be incorporated into the module of the seed element, to remove false positive non-seed elements from the non-seed elements having a high correlation with the seed element;
A module information storage step for storing configuration information of the module with the best module F score for each rank by element;
an FPO module selection step of selecting an optimal module of the FPO analysis result based on the best module F-score at each rank from the lower limit to the upper limit of the rank;
A computer-readable recording medium having a computer program recorded thereon for executing the above.