Movatterモバイル変換

LM-based and LMMA-based angiogenesis network structures (Thp = 0.150)

	LM-EC	LMMA-EC	LM-ST	LMMA-ST
Common nodes^a	1257	1031	1258	1162
Connections^a	6761	2848	6884	3935
Average path length^a	2.9810	3.6101	2.9741	3.3487
Average degree^b	5.3738	2.2777	5.4722	3.1375
SSE^c	522.3206	380.1941	520.2295	479.0745
SS_mse^c	0.0669	0.057	0.0614	0.0589
Microarray size	1257*53	1257*53	1258*119	1258*119

	LM-EC	LMMA-EC	LM-ST	LMMA-ST
Common nodes^a	1257	1031	1258	1162
Connections^a	6761	2848	6884	3935
Average path length^a	2.9810	3.6101	2.9741	3.3487
Average degree^b	5.3738	2.2777	5.4722	3.1375
SSE^c	522.3206	380.1941	520.2295	479.0745
SS_mse^c	0.0669	0.057	0.0614	0.0589
Microarray size	1257*53	1257*53	1258*119	1258*119

^aIn the largest connected sub-network.

^bIn the whole network.

^cAll nodes except for the isolated ones.

Table 1

LM-based and LMMA-based angiogenesis network structures (Thp = 0.150)

	LM-EC	LMMA-EC	LM-ST	LMMA-ST
Common nodes^a	1257	1031	1258	1162
Connections^a	6761	2848	6884	3935
Average path length^a	2.9810	3.6101	2.9741	3.3487
Average degree^b	5.3738	2.2777	5.4722	3.1375
SSE^c	522.3206	380.1941	520.2295	479.0745
SS_mse^c	0.0669	0.057	0.0614	0.0589
Microarray size	1257*53	1257*53	1258*119	1258*119

	LM-EC	LMMA-EC	LM-ST	LMMA-ST
Common nodes^a	1257	1031	1258	1162
Connections^a	6761	2848	6884	3935
Average path length^a	2.9810	3.6101	2.9741	3.3487
Average degree^b	5.3738	2.2777	5.4722	3.1375
SSE^c	522.3206	380.1941	520.2295	479.0745
SS_mse^c	0.0669	0.057	0.0614	0.0589
Microarray size	1257*53	1257*53	1258*119	1258*119

^aIn the largest connected sub-network.

^bIn the whole network.

^cAll nodes except for the isolated ones.

Table 1 lists the network parameters for LM- and LMMA-based angiogenesis networks. It shows that redundant connections are eliminated after multivariate selection. The connections for LMMA-EC and LMMA-ST networks are much smaller than those of the predominant sub-networks of LM-EC and LM-ST, respectively. The elimination of connections results in a dramatic decrease of the average degrees of genes and a slightly reduction of node number and average path length. Moreover, as shown inFigure 1a and b, when comparing with the LM-random filtering networks derived from the permutation test, the LMMA network results in not only significantly larger cluster size (P < 0.0001, by Kolmogorov–Smirnov test), but also smaller path length of the largest cluster (P < 0.001 byt-test). The results demonstrate that LMMA is more stable and integrative than that of the LM-random filtering. Similar performance is observed with the LMMA-ST network (Supplementary Fig. S1). Thus, LMMA seems to maintain the backbone of the LM-based angiogenesis network.

Fig. 1

(a) Comparison of the cluster sizes between the LMMA-EC network and the LM-EC random filtering networks (P < 0.0001, by Kolmogorov–Smirnov test). Other clusters are with <10 nodes (data not shown).(b) Comparison of the normalized average path length in the largest cluster between the LMMA-EC and the LM-EC random filtering networks (P < 0.001 byt test).(c) Relationship between the number of nodes and the degree of nodes in the whole LM angiogenesis network, LMMA-EC network, and LMMA-ST network (Thp = 0.150). The distribution of degrees in three networks follows a power law, obviously appearing to be scale-free.

Figure 1c shows the relationship between the number of nodes and the degree of nodes in both LM- and LMMA-based angiogenesis networks. Obviously, the profiles follow a power-law distribution, indicating that the topological properties of both networks are scale-free (Jeonget al., 2000;Songet al., 2005). Recent studies (Hanet al., 2004;Ozieret al., 2003) show that centrally located, highly connected hub nodes in a scale-free network dominate network operation.

3.2 Comparison of LM- and LMMA-based angiogenesis networks

Top 15 hub genes in both LM-based and LMMA-based angiogenesis networks are listed inTable 2. Vascular endothelial growth factor (VEGF) is identified in both LM and LMMA networks as the hub gene with the highest degree. VEGF is known to be a multi-functional cytokine that plays an important role in vasculogenesis (Mukhopadhyay and Datta, 2004). The activation of endothelial cells by VEGF sets in motion a series of steps towards the creation of new blood vessels (Folkman, 1995).

Table 2

The top 15 hub genes identified in LM-based and LMMA-based angiogenesis networks (Thp = 0.150)

Gene	Degree (LM-EC; LM-ST)^a	Degree (LMMA)		P-value^b
		EC	ST	EC	ST
VEGF	554	51	117	0	0
NUDT6	211	51	25	0	0
KDR	182	51	117	0	3.59e−06
SIAT7B	156	51	44	1.19e−07	0
TNF	149	51	46	0	0
IL8	148	26	27	0	0
MVD	126	19	28	0	0
CD34	111	51	22	1.19e−07	0
EGF	104	32	40	1.35e−13	0
IL6	97	31	24	0	0
CDH17	96	30	27	0	0
HIF1A	93	21	38	1.65e−12	0
SOS1	87	14	25	1.54e−11	0
CCM1	83	51	14	6.92e−06	0
PSME3	78	18	34	0	0

Gene	Degree (LM-EC; LM-ST)^a	Degree (LMMA)		P-value^b
		EC	ST	EC	ST
VEGF	554	51	117	0	0
NUDT6	211	51	25	0	0
KDR	182	51	117	0	3.59e−06
SIAT7B	156	51	44	1.19e−07	0
TNF	149	51	46	0	0
IL8	148	26	27	0	0
MVD	126	19	28	0	0
CD34	111	51	22	1.19e−07	0
EGF	104	32	40	1.35e−13	0
IL6	97	31	24	0	0
CDH17	96	30	27	0	0
HIF1A	93	21	38	1.65e−12	0
SOS1	87	14	25	1.54e−11	0
CCM1	83	51	14	6.92e−06	0
PSME3	78	18	34	0	0

^aDegree of these hub genes in both LM-EC and LM-ST networks are the same.

^b P-values are calculated fromF-test for the unit-network of each gene (Equation 3).

Table 2

The top 15 hub genes identified in LM-based and LMMA-based angiogenesis networks (Thp = 0.150)

Gene	Degree (LM-EC; LM-ST)^a	Degree (LMMA)		P-value^b
		EC	ST	EC	ST
VEGF	554	51	117	0	0
NUDT6	211	51	25	0	0
KDR	182	51	117	0	3.59e−06
SIAT7B	156	51	44	1.19e−07	0
TNF	149	51	46	0	0
IL8	148	26	27	0	0
MVD	126	19	28	0	0
CD34	111	51	22	1.19e−07	0
EGF	104	32	40	1.35e−13	0
IL6	97	31	24	0	0
CDH17	96	30	27	0	0
HIF1A	93	21	38	1.65e−12	0
SOS1	87	14	25	1.54e−11	0
CCM1	83	51	14	6.92e−06	0
PSME3	78	18	34	0	0

Gene	Degree (LM-EC; LM-ST)^a	Degree (LMMA)		P-value^b
		EC	ST	EC	ST
VEGF	554	51	117	0	0
NUDT6	211	51	25	0	0
KDR	182	51	117	0	3.59e−06
SIAT7B	156	51	44	1.19e−07	0
TNF	149	51	46	0	0
IL8	148	26	27	0	0
MVD	126	19	28	0	0
CD34	111	51	22	1.19e−07	0
EGF	104	32	40	1.35e−13	0
IL6	97	31	24	0	0
CDH17	96	30	27	0	0
HIF1A	93	21	38	1.65e−12	0
SOS1	87	14	25	1.54e−11	0
CCM1	83	51	14	6.92e−06	0
PSME3	78	18	34	0	0

^aDegree of these hub genes in both LM-EC and LM-ST networks are the same.

^b P-values are calculated fromF-test for the unit-network of each gene (Equation 3).

Table 2 lists theP-values for the unit-networks of 15 hub genes derived from theF-test. We calculate theP-values for different networks. The results show that the LMMA-based angiogenesis network is more reliable than the LM-based one.Figure 2 illustrates LOOCV gene expression values for the unit-networks of VEGF, EGF, TNF and IL6, respectively. The MSE values of the LMMA unit-networks are smaller than those of the LM, indicating that the LMMA network fits better to the microarray data of angiogenesis. Meanwhile,Table 1 lists the SSE and the SS_mse scores resulted from LM- and LMMA-based networks. The reduced errors in LMMA again suggest the improvement of the LMMA-based networks.

Fig. 2

Gene expression values derived from the leave one out cross validation approach for four hub genes VEGF, EGF, TNF and IL6 in both LM-EC and LMMA-EC networks. A total of 53 experiments in EC microarray dataset are tested.

Figure 3a and b shows the precision and the recall rates of both the LM- and LMMA-based angiogenesis networks at different threshold Thp. The LMMA-based network exhibits higher precisions and lower recalls than the LM-based one. On the other hand, the recall of LMMA-based network increases gradually with the increasing thresholds. We select a suitable threshold, Thp = 0.150, in the LMMA-based EC and ST networks.

Fig. 3

Comparison of (a) precision and (b) recall in LM, LMMA-EC and LMMA-ST angiogenesis networks at different thresholds. Here LM represents LM-EC and LM-ST since genes in LM-EC and LM-ST are identical when mapping to KEGG. TheX axis denotes theP-value thresholds calculated fromF-test in the step of statistical multivariate selection. Both the precision and the recall rates are calculated against KEGG.

Both LM-EC and LM-ST networks have the same 474 genes corresponding to 355 KO entities covered by KEGG database. When the LM-based network is refined by LMMA, the proportion of the TP rates increases significantly, while the proportion of FP rates decreases evidently.Table 3 shows the statistical results between LM- and LMMA-based angiogenesis networks, which demonstrate that the LMMA approach significantly eliminates the false positive relations.

Table 3

The true positive (TP), false positive (FP) and the statisticalP-values of TP/FP ratio (by Fisher Exact Test) between LM and LMMA networks^a

KEGG

Network

Thp

0.025

0.050

0.075

0.100

0.125

0.150

0.175

0.200

237

1048

LMMA-EC

108

111

LMMA-EC

121

175

241

267

303

349

417

458

TP/FP (LMMA-EC versus LM)

P-value

0.017004

0.034679

0.064785

0.01837

0.02372

0.01526

0.03012

0.04408

LMMA-ST

101

111

130

137

135

LMMA-ST

223

300

350

392

436

471

499

513

TP/FP (LMMA-ST versus LM)

P-value

0.0056928

0.021672

0.02762

0.032833

0.033552

0.013196

0.013274

0.021953

139

170

LMMA-EC

TP/FP (LMMA-EC versus LM)

P-value

0.017279

0.087676

0.090294

0.027146

0.017372

0.0097982

0.011667

0.011803

LMMA-ST

TP/FP (LMMA-ST versus LM)

P-value

0.042857

0.026912

0.020102

0.041336

0.036214

0.028095

0.023083

0.04316

KEGG	Network	Thp	0.025	0.050	0.075	0.100	0.125	0.150	0.175	0.200
KG	LM	TP	237	237	237	237	237	237	237	237
	LM	FP	1048	1048	1048	1048	1048	1048	1048	1048
	LMMA-EC	TP	39	49	56	76	83	98	108	111
	LMMA-EC	FP	121	175	241	267	303	349	417	458
	TP/FP (LMMA-EC versus LM)	P-value	0.017004	0.034679	0.064785	0.01837	0.02372	0.01526	0.03012	0.04408
	LMMA-ST	TP	71	83	93	101	111	130	137	135
	LMMA-ST	FP	223	300	350	392	436	471	499	513
	TP/FP (LMMA-ST versus LM)	P-value	0.0056928	0.021672	0.02762	0.032833	0.033552	0.013196	0.013274	0.021953
KO	LM	TP	139	139	139	139	139	139	139	139
	LM	FP	170	170	170	170	170	170	170	170
	LMMA-EC	TP	29	33	37	52	57	70	74	77
	LMMA-EC	FP	19	34	40	44	46	55	60	63
	TP/FP (LMMA-EC versus LM)	P-value	0.017279	0.087676	0.090294	0.027146	0.017372	0.0097982	0.011667	0.011803
	LMMA-ST	TP	43	54	62	67	73	86	90	87
	LMMA-ST	FP	38	46	52	64	69	80	82	87
	TP/FP (LMMA-ST versus LM)	P-value	0.042857	0.026912	0.020102	0.041336	0.036214	0.028095	0.023083	0.04316

^aHere LM represents LM-EC and LM-ST since genes in LM-EC and LM-ST are identical when mapping to KEGG database. KG = KEGG Gene; KO = KEGG Orthology.

Table 3

The true positive (TP), false positive (FP) and the statisticalP-values of TP/FP ratio (by Fisher Exact Test) between LM and LMMA networks^a

KEGG

Network

Thp

0.025

0.050

0.075

0.100

0.125

0.150

0.175

0.200

237

1048

LMMA-EC

108

111

LMMA-EC

121

175

241

267

303

349

417

458

TP/FP (LMMA-EC versus LM)

P-value

0.017004

0.034679

0.064785

0.01837

0.02372

0.01526

0.03012

0.04408

LMMA-ST

101

111

130

137

135

LMMA-ST

223

300

350

392

436

471

499

513

TP/FP (LMMA-ST versus LM)

P-value

0.0056928

0.021672

0.02762

0.032833

0.033552

0.013196

0.013274

0.021953

139

170

LMMA-EC

TP/FP (LMMA-EC versus LM)

P-value

0.017279

0.087676

0.090294

0.027146

0.017372

0.0097982

0.011667

0.011803

LMMA-ST

TP/FP (LMMA-ST versus LM)

P-value

0.042857

0.026912

0.020102

0.041336

0.036214

0.028095

0.023083

0.04316

KEGG	Network	Thp	0.025	0.050	0.075	0.100	0.125	0.150	0.175	0.200
KG	LM	TP	237	237	237	237	237	237	237	237
	LM	FP	1048	1048	1048	1048	1048	1048	1048	1048
	LMMA-EC	TP	39	49	56	76	83	98	108	111
	LMMA-EC	FP	121	175	241	267	303	349	417	458
	TP/FP (LMMA-EC versus LM)	P-value	0.017004	0.034679	0.064785	0.01837	0.02372	0.01526	0.03012	0.04408
	LMMA-ST	TP	71	83	93	101	111	130	137	135
	LMMA-ST	FP	223	300	350	392	436	471	499	513
	TP/FP (LMMA-ST versus LM)	P-value	0.0056928	0.021672	0.02762	0.032833	0.033552	0.013196	0.013274	0.021953
KO	LM	TP	139	139	139	139	139	139	139	139
	LM	FP	170	170	170	170	170	170	170	170
	LMMA-EC	TP	29	33	37	52	57	70	74	77
	LMMA-EC	FP	19	34	40	44	46	55	60	63
	TP/FP (LMMA-EC versus LM)	P-value	0.017279	0.087676	0.090294	0.027146	0.017372	0.0097982	0.011667	0.011803
	LMMA-ST	TP	43	54	62	67	73	86	90	87
	LMMA-ST	FP	38	46	52	64	69	80	82	87
	TP/FP (LMMA-ST versus LM)	P-value	0.042857	0.026912	0.020102	0.041336	0.036214	0.028095	0.023083	0.04316

^aHere LM represents LM-EC and LM-ST since genes in LM-EC and LM-ST are identical when mapping to KEGG database. KG = KEGG Gene; KO = KEGG Orthology.

3.3 Pathway extraction from networks

The statistical significance of pathways in LMMA-based angiogenesis networks is derived from Fisher Exact Test. The results are shown inTable 4 and graphically represented by an example, the EGF (epidermal growth factor) unit-network, inFigure 4. See more in Discussion below. Although many co-occurrence relations are eliminated from the LM-based network, main pathway information, such as the focal adhesion pathway, signaling pathways of TGF-beta, MAPK, Calcium and Wnt, is observed in the LMMA-based network with significantP-values. Thus, pathways in LMMA-based network are significantly enriched.

Fig. 4

An EGF (epidermal growth factor) unit-network derived respectively from the co-occurrence literature mining and the LMMA approaches. A total of 21 genes co-cited with EGF in LM are removed by LMMA. By manually revisiting the PubMed records, these 21 genes are found in false relations with EGF resulted from homonymic mis-matches and confused lexical orders (in the blue pane), unknown relations (in the purple pane) and isolated relations (in the yellow pane). A Neato program in the Graphviz software (AT&T;Author Webpage) is adopted to visualize the constructed network.

Table 4

KEGG pathways with significantP-values in LMMA-based angiogenesis networks (Thp = 0.150)^a

	LMMA-EC (KG)	LMMA-EC (KO)	LMMA-ST (KG)	LMMA-ST (KO)
Focal adhesion pathway	0.00087	1.09e − 07	0.00084	2.84e − 08
MAPK signaling pathway	0.02825	0.013338	0.015779	0.00910
Adherens junction	2.14e − 22	1.31e − 13	3.08e − 24	2.19e − 14
TGF-beta signaling pathway	0.00010	0.00540	8.76e − 06	0.00585
Insulin signaling pathway	1.27e − 06	0.00264	1.33e − 07	0.00225
Calcium signaling pathway	0.00011	0.00373	1.86e − 08	6.66e − 05
Wnt signaling pathway	—	0.03010	—	0.00548
Regulation of actin cytoskeleton	0.01100	—	0.00020	—
Cytokine-cytokine receptor interaction	5.13E-09	—	9.52E-16	—
Apoptosis	0.00127	—	0.03001	—
Cell cycle	—	0.04594	—	0.02220

	LMMA-EC (KG)	LMMA-EC (KO)	LMMA-ST (KG)	LMMA-ST (KO)
Focal adhesion pathway	0.00087	1.09e − 07	0.00084	2.84e − 08
MAPK signaling pathway	0.02825	0.013338	0.015779	0.00910
Adherens junction	2.14e − 22	1.31e − 13	3.08e − 24	2.19e − 14
TGF-beta signaling pathway	0.00010	0.00540	8.76e − 06	0.00585
Insulin signaling pathway	1.27e − 06	0.00264	1.33e − 07	0.00225
Calcium signaling pathway	0.00011	0.00373	1.86e − 08	6.66e − 05
Wnt signaling pathway	—	0.03010	—	0.00548
Regulation of actin cytoskeleton	0.01100	—	0.00020	—
Cytokine-cytokine receptor interaction	5.13E-09	—	9.52E-16	—
Apoptosis	0.00127	—	0.03001	—
Cell cycle	—	0.04594	—	0.02220

^a P-values are calculated from Fisher Exact Test. KG = KEGG Gene. KO = KEGG Orthology.

Table 4

KEGG pathways with significantP-values in LMMA-based angiogenesis networks (Thp = 0.150)^a

	LMMA-EC (KG)	LMMA-EC (KO)	LMMA-ST (KG)	LMMA-ST (KO)
Focal adhesion pathway	0.00087	1.09e − 07	0.00084	2.84e − 08
MAPK signaling pathway	0.02825	0.013338	0.015779	0.00910
Adherens junction	2.14e − 22	1.31e − 13	3.08e − 24	2.19e − 14
TGF-beta signaling pathway	0.00010	0.00540	8.76e − 06	0.00585
Insulin signaling pathway	1.27e − 06	0.00264	1.33e − 07	0.00225
Calcium signaling pathway	0.00011	0.00373	1.86e − 08	6.66e − 05
Wnt signaling pathway	—	0.03010	—	0.00548
Regulation of actin cytoskeleton	0.01100	—	0.00020	—
Cytokine-cytokine receptor interaction	5.13E-09	—	9.52E-16	—
Apoptosis	0.00127	—	0.03001	—
Cell cycle	—	0.04594	—	0.02220

	LMMA-EC (KG)	LMMA-EC (KO)	LMMA-ST (KG)	LMMA-ST (KO)
Focal adhesion pathway	0.00087	1.09e − 07	0.00084	2.84e − 08
MAPK signaling pathway	0.02825	0.013338	0.015779	0.00910
Adherens junction	2.14e − 22	1.31e − 13	3.08e − 24	2.19e − 14
TGF-beta signaling pathway	0.00010	0.00540	8.76e − 06	0.00585
Insulin signaling pathway	1.27e − 06	0.00264	1.33e − 07	0.00225
Calcium signaling pathway	0.00011	0.00373	1.86e − 08	6.66e − 05
Wnt signaling pathway	—	0.03010	—	0.00548
Regulation of actin cytoskeleton	0.01100	—	0.00020	—
Cytokine-cytokine receptor interaction	5.13E-09	—	9.52E-16	—
Apoptosis	0.00127	—	0.03001	—
Cell cycle	—	0.04594	—	0.02220

^a P-values are calculated from Fisher Exact Test. KG = KEGG Gene. KO = KEGG Orthology.

4 DISCUSSION AND CONCLUSION

High false positive rate is a well-known problem in most high-throughput methods for detecting molecular interactions (von Meringet al., 2002). In this work, we developed a LMMA approach to construct networks based on both existing knowledge (literature) and experimental information (microarray). Such approach performs multivariate analysis to modify the literature-derived holistic network using subject-oriented gene expression profiles. To analyze the hidden network buried in microarray datasets, two aspects make it necessary to construct the LM-based network beforehand. First, it is not advisable to construct the network directly from thousands of candidate variables if prior knowledge about the network is not available. Second, the number of variables should not exceed the number of observations (i.e. microarray experiments); otherwise the results will be falsely optimized. Thus, a certain number of arrays are required in LMMA for multivariate selection.

As an application, PubMed literatures and microarray datasets from both the EC and the ST are selected respectively to reconstruct the LMMA network for angiogenesis. The LMMA approach results in a larger cluster size, and a smaller average path length when comparing with a LM-random filtering, while preserves similar topological properties comparing with the LM-based network. Therefore, it indicates that LMMA can eliminate redundant relations while maintain the backbone of the LM-based network.

Angiogenesis networks constructed by LM and LMMA are tested for accuracy on confident sets of interactions. Both precision and recall rates are calculated against KEGG, one commonly used benchmark. We show that LMMA significantly improves the precision rate when comparing with LM alone. On the other hand, asBorket al. (2004) reported, the choice of benchmark set is still a knotty problem because the agreement among different benchmark sets is surprisingly poor. For example, less than half of all pairs in the KEGG benchmark set are present in the Gene Ontology biological process benchmark set (Borket al., 2004). Moreover, it is commonly known that co-occurrence in literature often describes or reflects more general relationships between genes. Some of these may be implicit and/or so novel that they have not yet reached the status of common knowledge or accepted fact often required for inclusion in databases such as KEGG. Two aspects mentioned above may be the reason why both the LM and LMMA approaches resulted in a low recall rate (Fig. 3) when calculated against KEGG. Even so, we still show that the integration with microarray data can significantly increase the reliability of gene co-occurrence networks extracted from the literature.

To demonstrate how LMMA reduces the false positive rate and improves the precision, we select, EGF (epidermal growth factor), a key player in angiogenesis as an example. As shown inFigure 4, LMMA totally removes 21 EGF false related genes from LM-based EGF unit-network. First, LMMA deletes five mis-matched genes in LM: SC, SF, AA and PC are abbreviations of stem cells, scatter factor, arachidonic acid or anaplastic astrocytoma, and prostate cancer respectively; IL8RA (interleukin 8 receptor, alpha) is misinterpretated by IL8 and EGF receptor in the lexical order. Second, LMMA cancels eight genes with unknown relations (few co-citation) to EGF in LM: CCR6, FGF16, MAP3K8 and EGF are co-cited in only one PubMed sentence recorded in a gene expression experiment (Gerritsenet al., 2003); the same as IL11, IL10, IL3, IL4 and CCR2. Third, LMMA removes eight genes that seldom have co-occurrences with EGF even by using their alias: NRG2, Scube1, NPY6R, ZNF78L2, IFI44, RNU106, AXPC1 and ANGPTL6. Thus, our results indicate that common errors, which lead to the false relations in LM, can be effectively removed by the LMMA approach.

Moreover, there are 11 most statistically significant KEGG pathways in the LMMA-based angiogenesis networks. SeeTable 4 for the detailedP-value of each pathway calculated by Fisher Exact Test. Among them the focal adhesion pathway, the adherens junction pathway and the regulation of actin cytoskeleton pathway contribute to the complex processes such as endothelial cell migration, morphogenesis and angiogenesis (Bixet al., 2004). TGF-beta regulates angiogenesis by affecting proliferation, differentiation and migration of endothelial cells (Lomnytskaet al., 2004). Insulin signaling pathway is implicated in cellular mitogenesis, angiogenesis, tumor cell survival and tumorigenesis (Cohenet al., 2005). Many Wnt proteins act through a canonical, beta-catenin signaling pathway (Masckauchanet al., 2005) and are able to control diverse biological processes, such as cell differentiation, proliferation (Masckauchanet al., 2005) and vasculature (Goodwin and D'Amore, 2002). Among the intracellular kinases implicated in angiogenesis, p38 MAPK has been shown to transduce signals critical for vascular remodeling and maturation (Zhuet al., 2003). Ca(2+) signaling is involved in virtually all cellular processes (Munaronet al., 2004). In addition, a variety of stimulatory cytokines, such as tumor necrosis factor (TNF)-alpha, interleukin (IL)-1, -6 and interferon (IFN)-gamma, and growth factors can promote the development of functional and structural vascular changes (Kofleret al., 2005). Therefore, pathway information in the LMMA-based angiogenesis network suggests that multiple pathway interactions boost the activity of either EC or ST, which are in accordance with recent reports (Mukhopadhyay and Datta, 2004;McCarty, 2004). Since multiple pathways are dysfunctional in angiogenesis related disorders such as cancers, a multifocal signal modulation therapy is proposed recently (McCarty, 2004). And LMMA network will be helpful for analyzing the interactions of multiple pathways in such complex biological processes.

As for the usability of LMMA, this system is flexible in application to any biological topic if the related literature and microarray data are available. Note that to construct a LMMA network, the number of all candidate variables (genes) should be controlled in a proper size, and the accuracy of the LMMA approach increases with the increasing number of candidate variables in a certain scope. For the LMMA-based angiogenesis network, it summarizes large amounts of angiogenesis related literatures and high-throughput microarray data. The LMMA approach enables researchers not only to keep up-to-date with all the relevant literature on specialized biological topics, but also to make sense of the relevant large-scale microarray dataset. Meanwhile, the LMMA approach serves as a useful tool for constructing specific biological network and experimental design. Thus, LMMA acts as a valuable computer representation of the known angiogenesis-related pathways, as well as the interactions among multiple pathways. Such representation will enable a systemic recognition for angiogenesis in the context of complex gene interactions, which is also helpful for studying the regulation of various complex biological, physiological and pathological systems. In the ‘omics’ field, the LMMA approach can be further explored to study protein–protein and other interactions.

The authors would like to express their great appreciation to B. Li (Boston University, USA), X. G. Zhang and C. Zhang in their lab for helpful discussions and comments. The authors would like to acknowledge the financial support from FANEDD (No. 200366), the Key Project of Chinese MOE (No. 104009) and the Basic Research Foundation of TNList.

Conflict of Interest: none declared.

REFERENCES

Barabasi

A.L.

Oltvai

Z.N.

Network biology: understanding the cell's functional organization

Nat. Rev. Genet.

2004

, vol.

(pg.

101

113

)

Bix

et al.,

Endorepellin causes endothelial cell disassembly of actin cytoskeleton and focal adhesions through alpha2beta1 integrin

J. Cell Biol.

2004

, vol.

166

(pg.

109

)

Bork

et al.,

Protein interaction networks from yeast to human

Curr. Opin. Struct. Biol.

2004

, vol.

(pg.

292

299

)

Carmeliet

Angiogenesis in health and disease

Nat. Med.

2003

, vol.

(pg.

653

660

)

Cary

M.P.

et al.,

Pathway information for systems biology

FEBS Lett.

2005

, vol.

579

(pg.

1815

1820

)

Cohen

B.D.

et al.,

Combination therapy enhances the inhibition of tumor growth with the fully human anti-type 1 insulin-like growth factor receptor monoclonal antibody CP-751,871

Clin. Cancer Res.

2005

, vol.

(pg.

2063

2073

)

Dennis

et al.,

DAVID: Database for Annotation, Visualization, and Integrated Discovery

Genome Biol.

2003

, vol.

pg.

R60

D'haeseleer

et al.,

Linear modeling of mRNA expression levels during CNS development and injury

Pac. Symp. Biocomput.

1999

(pg.

)

D'haeseleer

et al.,

Genetic network inference: from co-expression clustering to reverse engineering

Bioinformatics

2000

, vol.

(pg.

707

726

)

de Jong

Modeling and simulation of genetic regulatory systems: a literature review

J. Comput. Biol.

2002

, vol.

(pg.

103

)

Ding

et al.,

Mining Medline: abstracts, sentences, or phrases?

Pac. Symp. Biocomput.

2002

, vol.

(pg.

326

337

)

Folkman

Angiogenesis in cancer, vascular, rheumatoid and other diseases

Nat. Med.

1995

, vol.

(pg.

)

et al.,

Correlation between transcriptome and interactome mapping data fromSaccharomyces cerevisiae

Nat. Genet.

2001

, vol.

(pg.

482

486

)

Gerritsen

M.E.

et al.,

Using gene expression profiling to identify the molecular basis of the synergistic actions of hepatocyte growth factor and vascular endothelial growth factor in human endothelial cells

Br. J. Pharmacol.

2003

, vol.

140

(pg.

595

610

)

Goodwin

A.M.

D'Amore

P.A.

Wnt signaling in the vasculature

Angiogenesis

2002

, vol.

(pg.

)

Han

J.D.

et al.,

Evidence for dynamically organized modularity in the yeast protein–protein interaction network

Nature

2004

, vol.

430

(pg.

)

Jenssen

T.K.

et al.,

A literature network of human genes for high-throughput analysis of gene expression

Nat. Genet.

2001

, vol.

(pg.

)

PubMed

Jeong

et al.,

The large-scale organization of metabolic networks

Nature

2000

, vol.

407

(pg.

651

654

)

Kanehisa

Goto

KEGG: Kyoto encyclopedia of genes and genomes

Nucl. Acids. Res.

2000

, vol.

(pg.

)

Kofler

et al.,

Role of cytokines in cardiovascular diseases: a focus on endothelial responses to inflammation

Clin. Sci (Lond).

2005

, vol.

108

(pg.

205

213

)

Küffner

et al.,

Expert knowledge without the expert: integrated analysis of gene expression and literature to derive active functional contexts

Bioinformatics

2005

, vol.

(pg.

ii259

ii267

)

Lachenbruch

P.A.

Mickey

M.R.

Estimation of error rates in discriminant analysis

Technometrics

1968

, vol.

(pg.

)

Le Phillip

et al.,

Using prior knowledge to improve genetic network reconstruction from microarray data

In Silico Biol.

2004

, vol.

(pg.

335

353

)

PubMed

Liang

et al.,

Reveal a general reverse engineering algorithm for inference of genetic network architectures

Pac. Symp. Biocomput.

1998

, vol.

(pg.

)

Lomnytska

et al.,

Transforming growth factor-beta1-regulated proteins in human endothelial cells identified by two-dimensional gel electrophoresis and mass spectrometry

Proteomics

2004

, vol.

(pg.

995

1006

)

Mao

et al.,

Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary

Bioinformatics

2005

, vol.

(pg.

3787

3793

)

Masckauchan

T.N.

et al.,

Wnt/beta-catenin signaling induces proliferation, survival and interleukin-8 in human endothelial cells

Angiogenesis

2005

, vol.

(pg.

)

McCarty

M.F.

Targeting multiple signaling pathways as a strategy for managing prostate cancer: multifocal signal modulation therapy

Integr. Cancer Ther.

2004

, vol.

(pg.

349

380

)

Mukhopadhyay

Datta

Multiple regulatory pathways of vascular permeability factor/vascular endothelial growth factor (VPF/VEGF) expression in tumors

Semin. Cancer Biol.

2004

, vol.

(pg.

123

130

)

Munaron

et al.,

Blocking Ca²⁺entry: a way to control cell proliferation

Curr. Med. Chem.

2004

, vol.

(pg.

1533

1543

)

Ozier

et al.,

Global architecture of genetic interactions on the protein network

Nat. Biotechnol.

2003

, vol.

(pg.

490

491

)

Segal

M.R.

et al.,

Regression approaches for microarray data analysis

J. Comput. Biol.

2003

, vol.

(pg.

961

980

)

Shatkay

Feldman

Mining the biomedical literature in the genomic era: an overview

J. Comput. Biol.

2003

, vol.

(pg.

821

855

)

Sherlock

et al.,

The Stanford Microarray Database

Nucleic Acids. Res.

2001

, vol.

(pg.

152

155

)

Song

et al.,

Self-similarity of complex networks

Nature

2005

, vol.

433

(pg.

392

395

)

Stapley

B.J.

Benoit

Information retrieval and visualization from co-occurrences of gene names in Medline abstracts

Pac. Symp. Biocomput.

2000

(pg.

529

540

)

Troyanskaya

et al.,

Missing value estimation methods for DNA microarrays

Bioinformatics

2001

, vol.

(pg.

520

525

)

van Someren

E.P.

et al.,

Genetic network modeling

Pharmacogenomics

2002

, vol.

(pg.

507

525

)

von Mering

et al.,

Comparative assessment of large scale data sets of protein–protein interactions

Nature

2002

, vol.

417

(pg.

399

403

)

West

et al.,

Predicting the clinical status of human breast cancer using gene expression profiles

Proc. Natl Acad. Sci. USA

2001

, vol.

(pg.

11462

11467

)

L.J.

Combined literature mining and gene expression analysis for modeling neuro-endocrine-immune interactions

Lect. Notes Comput. Sci.

2005

, vol.

3645

(pg.

)

Zhang

Modeling of neuro-endocrine-immune network via subject oriented literature mining

Proc. BGRS

2004

, vol.

(pg.

167

170

)

Zhu

et al.,

A probabilistic model for mining implicit ‘chemical compound-gene’ relations from literature

Bioinformatics

2005

, vol.

(pg.

ii245

ii251

)

PubMed

Original Papers>Data and text mining

Zhu

W.H.

et al.,

Requisite role of p38 MAPK in mural cell recruitment during angiogenesis in the rat aorta model

J. Vasc. Res.

2003

, vol.

(pg.

140

148

)

Author notes

Associate Editor: Alfonso Valencia

Issue Section:

Download all slides

Citations

Views

2,189

Altmetric

More metrics information

Metrics

Total Views2,189

1,596Pageviews

593PDF Downloads

Since 12/1/2016

Month:	Total Views:
December 2016	3
January 2017	2
February 2017	5
March 2017	14
April 2017	2
May 2017	8
June 2017	9
July 2017	13
August 2017	8
September 2017	9
October 2017	1
November 2017	9
December 2017	20
January 2018	21
February 2018	18
March 2018	34
April 2018	34
May 2018	34
June 2018	35
July 2018	17
August 2018	32
September 2018	23
October 2018	9
November 2018	36
December 2018	26
January 2019	21
February 2019	26
March 2019	29
April 2019	32
May 2019	36
June 2019	23
July 2019	25
August 2019	37
September 2019	23
October 2019	15
November 2019	30
December 2019	24
January 2020	24
February 2020	29
March 2020	6
April 2020	35
May 2020	9
June 2020	21
July 2020	14
August 2020	18
September 2020	29
October 2020	18
November 2020	19
December 2020	17
January 2021	31
February 2021	16
March 2021	52
April 2021	20
May 2021	16
June 2021	25
July 2021	24
August 2021	24
September 2021	17
October 2021	24
November 2021	21
December 2021	16
January 2022	14
February 2022	32
March 2022	19
April 2022	21
May 2022	28
June 2022	38
July 2022	25
August 2022	50
September 2022	42
October 2022	36
November 2022	28
December 2022	15
January 2023	27
February 2023	24
March 2023	28
April 2023	14
May 2023	5
June 2023	5
July 2023	16
August 2023	26
September 2023	14
October 2023	17
November 2023	27
December 2023	34
January 2024	13
February 2024	42
March 2024	24
April 2024	19
May 2024	27
June 2024	30
July 2024	25
August 2024	20
September 2024	29
October 2024	19
November 2024	15
December 2024	28
January 2025	9
February 2025	15
March 2025	21

Citations

72Web of Science

Altmetrics

Email alerts

New journal issues

New journal articles

Citing articles via

Web of Science (72)