YOMEDIA
ADSENSE
Computational reconstruction of metabolic networks from high throughput profiling data
19
lượt xem 2
download
lượt xem 2
download
Download
Vui lòng tải xuống để xem tài liệu đầy đủ
In this paper, we develop a computational method for the metabolic network reconstruction that can uncover not only pairwise interactions but also interactions involving more than two substrates/products such as triple interactions, quartic interactions, etc.
AMBIENT/
Chủ đề:
Bình luận(0) Đăng nhập để gửi bình luận!
Nội dung Text: Computational reconstruction of metabolic networks from high throughput profiling data
Journal of Computer Science and Cybernetics, V.27, N.1 (2011), 23-35<br />
<br />
COMPUTATIONAL RECONSTRUCTION OF METABOLIC NETWORKS<br />
FROM HIGH-THROUGHPUT PROFILING DATA<br />
NGUYEN QUYNH DIEP1 , PHAM THO HOAN1 , HO TU BAO2<br />
TRAN DANG HUNG1 , PHAM QUOC THANG3<br />
1 Hanoi<br />
2<br />
<br />
National University of Education, 136 Xuan-Thuy, Cau-Giay, Hanoi, Vietnam<br />
<br />
Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa<br />
923-1292, Japan<br />
3<br />
<br />
Tay-Bac University, Son-La city, Vietnam<br />
<br />
´<br />
`<br />
´<br />
’ a<br />
T´m t˘t. Hˆu hˆt c´c phu.o.ng ph´p t´ to´n t´i hiˆn mang sinh hoc hiˆn nay m´.i chı tˆp trung<br />
o<br />
a<br />
a e a<br />
a ınh a a e<br />
e<br />
o<br />
.<br />
.<br />
.<br />
.<br />
.<br />
.o.ng t´c gi˜.a hai phˆn tu., trong khi d´ mang chuyˆn h´a lai bao gˆ m c´c phan u.ng liˆn<br />
’ o .<br />
` a<br />
’ ´<br />
t` c´c tu<br />
ım a<br />
a<br />
u<br />
a ’<br />
o .<br />
e<br />
o<br />
e<br />
`<br />
´<br />
´<br />
´<br />
e u<br />
e<br />
a<br />
ı a<br />
a a<br />
o .<br />
o<br />
quan dˆn t`. 2 dˆn 6 chˆt. V` vˆy m` c´c phu.o.ng ph´p kh´m ph´ mang sinh hoc dang tˆ n tai khˆng<br />
a<br />
a<br />
a .<br />
.<br />
.<br />
’<br />
`<br />
´<br />
’ ´<br />
th´ ho.p dˆ t´i hiˆn c´c phan u.ng sinh h´a c´ nhiˆu ho.n hai chˆt tham gia.<br />
ıch .<br />
e a e a<br />
o o<br />
e<br />
a<br />
.<br />
’<br />
´<br />
B`i b´o n`y gi´.i thiˆu mˆt phu.o.ng ph´p t´ to´n t´i hiˆn mang c´c chˆ t chuyˆn h´a t`. d˜. liˆu<br />
a a a<br />
o<br />
e<br />
o<br />
a ınh a a e<br />
a<br />
a<br />
e o u u e<br />
.<br />
.<br />
.<br />
.<br />
.<br />
.o.ng c´c chˆ t o. c´c diˆu kiˆn ho˘c th`.i diˆm kh´c nhau. Phu.o.ng ph´p khˆng chı<br />
’<br />
`<br />
´<br />
´<br />
’<br />
do nˆ ng dˆ/khˆi lu .<br />
o<br />
o<br />
o<br />
e<br />
e<br />
a<br />
o<br />
e<br />
a<br />
a<br />
a ’ a `<br />
a<br />
o<br />
.<br />
.<br />
.<br />
`<br />
ph´t hiˆn c´c tu.o.ng t´c gi˜.a hai phˆn tu. m` c`n ph´t hiˆn du.o.c c´c tu.o.ng t´c nhiˆu ho.n hai phˆn<br />
a<br />
e a<br />
a<br />
u<br />
a ’ a o<br />
a<br />
e<br />
a<br />
a<br />
e<br />
a<br />
.<br />
.<br />
.<br />
´<br />
´<br />
´<br />
´<br />
’ o a a<br />
e a<br />
u<br />
o ’<br />
tu., d´ l` c´c tu.o.ng t´c ba chˆt, tu.o.ng t´c bˆn chˆt, v.v. Trong phu.o.ng ph´p dˆ xuˆt, ch´ ng tˆi su.<br />
a<br />
a<br />
a o<br />
a<br />
a `<br />
.o.ng t´c da chˆt. Ch´ ng tˆi cung cˆp mˆt<br />
’<br />
´<br />
´<br />
o<br />
o<br />
o a<br />
e o ım a<br />
a<br />
u<br />
o<br />
a<br />
o<br />
dung dˆ do thˆng tin phu thuˆc bˆc ba dˆ d` t` c´c tu<br />
a<br />
.<br />
.<br />
.<br />
.<br />
.<br />
.<br />
c´ch nh` m´.i vˆ dˆ do thˆng tin phu thuˆc bˆc ba m` th´ ho.p trong viˆc ph´t hiˆn c´c tu.o.ng<br />
a<br />
ın o ` o<br />
e .<br />
o<br />
o a<br />
a ıch .<br />
e<br />
a<br />
e a<br />
.<br />
.<br />
.<br />
.<br />
.<br />
´<br />
`<br />
´<br />
’<br />
e<br />
e a<br />
a `<br />
o<br />
t´c nhiˆu ho.n hai biˆn. Hiˆu n˘ng cua phu.o.ng ph´p dˆ xuˆt d˜ du.o.c d´nh gi´ trˆn c´c d˜. liˆu mˆ<br />
a<br />
e<br />
e a a<br />
a e a u e<br />
.<br />
. a<br />
.<br />
’ o<br />
’ o<br />
’<br />
phong c´c hˆ chuyˆn h´a sinh hoc. T´ ch´ x´c cua phu.o.ng ph´p t´i hiˆn lai mang chuyˆn h´a<br />
a e<br />
e<br />
ınh ınh a ’<br />
a a e .<br />
e<br />
.<br />
.<br />
.<br />
.<br />
´<br />
´<br />
´<br />
’ a<br />
’<br />
du.o.c d´nh gi´ o. hai m´.c: c´c tu.o.ng t´c hai chˆt v` c´c tu.o.ng t´c ba chˆt. Kˆt qua t´i hiˆn cua<br />
a<br />
a ’<br />
u<br />
a<br />
a<br />
a a a<br />
a<br />
a<br />
e<br />
e<br />
.<br />
.<br />
.o.ng ph´p dˆ xuˆt l` rˆt triˆn vong.<br />
’<br />
´<br />
´<br />
e a a a<br />
e<br />
Phu<br />
a `<br />
.<br />
Abstract. All computational methods of biological network reconstruction up to now aim only to<br />
find pairwise interactions. While metabolic networks composed mainly of reactions that often consist<br />
of from 2 to 6 substrates/products, the existing computational methods may not be appropriate to<br />
reconstruct interactions of more than two variables like reactions in the metabolic networks.<br />
In this paper, we develop a computational method for the metabolic network reconstruction<br />
that can uncover not only pairwise interactions but also interactions involving more than two substrates/products such as triple interactions, quartic interactions, etc. In the proposed method we<br />
use the ternary mutual information to capture high order interactions. The key idea is to propose a<br />
novel view on the ternary mutual information that can be appropriately used to reconstruct reactions<br />
involving more than two substrates/products. We have applied the proposed method to synthesized<br />
metabolome data; the reconstruction accuracy has been evaluated at the levels of pairwise and triple<br />
interactions. The performance of the method is promising.<br />
Keywords: Mutual information, entropy, biological network reconstruction.<br />
<br />
24<br />
<br />
NGUYEN Q.D., PHAM T.H., HO T.B., TRAN D.H., PHAM Q.T.<br />
<br />
1.<br />
<br />
INTRODUCTION<br />
<br />
Thanks to the advancement of high-throughput technologies, we can now measure simultaneously the concentrations of thousands of molecular species in a biological system, such<br />
as mRNAs [22] and metabolites [18]. These high-throughput data are snapshots of a biological system and are informative to infer what has happened in the system. The analysis of<br />
the high-throughput data to uncover underlying biological mechanisms, e.g. gene regulatory<br />
networks (see [12] for an overview) or metabolic networks [6, 20] is one of the challenges in<br />
systems biology.<br />
Computational reconstruction of gene regulatory networks from transcriptome data has<br />
been deeply investigated by different approaches. These reverse engineering methods fall into<br />
three broad categories: (1) information theory models [24, 5, 19] with a variety of measures of<br />
pairwise mutual information between genes; (2) Bayesian and graphical networks [10, 25] that<br />
maximize a scoring function over some alternative network models to find the best model fitting<br />
the data; (3) differential and difference equations [11, 4] that explain the data by a system<br />
of mathematical equations. All the work on the gene regulatory network reconstruction until<br />
now aims to find only pairwise interactions (concerning with two genes).<br />
Different from gene regulatory networks that mainly concern with pairwise interactions,<br />
metabolic networks are composed mainly of reactions that often consist of from 2 to 6 metabolites (substrates/products). Thus, the metabolic network reconstruction should aim to find<br />
groups of metabolites that each involves in the same reaction. Up to now, there have been<br />
efforts to reconstruct metabolic networks that use methodologies of gene regulatory network<br />
reconstruction [6, 20]. As a consequence, they can only detect pairwise interactions but not<br />
interactions of more than two metabolites.<br />
In this work, we develop a computational method net-reconstruct for the metabolic network reconstruction that can uncover not only pairwise interactions but also interactions<br />
involving more than two substrates/products, for example, triple interactions, quartic interactions, etc. In this method we use the interaction mutual information [9] to capture multiple<br />
interactions. The key idea is to propose a novel view on the interaction mutual information that<br />
can be appropriately use to reconstruct reactions involving more than two substrates/products.<br />
When applying on the synthetic perturbation data of full-random networks (all structures,<br />
kinetic laws and parameter values are randomly generated, [2]) as well as of a semi-random<br />
networks, the human red blood cell metabolism ([14, 20]), our method gave promising results<br />
of interaction subsets that are close to the validated metabolic reactions. The interaction<br />
subsets with highest mutual information found from our method often correspond to metabolic<br />
reactions in the original networks, also many original reactions have been found in the results<br />
of our software. When evaluating accuracy at the level of pairwise interactions, the results of<br />
our method agreed with those of recent research on reconstruction methods.<br />
2.<br />
2.1.<br />
<br />
METHODS<br />
<br />
Mutual information between two variables<br />
<br />
Mutual information measure is more general than Pearson’s correlation coefficient (P P C )<br />
to capture dependency between two variables. While P P C accounts only for linear or monotonic relationships, the mutual information takes into account all types of dependence. Given<br />
<br />
RECONSTRUCTION OF METABOLIC NETWORKS FROM HIGH-THROUGHPUT PROFILING DATA<br />
H(X)<br />
<br />
25<br />
<br />
H(Y)<br />
<br />
H(X|Y)<br />
<br />
H(Y|X)<br />
<br />
MI(2)(X,Y) = H(X) - H(X|Y)<br />
= H(Y) - H(Y|X)<br />
<br />
Figure 2.1. The Venn diagram for mutual information M I (2) of two variables<br />
<br />
two random variables X and Y with the joint density function fX,Y and marginal density<br />
functions fX , fY , the mutual information M I (2) of two variables X and Y [8] is defined as<br />
follows:<br />
fX,Y (x, y)<br />
M I (2)(X, Y ) =<br />
fX,Y (x, y) log<br />
dxdy<br />
(2.1)<br />
fX (x)fY (y)<br />
(we use the superscript number 2 to emphasize that the mutual information here is for 2<br />
variables)<br />
If X and Y are independent, the mutual information M I (2)(X, Y ) = 0; if they are perfectly<br />
dependent, M I (2)(X, Y ) approaches infinity.<br />
The mutual information M I (2)(X, Y ) can also be interpreted in terms of information<br />
entropy [8] as<br />
M I (2)(X, Y ) = H(X) + H(Y ) − H(X, Y )<br />
= H(X) − H(X|Y )<br />
= H(Y ) − H(Y |X)<br />
<br />
(2.2)<br />
(2.3)<br />
(2.4)<br />
<br />
From Eq. 2.3 and Eq. 2.4 we can interpret the meaning of M I (2)(X, Y ) as it measures<br />
the reduction the uncertainty of X due to the knowledge of Y , or vice versa [3]. The above<br />
interpretation of Shannon entropy can be visualized by the Venn diagram in Figure 2.1, where<br />
M I (2)(X, Y ) is the intersection of two entropy circles H(X) and H(Y ), and H(X, Y ) is the<br />
union of two sets H(X) and H(Y ) [3, 13].<br />
2.2.<br />
<br />
Mutual information for more than two variables<br />
<br />
The mutual information M I (2) can detect interactions (edges) between two variables in<br />
a network. However, in most biological networks, each node (variable) may interact (link)<br />
with some others in the same or different mechanisms. Metabolic networks are an example of<br />
such networks, where each metabolite may interact with some others in different reactions. In<br />
this section, we present an extension of M I (2) that allows capturing the interactions of three<br />
variables.<br />
The generalization of mutual information of three variables from that of two variables is<br />
not trivial [3, 13]. One of those generations is i nteraction mutual information [9] that has<br />
received much attention but with controversial interpretations, defined as follows:<br />
M I (3)(X, Y, Z) = H(X) + H(Y ) + H(Z) − H(X, Y )<br />
−H(Y, Z) − H(X, Z) + H(X, Y, Z)<br />
= MI<br />
<br />
(2)<br />
<br />
(X, Y ) − M I<br />
<br />
(2)<br />
<br />
(X, Y |Z)<br />
<br />
(2.5)<br />
(2.6)<br />
<br />
26<br />
<br />
NGUYEN Q.D., PHAM T.H., HO T.B., TRAN D.H., PHAM Q.T.<br />
<br />
(a)<br />
<br />
(b)<br />
H(X)<br />
<br />
H(X)<br />
<br />
H(Z)<br />
<br />
H(Y)<br />
<br />
H(Y)<br />
H(Z)<br />
MI(3)(X,Y,Z)>0<br />
<br />
(3)<br />
<br />
MI (X,Y,Z)
ADSENSE
CÓ THỂ BẠN MUỐN DOWNLOAD
Thêm tài liệu vào bộ sưu tập có sẵn:
Báo xấu
LAVA
AANETWORK
TRỢ GIÚP
HỖ TRỢ KHÁCH HÀNG
Chịu trách nhiệm nội dung:
Nguyễn Công Hà - Giám đốc Công ty TNHH TÀI LIỆU TRỰC TUYẾN VI NA
LIÊN HỆ
Địa chỉ: P402, 54A Nơ Trang Long, Phường 14, Q.Bình Thạnh, TP.HCM
Hotline: 093 303 0098
Email: support@tailieu.vn