The effects of incomplete protein interaction data on structural and evolutionary inferences
-
Published:2006-11-03
Issue:1
Volume:4
Page:
-
ISSN:1741-7007
-
Container-title:BMC Biology
-
language:en
-
Short-container-title:BMC Biol
Author:
de Silva Eric,Thorne Thomas,Ingram Piers,Agrafioti Ino,Swire Jonathan,Wiuf Carsten,Stumpf Michael PH
Abstract
Abstract
Background
Present protein interaction network data sets include only interactions among subsets of the proteins in an organism. Previously this has been ignored, but in principle any global network analysis that only looks at partial data may be biased. Here we demonstrate the need to consider network sampling properties explicitly and from the outset in any analysis.
Results
Here we study how properties of the yeast protein interaction network are affected by random and non-random sampling schemes using a range of different network statistics. Effects are shown to be independent of the inherent noise in protein interaction data. The effects of the incomplete nature of network data become very noticeable, especially for so-called network motifs. We also consider the effect of incomplete network data on functional and evolutionary inferences.
Conclusion
Crucially, when only small, partial network data sets are considered, bias is virtually inevitable. Given the scope of effects considered here, previous analyses may have to be carefully reassessed: ignoring the fact that present network data are incomplete will severely affect our ability to understand biological systems.
Publisher
Springer Science and Business Media LLC
Subject
Cell Biology,Developmental Biology,Plant Science,General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,Physiology,Ecology, Evolution, Behavior and Systematics,Structural Biology,Biotechnology
Reference45 articles.
1. de Silva E, Stumpf M: Complex networks and simple models in biology. J Roy Soc Interface. 2005, 2 (5): 419-30. 10.1098/rsif.2005.0067. 2. Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck F, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S, Timm J, Mintzlaff S, Abraham C, Bock N, Kietzmann S, Goedde A, Toks?z E, Droege A, Krobitsch S, Korn B, Birchmeier W, Lehrach H, Wanker E: A human protein-protein interaction network: a resource for annotating the proteome. Cell. 2005, 122 (6): 957-68. 10.1016/j.cell.2005.08.029. 3. Rual J, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz G, Gibbons F, Dreze M, Ayivi-Guedehoussou N, Klitgord N, Simon C, Boxem M, Milstein S, Rosenberg J, Goldberg D, Zhang L, Wong S, Franklin G, Li S, Albala J, Lim J, Fraughton C, Llamosas E, Cevik S, Bex C, Lamesch P, Sikorski R, Vandenhaute J, Zoghbi H, Smolyar A, Bosak S, Sequerra R, Doucette-Stamm L, Cusick M, Hill D, Roth F, Vidal M: Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005, 437 (7062): 1173-8. 10.1038/nature04209. 4. Stumpf M, Wiuf C, May R: Subnets of scale-free networks are not scale-free: the sampling properties of networks. Proc Natl Acad Sci USA. 2005, 102: 4221-4224. 10.1073/pnas.0501179102. 5. Stumpf M, Wiuf C: Sampling properties of random graphs: the degree distribution. Phys Rev E. 2005, 72: 036118-10.1103/PhysRevE.72.036118.
Cited by
56 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|