The interdisciplinary nature of bioinformatics presents a research challenge in integrating concepts, methods, software and multiplatform data. Although there have been rapid developments in new technology and an inundation of statistical methods for addressing other types of high-throughput data, such as proteomic profiles that arise from mass spectrometry experiments. Bayesian Inference for Gene Expression and Proteomics discusses the development and application of Bayesian methods in the analysis of high-throughput bioinformatics data that arise from medical, in particular, cancer research, as well as molecular and structural biology. The Bayesian approach has the advantage that evidence can be easily and flexibly incorporated into statistical methods. A basic overview of the biological and technical principles behind multi-platform high-throughput experimentation is followed by expert reviews of Bayesian methodology, tools and software for single group inference, group comparisons, classification and clustering, motif discovery and regulatory networks, and Bayesian networks and gene interactions.
1. An introduction to high-throughput bioinformatics data Keith Baggerly, Kevin Coombes and Jeffrey S. Morris
2. Hierarchical mixture models for expression profiles Michael Newton, Ping Wang and Christina Kendziorski
3. Bayesian hierarchical models for inference in microarray data Anne-Mette K. Hein, Alex Lewin and Sylvia Richardson
4. Bayesian process-based modeling of two-channel microarray experiments estimating absolute mRNA concentrations Mark A. van de Wiel, Marit Holden, Ingrid K. Glad, Heidi Lyng and Arnoldo Frigessi
5. Identification of biomarkers in classification and clustering of high-throughput data Mahlet Tadesse, Marina Vannucci, Naijun Sha and Sinae Kim
6. Modeling nonlinear gene interactions using Bayesian MARS Veerabhadran Baladandayuthapani, Chris C. Holmes, Bani K. Mallick and Raymond J. Carroll
7. Models for probability of under- and over-expression: the POE scale Elizabeth Garrett-Mayer and Robert Scharpf
8. Sparse statistical modelling in gene expression genomics Joseph Lucas, Carlos Carvalho, Quanli Wang, Andrea Bild, Joseph Nevins and Mike West
9. Bayesian analysis of cell-cycle gene expression Chuan Zhou, Jon Wakefield and Linda L. Breeden
10. Model-based clustering for expression data via a Dirichlet process mixture model David Dahl
11. Interval mapping for Expression Quantitative Trait Loci mapping Meng Chen and Christina Kendziorski
12. Bayesian mixture model for gene expression and protein profiles Michele Guindani, Kim-Anh Do, Peter Muller and Jeffrey S. Morris
13. Shrinkage estimation for SAGE data using a mixture Dirichlet prior Jeffrey S. Morris, Kevin Coombes and Keith Baggerly
14. Analysis of mass spectrometry data using Bayesian wavelet-based functional mixed models Jeffrey S. Morris, Philip J. Brown, Keith Baggerly and Kevin Coombes
15. Nonparametric models for proteomic peak identification and quantification Merlise Clyde, Leanna House and Robert Wolpert
16. Bayesian modeling and inference for sequence motif discovery Mayetri Gupta and Jun S. Liu
17. Identifying of DNA regulatory motifs and regulators by integrating gene expression and sequence data Deuk Woo Kwon, Sinae Kim, David Dahl, Michael Swartz, Mahlet Tadesse and Marina Vannucci
18. A misclassification model for inferring transcriptional regulatory networks Ning Sun and Hongyu Zhao
19. Estimating cellular signaling from transcription data Andrew V. Kossenkov, Ghislain Bidaut and Michael Ochs
20. Computational methods for learning Bayesian networks from high-throughput biological data Bradley Broom and Devika Subramanian
21. Modeling transcriptional regulation: Bayesian networks and informative priors Alexander J. Hartemink
22. Sample size choice for microarray experiments Peter Muller, Christian Robert and Judith Rousseau
Kim-Anh Do is a Professor in the Department of Biostatistics and Applied Mathematics at the University of Texas M. D. Anderson Cancer Center. Her research interests are in computer-intensive statistical methods with recent focus in the development of methodology and software to analyze data produced from high-throughput optimization.
Peter Müller is a Professor in the Department of Biostatistics and Applied Mathematics at the University of Texas M. D. Anderson Cancer Center. His research interests and contributions are in the areas of Markov chain Monte Carlo posterior simulation, nonparametric Bayesian inference, hierarchical models, mixture models and Bayesian decisions problems.
Marina Vannucci is a Professor of Statistics at Rice University. Her research focuses on the theory and practice of Bayesian variable selection techniques and on the development of wavelet-based statistical models and their applications. Her work is often motivated by real problems that need to be addressed with suitable statistical methods.
"[...] an authoritative volume [...] presents the state of the art statistical techniques that are starting to make an impact at the forefronts of modern scientific discovery."
- Journal of the RSS