A multivariate prediction model for microarray cross-hybridization

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/1853/13197

Title: A multivariate prediction model for microarray cross-hybridization
Author: Chen, Yian A. ; Chou, Cheng-Chung ; Lu, Xinghua ; Slate, Elizabeth H. ; Peck, Konan ; Xu, Wenying ; Voit, Eberhard O. ; Almeida, Jonas S.
Abstract: Background: Expression microarray analysis is one of the most popular molecular diagnostic techniques in the post-genomic era. However, this technique faces the fundamental problem of potential cross-hybridization. This is a pervasive problem for both oligonucleotide and cDNA microarrays; it is considered particularly problematic for the latter. No comprehensive multivariate predictive modeling has been performed to understand how multiple variables contribute to (cross-) hybridization. Results: We propose a systematic search strategy using multiple multivariate models [multiple linear regressions, regression trees, and artificial neural network analyses (ANNs)] to select an effective set of predictors for hybridization. We validate this approach on a set of DNA microarrays with cytochrome p450 family genes. The performance of our multiple multivariate models is compared with that of a recently proposed third-order polynomial regression method that uses percent identity as the sole predictor. All multivariate models agree that the 'most contiguous base pairs between probe and target sequences,' rather than percent identity, is the best univariate predictor. The predictive power is improved by inclusion of additional nonlinear effects, in particular target GC content, when regression trees or ANNs are used. Conclusion: A systematic multivariate approach is provided to assess the importance of multiple sequence features for hybridization and of relationships among these features. This approach can easily be applied to larger datasets. This will allow future developments of generalized hybridization models that will be able to correct for false-positive cross-hybridization signals in expression experiments.
Description: © 2006 Chen et al; licensee BioMed Central Ltd. The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2105/7/101 doi:10.1186/1471-2105-7-101
Type: Article
URI: http://hdl.handle.net/1853/13197
ISSN: 1471-2105
Citation: BMC Bioinformatics 2006, 7:101
Date: 2006
Publisher: Georgia Institute of Technology
BioMed Central
Subject: Artificial Neural Network Analyses
Multiple linear regressions
Multiple multivariate models
Regression trees

All materials in SMARTech are protected under U.S. Copyright Law and all rights are reserved, unless otherwise specifically indicated on or in the materials.

Files in this item

Files Size Format View
EOVOIT_1471-2105-7-101.pdf 427.9Kb PDF View/ Open

This item appears in the following Collection(s)

Show full item record