-
Articles citing this article
- PubMed -
Same authors
- PubMed - Recommend this article
- Download citation
- Alert me if this article is corrected
|
||||||||||||||||||
Genet. Sel. Evol. 39 (2007) 669-683
DOI: 10.1051/gse:2007031
Analysis of a simulated microarray dataset: Comparison of methods for data normalisation and detection of differential expression (Open Access publication)
Michael Watsona, Mónica Pérez-Alegreb, Michael Denis Baronc, Céline Delmasd, Peter Dovce, Mylène Duvald, Jean-Louis Foulleyf, Juan José Garrido-Pavónb, Ina Hulseggeg, Florence Jaffrézicf, Ángeles Jiménez-Marínb, Miha Lavrice, Kim-Anh Lê Caoh, Guillemette Marotf, Daphné Mouzakih, Marco H. Poolc, Christèle Robert-Graniéd, Magali San Cristobald, Gwenola Tosser-Kloppd, David Waddingtonh and Dirk-Jan de Koningha Institute for Animal Health, Compton, UK (IAH_C)
b University of Cordoba, Cordoba, Spain (CDB)
c Institute for Animal Health, Pirbright, UK (IAH_P)
d INRA, Castanet-Tolosan, France (INRA_T)
e University of Ljubljana, Slovenia (SLN)
f INRA, Jouy-en-Josas, France (INRA_J)
g Animal Sciences Group Wageningen UR, Lelystad, NL (IDL)
h Roslin Institute, Roslin, UK (ROSLIN)
(Received 10 May 2007; accepted 10 July 2007; published online 6 December 2007)
Abstract - Microarrays allow researchers to measure the expression of thousands of genes in a single experiment. Before statistical comparisons can be made, the data must be assessed for quality and normalisation procedures must be applied, of which many have been proposed. Methods of comparing the normalised data are also abundant, and no clear consensus has yet been reached. The purpose of this paper was to compare those methods used by the EADGENE network on a very noisy simulated data set. With the a priori knowledge of which genes are differentially expressed, it is possible to compare the success of each approach quantitatively. Use of an intensity-dependent normalisation procedure was common, as was correction for multiple testing. Most variety in performance resulted from differing approaches to data quality and the use of different statistical tests. Very few of the methods used any kind of background correction. A number of approaches achieved a success rate of 95% or above, with relatively small numbers of false positives and negatives. Applying stringent spot selection criteria and elimination of data did not improve the false positive rate and greatly increased the false negative rate. However, most approaches performed well, and it is encouraging that widely available techniques can achieve such good results on a very noisy data set.
Key words: gene expression / two colour microarray / simulation / statistical analysis
Correspondence and reprints: michael.watson@bbsrc.ac.uk
© INRA, EDP Sciences 2007
| What is OpenURL? |
- If your librarian has set up your subscription with an OpenURL resolver, OpenURL links appear automatically on the abstract pages.
- You can define your own OpenURL resolver with your EDPS Account. In this case your choice will be given priority over that of your library.
- You can use an add-on for your browser (Firefox or I.E.) to display OpenURL links on a page (see http://www.openly.com/openurlref/). You should disable this module if you wish to use the OpenURL server that you or your library have defined.


Document
BibSonomy
CiteUlike
Connotea
Del.icio.us
Digg
Facebook