Open Access
Genet. Sel. Evol.
Volume 39, Number 6, November-December 2007
Page(s) 633 - 650
Published online 06 December 2007
Genet. Sel. Evol. 39 (2007) 633-650
DOI: 10.1051/gse:2007029

Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication)

Florence Jaffrézica, Dirk-Jan de Koningb, Paul J. Boettcherc, Agnès Bonnetd, Bart Buitenhuise, Rodrigue Clossetf, Sébastien Déjeang, Céline Delmash, Johanne C. Detilleuxi, Peter Dovcj, Mylène Duvalh, Jean-Louis Foulleya, Jakob Hedegaarde, Henrik Hornshøje, Ina Hulseggek, Luc Jansse, Kirsty Jensenb, Li Jiange, Miha Lavricj, Kim-Anh Lê Caog, h, Mogens Sandø Lunde, Roberto Malinvernic, Guillemette Marota, Haisheng Niel, Wolfram Petzlm, Marco H. Poolk, Christèle Robert-Graniéh, Magali San Cristobald, Evert M. van Schothorstn, Hans-Joachim Schubertho, Peter Sørensene, Alessandra Stellac, Gwenola Tosser-Kloppd, David Waddingtonb, Michael Watsonp, Wei Yangq, Holm Zerbem and Hans-Martin Seyfertq

a  INRA, UR337, Jouy-en-Josas, France (INRA_J);
b  Roslin Institute, Roslin, UK (ROSLIN);
c  Parco Tecnologico Padano, Lodi, Italy (PTP);
d  INRA, UMR444, Castanet-Tolosan, France (INRA_T);
e  University of Aarhus, Tjele, Denmark (AARHUS);
f  University of Liège, Liège, Belgium (ULg2);
g  Université Paul Sabatier, Toulouse, France (INRA_T);
h  INRA, UR631, Castanet-Tolosan, France (INRA_T);
i  Faculty of Veterinary Medicine, University of Liège, Liège, Belgium (ULg1);
j  University of Ljubljana, Slovenia (SLN);
k  Animal Sciences Group Wageningen UR, Lelystad, The Netherlands;
l  Wageningen University and Research Centre, Wageningen, The Netherlands (WUR);
m  Ludwig-Maximilians-University, Munich, Germany;
n  RIKILT-Institute of Food Safety, Wageningen, The Netherlands (WUR);
o  University of Veterinary Medicine, Hannover, Germany;
p  Institute for Animal Health, Compton, UK (IAH);
q  Research Institute for the Biology of Farm Animals, Dummerstorf, Germany

(Received 10 May 2007; accepted 6 July 2007 ; published online 6 December 2007)

Abstract - A large variety of methods has been proposed in the literature for microarray data analysis. The aim of this paper was to present techniques used by the EADGENE (European Animal Disease Genomics Network of Excellence) WP1.4 participants for data quality control, normalisation and statistical methods for the detection of differentially expressed genes in order to provide some more general data analysis guidelines. All the workshop participants were given a real data set obtained in an EADGENE funded microarray study looking at the gene expression changes following artificial infection with two different mastitis causing bacteria: Escherichia coli and Staphylococcus aureus. It was reassuring to see that most of the teams found the same main biological results. In fact, most of the differentially expressed genes were found for infection by E. coli between uninfected and 24 h challenged udder quarters. Very little transcriptional variation was observed for the bacteria S. aureus. Lists of differentially expressed genes found by the different research teams were, however, quite dependent on the method used, especially concerning the data quality control step. These analyses also emphasised a biological problem of cross-talk between infected and uninfected quarters which will have to be dealt with for further microarray studies.

Key words: quality control / differentially expressed genes / mastitis resistance / microarray data / normalisation

Correspondence and reprints:

© INRA, EDP Sciences 2007