Status and Investigators to Contact on Data Use and Publication modify this page

MOUSE BXD: The University of Western Australia data sets (Thymus, Spleen, Peripheral Blood Leucocytes) Status: Currently these are unpublished and private data source with usage restrictions. Please contact Dr. Grant Morahan regarding access and use of these data.
Data that are accessible via GeneNetwork belong to several research groups listed below. Some of the data sets are still actively being generated and analyzed. The scientists who are generating these data have often agreed to remove password protection and let the research community view, share, and analyze data. Although they are willing and enthusiastic about sharing these data, they have not relinquished interest or ownership. If you are planning to use results and data extracted from GeneNetwork in publication, we request that you contact the data owners prior to submission.

MOUSE BXD: Thymus, Spleen, Peripheral Blood Leucocytes data sets:

Status: These are unpublished and private data source with usage restrictions. Error-checking and refinement of this data set is still in progress. Data were first entered November 2008.

References and Contact: For access to data prior to publication, please contact Grant Morahan (gem at waimr. uwa. edu. au) regarding use of these data sets on a collaborative basis.

These expression data sets are being generated by investigators at The Western Australian Institute for Medical Research and The University of Western Australia (Grant Morahan, Munish Mehta, Quang Nguyen, James Jooste, and Violet Peeva). Samples are generated by Quang Nguyen and James Jooste. Arrays are all processed by Quang Nguyen.

MOUSE BXD: UTHSC Brain U74Av2 data sets:

Status: DEPRECATED, PUBLISHED, and OPEN data source, NO usage restrictions as of March 1, 2005. Even the most recent release of these data is now obsolete. We recommend using more recent Brain data sets from the University of Colorado Denver (Tabakoff and colleagues) or from INIA.

References: Chesler EJ, Lu L, Shou S, Qu Y, Gu J, Wang J, Hsu HC, Mountz JD, Baldwin N, Langston MA, Threadgill DW, Manly KF, Williams RW (2005) Genetic dissection of gene expression reveals polygenic networks modulating brain structure and function. Nature Genetics 37:233-242

The forebrain and midbrain expression data were generated using The William and Dorothy Dunavant Endowment to RWW and published in early 2005 (Chesler et al., 2005). Arrays were processed at Genome Explorations (Divyen Patel) or by Dr. Tom Sutter and colleagues at the University of Memphis. Data are now available on the GeneNetwork simply by clicking on the Information Page associated with the many different versions (transforms). Please contact Rob Williams if you have any questions on the use of these open data. We now consider these data to be somewhat archaic and of relatively poor quality compared to recent data sets such as the INIA BXD brain data and the UCHSC BXD brain data.

MOUSE BXD: GNF-Groningen Stem Cell U74Av2 data sets:

Status: PUBLISHED and OPEN data source, NO usage restrictions as of March 1, 2005. Error-checked and complete as of March 1, 2005. There are no plans to modify or expand this data set.

References: Bystrykh L, Weersing E, Dontje B, Sutton S, Pletcher MT, Wiltshire T, Su AI, Vellenga E, Wang J, Manly KF, Lu L, Chesler EJ, Alberts R, Jansen RC, Williams RW, Cooke M, de Haan G (2005) Uncovering regulatory pathways affecting hematopoietic stem cell function using “genetical genomics” Nature Genetics 37:225-232

These hematopoietic stem cell expression data were generated by collaborators at The University of Groningen (Gerald de Haan) and The Genomics Institute of the Novartis Research Foundation (Mike Cooke). Flow-sorted cells were generated in Holland. RNA samples and arrays were processed at GNF in La Jolla, CA. This team incorporated all of their data into the GeneNetwork before publication. The GNF-Groningen stem cell data are now available on the NCBI GEO site using the accession identifier GSE2031. If you have questions on the use of these data, please contact Gerald de Haan and Michael Cooke .

MOUSE BXD: SJUT M430 Cerebellum data sets:

Status: Unpublished and open data source with usage restrictions only on large-scale and global analysis. This data set is now complete as of 2005.

These cerebellar expression data are being generated by a consortium of investigators at St. Jude Children's Research Hospital (Clayton Naeve, Tom Curran, Peter McKinnon, Jim Morgan, Rich Smeyne) and at UTHSC (Dan Goldowitz, Lu Lu, Kristin Hamre, and Rob Williams). Samples are generated at UTHSC by Lu Lu and colleagues. Arrays are all processed at SJCRH by Clayton Naeve and colleagues. Please contact Rob Williams, or Dan Goldowitz regarding use of these data sets in publications or projects.

MOUSE BXD: Helmholtz Centre for Infection Research data sets:

Status: Currently these are unpublished and private data source with usage restrictions.

Please contact Dr. Klaus Schughart regarding access and use of these data.

November 17, 2006.

4. MOUSE BXD: INIA Brain mRNA M430 data sets:

Status: PUBLISHED and OPEN data source, with no usage restrictions. Error-checked and complete as of March 1, 2005. There are no plans to modify or expand this data set. Error-checked and nearly complete as of Sept 1, 2005.

    INIA data access:

Normalized data are available for this INIA data set at

  • Jan 2006, PDNN normalization (17 Mb file with strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0106_PDNN.txt
  • Jan 2006, RMA normalization (17 Mb file with strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0106_RMA.txt
  • June 2006, QTL results from RMA normalized data (5.7 Mb, no strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0606_RMA.txt
  • All data in ZIP format: ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_mRNA_data_sets.zip
  • References: Peirce JL, Lu L, Li H, Wang J, Manly KF, Hitzemann RJ, Belknap JK, Rosen GD, Goodwin S, Sutter TR, Williams RW (2006) How replicable are mRNA expression QTLs. Mammalian Genome 17:643-642

    These forebrain and midbrain expression sets were generated with continued support from NIAAA-INIA. Arrays were processed at the University of Memphis by Thomas Sutter and Shirlean Goodwin. These data are openly avaiable at all levels (CEL files, etc). Please contact Robert W. Williams for access to orginal data.

    5. MOUSE BXD: HBP/Rosen Striatum data sets:

    Status: Unpublished and open data source with usage restrictions only on large-scale and global analysis. Error-checked but still incomplete. We plan to significantly expand the size of this data set in 2005-2006.

    The striatal expresssion data sets are being generated by Glenn D. Rosen, Robert W. Williams, and colleagues with continued support from an NIH Human Brain Project award. Tissue and arrays are processed at Beth Israel Deaconess Medical Center by Glenn Rosen and colleagues. Please contact Glenn Rosen regarding extensive use of this data set in publications or projects.

    MOUSE LXS Illumina Hippocampus data sets:

    Status: Unpublished and open data source with usage restrictions only on large-scale and global analysis. Error-checked and complete.

    References: Lu Lu et al. RSA July and CTC May 2007

    This data set was initially entered in GeneNetwork, October 2006. Please contact Dr. Lu Lu if you have any questions on the use of these open data.

    MOUSE AKXD: NCI Mammary tumor mRNA M43 data sets:

    Status: Unpublished and open data source with usage restrictions only on large-scale and global analysis. Error-checked and complete.

    The mammary tumor expresssion data sets have been generated by Kent Hunter and colleagues with support from NCI Laboratory of Population Genetics. Tissue and arrays were processed at NCI. Please contact Kent Hunter regarding extensive use of this data set in publications or projects.

    B6D2F2 database:

    Status: Unpublished but submitted; an open data source with usage restrictions only on large-scale and global analysis. Error-checked and complete as of Sept 1, 2005.

    References: 105. Peirce JL, Lu L, Li H, Wang J, Manly KF, Hitzemann RJ, Belknap JK, Rosen GD, Goodwin S, Sutter TR, Williams RW (2006) How replicable are mRNA expression QTLs. Mammalian Genome 17:643-642

    All of the OHSU/VA B6D2F2 Brain mRNA M430AB data sets have been generated by Robert Hitzemann and John Belknap at The Oregon Health Sciences University in Portland. For contact and citations and other information on these data sets please review the INFO pages and contact Drs. Belknap or Hitzemann regarding use of this data set in publications or projects.

    B6BTBRF2-ob Mouse Liver and Metabolic Trait data sets:

    Status: Published open data source with no usage restrictions. Error-checked and complete. The F2 data set used in the manuscript is available at GEO under the accession number "GSE3330".

    References: Lan H, Chen M, Byers JE, Yandell BS, Stapleton DS, Mata CM, Mui TK, Flowers MT, Schueler KL, Manly KF, Williams RW, Kendziorski, CM, Attie AD (2006) Combined expression trait correlations and expression quantitative trait locus mapping. PLoS Genetics 2:51-61

    The liver expression data were generated by Dr. Alan Attie and colleagues at the University of Wisconsin. All original Affymetrix data files were generated by Attie and colleagues. Data sets in The GeneNetwork were processed as described in the Information pages. Data sets were opened to the public by Dr. Alan Attie and colleagues, August 15, 2005. Please contact Alan Attie for help in the use of these data.

    UNC Agilent Liver data sets:

    Status: PUBLISHED and OPEN data source, NO usage restrictions as of February 1, 2007. Error-checked and complete as of March 1, 2005. There are no plans to modify or expand this data set.

    References: Gatti D, Maki A, Chesler EJ, Kosyk O, Kirova R, Lu L, Manly KF, Qu Y, Williams RW, Perkins A, Langston ME, Threadgill DW, Rusyn I (2007) Genome-level analysis of genetic regulation of liver gene expression networks. Hepatology in press

    The BXD liver expression data sets were generated by Ivan Rusyn, David Threadgill, and colleagues with support from an NIEHS Toxicogenomics award. Arrays and the final data sets in The GeneNetwork were generated at the UNC Array Core. Please contact Ivan Rusyn to obtain original data files or for help in the use of these data.

    10. Hippocampus Consortium BXD and CXB data sets:

    Status: Unpublished and open data source for BXD, CXB, and diverse inbred strains of mice with a usage restrictions on large-scale and global analysis. Error-checked and essentially complete.

    The hippocampus expresssion data sets were generated by a consortium of investigator with support from a large number of funding agencies. Tissue and arrays are processed at the University of Memphis by Thomas Sutter and Shirlean Goodwin. Please contact Robert W. Williams regarding extensive use of this data set in publications or projects.

    Hamilton Eye Institute Mouse Eye data sets:

    Status: Unpublished and open data source with usage restrictions only on large-scale and global analysis. Error-checked and essentially complete by September 2006.

    This eye expresssion data set were generated by Robert W. Williams, E. E. Geisert, L. Lu, and W. Gu with support solely from Barrett G. Haik. Tissue and arrays were processed at the VA Medical Center, Memphis, by Weikuan Gu and Yan Jiao. Please contact Robert W. Williams regarding extensive use of this data set in publications or projects.

    Barley seedling leaf and embryo 22K GeneChip data sets from SCRI:
    Status: unpublished. The data sets will become OPEN with no usage restrictions as soon as publication is accepted (expected by mid 2007). The data sets will also be available from the ArrayExpress: accessions E-TABM-111 and E-TABM-112. Please contact Arnis Druka if you are interested in using these data sets before they are released.

    The barley expression data sets were funded by the BBSRC grant SCR/910/04 to Michael Kearsey (University of Birmingham, UK) and Robbie Waugh (SCRI, UK). Tissue and RNA isolation was performed by Arnis Druka at SCRI. Arrays were processed by Roger Wise (Iowa State University). Genetic linkage map was assembled by Arnis Druka by integrating updated RFLP-based mapping data provided by Andris Kleinhofs (Washington State University, Pullman) with SNP genotyping data (based on pilot barley OPA1) provided by Timothy Close (University of California, Riverside).

    Genotypes Files:

    All microsatellite and SNP marker genotype data files for mouse genetic reference populations (AXB/BXA, BXD, LXS, CXB, BXH) are public. Genotype files for the B6D2F2 and B6BTBRF2 are public. The rat HXB/BXH genotypes are public. BayxSha Arabidopsis genotype files are also public. Any of the genotype files is available upon request to R. W. Williams or original data providers.

    The majority of mouse genotypes in use after May 2005 are SNPs that were genotyped by the Jonathan Flint, Richard Mott (Wellcome Trust, Oxford), and Robert Williams. DNA samples from more than ~480 strains of mice were genotyped at 15,360 SNPs at Illumina in early 2005. The entire set is referred to as the Wellcome-CTC SNP data set. The appropriate URL citation for these data is currently www.well.ox.ac.uk/mouse/INBREDS/.

    The specific mouse genotype files used by WebQTL incorporated both SNPs and microsatellite have been substantially modified and error-checked and will not correspond precisely to the original Wellcome-CTC SNP data set files. All of these new markers are public and can be used. Please contact Rob Williams if you would like the specific files used in any of the mouse genetic reference populations.

    Phenotypes databases:

    Mouse phenotype databases were generated primarily by extracting trait values from the literature. All of the phenotype databases (BXD, AXB, CXB, BXH, LXS, LGXSM, HXB/BXH, and BayxSha) are curated by Elissa Chesler and Robert W. Williams. In several cases, these databases include extensive and still unpublished traits. Please contact Elissa Chesler or Rob Williams regarding new phenotypes you would like entered into any of the databases or regarding appropriate use of the entire database.

    Genomic and Array Annotation databases:

    The GeneNetwork relies on custom and public databases for Affymetrix, Illumina, and Agilent array platforms. In the case of the mouse Affymetrix U74Av2 and M430 arrays, and the rat RAE230A array, we have extensively annotated probes and probe set data. Our files are manually curated and will NOT correspond 1-to-1 with any other publically available annotation of these particular Affymetrix platforms. In the case of mouse, we have data on the positions of a large number of SNPs. These data were contributed by a number of colleagues and integrated by Rob Williams, Rob Crowell, and colleagues. Please contact Rob Williams if you would like access to parts of this data set. Detailed SNP data have now been placed on WebQTL maps and we thank Celera for providing early access to these data in 2003.

        Information about this text file:

    This text file originally generated by RWW, March 2004. Updated by RWW, Nov 12, 2004; Dec 4, 2004; EJC, Aug 29, 2005; RWW, Sept 4, 2005; Nov 5, 2005; Jan 26, 2007; AD, Jan 28, 2007.