Back to Downloads

Contents of this Demonstration Database

The demo database contains Gene clusters linked to a gene ontology graph.

Gene information comes from LocusLink (http://www.ncbi.nih.gov/LocusLink/). All Human, Mouse, and Rat LocusLink records from the LL_tmpl flatfile were loaded into Gene clusters, which each Gene node a member of a unique Cluster of Orthologous Groups (COG) node. No attempt was made to preserve orthologous relationships by linking genes from different organisms to the same COG. Such an effort would be highly desirable. The following information was imported from each LocusLink record: gene names, symbols, organism, locuslink ID, and Gene Ontology (GO) IDs. The cluster's primary name was designated based on the value of one of the following LocusLink tags (in order of precedence): OFFICIAL_GENE_NAME, PREFERRED_GENE_NAME, OFFICIAL_SYMBOL, PREFERRED_SYMBOL.

Gene ontology information comes from the Gene Ontology Consortium (http://www.geneontology.org/). The node/edge structure follows that specified in the xml flatfile, with the exception that the root node is ignored. Links between Genes and GO nodes are made by matching the GO identifier specified in the LocusLink record to the appropriate GO node.