*--------------------------------------------------------------------------------------------------* Release information of H-InvDB_7.5 http://www.h-invitational.jp Dataset fixed on July 23, 2010. Released on September 10, 2010. *--------------------------------------------------------------------------------------------------* --------------------------- Erratum for Evola annotation --------------------------- In H-InvDB_7.5 Evola, annotation for 19,561 IDs below will be revised in the comming release. Link : http://h-invitational.jp/hinv/download/etc/erratum20100910.txt --------------------------- H-InvDB statistics --------------------------- 1. number of H-Invitational transcripts (HIT) all HIT: 242,813 * protein coding transcripts: 220,739 * non-protein-coding transcripts: 20,969 * psudogene candidates: 1,105 2. number of H-Invitational clusters (HIX) all HIX: 44,806 * protein coding: 36,355 * non-protein-coding: 7,692 * psudogene candidates: 759 3. number of H-Invitational proteins (HIP) all HIP: 137,607 --------------------------- Human nucleotide datasets --------------------------- 1. Human full-length cDNA dataset The dataset contains sequences produced by six institutes. All the sequences are already in DDBJ/EMBL/GenBank. 2. Human mRNA dataset Human mRNA sequences registered in DDBJ/EMBL/GenBank other than full-length cDNA were extracted from DDBJ release 73 obtained on May 9, 2008. 3. Human genome dataset Repeat masked human genome assembly NCBI build 37.1 was obtained from UCSC. (UCSC hg19, Feb. 2009: human genome NCBI b37.1) --------------------------- Databases --------------------------- 1. RefSeq mRNA RefSeq curated mRNAs were obtained from NCBI on July 1, 2009. (RefSeq release 36) 2. Ensembl transcripts Ensembl transcripts were obtained from Ensembl on July 1, 2009. Ensembl [release 54] [NCBI36.54] 2. RefSeq protein RefSeq proteins were obtained from NCBI on July 1, 2009. (RefSeq release 36) 3. UniProt(SWISS-PROT/TrEMBL) UniProt(SWISS-PROT/TrEMBL) entries were obtained from EBI on July 1, 2009. 4. HUGO approved gene symbol http://www.gene.ucl.ac.uk/nomenclature/ Human gene name data fixed on July 23, 2010. 5. Entrez Gene database http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene Relations of H-InvDB genes to Entrez Genes were fixed on July 23, 2010. 6. dbSNP Relations of H-InvDB genes to dbSNP build130 were fixed on July 1, 2009. *--------------------------------------------------------------------------------------------------*