|
|
|
| Home Quick guide Navi BLAST Site map Download Contact us Help |
| H-InvDB_8.3 released on March 26, 2013. | |
![]() |
List of H-Invitational IDs |
| 3.0M | Text File | |
| Format 1: DNA databank accession numbers, 2: HIT : H-Invitational transcript, 3: HIX : H-Invitational clusters, 4: cDNA data provider | ||
![]() |
List of new, deleted and updated H-Invitational IDs |
| 256K | Text File |
![]() |
Annotation data sets |
| H-InvDB-HIX_8.1 | ||
| 52M | Flat File | |
| 68M | XML File | |
| Annotation for all H-Inv clusters are provided in both flat file and XML format | ||
| H-InvDB-HIT_8.2 | ||
| 555M | Flat File | |
| 148M | XML File | |
| 138M | XML File | |
| 141M | XML File | |
| 135M | XML File | |
| 82M | XML File | |
| NOTES: Annotation for all H-Inv transcripts are provided in both flat file and XML format | ||
| H-InvDB-HIP_8.1 | 311M | Flat File |
| 106M | XML File | |
| 105M | XML File | |
| 112M | XML File | |
| 3.7M | XML File | |
| NOTES: Annotation for all H-Inv proteins are provided in both flat file and XML format | ||
![]() |
Section data of annotation |
| H-InvDB-HIX-Annotation_8.1 | ||
| 42M | Flat File | |
| Section of basic annotation for H-Inv clusters are provided flat file format | ||
| H-InvDB-HIT-Annotation_8.2 | ||
| 387M | Flat File | |
| NOTES: Section of basic annotation for H-Inv transcripts are provided flat file format. | ||
| H-InvDB-Expression_8.1 | ||
| 7.9M | Flat File | |
| Section of tissue-specific expression data is provided flat file format. | ||
| H-InvDB-DiseaseInfo_8.0 | ||
| 8.8M | Flat File | |
| Section of disease information is provided flat file format. | ||
| H-InvDB-Evolution_7.1 | ||
| 24M | Flat File | |
| Section of molecular evolutionary analysis is provided flat file format. | ||
| H-InvDB-3Dstructure_8.0 | ||
| 29M | Flat File | |
| NOTES: Section of prediction of 3-D structure of H-Inv transcripts by GTOP (http://sybock.genes.nig.ac.jp/~hinv3/gtop.html) are provided flat file format | ||
| H-InvDB-Subcellular_8.0 | ||
| 19M | Flat File | |
| NOTES: Section of prediction of subcellular localization by WoLF PSORT, TargetP, TMHMM and SOSUI are provided flat file form | ||
![]() |
Sequence data sets |
| Nucleotide data sets | ||
| 138M | Flat File | |
| 27M | Flat File | |
| 46M | Flat File | |
| 45M | Flat File | |
| 501M | Flat File | |
|
NOTES: 1) Nucleotide sequences of all H-Inv transcripts (HITs) in FASTA format. 2) Nucleotide sequences of representative H-Inv transcripts (HITs) in FASTA format. 3) Nucleotide sequences of representative splicing variants (RASV) of H-Inv transcripts (HITs) in FASTA format. 4) Nucleotide sequences of all H-Inv cDNAs (HITs) in FASTA format. 5) Nucleotide sequences of the genome for the all H-Inv clusters (HIXs) in FASTA format. Headline> HIT version | HIX version | HIP version | DNA databank accession number | "FS": frame shift error if revised or "NO" for no revision | "IM": remaining intronic sequence if revised or "NO" for no revision | "HC": function if human curated or "AA" for auto-annotated| Frame (+3 to -3) | Position of CDS (start..end) | Definition Body: cDNA sequences |
||
| Amino acid data sets | ||
| 55M | Flat File | |
| 8.9M | Flat File | |
| 19M | Flat File | |
| 14M | Flat File | |
| 54M | Flat File | |
|
1) Translation of all H-Inv transcripts (HITs) in FASTA format. 2) Translation of representative H-Inv transcripts (HITs) in FASTA format. 3) Translation of representative alternative variants (RASV) of H-Inv transcripts (HITs) in FASTA format. 4) Translation of all H-Inv cDNAs (HITs) in FASTA format. 5) Translation of all H-Inv proteins (HIPs) in FASTA format. Headline> HIT version | HIX version | HIP version | DNA databank accession number | "FS": frame shift error if revised or "NO" for no revision | "IM": remaining intronic sequence if revised or "NO" for no revision | "HC": function if human curated or "AA" for auto-annotated| Frame (+3 to -3) | Position of CDS (start..end) | Definition Body: translation |
||
![]() |
Results of computational analysis |
| Multiple alignment information | ||
| 312M | Flat File | |
| NOTES: "FMULTI" provides multiple-alignments of all the H-Inv transcripts and RefSeq sequences mapped in the same H-Inv cluster(HIX), against human genome sequence. | ||
| Positional information of mapping | ||
| 95M | Flat File | |
|
NOTES: "FMULTIP"; FMULTIP.tar.gz (mALNp/.tbl) provides positional information for all the exon of all the H-Inv transcripts and RefSeq sequences mapped in the same H-Inv cluster(HIX), against human genome sequence. Format: 01:HIT (or acc), 02:HIX (or temporally cluster-id), 03:key (=acc_chr_sno), 04:seq1 ("genome"), 05:seq2 (cDNA_acc), 06:strand, 07:exon_no, 08:type[N:not aligned part of seq1(genome); U:unmapped part of seq2(cDNA); A:alignment, D:deletion, I:insertion; G:gap due to other member)], 09:start_seq1 (genome), 10:end_seq1 (genome), 11:start_seq2 (cDNA), 12:end_seq2 (cDNA) (or gap_length if 08:type='G') |
||
| 6.7M | GFF3 File | |
| 28M | GFF3 File | |
|
NOTES: "h-inv_pub_rep.gff3.gz","h-inv_pub.gff3.gz"; The positional information of the exon, CDS and UTR in GFF3 format. Format:01: "seqid (HIT)", 02: "source", 03: "type", 04: "start", 05: "end", 06: "score", 07: "strand", 08: "phase", 09: "attributes" Attributes (exon):ID=HIX, Name=HIT, Note=HUGO gene symbol Attributes (CDS):ID=HIT, Parent=HIX, Name=HIT, Alias=definition, Note=HUGO gene symbol, accession Attributes (UTR):Parent=HIT, Name=HIT |
||
| H-ANGEL matrix | ||
| 25M | Flat File | |
|
NOTES: "H-ANGEL_matrix.txt.gz" provides gene expression matrix of H-ANGEL. The followings are the No. of column and each description. Format: 1: Type of platform, 2: experimental ID added by each provider, 3: primer/probe ID. e.g. GeneChip Identifier, 4: 10 categories collapsed by the avarage of 40 categories, 5: 10 categories collapsed by the maximum value of 40 categories, 6: 40 categories, 7: Acc corresponding to the primer/probe, 8: start site of the primer/probe on the genome, 9: end site of the primer/probe on the genome, 10: Absolute value of the expression data for EST and SAGE only, NaN otherwise, 11: HIX, 12: UniGene ID, 13: start site of HIX on the genome, 14: end site of HIX on the genome, 15: strand of the locus, 16: Acc(s) included within HIX. |
||
| Molecular evolutionary annotation (Evola) | ||
| Flat File | ||
|
"Evola.txt.gz": provides ortholog accession number list of Evola |
||
| Inter-species multi-FASTA (Transcript) | ||
| Flat File | ||
| NOTES: "NFAS.tar.gz" provides multiple FASTA of transcript nucleotide sequences of human and other species orthologs. | ||
| Inter-species multi-FASTA (Protein) | ||
| Flat File | ||
| NOTES: "PFAS.tar.gz" provides multiple FASTA of protein amino acid sequences of human and other species orthologs. | ||
| Phylogenetic trees | ||
| Flat File | ||
| NOTES: "NJ.tar.gz" provides the phylogenetic trees (phb files) constructed by the Neighbor-joining method (amino acid) | ||
| Human protein complex database with quality index (PCDq), data set | ||
| 6.5M | TSV Files (tar.gz file) | |
| FTP Download |
|
FTP site for downloading data files in H-Invitational Database (DNA Data Bank of Japan). ftp://ftp.ddbj.nig.ac.jp/mirror_database/hinv/ |


