H-InvDB x AHG DB
Transcript view
H-InvDB_8.3 released on March 26, 2013.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
Locus viewLocus view Protein view Protein view G-integraG-integra DiseaseInfo ViewerDiseaseInfo Viewer H-ANGELH-ANGEL EvolaEvola PPI viewerPPI view Gene Family/GroupGene Family/Group Hyperlink managemnet system (All databases)Hyperlink MS
H-Invitational ID: HIT000243877 Accession number: AJ001481 Created date: 26-Mar-2013 Last modified: 20-Apr-2012
Definition: Homeobox domain containing protein.
 
 

Transcript original information
Accession number AJ001481.1
CAGE tag ID NA
EST ID NA
Clone Number NA
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ; Antibody: searching human antibodies at "BIO-kaimono.com" Antibody (DUX1) ; Catalog: searching experimental product catalogs at "BIO-kaimono.com" Catalog (DUX1);
Sequence data provider NA
Annotation project NA
Length of cDNA 1227[bp] (No. of exon:0)[A:239 T:195 G:381 C:412]
Devision HUM
Molecular type mRNA
Library origin Cell type rhabdomyosarcoma
Tissue type NA
Develpmental stage NA
Mini-G
Sequence quality information
CDS feature Complete CDS
Kozak sequence NA
PolyA NA
Vector/adapter sequence NA
Frame shift NA
Remaining intron NA
Splice site acceptor (NAGNAG) NA
Transcript quality feature NA
Notes UM gene supported by high quality evidences.

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0041649
Genomic location  G-integra Help Chromosome UM
Location NA
Position NA
Strand NA
Possible duplicated location(s) NA
Gene structure  exon(s)
Database links RefSeq NM_012146NM_012148NM_012149
Ensembl NA
Entrez Gene Entrez Gene ID:26584
KEGG GENES KEGG GENES(26584)
GeneCard GeneCardDUX1*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS;  G-integraG-integra;  cDNA-genome alignment; 

Predicted CDS information
HIP ID HIP000029642
Predicted CDS 113..625;  170[aa];  Orientation:+2; 
Codon Adaptation Index (CAI). 0.785
Database links RefSeq NP_036278
UniProt O43812
CCDS NA

Motif information
ORF

length(170),orf(113:625)
MALLTALDDTLPEEAQGPGRRMILLSTPSQSDALRACFERNLYPGIATKE
ELAQGIDIPEPRVQIWFQNERSCQLRQHRRQSRPWPGRRDPQKGRRKRTA
ITGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHR
GQSGRAPTQASIRCNAAPIG*
a.a.
length
InterPro Name
length(75), motif(10:84) 75 IPR009057 Homeodomain-like [Domain]
length(61), motif(17:77) 61 IPR001356 Homeobox [Domain]
length(66), motif(18:83) 66 IPR012287 Homeodomain-related [Domain]
length(63), motif(19:81) 63 IPR001356 Homeobox [Domain]
length(46), motif(27:72) 46 IPR001356 Homeobox [Domain]
length(79), motif(85:163) 79 IPR009057 Homeodomain-like [Domain]
length(70), motif(90:159) 70 IPR012287 Homeodomain-related [Domain]
length(61), motif(92:152) 61 IPR001356 Homeobox [Domain]
length(63), motif(94:156) 63 IPR001356 Homeobox [Domain]
length(56), motif(95:150) 56 IPR001356 Homeobox [Domain]
length(10), motif(123:132) 10 IPR000047 Helix-turn-helix motif, lambda-like repressor [Domain]
length(24), motif(127:150) 24 IPR017970 Homeobox, conserved site [Conserved_site]
length(17), motif(132:148) 17 IPR000047 Helix-turn-helix motif, lambda-like repressor [Domain]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000243877
H-Inv cluster ID Locus viewHIX0041649
Accession number AJ001481.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help Representative H-Inv IDRepresentative transcript; 
Coding potential  Help Protein coding; 
Definition Homeobox domain containing protein.
Similarity category  Help Category: IPR domain containing protein(Category III).
InterPro domain (IPR001356) -containing protein.
Experimental evidence NA
PubMed ID NA
Gene family/group Gene family H-Inv gene family/group ID NA
Gene family/group name NA
Evidence motif (InterPro) ID NA
Gene symbol/name HGNC symbol DUX1
HGNC aliases "double homeobox, 1"
HGNC name double homeobox 1
DDBJ DUX1
UniProt NA
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000029642
No. of interaction NA
Interaction partner(s) NA
BIND NA
DIP NA
MINT NA
HPRD NA
IntAct NA
Database links RefSeq NM_012146NM_012148NM_012149
Ensembl NA
Entrez Gene Entrez Gene ID:26584
KEGG GENES KEGG GENES(26584)
GeneCard GeneCardDUX1*GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases; 
Curation status Human curated
Notes UM gene supported by high quality evidences.
Related H-InvDB links Gene familyGene family;  Similarity Search ToolSimilarity Search Tool TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA

Gene ontology information
Molecular function DNA binding (GO:0003677);  sequence-specific DNA binding (GO:0043565);  transcription factor activity (GO:0003700); 
Biological process regulation of transcription (GO:0045449);  regulation of transcription, DNA-dependent (GO:0006355); 
Cellular component nucleus (GO:0005634); 

Subcellular localization information  Last modified:20-Apr-2012
WoLF PSORT nuclear; 
Target P Other
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:20-Apr-2012
Start End PDB_ID E-value Identity Coverage SCOP_ID
27 71 1akhA 9e-09 31.1 45/49 a.4.1.1
82 150 1mnmC 8e-14 18.8 69/77 a.4.1.1
Related H-InvDB links GTOP GTOP

Gene expression information  H-ANGEL DNAProbeLocator Last modified:20-Apr-2012
Tissue-specific expression  H-ANGEL NA
Probe
information DNAProbeLocator
AceGene AGhsB040402; 
Affymetrix
GeneChip
HG-Focus 208176_at;  208582_s_at; 
HG-U133 208176_at;  208582_s_at; 
HG-U133A 208176_at;  208582_s_at; 
HG-U133A_2 208176_at;  208582_s_at; 
HG-U133B NA
HG-U133_Plus_2 208176_at;  208582_s_at; 
HG-U95 31346_at;  31387_at; 
HG-U95A 31346_at;  31387_at; 
HG-U95B NA
HG-U95C NA
HG-U95D NA
HG-U95E NA
HG-U95Av2 NA
HuEx-1_0 3273097;  4038018;  4038043; 
HuGeneFL NA
Agilent Human 1A Oligo Microarray:PGID215 A_23_P146876;  A_23_P158257; 
Whole Human Genome Oligo Microarray:PGID247 A_23_P158257; 
Related H-InvDB links H-ANGELH-ANGEL;  DNAProbeLocatorDNAProbeLocator

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
No data available
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
Type Start End Strand
LSAU 3 108 -
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD); 
Related H-InvDB links VaryGeneVaryGene;  Repeat mask viewerRepeat Mask Viewer