H-InvDB x AHG DB
Transcript view
H-InvDB_8.3 released on March 26, 2013.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
Locus viewLocus view Protein view Protein view G-integraG-integra DiseaseInfo ViewerDiseaseInfo Viewer H-ANGELH-ANGEL EvolaEvola PPI viewerPPI view Gene Family/GroupGene Family/Group Hyperlink managemnet system (All databases)Hyperlink MS
H-Invitational ID: HIT000081651 Accession number: AK074616 Created date: 26-Mar-2013 Last modified: 20-Apr-2012
Definition: Similar to WD repeat-containing protein 47;
 
 

Transcript original information
Accession number AK074616.1
CAGE tag ID NA
EST ID NA
Clone Number HEMBB1000668
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ; Antibody: searching human antibodies at "BIO-kaimono.com" Antibody (WDR47) ; Catalog: searching experimental product catalogs at "BIO-kaimono.com" Catalog (WDR47);
Sequence data provider NA
Annotation project NA
Length of cDNA 2375[bp] (No. of exon:8)[A:684 T:751 G:498 C:442]
Devision HUM
Molecular type mRNA
Library origin Cell type NA
Tissue type whole embryo, mainly body
Develpmental stage embryo, 10 weeks
Mini-G
Sequence quality information
CDS feature N-truncated
Kozak sequence NA
PolyA NA
Vector/adapter sequence NA
Frame shift NA
Remaining intron NA
Splice site acceptor (NAGNAG) NA
Transcript quality feature Truncation; 
Notes NA

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0199788
Genomic location  G-integra Help Chromosome 1
Location NA
Position 109512840- 109538295
Strand -
Possible duplicated location(s) NA
Gene structure 8 exon(s)
Database links RefSeq NM_014969NM_001142550NM_001142551
Ensembl ENST00000357672ENST00000361054ENST00000369962ENST00000369965ENST00000400794ENST00000528747ENST00000529074ENST00000530772ENST00000531337
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS G-integraG-integra cDNA-genome alignmentcDNA-genome alignment

Predicted CDS information
HIP ID HIP000003890
Predicted CDS 3..1163;  386[aa];  Orientation:+3; 
Codon Adaptation Index (CAI). 0.695

Motif information
ORF

length(386),orf(3:1163)
DASNIHTSTPRNPGSTNHIPFLEESPCGSQISSEHSVIKPPLGDSPGSLS
RSKGEEDDKSKKQFVCINILEDTQAVRAVAFHPAGGLYAVGSNSKTLRVC
AYPDVIDPSAHETPKQPVVRFKRNKHHKGSIYCVAWSPCGQLLATGSNDK
YVKVLPFNAETCNATGPDLEFSMHDGTIRDLAFMEGPESGGAILISAGAG
DCNIYTTDCQRGQGLHALSGHTGHILALYTWSGWMIASGSQDKTVRFWDL
RVPSCVRVVGTTFHGTGSAVASVAVDPSGRLLATGQEDSSCMLYDIRGGR
MVQSYHPHSSDVRSVRFSPGAHYLLTGSYDMKIKVTDLQGDLTKQLPIMV
VGEHKDKVIQCRWHTQDLSFLSSSADRTVTLWTYNG*
a.a.
length
InterPro Name
length(39), motif(63:101) 39 IPR001680 WD40 repeat [Repeat]
length(315), motif(71:385) 315 IPR015943 WD40/YVTN repeat-like-containing domain [Domain]
length(27), motif(73:99) 27 IPR019781 WD40 repeat, subgroup [Repeat]
length(311), motif(73:383) 311 IPR011046 WD40 repeat-like-containing domain [Domain]
length(43), motif(114:156) 43 IPR001680 WD40 repeat [Repeat]
length(31), motif(124:154) 31 IPR019782 WD40 repeat 2 [Repeat]
length(263), motif(124:386) 263 IPR017986 WD40-repeat-containing domain [Domain]
length(30), motif(125:154) 30 IPR019781 WD40 repeat, subgroup [Repeat]
length(45), motif(164:208) 45 IPR001680 WD40 repeat [Repeat]
length(39), motif(211:249) 39 IPR001680 WD40 repeat [Repeat]
length(37), motif(213:249) 37 IPR019781 WD40 repeat, subgroup [Repeat]
length(41), motif(218:258) 41 IPR019782 WD40 repeat 2 [Repeat]
length(15), motif(236:250) 15 IPR020472 G-protein beta WD-40 repeat [Repeat]
length(15), motif(236:250) 15 IPR019775 WD40 repeat, conserved site [Conserved_site]
length(40), motif(256:295) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(261:295) 35 IPR019781 WD40 repeat, subgroup [Repeat]
length(42), motif(263:304) 42 IPR019782 WD40 repeat 2 [Repeat]
length(40), motif(298:337) 40 IPR001680 WD40 repeat [Repeat]
length(34), motif(302:335) 34 IPR019781 WD40 repeat, subgroup [Repeat]
length(35), motif(305:339) 35 IPR019782 WD40 repeat 2 [Repeat]
length(15), motif(324:338) 15 IPR020472 G-protein beta WD-40 repeat [Repeat]
length(40), motif(344:383) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(348:382) 35 IPR019781 WD40 repeat, subgroup [Repeat]
length(36), motif(351:386) 36 IPR019782 WD40 repeat 2 [Repeat]
length(15), motif(370:384) 15 IPR020472 G-protein beta WD-40 repeat [Repeat]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000081651
H-Inv cluster ID Locus viewHIX0199788
Accession number AK074616.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help NO; 
Coding potential  Help Protein coding; 
Definition Similar to WD repeat-containing protein 47;
Similarity category  Help Category: Similar to known protein(Category II).
Similar to known protein (O94967)  [Identity/coverage = 100.0%/42.0%] to Homo sapiens (Human). protein.
Experimental evidence Protein evidence
PubMed ID 100484851470203915489334167104141822033618669648ALL
Gene family/group Gene family H-Inv gene family/group ID NA
Gene family/group name NA
Evidence motif (InterPro) ID NA
Gene symbol/name HGNC symbol WDR47
HGNC aliases NA
HGNC name WD repeat domain 47
DDBJ NA
UniProt WDR47
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000003890
No. of interaction NA
Interaction partner(s) NA
BIND NA
DIP NA
MINT NA
HPRD NA
IntAct NA
Database links RefSeq NM_014969NM_001142550NM_001142551
Ensembl ENST00000357672ENST00000361054ENST00000369962ENST00000369965ENST00000400794ENST00000528747ENST00000529074ENST00000530772ENST00000531337
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases
Curation status Auto-annotated
Notes NA
Related H-InvDB links Gene familyGene family;  Similarity Search ToolSimilarity Search Tool;  TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA


Subcellular localization information  Last modified:20-Apr-2012
WoLF PSORT nuclear;  cytosol;  extracellular; 
Target P Other
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:20-Apr-2012
Start End PDB_ID E-value Identity Coverage SCOP_ID
29 210 1u4cA 3e-20 13.3 181/322 b.69.4.2
Related H-InvDB links GTOP GTOP

Gene expression information  H-ANGEL DNAProbeLocator Last modified:20-Apr-2012
Tissue-specific expression  H-ANGEL NA
Probe
information DNAProbeLocator
AceGene AGhsA240912; 
Affymetrix
GeneChip
HG-Focus NA
HG-U133 203855_at; 
HG-U133A 203855_at; 
HG-U133A_2 203855_at; 
HG-U133B NA
HG-U133_Plus_2 203855_at; 
HG-U95 35720_at; 
HG-U95A 35720_at; 
HG-U95B NA
HG-U95C NA
HG-U95D NA
HG-U95E NA
HG-U95Av2 NA
HuEx-1_0 2350464;  2426841;  2426842;  2426843;  2426844;  2426846;  2426848;  2426849;  2426850;  2426852;  2426854;  2426857;  4052680;  4052682; 
HuGeneFL NA
Agilent Human 1A Oligo Microarray:PGID215 A_23_P23748; 
Whole Human Genome Oligo Microarray:PGID247 A_23_P23748; 
Related H-InvDB links H-ANGELH-ANGEL DNAProbeLocatorDNAProbeLocator

Disease/pathology information  DiseaseInfo Viewer LEGENDA Last modified:20-Apr-2012
Disease relation Disease name:NA
Related information in OMIM OMIM ID:NA Title:NA
Co-localized orphan diseases OMIM ID:  115665116600155600600975605225605606606788606852606928607317607671608543608553608995610320612367612596
Disease related mutation NA
Literature-Extracted GENe-Disease Associations (LEGENDA) LEGENDA Gene name Entrez Gene ID:(22911)
Disease Entrez Gene ID:(22911)
Substance Entrez Gene ID:(22911)
Related H-InvDB links DiseaseInfo ViewerDiseaseInfo ViewerLEGENDALEGENDA

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
Location Variation dbSNP ID Strand CDS/UTR Translation
7 .. 7 C/T rs113434051 - CDS Nonsynonymous[Ala2Val]
260 .. 260 T/C rs41299563 - CDS Synonymous[Gly86Gly]
609 .. 609 A/C rs1538137 - CDS Nonsynonymous[Asn203His]
623 .. 623 C/T rs76370963 - CDS Synonymous[Thr207Thr]
661 .. 661 G/T rs74576781 - CDS Nonsynonymous[Gly220Val]
958 ^ 959 -/C rs35988108 - CDS
1525 .. 1525 T/G rs112609892 - 3'UTR
1718 .. 1718 A/G rs507776 - 3'UTR
1936 .. 1936 T/A rs114053618 - 3'UTR
1941 .. 1941 T/C rs11803800 - 3'UTR
2361 .. 2361 G/C rs12068536 - 3'UTR
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
No data available
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD)
Related H-InvDB links VaryGeneVaryGene Repeat mask viewerRepeat Mask Viewer