H-InvDB x AHG DB
Transcript view
H-InvDB_8.3 released on March 26, 2013.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
H-Invitational ID: HIT000384911 Accession number: AK225781 Created date: 26-Mar-2013 Last modified: 26-Mar-2013
Definition: WD repeat-containing protein 47 isoform 1.
 
 

Transcript original information
Accession number AK225781.1
CAGE tag ID NA
EST ID NA
Clone Number FCC132C06
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ; Antibody: searching human antibodies at "BIO-kaimono.com" Antibody (WDR47) ; Catalog: searching experimental product catalogs at "BIO-kaimono.com" Catalog (WDR47);
Sequence data provider NA
Annotation project NA
Length of cDNA 4154[bp] (No. of exon:15)[A:1220 T:1230 G:884 C:820]
Devision HUM
Molecular type mRNA
Library origin Cell type NA
Tissue type brain
Develpmental stage NA
Mini-G
Sequence quality information
CDS feature N-truncated
Kozak sequence NA
PolyA Site: 4133(+) Signal: 4112-4116(+)
Vector/adapter sequence NA
Frame shift NA
Remaining intron NA
Splice site acceptor (NAGNAG) AAGAAG; 
Transcript quality feature NA
Notes NA

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0199788
Genomic location  G-integra Help Chromosome 1
Location NA
Position 109512839- 109584608
Strand -
Possible duplicated location(s) NA
Gene structure 15 exon(s)
Database links RefSeq NM_014969NM_001142550NM_001142551
Ensembl ENST00000357672ENST00000361054ENST00000369962ENST00000369965ENST00000400794ENST00000528747ENST00000529074ENST00000530772ENST00000531337
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS G-integraG-integra cDNA-genome alignmentcDNA-genome alignment

Predicted CDS information
HIP ID HIP000353499
Predicted CDS 1130..2920;  596[aa];  Orientation:+2; 
Codon Adaptation Index (CAI). 0.697

Motif information
ORF

length(596),orf(1130:2920)
PVMIREFQTLETKLLQCHTPLLTSIIQGYKTSVEVSCLRIQNVTVFTKNP
LSEVIHLLMHRGLSAVKSWARVQFQKKSLQMEHRIQDQLNKKKNELRDST
EQFQEYYRQRLRYQQHLEQKEQQRQIYQQMLLEGGVNQEDGPDQQQNLTE
QFLNRSIQKLGELNIGMDGLGNEVSALNQQCNGSKGNGSNGSSVTSFTTP
PQDSSQRLTHDASNIHTSTPRNPGSTNHIPFLEESPCGSQISSEHSVIKP
PLGDSPGSLSRSKGEEDDKSKKQFVCINILEDTQAVRAVAFHPAGGLYAV
GSNSKTLRVCAYPDVIDPSAHETPKQPVVRFKRNKHHKGSIYCVAWSPCG
QLLATGSNDKYVKVLPFNAETCNATGPDLEFSMHDGTIRDLAFMEGPESG
GAILISAGAGDCNIYTTDCQRGQGLHALSGHTGHILALYTWSGWMIASGS
QDKTVRFWDLRVPSCVRVVGTTFHGTGSAVASVAVDPSGRLLATGQEDSS
CMLYDIRGGRMVQSYHPHSSDVRSVRFSPGAHYLLTGSYDMKIKVTDLQG
DLTKQLPIMVVGEHKDKVIQCRWHTQDLSFLSSSADRTVTLWTYNG*
a.a.
length
InterPro Name
length(39), motif(273:311) 39 IPR001680 WD40 repeat [Repeat]
length(315), motif(281:595) 315 IPR015943 WD40/YVTN repeat-like-containing domain [Domain]
length(27), motif(283:309) 27 IPR019781 WD40 repeat, subgroup [Repeat]
length(311), motif(283:593) 311 IPR011046 WD40 repeat-like-containing domain [Domain]
length(43), motif(324:366) 43 IPR001680 WD40 repeat [Repeat]
length(31), motif(334:364) 31 IPR019782 WD40 repeat 2 [Repeat]
length(263), motif(334:596) 263 IPR017986 WD40-repeat-containing domain [Domain]
length(30), motif(335:364) 30 IPR019781 WD40 repeat, subgroup [Repeat]
length(45), motif(374:418) 45 IPR001680 WD40 repeat [Repeat]
length(39), motif(421:459) 39 IPR001680 WD40 repeat [Repeat]
length(37), motif(423:459) 37 IPR019781 WD40 repeat, subgroup [Repeat]
length(41), motif(428:468) 41 IPR019782 WD40 repeat 2 [Repeat]
length(15), motif(446:460) 15 IPR019775 WD40 repeat, conserved site [Conserved_site]
length(40), motif(466:505) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(471:505) 35 IPR019781 WD40 repeat, subgroup [Repeat]
length(42), motif(473:514) 42 IPR019782 WD40 repeat 2 [Repeat]
length(40), motif(508:547) 40 IPR001680 WD40 repeat [Repeat]
length(34), motif(512:545) 34 IPR019781 WD40 repeat, subgroup [Repeat]
length(35), motif(515:549) 35 IPR019782 WD40 repeat 2 [Repeat]
length(40), motif(554:593) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(558:592) 35 IPR019781 WD40 repeat, subgroup [Repeat]
length(36), motif(561:596) 36 IPR019782 WD40 repeat 2 [Repeat]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000384911
H-Inv cluster ID Locus viewHIX0199788
Accession number AK225781.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help NO;  Splicing isoformSplicing isoform
Coding potential  Help Protein coding; 
Definition WD repeat-containing protein 47 isoform 1.
Similarity category  Help Category: Identical to known human protein(Category I).
Identical to known human protein (NP_001136022)  [Identity/coverage = 100.0%/100.0%] to Homo sapiens protein.
Experimental evidence Protein evidence
PubMed ID NA
Gene family/group Gene family H-Inv gene family/group ID NA
Gene family/group name NA
Evidence motif (InterPro) ID NA
Gene symbol/name HGNC symbol WDR47
HGNC aliases NA
HGNC name WD repeat domain 47
DDBJ NA
UniProt NA
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000353499
No. of interaction NA
Interaction partner(s) NA
BIND NA
DIP NA
MINT NA
HPRD NA
IntAct NA
Database links RefSeq NM_014969NM_001142550NM_001142551
Ensembl ENST00000357672ENST00000361054ENST00000369962ENST00000369965ENST00000400794ENST00000528747ENST00000529074ENST00000530772ENST00000531337
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases
Curation status Auto-annotated
Notes NA
Related H-InvDB links Gene familyGene family;  Similarity Search ToolSimilarity Search Tool;  TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA


Subcellular localization information  Last modified:26-Mar-2013
WoLF PSORT nuclear;  cytosol; 
Target P Other
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:26-Mar-2013
Start End PDB_ID E-value Identity Coverage SCOP_ID
13 76 1uujA 3e-11 19.7 61/76 a.221.1.1
602 923 1nr0A1 4e-29 16.2 296/311 b.69.4.1
Related H-InvDB links GTOP GTOP

Disease/pathology information  DiseaseInfo Viewer LEGENDA Last modified:26-Mar-2013
Disease relation Disease name:NA
Related information in OMIM OMIM ID:NA Title:NA
Co-localized orphan diseases OMIM ID:  115665116600155600600975605225605606606788606852606928607317607671608543608553608995610320612367612596
Disease related mutation NA
Literature-Extracted GENe-Disease Associations (LEGENDA) LEGENDA Gene name Entrez Gene ID:(22911)
Disease Entrez Gene ID:(22911)
Substance Entrez Gene ID:(22911)
Related H-InvDB links DiseaseInfo ViewerDiseaseInfo ViewerLEGENDALEGENDA

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
Location Variation dbSNP ID Strand CDS/UTR Translation
334 .. 334 T/C rs112410635 - 5'UTR
394 .. 394 C/T rs55730953 - 5'UTR
1328 .. 1328 A/G rs79997809 - CDS Nonsynonymous[Lys67Glu]
1764 .. 1764 C/T rs113434051 - CDS Nonsynonymous[Ala212Val]
2017 .. 2017 T/C rs41299563 - CDS Synonymous[Gly296Gly]
2366 .. 2366 A/C rs1538137 - CDS Nonsynonymous[Asn413His]
2380 .. 2380 C/T rs76370963 - CDS Synonymous[Thr417Thr]
2418 .. 2418 G/T rs74576781 - CDS Nonsynonymous[Gly430Val]
2715 ^ 2716 -/C rs35988108 - CDS
3282 .. 3282 T/G rs112609892 - 3'UTR
3475 .. 3475 A/G rs507776 - 3'UTR
3693 .. 3693 T/A rs114053618 - 3'UTR
3698 .. 3698 T/C rs11803800 - 3'UTR
4118 .. 4118 G/C rs12068536 - 3'UTR
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
No data available
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD)
Related H-InvDB links VaryGeneVaryGene Repeat mask viewerRepeat Mask Viewer