H-InvDB x AHG DB
Transcript view
H-InvDB_9.0 released on May 27, 2015.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
H-Invitational ID: HIT000384911 Accession number: AK225781 Created date: 26-Mar-2013 Last modified: 27-May-2015
Definition: Similar to WD repeat-containing protein 47 isoform 1.
 
 

Transcript original information
Accession number AK225781.1
CAGE tag ID NA
EST ID NA
Clone Number FCC132C06
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ; Antibody: searching human antibodies at "BIO-kaimono.com" Antibody (WDR47) ; Catalog: searching experimental product catalogs at "BIO-kaimono.com" Catalog (WDR47);
Sequence data provider NA
Annotation project NA
Length of cDNA 4153[bp] (No. of exon:15)[A:1219 T:1230 G:884 C:820]
Devision HUM
Molecular type mRNA
Library origin Cell type NA
Tissue type brain
Develpmental stage NA
Mini-G
Sequence quality information
CDS feature Complete CDS
Kozak sequence NA
PolyA Site: 4133(+) Signal: 4112-4116(+)
Vector/adapter sequence NA
Frame shift 1410-1410[bp] Insertion(A);
Remaining intron NA
Splice site acceptor (NAGNAG) AAGAAG; 
Transcript quality feature NA
Notes NA

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0199788
Genomic location  G-integra Help Chromosome 1
Location 1p13.3
Position 109512839- 109584608
Strand -
Possible duplicated location(s) NA
Gene structure 15 exon(s)
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS G-integraG-integra cDNA-genome alignmentcDNA-genome alignment

Predicted CDS information
HIP ID HIP000174754
Predicted CDS 136..2919;  927[aa];  Orientation:+1; 
Codon Adaptation Index (CAI). 0.696
Database links RefSeq NP_001136022
UniProt NA
CCDS CCDS44186

Motif information
ORF

length(927),orf(136:2919)
MTAEETVNVKEVEIIKLILDFLNSKKLHISMLALEKESGVINGLFSDDML
FLRQLILDGQWDEVLQFIQPLECMEKFDKKRFRYIILKQKFLEALCVNNA
MSAEDEPQHVRFLFLKLEFTMQEAVQCLHALEEYCPSKDDYSKLCLLLTL
PRLTNHAEFKDWNPSTARVHCFEEACVMVAEFIPADRKLSEAGFKASNNR
LFQLVMKGLLYECCVEFCQSKATGEEITESEVLLGIDLLCGNGCDDLDLS
LLSWLQNLPSSVFSCAFEQKMLNIHVDKLLKPTKAAYADLLTPLISKLSP
YPSSPMRRPQSADAYMTRSLNPALDGLTCGLTSHDKRISDLGNKTSPMSH
SFANFHYPGVQNLSRSLMLENTECHSIYEESPERSDTPVDAQRPIGSEIL
GQSSVSEKEPANGAQNPGPAKQEKNELRDSTEQFQEYYRQRLRYQQHLEQ
KEQQRQIYQQMLLEGGVNQEDGPDQQQNLTEQFLNRSIQKLGELNIGMDG
LGNEVSALNQQCNGSKGNGSNGSSVTSFTTPPQDSSQRLTHDASNIHTST
PRNPGSTNHIPFLEESPCGSQISSEHSVIKPPLGDSPGSLSRSKGEEDDK
SKKQFVCINILEDTQAVRAVAFHPAGGLYAVGSNSKTLRVCAYPDVIDPS
AHETPKQPVVRFKRNKHHKGSIYCVAWSPCGQLLATGSNDKYVKVLPFNA
ETCNATGPDLEFSMHDGTIRDLAFMEGPESGGAILISAGAGDCNIYTTDC
QRGQGLHALSGHTGHILALYTWSGWMIASGSQDKTVRFWDLRVPSCVRVV
GTTFHGTGSAVASVAVDPSGRLLATGQEDSSCMLYDIRGGRMVQSYHPHS
SDVRSVRFSPGAHYLLTGSYDMKIKVTDLQGDLTKQLPIMVVGEHKDKVI
QCRWHTQDLSFLSSSADRTVTLWTYNG*
a.a.
length
InterPro Name
length(33), motif(10:42) 33 IPR006594 LIS1 homology motif [Domain]
length(58), motif(45:102) 58 IPR006595 CTLH, C-terminal LisH motif [Domain]
length(39), motif(604:642) 39 IPR001680 WD40 repeat [Repeat]
length(314), motif(610:923) 314 IPR017986 WD40-repeat-containing domain [Domain]
length(315), motif(612:926) 315 IPR015943 WD40/YVTN repeat-like-containing domain [Domain]
length(27), motif(614:640) 27 IPR001680 WD40 repeat [Repeat]
length(43), motif(655:697) 43 IPR001680 WD40 repeat [Repeat]
length(263), motif(665:927) 263 IPR017986 WD40-repeat-containing domain [Domain]
length(31), motif(665:695) 31 IPR001680 WD40 repeat [Repeat]
length(30), motif(666:695) 30 IPR001680 WD40 repeat [Repeat]
length(45), motif(705:749) 45 IPR001680 WD40 repeat [Repeat]
length(39), motif(752:790) 39 IPR001680 WD40 repeat [Repeat]
length(37), motif(754:790) 37 IPR001680 WD40 repeat [Repeat]
length(41), motif(759:799) 41 IPR001680 WD40 repeat [Repeat]
length(15), motif(777:791) 15 IPR019775 WD40 repeat, conserved site [Conserved_site]
length(40), motif(797:836) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(802:836) 35 IPR001680 WD40 repeat [Repeat]
length(42), motif(804:845) 42 IPR001680 WD40 repeat [Repeat]
length(40), motif(839:878) 40 IPR001680 WD40 repeat [Repeat]
length(34), motif(843:876) 34 IPR001680 WD40 repeat [Repeat]
length(35), motif(846:880) 35 IPR001680 WD40 repeat [Repeat]
length(40), motif(885:924) 40 IPR001680 WD40 repeat [Repeat]
length(35), motif(889:923) 35 IPR001680 WD40 repeat [Repeat]
length(36), motif(892:927) 36 IPR001680 WD40 repeat [Repeat]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000384911
H-Inv cluster ID Locus viewHIX0199788
Accession number AK225781.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help NO;  Splicing isoformSplicing isoform
Coding potential  Help Protein coding; 
Definition Similar to WD repeat-containing protein 47 isoform 1.
Similarity category  Help Category: Similar to known protein(Category II).
Similar to known protein (NP_001136022)  [Identity/coverage = 100.0%/100.0%] to Homo sapiens protein.
Experimental evidence Protein evidence
PubMed ID NA
Gene family/group Gene family H-Inv gene family/group ID NA
Gene family/group name NA
Evidence motif (InterPro) ID NA
Gene symbol/name HGNC symbol WDR47
HGNC aliases NA
HGNC name WD repeat domain 47
DDBJ NA
UniProt NA
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000174754
No. of interaction NA
Interaction partner(s) NA
BIND NA
DIP NA
MINT NA
HPRD NA
IntAct NA
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:22911
KEGG GENES KEGG GENES(22911)
GeneCard GeneCardWDR47*GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases
Curation status Auto-annotated
Notes NA
Related H-InvDB links Gene familyGene family;  Similarity Search ToolSimilarity Search Tool;  TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA

Gene ontology information
Molecular function protein binding (GO:0005515); 

Subcellular localization information  Last modified:27-May-2015
WoLF PSORT nuclear;  cytosol; 
Target P Other
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:27-May-2015
Start End PDB_ID E-value Identity Coverage SCOP_ID
13 76 1uujA 3e-11 19.7 61/76 a.221.1.1
602 923 1nr0A1 4e-29 16.2 296/311 b.69.4.1
Related H-InvDB links GTOP GTOP

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
Location Variation dbSNP ID Strand CDS/UTR Translation
134 .. 134 A/G rs145802839 - 5'UTR
334 .. 334 T/C rs112410635 - CDS Nonsynonymous[Phe67Leu]
371 .. 371 A/G rs184073827 - CDS Nonsynonymous[Lys79Arg]
382 .. 382 C/T rs140178030 - CDS Nonsynonymous[Arg83Cys]
394 .. 394 C/T rs55730953 - CDS Synonymous[Leu87Leu]
405 .. 405 G/A rs200850819 - CDS Synonymous[Lys90Lys]
434 .. 434 C/T rs143131789 - CDS Nonsynonymous[Ala100Val]
734 .. 734 G/A rs150518582 - CDS Nonsynonymous[Arg200His]
866 .. 866 G/A rs201047851 - CDS Nonsynonymous[Cys244Tyr]
870 .. 870 T/C rs147738484 - CDS Synonymous[Asp245Asp]
876 .. 876 G/A rs145639482 - CDS Synonymous[Leu247Leu]
993 .. 993 A/G rs181292753 - CDS Synonymous[Ala286Ala]
1012 .. 1012 C/T rs139429053 - CDS Nonsynonymous[Pro293Ser]
1107 .. 1107 A/C rs146207643 - CDS Nonsynonymous[Leu324Phe]
1155 .. 1155 C/T rs189692872 - CDS Synonymous[Asp340Asp]
1188 .. 1188 C/T rs143800309 - CDS Synonymous[Ser351Ser]
1217 .. 1217 A/T rs184515595 - CDS Nonsynonymous[Gln361Leu]
1285 .. 1285 C/T rs200519984 - CDS AA-STOP[Arg384*]
1308 .. 1308 A/G rs199535710 - CDS Synonymous[Ala391Ala]
1327 ^ 1328 -/G rs145298974 - CDS
1379 .. 1379 A/G rs188128837 - CDS Nonsynonymous[Gln415Arg]
1448 .. 1448 A/G rs181475108 - CDS Nonsynonymous[Tyr438Cys]
1512 .. 1512 A/G rs200572013 - CDS Synonymous[Gln459Gln]
1611 .. 1611 T/G rs150410604 - CDS Synonymous[Gly492Gly]
1614 .. 1614 A/G rs143316105 - CDS Synonymous[Glu493Glu]
1652 .. 1652 C/T rs141359680 - CDS Nonsynonymous[Ser506Leu]
1724 .. 1724 C/T rs144487132 - CDS Nonsynonymous[Thr530Ile]
1763 .. 1763 C/T rs113434051 - CDS Nonsynonymous[Ala543Val]
1789 .. 1789 C/T rs141474361 - CDS Nonsynonymous[Arg552Cys]
1847 .. 1847 A/T rs150746223 - CDS Nonsynonymous[Gln571Leu]
1865 .. 1865 C/T rs140523650 - CDS Nonsynonymous[Ser577Leu]
1934 .. 1934 A/G rs202151026 - CDS Nonsynonymous[Lys600Arg]
1943 .. 1943 A/C rs199887849 - CDS Nonsynonymous[Lys603Thr]
2016 .. 2016 T/C rs41299563 - CDS Synonymous[Gly627Gly]
2117 .. 2117 G/A rs141008401 - CDS Nonsynonymous[Arg661His]
2163 .. 2163 C/A rs138774417 - CDS Synonymous[Ala676Ala]
2247 .. 2247 C/T rs151087323 - CDS Synonymous[Asn704Asn]
2248 .. 2248 G/C rs141314928 - CDS Nonsynonymous[Ala705Pro]
2252 .. 2252 C/T rs145842744 - CDS Nonsynonymous[Thr706Ile]
2365 .. 2365 A/C rs1538137 - CDS Nonsynonymous[Asn744His]
2379 .. 2379 C/T rs76370963 - CDS Synonymous[Thr748Thr]
2417 .. 2417 G/T rs74576781 - CDS Nonsynonymous[Gly761Val]
2559 .. 2559 C/T rs189774030 - CDS Synonymous[Gly808Gly]
2560 .. 2560 A/G rs142485597 - CDS Nonsynonymous[Ser809Gly]
2565 .. 2565 A/G rs150602693 - CDS Synonymous[Ala810Ala]
2688 .. 2688 T/C rs185973202 - CDS Synonymous[Ser851Ser]
2714 ^ 2715 -/C rs35988108 - CDS
2769 .. 2769 C/T rs140431320 - CDS Synonymous[Asp878Asp]
2772 .. 2772 A/G rs141843546 - CDS Synonymous[Leu879Leu]
2856 .. 2856 G/A rs139179400 - CDS Synonymous[Gln907Gln]
2910 .. 2910 C/A rs150803164 - CDS AA-STOP[Tyr925*]
3144 .. 3144 T/A rs187261735 - 3'UTR
3145 .. 3145 C/A rs182608349 - 3'UTR
3194 .. 3194 A/G rs139950222 - 3'UTR
3218 .. 3218 G/A rs189925843 - 3'UTR
3281 .. 3281 T/G rs112609892 - 3'UTR
3382 .. 3382 C/G rs185990741 - 3'UTR
3406 .. 3406 C/A rs144156480 - 3'UTR
3474 .. 3474 A/G rs507776 - 3'UTR
3501 .. 3501 G/A rs182805601 - 3'UTR
3692 .. 3692 T/A rs114053618 - 3'UTR
3697 .. 3697 T/C rs11803800 - 3'UTR
4117 .. 4117 G/C rs12068536 - 3'UTR
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
No data available
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD)
Related H-InvDB links VaryGeneVaryGene Repeat mask viewerRepeat Mask Viewer