H-InvDB_8.3 released on March 26, 2013.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
Locus view
Protein view
G-integra
DiseaseInfo Viewer
H-ANGEL
Evola
PPI view
Gene Family/Group
Hyperlink MS
H-Invitational ID:
HIT000081651
Accession number:
AK074616
Created date:
26-Mar-2013
Last modified:
20-Apr-2012
Definition:
Similar to WD repeat-containing protein 47;
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
AK074616.1
CAGE tag ID
NA
EST ID
NA
Clone Number
HEMBB1000668
Experimental resources
NBRC
;
HGPD
;
Antibody (WDR47)
;
Catalog (WDR47)
;
Sequence data provider
NA
Annotation project
NA
Length of cDNA
2375[bp] (No. of exon:8)[A:684 T:751 G:498 C:442]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
whole embryo, mainly body
Develpmental stage
embryo, 10 weeks
Sequence quality information
CDS feature
N-truncated
Kozak sequence
NA
PolyA
NA
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
Truncation;
Notes
NA
ATGATGCTTCAAATATTCATACAAGCACTCCTCGTAATCCTGGATCAACA AATCACATACCTTTTCTGGAGGAATCACCTTGTGGAAGCCAAATCTCTTC AGAACATTCGGTCATTAAGCCACCTCTTGGAGATTCTCCAGGGAGTCTTT CAAGGTCGAAAGGGGAAGAGGATGACAAATCAAAAAAGCAGTTTGTTTGT ATTAATATCCTAGAAGACACACAAGCTGTTAGAGCAGTGGCTTTTCATCC AGCTGGAGGTTTATATGCTGTTGGTTCAAATTCAAAAACTCTGAGAGTAT GTGCCTATCCAGATGTAATTGATCCAAGTGCACATGAGACTCCTAAGCAG CCGGTGGTACGTTTTAAAAGGAATAAACATCATAAAGGATCCATTTACTG TGTGGCCTGGAGTCCTTGTGGGCAGTTATTAGCAACAGGATCAAATGACA AATACGTCAAAGTGCTGCCCTTCAATGCAGAGACTTGTAACGCAACAGGA CCAGATCTGGAATTTAGTATGCATGATGGAACAATTAGAGACTTGGCATT TATGGAAGGCCCAGAAAGCGGAGGAGCTATTTTAATAAGTGCTGGAGCAG GGGATTGTAACATTTATACAACCGATTGTCAAAGAGGACAGGGCCTCCAT GCTTTGAGTGGACATACTGGGCATATTTTAGCACTTTATACCTGGAGTGG CTGGATGATTGCATCTGGTTCCCAAGATAAGACTGTTAGATTTTGGGATC TTCGAGTACCAAGTTGTGTTCGTGTTGTTGGCACAACATTTCATGGAACT GGCAGTGCAGTGGCATCTGTAGCTGTAGATCCCAGTGGTCGTCTCTTAGC CACAGGTCAAGAAGATTCTAGCTGCATGTTGTATGACATAAGAGGAGGAA GAATGGTACAAAGTTATCATCCTCATTCCAGTGATGTTCGCTCTGTTCGA TTCTCCCCTGGAGCTCACTACTTGCTAACAGGCTCTTATGATATGAAAAT AAAGGTGACAGACCTACAAGGGGACCTCACCAAGCAGCTCCCTATCATGG TGGTGGGGGAGCACAAGGACAAAGTGATTCAGTGCAGATGGCACACCCAG GATCTTTCCTTCCTGTCATCCTCTGCAGATAGAACTGTCACCCTCTGGAC TTACAATGGGTAGAGCACACCGCATGTCAGTCTATGCAGCAAAAGCACAG AGACTTAAGACTACTGAGTTGTGAAAATTACAAATCTGAAGAACATAGTG TCCAGGAAAGTGGTTTAGCACGAAGAGGCCCCTTATTACCATGTATCCCA CTGATAGGAGGTGTTGGGTGGTGTTATTCCGCAGTGCTTTCAGTCTTCCA TGTGAGCTCGTGCTGCTGTGACCTGCTATATGTAGTCTCGTTGCCAAAGT CTGCAGAAGAGCTCTTCAGTTGTTGGTGTGCACTCCAAGTCAGGATGGAC AATGTGTTTACGGTTTAGTATTCAATGCATTCCTTGGTCTTTGCCTAAAT AACAGTTTTATATGCACATTGAAATGGAATTATACTTCAACTATATTATT AAATGTAATGCAACCAAGTTCCTCCCAGATTAAACTTCCCAGGTGTTCAG AATTACTTTTGCTCTTCTCACGATCCCATATTGTATTATCACTTGTCTTC TAGAGGTCAGAATTCCATAATATATGTCACTCAAAAGTTACATGGTTGCT TTCACTTAAGGATCATTATGGAGTTTAAAGATGAATGAAAAACTGCTTCT TAGTTTACTACATGGTATAGGCCCTTTTTTCTTAAACCCAGGGATATGAT TATTTTGTCATATAATTTTGTTTCAGGCTAAAAGGTAAATGTGTTTGCTT CAGAAACTTGTTAACTTCAGTTTTTTGAATGCAACAGGATACCTCCCTTC CAAACTGAACTGTAGAAGCAGAGCAGCAGCAGTTATGTGATGCAACACTT GATGGTACAGTAAATTTACTGGCATTTTTCTCCTTAAAAATTAAAATCCT TGACATAGACCATAGCATGGCTTGAAATGCTATGTCTGCATGATAATTTA AAATGGAAGATTTAAACTTTGCACTCCAAAAGCTTATTTGGATTTTTTTC TTGCACTGTTTTGTGTAATGCAGAATAATGATTTTATTTCTACAGCTTTG TAGATTCTAACATTTATGTATCTTTATTTTCATATTGTACAGTAATTTTA CTTTAAATTATTTAAATAGGCTATTTTATTTATTTCAAATGCAGTTGTAT TAGTTCTCATTATTGAACTGTCTGTGCACTGTATGTAGCAAGCATTTTTC ATCTGTTGTATACAAGTGGAAAGGGTATTAGAAGTGTAACTGTGCTATTA TTTCAATAAAGACCTCTTGACATTT
Gene structure information
H-Inv cluster ID
HIX0199788
Genomic location
Chromosome
1
Location
NA
Position
109512840- 109538295
Strand
-
Possible duplicated location(s)
NA
Gene structure
8 exon(s)
Database links
RefSeq
NM_014969
;
NM_001142550
;
NM_001142551
;
Ensembl
ENST00000357672
;
ENST00000361054
;
ENST00000369962
;
ENST00000369965
;
ENST00000400794
;
ENST00000528747
;
ENST00000529074
;
ENST00000530772
;
ENST00000531337
;
Entrez Gene
Entrez Gene ID:22911
;
KEGG GENES
KEGG GENES(22911)
;
GeneCard
WDR47
;
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000003890
Predicted CDS
3..1163; 386[aa]; Orientation:+3;
Codon Adaptation Index (CAI).
0.695
DASNIHTSTPRNPGSTNHIPFLEESPCGSQISSEHSVIKPPLGDSPGSLS RSKGEEDDKSKKQFVCINILEDTQAVRAVAFHPAGGLYAVGSNSKTLRVC AYPDVIDPSAHETPKQPVVRFKRNKHHKGSIYCVAWSPCGQLLATGSNDK YVKVLPFNAETCNATGPDLEFSMHDGTIRDLAFMEGPESGGAILISAGAG DCNIYTTDCQRGQGLHALSGHTGHILALYTWSGWMIASGSQDKTVRFWDL RVPSCVRVVGTTFHGTGSAVASVAVDPSGRLLATGQEDSSCMLYDIRGGR MVQSYHPHSSDVRSVRFSPGAHYLLTGSYDMKIKVTDLQGDLTKQLPIMV VGEHKDKVIQCRWHTQDLSFLSSSADRTVTLWTYNG*
Motif information
a.a.
length
InterPro
Name
39
IPR001680
WD40 repeat [Repeat]
315
IPR015943
WD40/YVTN repeat-like-containing domain [Domain]
27
IPR019781
WD40 repeat, subgroup [Repeat]
311
IPR011046
WD40 repeat-like-containing domain [Domain]
43
IPR001680
WD40 repeat [Repeat]
31
IPR019782
WD40 repeat 2 [Repeat]
263
IPR017986
WD40-repeat-containing domain [Domain]
30
IPR019781
WD40 repeat, subgroup [Repeat]
45
IPR001680
WD40 repeat [Repeat]
39
IPR001680
WD40 repeat [Repeat]
37
IPR019781
WD40 repeat, subgroup [Repeat]
41
IPR019782
WD40 repeat 2 [Repeat]
15
IPR020472
G-protein beta WD-40 repeat [Repeat]
15
IPR019775
WD40 repeat, conserved site [Conserved_site]
40
IPR001680
WD40 repeat [Repeat]
35
IPR019781
WD40 repeat, subgroup [Repeat]
42
IPR019782
WD40 repeat 2 [Repeat]
40
IPR001680
WD40 repeat [Repeat]
34
IPR019781
WD40 repeat, subgroup [Repeat]
35
IPR019782
WD40 repeat 2 [Repeat]
15
IPR020472
G-protein beta WD-40 repeat [Repeat]
40
IPR001680
WD40 repeat [Repeat]
35
IPR019781
WD40 repeat, subgroup [Repeat]
36
IPR019782
WD40 repeat 2 [Repeat]
15
IPR020472
G-protein beta WD-40 repeat [Repeat]
Gene function information
H-Inv ID
HIT000081651
H-Inv cluster ID
HIX0199788
Accession number
AK074616.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
NO;
Coding potential
Protein coding;
Definition
Similar to WD repeat-containing protein 47;
Similarity category
Category: Similar to known protein(Category II).
Similar to known protein (
O94967
) [Identity/coverage = 100.0%/42.0%] to Homo sapiens (Human). protein.
Experimental evidence
Protein evidence
PubMed ID
10048485
;
14702039
;
15489334
;
16710414
;
18220336
;
18669648
;
ALL
;
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
WDR47
HGNC aliases
NA
HGNC name
WD repeat domain 47
DDBJ
NA
UniProt
WDR47
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000003890
No. of interaction
NA
Interaction partner(s)
NA
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NM_014969
;
NM_001142550
;
NM_001142551
;
Ensembl
ENST00000357672
;
ENST00000361054
;
ENST00000369962
;
ENST00000369965
;
ENST00000400794
;
ENST00000528747
;
ENST00000529074
;
ENST00000530772
;
ENST00000531337
;
Entrez Gene
Entrez Gene ID:22911
;
KEGG GENES
KEGG GENES(22911)
;
GeneCard
WDR47
;
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Auto-annotated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:20-Apr-2012
WoLF PSORT
nuclear; cytosol; extracellular;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Protein structure information (GTOP)
Last modified:20-Apr-2012
Start
End
PDB_ID
E-value
Identity
Coverage
SCOP_ID
29
210
1u4cA
3e-20
13.3
181/322
b.69.4.2
Related H-InvDB links
GTOP
Gene expression information
Last modified:20-Apr-2012
Tissue-specific expression
NA
Probe
information
AceGene
AGhsA240912;
Affymetrix
GeneChip
HG-Focus
NA
HG-U133
203855_at;
HG-U133A
203855_at;
HG-U133A_2
203855_at;
HG-U133B
NA
HG-U133_Plus_2
203855_at;
HG-U95
35720_at;
HG-U95A
35720_at;
HG-U95B
NA
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
2350464; 2426841; 2426842; 2426843; 2426844; 2426846; 2426848; 2426849; 2426850; 2426852; 2426854; 2426857; 4052680; 4052682;
HuGeneFL
NA
Agilent
Human 1A Oligo Microarray:PGID215
A_23_P23748;
Whole Human Genome Oligo Microarray:PGID247
A_23_P23748;
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Disease/pathology information
Last modified:20-Apr-2012
Disease relation
Disease name:NA
Related information in OMIM
OMIM ID:NA Title:NA
Co-localized orphan diseases
OMIM ID:
115665
;
116600
;
155600
;
600975
;
605225
;
605606
;
606788
;
606852
;
606928
;
607317
;
607671
;
608543
;
608553
;
608995
;
610320
;
612367
;
612596
;
Disease related mutation
NA
Literature-Extracted GENe-Disease Associations (LEGENDA)
Gene name
Entrez Gene ID:(22911)
Disease
Entrez Gene ID:(22911)
Substance
Entrez Gene ID:(22911)
Related H-InvDB links
DiseaseInfo Viewer
;
LEGENDA
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
7 .. 7
C/T
rs113434051
-
CDS
Nonsynonymous[Ala2Val]
260 .. 260
T/C
rs41299563
-
CDS
Synonymous[Gly86Gly]
609 .. 609
A/C
rs1538137
-
CDS
Nonsynonymous[Asn203His]
623 .. 623
C/T
rs76370963
-
CDS
Synonymous[Thr207Thr]
661 .. 661
G/T
rs74576781
-
CDS
Nonsynonymous[Gly220Val]
958 ^ 959
-/C
rs35988108
-
CDS
1525 .. 1525
T/G
rs112609892
-
3'UTR
1718 .. 1718
A/G
rs507776
-
3'UTR
1936 .. 1936
T/A
rs114053618
-
3'UTR
1941 .. 1941
T/C
rs11803800
-
3'UTR
2361 .. 2361
G/C
rs12068536
-
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
No data available
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene
;
Repeat Mask Viewer
;