H-InvDB x AHG DB
Transcript view
H-InvDB_8.3 released on March 26, 2013.
Search by for Advanced Search
Home Quick guide Navi BLAST Site map Download Contact us Help
Locus viewLocus view Protein view Protein view G-integraG-integra DiseaseInfo ViewerDiseaseInfo Viewer H-ANGELH-ANGEL EvolaEvola PPI viewerPPI view Gene Family/GroupGene Family/Group Hyperlink managemnet system (All databases)Hyperlink MS
H-Invitational ID: HIT000018866 Accession number: AK094009 Created date: 26-Mar-2013 Last modified: 20-Apr-2012
Definition: Peptidase S1A, chymotrypsin-type family protein.
 
 

Transcript original information
Accession number AK094009.1
CAGE tag ID NA
EST ID NA
Clone Number UTERU2008707
Experimental resources NBRC: NITE Biological Resource Center  NBRC ; HGPD: Human Gene and Protein Database HGPD ;
Sequence data provider Project:FLJ; Provider:FLJ/HRI; 
Annotation project H-Invitational FLcDNA
Length of cDNA 2580[bp] (No. of exon:0)[A:636 T:620 G:659 C:665]
Devision HUM
Molecular type mRNA
Library origin Cell type NA
Tissue type uterus
Develpmental stage NA
Mini-G
Sequence quality information
CDS feature N-truncated
Kozak sequence NA
PolyA NA
Vector/adapter sequence NA
Frame shift NA
Remaining intron NA
Splice site acceptor (NAGNAG) NA
Transcript quality feature NA
Notes NA

Gene structure information  G-integra H-DBAS cDNA-genome alignment
H-Inv cluster ID HIX0060170
Genomic location  G-integra Help Chromosome UM
Location NA
Position NA
Strand NA
Possible duplicated location(s) NA
Gene structure  exon(s)
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:715
KEGG GENES KEGG GENES(715)
GeneCard NA *GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links H-DBASH-DBAS;  G-integraG-integra;  cDNA-genome alignment; 

Predicted CDS information
HIP ID HIP000336703
Predicted CDS 1172..2467;  431[aa];  Orientation:+2; 
Codon Adaptation Index (CAI). 0.822

Motif information
ORF

length(431),orf(1172:2467)
LAVVWLNAPVCNGLGRRKKEPHPWGSHIHGGATSMGEPRDVLVLPPTPVW
KGQEGDLHCLLALMAPLSSAQQGNQVLHSFTAVCQDDGTWHRAMPRCKIK
DCGQPRNLPNGDFRYTTTMGVNTYKARIQYYCHEPYYKMQTRAGSRESEQ
GVYTCTAQGIWKNEQKGEKIPRCLPVCGKPVNPVEQRQRIIGGQKAKMGN
FPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKEHEAQSNASLDVFLGHT
NVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENSVTLGPNLL
PICLPDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWL
RGKNRMDVFSQNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIV
SWGIGCSRGYGFYTKVLNYVDWIKKEMEEED*
a.a.
length
InterPro Name
length(28), motif(71:98) 28 IPR016060 Complement control module [Domain]
length(27), motif(77:103) 27 IPR016060 Complement control module [Domain]
length(76), motif(100:175) 76 IPR000436 Sushi/SCR/CCP [Domain]
length(82), motif(102:183) 82 IPR016060 Complement control module [Domain]
length(64), motif(102:165) 64 IPR000436 Sushi/SCR/CCP [Domain]
length(72), motif(102:173) 72 IPR000436 Sushi/SCR/CCP [Domain]
length(77), motif(104:180) 77 IPR016060 Complement control module [Domain]
length(255), motif(177:431) 255 IPR009003 Peptidase cysteine/serine, trypsin-like [Domain]
length(235), motif(189:423) 235 IPR001254 Peptidase S1/S6, chymotrypsin/Hap [Domain]
length(234), motif(190:423) 234 IPR001254 Peptidase S1/S6, chymotrypsin/Hap [Domain]
length(239), motif(190:428) 239 IPR001254 Peptidase S1/S6, chymotrypsin/Hap [Domain]
length(16), motif(214:229) 16 IPR001314 Peptidase S1A, chymotrypsin-type [Family]
length(15), motif(279:293) 15 IPR001314 Peptidase S1A, chymotrypsin-type [Family]
length(13), motif(373:385) 13 IPR001314 Peptidase S1A, chymotrypsin-type [Family]
length(12), motif(374:385) 12 IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site [Active_site]

Gene function information  Gene family PPI viewer Similarity Search Tool TACT
H-Inv ID HIT000018866
H-Inv cluster ID Locus viewHIX0060170
Accession number AK094009.1
CAGE tag ID NA
EST ID NA
Transcript feature  Help Representative H-Inv IDRepresentative transcript; 
Coding potential  Help Protein coding; 
Definition Peptidase S1A, chymotrypsin-type family protein.
Similarity category  Help Category: IPR domain containing protein(Category III).
InterPro domain (IPR001314) -containing protein.
Experimental evidence NA
PubMed ID NA
Gene family/group Gene family H-Inv gene family/group ID HIF0004019
Gene family/group name Peptidase S1 and S6, chymotrypsin/Hap (IPR001254).
Evidence motif (InterPro) ID IPR001254
Gene symbol/name HGNC symbol NA
HGNC aliases NA
HGNC name NA
DDBJ NA
UniProt NA
EC number NA
GGDB
(GlycoGene Database)
Gene symbol NA
Familly NA
Designation NA
Expression NA
KEGG metabolic pathway NA
Protein-protein interaction (PPI) PPI viewer H-Inv protein ID HIP000336703
No. of interaction 89
Interaction partner(s) HIP000002695HIP000012362HIP000021611HIP000021611HIP000022683HIP000022683HIP000023224HIP000027846HIP000028776HIP000031121HIP000032033HIP000032033HIP000033567HIP000038044HIP000038917HIP000040104HIP000040104HIP000040325HIP000040325HIP000040335HIP000040335HIP000041964HIP000044948HIP000045138HIP000045138HIP000045470HIP000047547HIP000047547HIP000048209HIP000048478HIP000048478HIP000049023HIP000051303HIP000051303HIP000055598HIP000055598HIP000065041HIP000066234HIP000067782HIP000067782HIP000069782HIP000069782HIP000069885HIP000071711HIP000078313HIP000078313HIP000079932HIP000079932HIP000081613HIP000087439HIP000087439HIP000088108HIP000095623HIP000095623HIP000097730HIP000098499HIP000109009HIP000109009HIP000110854HIP000110854HIP000111022HIP000117125HIP000117125HIP000133002HIP000133002HIP000146646HIP000148859HIP000162041HIP000162041HIP000193974HIP000195336HIP000195642HIP000205642HIP000212383HIP000219887HIP000247333HIP000247912HIP000251455HIP000336295HIP000336784HIP000345964HIP000346720HIP000346720HIP000349874HIP000358744HIP000360676HIP000360879HIP000360879HIP000361685
BIND NA
DIP 100211E;  100366E;  100532E;  100609E;  100625E;  100726E;  101191E;  101443E;  102007E;  102223E;  102293E;  18532E;  47837E;  47898E;  47934E;  48016E;  48443E;  48849E;  48906E;  48953E;  49105E;  49197E;  49199E;  49449E;  50033E;  50141E;  50417E;  50716E;  50760E;  51035E;  51217E;  51254E;  51455E;  51513E;  51558E;  51690E;  51835E;  51967E;  52031E;  52230E;  52250E;  52271E;  52359E;  52870E;  52957E;  53435E;  96492E;  96537E;  96621E;  96987E;  97096E;  97835E;  98140E;  98247E;  98297E;  98317E;  98322E;  98352E;  98475E;  98746E;  98762E;  98906E;  99026E;  99093E;  99178E;  99193E;  99200E;  99222E;  99282E;  99323E;  99378E;  99433E;  99449E;  99484E;  99593E;  99624E;  99764E;  99869E;  99955E; 
MINT NA
HPRD 00284;  01677;  03396;  04435;  04471;  04866;  05344;  05728;  06861;  09001;  09490;  12692;  17624;  18253; 
IntAct EBI-906966;  EBI-880595;  EBI-895331;  EBI-881486;  EBI-879304;  EBI-1147250;  EBI-886936;  EBI-887918;  EBI-883767;  EBI-895222;  EBI-883170;  EBI-883484;  EBI-883046;  EBI-882652;  EBI-881701;  EBI-881754;  EBI-886668;  EBI-883980;  EBI-883848;  EBI-884415;  EBI-886556;  EBI-884810;  EBI-893916;  EBI-887783;  EBI-893650;  EBI-895127;  EBI-894927;  EBI-888366;  EBI-893285;  EBI-893204;  EBI-892264;  EBI-888757;  EBI-891913;  EBI-891424;  EBI-889682;  EBI-891089;  EBI-892437;  EBI-892692;  EBI-887368;  EBI-887304; 
Database links RefSeq NA
Ensembl NA
Entrez Gene Entrez Gene ID:715
KEGG GENES KEGG GENES(715)
GeneCard NA *GeneCards is provided free to academic non-profit institutions.
etc H-GOLDHuman-Gene diversity Of Life-style related Diseases; 
Curation status Human curated
Notes NA
Related H-InvDB links Peptidase S1 and S6, chymotrypsin/Hap (IPR001254). Similarity Search ToolSimilarity Search Tool TACTTACT
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome. 
NA

Gene ontology information
Molecular function catalytic activity (GO:0003824);  serine-type endopeptidase activity (GO:0004252); 
Biological process proteolysis (GO:0006508); 

Subcellular localization information  Last modified:20-Apr-2012
WoLF PSORT mitochondria;  cytosol;  extracellular;  peroxisome; 
Target P Not predicted
SOSUI soluble protein
TMHMM soluble protein
PTS1 Not targeted
Related H-InvDB links LIFEdb LIFEdb; 
JRE-1.4.0 or later is required. Download JRE at Sun's web site.

Protein structure information (GTOP) GTOP Last modified:20-Apr-2012
Start End PDB_ID E-value Identity Coverage SCOP_ID
40 66 1gpzA2 2e-05 100.0 27/68 g.18.1.1
67 142 1elvA2 2e-17 40.0 65/66 g.18.1.1
117 395 1qvh.1 1e-32 28.5 267/290 b.47.1.2
Related H-InvDB links GTOP GTOP

Gene expression information  H-ANGEL DNAProbeLocator Last modified:20-Apr-2012
Tissue-specific expression  H-ANGEL NA
Probe
information DNAProbeLocator
AceGene AGhsC080524; 
Affymetrix
GeneChip
HG-Focus 212067_s_at; 
HG-U133 212067_s_at; 
HG-U133A 212067_s_at; 
HG-U133A_2 212067_s_at; 
HG-U133B NA
HG-U133_Plus_2 212067_s_at; 
HG-U95 39409_at; 
HG-U95A 39409_at; 
HG-U95B NA
HG-U95C NA
HG-U95D NA
HG-U95E NA
HG-U95Av2 NA
HuEx-1_0 3403217;  3442478;  3442479;  3442480;  3442481;  3442482;  3442483;  3442484; 
HuGeneFL M14058_at; 
Agilent Human 1A Oligo Microarray:PGID215 NA
Whole Human Genome Oligo Microarray:PGID247 NA
Related H-InvDB links H-ANGELH-ANGEL;  DNAProbeLocatorDNAProbeLocator

Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information  VaryGene Repeat mask viewer
 Single Nucleotide Polymorphism (SNP) and indel  VaryGene
No data available
 Microsatellite (Short Tandem Repeat, STR)
No data available
 Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
 Repeat  Repeat mask viewer
Type Start End Strand
L2a 18 135 -
AluSz6 442 762 -
Database links H-GOLDHuman-Gene diversity Of Life-style related Diseases(H-GOLD); 
Related H-InvDB links VaryGeneVaryGene;  Repeat mask viewerRepeat Mask Viewer