Integrase_H2C2 domain-containing protein

Overview
NameIntegrase_H2C2 domain-containing protein
Smed IDSMED30002976
Length (bp)1681
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Integrase_H2C2 domain-containing protein (SMED30002976) t-SNE clustered cells

Violin plots show distribution of expression levels for Integrase_H2C2 domain-containing protein (SMED30002976) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Integrase_H2C2 domain-containing protein (SMED30002976) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Integrase_H2C2 domain-containing protein (SMED30002976) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30002976

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 8

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
head regionSMED30002976SMESG000031440.1 SmedASXL_012832SmedAsxl_ww_GCZZ01PMID:27034770
Currie et al., 2016
whole organism asexual adult RNA-sequencing evidence
pharynxSMED30002976SMESG000031440.1 dd_Smed_v4_9924_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
nervous systemSMED30002976SMESG000031440.1 dd_Smed_v4_9924_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
epidermisSMED30002976SMESG000031440.1 dd_Smed_v4_9924_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
head regionSMED30002976SMESG000031440.1 dd_Smed_v6_9924_0_1dd_Smed_v6PMID:28171748
Stückemann et al., 2017
whole organism asexual adult RNA-sequencing evidence
epidermisSMED30002976SMESG000031440.1 dd_Smed_v6_9924_0_1dd_Smed_v6PMID:30399335
Ross et al., 2018
whole organism asexual adult colorimetric in situ hybridization evidence
neuronSMED30002976SMESG000031440.1 dd_Smed_v6_9924_0_1dd_Smed_v6PMID:30399335
Ross et al., 2018
whole organism asexual adult colorimetric in situ hybridization evidence
epidermisSMED30002976SMESG000031440.1 dd_Smed_v4_9924_0_1dd_Smed_v4PMID:28292427
Wurtzel et al., 2017
whole organism asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Match: C2orf81 (chromosome 2 open reading frame 81 [Source:HGNC Symbol;Acc:HGNC:34350])

HSP 1 Score: 96.2857 bits (238), Expect = 2.818e-22
Identity = 53/138 (38.41%), Postives = 74/138 (53.62%), Query Frame = 2
Query:  161 KMSKQGTSRSRAEKTRGTVSVVVPVTN-DIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFEH-IWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            ++  +G +RS+AEK R      VPV   DIVPG+ +E +       E  ++ +  ++ +L         + YL  + +PFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:   14 QVRDRGVTRSKAEKVR---PPTVPVPQVDIVPGRLSEAEWMALTALEEGEDVVGDILADLLARVMDSAFKVYLTQQCIPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 148          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Match: C2orf81 (chromosome 2 open reading frame 81 [Source:HGNC Symbol;Acc:HGNC:34350])

HSP 1 Score: 96.2857 bits (238), Expect = 2.529e-20
Identity = 53/134 (39.55%), Postives = 72/134 (53.73%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTN-DIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            +G +RS+AEK R      VPV   DIVPG+ +E +       E  ++ +  ++ +L         + YL  + +PFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:   18 RGVTRSKAEKVR---PPTVPVPQVDIVPGRLSEAEWMALTALEEGEDVVGDILADLLARVMDSAFKVYLTQQCIPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 148          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Match: C2orf81 (chromosome 2 open reading frame 81 [Source:HGNC Symbol;Acc:HGNC:34350])

HSP 1 Score: 95.9005 bits (237), Expect = 3.049e-20
Identity = 53/134 (39.55%), Postives = 72/134 (53.73%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTN-DIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            +G +RS+AEK R      VPV   DIVPG+ +E +       E  ++ +  ++ +L         + YL  + +PFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:   12 RGVTRSKAEKVR---PPTVPVPQVDIVPGRLSEAEWMALTALEEGEDVVGDILADLLARVMDSAFKVYLTQQCIPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 142          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Match: C2orf81 (chromosome 2 open reading frame 81 [Source:HGNC Symbol;Acc:HGNC:34350])

HSP 1 Score: 56.9954 bits (136), Expect = 5.257e-9
Identity = 28/50 (56.00%), Postives = 31/50 (62.00%), Query Frame = 2
Query:  422 MLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            ML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:    1 MLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 50          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Match: C2orf81 (chromosome 2 open reading frame 81 [Source:HGNC Symbol;Acc:HGNC:34350])

HSP 1 Score: 57.7658 bits (138), Expect = 3.279e-8
Identity = 28/50 (56.00%), Postives = 31/50 (62.00%), Query Frame = 2
Query:  422 MLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            ML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:    1 MLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 50          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Mouse
Match: 1700003E16Rik (RIKEN cDNA 1700003E16 gene [Source:MGI Symbol;Acc:MGI:1919087])

HSP 1 Score: 92.4337 bits (228), Expect = 3.184e-19
Identity = 54/147 (36.73%), Postives = 76/147 (51.70%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECI 610
            +G +RS+AEK R     V  V  DIVPG+  E +    +  E  ++ +  ++ +L     +   + YL  + VPFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P       + H P  +  C+
Sbjct:   12 RGVTRSKAEKARPPTQPVPQV--DIVPGRLNEAEWIAFMSLEEGEDVVGDILADLMTRVMECAFKVYLTQQCVPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPLACTTDSWAQGSVP-------VLHTPAPV--CV 147          
BLAST of Integrase_H2C2 domain-containing protein vs. UniProt/SwissProt
Match: sp|A6NN90|CB081_HUMAN (Uncharacterized protein C2orf81 OS=Homo sapiens OX=9606 GN=C2orf81 PE=3 SV=3)

HSP 1 Score: 95.9005 bits (237), Expect = 1.464e-19
Identity = 53/134 (39.55%), Postives = 72/134 (53.73%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTN-DIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQAS 568
            +G +RS+AEK R      VPV   DIVPG+ +E +       E  ++ +  ++ +L         + YL  + +PFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P   AS
Sbjct:   12 RGVTRSKAEKVR---PPTVPVPQVDIVPGRLSEAEWMALTALEEGEDVVGDILADLLARVMDSAFKVYLTQQCIPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPSACTTDSWAQGSVPVLHAS 142          
BLAST of Integrase_H2C2 domain-containing protein vs. UniProt/SwissProt
Match: sp|A8NIX5|CB081_BOVIN (Uncharacterized protein C2orf81 homolog OS=Bos taurus OX=9913 PE=2 SV=2)

HSP 1 Score: 92.4337 bits (228), Expect = 1.635e-18
Identity = 51/129 (39.53%), Postives = 70/129 (54.26%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTN-DIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFP 553
            +G +RS+AEK R      VPV   DIVPG+ TE +       E  ++ +  ++ +L         + YL  + +PFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  D+WAQG  P
Sbjct:   12 RGMTRSKAEKVR---PPTVPVPQVDIVPGRLTEAEWIAFTALEEGEDVVGDILADLVARVIDSAFKVYLTQQCIPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPLACTTDAWAQGSVP 137          
BLAST of Integrase_H2C2 domain-containing protein vs. UniProt/SwissProt
Match: sp|Q9DAQ4|CB081_MOUSE (Uncharacterized protein C2orf81 homolog OS=Mus musculus OX=10090 PE=2 SV=3)

HSP 1 Score: 92.4337 bits (228), Expect = 2.227e-18
Identity = 54/147 (36.73%), Postives = 76/147 (51.70%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECI 610
            +G +RS+AEK R     V  V  DIVPG+  E +    +  E  ++ +  ++ +L     +   + YL  + VPFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  DSWAQG  P       + H P  +  C+
Sbjct:   12 RGVTRSKAEKARPPTQPVPQV--DIVPGRLNEAEWIAFMSLEEGEDVVGDILADLMTRVMECAFKVYLTQQCVPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPLACTTDSWAQGSVP-------VLHTPAPV--CV 147          
BLAST of Integrase_H2C2 domain-containing protein vs. UniProt/SwissProt
Match: sp|Q6AXP4|CB081_RAT (Uncharacterized protein C2orf81 homolog OS=Rattus norvegicus OX=10116 PE=1 SV=1)

HSP 1 Score: 91.6633 bits (226), Expect = 3.489e-18
Identity = 51/137 (37.23%), Postives = 72/137 (52.55%), Query Frame = 2
Query:  173 QGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFE-HIWTEDEEPPSCMIDSWAQGYFPKRQASFEIA 580
            +G +RS+AEK R     V  V  DIVPG+  E +    +  E  ++ +  ++ +L     +   + YL  + VPFT   A+ AML+I EW FL+ D GES   E   W EDEEP +C  D+WAQG  P   A   + 
Sbjct:   12 RGVTRSKAEKARPPTQPVPQV--DIVPGRLNEAEWIAFMSLEEGEDIVGDILADLVTRVMECAFKVYLTQQCVPFTISQAREAMLQITEWRFLARDEGESAVAEDPTWGEDEEPLACTTDAWAQGSVPVLHAPAPVG 146          
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Match: A0A267E8G7 (Uncharacterized protein (Fragment) OS=Macrostomum lignano OX=282301 GN=BOX15_Mlig015441g2 PE=4 SV=1)

HSP 1 Score: 134.035 bits (336), Expect = 3.770e-32
Identity = 65/136 (47.79%), Postives = 91/136 (66.91%), Query Frame = 2
Query:  158 TKMSKQGTSRSRAEKTRGTVSVVVP-VTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGES-DKF-EHIWTEDEEPPSCMIDSWAQGYFPK 556
            ++MS+Q  ++ RA++ +G  S   P V N+I+PGKFTEHD NL +     +EF+  +V ++      +I+EKYL  +++P+T  LAK+AML+IIEWNFLSHD GES D+     W+ED EP S  IDSW QG  PK
Sbjct:    9 SQMSRQ-PAKGRADRGKGAQSTPNPAVQNEIIPGKFTEHDWNLAMGPSENEEFVLGIVNDVVDTALGEIYEKYLQRQLIPYTVSLAKDAMLQIIEWNFLSHDEGESPDRLTGEAWSEDREPASATIDSWGQGCIPK 143          
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Match: A0A267DCI5 (Uncharacterized protein (Fragment) OS=Macrostomum lignano OX=282301 GN=BOX15_Mlig015441g3 PE=4 SV=1)

HSP 1 Score: 135.191 bits (339), Expect = 2.450e-30
Identity = 65/136 (47.79%), Postives = 91/136 (66.91%), Query Frame = 2
Query:  158 TKMSKQGTSRSRAEKTRGTVSVVVP-VTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGES-DKF-EHIWTEDEEPPSCMIDSWAQGYFPK 556
            ++MS+Q  ++ RA++ +G  S   P V N+I+PGKFTEHD NL +     +EF+  +V ++      +I+EKYL  +++P+T  LAK+AML+IIEWNFLSHD GES D+     W+ED EP S  IDSW QG  PK
Sbjct:    9 SQMSRQ-PAKGRADRGKGAQSTPNPAVQNEIIPGKFTEHDWNLAMGPSENEEFVLGIVNDVVDTALSEIYEKYLQRQLIPYTVSLAKDAMLQIIEWNFLSHDEGESPDRLTGEAWSEDREPASATIDSWGQGCIPK 143          
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Match: A0A1I8JKD8 (Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1)

HSP 1 Score: 133.65 bits (335), Expect = 3.234e-30
Identity = 65/134 (48.51%), Postives = 89/134 (66.42%), Query Frame = 2
Query:  164 MSKQGTSRSRAEKTRGTVSVVVP-VTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGES-DKF-EHIWTEDEEPPSCMIDSWAQGYFPK 556
            MS+Q  ++ RA++ +G  S   P V N+I+PGKFTEHD NL +     +EF+  +V ++      +I+EKYL  +++P+T  LAK+AML+IIEWNFLSHD GES D+     W+ED EP S  IDSW QG  PK
Sbjct:    1 MSRQ-PAKGRADRGKGAQSTPNPAVQNEIIPGKFTEHDWNLAMGPSENEEFVLGIVNDVVDTALSEIYEKYLQRQLIPYTVSLAKDAMLQIIEWNFLSHDEGESPDRLTGEAWSEDREPASATIDSWGQGCIPK 133          
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Match: A0A1I8IKS1 (Integrase_H2C2 domain-containing protein OS=Macrostomum lignano OX=282301 PE=4 SV=1)

HSP 1 Score: 133.265 bits (334), Expect = 9.599e-29
Identity = 65/136 (47.79%), Postives = 91/136 (66.91%), Query Frame = 2
Query:  158 TKMSKQGTSRSRAEKTRGTVSVVVP-VTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGES-DKFE-HIWTEDEEPPSCMIDSWAQGYFPK 556
            ++MS+Q  ++ RA++ +G  S   P V N+I+PGKFTEHD NL +     +EF+  +V ++      +I+EKYL  +++P+T  LAK+AML+IIEWNFLSHD GES D+     W+ED EP S  IDSW QG  PK
Sbjct:  510 SQMSRQ-PAKGRADRGKGAQSTPNPAVQNEIIPGKFTEHDWNLAMGPSENEEFVLGIVNDVVDTALSEIYEKYLQRQLIPYTVSLAKDAMLQIIEWNFLSHDEGESPDRLTGEAWSEDREPASATIDSWGQGCIPK 644          
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Match: A0A1S3IJN5 (uncharacterized protein C2orf81 homolog OS=Lingula unguis OX=7574 GN=LOC106164921 PE=4 SV=1)

HSP 1 Score: 125.946 bits (315), Expect = 2.946e-27
Identity = 131/504 (25.99%), Postives = 232/504 (46.03%), Query Frame = 2
Query:  164 MSKQGTSRSRAEK---TRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESD-KFEHIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECIFXXXXXXXXXXXXIYDSPTIFYHLNENEPTKYTPIKNNQIYEID---AESERNTENLRLDTEPEYMINDXXXXXXXXXXXYRGKINFDDLHMLTETLSETEQRIELEEYKNAAKLLRGTSGRSNTNRPVENVDYEKKVIQKLFGTTVKNYQ--QLRSG------DILSNKNDQLVTLMKIPSEQLMLQRVKANHKALDVGMTKGH-------------------------------LSNKENASKKKTVHMEKVNHSQIKQEEFRPPISQNDKRPLPMSMLDSIDPVPGVIVQEGDRTKRSNVKKLDREKQMEREFLS---LRPIKNQTTKATFMIQDIINVNKVNIKPLS 1528
            MS+   S+SRAEK   T+G+      ++++IVPGKF + D N  +E+++ D+F+  +VE L  +T + IH+KY+  +++P+T   AK A+L+IIEW FLS D GE + + +  W EDEEP   + D WAQG  PK +        P E++E    E +  E +E    + P      N N    + P+++     ID    ES+ +T++ R                K+ FKPY G++    +  +TE+L +TE ++  EE++  ++ +               +D +K+   KL    +  +   +++SG      D+  ++   +V +MKI  ++L   RVK  +  +D  +                                   L   +  S  KTV++       + Q+  R    +    PLP  +++S+D  PGV+V+EG R     VKK  R++  + + ++   LRP+          + +++      ++PL 
Sbjct:    1 MSRAAVSKSRAEKGGKTQGSAPTPA-ISHEIVPGKFNDTDWNFMLERDDGDDFLDDIVEELCSKTMQIIHDKYIQRQLLPYTISQAKEAILQIIEWQFLSRDEGEPEAESDPSWLEDEEPQPAVSDCWAQGSVPKHKVEVSREPTPVEVEEPEEAEVSTEEPAETAPIEEPVDKIEENNNF-IPHPPLEDTADKAIDDKKPESKESTKSKR----------------KIKFKPYTGRLKSAGVSKMTESLEDTEMKMLEEEWRKQSQSI---------------IDEQKEKYDKLLQMPLSCHSILKMQSGRPPGHKDVTYDEMGNVVAVMKINPDKLPTHRVKVKYHVIDPAVEAAQARLEAMRTGRYLTAKTKSKKTAKPSTTVTDRLPTTDIVSTAKTVNLLSTAGMSMTQQAVR----KETVGPLPPPLIESMDVSPGVVVREGHR-----VKKGPRQQPRQADVITEQQLRPVNLHKQVPVMTVTELLEGQSPILRPLG 462          
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Nematostella
Match: EDO36270 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SJ36])

HSP 1 Score: 108.997 bits (271), Expect = 2.005e-25
Identity = 49/127 (38.58%), Postives = 84/127 (66.14%), Query Frame = 2
Query:  182 SRSRAEKTRGTV--SVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESD-KFEHIWTEDEEPPSCMIDSWAQGYFP 553
            S+SR EKT+     +   PV NDIVPG+FTE + NL +E E  ++F+  +V+ + + T + +H++Y+ ++ +P+T   A++ +L++IEW FL+ D GE++   +  W E++EP + + DSWAQG  P
Sbjct:    2 SKSRQEKTKTPAQSAPPPPVNNDIVPGRFTEAEWNLMVEGEEGEDFVCDIVQEIVESTMQVLHDRYITSQTLPYTVQEARDLLLQMIEWQFLACDGGENNPAIDATWLEEDEPIAPITDSWAQGSVP 128          
BLAST of Integrase_H2C2 domain-containing protein vs. Planmine SMEST
Match: SMESG000031440.1 (SMESG000031440.1)

HSP 1 Score: 910.598 bits (2352), Expect = 0.000e+0
Identity = 466/471 (98.94%), Postives = 467/471 (99.15%), Query Frame = 2
Query:  164 MSKQGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFEHIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECIFXXXXXXXXXXXXIYDSPTIFYHLNENEPTKYTPIKNNQIYEIDAESERNTENLRLDTEPEYMINDXXXXXXXXXXXYRGKINFDDLHMLTETLSETEQRIELEEYKNAAKLLRGTSGRSNTNRPVENVDYEKKVIQKLFGTTVKNYQQLRSGDILSNKNDQLVTLMKIPSEQLMLQRVKANHKALDVGMTKGHLSNKENASKKKTVHMEKVNHSQIKQEEFRPPISQNDKRPLPMSMLDSIDPVPGVIVQEGDRTKRSNVKKLDREKQMEREFLSLRPIKNQTTKATFMIQDIINVNKVNIKPLSTSSPIPPIADGFKVSQ 1576
            MSKQGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFEHIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECIFEEDNKLEESEEEIYDSPTIFYHLNENE TKYTPIKNNQIYEI+AESERNTENLRLDTEPEYMINDKVKKPKVPFKPYRGKINFDDLHMLTETLSETEQRIELEEYKNAAKLLRGTS RSNTNRPVENVDYEKKVIQKLFGTTVKNYQQLRSGDILSNKNDQLVTLMKIPSEQLMLQRVKANHKALDVGMTKGHLSNKENASKKKTVH EKVNHSQI QEEFRPPISQNDKRPLPMSMLDSIDPVPGVIVQEGDRTKRSNVKKLDREKQMEREFLSLRPIKNQTTKATFMIQDIINVNKVNIKPLSTSSPIPPIADGFKVSQ
Sbjct:    1 MSKQGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEFIYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDSGESDKFEHIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECIFEEDNKLEESEEEIYDSPTIFYHLNENESTKYTPIKNNQIYEIEAESERNTENLRLDTEPEYMINDKVKKPKVPFKPYRGKINFDDLHMLTETLSETEQRIELEEYKNAAKLLRGTSSRSNTNRPVENVDYEKKVIQKLFGTTVKNYQQLRSGDILSNKNDQLVTLMKIPSEQLMLQRVKANHKALDVGMTKGHLSNKENASKKKTVHTEKVNHSQINQEEFRPPISQNDKRPLPMSMLDSIDPVPGVIVQEGDRTKRSNVKKLDREKQMEREFLSLRPIKNQTTKATFMIQDIINVNKVNIKPLSTSSPIPPIADGFKVSQ 471          
The following BLAST results are available for this feature:
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
C2orf812.818e-2238.41chromosome 2 open reading frame 81 [Source:HGNC Sy... [more]
C2orf812.529e-2039.55chromosome 2 open reading frame 81 [Source:HGNC Sy... [more]
C2orf813.049e-2039.55chromosome 2 open reading frame 81 [Source:HGNC Sy... [more]
C2orf815.257e-956.00chromosome 2 open reading frame 81 [Source:HGNC Sy... [more]
C2orf813.279e-856.00chromosome 2 open reading frame 81 [Source:HGNC Sy... [more]
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 1
Match NameE-valueIdentityDescription
1700003E16Rik3.184e-1936.73RIKEN cDNA 1700003E16 gene [Source:MGI Symbol;Acc:... [more]
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 4
Match NameE-valueIdentityDescription
sp|A6NN90|CB081_HUMAN1.464e-1939.55Uncharacterized protein C2orf81 OS=Homo sapiens OX... [more]
sp|A8NIX5|CB081_BOVIN1.635e-1839.53Uncharacterized protein C2orf81 homolog OS=Bos tau... [more]
sp|Q9DAQ4|CB081_MOUSE2.227e-1836.73Uncharacterized protein C2orf81 homolog OS=Mus mus... [more]
sp|Q6AXP4|CB081_RAT3.489e-1837.23Uncharacterized protein C2orf81 homolog OS=Rattus ... [more]
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A267E8G73.770e-3247.79Uncharacterized protein (Fragment) OS=Macrostomum ... [more]
A0A267DCI52.450e-3047.79Uncharacterized protein (Fragment) OS=Macrostomum ... [more]
A0A1I8JKD83.234e-3048.51Uncharacterized protein OS=Macrostomum lignano OX=... [more]
A0A1I8IKS19.599e-2947.79Integrase_H2C2 domain-containing protein OS=Macros... [more]
A0A1S3IJN52.946e-2725.99uncharacterized protein C2orf81 homolog OS=Lingula... [more]
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 1
Match NameE-valueIdentityDescription
EDO362702.005e-2538.58Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Integrase_H2C2 domain-containing protein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 1
Match NameE-valueIdentityDescription
SMESG000031440.10.000e+098.94SMESG000031440.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30002976 ID=SMED30002976|Name=Integrase_H2C2 domain-containing protein|organism=Schmidtea mediterranea sexual|type=transcript|length=1681bp
CTGTATCATAATTTATATTGATAATAGTTACGTAAGGCAAAACATTAAAT
ATATTTTAGAAATAAAATTATTAGAAATTTCACTGTAATTATTCAGAAAA
TCATCATTTATTGTTACAATAACTAAAATTTTTATGTACTTAATTTTTCT
ATAATTTACAAAAATGTCAAAGCAAGGAACATCGAGAAGCCGAGCTGAAA
AAACTAGAGGTACAGTTTCTGTTGTCGTTCCTGTTACTAATGACATAGTT
CCTGGGAAATTTACGGAGCACGACTTAAATTTGTGCATAGAACAAGAAAA
TTTAGATGAATTTATTTATTCTCTGGTTGAAAATTTATATCAAGAAACGG
AAAAGCAAATTCATGAGAAATATCTCAATGCCAGAATTGTGCCATTTACT
GCCGATTTGGCTAAAAATGCAATGTTGAAGATTATTGAATGGAATTTTCT
TTCTCACGATTCAGGCGAATCTGATAAATTCGAACATATATGGACAGAAG
ACGAAGAACCTCCGTCTTGCATGATTGATTCATGGGCTCAGGGATATTTT
CCTAAACGGCAAGCTAGTTTTGAAATCGCTCATCATCCGACGGAAATAGA
TGAATGTATATTTGAAGAAGACAATAAGTTAGAGGAAAGTGAAGAAGAGA
TTTATGACTCCCCGACAATATTTTACCATTTAAATGAAAATGAACCAACC
AAATATACGCCAATAAAAAATAATCAGATTTATGAAATAGATGCCGAAAG
TGAAAGAAACACAGAAAATTTACGTTTAGACACTGAACCAGAATATATGA
TTAATGATAAAGTTAAAAAACCTAAGGTACCATTCAAACCATATCGTGGG
AAAATAAATTTTGATGACCTCCACATGTTGACAGAGACTTTATCAGAAAC
AGAACAGCGAATTGAATTGGAAGAATATAAAAATGCCGCTAAATTACTTA
GGGGAACCAGTGGTCGTTCCAATACGAATCGTCCAGTTGAAAATGTTGAT
TATGAAAAGAAAGTTATTCAGAAACTTTTTGGTACGACTGTCAAAAATTA
TCAACAACTACGATCGGGGGATATTTTATCAAATAAAAATGATCAACTTG
TCACTTTGATGAAAATTCCAAGCGAACAATTAATGCTACAAAGAGTAAAA
GCCAACCACAAAGCTTTGGATGTAGGAATGACTAAAGGACATTTATCAAA
CAAAGAAAATGCATCGAAAAAGAAAACAGTCCATATGGAGAAAGTTAACC
ATAGCCAAATAAAGCAGGAAGAATTTCGACCTCCAATTTCTCAAAATGAT
AAGCGTCCACTACCTATGTCAATGCTAGATAGTATAGATCCGGTACCTGG
GGTTATTGTACAAGAAGGGGATAGAACGAAACGATCAAATGTTAAAAAAC
TTGACAGAGAAAAACAAATGGAGAGAGAATTCCTTAGTTTGCGGCCAATT
AAAAATCAAACTACAAAAGCTACCTTTATGATACAAGATATCATTAATGT
TAACAAAGTCAACATCAAACCCCTTTCAACATCCTCGCCAATACCTCCAA
TTGCAGATGGGTTCAAAGTTTCTCAGTGATCTTTATGTGAACTGAATCAA
GATGACGTTGAAATATATTTGTCTTAATGGTCATGATTCCTTGTTTTAGT
TTATTTTTTAAAATTGTTTATTATAAAAAAA
back to top

protein sequence of SMED30002976-orf-1

>SMED30002976-orf-1 ID=SMED30002976-orf-1|Name=SMED30002976-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=472bp
MSKQGTSRSRAEKTRGTVSVVVPVTNDIVPGKFTEHDLNLCIEQENLDEF
IYSLVENLYQETEKQIHEKYLNARIVPFTADLAKNAMLKIIEWNFLSHDS
GESDKFEHIWTEDEEPPSCMIDSWAQGYFPKRQASFEIAHHPTEIDECIF
EEDNKLEESEEEIYDSPTIFYHLNENEPTKYTPIKNNQIYEIDAESERNT
ENLRLDTEPEYMINDKVKKPKVPFKPYRGKINFDDLHMLTETLSETEQRI
ELEEYKNAAKLLRGTSGRSNTNRPVENVDYEKKVIQKLFGTTVKNYQQLR
SGDILSNKNDQLVTLMKIPSEQLMLQRVKANHKALDVGMTKGHLSNKENA
SKKKTVHMEKVNHSQIKQEEFRPPISQNDKRPLPMSMLDSIDPVPGVIVQ
EGDRTKRSNVKKLDREKQMEREFLSLRPIKNQTTKATFMIQDIINVNKVN
IKPLSTSSPIPPIADGFKVSQ*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000016pharynx
PLANA:0000034epidermis
PLANA:0000099neuron
PLANA:0000418head
Vocabulary: INTERPRO
TermDefinition
IPR028042DUF4639
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR028042Protein of unknown function DUF4639PFAMPF15479DUF4639coord: 4..138
e-value: 5.2E-25
score: 88.4
IPR028042Protein of unknown function DUF4639PANTHERPTHR34438FAMILY NOT NAMEDcoord: 3..363
coord: 327..460
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 346..365
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 345..365