Protein CBG04106

Overview
NameProtein CBG04106
Smed IDSMED30007456
Length (bp)2782
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Protein CBG04106 (SMED30007456) t-SNE clustered cells

Violin plots show distribution of expression levels for Protein CBG04106 (SMED30007456) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Protein CBG04106 (SMED30007456) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Protein CBG04106 (SMED30007456) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30007456

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 3

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
headSMED30007456SMESG000073042.1 SmedASXL_014557SmedAsxl_ww_GCZZ01PMID:27034770
Currie et al., 2016
whole organism asexual adult RNA-sequencing evidence
neuronSMED30007456SMESG000073042.1 dd_Smed_v4_13178_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
headSMED30007456SMESG000073042.1 dd_Smed_v6_13178_0_1dd_Smed_v6PMID:28171748
Stückemann et al., 2017
whole organism asexual adult RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Protein CBG04106 vs. Ensembl Celegans
Match: F32B5.7 (pep chromosome:WBcel235:I:2673049:2682797:-1 gene:WBGene00017980.1 transcript:F32B5.7a.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:F32B5.7)

HSP 1 Score: 164.466 bits (415), Expect = 1.327e-41
Identity = 91/263 (34.60%), Postives = 144/263 (54.75%), Query Frame = 1
Query:  493 IKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX--IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAME-VTAELWGIVFKDNYIKPGCQYRGRSV 1272
            + L +A +   +F   +DR    L     +N+A+RRYET WLP+ AA+  L     +DVHW+W  HMLSPI Y++DC     K+IDH+ L   E+QK  ++S + W S    EP++F    ++     +  +K + ++  ++ +   F YQVSLPHY   KFL D+V RY +F+ L+  Y DQ + P  D  +IWH H+VHPS Y +D  +    ++  +  + D       +  E +T +LW   F + + + GC +RG + 
Sbjct:   58 VDLVVAAQREANFLRMIDRKAPLLYEPDVVNHALRRYETFWLPMQAAHPDLNVIPPLDVHWVWHTHMLSPIHYQEDCEKLVGKIIDHKLLSSDEIQKRYDSSVRAWDSYCSAEPYDF---LASQTPPTAYKTKCNYDIAGAVQRQRNFNYQVSLPHYTSAKFLSDAVKRYIQFLLLKQTYADQFLTPCYDFDIIWHTHQVHPSSYLRDCTAIFGSLLKHDDTVNDRTKGSKLLKGEALTKKLWTTHFDEPFWRRGCMFRGHNA 317          
BLAST of Protein CBG04106 vs. Ensembl Celegans
Match: F32B5.7 (pep chromosome:WBcel235:I:2672940:2682797:-1 gene:WBGene00017980.1 transcript:F32B5.7a.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:F32B5.7)

HSP 1 Score: 164.466 bits (415), Expect = 1.327e-41
Identity = 91/263 (34.60%), Postives = 144/263 (54.75%), Query Frame = 1
Query:  493 IKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX--IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAME-VTAELWGIVFKDNYIKPGCQYRGRSV 1272
            + L +A +   +F   +DR    L     +N+A+RRYET WLP+ AA+  L     +DVHW+W  HMLSPI Y++DC     K+IDH+ L   E+QK  ++S + W S    EP++F    ++     +  +K + ++  ++ +   F YQVSLPHY   KFL D+V RY +F+ L+  Y DQ + P  D  +IWH H+VHPS Y +D  +    ++  +  + D       +  E +T +LW   F + + + GC +RG + 
Sbjct:   58 VDLVVAAQREANFLRMIDRKAPLLYEPDVVNHALRRYETFWLPMQAAHPDLNVIPPLDVHWVWHTHMLSPIHYQEDCEKLVGKIIDHKLLSSDEIQKRYDSSVRAWDSYCSAEPYDF---LASQTPPTAYKTKCNYDIAGAVQRQRNFNYQVSLPHYTSAKFLSDAVKRYIQFLLLKQTYADQFLTPCYDFDIIWHTHQVHPSSYLRDCTAIFGSLLKHDDTVNDRTKGSKLLKGEALTKKLWTTHFDEPFWRRGCMFRGHNA 317          
BLAST of Protein CBG04106 vs. Ensembl Celegans
Match: F32B5.7 (pep chromosome:WBcel235:I:2673049:2680644:-1 gene:WBGene00017980.1 transcript:F32B5.7b.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:F32B5.7)

HSP 1 Score: 159.844 bits (403), Expect = 2.659e-40
Identity = 88/247 (35.63%), Postives = 137/247 (55.47%), Query Frame = 1
Query:  541 VDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX--IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAME-VTAELWGIVFKDNYIKPGCQYRGRSV 1272
            +DR    L     +N+A+RRYET WLP+ AA+  L     +DVHW+W  HMLSPI Y++DC     K+IDH+ L   E+QK  ++S + W S    EP++F  + +    +    +K + ++  ++ +   F YQVSLPHY   KFL D+V RY +F+ L+  Y DQ + P  D  +IWH H+VHPS Y +D  +    ++  +  + D       +  E +T +LW   F + + + GC +RG + 
Sbjct:    2 IDRKAPLLYEPDVVNHALRRYETFWLPMQAAHPDLNVIPPLDVHWVWHTHMLSPIHYQEDCEKLVGKIIDHKLLSSDEIQKRYDSSVRAWDSYCSAEPYDFLASQTPPTAY---KTKCNYDIAGAVQRQRNFNYQVSLPHYTSAKFLSDAVKRYIQFLLLKQTYADQFLTPCYDFDIIWHTHQVHPSSYLRDCTAIFGSLLKHDDTVNDRTKGSKLLKGEALTKKLWTTHFDEPFWRRGCMFRGHNA 245          
BLAST of Protein CBG04106 vs. Ensembl Celegans
Match: F32B5.7 (pep chromosome:WBcel235:I:2672950:2680644:-1 gene:WBGene00017980.1 transcript:F32B5.7b.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:F32B5.7)

HSP 1 Score: 159.844 bits (403), Expect = 2.659e-40
Identity = 88/247 (35.63%), Postives = 137/247 (55.47%), Query Frame = 1
Query:  541 VDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX--IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAME-VTAELWGIVFKDNYIKPGCQYRGRSV 1272
            +DR    L     +N+A+RRYET WLP+ AA+  L     +DVHW+W  HMLSPI Y++DC     K+IDH+ L   E+QK  ++S + W S    EP++F  + +    +    +K + ++  ++ +   F YQVSLPHY   KFL D+V RY +F+ L+  Y DQ + P  D  +IWH H+VHPS Y +D  +    ++  +  + D       +  E +T +LW   F + + + GC +RG + 
Sbjct:    2 IDRKAPLLYEPDVVNHALRRYETFWLPMQAAHPDLNVIPPLDVHWVWHTHMLSPIHYQEDCEKLVGKIIDHKLLSSDEIQKRYDSSVRAWDSYCSAEPYDFLASQTPPTAY---KTKCNYDIAGAVQRQRNFNYQVSLPHYTSAKFLSDAVKRYIQFLLLKQTYADQFLTPCYDFDIIWHTHQVHPSSYLRDCTAIFGSLLKHDDTVNDRTKGSKLLKGEALTKKLWTTHFDEPFWRRGCMFRGHNA 245          
BLAST of Protein CBG04106 vs. UniProt/SwissProt
Match: sp|Q9ZQ47|GRDP1_ARATH (Glycine-rich domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=GRDP1 PE=2 SV=1)

HSP 1 Score: 115.161 bits (287), Expect = 8.957e-25
Identity = 82/283 (28.98%), Postives = 129/283 (45.58%), Query Frame = 1
Query:  475 KSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIW--------XXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQ-----NLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFL----QLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQS-SVAMEVTAELWGIVFKDNYIKPGCQYRGRS 1269
            + ++  + L  A K  L F   VDR    L   PA+  AI RY   W           + + G L PP+D  WIW  H L+P+ Y  DC  F  +++D+  +        +  +  +W   YPDEP+E D + + +L+ IS  S   +     +L  ++ +   FYYQVS  H     FL+++V RYK F++L    + +   +  +P  D+ LIWH H++HP  Y  D    I K++  +    D    +        T   W   F   Y K G  +RG++
Sbjct:   15 QKIEISVDLLAAAKQHLLFLETVDRN-RWLYDGPALEKAIYRYNACWLPLLVKYSESSSVSEGSLVPPLDCEWIWHCHRLNPVRYNSDCEQFYGRVLDNSGVLSSVDGNCKLKTEDLWKRLYPDEPYELDLD-NIDLEDISEKSSALEKCTKYDLVSAVKRQSPFYYQVSRSHVNSDIFLQEAVARYKGFLYLIKMNRERSLKRFCVPTYDVDLIWHTHQLHPVSYCDDMVKLIGKVLEHDDTDSDRGKGKKLDTGFSKTTAQWEETFGTRYWKAGAMHRGKT 295          
BLAST of Protein CBG04106 vs. UniProt/SwissProt
Match: sp|Q9SZJ2|GRDP2_ARATH (Glycine-rich domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=GRDP2 PE=2 SV=1)

HSP 1 Score: 110.538 bits (275), Expect = 2.106e-23
Identity = 81/274 (29.56%), Postives = 123/274 (44.89%), Query Frame = 1
Query:  499 LSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX--------IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFD-PNSSANLDHISCNSKLSQ-NLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVM----LPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQS-SVAMEVTAELWGIVFKDNYIKPGCQYRGRSVK 1275
            L+ A KH L F   VDR    L   PA+  AI RY   WLPL A Y              +D  W+W  H L+P+ YK DC  F  +++D+  +        +  +  +W   YP EP++ D  N+ +    +S   K +  +L  ++ +   F+YQVS  H  +  FL+++V RYK F++L     ++ +    +P  DI LIWH H++H   Y  D    I K++  +    D    +        T   W   F   Y K G   RG + K
Sbjct:   24 LAAAKKHLL-FLGAVDRN-RCLYDGPALQRAIYRYNAYWLPLLAQYTESSSICQGPLVPPLDCEWVWHCHRLNPVRYKTDCEQFYGRVLDNSGVVSSVNGNCKSQTETLWKRLYPTEPYDLDFANAISEPADVSALEKCTTYDLVLAVKRQSPFFYQVSRAHVDNDVFLQEAVARYKAFLYLIKGNRERSIKLFCVPTYDIDLIWHTHQLHAISYCNDLTKMIGKVLEHDDTDSDRSKGKKLDTGFSGTTAQWEETFGRRYWKAGAMNRGNTPK 295          
BLAST of Protein CBG04106 vs. TrEMBL
Match: A0A267G859 (Uncharacterized protein OS=Macrostomum lignano OX=282301 GN=BOX15_Mlig027047g4 PE=4 SV=1)

HSP 1 Score: 298.516 bits (763), Expect = 1.536e-83
Identity = 186/669 (27.80%), Postives = 332/669 (49.63%), Query Frame = 1
Query:  466 FRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIW---XXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEV-TAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLF-CANNSILTSKKICLLDYIENFLGTDSQTLEMNFDI---------GKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRS----NDISNVLPYTTEDQFSRFSIRSYWLRGEKK------IQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSV 2400
            F+ +S+Q  + L  A    L F   V+        T A ++A+ RYE  W   +        L PP+DV W    HMLSP++YK+D +  T + ID+   P K   +    +R++W   YP   FE D    A+   I   S    ++R SI++   FYYQVSLPHY+D  FL+ +++RYKK++FL+   P + ++P  D+ L+WH H +HP +Y KDT + + + +  + ++ D          ++ T ELW + +K+++ + GC +RG   + K+  L     Y +S+K +++ +  V +    +L++Y +   +    +   +  L K+ +P      +  +R +     +TG      +++VDRRG+F C N+ ++ S ++ +L+ +E     +   +E    +            +D     +RL+L   PPERH+  LK+  G   +  +P Y  S WG + +       + +    +C + +H +INH+ +    V+IIH   +  S+VQV Y N+++A+   +G DQLP  +   + + + L+ ++ +RAF++KD+ GDWG + G W   IG  +G         +S  +P           +R   LR   K      ++  ++PD +    + + D+S DL +G + +S  C D+ Q V
Sbjct:   19 FKERSIQISVNLVDAAIRQLDFLEAVNARPLLFDPTVA-DHAVCRYEKYWLPLVQKEGQSKDLMPPLDVRWAMHCHMLSPLAYKEDVVRLTGRFIDNTLHPDKVTARLLAQTRELWERTYPGVSFELD-LEWASRQPIEYQSVSRFDIRGSISRQKAFYYQVSLPHYRDGDFLKSALLRYKKYLFLKKNNPGEFLVPCYDMDLVWHSHMLHPVLYEKDTTAVLGRALNHDDSVNDRGEGSKLFKSDLRTRELWRLFYKEDFARQGCMFRGDPPRGKLQQLPQESLYMLSSKRSRISLNLVELTLPPELNRYKVVIES--YNVFGGAERLFKMSNPNTVAERR--ERPLAVQDFVTGHHRFLKVDLVDRRGVFGCCNSDVMCSVQLPMLERLEQHYSVEPIAIEEELRLTDYRRPAAAAGPEDSVWAKVRLSLTCSPPERHSCLLKLEPGSYLVAVIPEYVESLWGPVPL-----GRLPAGQENQCLVATHRLINHMNQLAFTVRIIHSTRLTMSVVQVFYANQMIAVGHLVGTDQLPLVSQV-SKQCIALDVARMQRAFLVKDHLGDWGVVQGTW---IGFRKGVPGKPGVKGVSRGVPGQPGSPGC-LQLRFNGLRARGKSGALPAMEQRNLPDDLGHMKITVRDMSVDLKSGVVSVSASCKDVGQLV 671          
BLAST of Protein CBG04106 vs. TrEMBL
Match: A0A1I8HGI9 (Uncharacterized protein OS=Macrostomum lignano OX=282301 GN=BOX15_Mlig025032g1 PE=4 SV=1)

HSP 1 Score: 296.59 bits (758), Expect = 2.522e-83
Identity = 185/669 (27.65%), Postives = 332/669 (49.63%), Query Frame = 1
Query:  466 FRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIW---XXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEV-TAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLF-CANNSILTSKKICLLDYIENFLGTDSQTLEMNFDI---------GKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRS----NDISNVLPYTTEDQFSRFSIRSYWLRGEKK------IQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSV 2400
            F+ +S+Q  + L  A    L F   V+        T A ++A+ RYE  W   +        L PP+DV W    HMLSP++YK+D +  T + ID+   P K   +    +R++W   YP   FE D    A+   I   S    ++R SI++   FYYQVSLPHY+D  FL+ +++RYKK++FL+   P + ++P  D+ L+WH H +HP +Y KDT + + + +  + ++ D          ++ T ELW + +K+++ + GC +RG   + K+  L     Y +S+K +++ +  V +    +L++Y +   +    +   +  L K+ +P      +  +R +     +TG      +++VDRRG+F C N+ ++ S ++ +L+ +E     +   +E    +            +D     +RL+L   PPERH+  LK+  G   +  +P Y  S WG + +       + +    +C + +H +INH+ +    V+IIH   +  S+VQV Y ++++A+   +G DQLP  +   + + + L+ ++ +RAF++KD+ GDWG + G W   IG  +G         +S  +P           +R   LR   K      ++  ++PD +    + + D+S DL +G + +S  C D+ Q V
Sbjct:   19 FKERSIQISVNLVDAAIRQLDFLETVNARPLLFDPTVA-DHAVCRYEKYWLPLVQKEGQGKDLMPPLDVRWAMHCHMLSPLAYKEDVVRLTGRFIDNTLHPDKVTARLLAQTRELWERTYPGVSFELD-LEWASRQPIEYQSVSRFDIRGSISRQKAFYYQVSLPHYRDGDFLKSALLRYKKYLFLKKNNPGEFLVPCYDMDLVWHSHMLHPVLYEKDTTAVLGRALNHDDSVNDRGEGSKLFKSDLRTRELWRLFYKEDFARQGCMFRGDPPRGKLQQLPQESLYMLSSKRSRISLNLVELTLPPELNRYKVVIES--YNVFGGAERLFKMSNPNTVAERR--ERPLAVQDFVTGHHRFLKVDLVDRRGVFGCCNSDVMCSVQLPMLERLEQHYSVEPIAIEEELRLTDYRRPAAAAGPEDPVWAKVRLSLTCSPPERHSCLLKLEPGSYLVAVIPEYVESLWGPVPL-----GRLPAGQENQCLVATHRLINHMNQLAFTVRIIHSTRLTMSVVQVFYADQMIAVGHLVGTDQLPLVSQV-SKQCIALDVARMQRAFLVKDHLGDWGVVQGTW---IGFRKGVPGKPGVKGVSRGVPGQPGSPGC-LQLRFNGLRARGKSGALPAMEQRNLPDDLGHMKITVRDMSVDLKSGVVSVSASCKDVGQLV 671          
BLAST of Protein CBG04106 vs. TrEMBL
Match: A0A1I8HKD2 (OTU domain-containing protein OS=Macrostomum lignano OX=282301 PE=4 SV=1)

HSP 1 Score: 272.322 bits (695), Expect = 3.526e-72
Identity = 180/669 (26.91%), Postives = 321/669 (47.98%), Query Frame = 1
Query:  466 FRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIW---XXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEV-TAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLF-CANNSILTSKKICLLDYIENFLGTDSQTLEMNFDI---------GKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRS----NDISNVLPYTTEDQFSRFSIRSYWLRGEKK------IQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSV 2400
            F+ +S+Q  + L  A    L F   V+        T A ++A+ RYE  W   +        L PP+DV W    HMLSP++YK+D +  T + ID+   P K   +    +R++W   YP   FE D    A+   I   S    ++R SI++   FYYQVSLPHY+D  FL+ +++RYKK++FL+   P + ++P  D+ L+WH H +HP +Y KDT + + + +  + ++ D          ++ T ELW + +K+++ + GC +RG   + K+  L     Y +S+K +++ +  V +    +L++Y +   +    +   +  L K+ +P      +  +R +     +TG      +++VDRRG+F C N+ ++ S ++ +L+ +E     +   +E    +            +D     +RL+L   PPERH+  LK+  G   +  +P Y  S WG + +       + +    +C   S                IH   +  S+VQV Y ++++A+   +G DQLP  +   + + + L+ ++ +RAF++KD+ GDWG + G W   IG  +G         +S  +P           +R   LR   K      ++  ++PD +    + + D+S DL +G + +S  C D+ Q V
Sbjct:   19 FKERSIQISVNLVDAAIRQLDFLETVNARPLLFDPTVA-DHAVCRYEKYWLPLVQKEGQGKDLMPPLDVRWAMHCHMLSPLAYKEDVVRLTGRFIDNTLHPDKVTARLLAQTRELWERTYPGVSFELD-LEWASRQPIEYQSVSRFDIRGSISRQKAFYYQVSLPHYRDGDFLKSALLRYKKYLFLKKNNPGEFLVPCYDMDLVWHSHMLHPVLYEKDTTAVLGRALNHDDSVNDRGEGSKLFKSDLRTRELWRLFYKEDFARQGCMFRGDPPRGKLQQLPQESLYMLSSKRSRISLNLVELTLPPELNRYKVVIES--YNVFGGAERLFKMSNPNTVAERR--ERPLAVQDFVTGHHRFLKVDLVDRRGVFGCCNSDVMCSVQLPMLERLEQHYSVEPIAIEEELRLTDYRRPAAAAGPEDPVWAKVRLSLTCSPPERHSCLLKLEPGSYLVAVIPEYVESLWGPVPL-----GRLPAGQENQCLCAS----------------IHSTRLTMSVVQVFYADQMIAVGHLVGTDQLPLVSQV-SKQCIALDVARMQRAFLVKDHLGDWGVVQGTW---IGFRKGVPGKPGVKGVSRGVPGQPGSPGC-LQLRFNGLRARGKSGALPAMEQRNLPDDLGHMKITVRDMSVDLKSGVVSVSASCKDVGQLV 655          
BLAST of Protein CBG04106 vs. TrEMBL
Match: R7T5P1 (Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_202493 PE=4 SV=1)

HSP 1 Score: 253.062 bits (645), Expect = 4.202e-68
Identity = 187/667 (28.04%), Postives = 315/667 (47.23%), Query Frame = 1
Query:  493 IKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXX--XXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKE--LQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIW---IQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQ---PFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENF--------LGTDSQTLEMNFDI---GKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRG----------SRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSV 2400
            + L  A+   L+F  +V+R+ E L + P   NAIRRYET+WLPLAA         P+D+HW+W  HML+P  Y++DC+    K+IDH  L   E   +KA + +  +W      EPF    ++      +   SK S  L+++I++   FYYQVSLPHY+D  FL+ ++ RYKK++ L+ + PD+ ++P  D  LIWH H++HP +Y  DT + + +++  + ++ D      + SS A   T +LW   + + +   GC +RG     K+ ++S A  Y   +K + + I +V + P    D+   + S  +         S  L+K K  E  W  K    FV N++      +    +++D++G  C     L S + C    I NF        +    Q +E+   +   G    I L  +  +L +  P      L +  G      +P      WG I +          DN   C + SH + NH ++ +  +++IH I +  S VQV + +++++++  IG DQLPT +     +   L+ S  ERA IIKD+ GDW   +GRW   +G  +G          + S+ I  V            S+  Y LR     +   +    + ++  ++D+  D+    + +  +C  I Q++
Sbjct:   50 VDLVEASVRQLNFLCQVNRHPE-LYSGPVALNAIRRYETVWLPLAAQCHGNRLIAPLDIHWVWHCHMLAPYFYEKDCLKLAGKIIDHSLLAPDEHEYKKALKHTESLWSQHANGEPFNV-LSTDCPPRCMEYTSKCSYQLQDAIDRQRMFYYQVSLPHYRDSVFLKKALSRYKKYLALKRRNPDEFLVPCYDFDLIWHSHQLHPLLYRNDTGAILGRMLNHDDSVNDRSENSKLNSSDAN--TRDLWRKAYNEEFAACGCMFRGDPPDGKLGTMSKAQIYGAVSKKSSIAIHSVKLQPDHINDEQRQKLSLKVCLTHRDMTSSETLLKFKGAE--WEPKSQCEFVCNSAF----HQFLQFDLMDKKGFLC-----LGSNQSC---GIHNFPLAERVDAVNPAGQKIELAIPLHEGGNGPSIGL-SVGFSLTIGQPTIGPCMLSLQSGPFQSCTIPENAEKLWGPIPL-----PKRPVDNTNTCNVASHRLFNHAQQLMFTIRVIHSIPLMMSAVQVYHLDQMISVAHLIGTDQLPTDDMIVGMKCPTLDYSDGERALIIKDHAGDWAIAVGRW---VGFRKGVPGRPGIPGTADSDPIPGVP--GVPGSPGHLSVFVYNLRKSSYTKMDILDKPSSGYAFEVDDILVDMKHANMTLLPECDTIGQNI 687          
BLAST of Protein CBG04106 vs. TrEMBL
Match: V4A3I0 (Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_153725 PE=4 SV=1)

HSP 1 Score: 250.366 bits (638), Expect = 3.980e-67
Identity = 162/574 (28.22%), Postives = 287/574 (50.00%), Query Frame = 1
Query:  487 SGIKLSL----ATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXX--XXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEV-TAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNV---SIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPN-TTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRG 2175
            +GIK SL    A+   L F  ++ ++ + L   P +  AI RY  +WLPLAA     +L  P+D+ W W  HML P++Y++DC+A   K++D +    +EL+K    ++K W   YP+ P E            +   + S N+ E+  +   FYYQVSLPHY+D  FL+++++RYKK++FL+L+ P   ++P  DI LIWH H++HP  Y  DT + + ++   +  + D          ++ T  +W   F +++   G  YRG   + K+  ++  DS++  TK +++ I+++   ++ P+ K  +  I  +     LQ  ++    LK P+  W       F T  S       +    + ++ G  C  + +   + +   D + +   T   +  +N  +     ++L  +    I   P R    L++V G      +P+   S WG + +         +DN+  C + SH++ NH+ K V   +IIH I +  S+V V +++++  ++  +G DQLP P+  T +   + L P + ERA +IK+N GDWG ++G W   +G  +G
Sbjct:    8 NGIKFSLDLPQASLLHLDFLKQISKHPD-LEEGPILTQAIHRYINLWLPLAAKHDQEVLPAPLDIQWAWHCHMLCPVTYREDCMALVGKVVDSKLFTPQELKKIYPVTQKYWSEAYPNHPLEV--GGDVVDPRYTTLPQTSYNIAEAAGRQKVFYYQVSLPHYKDESFLKNALLRYKKYLFLKLKNPGMFLVPCYDIDLIWHTHQLHPLKYQNDTTNLLGRVFNHDDTVNDRTEGSKLYNADLETRTVWKNTFNESFSMYGAMYRGDPPRGKLYGMTQEDSFYACTKMSEIVIESIQLSNLPPNAKKFRLKISIAVNKKELQTVAS----LKGPKTSWNGLAKFNFDTKQS------NNLKFRLYEKTGFLCLGSDLSFGESV--YDMLPHIEATPPISSRLNIAVTLGTSVNLNIVGTVQI---PRRGWCTLQLVPGSYEQCMMPVDVESMWGPVPL---PKLPPGTDNV--CDVASHTLRNHLGKPVFTARIIHSIPLMTSVVHVFFKDKLSVVAHLVGSDQLPLPSLVTSSKSNITLNPRRGERAVLIKNNEGDWGVVVGSW---VGFKKG 555          
BLAST of Protein CBG04106 vs. Ensembl Nematostella
Match: EDO46334 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7RQB8])

HSP 1 Score: 132.494 bits (332), Expect = 1.430e-31
Identity = 79/239 (33.05%), Postives = 124/239 (51.88%), Query Frame = 1
Query:  580 INNAIRRYETIWXXXXXXXX-------XXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHA----IPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRG 1263
            + NAIRRYE  WLPLA+          +L  P+DV W+W VHML+P+ Y  DC     K+I+H+  P           RK+W  ++PDEPF++    + +       SKL  ++  +  +  +FYY VSL HY+D  FL  ++ RY++ + ++   P+   +P  D  LIWH H+++P  Y  D  S + K++  + +    +P     +S +   +  E  G+VF     KPG  YRG
Sbjct:   43 LKNAIRRYEQFWLPLASDLTDEHIPLSVLSAPLDVAWVWHVHMLAPVRYHADCERIVGKIINHKFDPYSPRDSLLHRGRKLWNKRHPDEPFDYHATKTVS----GYTSKLQYDICAASLRQSKFYYNVSLTHYRDPVFLTAALERYEQHIQIKKANPELFAVPCYDFDLIWHAHQLNPLTYRDDMISILGKVLSHDDSETGRVPGAFLYESEMRTRLAWEKAGLVFA----KPGTMYRG 273          
BLAST of Protein CBG04106 vs. Ensembl Nematostella
Match: EDO35759 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SKP3])

HSP 1 Score: 125.946 bits (315), Expect = 1.112e-29
Identity = 86/257 (33.46%), Postives = 134/257 (52.14%), Query Frame = 1
Query:  511 TKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXX-----XXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGR 1266
            TK  L F A+  RY   +     + NA+RRYE +WLPL   +G +       P+DV W+W +HML+P +Y+++C        +HR    ++L++A   SR++WV  YP EPFE   N S  +   S  +++  +   +  K+ EF YQVSLPH+ D  FL+ +  RY +++ LQ + P+ V+ P +DI LI H H+++P  Y+  +      +IL      DL   ++        E  GI     Y +PG   R R
Sbjct:   24 TKAHLKFVAKATRYPLLIDGC-HLENAVRRYEKLWLPLCRRHGTMSCDEWAAPLDVAWVWILHMLAPTNYRRECSRLIKNPPNHRSKSGEDLEQALRLSRRLWVDAYPREPFEV--NLSVPIAKDSFITRIRYDFNSASIKYSEFCYQVSLPHFCDDNFLDVATQRYVRYLDLQSKRPNVVLRPPLDIKLILHSHQLNPIFYASQS-----GVILGEVQDIDLALSEACEDSRRAFECEGI----EYARPGTICRER 268          
BLAST of Protein CBG04106 vs. Planmine SMEST
Match: SMESG000073042.1 (SMESG000073042.1)

HSP 1 Score: 1520.75 bits (3936), Expect = 0.000e+0
Identity = 781/781 (100.00%), Postives = 781/781 (100.00%), Query Frame = 1
Query:  421 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVXXXXXXXXXXXXXRTSRTEAVSIASHSPEIKPPSESFSGSLYLXXXXXXXXXXXXHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE 2763
            MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE
Sbjct:    1 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE 781          
BLAST of Protein CBG04106 vs. Planmine SMEST
Match: SMESG000073042.1 (SMESG000073042.1)

HSP 1 Score: 1512.28 bits (3914), Expect = 0.000e+0
Identity = 779/781 (99.74%), Postives = 780/781 (99.87%), Query Frame = 1
Query:  421 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVXXXXXXXXXXXXXRTSRTEAVSIASHSPEIKPPSESFSGSLYLXXXXXXXXXXXXHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE 2763
            MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRK +NSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE
Sbjct:    1 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRK-KNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNRVKVAEVDSINCPETSEE 780          
BLAST of Protein CBG04106 vs. Planmine SMEST
Match: SMESG000073042.1 (SMESG000073042.1)

HSP 1 Score: 1493.79 bits (3866), Expect = 0.000e+0
Identity = 764/764 (100.00%), Postives = 764/764 (100.00%), Query Frame = 1
Query:  421 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVXXXXXXXXXXXXXRTSRTEAVSIASHSPEIKPPSESFSGSLYLXXXXXXXXXXXXHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR 2712
            MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR
Sbjct:    1 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR 764          
BLAST of Protein CBG04106 vs. Planmine SMEST
Match: SMESG000073042.1 (SMESG000073042.1)

HSP 1 Score: 1485.32 bits (3844), Expect = 0.000e+0
Identity = 762/764 (99.74%), Postives = 763/764 (99.87%), Query Frame = 1
Query:  421 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXXIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVXXXXXXXXXXXXXRTSRTEAVSIASHSPEIKPPSESFSGSLYLXXXXXXXXXXXXHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR 2712
            MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRK +NSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR
Sbjct:    1 MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLVLEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTEDQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHISRQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGSLYLPPNSEIINIPPIHDFLIRK-KNSNNFSSTFQDPTKLLMCIGVGINFSEYLPSFRNLQKTNR 763          
BLAST of Protein CBG04106 vs. Planmine SMEST
Match: SMESG000047460.1 (SMESG000047460.1)

HSP 1 Score: 220.705 bits (561), Expect = 2.218e-60
Identity = 177/586 (30.20%), Postives = 284/586 (48.46%), Query Frame = 1
Query:  472 YKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSATPAINNAIRRYETIWXXXXXXXXXXXXX-IDVHWIWCVHMLSPISYKQDCIAFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEF-DPNSSANLDHISCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQLQYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIWIQSSVAME---VTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFISTKSAQLQI---KNVSIDPHQKLD----KYSIQFSNPLVALQPFSNYLIKLKSPERFWPHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDYIENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLGENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKEVLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNS-EMLVLEPSKSERAFIIKDNGGDWGFIIGRW-DQNIGV---HRGS 2178
            Y +V   I L  A +  L F  +++  + HL     IN A+RRYET+WLPL   Y  +    +D++W+W VHMLSPISYK D I    K+IDH+ L     + + +  + +W  KY  EPFE  DP            SK S ++  + ++   FYYQVSLPH++  KFL  S+ RYK F+ L+ +YPD+ ++P  DI +IWH H+VHPS+Y  D    + KI   + +  D     +S  ME    T ELW   FK ++   GC YRG S K K+  + L     I +K   + +      S   H KL+     +S++ S   +    F N + K              + +  ++  TG    F + ++ +  L C     L++  +   +  E     +S+ + ++ DI  +K+    ++ +  I +          ++LGE  ++ + L   +  G I +      +       E  + SHSV N   K+   V + H +++++SIVQV  + +I  I+  IG +QLP+P    ++ +   +  S + RA +IKDN GDW  ++ RW    +GV    RGS
Sbjct:    5 YHAVNFNINLLEAAEKELLFLKKINE-LSHLYDPNIINEALRRYETLWLPLVVNYDGIITAPLDIYWVWHVHMLSPISYKNDLIKKFGKIIDHKFLDSDIYESSIDRIKTLWKCKYISEPFEISDPEMRPKFV-----SKFSYDIESASSRQSSFYYQVSLPHFKSKKFLSKSIDRYKMFLHLKQKYPDKFIVPCYDIDIIWHTHQVHPSIYHHDCLVFLGKIFPHDDSFTDR--SANSKLMESEKFTRELWLENFKQSFSNNGCMYRGESSKGKLNDMPLNHLKTIVSKLLHISVDISSKKSNASHHKLEFHLTSHSLK-SKIYLEYDYFHNKIAK------------NNQILIQSNKNTG----FELHLIKKYPL-CGIKKTLSTTFVPTAESFEKLYEHNSKAI-VHCDIWDKKE-KFDEVEIAFIRNSMTYGCFDGYVILGEFLVVLLDLSNENLLGPIPIPKSDIQNF------ESLMASHSVKNSNGKQAFDVNVFHIMSLNQSIVQVFCEKKICVIAHLIGNEQLPSPAQVDSTFKYPAMSGSGASRAMLIKDNFGDWMIVVSRWLKMKVGVPGNMRGS 556          
The following BLAST results are available for this feature:
BLAST of Protein CBG04106 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 4
Match NameE-valueIdentityDescription
F32B5.71.327e-4134.60pep chromosome:WBcel235:I:2673049:2682797:-1 gene:... [more]
F32B5.71.327e-4134.60pep chromosome:WBcel235:I:2672940:2682797:-1 gene:... [more]
F32B5.72.659e-4035.63pep chromosome:WBcel235:I:2673049:2680644:-1 gene:... [more]
F32B5.72.659e-4035.63pep chromosome:WBcel235:I:2672950:2680644:-1 gene:... [more]
back to top
BLAST of Protein CBG04106 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 2
Match NameE-valueIdentityDescription
sp|Q9ZQ47|GRDP1_ARATH8.957e-2528.98Glycine-rich domain-containing protein 1 OS=Arabid... [more]
sp|Q9SZJ2|GRDP2_ARATH2.106e-2329.56Glycine-rich domain-containing protein 2 OS=Arabid... [more]
back to top
BLAST of Protein CBG04106 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A267G8591.536e-8327.80Uncharacterized protein OS=Macrostomum lignano OX=... [more]
A0A1I8HGI92.522e-8327.65Uncharacterized protein OS=Macrostomum lignano OX=... [more]
A0A1I8HKD23.526e-7226.91OTU domain-containing protein OS=Macrostomum ligna... [more]
R7T5P14.202e-6828.04Uncharacterized protein OS=Capitella teleta OX=283... [more]
V4A3I03.980e-6728.22Uncharacterized protein OS=Lottia gigantea OX=2251... [more]
back to top
BLAST of Protein CBG04106 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 2
Match NameE-valueIdentityDescription
EDO463341.430e-3133.05Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO357591.112e-2933.46Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Protein CBG04106 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG04106 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000073042.10.000e+0100.00SMESG000073042.1[more]
SMESG000073042.10.000e+099.74SMESG000073042.1[more]
SMESG000073042.10.000e+0100.00SMESG000073042.1[more]
SMESG000073042.10.000e+099.74SMESG000073042.1[more]
SMESG000047460.12.218e-6030.20SMESG000047460.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30007456 ID=SMED30007456|Name=Protein CBG04106|organism=Schmidtea mediterranea sexual|type=transcript|length=2782bp
AACGATTGCAGTTTTCGTCTGTTGTTGTGTTTTTTTCTTCTGTTATATGT
TGTTGTTGTACTCTGAATGTTTGGAGTTATTGTTTCTTTGGCTATCGAGT
TCTGACTGTTGTTGACATTTATTGGTCATACATCTACAACCATTTCTGGG
CTTGCATTATTGTATAAATTCGTCAGAGATAACTTAGCTCTGCAATCAAA
TGGATTTTATTTGAGAGTTCACCGGCAATGTTGATTAGGATTTCGATTAC
AGTTCAACACACGATATTATTTGTTTCCATTGATGCAAGATAGAAGATAC
ATTTCACCCAATTGTAACTGGAGATAATGGATTAATTCTACTACTGGTTC
AGTTCAAGTAGTAGATTTATTTTTGGCATTTATTAAAATTCTACTGAAAT
TTTCAAAAGATTTGATTAAGATGCAAAAATCAAAAGAACAACCAACTGAT
GTTAGTTCTCCTAAATTTCGCTACAAATCCGTTCAATCAGGGATAAAATT
ATCATTGGCCACGAAACACTTTTTAAGCTTTTTTGCTGAAGTTGACAGAT
ATATAGAACATTTGTCTGCTACTCCCGCAATTAATAATGCAATAAGACGA
TATGAAACAATATGGTTACCCTTGGCAGCAGCGTACGGCTTGTTAGGGCC
CCCTATCGATGTACATTGGATATGGTGTGTCCATATGTTATCTCCAATAA
GCTACAAACAAGACTGTATTGCATTTACTAATAAATTAATCGACCATCGA
TGTTTGCCCTTGAAAGAATTACAAAAAGCTAGAGAAACATCAAGAAAAAT
ATGGGTTTCCAAATATCCAGATGAACCATTTGAATTTGATCCGAATTCCT
CAGCAAATTTAGATCACATTTCTTGCAATTCAAAATTATCTCAAAACCTC
CGAGAATCAATCAACAAGCATTTTGAGTTTTATTATCAAGTGTCCTTACC
ACATTATCAAGATATAAAATTCCTTGAAGATTCAGTTGTTAGATATAAGA
AATTTGTCTTTTTACAACTTCAATATCCGGACCAAGTCATGTTACCTTTA
ATTGACATAGCTCTGATATGGCATTGTCACAAGGTTCATCCTTCAGTTTA
TTCTAAAGACACAAAATCATCAATTGATAAAATAATCTTGAAAAATCATG
CGATTCCAGATTTAATATGGATTCAGTCATCTGTTGCGATGGAGGTTACT
GCTGAACTTTGGGGAATAGTATTCAAAGACAATTATATAAAGCCAGGTTG
CCAGTATCGCGGACGTTCAGTGAAGGATAAAATAATGAGTCTAAGCCTTG
CAGATTCTTACTTTATTTCTACTAAATCGGCTCAATTACAGATTAAAAAC
GTGTCCATTGATCCTCACCAAAAACTCGACAAATATTCAATTCAATTTTC
CAATCCATTAGTAGCTCTGCAGCCGTTTTCTAATTATTTGATAAAACTTA
AATCTCCAGAAAGGTTTTGGCCACATAAACCCGGTCAAAGGTTTGTTACG
AACACCTCTTTAATTACTGGGACGGTGGAATCGTTCACTATGGAAGTTGT
CGATAGACGGGGATTATTTTGTGCCAACAATTCAATTTTGACATCAAAGA
AAATTTGTCTATTGGATTATATTGAAAACTTTTTGGGCACTGACAGTCAA
ACACTGGAAATGAATTTCGATATTGGAAAAGAAAAAGATATTTCTCTCGG
TGATATTAGACTTACACTGATTTTACATCCCCCTGAACGTCATGCGACAA
GATTGAAAATGGTACTTGGTGAAAACGCATTGTTACGTGTCCCTCTTTAT
CAAAATAGTTTTTGGGGTGCAATAAGTGTTTTTAATGATATGACAACTTC
AATTACATCTGATAATCTTGTTGAATGTTATCTTCTTTCTCACAGTGTAA
TAAATCACATAAAAAAAGAGGTTTTGGTCGTAAAAATTATACATTGTATT
GCAATGGATAAATCAATTGTGCAAGTTTTGTATCAAAATGAAATTGTTGC
GATTTCAGAAACTATCGGTTTAGATCAATTGCCAACACCAAATACAACTC
ATAACAGCGAAATGCTAGTTTTGGAACCGAGCAAATCTGAACGAGCTTTC
ATTATCAAAGATAACGGTGGAGATTGGGGATTTATTATTGGGAGATGGGA
CCAAAACATTGGTGTCCATCGAGGATCTCGGTCTAATGACATTTCAAATG
TGTTACCCTATACCACTGAAGACCAGTTTTCAAGGTTCTCTATCCGATCG
TATTGGTTGAGAGGCGAAAAGAAAATTCAATATTCACACATACCAGATAG
CATGAACAGTTGGAGTGTTGTTTTAAATGATGTTTCTTTTGATCTTTCAA
CCGGAGAAATCCACATCAGTCGCCAGTGCACCGATATTGCACAGAGTGTC
TGTTTAGCTTTCAGCTGTGCAGCTTTGCTTGCTCTGTGTAGGACATCTCG
CACAGAAGCTGTATCAATTGCTTCACACAGTCCGGAAATAAAGCCCCCGT
CAGAAAGTTTCAGTGGATCTTTATATTTACCTCCAAATTCCGAAATAATT
AACATACCCCCTATTCACGATTTCCTGATTAGGAAAAATAGAAACTCGAA
TAATTTTTCAAGCACTTTTCAAGACCCAACAAAACTTTTGATGTGCATCG
GCGTCGGCATAAATTTCAGTGAATATTTACCAAGCTTTCGAAATTTACAG
AAGACGAACCGAGTAAAAGTCGCCGAGGTCGACAGTATCAATTGTCCGGA
GACCTCAGAAGAATAAGAAAATTTATGATGAT
back to top

protein sequence of SMED30007456-orf-1

>SMED30007456-orf-1 ID=SMED30007456-orf-1|Name=SMED30007456-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=782bp
MQKSKEQPTDVSSPKFRYKSVQSGIKLSLATKHFLSFFAEVDRYIEHLSA
TPAINNAIRRYETIWLPLAAAYGLLGPPIDVHWIWCVHMLSPISYKQDCI
AFTNKLIDHRCLPLKELQKARETSRKIWVSKYPDEPFEFDPNSSANLDHI
SCNSKLSQNLRESINKHFEFYYQVSLPHYQDIKFLEDSVVRYKKFVFLQL
QYPDQVMLPLIDIALIWHCHKVHPSVYSKDTKSSIDKIILKNHAIPDLIW
IQSSVAMEVTAELWGIVFKDNYIKPGCQYRGRSVKDKIMSLSLADSYFIS
TKSAQLQIKNVSIDPHQKLDKYSIQFSNPLVALQPFSNYLIKLKSPERFW
PHKPGQRFVTNTSLITGTVESFTMEVVDRRGLFCANNSILTSKKICLLDY
IENFLGTDSQTLEMNFDIGKEKDISLGDIRLTLILHPPERHATRLKMVLG
ENALLRVPLYQNSFWGAISVFNDMTTSITSDNLVECYLLSHSVINHIKKE
VLVVKIIHCIAMDKSIVQVLYQNEIVAISETIGLDQLPTPNTTHNSEMLV
LEPSKSERAFIIKDNGGDWGFIIGRWDQNIGVHRGSRSNDISNVLPYTTE
DQFSRFSIRSYWLRGEKKIQYSHIPDSMNSWSVVLNDVSFDLSTGEIHIS
RQCTDIAQSVCLAFSCAALLALCRTSRTEAVSIASHSPEIKPPSESFSGS
LYLPPNSEIINIPPIHDFLIRKNRNSNNFSSTFQDPTKLLMCIGVGINFS
EYLPSFRNLQKTNRVKVAEVDSINCPETSEE*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0000418head
Vocabulary: INTERPRO
TermDefinition
IPR009836GRDP-like
Vocabulary: molecular function
TermDefinition
GO:0003674molecular_function
Vocabulary: cellular component
TermDefinition
GO:0005886plasma membrane
Vocabulary: biological process
TermDefinition
GO:0009738abscisic acid-activated signaling pathway
GO:0009787regulation of abscisic acid-activated signaling pathway
GO:0071470cellular response to osmotic stress
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009836Glycine-rich domain-containing protein-likePFAMPF07173GRDP-likecoord: 94..230
e-value: 9.9E-32
score: 110.5
NoneNo IPR availablePANTHERPTHR34365FAMILY NOT NAMEDcoord: 17..294