Surface protein

Overview
NameSurface protein
Smed IDSMED30003501
Length (bp)4907
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Surface protein (SMED30003501) t-SNE clustered cells

Violin plots show distribution of expression levels for Surface protein (SMED30003501) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Surface protein (SMED30003501) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Surface protein (SMED30003501) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30003501

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 3

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
neuronSMED30003501SMESG000064752.1 dd_Smed_v4_3240_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30003501SMESG000064752.1 dd_Smed_v4_3240_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30003501SMESG000064752.1 dd_Smed_v6_3240_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Surface protein vs. Ensembl Fly
Match: Dscam1 (gene:FBgn0033159 transcript:FBtr0111085)

HSP 1 Score: 52.373 bits (124), Expect = 5.223e-6
Identity = 72/325 (22.15%), Postives = 132/325 (40.62%), Query Frame = 1
Query: 3190 GDYGCFTKPAASFGLAKITIVINDLILKP---KKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCL--KSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETN--SVFDNKKKLVSTSYPN---LLDNSIQDIDIVNNLKV--HQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRA 4128
            G Y CF +       A   + +      P   +  + +  +P   + + C +       + W + D   I+N  N+ Y++ Q +T+   + S L I     N  G Y C+    +G  + S +  +YG    +  ++K + + GE   + C          +VWE +  ++  N+K+ V   +PN   +++N  ++ D      V  +Q       SLE     P        +TP    D PTN     ++S+TC         D++W F+NE  +   SG  V+    +           VLS+++V+   +G Y C+ +  A
Sbjct:  402 GMYQCFVRNDQESAEASAELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEISW-ELDGKKIAN--NDRYQVGQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAKLNVYGL-PYIRQMEKKAIVAGETLIVTCPV-AGYPIDSIVWERDNRALPINRKQKV---FPNGTLIIENVERNSDQATYTCVAKNQEGYSARGSLEVQVMVPP------KITPFDFGDEPTNFE--DSVSVTCLISSGDLPIDIEW-FFNEYGISSYSGISVVKGGKR---------NSVLSIDSVQARHAGNYSCRAKNHA 700          
BLAST of Surface protein vs. Ensembl Fly
Match: Dscam1 (gene:FBgn0033159 transcript:FBtr0111084)

HSP 1 Score: 52.373 bits (124), Expect = 5.312e-6
Identity = 72/325 (22.15%), Postives = 132/325 (40.62%), Query Frame = 1
Query: 3190 GDYGCFTKPAASFGLAKITIVINDLILKP---KKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCL--KSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETN--SVFDNKKKLVSTSYPN---LLDNSIQDIDIVNNLKV--HQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRA 4128
            G Y CF +       A   + +      P   +  + +  +P   + + C +       + W + D   I+N  N+ Y++ Q +T+   + S L I     N  G Y C+    +G  + S +  +YG    +  ++K + + GE   + C          +VWE +  ++  N+K+ V   +PN   +++N  ++ D      V  +Q       SLE     P        +TP    D PTN     ++S+TC         D++W F+NE  +   SG  V+    +           VLS+++V+   +G Y C+ +  A
Sbjct:  403 GMYQCFVRNDQESAEASAELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEISW-ELDGKKIAN--NDRYQVGQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAKLNVYGL-PYIRQMEKKAIVAGETLIVTCPV-AGYPIDSIVWERDNRALPINRKQKV---FPNGTLIIENVERNSDQATYTCVAKNQEGYSARGSLEVQVMVPP------KITPFDFGDEPTNFE--DSVSVTCLISSGDLPIDIEW-FFNEYGISSYSGISVVKGGKR---------NSVLSIDSVQARHAGNYSCRAKNHA 701          
BLAST of Surface protein vs. Ensembl Fly
Match: Dscam1 (gene:FBgn0033159 transcript:FBtr0111050)

HSP 1 Score: 51.9878 bits (123), Expect = 6.617e-6
Identity = 71/325 (21.85%), Postives = 132/325 (40.62%), Query Frame = 1
Query: 3190 GDYGCFTKPAASFGLAKITIVINDLILKP---KKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCL--KSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETN--SVFDNKKKLVSTSYPN---LLDNSIQDIDIVNNLKV--HQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRA 4128
            G Y CF +       A   + +      P   +  + +  +P   + + C +       + W + D   I+N  N+ Y++ Q +T+   + S L I     N  G Y C+    +G  + S +  +YG    +  ++K + + GE   + C          +VWE +  ++  N+K+ V   +PN   +++N  ++ D      V  +Q       SLE     P        +TP    + P NV    ++S+TC         D++W F+NE  +   SG  V+    +           VLS+++V+   +G Y C+ +  A
Sbjct:  402 GMYQCFVRNDQESAEASAELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEISW-ELDGKKIAN--NDRYQVGQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAKLNVYGL-PYIRQMEKKAIVAGETLIVTCPV-AGYPIDSIVWERDNRALPINRKQKV---FPNGTLIIENVERNSDQATYTCVAKNQEGYSARGSLEVQVMAPP------KITPFSFGEEPANVE--DSVSVTCLISTGDLPIDIEW-FFNEYGISSYSGISVMKGGKR---------NSVLSIDSVQARHAGNYSCRAKNHA 700          
BLAST of Surface protein vs. Ensembl Fly
Match: Dscam1 (gene:FBgn0033159 transcript:FBtr0111096)

HSP 1 Score: 51.9878 bits (123), Expect = 6.780e-6
Identity = 71/325 (21.85%), Postives = 132/325 (40.62%), Query Frame = 1
Query: 3190 GDYGCFTKPAASFGLAKITIVINDLILKP---KKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCL--KSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETN--SVFDNKKKLVSTSYPN---LLDNSIQDIDIVNNLKV--HQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRA 4128
            G Y CF +       A   + +      P   +  + +  +P   + + C +       + W + D   I+N  N+ Y++ Q +T+   + S L I     N  G Y C+    +G  + S +  +YG    +  ++K + + GE   + C          +VWE +  ++  N+K+ V   +PN   +++N  ++ D      V  +Q       SLE     P        +TP    + P NV    ++S+TC         D++W F+NE  +   SG  V+    +           VLS+++V+   +G Y C+ +  A
Sbjct:  402 GMYQCFVRNDQESAEASAELKLGGRFDPPVIRQAFQEETMEPGPSVFLKCVAGGNPTPEISW-ELDGKKIAN--NDRYQVGQYVTVNGDVVSYLNITSVHANDGGLYKCIAKSKVGVAEHSAKLNVYGL-PYIRQMEKKAIVAGETLIVTCPV-AGYPIDSIVWERDNRALPINRKQKV---FPNGTLIIENVERNSDQATYTCVAKNQEGYSARGSLEVQVMAPP------KITPFSFGEEPANVE--DSVSVTCLISTGDLPIDIEW-FFNEYGISSYSGISVMKGGKR---------NSVLSIDSVQARHAGNYSCRAKNHA 700          
BLAST of Surface protein vs. TrEMBL
Match: G7Y9G3 (GPI-anchored surface glycoprotein OS=Clonorchis sinensis OX=79923 GN=CLF_103274 PE=4 SV=1)

HSP 1 Score: 213.772 bits (543), Expect = 7.058e-52
Identity = 322/1439 (22.38%), Postives = 600/1439 (41.70%), Query Frame = 1
Query:   94 FIIFILGFCNIGMLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPD---IRFVVGNLING---QGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLV--KSSWASIYEIEVYATKKFTASNLISNNK-----LVLP------------DNCEEKSWLLTKSDL--SIKPKLQK--SGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDI-TKCSSSNSK---FTCQI-DRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNV-----NENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDLNSF---VIINHGKLE----KIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAF---DDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDK------YNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGD----KSIRLTKSVITCTKSENKE----VALI-NRTKCMTFNEEASCSNPDQVNMITIS-KNRNIANKY-LYSIIENA-SNKTL-RTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQK-----SNITIPXXXXXXXXXXXXXXEIEGIKG-----LYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLS-EKFSSKLLV----TVVPLYKNSKRGQKKEIEKLSTISKS----TKLVIDGEVLPKQRIITLIPAQSDL-----PCSIKKVDFTIES----------DTVYNQXXXXXXXXXFK--YQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEG------VLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAAS------FGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITV-KKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSI----QDIDIVNNLKVHQVSCKIV-------------PSLEANAKQ--------PKT-KIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETS 3981
            F +  L FC IG   G    V I+SH  LT+ I   ++ + +DND +T   F+    + V +  +L   + ++ +   I S+ P        VG L NG    G L Y C  +   +    C P C  +  +R G++   I  +F    ++ W  IY++ +  T +     + S++K      + P            + C E  W LT S    S  P + K    + ++ ++SY I ++ +I       N +  +    +K +  +  + + C+  N     + C   D + +P    TI + G+ ++ + G   +YPS+++   + V      E+LK+ C A  C    H + + D     VQ  C+   L R    S+ ++    V++ +G+++    ++ ++ K V +S T  K++ +MPR++ YYGQY+C C   D   + + S  T L  ++F ++L+F+     + DE      + I N +   +      Y T+ I  +SL   + +  +   +   T      A T  +++AF      S +W ++ Y+   E A+P+    G+       K        ++  +    I     +V    V +    + + I+  Y  +    +  +L    ++C+  EN E    V L  +  K      E  C        +  +  N + ++ Y LY ++E+  S++ L ++ +V   L  ++   +I +K+     ++ +P   ++ +     SK+I D   C    V +  P  P  + + Q+     +  T+ V      +   L++ IEG +        +  L ++NP  + + DL++ D  +  + W  + +   D+L  + V   +T   C   +++         +N    +L L+    F S++LV     V P++K    G    +     ISK     + +  D     +   I + P+Q  L     P    K+   +            D  + + +E      F   Y I  L PGR+Y++ A V Y  G  D+ S    F T D++ ++ ++   +  D + I CT  +GP D ++K +EW+RAD S LP+G        + ++P+  ++  L+ +K  A  +  Y CF +P+ +      +    + +++N L L    ++       +PI++ C +       L WT  D   +            ++  E+   S+L I +  L  +G Y C K  G     + F L  +E + I V ++SS+L GE  S+ C A L   +QKV W       +       SY  + D SI    Q+ ++ N   V     +++              S+  ++KQ        PKT  I  L  TP + LD    V +G  + + CTGYP +    L+W + + T 
Sbjct:    8 FSVITLTFC-IGA--GAYENVAIRSHLNLTQGIEFIDLGNSFDNDRTTQTLFHSEGGQLVSIVLELPGEYQVKSV--EIISDVPSNLKQAVHVGFLQNGVTRSGRLTYECVPQ---HTVSICRPSCMDEKNSREGVVGDSILWTFQRTKENGWTRIYDVII--TGRLFQQPVSSDDKAADITYLKPYFNRALGVDYEREQCAEPVWKLTDSSEANSQMPPVSKFHRNVTVYFDQSYSIAKVSLIASEMDSGNEECVIRFGGRKSVTKIFSLKSDCTMQNETIGLYVCSTPDLMGYPFTNATIDAKGLVRVHIFGMLWYYPSVKIVPFTKVIDPQSTEDLKIQCVASTC-AETHPNSNCDHYGHIVQ--CAYLELARTHESSNGHTIPDTVLVTNGRVKPEVAEVITELKIVHSSKT--KLVTTMPRTHSYYGQYKCRCQTDDPFTSVVESVETKLPATEFDQDLVFERSFTARLDE------QEIPNTLGFLEMPEITGYFTLIISGHSLTENLQLRVTYLESGEFTTITEEYAQTS-SVQAFPLAKGTSYAWPIRAYRIVQENAWPDVTDEGLTTIEWETKDEAVTVRKVRDELRTIVISPQRQNVVRAWVWRR-DSRVVDIRAHYQNEAHDTRKKKLYLQRVSCSTGENPEQMSSVTLSGDECKRTVRQGEIECERTKGGFAVQFTVTNPSASDSYHLYEVMEDQESDRELAKSTSVVQGLKAELDATDIRIKLAGIKGES-EPGRTQMVMVFTIHSKLITDNSACHPTFVKL-QPSWPVSEKLEQQLGSNQTQFTVDVPNDQQLLTFDLEVRIEGGEASGNEITTKRTLQLLNPRHVKL-DLKL-DTQEESITWTALPESISDVLDKFNVKMWSTNKVCGIREEQTPPRVDRPIENLMAYRLSLIDVPNFKSRVLVDHEIEVTPVFKTIDGGSITGVLANLAISKGKIGQSGVKWDKSAANRTARIYVTPSQPALCLRDNPAMTVKLAVLMMGPGAATPKQLVDGFHLEAEEGGLGSDFGKWYTIKDLQPGRRYEVKATVLYPTGIQDEPSASEPFWTTDEVFVSSSEVSVKPGDRLVIQCTGAVGPNDSFKKRLEWKRADGSQLPDGTRIETVTATESDPENMETVNLVFEKVEAGQANTYACFIRPSVASQVGVQYAPPTVKVIVNVLEL---DIQSHVANVADPITVQCRAHQ--GSTLTWTSPDGAVVRKDRQREEPYVTDVKNEQGTISLLHIPRVKLAQTGQYLC-KQDG-TDNHRAFKLKMQEDVRIVVDEESSQLAGENFSITCNALLGHMDQKVEWYRRE--GDGHPWTPVSYATMEDASIKITNQNTELSNGDNVRSSKLRMINSPASAGEFACALKSMSGDSKQFARETLELPKTSAIIELESTPVVNLDRVRRV-SGQQVVVKCTGYPAHEEDHLEWYYVSSTG 1409          
BLAST of Surface protein vs. TrEMBL
Match: A0A4S2M2V4 (Uncharacterized protein OS=Opisthorchis felineus OX=147828 GN=CRM22_003222 PE=4 SV=1)

HSP 1 Score: 191.43 bits (485), Expect = 6.025e-45
Identity = 322/1429 (22.53%), Postives = 598/1429 (41.85%), Query Frame = 1
Query:  154 VPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPD---IRFVVGNLING---QGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLV--KSSWASIYEIEVYATKKFTASNLISNNK-----LVLP------------DNCEEKSWLLTKSDL--SIKPKLQK--SGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDI-TKCSSSNSK---FTCQI-DRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQV-----SASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRL--PS-GSDLNSFVIINHGKLE----KIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAF---DDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDKYNLK-QNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISK----------------------------NRNIANKY-LYSIIENA-SNKTL-RTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQK-----SNITIPXXXXXXXXXXXXXXEIEGIKG-----LYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACN--SAQKKVITIKKSDNQNSYETKLDLLS-EKFSSKLLV----TVVPLYKNSKRGQKKEIEKLSTISKS----TKLVIDGEVLPKQRIITLIPAQSDL-----PCSIKKVDFTIES----------DTVYNQXXXXXXXXXFK--YQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEG------VLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAAS------FGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITV-KKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSI----QDIDIVNNLKVHQVSCKIV-------------PSLEANAKQ--------PKTKI-YSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVF 3966
            V I+SH  LT+ I   ++ + +DND +T   F+    + V +  +L   + I+ +   I S+ P        VG L NG    G L Y C  +   +    C P C  +  +R G++   I  +F    ++ W  IY++ +  T +     + S++K      + P            + C E  W LT S    S  P + K    + I+ ++SY I ++ +I       N +  +    +K +  +  + + C+  N     + C   D + +P    TI + G+ ++ + G   +YPS+++     +   +  E+LK+ C A  C    HS+ + D     VQ  C+   L R+  PS G  +   V++ +G+++    ++ ++ K V +S T  K++++MPR++ YYGQY+C C   D   + + S  T L  ++F ++L+F+     + DE      + I N +   +      Y T+ I  +SL   + +  +   +   T      A T  +++AF     +S +W ++ Y+   E A+P+    G+    +T ++  K + + V+ +K     +      Q+I    +  +  ++ D  IR    V T    E K    + R  C T  EE    NP+QV+ +T+S                             N + ++ Y LY ++E+  S++ L ++ +V + L  ++   ++ +K+     ++ +P   ++ V     SK+I D   C    V +  P  P  + + Q+     +  T+ V      +   L++ IEG +        +  L ++NP  L + DL++ D  +  + W  + +   ++L  + V   +T   C     Q      +  DN  +Y  +L L+      S++LV     V P++K+        +     ISK     + +  D     +   I + P+Q  L     P    K+   +            D    + +E      F   Y I  L PGR+Y++TA V Y  G  D+ S    F T D++ ++ ++   +  D + I CT  +GP D ++K +EW+R D S LP+G        + ++ +  ++  L+ ++  A  +  Y CF +P+ +      +    I +++N L L    ++       +PI++ C +       L WT      +            ++  E+   S+L I +  L  +G Y C K  G     + F L  +E + I V ++SS+  GE  S+ C   L   +QKV W       +       SY  + D SI    Q+ ++ N   V     +++              S+  ++KQ        PKT     +   P I LD    V+ G  + + CTGYP +A   L+W +
Sbjct:   25 VAIRSHINLTQGIEFIDLGNSFDNDRTTQTLFHSEGGQLVSIVLELPGEYQIKSV--EIISDVPSNLKQAVHVGFLQNGLTRSGRLTYECVPQ---HTVSICRPSCMDERNSREGVVGDSILWTFQRTKENGWTRIYDVII--TGRLFQQPVSSDDKAADITFLKPYFNRALGVDYEREQCAEPVWKLTDSSEANSQMPPVSKFHRNVTIYFDQSYSIAKVSLIASEMDSGNEECVIRFGGRKSVTKIFSLKSDCTMENETVGLYVCSTPDLMGYPFTNATIDAKGLVRVHIFGLLWYYPSVKIVPFTKAIDPHSTEDLKIQCVASTC-AEKHSNSNCDHYGHIVQ--CAYLELARIHEPSNGHMIPDTVLVTNGRVKPEVAEVTTELKIVDSSKT--KLVITMPRTHSYYGQYKCRCQTDDPFTSVVESVETKLPATEFDQDLVFERSFTARLDE------QEIPNTLGFLEMPEVAGYFTLIISGHSLTENLQLRVTYLESGEFTTITEEYAQTS-SVQAFPRAKGSSYAWPIRAYRIVQENAWPDVTDEGL----TTIEWKTKDEAVTVRQVKDELRTIVISPQRQNIVRAWVWRRDSHVVD--IRAHYQVETHNTREKK--LYLQRVSCST--EE----NPEQVSSVTLSGDECKRIVRQGAVECERTKGGFAVQFTVTNPSASDSYHLYEVMEDQESDRELAKSASVVHGLKAELDATDMQIKLAGIKGES-EPGRTQMVVVFTIHSKLITDNSACHPTFVKL-QPSWPVSEKLEQQLGSNQTQFTVDVPNDQQLLTFDLEVRIEGGEASGSDITTKRTLQLLNPRHLKL-DLKL-DTQEEYITWTALPESISEILDKFNVKMWSTNKVCGIREEQTPPRVDRPIDNLMAY--RLSLIDVPNVKSRVLVDHEIEVTPIFKSIDGESITGVPANLAISKGKIGQSVVKWDKSAANRTARIYVTPSQPALCLRDNPAMTVKLAVLMMGPGTTTPKQLVDGFRLEAEEGGLGSDFGKWYIIKGLQPGRRYEVTATVLYPTGIQDEQSASEPFWTTDEVFVSSSEVSVKPGDRLVIQCTGAVGPNDSFKKRLEWKRVDGSQLPDGTRIETVTTTESDVENMETVNLVFEEVEAGQANTYACFIRPSVASQVGTQYAPPTIKVIVNVLEL---DIESHVANVADPITVQCRAHQ--GSTLTWTSPGGAVVRKDRQREEPYVTDVKNEQGTISLLHIPRVKLAQTGQYLC-KQDG-TDNHRAFKLKMQEDVRIVVDEESSQSAGETFSITCNVLLGHMDQKVEWYRRE--GDGHPWTPVSYATMEDASIKITNQNTELSNGDNVRSSQLRMINSPASAGEFACALKSMSGDSKQFARETMELPKTSASIQIKSHPVINLDRARRVS-GQQVVVKCTGYPAHAEDHLEWYY 1404          
BLAST of Surface protein vs. TrEMBL
Match: Q26607 (Surface protein (Fragment) OS=Schistosoma mansoni OX=6183 PE=2 SV=1)

HSP 1 Score: 187.963 bits (476), Expect = 6.127e-44
Identity = 340/1476 (23.04%), Postives = 619/1476 (41.94%), Query Frame = 1
Query:  154 VPIKSHE--ILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPD-------IRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLV-KSSWASIYEIEVYATKKFTASNLISNN----KLVLP-----------DNCEEKSWLLTK----SDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGK--APENLKIQLSNVEK--KQIDIVIDITKCSSSNS------KFTCQIDRLK-FPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSN------VNEN-LKLTCKAIPC----------DLSVHSDCSTDAM--KSGVQFPCSNKRLVRL---PSGSDLNSFVIINHGKLEKIWSK--TKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQ--KTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAFDD---NSESWSVKTYKYEIEEAFPE-FDKRGINVSHSTDKY-------NLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGD--------KSIRLTKSVITCTKSENKEVALIN--RTKCMTFNEE-ASCSNPDQVNMITISKNRNIANKYLYSI----------IENASNKTLRTI-TVNNTLALDVKGIEINMKI--VDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGK--VLVSITAPGKPAKKMVLQKSNITIPXXXXXXXXXXXXXXEIEGIKGLYETVLNVINPTSLAVK-------DLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQK--KVITIKKSDNQNSYETKLDLLSEKFSSKLLVT------VVPLYKNSK-----RGQKKEIEKLSTISKSTKL----------------------VIDGEVLPKQRIITLIPAQSDLPCSIKKVDF---TIESDTVYNQXXXXXXXXXFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEGVLSY------NNPDFPQSAFLIMKKALADYSGDYGCFTKPA------ASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWT--DSDNLPISNK-------ENELYKIQ-QNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKL-VITVKKSSELIGEITSLRCKADLATKNQKVVW----ETNSVFDNKKKLVST---------------------------SYPNLLDN---SIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVF 3966
            V IK+H+  +LTK +   +++ ++DND +T G F+   ++ V +  DLG  + +  +   I S+ P+       I +          G  Y C+ +     +  CIP C + D +R G +   +  SF   K  W  IY++ +  T  F  +  + NN     L  P           ++C    W L      ++ S         + I+  K+Y +  +  +   +   P+   +  + +E   K I++  D T  +SS++      ++ C    L+ +    VT+   G+ +L + G P  YP +Q+  S        V++N L+++C A  C          D S     S+D     +G    C    L RL   P+  +     + N G +   +S+  T+  +  +  EK+IV+MPR++QYY QY+CSC   D   +++ S ST+L  +DF ++ +F+T   Y T     F +++I++ +   +  +    +Y  + I+ YSL         + +   +   +T     E++++AF      S+ WS+  YKYE+++ +P+  D   I V+ S  K        ++ ++IF   I  +    T    +       + I+  +  D        K + L +S  + ++SE++ VA +     +C T N +  +C+      +I    N N +   +Y +          +E+ S+  L T  ++  T+  D+KG  +++ +  +  N +  + E+D   V +   SK+I+D I C    +L+    P     K  +        + +  +  ++ LK+++  I  +  T        S+A +       D+++  + ++I  W G+  +  +LL  YE        AC  A +    IT ++ DN   Y   L  + +   +K  +       V P++K         G   +I   +  +  T L                       ++ E L  Q I+ +I    + P  IK+V++   T+++    N      ++    Y+I  L+PGR+Y+L AEV Y     DKISE       D++ +   +      + V I CT ++GP D  +K++EW+  D   LP+G  S       + P +     LI       + G Y CF +P+          L K+T+ ++DL +      V+F    E I I C   T   G L+W     + + I N+       +++ Y I+ +N  ++ +I   L I K +LN  G YTCL S    K  Q F+L  KE + ++   +SS+  G+   L C A+L   +Q VVW     +NS +    + + T                           + P ++     +IQ+I    N++  +    +    + +     T   SL    KIL   P  +  G  IS+ C GYP ++   LQWV+
Sbjct:   27 VDIKAHDYKLLTKILAARQLQDLFDNDKNTHGLFHAELNQKVYLIVDLGGLYQVSSV--EITSDEPENLKQTISIGYGFDKNTTSLFGSLYNCTYQ---QTSTLCIPACSQYDNSRTGFVGNSLLWSFQSDKEGWTRIYDLIIRGTP-FDENRSMKNNLDDITLFKPYADHLVPAESMESCNGLLWHLNDESSVTESSALAATDDRNVTIYFGKTYSVTSVHFVTSKRDDMPQEYTLHFNGMESLTKIINLQNDCTLTASSDNDGANFKEYNCPTTDLESYTFDYVTVNVKGLYKLHVYGLPFHYPKIQIIPSKEDDKTIMVDKNVLQISCIAQSCNSTSNVLLDVDDSYRRSRSSDKSCPMNGHIVRCEYLNLFRLSTSPNDDNNQKIQLTNQGIINTEFSEIMTELRIIQSNAEKLIVTMPRTHQYYSQYECSCQATDNKKSQLFSLSTNLKPTDFDQDFIFQTN--YTT----IFADQTIESHVGFIELPENEHESYFKLNIIGYSL-LNDVQIGVVFVEGGQAGSRTNANIEEISVQAFPGINVGSDLWSIPVYKYELQDVWPDSVDDSVITVTWSMPKALSATKSPSVIRDIFRALIITSSGVNTIRAWVWQRDSHLLDIRAYHQLDENNVDSQLKILTLQRSGCSPSESEDEVVASVQLKDGQCSTDNNDIITCTRTIHGQIIQFKLN-NPSTSDVYKLYMKSDGVEDNVESTSSIDLVTSGSLGETVKEDIKGAGLSLTVEGIHHNHETQETELD---VAVHIASKVISDNIACRPTYLLLEFIEPNIETLKSRVSSKQTMFRIKLPSNQKEINLKMQL-SIGSVDPTQSEATTNQSIAFQNPFYIPTDIKVDAENQLI-QWFGLPTIFNNLLHHYETKLSGLPKACEQASEFNLPITQQEIDNGTIYRVNLKNIPDPTITKNGLAIDYNFKVTPVFKGIDGKSITMGTSSDIRFSTGRTGQTDLKAPTSGRYYSLQVQVRPSQIPSCLNLETLNTQFILRVIGEVDEYPDYIKQVNYVPITMKTTETLNDKNNHVKL----YKIENLLPGRRYELQAEVIYTEDFRDKISEPVRLWIEDEVHVQTEEVFISPGERVVINCTGSVGPNDTSQKSLEWKLFDGGRLPDGSRSLKTQEAQSGPLWYAMESLIFDPVNKQHGGVYACFIRPSILELMNKPTELHKVTVTVSDLEVDINSKIVEFG---EKIIITC--RTASPGQLDWMLPSGEKVEIMNEMKSDDDNDDQPYTIKDENDDVKLSIK--LIIPKVNLNKVGKYTCLHSPSNNK--QTFSLKMKEVIKLVKSPESSDKPGKTLILDCTANLGNLHQSVVWYKRPNSNSPWLEITEAIQTIEHITIQQKNPEDTLSSGVWLSELKVKNSPGIIGEFMCTIQNIQTTMNIERMETGSIMTNDNDFSKITHATIKVSLKSVLKILT--PIKLENGQ-ISVHCQGYPAHSKDRLQWVY 1467          
BLAST of Surface protein vs. TrEMBL
Match: A0A3Q0KCT5 (200-kDa GPI-anchored surface glycoprotein OS=Schistosoma mansoni OX=6183 PE=4 SV=1)

HSP 1 Score: 187.193 bits (474), Expect = 1.091e-43
Identity = 340/1476 (23.04%), Postives = 619/1476 (41.94%), Query Frame = 1
Query:  154 VPIKSHE--ILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPD-------IRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLV-KSSWASIYEIEVYATKKFTASNLISNN----KLVLP-----------DNCEEKSWLLTK----SDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGK--APENLKIQLSNVEK--KQIDIVIDITKCSSSNS------KFTCQIDRLK-FPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSN------VNEN-LKLTCKAIPC----------DLSVHSDCSTDAM--KSGVQFPCSNKRLVRL---PSGSDLNSFVIINHGKLEKIWSK--TKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQ--KTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAFDD---NSESWSVKTYKYEIEEAFPE-FDKRGINVSHSTDKY-------NLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGD--------KSIRLTKSVITCTKSENKEVALIN--RTKCMTFNEE-ASCSNPDQVNMITISKNRNIANKYLYSI----------IENASNKTLRTI-TVNNTLALDVKGIEINMKI--VDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGK--VLVSITAPGKPAKKMVLQKSNITIPXXXXXXXXXXXXXXEIEGIKGLYETVLNVINPTSLAVK-------DLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQK--KVITIKKSDNQNSYETKLDLLSEKFSSKLLVT------VVPLYKNSK-----RGQKKEIEKLSTISKSTKL----------------------VIDGEVLPKQRIITLIPAQSDLPCSIKKVDF---TIESDTVYNQXXXXXXXXXFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEGVLSY------NNPDFPQSAFLIMKKALADYSGDYGCFTKPA------ASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWT--DSDNLPISNK-------ENELYKIQ-QNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKL-VITVKKSSELIGEITSLRCKADLATKNQKVVW----ETNSVFDNKKKLVST---------------------------SYPNLLDN---SIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVF 3966
            V IK+H+  +LTK +   +++ ++DND +T G F+   ++ V +  DLG  + +  +   I S+ P+       I +          G  Y C+ +     +  CIP C + D +R G +   +  SF   K  W  IY++ +  T  F  +  + NN     L  P           ++C    W L      ++ S         + I+  K+Y +  +  +   +   P+   +  + +E   K I++  D T  +SS++      ++ C    L+ +    VT+   G+ +L + G P  YP +Q+  S        V++N L+++C A  C          D S     S+D     +G    C    L RL   P+  +     + N G +   +S+  T+  +  +  EK+IV+MPR++QYY QY+CSC   D   +++ S ST+L  +DF ++ +F+T   Y T     F +++I++ +   +  +    +Y  + I+ YSL         + +   +   +T     E++++AF      S+ WS+  YKYE+++ +P+  D   I V+ S  K        ++ ++IF   I  +    T    +       + I+  +  D        K + L +S  + ++SE++ VA +     +C T N +  +C+      +I    N N +   +Y +          +E+ S+  L T  ++  T+  D+KG  +++ +  +  N +  + E+D   V +   SK+I+D I C    +L+    P     K  +        + +  +  ++ LK+++  I  +  T        S+A +       D+++  + ++I  W G+  +  +LL  YE        AC  A +    IT ++ DN   Y   L  + +   +K  +       V P++K         G   +I   +  +  T L                       ++ E L  Q I+ +I    + P  IK+V++   T+++    N      ++    Y+I  L+PGR+Y+L AEV Y     DKISE       D++ +   +      + V I CT ++GP D  +K++EW+  D   LP+G  S       + P +     LI       + G Y CF +P+          L K+T+ ++DL +      V+F    E I I C   T   G L+W     + + I N+       +++ Y I+ +N  ++ +I   L I K +LN  G YTCL S    K  Q F+L  KE + ++   +SS+  G+   L C A+L   +Q VVW     +NS +    + + T                           + P ++     +IQ+I    N++  +    +    + +     T   SL    KIL   P  +  G  IS+ C GYP ++   LQWV+
Sbjct:   27 VDIKAHDYKLLTKILAARQLQDLFDNDKNTHGLFHAELNQKVYLIVDLGGLYQVSSV--EITSDEPENLKQTISIGYGFDKNTTSLFGSLYNCTYQ---QTSTLCIPACSQYDNSRTGFVGNSLLWSFQSDKEGWTRIYDLIIRGTP-FDENRSMKNNLDDITLFKPYVDHLVPAESMESCNGLLWHLNDESSVTESSALAATDDRNVTIYFGKTYSVTSVHFVTSKRDDMPQEYTLHFNGMESLTKIINLQNDCTLTASSDNDGANFKEYNCPTTDLESYTFDYVTVNVKGLYKLHVYGLPFHYPKIQIIPSKEDDKTIMVDKNVLQISCIAQSCNSTSNVLLDVDDSYRRSRSSDKSCPMNGHIVRCEYLNLFRLSTSPNDDNNQKIQLTNQGIINTEFSEIMTELRIIQSNAEKLIVTMPRTHQYYSQYECSCQATDNKKSQLFSLSTNLKPTDFDQDFIFQTN--YTT----IFADQTIESHVGFIELPENEHESYFKLNIIGYSL-LNDVQIGVVFVEGGQAGSRTNANIEEISVQAFPGINVGSDLWSIPVYKYELQDVWPDSVDDSVITVTWSMPKALSATKSPSVIRDIFRALIITSSGVNTIRAWVWQRDSHLLDIRAYHQLDENNVDSQLKILTLQRSGCSPSESEDEVVASVQLKDGQCSTDNNDIITCTRTIHGQIIQFKLN-NPSTSDVYKLYMKSDGVEDNVESTSSIDLVTSGSLGETVKEDIKGAGLSLTVEGIHHNHETQETELD---VAVHIASKVISDNIACRPTYLLLEFIEPNIETLKSRVSSKQTMFRIKLPSNQKEINLKMQL-SIGSVDPTQSEATTNQSIAFQNPFYIPTDIKVDAENQLI-QWFGLPTIFNNLLHHYETKLSGLPKACEQASEFNLPITQQEIDNGTIYRVNLKNIPDPTITKNGLAIDYNFKVTPVFKGIDGKSITMGTSSDIRFSTGRTGQTDLKAPTSGRYYSLQVQVRPSQIPSCLNLETLNTQFILRVIGEVDEYPDYIKQVNYVPITMKTTETLNDKNNHVKL----YKIENLLPGRRYELQAEVIYTEDFRDKISEPVRLWIEDEVHVQTEEVFISPGERVVINCTGSVGPNDTSQKSLEWKLFDGGRLPDGSRSLKTQEAQSGPLWYAMESLIFDPVNKQHGGVYACFIRPSILELMNKPTELHKVTVTVSDLEVDINSKIVEFG---EKIIITC--RTASPGQLDWMLPSGEKVEIMNEMKSDDDNDDQPYTIKDENDDVKLSIK--LIIPKVNLNKVGKYTCLHSPSNNK--QTFSLKMKEVIKLVKSPESSDKPGKTLILDCTANLGNLHQSVVWYKRPNSNSPWLEITEAIQTIEHITIQQKNPEDTLSSGVWLSELKVKNSPGIIGEFMCTIQNIQTTMNIERMETGSIMTNDNDFSKITHATIKVSLKSVLKILT--PIKLENGQ-ISVHCQGYPAHSKDRLQWVY 1467          
BLAST of Surface protein vs. TrEMBL
Match: A0A1S8X837 (Immunoglobulin domain protein (Fragment) OS=Opisthorchis viverrini OX=6198 GN=X801_01223 PE=4 SV=1)

HSP 1 Score: 172.17 bits (435), Expect = 4.471e-39
Identity = 297/1355 (21.92%), Postives = 553/1355 (40.81%), Query Frame = 1
Query:  361 VGNLING---QGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLV--KSSWASIYEIEVYA---TKKFTASNLISNNKLVLP------------DNCEEKSWLLTKSDL--SIKPKLQK--SGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDI-TKCSSSNSK---FTCQI-DRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQV-----SASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRL--PS-GSDLNSFVIINHGKLE----KIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAF---DDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDK------YNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGD----KSIRLTKSVITCTKSENKE---VALINRTKC--MTFNEEASCSNPDQVNMITIS-KNRNIANKY-LYSIIENA-SNKTL-RTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQK-----SNITIPXXXXXXXXXXXXXXEIEGIKG-----LYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKV-----------ITIKKSDNQNSYETKLDLLSEKFSSKLLV----TVVPLYKNSKRGQKKEIEKLSTISKS----TKLVIDGEVLPKQRIITLIPAQSDL-----PCSIKKVDFTIES----------DTVYNQXXXXXXXXXFK--YQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEG------VLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKP--AASFGL----AKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITV-KKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSI----QDIDIVNNLKVHQVSCKIV-------------PSLEANAKQ--------PKTKI-YSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETS 3981
            VG L NG    G L Y C  +   +    C P C  +  +R G++   I  +F    ++ W  IY++ +      +  +  +  ++   + P            + C E  W LT S    S  P + K    + ++ ++SY I ++ +I       N +  +    +K +  +  + + C+  N     + C   D + +P    TI + G+    +  K  +YPS+++     +      E+LK+ C A  C    HS+ + D     VQ  C+   L R   PS G  +   V++ +G+++    ++ ++ K V +S T+  ++ ++PR++ YYGQY+C C   D   + + S  T L  ++F ++L+F+     K DE      + I N +   +      Y T+ I  +SL   + +  +   +   T      A T  +++AF        +W ++ Y+   E A+P+    G+       K        +K  +    I     +     V +      + I+  Y  +    +  +L    ++C+  EN E      I+  +C  +    E  C        +  +  N + ++ Y LY ++E+  S++ L ++ +V   L  ++   +I +K+    +++ +P   ++ V     SK+I D   C    V +  P  P  + + Q+     +  T+ V      +   L++ IEG          +  L ++NP  L + DL++ D  +  + W  + +   ++L  + V   +T   C   +++            I +   +N  +Y   L +    F S++LV     V P++K       K +     ISK     + +  D     +   I +IP+Q  L     P    K+   +            D  + + +E      F   Y I +L PGR+Y++ A V Y  G  D+ S    F T+D++ ++ ++   +  D + I CT  +GP D  +K +EW+RAD S LP+G        + ++P   ++  L+ ++  A  +  Y CF KP  A+  G+      + +V+N L L    ++       +PI++ C +       L WT      +            ++  E+   S+L I +  L  +G Y C K  G     + F L  +E + I V + SS+L GE  S+ C A L   +QKV W       +       SY +L D SI    Q+ ++ N   V     +++              S+  ++KQ        PKT     L   P I LD    V+    + + CTG+P +    L+W + + T 
Sbjct:   12 VGFLQNGVTRSGRLTYECIPQ---HTVSICRPSCMNEKNSREGVVGDSILWTFQRTKENGWTRIYDVVITGRLFQQPVSLEDKAADITYLKPYFNRALGVDYEREQCAEPVWKLTDSSEANSQMPPVSKFHRNVTVYFDQSYSIAKVSLIASEMDNGNEECVIRFGGRKSVTKIFSLKSDCTMENETIGLYVCSTPDLMGYPFTNATIDAKGLFPNAVLQKCGYYPSVKIVPFTKAIDPQSTEDLKIQCVASTCP-EKHSNSNCDQYGHIVQ--CAYLELARTHEPSNGHAIPDTVLVTNGRVKPEVAEVITELKIVHSSKTE--LVTTVPRTHSYYGQYKCRCQTDDPFTSVVESVETKLPATEFDQDLVFERSFTAKLDE------QEIPNTLGFLEMPEITGYFTLIISGHSLTENLQLRVTYLDSGEFTTITEEYAQTS-SVQAFPLAKGTLYAWPIRAYRIVQENAWPDVTDEGLTTIEWETKDEAVTVRKVKDELRTIVISPQRQNTVRAWVWRR-DSHVVDIRAHYQNEPHNSREKKLYLQRVSCSTGENPEQVSSVTISGDECKRIVRQGEVECERTKGGFAVQFTVTNPSASDSYHLYEVMEDQESDRELAKSTSVVQGLQAELDATDIRIKLAGIKAES-EPGRTQMVVVFTIHSKLITDNSACHPTFVKL-QPSWPVSEKLEQQLGSNQTQFTVDVPNDQQLLTFDLEVRIEGGDASGSDITTKRTLQLLNPRHLKL-DLKL-DTQEETVTWTSLPESITEVLDKFNVKMWSTNKVCGIREEQTPPRVDRPIETPINVSFVENLMAYRISL-IDVPNFKSRVLVDHEIEVTPIFKTIDGESIKGVPSNVAISKGKIGQSGVKWDKSAANRTARIYVIPSQPALCLRDNPAITVKLALLMMGPGTATPNQLVDGFHLEAEEGGLGSDFGKWYTIKELQPGRRYEVKATVLYPTGIQDERSAAEVFWTSDEVFVSSSEVSVKPGDRLEIQCTGAVGPNDSLKKRLEWKRADGSQLPDGTRIETVTATESDPKNMETVKLVFEEVEAGQANTYACFIKPSVASQVGVQYVPPTVRVVVNVLEL---DIQSHVANVADPITVQCRAHQ--GSTLTWTSPGGAVVRKDRQREEPYVTDVKNEQGTISLLHIPRVKLAQTGQYLC-KQDG-TDNHRAFKLKMQEDVRIVVDEDSSQLAGETFSITCNALLGHADQKVEWYRRE--GDGHPWTPVSYASLEDESIKITNQNAELSNGDNVRSSQLRMINSPASAGEFACALKSMSGDSKQFARETMELPKTSASVHLDSIPVIKLDRARRVSR-QQVVVKCTGFPAHEEDHLEWYYVSSTG 1335          
BLAST of Surface protein vs. Planmine SMEST
Match: SMESG000064752.1 (SMESG000064752.1)

HSP 1 Score: 2714.49 bits (7035), Expect = 0.000e+0
Identity = 1397/1408 (99.22%), Postives = 1402/1408 (99.57%), Query Frame = 1
Query:   82 MKNNFIIFILGFCNIGMLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTKSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDLNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQKSNITIPXXXXXXXXXXXXXXEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNSKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKKVDFTIESDTVYNQXXXXXXXXXFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLDPCTDCPKKSAHTAYNFNFPKSVFISVIAFALFCLF 4305
            MKNNFIIFILGFCNIGMLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLT+SDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSD NSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHS+DKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMK VDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGK AKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKN+KRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIK VDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTA+LGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSS LIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVL+PCTDCPKKSAH AYNFNFPKSVFISVIAFALFCLF
Sbjct:    1 MKNNFIIFILGFCNIGMLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTQSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDFNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSSDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKYVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKTAKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNNKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKNVDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTADLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSGLIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLNPCTDCPKKSAHAAYNFNFPKSVFISVIAFALFCLF 1408          
BLAST of Surface protein vs. Planmine SMEST
Match: SMESG000064752.1 (SMESG000064752.1)

HSP 1 Score: 2681.75 bits (6950), Expect = 0.000e+0
Identity = 1381/1392 (99.21%), Postives = 1386/1392 (99.57%), Query Frame = 1
Query:  130 MLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTKSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDLNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQKSNITIPXXXXXXXXXXXXXXEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNSKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKKVDFTIESDTVYNQXXXXXXXXXFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLDPCTDCPKKSAHTAYNFNFPKSVFISVIAFALFCLF 4305
            MLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLT+SDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSD NSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHS+DKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMK VDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGK AKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKN+KRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIK VDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTA+LGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSS LIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVL+PCTDCPKKSAH AYNFNFPKSVFISVIAFALFCLF
Sbjct:    1 MLIGETMKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTQSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDFNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSSDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKYVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKTAKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNNKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKNVDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTADLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSGLIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLNPCTDCPKKSAHAAYNFNFPKSVFISVIAFALFCLF 1392          
BLAST of Surface protein vs. Planmine SMEST
Match: SMESG000064752.1 (SMESG000064752.1)

HSP 1 Score: 2668.26 bits (6915), Expect = 0.000e+0
Identity = 1375/1386 (99.21%), Postives = 1380/1386 (99.57%), Query Frame = 1
Query:  148 MKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTKSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDLNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSXXXXXXXXCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLXXXXXXXXXXXXXXXETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKPAKKMVLQKSNITIPXXXXXXXXXXXXXXEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNSKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKKVDFTIESDTVYNQXXXXXXXXXFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLDPCTDCPKKSAHTAYNFNFPKSVFISVIAFALFCLF 4305
            MKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLT+SDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSD NSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHS+DKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMK VDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGK AKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKN+KRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIK VDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTA+LGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSS LIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVL+PCTDCPKKSAH AYNFNFPKSVFISVIAFALFCLF
Sbjct:    1 MKVPIKSHEILTKDITDNEVKSIYDNDMSTSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLINGQGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWASIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTQSDLSIKPKLQKSGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCSSSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNVNENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDFNSFVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVADETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLKQKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKAFDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSSDKYNLKQNIFVQFIKINEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVALINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTLRTITVNNTLALDVKGIEINMKYVDFNSDAMKPEMDEVNVEIQFVSKMINDFIPCGKVLVSITAPGKTAKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGIKGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVTTETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLYKNNKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIKNVDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPSDKISEVYNFETADDISINQAKHIAQVNDHVNITCTADLGPVDKYRKTIEWRRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFGLAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDNLPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMKASQQFTLYGKEKLVITVKKSSGLIGEITSLRCKADLATKNQKVVWETNSVFDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPKTKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETSVKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAKTELLTSKSISLSVDVSKVLAVLNPCTDCPKKSAHAAYNFNFPKSVFISVIAFALFCLF 1386          
The following BLAST results are available for this feature:
BLAST of Surface protein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 4
Match NameE-valueIdentityDescription
Dscam15.223e-622.15gene:FBgn0033159 transcript:FBtr0111085[more]
Dscam15.312e-622.15gene:FBgn0033159 transcript:FBtr0111084[more]
Dscam16.617e-621.85gene:FBgn0033159 transcript:FBtr0111050[more]
Dscam16.780e-621.85gene:FBgn0033159 transcript:FBtr0111096[more]
back to top
BLAST of Surface protein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
G7Y9G37.058e-5222.38GPI-anchored surface glycoprotein OS=Clonorchis si... [more]
A0A4S2M2V46.025e-4522.53Uncharacterized protein OS=Opisthorchis felineus O... [more]
Q266076.127e-4423.04Surface protein (Fragment) OS=Schistosoma mansoni ... [more]
A0A3Q0KCT51.091e-4323.04200-kDa GPI-anchored surface glycoprotein OS=Schis... [more]
A0A1S8X8374.471e-3921.92Immunoglobulin domain protein (Fragment) OS=Opisth... [more]
back to top
BLAST of Surface protein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Surface protein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 3
Match NameE-valueIdentityDescription
SMESG000064752.10.000e+099.22SMESG000064752.1[more]
SMESG000064752.10.000e+099.21SMESG000064752.1[more]
SMESG000064752.10.000e+099.21SMESG000064752.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30003501 ID=SMED30003501|Name=Surface protein|organism=Schmidtea mediterranea sexual|type=transcript|length=4907bp
TTTAATTTTGAGTTCTTTATTAATTCGAACAGTAACTGCAATAGTGAAAG
TTCAACAAAATATTCTCTTTAATATTTTATCATGAAGAATAATTTCATTA
TTTTTATACTTGGATTTTGTAATATTGGAATGTTAATTGGAGAAACAATG
AAAGTTCCAATCAAATCACACGAGATCTTGACAAAAGACATTACAGACAA
TGAAGTGAAAAGTATTTATGATAATGATATGAGCACTTCTGGATCTTTTT
ATCATGTATCATCAAAATATGTCGACGTTGATTTTGATTTGGGCTCAAAT
TTTTTAATAGAAAAAATCGTGGCAACAATCAAATCTGAATCACCAGATAT
TCGATTTGTTGTAGGGAATCTCATTAATGGTCAAGGAGGATTGAATTATA
TTTGCTCAAAGAAAACCGATAACAATAATGCTTACGAATGCATTCCTAAA
TGTCAGAAAGACGATTTAACACGAAACGGAATTATCTCCAGAAAAATAAA
AATGTCATTTTTGGTTAAAAGTAGCTGGGCATCTATTTATGAAATTGAAG
TTTATGCCACAAAGAAATTTACTGCATCGAATTTAATTTCGAACAATAAA
TTGGTTTTACCCGATAATTGTGAAGAAAAATCTTGGCTTTTGACCAAATC
CGATCTGTCTATAAAACCAAAATTACAAAAATCTGGAATTATGATTTTCT
TAGAAAAAAGCTATATTATCAATGAAATCGATGTCATATTTCAGGGAAAA
GCACCAGAAAATCTGAAAATTCAATTATCGAATGTTGAGAAAAAACAAAT
CGATATTGTCATAGACATAACCAAGTGCAGTTCTTCAAACTCAAAATTCA
CATGCCAAATTGATCGGCTCAAATTTCCGACAAGAATAGTAACAATTCTT
AGCAATGGTGTCGCTCAATTGTTTTTACGAGGGAAACCCGTTTTCTACCC
CTCATTGCAAGTGAGTGCCTCCAGCAATGTTAATGAAAATCTCAAGCTCA
CTTGCAAAGCTATTCCATGTGACCTATCGGTTCACAGCGATTGCTCGACT
GATGCCATGAAGTCCGGTGTGCAATTTCCCTGTTCCAATAAACGATTGGT
GAGATTGCCGAGTGGATCGGATTTGAATTCGTTTGTCATTATTAATCATG
GGAAACTGGAAAAAATCTGGTCGAAAACTAAATTTGTTGTCAATTCTAAT
ACGGATGAAAAAATGATTGTTTCAATGCCTAGAAGTTACCAATACTACGG
CCAATATCAATGCAGTTGTCTTGTTGCAGATGAAACAAACACAGAAATTA
CTTCAAAGTCTACTTCTCTAGCTGAAAGCGATTTTCTTGAAAATCTCCTA
TTCAAAACCGAGATATTGTATAAAACTGACGAGGACTCTCAATTCTTTAA
TAAATCTATCGATAATGTGATCGTGTTGAAACAGAAAACACTGAAAAATA
CATATATCACCGTAAAAATACTATCATATAGTCTCATTGCTGGCATTAAC
ATAAATGCTTCAATTCCAATTGCTAACGCCGAAACGGAATTTCAGACGCC
TGGTGCTCCAACTGAACTCAATATCAAAGCATTTGACGACAACTCCGAGT
CATGGTCCGTGAAAACATACAAGTATGAAATAGAAGAAGCCTTTCCTGAA
TTCGATAAACGAGGAATCAATGTCAGCCACTCCACGGATAAATATAACTT
AAAACAAAATATTTTTGTACAATTTATAAAAATCAACGAGGACGACGTGA
CGGAACCGGAGGTCCTACAATCAATCACTGATAAATCAATCTCGATCAAA
GTCATTTATATGGGTGATAAATCAATAAGACTCACAAAATCCGTAATAAC
ATGCACAAAGTCAGAAAACAAAGAAGTTGCGCTTATAAACCGAACCAAAT
GCATGACATTCAATGAGGAAGCATCGTGTTCGAATCCAGATCAAGTAAAT
ATGATAACAATATCCAAAAACCGAAATATTGCCAATAAATATCTCTATTC
CATTATTGAAAATGCGTCGAACAAAACGTTACGGACAATTACAGTCAATA
ATACATTAGCATTGGATGTAAAAGGTATTGAAATTAACATGAAGATTGTT
GATTTTAATTCTGATGCAATGAAACCGGAAATGGATGAAGTTAATGTAGA
AATTCAATTTGTTTCAAAGATGATTAATGATTTTATTCCATGTGGAAAAG
TGCTGGTATCAATTACTGCGCCCGGAAAACCTGCAAAGAAAATGGTTTTA
CAAAAATCAAATATAACAATTCCCGTAAGTGTCGTCGGGTCTTCTGTGAA
ATTAGTTCTTAAATTGGAAATCGAAGGAATCAAAGGACTTTACGAAACTG
TGCTTAATGTTATCAATCCAACTTCCTTGGCAGTTAAGGATTTACGAATT
CAAGACAAAACCAAAATAATTTTATTATGGAACGGCATTGAAAAGCTAGC
CCAAGATTTGTTGGTAGGATATGAAGTAACTACCGAAACTACGGTAATCG
CGTGTAATTCCGCACAGAAAAAAGTAATAACAATCAAAAAATCAGACAAT
CAAAACTCATATGAAACCAAATTGGATTTGTTGAGTGAAAAGTTTTCTTC
GAAATTGCTTGTAACCGTAGTACCTTTATACAAAAACAGCAAACGAGGAC
AAAAGAAAGAGATAGAAAAACTATCAACTATTTCAAAAAGTACTAAACTT
GTAATTGATGGCGAAGTTCTACCGAAACAACGAATAATCACTCTGATTCC
GGCTCAATCAGATCTTCCATGTTCTATAAAAAAGGTCGATTTCACAATAG
AATCAGATACGGTTTACAATCAAATCAAAGAAATAAAAGAAATCGAAAAA
TTCAAATACCAAATCAATAAATTGGTGCCAGGAAGAAAATACAAATTAAC
CGCTGAAGTGAACTACAATTTTGGGCCTTCGGATAAAATCTCTGAAGTCT
ACAATTTCGAAACTGCTGATGATATCTCTATCAATCAAGCGAAGCACATT
GCTCAGGTTAATGATCATGTCAACATCACATGCACTGCTAATCTGGGACC
TGTGGATAAATATCGAAAAACCATTGAATGGCGAAGAGCCGATAATTCAC
CGCTTCCCGAAGGTGTTCTTTCATATAACAATCCAGATTTTCCACAATCA
GCTTTTTTAATAATGAAGAAGGCCTTGGCTGATTATTCTGGAGACTACGG
CTGTTTCACAAAACCAGCAGCAAGTTTTGGCCTGGCCAAAATTACAATTG
TCATCAACGATTTAATATTAAAACCTAAAAAAGTAAAAGTTGATTTTACA
AAGCCTCTGGAACCAATATCAATAAATTGCTATAGTTCCACTATAGGAAA
TGGCAATCTTGAATGGACAGACTCGGATAACTTACCAATTTCTAATAAAG
AAAATGAGCTTTATAAAATACAACAAAATATTACAATGGAAAAAGCAATC
TCATCTATTTTAACAATAAAAAAACCTTCATTGAACACCTCAGGCACCTA
CACTTGTCTTAAATCAATCGGTGAAATGAAAGCATCTCAGCAGTTCACAC
TTTATGGCAAAGAAAAGCTTGTAATTACTGTGAAAAAGAGTTCAGAATTA
ATTGGTGAAATTACATCATTGAGATGCAAAGCAGATCTTGCAACAAAAAA
TCAAAAAGTAGTTTGGGAAACTAATAGCGTTTTTGATAACAAAAAGAAAC
TCGTTTCTACTTCATATCCCAATTTACTAGATAATAGCATTCAAGACATA
GATATCGTCAATAATTTGAAAGTCCATCAAGTTTCATGCAAAATTGTGCC
GTCTTTAGAAGCTAACGCAAAGCAACCTAAGACTAAGATATATTCACTAA
GCCTTACACCAAAAATCCTATTGGACGGTCCCACTAATGTTACTGCTGGC
TCAAATATCAGTTTGACGTGCACAGGATATCCAACATATGCAACCACAGA
TCTTCAATGGGTGTTTTATAATGAAACATCGGTGAAACCCACAAGTGGCA
ATGTGGTTTTATCATCTAAAAATCAACCGGTCTCCAAAATTTATGAGCCA
GTTAAATTAGTTTTATCATTAAACAATGTTAAAAACTCCGAGTCCGGTCG
GTACGAATGCAAATTTAGAACAAGAGCAAAAACCGAACTTTTGACAAGCA
AAAGTATCAGTCTATCTGTGGACGTAAGCAAGGTTTTAGCTGTGCTCGAT
CCATGTACCGATTGCCCTAAAAAGAGTGCTCACACAGCTTATAATTTTAA
TTTCCCCAAATCAGTTTTCATTTCAGTGATTGCTTTTGCACTATTTTGTT
TATTTTAATAATCAACAAAATGAGTAAATTCGAGTTTTCATTACAATTGT
TATTGCAATATGTAAACGTTGTCTTTTTTTGTACAAAAGCACTTTTCCAA
ATATTGACTTATATTTTTGTATATTTTTCTATGTTTAATTTTTAAAAATG
ATTGCATTTGAAAAGTTAAAAGATTTGTTTGAAGTCGATAAATTGTGAAC
CCTGAAATTTGGTTTTGTTTTTCTGCTTTTTATTTTGATGCATTCTTTAT
GTTTTTGCTGCAGAATCATGGAATGGTGAATAGTTGAAGAATCTTTGATT
TCCGTTGACCAATTCTATTAACAATTTGACTGATATATCCTTTTTGCTGT
ACATTGTGTGTAGATGATTTATTTTAATTGCTGCTGGTGTCATTTAGTTG
ATTTTAAATTAACATTATTCTTATTATTAACTATAATTAATTATTAATAT
ATCTGCTGGTGCCTCTCTATATGCCTATTTCTGATTCTAGATTGTCTAAA
ATATATGGTTAAAATATTTTATGAAGTGTATTAGTGTTTTAATGTCCTGC
CTTTTAATTTTATTTCAACAAATTTATGTAATGTAGAGACATTATAATAT
GCCAAGG
back to top

protein sequence of SMED30003501-orf-1

>SMED30003501-orf-1 ID=SMED30003501-orf-1|Name=SMED30003501-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1409bp
MKNNFIIFILGFCNIGMLIGETMKVPIKSHEILTKDITDNEVKSIYDNDM
STSGSFYHVSSKYVDVDFDLGSNFLIEKIVATIKSESPDIRFVVGNLING
QGGLNYICSKKTDNNNAYECIPKCQKDDLTRNGIISRKIKMSFLVKSSWA
SIYEIEVYATKKFTASNLISNNKLVLPDNCEEKSWLLTKSDLSIKPKLQK
SGIMIFLEKSYIINEIDVIFQGKAPENLKIQLSNVEKKQIDIVIDITKCS
SSNSKFTCQIDRLKFPTRIVTILSNGVAQLFLRGKPVFYPSLQVSASSNV
NENLKLTCKAIPCDLSVHSDCSTDAMKSGVQFPCSNKRLVRLPSGSDLNS
FVIINHGKLEKIWSKTKFVVNSNTDEKMIVSMPRSYQYYGQYQCSCLVAD
ETNTEITSKSTSLAESDFLENLLFKTEILYKTDEDSQFFNKSIDNVIVLK
QKTLKNTYITVKILSYSLIAGININASIPIANAETEFQTPGAPTELNIKA
FDDNSESWSVKTYKYEIEEAFPEFDKRGINVSHSTDKYNLKQNIFVQFIK
INEDDVTEPEVLQSITDKSISIKVIYMGDKSIRLTKSVITCTKSENKEVA
LINRTKCMTFNEEASCSNPDQVNMITISKNRNIANKYLYSIIENASNKTL
RTITVNNTLALDVKGIEINMKIVDFNSDAMKPEMDEVNVEIQFVSKMIND
FIPCGKVLVSITAPGKPAKKMVLQKSNITIPVSVVGSSVKLVLKLEIEGI
KGLYETVLNVINPTSLAVKDLRIQDKTKIILLWNGIEKLAQDLLVGYEVT
TETTVIACNSAQKKVITIKKSDNQNSYETKLDLLSEKFSSKLLVTVVPLY
KNSKRGQKKEIEKLSTISKSTKLVIDGEVLPKQRIITLIPAQSDLPCSIK
KVDFTIESDTVYNQIKEIKEIEKFKYQINKLVPGRKYKLTAEVNYNFGPS
DKISEVYNFETADDISINQAKHIAQVNDHVNITCTANLGPVDKYRKTIEW
RRADNSPLPEGVLSYNNPDFPQSAFLIMKKALADYSGDYGCFTKPAASFG
LAKITIVINDLILKPKKVKVDFTKPLEPISINCYSSTIGNGNLEWTDSDN
LPISNKENELYKIQQNITMEKAISSILTIKKPSLNTSGTYTCLKSIGEMK
ASQQFTLYGKEKLVITVKKSSELIGEITSLRCKADLATKNQKVVWETNSV
FDNKKKLVSTSYPNLLDNSIQDIDIVNNLKVHQVSCKIVPSLEANAKQPK
TKIYSLSLTPKILLDGPTNVTAGSNISLTCTGYPTYATTDLQWVFYNETS
VKPTSGNVVLSSKNQPVSKIYEPVKLVLSLNNVKNSESGRYECKFRTRAK
TELLTSKSISLSVDVSKVLAVLDPCTDCPKKSAHTAYNFNFPKSVFISVI
AFALFCLF*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0003116parenchymal cell
Vocabulary: INTERPRO
TermDefinition
IPR036179Ig-like_dom_sf
IPR003598Ig_sub2
IPR003599Ig_sub
IPR013783Ig-like_fold
IPR007110Ig-like_dom
Vocabulary: molecular function
TermDefinition
GO:0005515protein binding
Vocabulary: cellular component
TermDefinition
GO:0016020membrane
GO:0016021integral component of membrane
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003599Immunoglobulin subtypeSMARTSM00409IG_3ccoord: 969..1057
e-value: 0.058
score: 22.5
coord: 1265..1364
e-value: 2.9E-4
score: 30.2
coord: 1068..1160
e-value: 12.0
score: 10.7
IPR003598Immunoglobulin subtype 2SMARTSM00408igc2_5coord: 975..1048
e-value: 1.1
score: 6.6
coord: 1074..1149
e-value: 2.3
score: 3.6
coord: 1271..1350
e-value: 0.069
score: 18.2
IPR013783Immunoglobulin-like foldGENE3DG3DSA:2.60.40.10coord: 1057..1159
e-value: 1.9E-5
score: 26.7
IPR013783Immunoglobulin-like foldGENE3DG3DSA:2.60.40.10coord: 1250..1371
e-value: 1.8E-9
score: 39.5
IPR007110Immunoglobulin-like domainPROSITEPS50835IG_LIKEcoord: 1260..1360
score: 10.353
IPR007110Immunoglobulin-like domainPROSITEPS50835IG_LIKEcoord: 1065..1156
score: 7.776
NoneNo IPR availableSIGNALP_EUKSignalP-noTMSignalP-noTMcoord: 1..20
score: 0.636
IPR036179Immunoglobulin-like domain superfamilySUPERFAMILYSSF48726Immunoglobulincoord: 968..1144
IPR036179Immunoglobulin-like domain superfamilySUPERFAMILYSSF48726Immunoglobulincoord: 1266..1363