Protein split ends

Overview
NameProtein split ends
Smed IDSMED30021224
Length (bp)8488
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Protein split ends (SMED30021224) t-SNE clustered cells

Violin plots show distribution of expression levels for Protein split ends (SMED30021224) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Protein split ends (SMED30021224) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Protein split ends (SMED30021224) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30021224

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 6

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
nervous systemSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
neoblastSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
nervous systemSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
neoblastSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30021224h1SMcG0021544 dd_Smed_v4_4709_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Protein split ends vs. Ensembl Human
Match: SPEN (spen family transcriptional repressor [Source:HGNC Symbol;Acc:HGNC:17575])

HSP 1 Score: 137.887 bits (346), Expect = 2.376e-31
Identity = 81/173 (46.82%), Postives = 107/173 (61.85%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN  L +          SL  +  G  L+I QRMRL+ +QLEGV RRM    ++C+ LALP G D    V Q++ L+  FI Y++ K AAGIINV +P  S Q  YV+ IFPPC+FS + L   APDL   I  N  P+L++VI +V
Sbjct: 3503 KKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPLSEGGPPLRIAQRMRLEATQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP-GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASIS-NISPHLMIVIASV 3664          

HSP 2 Score: 99.3673 bits (246), Expect = 1.332e-19
Identity = 84/288 (29.17%), Postives = 143/288 (49.65%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  +  +     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV  +    AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  336 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GTSEERYGLVFFRQQEDQEKALT-ASKGK---------LFFGMQIEVTAWIGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYNEIEYAQAAVKETKGRK--IGGNKIKVDFANRESQ 592          
BLAST of Protein split ends vs. Ensembl Human
Match: SPEN (spen family transcriptional repressor [Source:HGNC Symbol;Acc:HGNC:17575])

HSP 1 Score: 110.538 bits (275), Expect = 1.558e-23
Identity = 84/288 (29.17%), Postives = 144/288 (50.00%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  + ++     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV  +    AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  268 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GTSEERYGLVFFRQQEDQEKAL-TASKGK---------LFFGMQIEVTAWIGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYNEIEYAQAAVKETKGRK--IGGNKIKVDFANRESQ 524          
BLAST of Protein split ends vs. Ensembl Human
Match: SPEN (spen family transcriptional repressor [Source:HGNC Symbol;Acc:HGNC:17575])

HSP 1 Score: 98.2117 bits (243), Expect = 3.753e-21
Identity = 76/251 (30.28%), Postives = 125/251 (49.80%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALV 1351
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  + ++     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV
Sbjct:   76 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GTSEERYGLVFFRQQEDQEKAL-TASKGK---------LFFGMQIEVTAWIGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALV 297          
BLAST of Protein split ends vs. Ensembl Human
Match: RBM15 (RNA binding motif protein 15 [Source:HGNC Symbol;Acc:HGNC:14959])

HSP 1 Score: 64.3142 bits (155), Expect = 3.727e-9
Identity = 55/176 (31.25%), Postives = 85/176 (48.30%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNA--NEFCIGLALPG--------GDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENS-QQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            L W+G L LKN      +H + G+ ++ ++    L+  GS        LKI QR+RLD  +L+ V RR++ A  N + I LA+PG            +     +Q      + Y++ K AAG+I++    N  ++   V+H FPPC+FS   L S A  L    E     + LV+I
Sbjct:  742 LAWQGMLLLKNSNFPSNMHLLQGDLQVASS----LLVEGSTG-GKVAQLKITQRLRLDQPKLDEVTRRIKVAGPNGYAILLAVPGSSDSRSSSSSAASDTATSTQRPLRNLVSYLKQKQAAGVISLPVGGNKDKENTGVLHAFPPCEFSQQFLDSPAKALAKSEE-----DYLVMI 907          
BLAST of Protein split ends vs. Ensembl Human
Match: RBM15 (RNA binding motif protein 15 [Source:HGNC Symbol;Acc:HGNC:14959])

HSP 1 Score: 64.3142 bits (155), Expect = 3.901e-9
Identity = 55/176 (31.25%), Postives = 85/176 (48.30%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNA--NEFCIGLALPG--------GDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENS-QQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            L W+G L LKN      +H + G+ ++ ++    L+  GS        LKI QR+RLD  +L+ V RR++ A  N + I LA+PG            +     +Q      + Y++ K AAG+I++    N  ++   V+H FPPC+FS   L S A  L    E     + LV+I
Sbjct:  742 LAWQGMLLLKNSNFPSNMHLLQGDLQVASS----LLVEGSTG-GKVAQLKITQRLRLDQPKLDEVTRRIKVAGPNGYAILLAVPGSSDSRSSSSSAASDTATSTQRPLRNLVSYLKQKQAAGVISLPVGGNKDKENTGVLHAFPPCEFSQQFLDSPAKALAKSEE-----DYLVMI 907          
BLAST of Protein split ends vs. Ensembl Celegans
Match: din-1 (Daf-12-interacting protein 1 [Source:UniProtKB/Swiss-Prot;Acc:G5EGK6])

HSP 1 Score: 70.8626 bits (172), Expect = 3.570e-13
Identity = 50/166 (30.12%), Postives = 85/166 (51.20%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVIT 8227
            +VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +     L   A  L + ++      LL+V T
Sbjct:    1 MVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEIVNRYLSELATSLHDYLQNTDTRYLLIVFT 161          
BLAST of Protein split ends vs. Ensembl Celegans
Match: din-1 (Daf-12-interacting protein 1 [Source:UniProtKB/Swiss-Prot;Acc:G5EGK6])

HSP 1 Score: 70.8626 bits (172), Expect = 3.570e-13
Identity = 50/166 (30.12%), Postives = 85/166 (51.20%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVIT 8227
            +VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +     L   A  L + ++      LL+V T
Sbjct:    1 MVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEIVNRYLSELATSLHDYLQNTDTRYLLIVFT 161          
BLAST of Protein split ends vs. Ensembl Celegans
Match: din-1 (Daf-12-interacting protein 1 [Source:UniProtKB/Swiss-Prot;Acc:G5EGK6])

HSP 1 Score: 74.3294 bits (181), Expect = 8.696e-13
Identity = 53/169 (31.36%), Postives = 91/169 (53.85%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLST-APDLQNIIEVNKFPNLLVVIT 8227
            +P+VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +  +N+ LS  A  L + ++      LL+V T
Sbjct:  380 FPMVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEI-VNRYLSELATSLHDYLQNTDTRYLLIVFT 542          
BLAST of Protein split ends vs. Ensembl Celegans
Match: din-1 (Daf-12-interacting protein 1 [Source:UniProtKB/Swiss-Prot;Acc:G5EGK6])

HSP 1 Score: 74.3294 bits (181), Expect = 8.696e-13
Identity = 53/169 (31.36%), Postives = 91/169 (53.85%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLST-APDLQNIIEVNKFPNLLVVIT 8227
            +P+VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +  +N+ LS  A  L + ++      LL+V T
Sbjct:  380 FPMVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEI-VNRYLSELATSLHDYLQNTDTRYLLIVFT 542          
BLAST of Protein split ends vs. Ensembl Celegans
Match: din-1 (Daf-12-interacting protein 1 [Source:UniProtKB/Swiss-Prot;Acc:G5EGK6])

HSP 1 Score: 71.633 bits (174), Expect = 1.027e-11
Identity = 53/171 (30.99%), Postives = 92/171 (53.80%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLST-APDLQNIIEVNKFPNLLVVIT 8227
            + +P+VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +  +N+ LS  A  L + ++      LL+V T
Sbjct: 2022 KHFPMVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEI-VNRYLSELATSLHDYLQNTDTRYLLIVFT 2186          
BLAST of Protein split ends vs. Ensembl Fly
Match: spen (gene:FBgn0016977 transcript:FBtr0332336)

HSP 1 Score: 152.525 bits (384), Expect = 5.749e-36
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5327 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5487          

HSP 2 Score: 126.331 bits (316), Expect = 4.455e-28
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  489 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 754          
BLAST of Protein split ends vs. Ensembl Fly
Match: spen (gene:FBgn0016977 transcript:FBtr0306341)

HSP 1 Score: 152.525 bits (384), Expect = 5.751e-36
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5345 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5505          

HSP 2 Score: 126.331 bits (316), Expect = 4.609e-28
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  489 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 754          
BLAST of Protein split ends vs. Ensembl Fly
Match: spen (gene:FBgn0016977 transcript:FBtr0330652)

HSP 1 Score: 152.525 bits (384), Expect = 5.751e-36
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5345 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5505          

HSP 2 Score: 126.331 bits (316), Expect = 4.609e-28
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  489 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 754          
BLAST of Protein split ends vs. Ensembl Fly
Match: spen (gene:FBgn0016977 transcript:FBtr0330653)

HSP 1 Score: 152.525 bits (384), Expect = 5.801e-36
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5350 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5510          

HSP 2 Score: 126.331 bits (316), Expect = 4.347e-28
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  494 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 759          
BLAST of Protein split ends vs. Ensembl Fly
Match: spen (gene:FBgn0016977 transcript:FBtr0078121)

HSP 1 Score: 152.14 bits (383), Expect = 5.952e-36
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5373 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5533          

HSP 2 Score: 126.331 bits (316), Expect = 4.033e-28
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  544 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 809          
BLAST of Protein split ends vs. Ensembl Zebrafish
Match: spen (spen family transcriptional repressor [Source:ZFIN;Acc:ZDB-GENE-050309-70])

HSP 1 Score: 150.214 bits (378), Expect = 2.864e-35
Identity = 85/171 (49.71%), Postives = 107/171 (62.57%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LKN+ A V LHFV GN    N   H      SL     G+ L+I QRMRL+ SQLEGV RRM   NE+C+ LALP G D      Q+  L+ GFI Y++ K AAGIINV +P  S Q  YVV IFPPC+FS + L   APDL N I  +  P+L++VI +V
Sbjct: 2340 YPIIWQGHLALKNDTAAVQLHFVSGN----NVLAHR-----SLPPPEGGAFLRIAQRMRLEASQLEGVARRMTAENEYCLLLALPCGLDQEDVHNQTHALKTGFITYLQAKQAAGIINVPNP-GSNQPAYVVQIFPPCEFSESHLSHLAPDLLNSIS-SISPHLMIVIASV 2499          
BLAST of Protein split ends vs. Ensembl Zebrafish
Match: spen (spen family transcriptional repressor [Source:ZFIN;Acc:ZDB-GENE-050309-70])

HSP 1 Score: 139.428 bits (350), Expect = 6.403e-32
Identity = 84/171 (49.12%), Postives = 107/171 (62.57%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LKN+ A V LHFV GN  L +          SL     G+ L+I QRMRL+ SQLEGV RRM   NE+C+ LALP G D      Q+  L+ GFI Y++ K AAGIINV +P  S Q  YVV IFPPC+FS + L   APDL N I  +  P+L++VI +V
Sbjct: 3317 YPIIWQGHLALKNDTAAVQLHFVSGNNVLAHR---------SLPPPEGGAFLRIAQRMRLEASQLEGVARRMTAENEYCLLLALPCGLDQEDVHNQTHALKTGFITYLQAKQAAGIINVPNP-GSNQPAYVVQIFPPCEFSESHLSHLAPDLLNSIS-SISPHLMIVIASV 3476          

HSP 2 Score: 116.701 bits (291), Expect = 4.936e-25
Identity = 88/288 (30.56%), Postives = 145/288 (50.35%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-NCPQG------------LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +  A   S  R+ +V F    D EKAL  ASK +         LF   ++++TA + P+             +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK        A +Q+ DI+ V K        +K + G  + ++     FG S PT C+WL  +S+SI E  +T +F R+GHV++VV D  +  AL+  +N   AQ ++     K   +G  ++++D+A+ E Q
Sbjct:  384 GIKVQNLPVRSTDTSLKDGLFHEFKKHGK--------VTSVQIHGA---SEERYGLVFFRQQEDQEKALS-ASKGK---------LFFGMQIDVTAWHGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYNDLLNIFQRFGEIVDIDIKKVNGSPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTTCVWLDGLSSSITEQYLTRHFCRYGHVVKVVFDRLKGMALILYNNIEYAQAAVKET--KGWKIGGNKIKVDFANQESQ 640          
BLAST of Protein split ends vs. Ensembl Zebrafish
Match: si:ch1073-335m2.2 (si:ch1073-335m2.2 [Source:ZFIN;Acc:ZDB-GENE-081104-82])

HSP 1 Score: 135.961 bits (341), Expect = 5.543e-31
Identity = 79/170 (46.47%), Postives = 108/170 (63.53%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+VW+G L+LKN+ A V LHF+ GN  L    + L          + G L+IVQRMRL+  QLEGV RRM   ++FC+ LA+P G D    + Q+Q L+  FI Y++ K AAGIINV +P  S Q  +V+ IFPPC+FS + L   APDL + I  +  P+L++VIT+V
Sbjct: 2980 YPIVWQGLLALKNDTAAVQLHFLCGNKALGLRSLPLP--------ESGGILRIVQRMRLEAQQLEGVARRMTGESDFCLLLAMPCGLDQEDVLNQTQALKSAFINYLQAKLAAGIINVPNP-GSNQPAFVLQIFPPCEFSESHLSRLAPDLLSQIS-SISPHLMIVITSV 3139          

HSP 2 Score: 105.531 bits (262), Expect = 1.119e-21
Identity = 88/288 (30.56%), Postives = 145/288 (50.35%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-NCPQG------------LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GI+V  L  + +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL ++             LF    + +TA N P+             +DEFHPKA+RTL I NL +   N++ L + F + G+++DI+IK    +   A VQ+SDI+ V K        +K + G  + T+     FG S PT C+WL  ++++I E  +T +F R+G V++VV D  +  ALV  +N   AQ ++   + K   +G  ++++D+AS E Q
Sbjct:  341 GIRVQNLPTRSTDTSLKDGLFHEFKKYGK--------VTSVQIH---GASEERYGLVFFRQQEDQEKALSVSKGK----------LFFGMLIEVTAWNGPETESENEFRPLDGRIDEFHPKATRTLFIGNL-EKTTNYQQLLDVFQRFGEIVDIDIKRVNGVPQYAFVQYSDIASVCK-------AIKKMDGEYLGTNRLKLGFGKSMPTACVWLDGLTSNITEQYLTRHFCRYGPVVKVVFDRLKGMALVLYNNTDFAQAAV--RETKGWKIGGNKIKVDFASQESQ 597          
BLAST of Protein split ends vs. Ensembl Zebrafish
Match: rbm15b (RNA binding motif protein 15B [Source:ZFIN;Acc:ZDB-GENE-080204-91])

HSP 1 Score: 71.633 bits (174), Expect = 1.433e-11
Identity = 57/177 (32.20%), Postives = 86/177 (48.59%), Query Frame = 2
Query: 7733 VWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQ--NANEFCIGLALPGGDDINQWVKQS---QILEEGFIKYMRDKGAAGIIN--VCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV*NS 8242
            VW G L LKN      +H + G    L++ M      GS        LKI QR+RLD  +L+ V RR++  + + + + LA+ G  D +    +    Q L    + Y+R+K AAG+I   +  P+  + G  +++ FPPCDFS  Q L TA     +  V K     +VI  V +S
Sbjct:  649 VWHGSLVLKNSSFPTNMHMLEGGVSFLHSLMRDNQAGGS----KITQLKIAQRLRLDQPKLDEVTRRIKLGSPDGYAVLLAVQGPMDRDAPPPEPGLQQRLLRNLVTYLRNKQAAGVIGLPLGGPKEREMG-GMLYAFPPCDFS-QQYLQTA-----LKTVGKLEEEHLVIVVVRDS 814          
BLAST of Protein split ends vs. Ensembl Zebrafish
Match: rbm15 (RNA binding motif protein 15 [Source:ZFIN;Acc:ZDB-GENE-041008-192])

HSP 1 Score: 68.1662 bits (165), Expect = 1.473e-10
Identity = 53/171 (30.99%), Postives = 83/171 (48.54%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDD---INQWVKQSQILEEGFIKYMRDKGAAGIINV-CHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            + W G L LKN     ++H + G+   L+    LLI  GS        L+I QR+RLD  +++ V RR++ A    + + LA+PG  +    +     +Q      + Y++ K AAG+I++       +    V+H FPPCDFS   L S+A  L    E     + LV+I
Sbjct:  687 MAWNGMLLLKNSNFPASMHLLEGD---LSVATSLLI-DGSTG-GKVSQLRITQRLRLDQPKIDEVSRRIKVAGPGGYAVLLAVPGSSEETSSSDPAASTQRPLRNLVSYLKQKQAAGVISLPVGGSRDKDNTGVLHAFPPCDFSQQFLDSSAKALAKTEE-----DFLVMI 847          
BLAST of Protein split ends vs. Ensembl Xenopus
Match: fam78b (family with sequence similarity 78 member B [Source:Xenbase;Acc:XB-GENE-6041514])

HSP 1 Score: 99.3673 bits (246), Expect = 9.704e-20
Identity = 86/288 (29.86%), Postives = 142/288 (49.31%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +   +   L+  F + G+++DI++K    +   A +Q+SDI  V K        +K + G  +  +     FG S PTNC+W+  +S++I E  +T +F R+G V++VV D  +  ALV  +    AQ ++   K +   +G   V++D+A+ E Q
Sbjct:  334 GIKVQNLPVRSTDTSLKDGLFHEFKKYGK--------VTSVQIH---GVSEERYGLVFFRQQEDQEKALN-ASKGK---------LFFGMQIEVTAWVGPETESENEFRPLDERIDEFHPKATRTLFIGNLEKTTTHLD-LHNLFQRFGEIVDIDVKKVNGVPQYAFLQYSDIGSVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWIDGLSSNITEQYLTRHFCRYGPVVKVVFDRFKGMALVLYNEIEYAQAAVKETKGRK--IGGNDVKVDFANQESQ 590          

HSP 2 Score: 66.2402 bits (160), Expect = 1.307e-9
Identity = 57/171 (33.33%), Postives = 79/171 (46.20%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTG-SLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+VW+G L+LKN+ A V LHFV GN  L +          SL     G  L+I QRMRL+ +QL+GV RRM                         ++    F+  + ++GA     VC P       YV+ IFPP +FS +       DL   I  N  P+ ++VI +V
Sbjct: 3312 YPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPAPEGGPPLRIAQRMRLEATQLDGVARRM-----------------------MVRVRHPPFLSPI-ERGAP---QVCGPA------YVLQIFPPWEFSESHFSRLVRDLFASIS-NISPHFMIVIASV 3439          
BLAST of Protein split ends vs. Ensembl Xenopus
Match: RBM15 (RNA binding motif protein 15 [Source:NCBI gene;Acc:100490588])

HSP 1 Score: 70.0922 bits (170), Expect = 5.003e-11
Identity = 50/144 (34.72%), Postives = 74/144 (51.39%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNA--NEFCIGLALPG---GDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQG-LYVVHIFPPCDFS 8143
            L W+G L LKN      +H + G+   L     LL+   S        LKI QR+RLD  +L+ V RR++ A  N + I LA+PG   G  ++Q     + L    + Y++ K AAG+I++    N  +    V+H FPPC+FS
Sbjct:  671 LAWQGMLLLKNSNFPSNMHLLQGD---LGVASSLLVEGPS--GGKVAQLKITQRLRLDQPKLDEVTRRIRVAGPNGYAILLAIPGTSEGSSVDQASSTQRPLRN-LVSYLKQKQAAGVISLPVGSNKDKDHAGVLHAFPPCNFS 808          
BLAST of Protein split ends vs. Ensembl Xenopus
Match: smpd3 (sphingomyelin phosphodiesterase 3 [Source:Xenbase;Acc:XB-GENE-1013642])

HSP 1 Score: 64.6994 bits (156), Expect = 2.496e-9
Identity = 49/154 (31.82%), Postives = 74/154 (48.05%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQ--NANEFCIGL---------ALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIIN--VCHPENSQQGLYVVHIFPPCDF 8140
            R  P VW G L LKN      LHF+ G+ ++  A   LL    S    +   LKI QR+RLD  +LE V R+++   A  + + L         A+PG   +     Q ++L    + Y++ K AAG+I+  V      +    +++ FPPCDF
Sbjct:  614 RTLPPVWCGHLVLKNSCFPTYLHFLEGDRDVPGA---LLKDRSSTSGGSLAQLKIAQRLRLDQPKLEEVTRKVRQGTAGGYAVLLATQAPQNEGAIPGEPGL-----QRRLLRN-LVSYLKQKQAAGVISLPVGGSNKGRDPSGMLYAFPPCDF 758          
BLAST of Protein split ends vs. Ensembl Mouse
Match: Spen (spen family transcription repressor [Source:MGI Symbol;Acc:MGI:1891706])

HSP 1 Score: 140.584 bits (353), Expect = 2.868e-32
Identity = 82/173 (47.40%), Postives = 107/173 (61.85%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN  L +          SL  +  G  L+I QRMRL+ SQLEGV RRM    ++C+ LALP G D    V Q++ L+  FI Y++ K AAGIINV +P  S Q  YV+ IFPPC+FS + L   APDL   I  N  P+L++VI +V
Sbjct: 3482 KKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPLSEGGPPLRIAQRMRLEASQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP-GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASIS-NISPHLMIVIASV 3643          

HSP 2 Score: 100.138 bits (248), Expect = 4.954e-20
Identity = 84/288 (29.17%), Postives = 142/288 (49.31%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  +  +     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV       AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  338 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GASEERYGLVFFRQQEDQEKALT-ASKGK---------LFFGMQIEVTAWVGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYSEIEDAQAAVKETKGRK--IGGNKIKVDFANRESQ 594          
BLAST of Protein split ends vs. Ensembl Mouse
Match: Spen (spen family transcription repressor [Source:MGI Symbol;Acc:MGI:1891706])

HSP 1 Score: 140.584 bits (353), Expect = 2.962e-32
Identity = 82/173 (47.40%), Postives = 107/173 (61.85%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN  L +          SL  +  G  L+I QRMRL+ SQLEGV RRM    ++C+ LALP G D    V Q++ L+  FI Y++ K AAGIINV +P  S Q  YV+ IFPPC+FS + L   APDL   I  N  P+L++VI +V
Sbjct: 3459 KKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPLSEGGPPLRIAQRMRLEASQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP-GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASIS-NISPHLMIVIASV 3620          

HSP 2 Score: 100.138 bits (248), Expect = 4.950e-20
Identity = 84/288 (29.17%), Postives = 142/288 (49.31%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  +  +     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV       AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  338 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GASEERYGLVFFRQQEDQEKALT-ASKGK---------LFFGMQIEVTAWVGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYSEIEDAQAAVKETKGRK--IGGNKIKVDFANRESQ 594          
BLAST of Protein split ends vs. Ensembl Mouse
Match: Rbm15 (RNA binding motif protein 15 [Source:MGI Symbol;Acc:MGI:2443205])

HSP 1 Score: 64.3142 bits (155), Expect = 2.470e-9
Identity = 55/176 (31.25%), Postives = 85/176 (48.30%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNA--NEFCIGLALPG--------GDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENS-QQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            L W+G L LKN      +H + G+ ++ ++    L+  GS        LKI QR+RLD  +L+ V RR++ A  N + I LA+PG            +     +Q      + Y++ K AAG+I++    N  ++   V+H FPPC+FS   L S A  L    E     + LV+I
Sbjct:  787 LAWQGMLLLKNSNFPSNMHLLQGDLQVASS----LLVEGSTG-GKVAQLKITQRLRLDQPKLDEVTRRIKVAGPNGYAILLAVPGSSDSRSSSSSATSDTAASTQRPLRNLVSYLKQKQAAGVISLPVGGNKDKENTGVLHAFPPCEFSQQFLDSPAKALAKSEE-----DYLVMI 952          
BLAST of Protein split ends vs. Ensembl Mouse
Match: Rbm15b (RNA binding motif protein 15B [Source:MGI Symbol;Acc:MGI:1923598])

HSP 1 Score: 59.6918 bits (143), Expect = 6.207e-8
Identity = 55/182 (30.22%), Postives = 90/182 (49.45%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS----LKIVQRMRLDLSQLEGVQRRMQNA--NEFCIGLAL------PG--GDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQG---LYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            L W G L LKN     ++H + G+           + SG L+ + +GS    LKI QR+RLD  +L+ V RR++    N + + LA+      PG  G  + +   Q ++L    + Y++ K AAG+I++  P    +G     +++ FPPCDFS   L S    L  + E     ++++VI
Sbjct:  717 LGWNGLLVLKNSCFPTSMHILEGDQG---------VISGLLKDHPSGSKLTQLKIAQRLRLDQPKLDEVTRRIKQGSPNGYAVLLAIQSTPSGPGAEGMPVVEPGLQRRLLRN-LVSYLKQKQAAGVISL--PVGGSKGRDNTGMLYAFPPCDFSQQYLQSALRTLGKLEEE----HMVIVI 882          
BLAST of Protein split ends vs. UniProt/SwissProt
Match: sp|Q8SX83|SPEN_DROME (Protein split ends OS=Drosophila melanogaster OX=7227 GN=spen PE=1 SV=2)

HSP 1 Score: 152.14 bits (383), Expect = 6.021e-35
Identity = 79/170 (46.47%), Postives = 109/170 (64.12%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP++W+G L+LK ++A V +HFVHGN  +  A +  L+        NT  L+I QRMRL+ +QLEGV ++MQ   E C+ LALP G D    ++ S+ L+ GFI Y++ K AAGI+N+  P  S+Q  YVVHIFP CDF+   L   APDL+N   V +  +LL+VI TV
Sbjct: 5400 YPVMWQGLLALKTDQAAVQMHFVHGNPNVARASLPSLV------ETNTPLLRIAQRMRLEQTQLEGVAKKMQVDKEHCMLLALPCGRDHADVLQHSRNLQTGFITYLQQKMAAGIVNIPIP-GSEQAAYVVHIFPSCDFANENLERAAPDLKN--RVAELAHLLIVIATV 5560          

HSP 2 Score: 126.331 bits (316), Expect = 4.398e-27
Identity = 90/292 (30.82%), Postives = 148/292 (50.68%), Query Frame = 2
Query:  614 PLGHSQG--PRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ------SVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            P+ HS+   P  I+V  L  + S++S++  L  ++    K           +      G +  R+A+V F    D EKAL+++           VE + GYD+ D +     A     LDE+HPK++RTL I NL + +   + L  +F   G++++I+IK K  L+A A  Q+SDI  VVK        ++ + G  + S+     FG S PTNC+W+  +   ++E+ +   F+RFG V +V  D  +Q ALV  D    AQ ++  +  + + L   ++Q+D+AS ECQ
Sbjct:  544 PVVHSEDNRPLAIRVRNLPARSSDTSLKDGLFHEYKKHGK-----------VTWVKVVGQNSERYALVCFKKPDDVEKALEVSHDKHFFGCKIEVEPYQGYDVEDNEFRPYEAE----LDEYHPKSTRTLFIGNLEKDITAGE-LRSHFEAFGEIIEIDIK-KQGLNAYAFCQYSDIVSVVK-------AMRKMDGEHLGSNRIKLGFGKSMPTNCVWIDGVDEKVSESFLQSQFTRFGAVTKVSIDRNRQLALVLYDQVQNAQAAVKDM--RGTILRRKKLQVDFASRECQ 809          
BLAST of Protein split ends vs. UniProt/SwissProt
Match: sp|Q62504|MINT_MOUSE (Msx2-interacting protein OS=Mus musculus OX=10090 GN=Spen PE=1 SV=2)

HSP 1 Score: 140.584 bits (353), Expect = 2.023e-31
Identity = 82/173 (47.40%), Postives = 107/173 (61.85%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN  L +          SL  +  G  L+I QRMRL+ SQLEGV RRM    ++C+ LALP G D    V Q++ L+  FI Y++ K AAGIINV +P  S Q  YV+ IFPPC+FS + L   APDL   I  N  P+L++VI +V
Sbjct: 3483 KKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPLSEGGPPLRIAQRMRLEASQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP-GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASIS-NISPHLMIVIASV 3644          

HSP 2 Score: 98.5969 bits (244), Expect = 8.866e-19
Identity = 84/288 (29.17%), Postives = 141/288 (48.96%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++  ++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  +  +     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV       AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  337 GIKVQNLPVRSIDTSLKDGLFHEFKKFGK--------VTSVQIH---GASEERYGLVFFRQQEDQEKALT-ASKGK---------LFFGMQIEVTAWVGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYSEIEDAQAAVKETKGRK--IGGNKIKVDFANRESQ 593          
BLAST of Protein split ends vs. UniProt/SwissProt
Match: sp|Q96T58|MINT_HUMAN (Msx2-interacting protein OS=Homo sapiens OX=9606 GN=SPEN PE=1 SV=1)

HSP 1 Score: 137.887 bits (346), Expect = 1.141e-30
Identity = 81/173 (46.82%), Postives = 107/173 (61.85%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGS-LKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN  L +          SL  +  G  L+I QRMRL+ +QLEGV RRM    ++C+ LALP G D    V Q++ L+  FI Y++ K AAGIINV +P  S Q  YV+ IFPPC+FS + L   APDL   I  N  P+L++VI +V
Sbjct: 3503 KKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHR---------SLPLSEGGPPLRIAQRMRLEATQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP-GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASIS-NISPHLMIVIASV 3664          

HSP 2 Score: 99.3673 bits (246), Expect = 6.397e-19
Identity = 84/288 (29.17%), Postives = 143/288 (49.65%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-------------NCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSI-TSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +    G S  R+ +V F    D EKAL  ASK +         LF   ++ +TA                + +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK    +   A +Q+ DI+ V K        +K + G  +  +     FG S PTNC+WL  +S+++++  +T +F R+G V++VV D  +  ALV  +    AQ ++   K +   +G  ++++D+A+ E Q
Sbjct:  336 GIKVQNLPVRSTDTSLKDGLFHEFKKFGK--------VTSVQIH---GTSEERYGLVFFRQQEDQEKALT-ASKGK---------LFFGMQIEVTAWIGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYNEIEYAQAAVKETKGRK--IGGNKIKVDFANRESQ 592          
BLAST of Protein split ends vs. UniProt/SwissProt
Match: sp|G5EGK6|DIN1_CAEEL (Daf-12-interacting protein 1 OS=Caenorhabditis elegans OX=6239 GN=din-1 PE=1 SV=1)

HSP 1 Score: 71.2478 bits (173), Expect = 1.759e-10
Identity = 53/171 (30.99%), Postives = 92/171 (53.80%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLST-APDLQNIIEVNKFPNLLVVIT 8227
            + +P+VW G+L+LK+ EA + LH ++G+   LN  +   +   + + +   S+KI+QR+RLD  Q+E + R + N  E+   LAL   ++I    +    L+  FI Y+ +K  AGI ++   E   +    VH+F P +  +N+ LS  A  L + ++      LL+V T
Sbjct: 2218 KHFPMVWTGRLALKSTEAMINLHLINGSETFLNDVLGRQVTEENPRRD---SVKILQRLRLDNGQVEHIYRILTNP-EYACCLALSSVNNIENLKENDTNLKSHFIDYLINKKIAGISSLGEVETKFKSAR-VHVFAPGEI-VNRYLSELATSLHDYLQNTDTRYLLIVFT 2382          
BLAST of Protein split ends vs. UniProt/SwissProt
Match: sp|Q7KMJ6|NITO_DROME (RNA-binding protein spenito OS=Drosophila melanogaster OX=7227 GN=nito PE=1 SV=1)

HSP 1 Score: 68.5514 bits (166), Expect = 8.802e-10
Identity = 45/150 (30.00%), Postives = 74/150 (49.33%), Query Frame = 2
Query: 7733 VWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGG------DDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLST 8164
            VW G L LK+       H   G+T+++ + M               +L+I QR+RLD  +L+ VQ+R+ +++   I + L G       DD +   +  + L    + Y++ K AAG+I++ + E    G  V++ FPPCDFS   L  T
Sbjct:  634 VWTGALILKSSLFPAKFHLTDGDTDIVESLMR--------DEEGKHNLRITQRLRLDPPKLDDVQKRIASSSSHAIFMGLAGSTNDTNCDDASVQTRPLRNL----VSYLKQKEAAGVISLLNKETEATG--VLYAFPPCDFSTELLKRT 769          
BLAST of Protein split ends vs. TrEMBL
Match: A0A430QDF6 (Uncharacterized protein OS=Schistosoma bovis OX=6184 GN=DC041_0003284 PE=4 SV=1)

HSP 1 Score: 200.675 bits (509), Expect = 3.511e-47
Identity = 102/176 (57.95%), Postives = 124/176 (70.45%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQW-----NNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDL-QNIIEVNKFPNLLVVITTV 8233
            YPLVW+G+LSLKN E +VALHFV GN  LLNACM LL   G  Q       + G L+IVQRMRL+ +QLEGVQR++      C  LALP G    +  +Q+QIL E FI+YM++K AAGIINV HP+  QQGLYVVHIFPPCDFS  QL   APDL + +++ N+  +LLVVITTV
Sbjct: 2902 YPLVWQGRLSLKNMETRVALHFVQGNHNLLNACMTLLASGGGGQPQFSLITSGGPLRIVQRMRLEPAQLEGVQRKLNQEGASCACLALPAGSSPVELAQQTQILNENFIRYMQEKMAAGIINVGHPD-YQQGLYVVHIFPPCDFSHAQLGLAAPDLHRRVVQANQS-HLLVVITTV 3075          
BLAST of Protein split ends vs. TrEMBL
Match: A0A3Q0KR58 (Platelet binding protein-related OS=Schistosoma mansoni OX=6183 PE=4 SV=1)

HSP 1 Score: 199.904 bits (507), Expect = 5.550e-47
Identity = 102/176 (57.95%), Postives = 124/176 (70.45%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQ-----WNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDL-QNIIEVNKFPNLLVVITTV 8233
            YPLVW+G+LSLKN E +VALHFV GN  LLNACM LL   G  Q       + G L+IVQRMRL+ +QLEGVQR++      C  LALP G    +  +Q+QIL E FI+YM++K AAGIINV HP+  QQGLYVVHIFPPCDFS  QL   APDL + +++ N+  +LLVVITTV
Sbjct: 3282 YPLVWQGRLSLKNMETRVALHFVQGNHNLLNACMTLLASGGGGQPQFSLITSGGPLRIVQRMRLEPAQLEGVQRKLNQEGASCACLALPAGSSPVELAQQTQILNENFIRYMQEKMAAGIINVGHPD-YQQGLYVVHIFPPCDFSHAQLGLAAPDLHRRVVQANQS-HLLVVITTV 3455          
BLAST of Protein split ends vs. TrEMBL
Match: A0A183QRL0 (Uncharacterized protein OS=Schistosoma rodhaini OX=6188 PE=4 SV=1)

HSP 1 Score: 199.904 bits (507), Expect = 5.607e-47
Identity = 102/176 (57.95%), Postives = 124/176 (70.45%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQW-----NNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDL-QNIIEVNKFPNLLVVITTV 8233
            YPLVW+G+LSLKN E +VALHFV GN  LLNACM LL   G  Q       + G L+IVQRMRL+ +QLEGVQR++      C  LALP G    +  +Q+QIL E FI+YM++K AAGIINV HP+  QQGLYVVHIFPPCDFS  QL   APDL + +++ N+  +LLVVITTV
Sbjct: 3147 YPLVWQGRLSLKNMETRVALHFVQGNHNLLNACMTLLASGGGGQPQFSLITSGGPLRIVQRMRLEPAQLEGVQRKLNQEGASCACLALPAGSSPVELAQQTQILNENFIRYMQEKMAAGIINVGHPD-YQQGLYVVHIFPPCDFSHAQLGLAAPDLHRRVVQANQS-HLLVVITTV 3320          
BLAST of Protein split ends vs. TrEMBL
Match: G4VE99 (Platelet binding protein-related OS=Schistosoma mansoni OX=6183 GN=Smp_159350 PE=4 SV=1)

HSP 1 Score: 199.904 bits (507), Expect = 6.628e-47
Identity = 102/176 (57.95%), Postives = 124/176 (70.45%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQ-----WNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDL-QNIIEVNKFPNLLVVITTV 8233
            YPLVW+G+LSLKN E +VALHFV GN  LLNACM LL   G  Q       + G L+IVQRMRL+ +QLEGVQR++      C  LALP G    +  +Q+QIL E FI+YM++K AAGIINV HP+  QQGLYVVHIFPPCDFS  QL   APDL + +++ N+  +LLVVITTV
Sbjct: 3071 YPLVWQGRLSLKNMETRVALHFVQGNHNLLNACMTLLASGGGGQPQFSLITSGGPLRIVQRMRLEPAQLEGVQRKLNQEGASCACLALPAGSSPVELAQQTQILNENFIRYMQEKMAAGIINVGHPD-YQQGLYVVHIFPPCDFSHAQLGLAAPDLHRRVVQANQS-HLLVVITTV 3244          
BLAST of Protein split ends vs. TrEMBL
Match: A0A5K4ETF3 (Platelet binding protein-related OS=Schistosoma mansoni OX=6183 PE=4 SV=1)

HSP 1 Score: 199.904 bits (507), Expect = 6.820e-47
Identity = 102/176 (57.95%), Postives = 124/176 (70.45%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQ-----WNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDL-QNIIEVNKFPNLLVVITTV 8233
            YPLVW+G+LSLKN E +VALHFV GN  LLNACM LL   G  Q       + G L+IVQRMRL+ +QLEGVQR++      C  LALP G    +  +Q+QIL E FI+YM++K AAGIINV HP+  QQGLYVVHIFPPCDFS  QL   APDL + +++ N+  +LLVVITTV
Sbjct: 3325 YPLVWQGRLSLKNMETRVALHFVQGNHNLLNACMTLLASGGGGQPQFSLITSGGPLRIVQRMRLEPAQLEGVQRKLNQEGASCACLALPAGSSPVELAQQTQILNENFIRYMQEKMAAGIINVGHPD-YQQGLYVVHIFPPCDFSHAQLGLAAPDLHRRVVQANQS-HLLVVITTV 3498          
BLAST of Protein split ends vs. Ensembl Cavefish
Match: spen (spen family transcriptional repressor [Source:NCBI gene;Acc:103025061])

HSP 1 Score: 137.502 bits (345), Expect = 1.652e-31
Identity = 80/172 (46.51%), Postives = 105/172 (61.05%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN        ++L             L+I QRMRL+ SQLEGV RRM   +++C+ LALP G D      Q+  L+ GFI Y++ K AAGIINV +P  S Q  YVV IFPPC+FS + L   APDL N I  +  P+L++VI +V
Sbjct: 3402 KKYPIVWQGHLALKNDSAAVQLHFVSGN--------NVLAHRSLPPPEGGPPLRIAQRMRLEASQLEGVARRMTVESDYCLLLALPCGLDQEDVFNQTHALKTGFITYLQAKQAAGIINVPNP-GSNQPAYVVQIFPPCEFSESHLSHLAPDLLNSIS-SISPHLMIVIASV 3563          

HSP 2 Score: 106.301 bits (264), Expect = 5.203e-22
Identity = 77/241 (31.95%), Postives = 123/241 (51.04%), Query Frame = 2
Query:  782 GPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-NCPQG------------LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            G S  R+ +V F    D EKAL  ASK +         LF   ++++TA   P+             +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK        A +Q+ DI+ V K        +K + G  + ++     FG S PT C+WL  ++++I E  +T +F R+GHV++VV D  +  ALV  +N   AQ ++     K   +G  ++++D+A+ E Q
Sbjct:  468 GASEERYGLVFFRQQEDQEKALS-ASKGK---------LFFGMQIDVTAWTGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLLNIFQRFGEIVDIDIKKLNGAPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTTCVWLDGLASNITEQYLTRHFCRYGHVVKVVFDRLKGMALVLYNNIECAQAAVKET--KGWKIGGNKIKVDFANHESQ 688          
BLAST of Protein split ends vs. Ensembl Cavefish
Match: spen (spen family transcriptional repressor [Source:NCBI gene;Acc:103025061])

HSP 1 Score: 137.502 bits (345), Expect = 1.981e-31
Identity = 80/172 (46.51%), Postives = 105/172 (61.05%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            + YP+VW+G L+LKN+ A V LHFV GN        ++L             L+I QRMRL+ SQLEGV RRM   +++C+ LALP G D      Q+  L+ GFI Y++ K AAGIINV +P  S Q  YVV IFPPC+FS + L   APDL N I  +  P+L++VI +V
Sbjct: 3355 KKYPIVWQGHLALKNDSAAVQLHFVSGN--------NVLAHRSLPPPEGGPPLRIAQRMRLEASQLEGVARRMTVESDYCLLLALPCGLDQEDVFNQTHALKTGFITYLQAKQAAGIINVPNP-GSNQPAYVVQIFPPCEFSESHLSHLAPDLLNSIS-SISPHLMIVIASV 3516          

HSP 2 Score: 115.161 bits (287), Expect = 1.028e-24
Identity = 92/302 (30.46%), Postives = 150/302 (49.67%), Query Frame = 2
Query:  608 LSPLGHSQGPR---GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-NCPQG------------LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            L PL   + PR   GIKV  L ++ +++S++  L  +F    K        + S+ +  A   S  R+ +V F    D EKAL  ASK +         LF   ++++TA   P+             +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK        A +Q+ DI+ V K        +K + G  + ++     FG S PT C+WL  ++++I E  +T +F R+GHV++VV D  +  ALV  +N   AQ ++     K   +G  ++++D+A+ E Q
Sbjct:  372 LPPLDKDE-PRKSFGIKVQNLPVRSTDTSLKDGLFHEFKKHGK--------VTSVQIHGA---SEERYGLVFFRQQEDQEKALS-ASKGK---------LFFGMQIDVTAWTGPETESENEFRPLDERIDEFHPKATRTLFIGNL-EKTTTYHDLLNIFQRFGEIVDIDIKKLNGAPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTTCVWLDGLASNITEQYLTRHFCRYGHVVKVVFDRLKGMALVLYNNIECAQAAVKET--KGWKIGGNKIKVDFANHESQ 641          
BLAST of Protein split ends vs. Ensembl Cavefish
Match: rbm15 (RNA binding motif protein 15 [Source:NCBI gene;Acc:103043033])

HSP 1 Score: 61.6178 bits (148), Expect = 1.214e-8
Identity = 52/176 (29.55%), Postives = 83/176 (47.16%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDDINQWVKQS--------QILEEGFIKYMRDKGAAGIINV-CHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            + W G L LKN     ++H + G+  + ++    L+  GS     T  L+I QR+RLD  +++ V RR++ A    + + LA+PG  +       S        Q      + Y++ K AAG+I++       +    V+H FPPCDFS   L S+A  L    E     + LV+I
Sbjct:  691 MAWNGMLLLKNSNFPASMHILDGDLSVASS----LLVDGSTGGKVT-QLRITQRLRLDQPKMDEVSRRIKVAGPGGYAVLLAVPGSSEDTSSSSSSSADPAASTQRPLRNLVSYLKQKQAAGVISLPVGGSRDKDNTGVLHAFPPCDFSQQFLDSSAKALAKTEE-----DYLVMI 856          
BLAST of Protein split ends vs. Ensembl Nematostella
Match: EDO40890 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7S5Y8])

HSP 1 Score: 132.494 bits (332), Expect = 5.293e-35
Identity = 73/170 (42.94%), Postives = 104/170 (61.18%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+VW+G ++LKN+ A V +H++ GN+ L  A +     + ++       L+I QRM+L+ SQLEGV RRMQ   + C+ LALP G D      Q++ L+ GFI Y++ K AAGIINV  P  S Q  YV+HIFPPCDF+   L   +PDL +        +L+VV+ +V
Sbjct:    8 YPVVWQGLVALKNDTAAVQMHYLSGNSRLAEASLPQAPAAQAMA-TVPPPLRIAQRMKLEQSQLEGVVRRMQCEADHCLLLALPCGRDPLDVHAQTRALKNGFINYLQQKQAAGIINVARP-GSVQPSYVLHIFPPCDFAQEHLARVSPDLLD--SAADSGHLMVVVASV 173          
BLAST of Protein split ends vs. Ensembl Nematostella
Match: EDO40891 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7S5Y9])

HSP 1 Score: 84.7297 bits (208), Expect = 2.083e-17
Identity = 60/225 (26.67%), Postives = 108/225 (48.00%), Query Frame = 2
Query:  677 ESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALV 1351
            + SIR  L  +F         ++  +  I +    G   +R AI+ F    +AE+AL++      +        +  +   +T       DEF P  +RTL + N+ +    +  L E F + G+V+D++IK +   +  A VQF+++S  ++   + R + +  VG +        FG   P N IW+G ++NS++E Q+  +F R+G V +VV +    +ALV
Sbjct:   70 DGSIREGLYHEF--------KKHGEVTRIYIHGTGG---SRSAIIYFRRPEEAERALEVCQGKMFLGAEMQIQHWHGNSYALTYISNYSEDEFDPGCTRTLFVGNIEK-TTTYGDLKEAFERYGEVIDVDIKKQPGNNPYAFVQFAELSSAIQ---ARRKMDREYVGRN---RVKVGFGKVNPINTIWVGGVTNSLSEQQVERHFGRYGRVTKVVINRVTNQALV 276          
BLAST of Protein split ends vs. Ensembl Nematostella
Match: EDO49695 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7RG09])

HSP 1 Score: 85.5001 bits (210), Expect = 1.562e-16
Identity = 63/182 (34.62%), Postives = 96/182 (52.75%), Query Frame = 2
Query: 7718 RMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRM--QNANEFCIGLALPGGDDINQWVKQSQILEE-------GFIKYMRDKGAAGIINV-CHPENS---QQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            + + + W+G L LKN    V +H V GN E+ ++   L+   GS +     SL+I QR+RLD  +LE V RR+    A+  C+ LALPG       ++  +  EE         + Y++ K AAG++N+   P NS   +  + V+H FPPC FS   LL  AP L    E +K  +L++V+
Sbjct:  427 KRFAVAWKGSLILKNSAFPVRMHLVGGNPEIADS---LIRNDGSTK----SSLRITQRLRLDQPKLEEVSRRINTAGASGHCLLLALPGTQ-----IQLPESTEELQHRPLRNLMTYLKQKQAAGVVNLPVSPSNSTNAKDDVGVLHAFPPCQFSHEHLLRIAPHLS--AEPSKEDHLVIVV 594          

HSP 2 Score: 54.6842 bits (130), Expect = 5.458e-7
Identity = 46/173 (26.59%), Postives = 77/173 (44.51%), Query Frame = 2
Query:  950 EFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHAS--ALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQ--FGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLE 1456
            E  PKA+RTL + NL   ++  + L   F K G VLD++IK       +  A V+F+D+    K         KC +          +  +G S  T  +W+G +   I+  ++   F RFG +  +        A +  D+  AA  S+   + +   +G  R++ID+A  E
Sbjct:  127 EEDPKATRTLFVGNLETGIS-CQDLRLSFEKFGVVLDVDIKRPARGQGNTYAFVKFADLDVAAKA--------KCAMQGQCIGRNHIKIGYGRSQQTTRLWVGGLGPWISIPELEREFDRFGAIRRIDYRKGGDHAYILYDSLDAA--SVAAREMRGFQMGDRRLRIDFADKE 288          
BLAST of Protein split ends vs. Ensembl Medaka
Match: spen (msx2-interacting protein-like [Source:NCBI gene;Acc:101159000])

HSP 1 Score: 155.992 bits (393), Expect = 4.459e-37
Identity = 87/170 (51.18%), Postives = 112/170 (65.88%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+VW+G L+LKN++A V LHFV GNT L    +  L   GSL       L+IVQRMRL++SQL+ V RRM   N+FC+ LALP G D    + Q+Q L+ GFI Y++ K AAGIINV +P  S Q  YVV IFPPC+FS + L   APDL N I  +  P+L++VI +V
Sbjct: 2878 YPIVWQGLLALKNDQAAVQLHFVSGNTLLAQRSLPPL-EGGSL-------LRIVQRMRLEVSQLDSVTRRMTVENDFCLLLALPCGRDQEDVLSQTQALKSGFITYLQAKQAAGIINVPNP-GSNQPAYVVQIFPPCEFSESHLSRLAPDLLNSIS-SISPHLMIVIASV 3037          

HSP 2 Score: 96.2857 bits (238), Expect = 4.902e-19
Identity = 57/174 (32.76%), Postives = 95/174 (54.60%), Query Frame = 2
Query:  944 LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK        A +Q+ DI+ V K        +K + G  + ++     FG S PT C+WL  ++++  E  +T +F R+GHV++VV D  +  AL+  +N   AQ ++     K   L   ++++D+A+ E Q
Sbjct:   26 IDEFHPKATRTLFIGNL-EKTTTYHDLLNIFQRFGEIVDIDIKKVNGAPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTTCVWLDGLASNTTEQFLTRHFCRYGHVVKVVLDRMKGMALILYNNIEYAQAAVKDT--KGWKLSGSKIKVDFANQESQ 189          
BLAST of Protein split ends vs. Ensembl Medaka
Match: si:ch1073-335m2.2 (spen family transcriptional repressor [Source:NCBI gene;Acc:101160467])

HSP 1 Score: 142.51 bits (358), Expect = 5.708e-33
Identity = 83/171 (48.54%), Postives = 111/171 (64.91%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSL-KIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+VW+G L+LKN+ A V LHFV GN  L +          SL     G+L +IVQRMRL+ SQLE V RRM   +++C+ LALP G D +  +KQ+Q L+  FI Y++ K AAGIIN+ +P  S Q  YV+ IFPPC+FS + L   APDL N I  +  P+L++VIT+V
Sbjct: 3014 YPIVWQGLLALKNDTAAVQLHFVCGNKALGHR---------SLPMQEGGALLRIVQRMRLEASQLESVARRMTGDSDYCLLLALPCGRDQDDVLKQTQALKNAFINYLQKKLAAGIINIPNP-GSNQPAYVLQIFPPCEFSESHLSQLAPDLLNRIS-SISPHLMIVITSV 3173          

HSP 2 Score: 120.168 bits (300), Expect = 3.496e-26
Identity = 90/288 (31.25%), Postives = 147/288 (51.04%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITA-NCPQG------------LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            GIKV  L ++ +++S++  L  +F    K        + S+ +  A   S  R+ +V F    D EKAL + SK +         LF    + +TA N P+             +DEFHPKA+RTL I NL +   +++ L + F + G+++DI+IK    +   A VQ+SDI+ V K        +K + G  + S+     FG S PT C+WL  ++ +I E  +T +F R+GHV++VV D  +  AL+  +N   AQ ++     K   +G  ++++D+AS E Q
Sbjct:  331 GIKVQNLPVRSTDTSLKDGLFHEFKKYGK--------VTSVQIHGA---SEDRYGLVFFRQQEDQEKALSV-SKGK---------LFFGMLIEVTAWNGPETESENEFRPLDGRIDEFHPKATRTLFIGNL-EKTTSYQQLLDIFQRFGEIVDIDIKKVNGVPQYAFVQYSDIASVCK-------AIKKMDGEYLGSNRLKLGFGKSMPTTCVWLDGLAPNITEQYLTRHFCRYGHVVKVVFDRLKGMALILYNNTDFAQAAVRET--KGWKIGGNKIKVDFASQESQ 587          
BLAST of Protein split ends vs. Ensembl Medaka
Match: spen (msx2-interacting protein-like [Source:NCBI gene;Acc:101159000])

HSP 1 Score: 119.398 bits (298), Expect = 5.083e-26
Identity = 64/125 (51.20%), Postives = 84/125 (67.20%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQ 8098
            YP+VW+G L+LKN++A V LHFV GNT L    +  L   GSL       L+IVQRMRL++SQL+ V RRM   N+FC+ LALP G D    + Q+Q L+ GFI Y++ K AAGIINV +P ++Q
Sbjct: 2878 YPIVWQGLLALKNDQAAVQLHFVSGNTLLAQRSLPPL-EGGSL-------LRIVQRMRLEVSQLDSVTRRMTVENDFCLLLALPCGRDQEDVLSQTQALKSGFITYLQAKQAAGIINVPNPGSNQ 2994          

HSP 2 Score: 96.6709 bits (239), Expect = 4.067e-19
Identity = 57/174 (32.76%), Postives = 95/174 (54.60%), Query Frame = 2
Query:  944 LDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSST-SFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQ 1462
            +DEFHPKA+RTL I NL +    +  L   F + G+++DI+IK        A +Q+ DI+ V K        +K + G  + ++     FG S PT C+WL  ++++  E  +T +F R+GHV++VV D  +  AL+  +N   AQ ++     K   L   ++++D+A+ E Q
Sbjct:   26 IDEFHPKATRTLFIGNL-EKTTTYHDLLNIFQRFGEIVDIDIKKVNGAPQYAFLQYCDIASVCK-------AIKKMDGEYLGNNRLKLGFGKSMPTTCVWLDGLASNTTEQFLTRHFCRYGHVVKVVLDRMKGMALILYNNIEYAQAAVKDT--KGWKLSGSKIKVDFANQESQ 189          
BLAST of Protein split ends vs. Ensembl Medaka
Match: ENSORLT00000000534.2 (pep primary_assembly:ASM223467v1:5:2402391:2404865:1 gene:ENSORLG00000000433.2 transcript:ENSORLT00000000534.2 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 65.855 bits (159), Expect = 7.287e-10
Identity = 48/151 (31.79%), Postives = 76/151 (50.33%), Query Frame = 2
Query: 7733 VWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDDINQWVKQSQILE---EGFIKYMRDKGAAGIINV-CHPENSQQGLYVVHIFPPCDFSINQLLSTA 8167
             WEG L LKN     +LH + G+  L ++    L+  G         LKI QR+R+D  +L+ V RR++ A+   + + LA+PG     +    S   E   +  + Y++ K AAGII++       +    V+H FPPC+FS   L ++A
Sbjct:  658 AWEGVLLLKNSTFPTSLHLLGGDMNLASS----LLVEGPTG-GKVSQLKISQRLRMDQPKLDEVSRRIKAASSSGYSVLLAVPGRSADGEGQDSSNSTERPLKNLVSYLKHKEAAGIISLPVGGGRDRDHAGVLHAFPPCEFSQQFLDASA 803          
BLAST of Protein split ends vs. Ensembl Medaka
Match: rbm15 (RNA binding motif protein 15 [Source:NCBI gene;Acc:101159584])

HSP 1 Score: 65.4698 bits (158), Expect = 7.827e-10
Identity = 53/171 (30.99%), Postives = 82/171 (47.95%), Query Frame = 2
Query: 7730 LVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDD---INQWVKQSQILEEGFIKYMRDKGAAGIINV-CHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVI 8224
            + W G L LKN     ++H + G+  + +     L+  GS        LKI QR+RLD  +L+ V RR++ A    + I LA+PG  +    +  +  +Q      + Y+  K AAG+I++       +    V+H FPPCDFS   L S+A  L    E     + LV+I
Sbjct:  679 VAWRGMLLLKNSNFPASMHLLEGDYGVASD----LLIDGST-GRQVSELKITQRLRLDQPKLDEVSRRIKVAGPGGYAILLAVPGSTEDSSSSDPMASTQRPLRNLVSYLNQKQAAGVISLPVGGSRDKDNTGVLHAFPPCDFSQQFLDSSAKALAKSEE-----DYLVMI 839          
BLAST of Protein split ends vs. Planmine SMEST
Match: SMESG000078393.1 (SMESG000078393.1)

HSP 1 Score: 4306.52 bits (11168), Expect = 0.000e+0
Identity = 2576/2596 (99.23%), Postives = 2579/2596 (99.35%), Query Frame = 2
Query:  446 MEFCSNHSKTSCDSDKELTNIYVNKFLKESIDINDNLKDQSFKISLDKNNNKHVLSPLGHSQGPRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDXXXXXXXXXXXXXXXXXXXQEKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFXXXXXXXXXXXXXXXXXXXXXXXXLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEXXXXXXXXXXXXXXXXXXRKRKQNLXXXXXXXXXXXXXXXXXXXXXQNRFIRISNVHYHENKKSIHLKPKRIKNRXXXXXXXXXXXXXDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKSCIYRXXXXXXXXXXXXEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNXXXXXXXXXXXXXXXXXXEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCXXXXXXXXXXXXXXXXXXXXXXXXXLVTSPISTAYXXXXXXXXXXXQSNYSSTMYVTSEKDLIHKNSNRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKXXXXXXXXXXXXXXVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWXXXXXXXXXXXXXXXXXXIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKXXXXXXXXXXXXXXXXXFENSCDEINTNCNNYDELALXXXXXXXXXXXXXEFQETVMPHTXXXXXXXXXXXEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCXXXXXXXXXXXXXXXXXTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQKSPTLSLQASIIENSDSQIPVSSYTQSVTPNIPGVNVCSPVTSVNLSICKPTVXXXXXXXXXXXXXKTIKTRLXXXXXXXXXXXXXXXXDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRXXXXXXXXXXXXXXXXXXXXXVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFXXXXXXXXXXXXXXXXXXTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTXXXXXXXXKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNXXXXXXXXXXXXXXXNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKXXXXXXXXXXXXXXIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            MEFCSNHSKTSCDSDKELTNIYVNKFLKESIDINDNLKDQSFKISLDKNNNKHVLSPLGHSQGPRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITAN PQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDNSINS               EKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFTHTSSITHNSSQSISNIQSKTSQSLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEDFSDLSDSDSPSSYISNSRKRKQNLSKCKSPSVDSSSTVSVLSSEKQNRFIRISNVHYHENKKSIHLKPKRIKNRINSSKFSSGNISSDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKS IYRNDNMNLSSSSDNEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNSLKVSSVSSTSMKPFSSFEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCKISEHKHKSLKESKRYYKRHKKSRELVTSPISTAYSDDSSLLVDDSQSNYSSTMYVTSEKDLIHKN+NRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKNDNEFENDNEDHENVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWKSGLSSQSSNESSNKASKIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKSNVISDSNNLINSINHIFENSCDEINTNCNNYDELALINSSNINVDISNNEFQETVMPHTSLSNSDLLLNSEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCSNNEIISESISSVISASTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAENQNKLKLEKNNKRIKKIGIGSNVSSKNSQSSNNSATSNLSVSVLTTLQKSPTLSLQASIIENSDSQIPVSSY+QSVTPNIPGVNVCSPVTSV+LSICKPTVESSISSLPEIIIPKTIKTRLNAISIINMSTNSSMKSDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRSDANPNNNNNQSNSNISDNQIVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFVSNDSRDCSSVSVDNSNSTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTSNSISNINKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNSPQSSVSSLLPSVTQNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKSESNQESQNISSRIIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV
Sbjct:    1 MEFCSNHSKTSCDSDKELTNIYVNKFLKESIDINDNLKDQSFKISLDKNNNKHVLSPLGHSQGPRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYDLFDFDELNITAN-PQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDNSINS---------------EKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFTHTSSITHNSSQSISNIQSKTSQSLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEDFSDLSDSDSPSSYISNSRKRKQNLSKCKSPSVDSSSTVSVLSSEKQNRFIRISNVHYHENKKSIHLKPKRIKNRINSSKFSSGNISSDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKSGIYRNDNMNLSSSSDNEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNSLKVSSVSSTSMKPFSSFEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCKISEHKHKSLKESKRYYKRHKKSRELVTSPISTAYSDDSSLLVDDSQSNYSSTMYVTSEKDLIHKNNNRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKNDNEFENDNEDHENVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWKSGLSSQSSNESSNKASKIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKSNVISDSNNLINSINHIFENSCDEINTNCNNYDELALINSSNINVDISNNEFQETVMPHTSLSNSDLLLNSEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCSNNEIISESISSVISASTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAENQNKLKLEKNNKRIKKIGIGSNVSSKNSQSSNNSATSNLSVSVLTTLQKSPTLSLQASIIENSDSQIPVSSYSQSVTPNIPGVNVCSPVTSVHLSICKPTVESSISSLPEIIIPKTIKTRLNAISIINMSTNSSMKSDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRSDANPNNNNNQSNSNISDNQIVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFVSNDSRDCSSVSVDNSNSTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTSNSISNINKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNSPQSSVSSLLPSVTQNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKSESNQESQNISSRIIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 2580          
BLAST of Protein split ends vs. Planmine SMEST
Match: SMESG000078393.1 (SMESG000078393.1)

HSP 1 Score: 4027.63 bits (10444), Expect = 0.000e+0
Identity = 2441/2461 (99.19%), Postives = 2444/2461 (99.31%), Query Frame = 2
Query:  851 MASKSQSVELHPGYDLFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDXXXXXXXXXXXXXXXXXXXQEKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFXXXXXXXXXXXXXXXXXXXXXXXXLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEXXXXXXXXXXXXXXXXXXRKRKQNLXXXXXXXXXXXXXXXXXXXXXQNRFIRISNVHYHENKKSIHLKPKRIKNRXXXXXXXXXXXXXDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKSCIYRXXXXXXXXXXXXEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNXXXXXXXXXXXXXXXXXXEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCXXXXXXXXXXXXXXXXXXXXXXXXXLVTSPISTAYXXXXXXXXXXXQSNYSSTMYVTSEKDLIHKNSNRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKXXXXXXXXXXXXXXVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWXXXXXXXXXXXXXXXXXXIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKXXXXXXXXXXXXXXXXXFENSCDEINTNCNNYDELALXXXXXXXXXXXXXEFQETVMPHTXXXXXXXXXXXEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCXXXXXXXXXXXXXXXXXTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQKSPTLSLQASIIENSDSQIPVSSYTQSVTPNIPGVNVCSPVTSVNLSICKPTVXXXXXXXXXXXXXKTIKTRLXXXXXXXXXXXXXXXXDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRXXXXXXXXXXXXXXXXXXXXXVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFXXXXXXXXXXXXXXXXXXTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTXXXXXXXXKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNXXXXXXXXXXXXXXXNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKXXXXXXXXXXXXXXIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            MASKSQSVELHPGYDLFDFDELNITAN PQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDNSINS               EKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFTHTSSITHNSSQSISNIQSKTSQSLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEDFSDLSDSDSPSSYISNSRKRKQNLSKCKSPSVDSSSTVSVLSSEKQNRFIRISNVHYHENKKSIHLKPKRIKNRINSSKFSSGNISSDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKS IYRNDNMNLSSSSDNEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNSLKVSSVSSTSMKPFSSFEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCKISEHKHKSLKESKRYYKRHKKSRELVTSPISTAYSDDSSLLVDDSQSNYSSTMYVTSEKDLIHKN+NRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKNDNEFENDNEDHENVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWKSGLSSQSSNESSNKASKIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKSNVISDSNNLINSINHIFENSCDEINTNCNNYDELALINSSNINVDISNNEFQETVMPHTSLSNSDLLLNSEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCSNNEIISESISSVISASTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAENQNKLKLEKNNKRIKKIGIGSNVSSKNSQSSNNSATSNLSVSVLTTLQKSPTLSLQASIIENSDSQIPVSSY+QSVTPNIPGVNVCSPVTSV+LSICKPTVESSISSLPEIIIPKTIKTRLNAISIINMSTNSSMKSDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRSDANPNNNNNQSNSNISDNQIVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFVSNDSRDCSSVSVDNSNSTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTSNSISNINKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNSPQSSVSSLLPSVTQNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKSESNQESQNISSRIIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV
Sbjct:    1 MASKSQSVELHPGYDLFDFDELNITAN-PQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNFLTLPVPPDNVSLKQSQSDNSINS---------------EKIVVNINKSTSVNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGNSGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNRFSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIRLSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFTHTSSITHNSSQSISNIQSKTSQSLPFVNRNTSSITSHNANRTIQRSLTLYPEKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPENVQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEDFSDLSDSDSPSSYISNSRKRKQNLSKCKSPSVDSSSTVSVLSSEKQNRFIRISNVHYHENKKSIHLKPKRIKNRINSSKFSSGNISSDEPLSEIYKSKMNRHKMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVELSTKCLNIAKKQSQLSSNKRKLLTDKSGIYRNDNMNLSSSSDNEVTDFRKYQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKFENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRHEDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFDEFKRDQAGDQGFVHRKGNSLKVSSVSSTSMKPFSSFEKYKTQKPSKSSKINNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPETNEPHNKRKCKISEHKHKSLKESKRYYKRHKKSRELVTSPISTAYSDDSSLLVDDSQSNYSSTMYVTSEKDLIHKNNNRKFFKPTKNSTISSFESPNSFSATSWKRSKNKDEMARIFSYSNKSEMKYCNSKKNDNEFENDNEDHENVSEFSFKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLKMSDDDWKSGLSSQSSNESSNKASKIEIHNVDISLENPVQIINEKTLICDQIQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEIKDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKSNVISDSNNLINSINHIFENSCDEINTNCNNYDELALINSSNINVDISNNEFQETVMPHTSLSNSDLLLNSEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEKSVPISQNSKTLINTNMSEPKPIIDNTVCSNNEIISESISSVISASTRNNKVQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAENQNKLKLEKNNKRIKKIGIGSNVSSKNSQSSNNSATSNLSVSVLTTLQKSPTLSLQASIIENSDSQIPVSSYSQSVTPNIPGVNVCSPVTSVHLSICKPTVESSISSLPEIIIPKTIKTRLNAISIINMSTNSSMKSDRVSSNSPLSELRINTELNQFSNAELNSSVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTTCITRSDANPNNNNNQSNSNISDNQIVSANKSLSKHLENIPISESATSQSEKVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPVTMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSFQVSPSTTPLTFVSNDSRDCSSVSVDNSNSTNLETSVTLRKSNLSKPVVTPVFSDDNKKDVFVTSNSISNINKSTVTLLTSMSTDDIGAVCNIYTNNNKLNLQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDANFQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNIDSNLSQTISKNNIFNIAFNSPQSSVSSLLPSVTQNQNENAVKQLYFNLLQRSHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATLPLKSESNQESQNISSRIIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 2445          
BLAST of Protein split ends vs. Planmine SMEST
Match: SMESG000016300.1 (SMESG000016300.1)

HSP 1 Score: 171.014 bits (432), Expect = 9.721e-42
Identity = 85/170 (50.00%), Postives = 113/170 (66.47%), Query Frame = 2
Query: 7724 YPLVWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENSQQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV 8233
            YP+ W GKL LKN+E  V +H + GN +L+   M+L+    SL    +  L+IVQRMRL+ +QLEG+QRR+    EFC  L L GGD + +  + S IL + FIKY+++K AAGIIN+C PE SQQ  YV+HIFPPC+FS NQL +    L + ++    P LLVVITTV
Sbjct: 2607 YPVRWTGKLCLKNDEVFVQMHMIAGNEDLIKNSMNLISNGQSL----SQPLRIVQRMRLEPNQLEGLQRRLTRPLEFCTCLTLAGGDKLEELTQTSSILNDNFIKYLQEKSAAGIINICIPE-SQQNAYVIHIFPPCEFSDNQLKNCCSKLYDKLKEESVPYLLVVITTV 2771          

HSP 2 Score: 116.701 bits (291), Expect = 2.861e-25
Identity = 89/291 (30.58%), Postives = 145/291 (49.83%), Query Frame = 2
Query:  641 GIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRNYNIKSI-VLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQ-------SVELHPGYDLFDFDELNITANCP--QGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKLGKVLDIEIK-----PKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSSTSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKALVYLDNAFAAQQSIIHLKRKNSFLG--TCRVQIDYASLECQ 1462
            G+ +  LS + S++S+R  L  ++    K        I S+ V+  A+G    R+AI++F  S D EKA + AS+ +         + H G++  D D       CP    +DE+HPKA++TL +  L         L + F K G+++DI+IK     P       A VQ+ DIS VV+ L   R+     + S    S    FG S PTNC+W+G +  + N+  +  +F R+G ++ V+ D +   AL+  D    AQ+++   + +++  G    R+Q+DYAS E Q
Sbjct:  186 GLLIENLSQRSSDTSLREGLFHEYKKHGK--------ITSVTVMGQASG----RYAIISFKRSEDMEKAFE-ASQGKVFFGNLIKAKQHNGFNYIDPDL------CPPEHAMDEYHPKATKTLFVGKLTTGSVTESDLKKSFKKFGEIIDIDIKTQSNQPDTSF---AFVQYYDISSVVRAL---RNQDNIKIDSK---SVKLGFGKSQPTNCLWIGNLQRNANDAFLRRHFRRYGPIMIVIPDPEFSCALIQFDCVEDAQRALTDTRERSNGNGNNNKRMQVDYASNEYQ 448          
BLAST of Protein split ends vs. Planmine SMEST
Match: SMESG000077958.1 (SMESG000077958.1)

HSP 1 Score: 77.411 bits (189), Expect = 7.571e-14
Identity = 54/175 (30.86%), Postives = 88/175 (50.29%), Query Frame = 2
Query: 7733 VWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDDINQWVKQSQILEEG---------FIKYMRDKGAAGIINVCHPENSQQGLY-VVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVV 8221
            VW G + LKN      L+ + G   L+N  M   +   +   NN  +L I QRM+LD+ +L  + +R++ A E  FC+ LA       + ++  ++++ +           I Y+R K +AG+I      NS + L  V+H FPPCDFS + LL  AP+L      + F  +L++
Sbjct:  308 VWTGMIVLKNNNFGCRLYMLKGEQSLINEFM---LTQNNDSKNNIDNLTITQRMKLDIQKLAEMSQRIEFAGENGFCLMLA-------SSYIVDTKLVSDSNQPQRPLKNLITYLRTKESAGVI-FLKSNNSDKNLSGVLHAFPPCDFSFDILLQKAPNLTKDFSKDDFLVILLI 471          
BLAST of Protein split ends vs. Planmine SMEST
Match: SMESG000077958.1 (SMESG000077958.1)

HSP 1 Score: 77.411 bits (189), Expect = 1.190e-13
Identity = 54/175 (30.86%), Postives = 88/175 (50.29%), Query Frame = 2
Query: 7733 VWEGKLSLKNEEAKVALHFVHGNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNANE--FCIGLALPGGDDINQWVKQSQILEEG---------FIKYMRDKGAAGIINVCHPENSQQGLY-VVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVV 8221
            VW G + LKN      L+ + G   L+N  M   +   +   NN  +L I QRM+LD+ +L  + +R++ A E  FC+ LA       + ++  ++++ +           I Y+R K +AG+I      NS + L  V+H FPPCDFS + LL  AP+L      + F  +L++
Sbjct:  437 VWTGMIVLKNNNFGCRLYMLKGEQSLINEFM---LTQNNDSKNNIDNLTITQRMKLDIQKLAEMSQRIEFAGENGFCLMLA-------SSYIVDTKLVSDSNQPQRPLKNLITYLRTKESAGVI-FLKSNNSDKNLSGVLHAFPPCDFSFDILLQKAPNLTKDFSKDDFLVILLI 600          
The following BLAST results are available for this feature:
BLAST of Protein split ends vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
SPEN2.376e-3146.82spen family transcriptional repressor [Source:HGNC... [more]
SPEN1.558e-2329.17spen family transcriptional repressor [Source:HGNC... [more]
SPEN3.753e-2130.28spen family transcriptional repressor [Source:HGNC... [more]
RBM153.727e-931.25RNA binding motif protein 15 [Source:HGNC Symbol;A... [more]
RBM153.901e-931.25RNA binding motif protein 15 [Source:HGNC Symbol;A... [more]
back to top
BLAST of Protein split ends vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 5
Match NameE-valueIdentityDescription
din-13.570e-1330.12Daf-12-interacting protein 1 [Source:UniProtKB/Sw... [more]
din-13.570e-1330.12Daf-12-interacting protein 1 [Source:UniProtKB/Sw... [more]
din-18.696e-1331.36Daf-12-interacting protein 1 [Source:UniProtKB/Sw... [more]
din-18.696e-1331.36Daf-12-interacting protein 1 [Source:UniProtKB/Sw... [more]
din-11.027e-1130.99Daf-12-interacting protein 1 [Source:UniProtKB/Sw... [more]
back to top
BLAST of Protein split ends vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 5
Match NameE-valueIdentityDescription
spen5.749e-3646.47gene:FBgn0016977 transcript:FBtr0332336[more]
spen5.751e-3646.47gene:FBgn0016977 transcript:FBtr0306341[more]
spen5.751e-3646.47gene:FBgn0016977 transcript:FBtr0330652[more]
spen5.801e-3646.47gene:FBgn0016977 transcript:FBtr0330653[more]
spen5.952e-3646.47gene:FBgn0016977 transcript:FBtr0078121[more]
back to top
BLAST of Protein split ends vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
spen2.864e-3549.71spen family transcriptional repressor [Source:ZFIN... [more]
spen6.403e-3249.12spen family transcriptional repressor [Source:ZFIN... [more]
si:ch1073-335m2.25.543e-3146.47si:ch1073-335m2.2 [Source:ZFIN;Acc:ZDB-GENE-081104... [more]
rbm15b1.433e-1132.20RNA binding motif protein 15B [Source:ZFIN;Acc:ZDB... [more]
rbm151.473e-1030.99RNA binding motif protein 15 [Source:ZFIN;Acc:ZDB-... [more]
back to top
BLAST of Protein split ends vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 3
Match NameE-valueIdentityDescription
fam78b9.704e-2029.86family with sequence similarity 78 member B [Sourc... [more]
RBM155.003e-1134.72RNA binding motif protein 15 [Source:NCBI gene;Acc... [more]
smpd32.496e-931.82sphingomyelin phosphodiesterase 3 [Source:Xenbase;... [more]
back to top
BLAST of Protein split ends vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 4
Match NameE-valueIdentityDescription
Spen2.868e-3247.40spen family transcription repressor [Source:MGI Sy... [more]
Spen2.962e-3247.40spen family transcription repressor [Source:MGI Sy... [more]
Rbm152.470e-931.25RNA binding motif protein 15 [Source:MGI Symbol;Ac... [more]
Rbm15b6.207e-830.22RNA binding motif protein 15B [Source:MGI Symbol;A... [more]
back to top
BLAST of Protein split ends vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|Q8SX83|SPEN_DROME6.021e-3546.47Protein split ends OS=Drosophila melanogaster OX=7... [more]
sp|Q62504|MINT_MOUSE2.023e-3147.40Msx2-interacting protein OS=Mus musculus OX=10090 ... [more]
sp|Q96T58|MINT_HUMAN1.141e-3046.82Msx2-interacting protein OS=Homo sapiens OX=9606 G... [more]
sp|G5EGK6|DIN1_CAEEL1.759e-1030.99Daf-12-interacting protein 1 OS=Caenorhabditis ele... [more]
sp|Q7KMJ6|NITO_DROME8.802e-1030.00RNA-binding protein spenito OS=Drosophila melanoga... [more]
back to top
BLAST of Protein split ends vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A430QDF63.511e-4757.95Uncharacterized protein OS=Schistosoma bovis OX=61... [more]
A0A3Q0KR585.550e-4757.95Platelet binding protein-related OS=Schistosoma ma... [more]
A0A183QRL05.607e-4757.95Uncharacterized protein OS=Schistosoma rodhaini OX... [more]
G4VE996.628e-4757.95Platelet binding protein-related OS=Schistosoma ma... [more]
A0A5K4ETF36.820e-4757.95Platelet binding protein-related OS=Schistosoma ma... [more]
back to top
BLAST of Protein split ends vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 3
Match NameE-valueIdentityDescription
spen1.652e-3146.51spen family transcriptional repressor [Source:NCBI... [more]
spen1.981e-3146.51spen family transcriptional repressor [Source:NCBI... [more]
rbm151.214e-829.55RNA binding motif protein 15 [Source:NCBI gene;Acc... [more]
back to top
BLAST of Protein split ends vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein split ends vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein split ends vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 3
Match NameE-valueIdentityDescription
EDO408905.293e-3542.94Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO408912.083e-1726.67Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO496951.562e-1634.62Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Protein split ends vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
spen4.459e-3751.18msx2-interacting protein-like [Source:NCBI gene;Ac... [more]
si:ch1073-335m2.25.708e-3348.54spen family transcriptional repressor [Source:NCBI... [more]
spen5.083e-2651.20msx2-interacting protein-like [Source:NCBI gene;Ac... [more]
ENSORLT00000000534.27.287e-1031.79pep primary_assembly:ASM223467v1:5:2402391:2404865... [more]
rbm157.827e-1030.99RNA binding motif protein 15 [Source:NCBI gene;Acc... [more]
back to top
BLAST of Protein split ends vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000078393.10.000e+099.23SMESG000078393.1[more]
SMESG000078393.10.000e+099.19SMESG000078393.1[more]
SMESG000016300.19.721e-4250.00SMESG000016300.1[more]
SMESG000077958.17.571e-1430.86SMESG000077958.1[more]
SMESG000077958.11.190e-1330.86SMESG000077958.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30021224 ID=SMED30021224|Name=Protein split ends|organism=Schmidtea mediterranea sexual|type=transcript|length=8488bp
ATAATTTGGATCAACATTGTAAAATTTATAAAATTTTAAATAAATTACTC
ATTGTATAAAATATATTTCTTAATTTTACAATTATATATTACTTTACAAA
AGCTATTACAGAAATTAATTTAGGCAAACTAGCGCTGGAATCATTAAGCA
AGTGAACTTTCGTTAAGCATTTATACACACATTCTGGTAAGAGAACATAC
TTGCTTTTGTGATTGTACAGGTTTATTGCCATATTTATTTTGTTGTTCAA
GATTGTGATCTATTTTAGTCAACTAACCAAATGTGGTTATTACTATTTTC
TTGCATTCATTGAGAATTATATTGTAAATGTGGCTTCGTTGAAACTAGAT
AATATAATTTTTTAATTCTAATTTCATCATTTAGCTATTCATAATGGAGT
TTTTTGCGATCAAAACTGTATGTTTTAGGAATAATTAAACTTTTCATGGA
ATTTTGTAGCAATCATTCAAAAACTTCATGTGACAGTGATAAAGAATTAA
CAAATATTTATGTGAATAAATTTTTGAAAGAATCTATTGATATTAACGAC
AATTTAAAAGACCAATCATTTAAAATTTCACTAGATAAAAACAATAACAA
GCACGTTCTCTCTCCCTTGGGTCATTCTCAAGGTCCTAGAGGGATCAAAG
TCAGTCGCTTATCACTACAATGTTCCGAATCCAGTATTCGGCACAGTTTA
ATACAACAATTTGGAATTCAAAATAAAAACAGTCAAAATCGCAATTATAA
TATTAAAAGTATAGTTCTTGAGTCTGCCGCGGGTCCCAGTCCAACAAGAT
GGGCAATTGTAACTTTTGTCACTTCTTATGATGCTGAAAAGGCTCTGCAA
ATGGCATCTAAATCCCAATCTGTTGAACTCCATCCTGGATATGATTTATT
TGACTTTGATGAGTTAAACATAACTGCCAACTGCCCTCAAGGTCTAGATG
AGTTTCATCCAAAAGCATCTAGAACTCTACAAATTTCAAACTTGCCTCAG
CATCTGAATAATTTTAAAATACTATACGAATATTTCAGTAAATTAGGAAA
AGTTTTGGATATAGAAATTAAGCCAAAATTGGTATTACATGCATCGGCAT
TAGTGCAATTTAGTGATATTTCCAAAGTAGTTAAAATATTGGTTAGTCCA
AGATCAGTACTGAAGTGTTTAGTTGGATCTTCAATAACTTCCTCTACGTC
ATTTCAATTCGGTCCGAGTTTTCCAACCAATTGTATTTGGTTAGGACAAA
TATCCAATTCAATTAACGAAAATCAAATAACAGAATATTTTTCTAGATTT
GGACATGTTTTAGAGGTTGTTCAAGATGTCAAACAACAAAAGGCTTTAGT
ATATTTGGATAACGCATTTGCAGCTCAACAGTCTATTATTCATCTGAAAC
GTAAAAATTCTTTCCTAGGAACTTGCAGAGTTCAAATCGACTACGCAAGT
TTGGAGTGTCAAGCAACTATATCCTATTGGCTTTCAAATAATTTTCTTAC
ACTACCAGTTCCACCAGATAACGTGTCGTTGAAACAATCTCAGTCAGATA
ATTCAATAAATAGTGGTATTATTTATAATAGATCTATTTCCCGGATAAAT
TCTAATCAAGAAAAAATAGTGGTTAATATCAATAAGTCAACATCTGTAAA
TGATCAACCAGTACAAATACATTCTAATGGAATTCAAAAACCTTCCGATA
TTCAGACTTCCTTAAATTCACTATCGAATTTTGAATCAATAAATTACACT
CAACCGACAATTTCAGTAAAAAACTATCAATCATCTATTGGAAATTCTGG
AGTAAATATTTCGGATATGGTAATTACTGATAGAATTCAAACAGATAATG
CTACACGAGTTACGCCAATATGCAACAAAAAATCTAGTAATATTCGAGTA
ACAAATCCTTACGATTCTAGCTGTGATACTGCATCAAACAATAGATTTTC
CCAAAATTATGATCAATTAATAAATTCGAATTCTTTACATTTGAGCCAGA
CACAAATTGATCATCAAAATCCTGGTTTATGTAGCATAAAGAAGGACAAA
AATTACACTGAAATTAATGATGGAAAACTTGTAAACAACATCAGGTTATC
TTGCAGTCCAGTCACGTGGTCTACAAATACTTGTACTCCACAATCTAGTG
AAATCAACAAATCAGATTTATATCATCGGCAATTTTCACCTTGGAGTGCT
CCTGTTCGAAACCGAGTTGTTGCTAGAGTAGATCCTTTTACCCACACAAG
TTCAATTACACATAATTCGTCTCAGTCGATTTCCAATATTCAATCAAAAA
CTAGCCAGAGTTTACCCTTTGTAAATCGTAATACTAGTTCCATAACATCT
CACAATGCAAATAGAACTATTCAGAGATCTTTAACATTATATCCAGAAAA
AGATATAAATTTTAATATCAAACCTGAAAATCTTATATCTATTAGTAAAA
ATACATTGCCTAATGATCGGGAATTTGGAAATGTAAAGCACAGAACGAAT
TATAATCAACCTCTTACTTACCGAGTTGCTAAAAATCCTGAAAATGTTCA
AAAGTGTTTCGATGCAAGGGCACCATTATTTACAGGCGTTCATAAATTAA
AGTCATCTAAAACATTAAATTATAAATTATCTGACATATCGGAAAATCCA
ACTAAACAAATTACAGCTGCCGAAGTCTTTGGAGAAGACTTTTCTGATTT
AAGTGATTCCGATTCTCCTTCGTCCTACATTTCAAATAGTCGAAAGAGGA
AACAGAATTTATCTAAATGTAAATCTCCATCTGTTGATTCTTCATCCACA
GTATCAGTTTTATCTTCTGAAAAGCAAAATCGTTTTATTCGCATTTCAAA
TGTCCATTATCATGAAAATAAGAAATCCATTCACTTAAAACCCAAAAGAA
TAAAAAATCGCATAAACTCTAGCAAATTTTCTAGTGGAAATATTAGTTCA
GATGAACCTCTTTCTGAAATTTACAAATCAAAAATGAATCGACATAAAAT
GAAAATTTCATCCAAAGAAAAATCTGAAACAATGAACAGTTCAACGAAGC
CGTTGACTGTTAATGTTTCACTTTCAAATTTTGAAGATAATCCATTAGAT
TCAGAAGATATTAGCAGTATGGCAACTGATAATAGTTCAGTAGAATTGTC
CACAAAATGCTTAAATATTGCTAAAAAGCAAAGCCAGTTGTCAAGTAATA
AACGAAAATTATTGACAGATAAATCATGCATTTATCGGAATGATAACATG
AATCTATCATCTTCATCAGATAATGAAGTAACAGATTTTAGAAAATACCA
AGGTACTTCTAATCGTAAATCAAAATCTTTTATTAATAATCACGATGATT
GCAAGAGATCAAACTCTTATATCATTGATAGAAAATTTAAAGAAAATAAA
TTTTTATCAAAAATTACTACAAATAGAAATCACAAAACAAAATTTGAAAA
TGTTCATCATTCTAAAAATAAAACTATTTATAATTCTCATAATTTTGTTA
GTAGTAAGTGTAAAGATAATGATAAAAGTTCTTATTTGTCTCGAAATCGA
AGAGATACTGATTCTCCGCCTTTCAAACGTAAAAAGTTAAGACACGAAGA
TAACAATAAAGCAAATATGTATCGTTCTGAAAAAAAAATAAATGATTCAT
TTAATATAGATCTTCATACAAGTTACAAGCGTCAAGAGTTTCAACAAATT
TCTAATGGAATCACATCAGATGCCGTTTTTCAAAAAGAATTTGATGAATT
CAAAAGAGATCAAGCTGGTGATCAAGGTTTTGTTCATAGGAAAGGAAACT
CTTTGAAAGTTTCATCTGTCTCAAGTACTAGCATGAAACCATTTTCATCA
TTTGAGAAGTATAAAACACAAAAACCATCTAAATCTTCTAAAATAAATAA
TATCTACACAACATCTTCAGACTGTAAAAATATTTTTCAAATTGACATGA
ATAATATTAACATGTTTGAATCAATGTATGACAAAGTAAAAAGGCGGGCT
CAGAAACAAGCGGAAACTAAGAATATTCATAATTCTCCTGAAACTAATGA
ACCGCACAATAAGCGAAAATGTAAAATTTCTGAGCATAAACATAAGTCTT
TAAAAGAGAGTAAAAGATATTATAAAAGACATAAAAAATCACGTGAACTT
GTTACATCACCGATTTCTACTGCTTATAGTGATGATAGTAGTTTATTGGT
TGATGATTCTCAATCAAATTATTCTTCAACTATGTATGTAACTTCTGAGA
AAGATTTAATTCATAAAAATAGTAATAGGAAATTTTTCAAACCTACAAAA
AATAGTACGATTTCTTCTTTTGAATCACCCAACTCATTTTCAGCTACAAG
TTGGAAGCGTTCAAAAAATAAAGATGAAATGGCTCGAATATTTAGTTATT
CTAATAAATCAGAGATGAAGTACTGTAATTCTAAAAAAAATGATAATGAA
TTTGAAAATGATAATGAAGATCATGAAAATGTATCAGAATTCAGTTTTAA
AACAAATATTAGTAATATTAATTTTTCAAATAAACCAATTAAGGAATATA
GAAAAACTGCATTAGAAGAAGTGGATAATATTAGTGATGCTACTGCTAGT
ACAGAACCAATGGAGTATGCTGAAGTTGATTTAGAAAGTTTAAAAATGTC
AGATGATGATTGGAAATCTGGACTTAGTAGTCAGAGTAGCAATGAAAGTA
GTAATAAAGCTTCTAAGATAGAAATTCATAATGTGGATATATCACTGGAA
AATCCTGTTCAGATTATCAATGAAAAAACGTTAATTTGTGATCAAATTCA
ATCACTGAATTACAAAATTGAAAGTATTGCTGATTGTGAAGATAATACTA
TTATTGATCACGAAGATAAAAACACTAATGATATTAAAACTGAAATAGAT
AAGTCTCCACACTTGGTTGAACATGAATCACAACTAAATGAGATTAAAGA
TAATGATCGTACTTCTGATGAAGTGGAAAATCATGAATGTGTTAAAGAAG
AACAAGATATTGCCATAGCAAGCATAATGTTGAAAGAAGAGAGTGATTCT
AAGTCCCCGATTGATTGGAAATCTAACGTTATCAGTGATTCAAATAACTT
AATCAATTCTATTAATCATATTTTTGAAAATTCTTGTGATGAAATCAATA
CAAATTGTAATAATTATGACGAATTAGCGCTCATTAATTCTTCCAATATT
AATGTAGATATTAGTAATAATGAATTTCAGGAAACAGTAATGCCACATAC
TTCACTCTCAAATAGTGATCTATTACTTAATTCAGAAATTACTGATATAA
ATGCTACAGATCAAGAAAATATTATATGTTCTGATCTGTGCGTAGTTAAC
CCCACTGAAGCAATAATTGAAGACACAGGATCAGAAACTGAGAAGTCAGT
ACCTATCTCTCAAAATAGTAAAACTCTAATTAACACCAACATGTCAGAAC
CAAAACCAATTATTGATAATACCGTTTGTTCAAATAATGAAATAATATCT
GAAAGCATTTCTTCGGTAATTTCTGCTTCTACAAGAAACAATAAAGTTCA
ATCGCAAAATAAAAATATAAACAGTCCCATTGAAAAACTGGATTTAACTT
CTTATGTAAAGGGCGTTATTGAACAAGTTAAAAGAGAACAAAGTGCTGAA
AATCAAAATAAATTAAAATTAGAAAAAAATAATAAAAGAATTAAGAAAAT
AGGAATAGGTTCTAATGTATCAAGCAAAAATTCGCAGAGTTCAAATAATT
CTGCTACAAGCAACTTATCTGTATCAGTGTTAACAACGTTACAAAAATCT
CCAACACTAAGTTTGCAGGCTTCTATTATTGAAAATTCTGATTCTCAAAT
TCCTGTTAGTTCATATACTCAATCTGTAACACCCAATATTCCAGGAGTGA
ACGTTTGTTCTCCAGTCACTAGTGTTAATTTATCAATCTGCAAGCCAACT
GTAGAATCTTCAATTTCTTCTTTACCAGAAATTATAATACCTAAAACTAT
TAAAACACGATTGAATGCCATTTCCATTATAAATATGTCAACAAACTCAT
CAATGAAAAGCGATCGAGTAAGTAGTAATTCTCCATTATCTGAACTCAGA
ATTAATACGGAATTGAATCAATTTTCAAATGCTGAATTAAATTCGTCAGT
TTCTGATCCAATGTCGACTCAGTTATCTAGTGTTAGTTTAAAATCTCCAC
TATCAGAGGACCATACCAAACAGAGAATATGTACAAGACTTCGTCGAGCA
GATCCAAGCAAATCTGATAGTAATTATAAAATTGTTACTACTACATGTAT
TACTCGTTCCGATGCTAATCCTAATAACAACAATAATCAGTCTAATTCGA
ATATTTCTGATAATCAAATAGTTAGTGCAAATAAATCGCTTTCAAAGCAT
CTTGAAAATATACCGATATCAGAATCAGCAACATCGCAGTCCGAAAAAGT
TTCAGACCCATACGAACCAAATTTTGATGAATCTCCATTGGAATTTTCAA
GATATAGAACCAAAATTAAAAATACATTTTTCCCACAAGAAAAGGCATCA
AAAAATGTACAAATATCGGAGGAAAATCAAAGTATAACGCCAGTGACAAT
GCAAAACATAACTTATTCATCGATTAATTCTGTTGTTACAAGTGGAATAG
ACAGTGTTAGTCAGATAATAGAAGACGTAGTACACGGAAATTTTAATTAT
TCAGAATATTTGACTTCATTCAAAAATGGTAAACTATCTTCATTTCAAGT
TAGTCCATCTACCACTCCATTAACATTCGTTTCAAATGACAGTAGAGATT
GTTCATCAGTTAGTGTAGATAATTCCAATAGTACAAATTTAGAGACTTCG
GTTACACTTCGAAAATCTAATTTATCTAAGCCTGTTGTAACACCAGTTTT
TTCAGATGACAATAAAAAAGATGTATTTGTTACAAGTAATTCAATTTCGA
ATATTAATAAATCAACTGTAACATTATTAACATCAATGTCCACTGATGAT
ATAGGAGCGGTTTGTAATATTTACACAAACAATAACAAATTAAATCTTCA
GAAAGAAATTACTAATCAAAATTTTGTATCAAAATCAAGTACTTCAAATG
CTAATGTTTCGCATGTGTTTACCACTTCTAGTGTACCACAATTGAATACA
GAATTTCTAGCTCTAATTAATCTTTGCAAGCCAAAAGATGCGAATTTTCA
AAATTCAAATGCGATTAGTTCAAATGAAAATGTTCAGCAAAAAAGTTTGT
CAATATCATCTGTTCACTTACAGCCCTGTGATAACTCATCTGCTTCATTT
GATAAAAATACTGTTCCTAATTCTAAATTGACTCTGAATATTGATTCAAA
CTTGTCTCAGACGATTTCGAAAAACAATATATTTAATATAGCATTTAATT
CTCCACAATCTAGTGTCAGCTCTTTGTTACCTTCTGTTACTCAGAATCAG
AATGAGAATGCCGTTAAGCAATTATATTTTAATCTTTTACAAAGATCTCA
TTTTTCAGGCGTTAGTGTTGATTCGCTAACAAAGTTACAAACTACCATAA
CTGAATCATCATTAAACGATATAAATTCACACACAGTGTCAAATTCTTAT
AAAAATATTCAAAATAATGTTCAAGTAGAATCAATAGCCACTCTACCATT
AAAGAGTGAATCAAATCAAGAATCACAAAATATCTCTTCTAGAATAATAG
CTGATCGGATCATACCAAGAATGTATCCATTAGTATGGGAAGGAAAATTA
TCTTTAAAAAATGAGGAAGCAAAAGTAGCCTTGCATTTTGTTCATGGAAA
TACAGAATTGCTTAATGCTTGTATGCATCTTCTTATATTTAGTGGTTCAT
TACAGTGGAATAATACTGGATCGCTAAAAATTGTCCAGAGAATGAGACTT
GATTTAAGTCAGTTAGAAGGTGTTCAAAGGAGAATGCAGAATGCTAATGA
ATTTTGTATTGGTTTGGCATTACCAGGAGGTGACGATATAAATCAGTGGG
TTAAACAATCTCAAATACTTGAAGAAGGTTTCATCAAATATATGCGTGAT
AAAGGAGCTGCTGGAATAATAAATGTTTGCCATCCAGAAAACAGTCAACA
AGGTTTATATGTGGTGCATATCTTTCCACCTTGTGATTTCTCAATTAATC
AATTGCTATCAACTGCTCCAGATCTACAGAATATTATTGAAGTAAACAAA
TTTCCTAATCTACTTGTTGTTATTACTACCGTGTAAAATTCATTGTTTAT
ATCATTAACGTCTTTCCAGAGAATAACAGTAATTGATTGAAGATATATTT
CCTTCCTACCTCAAAAGCAATTTTGTATATATCGATTCAATCTTTTATAG
TCCCTTTTCCACTTAAGAACACTGCTGAATGTAAAATTTACTCAGATATT
CTATCCTTGTGGCTAGAAATATTCGCAGCATATTTTTATTATGTACTTAT
GTAAATATTCAAAGAATGCTAACATTTTTGTAAATAGT
back to top

protein sequence of SMED30021224-orf-1

>SMED30021224-orf-1 ID=SMED30021224-orf-1|Name=SMED30021224-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=2597bp
MEFCSNHSKTSCDSDKELTNIYVNKFLKESIDINDNLKDQSFKISLDKNN
NKHVLSPLGHSQGPRGIKVSRLSLQCSESSIRHSLIQQFGIQNKNSQNRN
YNIKSIVLESAAGPSPTRWAIVTFVTSYDAEKALQMASKSQSVELHPGYD
LFDFDELNITANCPQGLDEFHPKASRTLQISNLPQHLNNFKILYEYFSKL
GKVLDIEIKPKLVLHASALVQFSDISKVVKILVSPRSVLKCLVGSSITSS
TSFQFGPSFPTNCIWLGQISNSINENQITEYFSRFGHVLEVVQDVKQQKA
LVYLDNAFAAQQSIIHLKRKNSFLGTCRVQIDYASLECQATISYWLSNNF
LTLPVPPDNVSLKQSQSDNSINSGIIYNRSISRINSNQEKIVVNINKSTS
VNDQPVQIHSNGIQKPSDIQTSLNSLSNFESINYTQPTISVKNYQSSIGN
SGVNISDMVITDRIQTDNATRVTPICNKKSSNIRVTNPYDSSCDTASNNR
FSQNYDQLINSNSLHLSQTQIDHQNPGLCSIKKDKNYTEINDGKLVNNIR
LSCSPVTWSTNTCTPQSSEINKSDLYHRQFSPWSAPVRNRVVARVDPFTH
TSSITHNSSQSISNIQSKTSQSLPFVNRNTSSITSHNANRTIQRSLTLYP
EKDINFNIKPENLISISKNTLPNDREFGNVKHRTNYNQPLTYRVAKNPEN
VQKCFDARAPLFTGVHKLKSSKTLNYKLSDISENPTKQITAAEVFGEDFS
DLSDSDSPSSYISNSRKRKQNLSKCKSPSVDSSSTVSVLSSEKQNRFIRI
SNVHYHENKKSIHLKPKRIKNRINSSKFSSGNISSDEPLSEIYKSKMNRH
KMKISSKEKSETMNSSTKPLTVNVSLSNFEDNPLDSEDISSMATDNSSVE
LSTKCLNIAKKQSQLSSNKRKLLTDKSCIYRNDNMNLSSSSDNEVTDFRK
YQGTSNRKSKSFINNHDDCKRSNSYIIDRKFKENKFLSKITTNRNHKTKF
ENVHHSKNKTIYNSHNFVSSKCKDNDKSSYLSRNRRDTDSPPFKRKKLRH
EDNNKANMYRSEKKINDSFNIDLHTSYKRQEFQQISNGITSDAVFQKEFD
EFKRDQAGDQGFVHRKGNSLKVSSVSSTSMKPFSSFEKYKTQKPSKSSKI
NNIYTTSSDCKNIFQIDMNNINMFESMYDKVKRRAQKQAETKNIHNSPET
NEPHNKRKCKISEHKHKSLKESKRYYKRHKKSRELVTSPISTAYSDDSSL
LVDDSQSNYSSTMYVTSEKDLIHKNSNRKFFKPTKNSTISSFESPNSFSA
TSWKRSKNKDEMARIFSYSNKSEMKYCNSKKNDNEFENDNEDHENVSEFS
FKTNISNINFSNKPIKEYRKTALEEVDNISDATASTEPMEYAEVDLESLK
MSDDDWKSGLSSQSSNESSNKASKIEIHNVDISLENPVQIINEKTLICDQ
IQSLNYKIESIADCEDNTIIDHEDKNTNDIKTEIDKSPHLVEHESQLNEI
KDNDRTSDEVENHECVKEEQDIAIASIMLKEESDSKSPIDWKSNVISDSN
NLINSINHIFENSCDEINTNCNNYDELALINSSNINVDISNNEFQETVMP
HTSLSNSDLLLNSEITDINATDQENIICSDLCVVNPTEAIIEDTGSETEK
SVPISQNSKTLINTNMSEPKPIIDNTVCSNNEIISESISSVISASTRNNK
VQSQNKNINSPIEKLDLTSYVKGVIEQVKREQSAENQNKLKLEKNNKRIK
KIGIGSNVSSKNSQSSNNSATSNLSVSVLTTLQKSPTLSLQASIIENSDS
QIPVSSYTQSVTPNIPGVNVCSPVTSVNLSICKPTVESSISSLPEIIIPK
TIKTRLNAISIINMSTNSSMKSDRVSSNSPLSELRINTELNQFSNAELNS
SVSDPMSTQLSSVSLKSPLSEDHTKQRICTRLRRADPSKSDSNYKIVTTT
CITRSDANPNNNNNQSNSNISDNQIVSANKSLSKHLENIPISESATSQSE
KVSDPYEPNFDESPLEFSRYRTKIKNTFFPQEKASKNVQISEENQSITPV
TMQNITYSSINSVVTSGIDSVSQIIEDVVHGNFNYSEYLTSFKNGKLSSF
QVSPSTTPLTFVSNDSRDCSSVSVDNSNSTNLETSVTLRKSNLSKPVVTP
VFSDDNKKDVFVTSNSISNINKSTVTLLTSMSTDDIGAVCNIYTNNNKLN
LQKEITNQNFVSKSSTSNANVSHVFTTSSVPQLNTEFLALINLCKPKDAN
FQNSNAISSNENVQQKSLSISSVHLQPCDNSSASFDKNTVPNSKLTLNID
SNLSQTISKNNIFNIAFNSPQSSVSSLLPSVTQNQNENAVKQLYFNLLQR
SHFSGVSVDSLTKLQTTITESSLNDINSHTVSNSYKNIQNNVQVESIATL
PLKSESNQESQNISSRIIADRIIPRMYPLVWEGKLSLKNEEAKVALHFVH
GNTELLNACMHLLIFSGSLQWNNTGSLKIVQRMRLDLSQLEGVQRRMQNA
NEFCIGLALPGGDDINQWVKQSQILEEGFIKYMRDKGAAGIINVCHPENS
QQGLYVVHIFPPCDFSINQLLSTAPDLQNIIEVNKFPNLLVVITTV*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: molecular function
TermDefinition
GO:0003723RNA binding
GO:0003676nucleic acid binding
GO:0000166nucleotide binding
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0003116parenchymal cell
Vocabulary: INTERPRO
TermDefinition
IPR035979RBD_domain_sf
IPR000504RRM_dom
IPR016194SPOC-like_C_dom_sf
IPR012921SPOC_C
IPR012677Nucleotide-bd_a/b_plait_sf
IPR010912SPOC_met
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1724..1744
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 755..787
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1898..1924
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1181..1221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1988..2003
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1988..2013
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1022..1041
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1898..1918
NoneNo IPR availablePANTHERPTHR23189:SF48MSX2-INTERACTING PROTEINcoord: 117..339
coord: 2426..2595
NoneNo IPR availablePANTHERPTHR23189RNA RECOGNITION MOTIF-CONTAININGcoord: 117..339
coord: 2426..2595
NoneNo IPR availableCDDcd00590RRM_SFcoord: 67..136
e-value: 1.29839E-4
score: 40.3661
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 262..331
e-value: 0.0028
score: 26.9
coord: 176..249
e-value: 0.39
score: 10.2
coord: 66..146
e-value: 4.1
score: 1.1
IPR016194SPOC-like, C-terminal domain superfamilyGENE3DG3DSA:2.40.290.10coord: 2419..2595
e-value: 1.3E-52
score: 179.7
IPR016194SPOC-like, C-terminal domain superfamilySUPERFAMILYSSF100939SPOC domain-likecoord: 2424..2592
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3DG3DSA:3.30.70.330coord: 255..339
e-value: 3.6E-8
score: 35.2
IPR012921Spen paralogue and orthologue SPOC, C-terminalPFAMPF07744SPOCcoord: 2425..2594
e-value: 7.0E-30
score: 103.9
IPR010912Spen paralogue/orthologue C-terminal, metazoaPROSITEPS50917SPOCcoord: 2419..2594
score: 36.1
IPR035979RNA-binding domain superfamilySUPERFAMILYSSF54928RNA-binding domain, RBDcoord: 172..316