Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2

Overview
NameRetrovirus-related Pol polyprotein from type-1 retrotransposable element R2
Smed IDSMED30015424
Length (bp)8216
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (SMED30015424) t-SNE clustered cells

Violin plots show distribution of expression levels for Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (SMED30015424) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (SMED30015424) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (SMED30015424) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30015424

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 8

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
central nervous systemSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:27063937
Scimone et al., 2016
whole organism asexual adult colorimetric in situ hybridization evidence
posterior region of the whole animalSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:27063937
Scimone et al., 2016
whole organism asexual adult colorimetric in situ hybridization evidence
nervous systemSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cephalic gangliaSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
muscle cellSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
neoblastSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
non-ciliated neuronSMED30015424SMESG000062936.1 SMESG000024338.1 SMESG000024319.1 SMESG000024308.1 dd_Smed_v4_18000_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
prepharyngeal regionSMED30015424 dd_Smed_v6_97128_0_1dd_Smed_v6PMID:28171748
Stückemann et al., 2017
whole organism asexual adult RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Match: CU462878.3 (pep chromosome:GRCz11:17:8542203:8545189:1 gene:ENSDARG00000103562.2 transcript:ENSDART00000158873.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CU462878.3)

HSP 1 Score: 103.219 bits (256), Expect = 2.582e-21
Identity = 70/227 (30.84%), Postives = 113/227 (49.78%), Query Frame = 1
Query: 4684 KLASASSPGDDNITYRDLRLLDPE-GKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAK 5361
            KL +  +PG D       +   P+   +L + FN ++  G+IP +W+     +IPK GK  +  + SS+RPI++L+  YK+F SILS+RL   I++   L    + G +     + N   T  +     ++     +  LD + AF SV   YL+ +LA  G +  F+  ++ LY +  +        +  I ++RG RQGCP+S  LF L I P+  AI   P  K
Sbjct:  278 KLKANKTPGTDGYPSEMYKTFKPQITPMLLKCFNYVLRGGEIPQSWQEAVISIIPKQGKDKN--ECSSYRPISVLNVDYKLFASILSKRLEVVISE---LVDLDQTGFIQNRQTQDNLRRTLQIMSHITTENISAMLLSLDAEKAFDSVGWDYLYQVLARFGFNNKFIECIKGLYLSPKAKIKINGHLSKTIYLERGTRQGCPLSPTLFALFIEPLAQAIREEPEIK 499          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Match: BX649434.3 (pep chromosome:GRCz11:3:18270043:18274691:-1 gene:ENSDARG00000105157.2 transcript:ENSDART00000161140.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX649434.3)

HSP 1 Score: 96.2857 bits (238), Expect = 5.081e-19
Identity = 78/265 (29.43%), Postives = 130/265 (49.06%), Query Frame = 1
Query: 4567 QFTQSDPGEFVF--NIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPE-GKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCC-GPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSS 5349
            +  Q D    VF  N+ +    SN  LEL           + + +  SPG D I     ++L P  G+ L    N  ++ G +P + R     L+PK    G  +   +WRP++LL + YKI +  L+ RL++ +  ++++ P Q          ++  ++  ALE +K        I+ LD + AF  V H YL  +L   G   HF+ ++ ++Y+N  S     G L+ P  D++RGVRQGC +S +L+++AI P+L  I + 
Sbjct:  452 EVLQRDVENSVFFENLSKVDDESNAMLELPISQDELYTVMMSMENGKSPGIDGIPVDFYKVLWPVIGEDLFLVLNDSLNKGSLPLSCRRAVITLLPK---KGDLQKIGNWRPVSLLCSDYKILSRALAVRLSKVL--DQIIQPNQNYSIPNRSIFDNIFLIRDALEVAKLFGINCGLIS-LDQEKAFDRVEHEYLKKVLKVFGFSTHFIKMIEVMYSNIQSVLKINGGLSAP-FDVQRGVRQGCAMSGMLYSIAIEPLLHRIRAE 709          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Match: CABZ01033394.1 (pep chromosome:GRCz11:13:6032844:6038477:-1 gene:ENSDARG00000101795.2 transcript:ENSDART00000166130.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CABZ01033394.1)

HSP 1 Score: 93.9745 bits (232), Expect = 2.180e-18
Identity = 125/501 (24.95%), Postives = 220/501 (43.91%), Query Frame = 1
Query: 4492 RKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFNI-QEFCIRSNKPLELCKISARDVIFELK-LASASSPGDDNITYRDLRLL-DPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCC-GPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGL--EFRPEKCAYLTTSNSTDXXXXXXXXXX----XXXXXDREFYQYLGVPVGESP-NQTPYD-TLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRT----REIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDL 5946
            R++ V   ++    E + + EN+   F ++ P     NI QE+  +  +PL     S  +++  LK +    +PG D +     ++  D  G+      N+ +  G +P + R     L+PK    G  ++  +WRP++LL + +KI +  L+ RL   I    ++HP Q         +++ +++   L+ S Y     L +  +D + AF  V H YLW+ L   G    F+ ++++LY +  S     G L+ P   I RG+RQGC +S +L++LAI P+L  + S  I  + + N  +Q    A         A D+   +N   ++ + + +  EF     A +    S  L N   NN         +  +   +YLGV +GES   Q  +D  +EK+     KLK      W+       + H +L+F  R       I  +   R    +  +  L      ++  F++  H  P++ +Y P+  GG   V L
Sbjct:  434 RRRAVQFYVDLYRSEYKED-ENLSASFYENLP-----NISQEYKEKIERPL-----SELELLTALKAMQPGKAPGIDGLPIEFYKIFWDVLGEDFLAVLNESLTEGSLPLSCRRAVVTLLPK---KGDLQEIKNWRPVSLLCSDFKILSKTLANRLKNVI--GHIIHPDQTYCVPGRSIMDNISLVRDILDLSSYFN-LDLGLISIDQEKAFDRVEHQYLWNTLEAFGFGSGFIGMIKVLYQDIESVLKINGGLSAP-FKINRGIRQGCALSGMLYSLAIEPMLRKLRSR-IEGFMLANKSVQHQLSAY--------ADDVIVFVNGQKDVNTLVSIISEFDKLSAARVNWGKSDALINGKWNNGTPILPGGLIWKKGGLKYLGVYLGESTMQQKNWDGIIEKI---EGKLKK-----WR-------WIHPQLSFRGRVLIINNLIASTLWHRLFCTEPPSGLLLFLQAKLVNFFWDRMHWVPQSELYLPLEEGGQGLVSL 892          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Match: si:dkey-187k19.2 (si:dkey-187k19.2 [Source:ZFIN;Acc:ZDB-GENE-131121-68])

HSP 1 Score: 92.8189 bits (229), Expect = 2.226e-18
Identity = 84/282 (29.79%), Postives = 134/282 (47.52%), Query Frame = 1
Query: 4702 SPGDDNITYR-DLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPV-LVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALIN--NSVEIASRIGLEFRPEKCAYLTTSNS 5535
            SPG D ++    L   +     L   + + I+  ++P + +     L+PKP K     D  +WRPI LL+  YK+ + I ++RL E +  N+++   Q G         +  ++   L++S   +   L + +LD   AF ++ H +++  L Y G    F+SI+ L + + NS     P T+ +  I RGVRQGCPI+  LF L +  + +  + SS I    I N +I+I   ADD  L       L A +   N   IAS  GL     KC  ++  NS
Sbjct:  187 SPGSDGLSVEFYLHFWNIIENPLFEMYKESIELKELPTSLKQGLITLLPKPNKDLMILD--NWRPITLLNVDYKLLSLIYAKRLKEGL--NEIISEYQTGFMAGRHISWNIRLILDLLDYSNLIESEALLL-FLDFHKAFDTIEHQFMFMALKYFGFGDRFISIMELFHRDINSSINLYPNTSKRFPICRGVRQGCPIAPFLFLLVVEFLSIYVLKSSAIQGITIFNKEIRISQLADDTVLFLKDKDQLPAALQLVNDFSIAS--GLTLNVTKCEIISLYNS 461          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Match: BX323564.1 (pep chromosome:GRCz11:2:10597810:10600950:-1 gene:ENSDARG00000098557.2 transcript:ENSDART00000160216.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX323564.1)

HSP 1 Score: 92.0485 bits (227), Expect = 8.095e-18
Identity = 88/303 (29.04%), Postives = 142/303 (46.86%), Query Frame = 1
Query: 4648 CKISARDVIFELKLASAS---SPGDDNITYR-DLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPV-LVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALIN--NSVEIASRIGLEFRPEKCAYLTTSNS 5535
            C+ +   +  E+ L S     SPG D ++    L   +     L   + + I+  ++P + +     L+PKP K     D  +WRPI LL+  YK+ + I ++RL E +  N+++   Q G         +  ++   L++S   +   L + +LD   AF ++ H +++  L Y G    F+SI+ L + + NS     P T+ +  I RGVRQGCPI+  LF L +  + +  + SS I    I N +I+I   ADD  L       L A +   N   IAS  GL     KC  ++  NS
Sbjct:  166 CEENLNKLEMEIALRSMKQNKSPGSDGLSVEFYLHFWNIIENPLFEMYKESIELKELPTSLKQGLITLLPKPNKDLMILD--NWRPITLLNVDYKLLSLIYAKRLKEGL--NEIISEYQTGFMAGRHISWNIRLILDLLDYSNLIESEALLL-FLDFHKAFDTIEHQFMFMALKYFGFGDRFISIMELFHRDINSSINLYPNTSKRFPICRGVRQGCPIAPFLFLLVVEFLSIYVLKSSAIQGITIFNKEIRISQLADDTVLFLKDKDQLPAALQLVNDFSIAS--GLTLNVTKCEIISLYNS 461          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Match: ENSXETT00000032317.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:1:130520927:130524992:-1 gene:ENSXETG00000008594.1 transcript:ENSXETT00000032317.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 118.242 bits (295), Expect = 1.224e-25
Identity = 80/239 (33.47%), Postives = 122/239 (51.05%), Query Frame = 1
Query: 4642 ELCKISARDVI--FELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP 5352
            EL  +  ++ +  F L  A+ +      I  R  +LL P   +L + F + +  GQ P++      +L+PK GK        S+RPI+LL+   KIF  +L+RRL+  IT  K++HP Q G    +    +   L   L H+   + A  AIA LDI  AF +V   YLW +L + G    ++ +++LLY    +      LTTP   + RG RQGCP+S +LF +AI P   A+ + P
Sbjct:  448 ELTAVEIQEAVNAFPLGKAAGADGLPIEIYKRHSKLLTP---MLLKLFKEAVQLGQFPESLYEAAIVLLPKQGKDPHL--CESFRPISLLTADVKIFAKVLARRLSRVIT--KIIHPDQIGFIPAKTAALNTRRLYLNLAHASAGQGAK-AIAALDITKAFDTVEWPYLWQVLTHFGFGKKYIQMVQLLYKYPVASIRINSLTTPAFALSRGTRQGCPLSPLLFAIAIEPFAQAVRAHP 678          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Match: ENSXETT00000040456.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:8:87387561:87390644:-1 gene:ENSXETG00000021299.1 transcript:ENSXETT00000040456.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 117.857 bits (294), Expect = 1.349e-25
Identity = 80/239 (33.47%), Postives = 122/239 (51.05%), Query Frame = 1
Query: 4642 ELCKISARDVI--FELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP 5352
            EL  +  ++ +  F L  A+ +      I  R  +LL P   +L + F + +  GQ P++      +L+PK GK        S+RPI+LL+   KIF  +L+RRL+  IT  K++HP Q G    +    +   L   L H+   + A  AIA LDI  AF +V   YLW +L + G    ++ +++LLY    +      LTTP   + RG RQGCP+S +LF +AI P   A+ + P
Sbjct:   78 ELTAVEIQEAVNAFPLGKAAGADGLPIEIYKRHSKLLTP---MLLKLFKEAVQLGQFPESLYEAAIVLLPKQGKDPHL--CESFRPISLLTADVKIFAKVLARRLSRVIT--KIIHPDQIGFIPAKTAALNTRRLYLNLAHASAGQGAK-AIAALDITKAFDTVEWPYLWQVLTHFGFGKKYIQMVQLLYKYPVASIRINSLTTPAFALSRGTRQGCPLSPLLFAIAIEPFAQAVRAHP 308          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Match: efcab14 (EF-hand calcium binding domain 14 [Source:Xenbase;Acc:XB-GENE-5791088])

HSP 1 Score: 102.834 bits (255), Expect = 6.456e-22
Identity = 77/227 (33.92%), Postives = 115/227 (50.66%), Query Frame = 1
Query: 4687 LASASSPGDDNIT---YRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGC-VEHNAILT-AALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP 5352
            L S  +PG D +    Y+ L    P  KLL        DN  +P ++     ++IPKPGK  S    SS+RPI+L++T  KI   IL+ RL   + D  L+HP Q G        +    + T   + H +   +    +A LD   AF S+   YLW++L+  G    F+  ++LLY +  +      +T+P   ++RG RQGCP+S ILF LAI P+ + I +SP
Sbjct:  116 LPSNKTPGPDGLPSNWYKVLAEYIP-SKLL-ETLQYAYDNQALPASFAEALIVVIPKPGKDPSL--CSSYRPISLINTDAKILAKILATRLQRVVPD--LVHPDQSGFMPGRSTDINLRRLFTNLQIPHLETDTRV---VASLDSAKAFDSIEWGYLWEVLSGFGFGQTFLKWIKLLYQHPTARVRVNGITSPPFSLERGTRQGCPLSPILFALAIEPLAILIRNSP 333          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Match: ENSXETT00000045854.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:1:73083463:73084749:1 gene:ENSXETG00000020482.1 transcript:ENSXETT00000045854.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.064 bits (253), Expect = 7.413e-22
Identity = 66/189 (34.92%), Postives = 101/189 (53.44%), Query Frame = 1
Query: 4792 DNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGC-VEHNAILT-AALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP 5352
            DN  +P ++     ++IPKPGK  S    SS+RPI+L++T  KI   IL+ RL   + D  L+HP Q G        +    + T   + H +   +    +A LD   AF S+   YLW++L+  G    F+  ++LLY +  +      +T+P   ++RG RQGCP+S ILF LAI P+ + I +SP
Sbjct:  202 DNQALPASFAEALIVVIPKPGKDPSL--CSSYRPISLINTDAKILAKILATRLQRVVPD--LVHPDQSGFMPGRSTDINLRRLFTNLQIPHLETDTRV---VASLDSAKAFDSIEWGYLWEVLSGFGFGQTFLKWIKLLYQHPTARVRVNGITSPPFSLERGTRQGCPLSPILFALAIEPLAILIRNSP 383          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Match: ENSXETT00000002428.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:3:15907829:15913358:1 gene:ENSXETG00000006437.1 transcript:ENSXETT00000002428.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.834 bits (255), Expect = 6.864e-21
Identity = 69/200 (34.50%), Postives = 104/200 (52.00%), Query Frame = 1
Query: 4762 LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAK 5361
            LL + + + I  G +P++      +L+PKPGK    +   S+RPI+LL+T  KI+  IL+RRL   I    L++P Q G    +    +   L   L  +  S     AIA LDI  AF +V   YLW +L  +G    F+++++LLY +  +      L T    + RG RQGCP+S +LF LAI P   A+   P+ +
Sbjct:  487 LLTKMYGEAISTGILPESMYEAAIILLPKPGKDP--QLCESFRPISLLTTDVKIYAKILARRLARVI--KTLVNPDQIGFIPTKTTALNTRRLYLNLSLTP-SNTGNRAIAALDIAKAFDTVEWSYLWCVLKQLGFGPTFINMVQLLYKSPRATLRINSLCTSNFTLSRGTRQGCPLSPLLFALAIEPFAQAVRKHPLIQ 681          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Match: sp|Q03278|PO21_NASVI (Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment) OS=Nasonia vitripennis OX=7425 PE=4 SV=2)

HSP 1 Score: 112.079 bits (279), Expect = 5.079e-23
Identity = 116/430 (26.98%), Postives = 190/430 (44.19%), Query Frame = 1
Query: 4666 DVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPK-PGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQ--------YLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGG 5928
            D I E++    ++ G D +T      +D   + +   FN I+ +GQ P  +   +T+LIPK PG      D + +RP+++ S   + F  IL+ R+ E    + LL   Q+   V +G  E+ ++L+A ++ ++   K  L IA LD+K AF SV H  + D L    +     + +  +Y N+ +           I   RGVRQG P+S +LFN  ++ VL  +  +    + +G  +I  L +ADD+ L+A +   LQA ++         GLE  P KC  L    S     I +   K   + ++E  Q        YLGV      +  P      +  D  ++  + L P Q++     F   R              +GR S  D    +    +R  + G+  LPH+ P  Y ++PI  GG
Sbjct:  322 DEIKEVEACKRTAAGPDGMTTTAWNSID---ECIKSLFNMIMYHGQCPRRYLDSRTVLIPKEPGTM----DPACFRPLSIASVALRHFHRILANRIGE----HGLLDTRQRAFIVADGVAENTSLLSAMIKEARMKIKG-LYIAILDVKKAFDSVEHRSILDALRRKKLPLEMRNYIMWVYRNSKTRLEVVKTKGRWIRPARGVRQGDPLSPLLFNCVMDAVLRRLPEN--TGFLMGAEKIGALVFADDLVLLAETREGLQASLSRIEAGLQEQGLEMMPRKCHTLALVPSGKEKKIKVETHKPFTVGNQEITQLGHADQWKYLGVVYN---SYGPIQVKINIAGDLQRVTAAPLKPQQRMAILGMFLIPRFIHKL--------VLGRTSNADVRKGD--KIIRKTVRGWLRLPHDTPIGYFHAPIKEGG 724          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Match: sp|P16423|POLR_DROME (Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 109.383 bits (272), Expect = 3.607e-22
Identity = 109/428 (25.47%), Postives = 193/428 (45.09%), Query Frame = 1
Query: 4684 KLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDR-------EFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDL 5946
            +++ +SSPG D IT +  R + P G +L R  N I+  G +P + R  +T+ IPK   +   +D   +RPI++ S   +   +IL+ RL   I       P Q+G    +GC ++  I+   L HS    ++   IA LD+  AF S+ H  ++D L   G    FV  ++  Y    +       ++ +    RGV+QG P+S ILFNL ++ +L  + S   A  K+GN+     A+ADD+ L A +   LQ L++ +++  S +GL+   +KC  +           +L                  + ++YLG+    +  +   +  E +     +L  + L P Q++ A +T    +L        + I  +          +     +R  +  + NLP + P A++++P  +GG     L
Sbjct:  353 RVSLSSSPGPDGITPKSAREV-PSGIML-RIMNLILWCGNLPHSIRLARTVFIPKTVTAKRPQD---FRPISVPSVLVRQLNAILATRLNSSIN----WDPRQRGFLPTDGCADNATIVDLVLRHSHKHFRS-CYIANLDVSKAFDSLSHASIYDTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSEEFVPARGVKQGDPLSPILFNLVMDRLLRTLPSEIGA--KVGNAITNAAAFADDLVLFAETRMGLQVLLDKTLDFLSIVGLKLNADKCFTVGIKGQPKQKCTVLEAQSFYVGSSEIPSLKRTDEWKYLGINFTAT-GRVRCNPAEDIGPKLQRLTKAPLKPQQRLFALRTVLIPQLYHKLALGSVAIGVL----------RKTDKLIRYYVRRWLNLPLDVPIAFVHAPPKSGGLGIPSL 757          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Match: sp|Q03274|PO22_POPJA (Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment) OS=Popillia japonica OX=7064 PE=4 SV=1)

HSP 1 Score: 100.523 bits (249), Expect = 1.217e-19
Identity = 69/242 (28.51%), Postives = 121/242 (50.00%), Query Frame = 1
Query: 4654 ISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLT-TPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGN 5376
            I+  ++   +K    S+PG D +T + +       +L   F    +  G +P  W + +T LIPK    G  E+ S+WRPI + S   ++   IL++RL   +     LHP QKG +  +G +  N++L      S+  ++    +  LD++ AF +V H  +   L  +G+D    + +    +++ +    GP + T KI I+RGV+QG P+S  LFN  ++ +L ++ S+P     IG 
Sbjct:    6 IAREEIQCAIKGWKPSAPGSDGLTVQAI----TRTRLPRNFVQLHLLRGHVPTPWTAMRTTLIPK---DGDLENPSNWRPITIASALQRLLHRILAKRLEAAVE----LHPAQKGYARIDGTLV-NSLLLDTYISSRREQRKTYNVVSLDVRKAFDTVSHSSICRALQRLGIDEGTSNYITGSLSDSTTTIRVGPGSQTRKICIRRGVKQGDPLSPFLFNAVLDELLCSLQSTPGIGGTIGE 235          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Match: sp|P14381|YTX2_XENLA (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 87.8113 bits (216), Expect = 1.303e-15
Identity = 68/218 (31.19%), Postives = 95/218 (43.58%), Query Frame = 1
Query: 4702 SPGDDNITYRDLRLL-DPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQK----GGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAI 5340
            SPG D +T    +   D  G    R   +    G++P + R     L+PK    G      +WRP++LLST YKI    +S RL   + +  ++HP Q     G ++F+     N  L   L H        LA   LD + AF  V H YL   L        FV  L+ +Y +           T  +   RGVRQGCP+S  L++LAI P L  +
Sbjct:  466 SPGLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPK---KGDLRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAE--VIHPDQSYTVPGRTIFD-----NVFLIRDLLHFARRTGLSLAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRGVRQGCPLSGQLYSLAIEPFLCLL 673          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Match: sp|O00370|LORF2_HUMAN (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 85.5001 bits (210), Expect = 7.025e-15
Identity = 60/226 (26.55%), Postives = 104/226 (46.02%), Query Frame = 1
Query: 4687 LASASSPGDDNITYRDLRLLDPE-GKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLH-------PGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAI 5340
            L +  SPG D  T    +    E    L + F  I   G +P+++     +LIPKPG+  + ++  ++RPI+L++   KI   IL+ R+ + I   KL+H       PG +G       +         ++H   +K     I  +D + AF  +   ++   L  +G+DG ++ I+R +Y    +             +K G RQGCP+S +LFN+ +  +  AI
Sbjct:  465 LPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKE--NFRPISLMNIDAKILNKILANRIQQHI--KKLIHHDQVGFIPGMQGWFNIRKSIN-------VIQHINRAKDKNHVIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAI 679          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Match: A0A159X4Q6 (ORF2-like protein (Fragment) OS=Schmidtea mediterranea OX=79327 PE=2 SV=1)

HSP 1 Score: 457.603 bits (1176), Expect = 1.799e-142
Identity = 233/319 (73.04%), Postives = 271/319 (84.95%), Query Frame = 1
Query: 4138 ICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLIQRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFNIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIA 5094
            I SYI   LTDRS FKMDLT VGNKL+SILLPP QNTD    K+    +T+ +RR I N L  +I  CN NEAY+KII++ NK+NIKP  KPYLG NFF+ LN+  KMIK+KGGYFHNRKKV T+I++EINI+TRP+MENIL+QFTQ DP EFVF+I ++CI+S + +EL +I+A++VIFE K A+ASSPGDDNITYRDLRLLDPEGKLLAR FN IID+GQIP++W SFKT+LIPKPGKSG Y+DTSSWRPIALLST YKIFTSILSRRLT+WI +NKLLHPGQKGGS FEGCVEHNA+LTA LEHSKYSKKAPLAIA
Sbjct:    1 IXSYIXXXLTDRSXFKMDLTYVGNKLESILLPPKQNTDQNILKLPLICKTINERRKICNILLAEIATCNANEAYSKIISAANKSNIKPEHKPYLGKNFFNSLNSDNKMIKIKGGYFHNRKKVATRILDEINIDTRPSMENILNQFTQMDPDEFVFSISDYCIKSKEIIELDQITAKEVIFEYKQANASSPGDDNITYRDLRLLDPEGKLLARLFNIIIDSGQIPNDWLSFKTILIPKPGKSGDYDDTSSWRPIALLSTSYKIFTSILSRRLTKWIIENKLLHPGQKGGSRFEGCVEHNAVLTAVLEHSKYSKKAPLAIA 319          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Match: Q7YXU7 (ORF2 (Fragment) OS=Girardia tigrina OX=6162 PE=4 SV=1)

HSP 1 Score: 291.197 bits (744), Expect = 2.193e-76
Identity = 269/891 (30.19%), Postives = 425/891 (47.70%), Query Frame = 1
Query: 4495 KKVVTKIINEINIETRPNMENILDQF--TQSDPGEFVFNIQE------FCIRSNKPLELCKISARDVIFE-LKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSK--------YSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP-IAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDR-EFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAI--GRPSAKDSHNKNLSTQLRSIIY---------GFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIR-ISDPSISQCLEWINGKSFNK----NSLSRKTWWIRFRNAIQHLRNTRGI-------------TISLEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKC-DQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIW 7020
            K  + ++++     T    EN +  F  + S P E +F  QE      F +      E  +I  RD I + +K AS S+ G D ITY D++L DP G++L   F  I+ N   P   ++ +T++IPKPGKS  Y D SSWRPI + S  Y++    L+  L  WI  N++L   QK    FEGC +HNA+L   ++  +         +K   L I +LD  +AFGSVP   L  +    G+    +++++ LY +  +   CG      + + +GV+QGCP+SM+LFN+ IN ++ AI + P +  Y +G+  I+ILAYADDIALI++S  DLQ ++  +  I   +GL F P KCA +   +       IL N ++ K   + + Y+YLG            + L+ ++ +T  +  S+L+P QK+ AY+TF HS+L F  R   I  S     R + K ++N N S +     Y           F LP    K + Y     GG       DEY   SI+  FRLL  + P  ++ I+  L  ++   ++   + + SQ +   N    ++    + LSR T W R + A + L++T  I             T+SLE +    L I ++ +G       D KK+   L   ++  + + L+    +  + +   +  ++NK I    +    W FIHRAR   L     P     +S  CRKC  + ETMSH L +C     LINERH++    +   + S  +   + QK  +   +  R D+ +  +      LV++KCP DT+ +F+   +   DKY+ +   LE   P  +V L T I G+LGS        LR++G   ++   +     + NI  S   W
Sbjct:  221 KLAIREVLHRKTTATSSPSENAIKAFFSSYSRPAE-LFTGQELLESSWFPVHPEDDFEF-RIPGRDQIAKYIKFASKSAAGLDWITYEDIKLGDPSGEILQPIFEYIVQNNICPSEGKASRTIMIPKPGKS-DYSDPSSWRPITITSAVYRLLMKYLTWELYNWILLNQMLSRSQKSLGKFEGCHDHNAMLNMLIQDVRRQTNPSNPINKNKRLYIVFLDFTNAFGSVPLDTLMYVPQRFGLGTSALTLIKNLYLDNYTNVTCGESKIENVKLNKGVKQGCPLSMLLFNIFINIIIRAIEAMPDVHGYPLGDMDIRILAYADDIALISDSHKDLQEMVYKAEYIGRILGLLFNPSKCALMDIPHDKKRTPPILVNGEMIKCVGKADPYKYLGTFRSWFRKLDIKELLQMMMDETKLITESNLHPHQKIHAYETFIHSQLPFHLRHSRIPFSDFITNRKTNKTTNNSNDSEKSIQKAYDPESGQLFLNTFALPSGCAKDFFYITKDAGGPQLTSGLDEYLIQSIMYIFRLLGSEDPTLNSAIKHDL--ISHLNLKGFVNINFSQAISIFNSNFTDRTDHFSHLSR-TEWARLQLARKKLKSTLAIQTNVCLINGHLVLTLSLENN---VLLIDSKEKG-------DVKKIHASLMGFLRLAHLIRLQKHGWSKLLFSATTHHEILNKRILNGHVPYKIWYFIHRARLGLL-----PTKLFSVSNLCRKCGGKKETMSHALVNCPMMQTLINERHDALEISLVQILSSKFQGTVIRQKTYV---NELRPDITMESDTQYY--LVEVKCPFDTKMSFELRTQQTTDKYNIIIEILEDVHPGKEVRLVTFIVGTLGSWGPQNSDFLRDLGFSKDEIDQVKTRLMLQNINSSCEQW 1085          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Match: A0A3L8D8E5 (Reverse transcriptase domain-containing protein OS=Ooceraea biroi OX=2015173 GN=DMN91_010823 PE=4 SV=1)

HSP 1 Score: 289.271 bits (739), Expect = 6.714e-75
Identity = 244/839 (29.08%), Postives = 387/839 (46.13%), Query Frame = 1
Query: 4639 LELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTN-TNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEI-ASRIGLEFRPEKCA--YLTTS--NSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWING------KSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLEYDGFFSLRITTEHRGT-TIIYSLDRKKLACFLHKL---IQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLN-LVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCDQ-VETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPG--DGKRVDLLIRDNKNMTIKLVDIKCPLDTEF-NFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPKYLRHVNN*NPYI 7092
            L +   +  +V   L+  + S+PG D I Y DLR  DP  +LL   FN       +P +W++  T+L+ K G   S E   SWRP+AL  T  K+F ++ + RLT+W+  N  L   QKG    EGC EHN +L   L  ++ +++  + +AWLD+ +AFGSVPH  +   L   GV    ++I   +Y   T         T P I I+ GVRQGCP+S I+F+L I+ V+ A +      Y I      ILAYADD+ALIA +   ++ L+  +VE+ A R+GL F P KCA  ++TT            +    +  L + E Y++LGVP G S +QTP  T+E +L D   +  S L PWQ+V+    F   RL F  R   ++   +    A D H       +R I+  + NLP  A    +Y P   GG   + L D    L+I  AFRLLT    + S +   SLR V + RI  + PS  +   +++G      +   + SL  +      R A +     R   ++ E        +  E RG    I  +     A  +H+L   + E Y   L        +       P+ N  +          W F+HRAR + L +N   +    +  +CR+C +  ET+ HV+  C  +   I  RH++ L+++  + +   + + ++Q+ E V G  +G R DL++R   + ++ + D+    +     F+ +    + +Y  L   L      Y+V ++  + G+LGS     D +LR + V  +    +        I +S  I+  H +G    P   R   +  P +
Sbjct:  641 LLMAPFTEGEVDRRLRRMTNSAPGPDGIAYNDLRAADPGARLLTALFNACYRLEAVPPSWKTSNTVLVYKKGDRDSLE---SWRPLALGDTMPKLFAAVAADRLTDWVIANDKLCRAQKGFLRDEGCYEHNFVLQEILTDARRTRRHAV-VAWLDLSNAFGSVPHAAIRSALVRAGVPSGLINIWGSMYDGCTTRVRAVDGFTAP-IPIRSGVRQGCPLSPIIFDLVIDSVVRAAAELADVGYDILGQTFNILAYADDLALIARTPEGMRQLL-TAVELEAGRVGLHFNPAKCASLHITTGRIGRVLPTVFEIEGRPMPTLAEGEPYRHLGVPTGFSVDQTPLTTIEGVLKDIWAVDASLLAPWQRVEVLAAFILPRLDFLLRGAAVEKRPL---RAVDLH-------VRRIVKSWLNLPQRASAEVVYLPPSRGGCGLLPLSDLADVLTIAHAFRLLTASDAVVSGLAWGSLRGVVARRIGHA-PSDEEIASFLSGSLEGRLRGGGEASLWSRARNAARRQANRAAVRWRWSEVTGE--------MIVECRGPGERIVRVPGSARAQVIHRLRSAVMEYYAGTLLRKPDQGKVFEVSSRVPVSNHFMRGGSFTRFADWRFVHRARLDVLPLNGA-RRWGTVDERCRRCGRTAETLPHVIGHCGVHAAAIQLRHDAVLHRLRRACRLPGE-VRVNQRVEGVDGELEGLRPDLVVRHEPSKSVVICDVTVAFENRLVAFEEARGRKVARYTPLAEALRAQ--GYRVVVTALVVGALGSWCPRNDAVLRLLRVGSKYGSMMRRLIVSDTIRWSRDIYVEHVSGVRQYPAPPRPSGDGGPLV 1450          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Match: A0A3L8DAI7 (Reverse transcriptase domain-containing protein OS=Ooceraea biroi OX=2015173 GN=DMN91_010948 PE=4 SV=1)

HSP 1 Score: 287.73 bits (735), Expect = 5.103e-74
Identity = 231/778 (29.69%), Postives = 366/778 (47.04%), Query Frame = 1
Query: 4666 DVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTN-TNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEI-ASRIGLEFRPEKCA--YLTTS--NSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWING------KSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLEYDGFFSLRITTEHRGT-TIIYSLDRKKLACFLHKL---IQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLN-LVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCDQ-VETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPG--DGKRVDLLIRDNKNMTIKLVDIKCPLDTEF-NFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILR 6936
            +V   L+  + S+PG D I Y DLR  DP  +LL   FN       +P +W++  T+L+ K G   S E   SWRP+AL  T  K+F ++ + RLT+W+  N  L   QKG    EGC EHN +L   L  ++ +++  + +AWLD+ +AFGSVPH  +   L   GV    ++I   +Y   T         T P I I+ GVRQGCP+S I+F+L I+ V+ A +      Y I      ILAYADD+ALIA +   ++ L+  +VE+ A R+GL F P KCA  ++TT            +    +  L + E Y++LGVP G S +QTP  T+E +L D   +  S L PWQ+V+    F   RL F  R   ++   +          + +   +R I+  + NLP  A    +Y P   GG   + L D    L+I  AFRLLT    + S +   SLR V + RI  + PS  +   +++G      +   + SL  +      R A +     R   ++ E        +  E RG    I  +     A  +H+L   + E Y   L        +       P+ N  +          W F+HRAR + L +N   +    +  +CR+C +  ET+ HV+  C  +   I  RH++ L+++  + +   + + ++Q+ E V G  +G R DL++R   + ++ + D+    +     F+ +    + +Y  L   L      Y+V ++  + G+LGS     D +LR
Sbjct:  875 EVDRRLRRMTNSAPGPDGIAYNDLRAADPGARLLTALFNACYRLEAVPPSWKTSNTVLVYKKGDRDSLE---SWRPLALGDTMPKLFAAVAADRLTDWVIANDKLCRAQKGFLRDEGCYEHNFVLQEILTDARRTRRHAV-VAWLDLSNAFGSVPHAAIRSALVRAGVPSGLINIWGSMYDGCTTRVRAVDGFTAP-IPIRSGVRQGCPLSPIIFDLVIDSVVRAAAELTDVGYDILGQTFNILAYADDLALIARTPEGMRQLL-TAVELEAGRVGLHFNPAKCASLHITTGRIGRVLPTVFEIEGRPMPTLAEGEPYRHLGVPTGFSVDQTPLTTIEGVLKDIWAVDASLLAPWQRVEVLAAFILPRLDFLLRGAAVEKRPL----------RAVDLHVRRIVKSWLNLPQRASAEVVYLPPSRGGCGLLPLSDLADVLTIAHAFRLLTASDAVVSGLAWGSLRGVVARRIGHA-PSDEEIASFLSGSLEGRLRGGGEASLWSRARNAARRQANRAAVRWRWSEVTGE--------MIVECRGPGERIVRVPGSARAQVIHRLRSAVMEYYAGTLLRKPDQGKVFEVSSRVPVSNHFMRGGSFTRFADWRFVHRARLDVLPLNGA-RRWGTVDERCRRCGRTAETLPHVIGHCGVHAAAIQLRHDAVLHRLRRACRLPGE-VRVNQRVEGVDGELEGLRPDLVVRHEPSKSVVICDVTVAFENRLVAFEEARGRKVARYTPLAEALRAQ--GYRVVVTALVVGALGSWCPRNDAVLR 1623          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Match: W4Y7A7 (Uncharacterized protein OS=Strongylocentrotus purpuratus OX=7668 PE=4 SV=1)

HSP 1 Score: 287.73 bits (735), Expect = 6.281e-74
Identity = 249/886 (28.10%), Postives = 412/886 (46.50%), Query Frame = 1
Query: 4480 YFHNRKKVVTKIINEINIETRPNMENILDQFTQ--SDP----------GEFVFNIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAK---YKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTS----NSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGK-----SFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMD-LRNGKINNFIANTYVNCPLINKAIFKSK---LNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCDQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDG---KRVDLLIRDNKNMTIKLVDIKCPLDTEFNF-QNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGN 7041
            Y  ++K+ V  I+ +       + E +LD F +  + P           E +F   E    S   L +  IS +++   L   S S+PG D + YR +R  D   ++    FN+ +   +IP  W+   T+LI    KSG+ +D +++RPIAL S  YK+F  ILS R+T+W  ++ LL P QK     EGC EH  +L++ ++ +K ++K    IAWLD+++AFGS+PH  +  +L  IG     V +L+  YT  ++ +      T  I I+ GV+QGCP+S ILFNL I  ++ A+            +   ++ I+AYADD+ L++ +   L A+++ + E A  + L F+P KCA L+ S     S       +    +  L + E Y+YLGVP G        D + KL  +   + +S L PWQK+DA KTF    L+F  R  +   S +          ++L + + + +     LP  A  AYI++   +GG AF+D + +     I QA R+L+    +   I    L+ V    I  + P+      +++G      + + NS    + W R R+A + L  T   T S       + +   +H    +  S+ R        +LIQ T + + L++      +A +  N P  N + + +    +    W FIHRAR N L  N   +   + +   +   Q ET+ HVL  C  N + I  RH++   ++  +I+  +  +      + VPGD    +R D+ + +   +T+  +DI  P D   N    + +  ++KY  L+  L        V +   I G+LG+     +  L  +GV    R  +   C I  I+ S  IW  H TG+
Sbjct:  941 YKRSKKRAVRHILRDDAPSFSGSNEQLLDYFKEIYAPPEIDENRAQQLAESLFTDLEEAKESAAAL-MSPISQQEISTRLSRMSNSAPGKDRLEYRHIRQADGACRVTHIMFNRCLQEHRIPSAWKEATTILI---HKSGTTDDPANFRPIALQSCLYKLFMGILSDRMTQWACNHNLLSPEQKSARPCEGCHEHTFLLSSVIKDTKRNQKT-ANIAWLDLRNAFGSIPHQAIHAVLTTIGAPVSLVMLLKDTYTGASTSFLSTSGETDPIQIQSGVKQGCPMSAILFNLTIELIIRAVKKKATDDGLGLVVHGQRLSIMAYADDLVLMSKTPEGLDAILSVASEQAETLRLAFKPTKCASLSLSCRHGTSVLPREYTVQGHLMPALDEEEQYRYLGVPFGLPRFTNLKDLIGKLKGNIETIASSLLAPWQKLDAIKTFVQPGLSFVLRAADYLKSDL----------RSLKSAITTNVKKICQLPLRAANAYIFAAKESGGLAFIDPNVDADIQVITQAVRVLSSDDEVVQTIATSQLKSVVHRTIH-AVPTEEDIDNYLSGSNEGLLANSGNSGQASSLWSRTRSAARRLHLTLRATTSGTV--VVNQQADIDHTRDILPASITRGL------RLIQRTTNAEKLKSLPDQGKVARSLSNDPFANGSSWHATGKFIRFCDWRFIHRARLNCLPTNVATKRW-KANANGKNGHQQETLPHVLNHCLPNMVPIRRRHDNIQQRLVTAIRHGDVFVN-----QHVPGDPNPRERPDITVIEGNKVTV--IDISVPFDNGPNACTTAAQAKVEKYSALRQALRDM--GRDVEVHGFIVGALGTWHQGNERALGRLGVSRWYRTLMRKLCCIDAIQASRDIWVEHVTGH 1792          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Match: ENSAMXT00000034560.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000712.1:195631:202616:-1 gene:ENSAMXG00000042811.1 transcript:ENSAMXT00000034560.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 143.28 bits (360), Expect = 1.528e-33
Identity = 120/449 (26.73%), Postives = 212/449 (47.22%), Query Frame = 1
Query: 4699 SSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKC-AYLTTSNST-----DXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAII 6027
            ++PG D IT + L   DP G+ L R +   +  G +P  ++  +T LIPK     + E+  +WRPI + S   ++F+ I++ RL      N    P Q+G     GC E+  +L   + H K  +++ LA+ ++D   AF S+ H ++   L    +D H + +++  Y +  +        +P I +K GV+QG  +S ILFNLA++P++ ++ +   + YKIG S +  LA+ADD+ L+++S   +++ I    E   R GL  +  KC  +L    +T     D +   L    +  ++  E  +YLGV V      T  D   +L     ++  + L P+QKV     F   RL +     ++K   +            L   +R  + G+ +LP +     +YS +  GG   + L     ++   + +RLL  +  I + I+
Sbjct:  442 TAPGPDRITRQMLSDWDPSGEKLERLYTAWMVAGVVPKAFKECRTTLIPKSTSKEALEEVGNWRPITIGSLILRLFSKIMAERLARACPIN----PRQRGFISAPGCAENLKVLQGLIGHCK-KERSQLAVVFVDFARAFDSISHEHILSALGQRQLDQHVIKLIQSAYVDCVTRIIMDGAKSPDILMKVGVKQGDSMSPILFNLAMDPLIQSLENLG-SGYKIGGSSVTTLAFADDLVLLSSSWDGMRSNIALLDEFCERTGLRVQSRKCHGFLIRRGATSYTVNDCSPWDLGGEPIHLIEPHEVEKYLGVKVNPWVGITKPDLNSQLGEWVTRIGGAPLKPYQKVGLLNDFAIPRLIYLADHCDLKGVTLA----------TLDGTIRRAVKGWLHLPLSTCGGLLYSRVQDGGLGILKLEALVPSIQARRLYRLLGSEDEIMNQIM 874          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Match: ENSAMXT00000034831.1 (pep primary_assembly:Astyanax_mexicanus-2.0:3:16211297:16215948:1 gene:ENSAMXG00000034228.1 transcript:ENSAMXT00000034831.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 108.612 bits (270), Expect = 6.881e-23
Identity = 64/217 (29.49%), Postives = 112/217 (51.61%), Query Frame = 1
Query: 4693 SASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPG-QKGG-SVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVA 5337
            + S+PG + + YR  +      K L +    +   G IP  W     +LIPK   S + +    +RPI+LL+   KIF S+++RRL  ++  N L+    QK G + F GC+EHN+++   ++ ++  KK  L + +LD+ +AFGSVPH  LW    +  V      +++  + +    +  G   T    ++ G+  GC IS++ F +A+  ++ A
Sbjct:  386 AGSAPGPNGVPYRSYKSAPDVLKFLWKLMAIVWKKGIIPKEWHRAGGVLIPKEKDSVAIDQ---FRPISLLNVEGKIFFSVVARRLATYLKRNNLIDTSVQKAGIAGFSGCIEHNSMIWHQIQTARAEKK-DLHVVFLDLANAFGSVPHALLWKAFNFFRVPEEISRLVKAYFEDIQFCFSVGECVTSWQRLEIGIMAGCTISLLAFTMAMEVIIRA 598          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Match: ENSAMXT00000052982.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000409.1:21:5030:-1 gene:ENSAMXG00000036080.1 transcript:ENSAMXT00000052982.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.834 bits (255), Expect = 3.738e-21
Identity = 101/345 (29.28%), Postives = 158/345 (45.80%), Query Frame = 1
Query: 4762 LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKG---GSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSP-IAKYKI-GNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTP----YDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFR 5769
            LL R FN  I N + PD+       +I K GK  +  D +S+RPIALL+   KI + +L+ RL   I+D  ++HP Q G   G    G    N  L     HS      P AI  LD + AF  V   Y++  L+  G    F+++++ LY +  S        +    ++RGVRQG P+S +LFNLA+ P+ + I + P I    I G   +  L   D +  I+N AT +  L++         G      KC ++  +N+ D N +      +      E + YLG+ + ++P  T      + L+KL  +  + K   L    K++A K     R  + F+
Sbjct:  497 LLLRMFNDSIANNRFPDSLYQANISVILKKGKIKT--DPASYRPIALLNVDQKIISKVLANRLAHHISD--IIHPDQTGFIPGRFSFG----NVRLLLNTIHSAQQGSVPAAILSLDAQKAFDQVEWPYMFYTLSKFGFGTPFINLVKALYLHPCSSILTNSNRSLPFPLQRGVRQGDPLSPLLFNLALEPLAIGIRNHPDIHGITINGLETLVNLYADDLLLSISNPATSVPKLLDYINLFGRLSGYTINWNKCEFMPLTNNFDQNFLSALPFNIT----NEHFTYLGLQISKNPKTTIKLNYENALDKLKKEIARWKLLPLSMIGKINAIKMIILPRYLYLFQ 829          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Match: ENSAMXT00000033573.1 (pep primary_assembly:Astyanax_mexicanus-2.0:3:31407009:31410218:-1 gene:ENSAMXG00000030324.1 transcript:ENSAMXT00000033573.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 101.679 bits (252), Expect = 8.124e-21
Identity = 68/215 (31.63%), Postives = 111/215 (51.63%), Query Frame = 1
Query: 4684 KLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPG-QKGG-SVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAIN 5322
            K  S+S+PG   + Y+  +      + L +    I   G++   WR  + + IPK   S   ++   +R I+LLS   KIF SIL+RRLT++++ N  +    QKGG     GC+EH  ++T  +  ++   K  LA+ WLD+ +A+GS+PH  +   L    V G    ++   Y   +     G +T+    ++ G+  GC IS+ILF LA+N
Sbjct:  268 KARSSSAPGPSGVPYKVYKNCPLLLERLWKILKVIWRRGKVARQWRFAEGVWIPKEEVS---KNIDQFRIISLLSVEGKIFFSILARRLTDFLSSNHYIDTSVQKGGIPGVPGCLEHTGVVTQLIREAR-ENKGDLAVLWLDLANAYGSIPHKLVQTTLERHHVPGQVAELIMNYYDQFSMRVSTGSITSEWHRVEVGIITGCTISVILFVLAMN 478          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Match: ENSAMXT00000029797.1 (pep primary_assembly:Astyanax_mexicanus-2.0:20:11908834:11912265:1 gene:ENSAMXG00000039541.1 transcript:ENSAMXT00000029797.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 98.2117 bits (243), Expect = 1.056e-19
Identity = 63/220 (28.64%), Postives = 110/220 (50.00%), Query Frame = 1
Query: 4684 KLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPG-QKGG-SVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVA 5337
            K  S S+PG + I Y+  +      K++      +  N  +P  W+    + IPK   S +   TS +R IALL+   KIF ++L+RR+ +++  N  +    QK G   F GC+EH  ++   ++ +K  +K+ L + WLD+ +A+GSVPH  +   L +  +    ++I+   ++N    +      T    ++ G+  GC IS ILF  A   +L+ 
Sbjct:  348 KARSGSAPGPNGIPYKLYKHCPQVTKIVWNLMRVVWKNKSVPAEWQEAVGIFIPKEQNSTT---TSQFRSIALLNVEGKIFFTVLARRIAQFLKSNSYIDTSCQKAGLPGFPGCIEHATMIWEQIQRAK-REKSDLHVVWLDMANAYGSVPHKLVEFALEFFYIPDCIITIIGKYFSNLRMSFVMDGFATGWQQLEVGIAMGCSISPILFVAAFEVILIG 563          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Nematostella
Match: EDO47846 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7RKZ5])

HSP 1 Score: 52.7582 bits (125), Expect = 6.965e-8
Identity = 38/109 (34.86%), Postives = 54/109 (49.54%), Query Frame = 1
Query: 4684 KLASASSPGDDNITYRDLRL----LDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKG 4998
            KL +  +PG D I    L+     L P    +++ FN I+D G  PD W      +I    KSG   D S +R I + S   K+F  IL+ RL  ++T N++LH  Q G
Sbjct:    2 KLKNNKAPGIDRIRNEMLKCGQQTLVP---CISKIFNLILDAGVYPDVWTRG---IISAIYKSGDKSDPSDYRGICVTSCLGKLFCFILNNRLQTFLTSNQILHQSQIG 104          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Match: ENSORLT00000039772.1 (pep primary_assembly:ASM223467v1:15:12930252:12933724:1 gene:ENSORLG00000026021.1 transcript:ENSORLT00000039772.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 101.679 bits (252), Expect = 6.711e-21
Identity = 79/280 (28.21%), Postives = 131/280 (46.79%), Query Frame = 1
Query: 4687 LASASSPGDDNITYRDLRLLDPEGK-LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAI-LTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYL 5520
            L+   +PG D +T    +      K L+   F +II NG +    +     LIPKPGK  ++ D  + RPI LL+T YK+ T I S R    IT  +++   Q G    +G   HN + L   +   ++       I +LD   AF  + H +++  L + G    F++ ++  Y  T+S  C    T+ + +I +G++QGCPIS +LF  A   + + I  +   K  + N++  I   ADD  +  N+  D+  ++      +   GL+    KC  L
Sbjct:   61 LSKDKAPGCDGLTSNFYKFFWEHIKDLIFEMFKEIIQNGSLTHTMKQGVITLIPKPGKDPTFLD--NLRPITLLNTDYKLLTHIFSNRFKSDIT--QIISETQSG--FIKGRSIHNNLRLVLDMIDYEHLIDTDGFILFLDFYKAFDMIEHNFMFQTLQFFGFGNKFINTVKTFYYETSSSVCLPQGTSHRFNISKGIKQGCPISPLLFIAAAEMLSLLIKHTDFGKLTVANAEFSISQLADDTTIFLNNLHDIPKILKTIDFFSKASGLKLNLNKCEIL 334          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Match: ENSORLT00000042085.1 (pep primary_assembly:ASM223467v1:19:3759893:3763624:1 gene:ENSORLG00000027203.1 transcript:ENSORLT00000042085.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.064 bits (253), Expect = 7.507e-21
Identity = 81/281 (28.83%), Postives = 137/281 (48.75%), Query Frame = 1
Query: 4687 LASASSPGDDNITYRDLRLLDPEGK-LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAI-LTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRI-GLEFRPEKCAYL 5520
            L++  +PG D +T    +      K L+   F +II NG +    +     LIPKPGK  ++ D  + RPI LL+T YK+ T I S R    IT  +++   Q G    +G   HN + L   +   ++       I +LD   AF  + H +++  L + G    F++ ++  Y  T+S  C    T+ + +I +G++QGCPIS +LF  A   + + I  +   K  + N++  I   ADD  +  N+  D+  ++  +++I S+  GL+    KC  L
Sbjct:  464 LSNDKAPGCDGLTSNFYKFFWQHIKDLIFEMFKEIIQNGSLTHTMKQGVITLIPKPGKDPTFLD--NLRPITLLNTDYKLLTHIFSNRFKSDIT--QIISETQSG--FIKGRSIHNNLRLVLDMIDYEHLIDTDGFILFLDFYKAFDMIEHNFMFQTLQFFGFGNKFINTVKTFYYETSSCVCLPQGTSHRFNINKGIKQGCPISPLLFIAAAEMLSLLIKHTDFGKLTVANAEFSISQLADDTTIFLNNLHDIPKIL-KTIDIFSKASGLKLNLNKCEIL 737          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Match: ENSORLT00000045648.1 (pep primary_assembly:ASM223467v1:16:7714608:7718339:-1 gene:ENSORLG00000027038.1 transcript:ENSORLT00000045648.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.064 bits (253), Expect = 7.831e-21
Identity = 81/281 (28.83%), Postives = 137/281 (48.75%), Query Frame = 1
Query: 4687 LASASSPGDDNITYRDLRLLDPEGK-LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAI-LTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRI-GLEFRPEKCAYL 5520
            L++  +PG D +T    +      K L+   F +II NG +    +     LIPKPGK  ++ D  + RPI LL+T YK+ T I S R    IT  +++   Q G    +G   HN + L   +   ++       I +LD   AF  + H +++  L + G    F++ ++  Y  T+S  C    T+ + +I +G++QGCPIS +LF  A   + + I  +   K  + N++  I   ADD  +  N+  D+  ++  +++I S+  GL+    KC  L
Sbjct:  464 LSNDKAPGCDGLTSNFYKFFWQHIKDLIFEMFKEIIQNGSLTHTMKQGVITLIPKPGKDPTFLD--NLRPITLLNTDYKLLTHIFSNRFKSDIT--QIISETQSG--FIKGRSIHNNLRLVLDMIDYEHLIDTDGFILFLDFYKAFDMIEHNFMFQTLQFFGFGNKFINTVKTFYYETSSSVCLPQGTSHRFNINKGIKQGCPISPLLFIAAAEMLSLLIKHTDFGKLTVANAEFSISQLADDTTIFLNNLHDIPKIL-KTIDIFSKASGLKLNLNKCEIL 737          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Match: ENSORLT00000045921.1 (pep primary_assembly:ASM223467v1:18:29376025:29377426:1 gene:ENSORLG00000028454.1 transcript:ENSORLT00000045921.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 96.2857 bits (238), Expect = 1.342e-20
Identity = 67/206 (32.52%), Postives = 104/206 (50.49%), Query Frame = 1
Query: 4738 RLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPI 5355
            ++L P   +L   FN    NG +P +      +L+PKPGK  +     + RPI+LL++  KI   IL+ RL   +    ++H  Q G  +    + HN      + H      A  A+   D + AF  V   +L+++LA +G    F   ++LLYTN  +        +  I+IKRG RQGCP+S +LF LAI P+ +A+ + PI
Sbjct:   90 KILKPMLDMLQESFN----NGALPKSMTCALIILLPKPGKPSN--KCENMRPISLLNSDLKIICKILAMRLQNTLP--HVVHRDQNGFILGRQGL-HNVRRVLNIIHG-LEDTADQALLSFDAEKAFDKVEWPFLFNVLAKLGFGETFRKWIQLLYTNPTAEILTNKNISAPIEIKRGCRQGCPLSPLLFTLAIEPLAIAVRAHPI 285          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Match: ENSORLT00000029493.1 (pep primary_assembly:ASM223467v1:9:31095018:31097924:1 gene:ENSORLG00000028372.1 transcript:ENSORLT00000029493.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 100.138 bits (248), Expect = 1.997e-20
Identity = 78/280 (27.86%), Postives = 131/280 (46.79%), Query Frame = 1
Query: 4687 LASASSPGDDNITYRDLRLLDPEGK-LLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAI-LTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYL 5520
            L+   +PG D +T    +      K L+   F +II NG +    +     LIPKPGK  ++ D  + RPI LL+T YK+ T I S R    IT  +++   Q G    +G   HN + L   +   ++       I +LD   AF  + H +++  L + G    F++ ++  Y  T+S  C    T+ + +I +G++QGCPIS +LF  A   + + I  +   K  + N++  I   ADD  +  N+  D+  ++      +   G++    KC  L
Sbjct:  464 LSKDKAPGCDGLTSNFYKFFWEHIKDLIFEMFKEIIQNGSLTHTMKQGVITLIPKPGKDPTFLD--NLRPITLLNTDYKLLTHIFSNRFKSDIT--QIISETQSG--FIKGRSIHNNLRLVLDMIDYEHLIDTDGFILFLDFYKAFDMIEHNFMFQTLQFFGFGNKFINTVKTFYYETSSCVCLPQGTSHRFNISKGIKQGCPISPLLFIAAAEMLSLLIKHTDFGKLTVANAEFSISQLADDTTIFLNNLHDIPKILKTIDFFSKASGIKLNLNKCEIL 737          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Match: SMESG000062936.1 (SMESG000062936.1)

HSP 1 Score: 1802.72 bits (4668), Expect = 0.000e+0
Identity = 936/1270 (73.70%), Postives = 1066/1270 (83.94%), Query Frame = 1
Query: 3256 FDEVASNNLTTINEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQFCRTDIVQLSMDHSNVQTIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLIQRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFNIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCDQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPKYLR 7065
            FDEVAS+NL  INE FII+NITYKQFK+IA+I+CPFRSNDVI  FFYNK+ K SF+INT SNH  S +L  +G+TLTMLFNQFCR++IVQ+S D+S  Q +EYDDLN+CK SFK IINSI N  +SD+WI+CEF R+   S+K  K   I    +N+IGINAKLD  AID+YF+SL+DN+KY LVNETLSDALI+D WQHV+EFLDKT+L +ASIIF IF PPSSGKQLL+IDYEKCTSFLLNPTSS E PLH +TAFILLN+ISS R+N+GAAPLM SP HDIR  GLFSNILICSYI+NYLTDRSLFKMDLT VGNKL+SILLPP QNTD    K+    +T+ +RR I N L  +I  CN NEAY+KII+++NK+NIKP  KPYLG NFF+ LN+  KMIK+KGGYFHNRKKV T+I++EINI+TRP+MENIL+QFTQ DP EFVF+I ++C +S + +EL +I+A++VIFE K A+ASSPGDDNITYRDLRLLDPEGKLLAR FN IID+GQIP++W SFKT+LIPK                                             PG+             ++LTA LEHSKYSKKAPLAIAWLDIKDAFGSVPH Y+W+LL YIGVD HFVS+LRLLYT TNSYYCCGPL TPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKY++G+S++QILAYADDIALIANSATDLQALI +SVEIASRIGLEFRPEKCAYL+TSNS+D+N + LN++KLKKLKD+EFY YLGVPVGE PNQTPYDTLEKLL+D NK+KNSDLYPWQKVD YKTF HSRL FAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIY FFNLPHNAPKAYIYSPI TGGAAF+DLHDEYS L+IVQAFRLLTCKCP TSA+I DSLRFVASTRIRIS PSISQCLEWINGKS+ KNSLS+KTWWIRFR+AI HL+NTRGI I LEYDGFFSLRIT EHRGTTIIYSLDRKKLA FLHKL+QETY+MDL NGK+NNFIA+TYV CPLINKAIFKS+LN VSWNF+HRARTNTLA+NARPQN S+ +RKCRKCDQVET+SHVLQSC SNG+LINERHN+CL K+YN IKS +KI  LDQKC+LV  D KRVDLLIRDNKN TIKLVD+KCPLDTEFNF N+++LN+ KY++LK  +E A P+YKVTLSTCIFGSLGS+P  T  +L +IGVK  DR SL+V CA+SNIEYSARIW FHSTGNDI+PKYLR
Sbjct:  308 FDEVASDNLIVINETFIIDNITYKQFKEIAYIVCPFRSNDVIHIFFYNKISKISFVINTLSNHLLS-NLKIIGETLTMLFNQFCRSEIVQMSTDYSKAQFMEYDDLNVCKSSFKIIINSINNNNISDDWIDCEFGRYIDNSIKANK---IPLTTLNVIGINAKLDGFAIDKYFQSLLDNDKYFLVNETLSDALISDSWQHVAEFLDKTKLLDASIIFVIFVPPSSGKQLLIIDYEKCTSFLLNPTSSKEIPLHVETAFILLNYISSVRNNVGAAPLMTSPLHDIRTCGLFSNILICSYITNYLTDRSLFKMDLTCVGNKLESILLPPKQNTDQNILKLPLICKTINERRKICNILLAEIATCNANEAYSKIISAVNKSNIKPEHKPYLGKNFFNSLNSDNKMIKIKGGYFHNRKKVATRILDEINIDTRPSMENILNQFTQMDPDEFVFSISDYCTKSKEIIELDQITAKEVIFEYKQANASSPGDDNITYRDLRLLDPEGKLLARLFNIIIDSGQIPNDWLSFKTILIPK---------------------------------------------PGK-------------SVLTAVLEHSKYSKKAPLAIAWLDIKDAFGSVPHAYIWNLLTYIGVDEHFVSVLRLLYTKTNSYYCCGPLITPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYRMGDSEVQILAYADDIALIANSATDLQALIESSVEIASRIGLEFRPEKCAYLSTSNSSDINGVNLNDVKLKKLKDKEFYHYLGVPVGECPNQTPYDTLEKLLSDANKIKNSDLYPWQKVDMYKTFLHSRLHFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYSFFNLPHNAPKAYIYSPIETGGAAFIDLHDEYSILTIVQAFRLLTCKCPNTSAVIHDSLRFVASTRIRISGPSISQCLEWINGKSYKKNSLSKKTWWIRFRDAILHLKNTRGIDILLEYDGFFSLRITAEHRGTTIIYSLDRKKLASFLHKLVQETYYMDLHNGKLNNFIADTYVKCPLINKAIFKSRLNTVSWNFVHRARTNTLAINARPQNKSDEARKCRKCDQVETLSHVLQSCNSNGILINERHNACLRKLYNLIKSPDKITVLDQKCDLVENDSKRVDLLIRDNKNKTIKLVDVKCPLDTEFNFINTHELNMSKYNDLKLNIEKALPNYKVTLSTCIFGSLGSIPDKTFDVLDDIGVKFTDRNSLSVECALSNIEYSARIWQFHSTGNDINPKYLR 1515          

HSP 2 Score: 330.487 bits (846), Expect = 2.836e-91
Identity = 159/203 (78.33%), Postives = 175/203 (86.21%), Query Frame = 1
Query: 2440 EIVRRTVDENSENDVNSPFYKTRSHIKAKGKPKECPS--DSPSLSSVLIDKYGPHDKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKT 3042
            EI+     ++   +V+SPF KTRS IK K + KE        SLS+ LIDKYGPHDK+KWFTDLDIS YLEWKIN DKHFALNAGIVDAIASK++DYKPPLLERLVKCDT+LCPLNIKN+HWVLFVY KS++ESFVMDPLPVPFEPEETALRA+AVNDCFNKLFKINSKIIDNKYPTAKKQ NSNDCGPIICGYAKKISF + 
Sbjct:  109 EILNDPSGDSCAGEVSSPFCKTRSQIKFKDRVKETERVPMPQSLSATLIDKYGPHDKSKWFTDLDISSYLEWKINDDKHFALNAGIVDAIASKLDDYKPPLLERLVKCDTILCPLNIKNQHWVLFVYIKSSSESFVMDPLPVPFEPEETALRAAAVNDCFNKLFKINSKIIDNKYPTAKKQANSNDCGPIICGYAKKISFDEV 311          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Match: SMESG000021486.1 (SMESG000021486.1)

HSP 1 Score: 987.638 bits (2552), Expect = 0.000e+0
Identity = 626/1522 (41.13%), Postives = 860/1522 (56.50%), Query Frame = 1
Query: 2599 DKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXXKENISVLVFDEVASNNLTTI------------NEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQFCRTDIVQLSMDHSNVQ--TIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLI-----------QRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGG-----YFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVF-NIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITIS-LEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPKYL 7062
            D   WFTD DI  YL+  I+  +  ++N  IV+ + +   +   P+ E++ K   +LCPLNI   HW+LFVY K++  S+ +DP+            A  V    N+LF + + +  + +     Q NS DCGP IC YA  IS             IRKE H     ++L      +        +   ++ R     +      F       IS L+ +   +   TT+            N  FI EN+T++QF  I  II    ++     F ++ ++ +S+I N   +   +  L  +G+ +T   N+F  ++   + +   N++  +I  DD  L   +F  +I S+  +         EF       LK +K       C+NL  IN K++     +YF +L  N+ ++L+NET+  A+++DC +H+++FL+  +L  A +IFAI  PP   + LL+IDY     +++NPT+            I  N I+  ++  G    M   P+DIR SGL S +LIC++I NY  D  L  ++L  +G  + S LLP      +E++KI+ S  T I           QR+   + L   + +  VNE    II       +K   KPYLG        N T+  K+  G     +  + K  V K+ N  N+  RP  + I+  F    P    + N   FC RS + L L   S  +++F+LK A  +SPG D I YRDL LLDPEGKLL   FNKII N  IPD+W+SFKTLL PKP K G Y+D SSWRPIALLS  YK+F S L+ RLT WI  N LLH GQKGGS  +GCVEHN+IL+AALEHSKYSK  PLAIAWLDIKDAFGSVPH Y+W LL +IGV   F SIL+LLY++T+++Y CGP+ TP I IK+GV+QGCPISMILF LAINPVL A+S S    + IG S ++ILAYADDIALIA S  DLQ + N +V  A+ IG E+RPEKC YL        + I++NN K+KKL  +EFYQYLGVPVGE P+Q+PYD L+K ++DT KL NS+L  WQK+ AYK F HSRL F FRTREIK  A+ + +  ++ + N+S+QLR        LP+ +  +Y Y+    GGA  VDL DEY T +I   FRLLT +C     I  DSL+FV   R+ I  PS+ + L+W NGK    +   RKT + R R A+     T  I++S +  D   SL ITT  RGT I+ S  RK  +  LH  + ++Y        ++NFIA+     P INKAIF+S+L+  +WNFIHRARTNTL++ A+  N  +  R CR C  + ETMSH +QSCK +  L  ERHN CL  I +++ K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP D+  NF+  + LNL KYD LK++++   P +KV L TC+FGSLGS P  T  IL +IGVK  +   L   CA+SNI +SAR WHFH TG  +DP+ +
Sbjct: 4722 DPTIWFTDDDIDLYLKNHISSLEFASINCFIVEILRTNPGEDLFPIPEKVFKAQIILCPLNIDKTHWILFVYCKNSLTSYFIDPILQYRHRFVNKTYALQVTMALNQLFNLKAAMTFHPHENFLYQENSYDCGPYICAYAMMISGVWQSFPDHFIDIIRKEVHQ----LQLFCQQKDQKVSGGGLPKFMEIIKRNGVRKTSSATQAFLLKQNSCISSLLDEYFKTRTNTTVLSPELTMNIVAQNLTFIRENVTFRQFTGIDHIISIVPNDSNWSLFVFSLVLGNSYIFN-FHDRIVTERLIAIGENMTEYLNKFIISNRKVIFISEHNIEHASITSDDSLLFASAFIILIESLIFSKQVSHTRTSEF-------LKSVKIKQANEICLNLNSINCKINEEITAKYFSNLKLNSNFMLLNETMCTAILDDCTRHLTKFLNYEDLIKAQVIFAIIAPPKHREILLIIDYITDEYYIINPTTLCGTESSQSVCRIFRNKINEIKETPGRVINMGDIPNDIRGSGLVSKLLICAFIKNYANDLPLINLNLREIGTNISS-LLP----IISESKKILMSDLTKITKIKKSNLNLEQRKLKIDELLHDVSHLTVNEI-IDIIVRQFPDRVKSKHKPYLG--------NKTRPQKINKGILVRKFMTDMKWTVDKVFNNTNVSIRPTFQKIVLNFLIKSPFHGNWNNPLNFCKRSKQALNLELFSNFEIMFQLKRADNTSPGIDGIQYRDLLLLDPEGKLLTFLFNKIIVNKIIPDSWKSFKTLLTPKPNKDGKYDDVSSWRPIALLSVIYKVFASCLASRLTYWINTNNLLHIGQKGGSRHDGCVEHNSILSAALEHSKYSKNCPLAIAWLDIKDAFGSVPHDYMWSLLRFIGVGEDFTSILQLLYSDTSTFYSCGPILTPNIPIKQGVKQGCPISMILFALAINPVLEAVSRSDCEPFMIGESPVKILAYADDIALIAKSVEDLQKITNIAVRTAAEIGFEYRPEKCGYLQLPKVHIDSEILINNTKIKKLLSKEFYQYLGVPVGEEPDQSPYDILDKAVSDTRKLANSELLGWQKLKAYKIFVHSRLIFPFRTREIKTGALSKSNGNNNRSVNISSQLRGCFRKMLCLPNQSEVSYFYNATEKGGANCVDLLDEYHTQTITHFFRLLTSECDYARQINIDSLKFVTGPRLGIKTPSLQESLDWFNGKESKPHHSGRKTRFQRARIAVAFFEKTHNISVSFIVKDHKPSLYITTAMRGTIILTSDLRKTTSKVLHMALCDSYLSKWEKSCVSNFIASAVKLSPKINKAIFRSELSEFAWNFIHRARTNTLSIFAKHHNKGD-KRLCRLCHAEDETMSHAIQSCKVHLTLGAERHNDCLKLIASNLTKNSNLIVVVDHVCSLVPNSKERVDLMITDLSRKKIFLVDMKCPCDSVNNFEQVDLLNLRKYDSLKNQIQAVKPDFKVELDTCVFGSLGSFPLKTSEILVKIGVKRTELKPLLKECALSNIAHSARRWHFHKTGILVDPEKM 6216          

HSP 2 Score: 980.319 bits (2533), Expect = 0.000e+0
Identity = 625/1516 (41.23%), Postives = 858/1516 (56.60%), Query Frame = 1
Query: 2599 DKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXXKENISVLVFDEVASNNLTTINEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQFCRTDIVQLSMDHSNVQ--TIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLI-----------QRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGG-----YFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFN-IQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITIS-LEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPK--YLRHVN 7074
            D   WFTD DI  YL+  I+  +  ++N  IV+ + +   +   P+ E++ K   +LCPLNI   HW+LFVY K++  S+ +DP+            A  V    N+LF + + +  + +     Q NS DCGP IC YA  IS             IRKE H     ++L      +        +   ++ R     +      F       IS L+ +   +   TT+    +  NI  +    +  II P  SN  +  F ++ ++ +S+I N   +   +  L  +G+ +T   N+F  ++   + +   N++  +I  DD  L   +F  +I S+  +         EF       LK +K       C+NL  IN K++     +YF +L  N+ ++L+NET+  A+++DC +H+++FL+  +L  A +IFAI  PP   + LL+IDY     +++NPT+            I  N I+  ++  G    M   P+DIR SGL S +LIC++I NY  D  L  ++L  +G  + S LLP      +E++KI+ S  T I           QR+   + L   + +  VNE    II       +K   KPYLG        N T+  K+  G     +  + K  V K+ N  N+  RP  + I+  F    P +  +N    FC RS + L L   S  +++F+LK A  +SPG D I YRDL LLDPEGKLL   FNKII N  IPD+W+SFKTLL PKP K G Y+D SSWRPIALLS  YK+F S L+ RLT WI  N LLH GQKGGS  +GCVEHN+IL+AALEHSKYSK  PLAIAWLDIKDAFGSVPH Y+W LL +IGV   F SIL+LLY++T+++Y CGP+ TP I IK+GV+QGCPISMILF LAINPVL A+S S    + IG S ++ILAYADDIALIA S  DLQ + N +V  A+ IG E+RPEKC YL        + I++NN K+KKL  +EFYQYLGVPVGE P+Q+PYD L+K ++DT KL NS+L  WQK+ AYK F HSRL F FRTREIK  A+ + +  ++ + N+S+QLR        LP+ +  +Y Y+    GGA  VDL DEY T +I   FRLLT +C     I  DSL+FV   R+ I  PS+ + L+W NGK    +   RKT + R R A+     T  I++S +  D   SL ITT  RGT I+ S  RK  +  LH  + ++Y        ++NFIA+     P INKAIF+S+L+  +WNFIHRARTNTL++ A+  N  +  R CR C  + ETMSH +QSCK +  L  ERHN CL  I +++ K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP D+  NF+  + LNL KYD LK++++   P +KV L TC+FGSLGS P  T  IL +IGVK  +   L   CA+SNI +SAR WHFH TG  +DP+  Y   VN
Sbjct: 2380 DPTIWFTDDDIDLYLKNHISSLEFASINCFIVEILRTNPGEDLFPIPEKVFKAQIILCPLNIDKTHWILFVYCKNSLTSYFIDPILQYRHRFVNKTYALQVTMALNQLFNLKAAMTFHPHENFLYQENSYDCGPYICAYAMMISGVWQSFPDHFIDIIRKEVHQ----LQLFCQQKDQKVSGGGLPKFMEIIKRNGVRKTSSATQAFLLKQNSCISSLLDEYFKTRTNTTVLSPELTMNIVAQISHSLEKIIVPNDSNWSL--FVFSLVLGNSYIFN-FHDRIVTERLIAIGENMTEYLNKFIISNRKVIFISEHNIEHASITSDDSLLFASAFIILIESLIFSKQVSHTRTSEF-------LKSVKIKQANEICLNLNSINCKINEEITAKYFSNLKLNSNFMLLNETMCTAILDDCTRHLTKFLNYEDLIKAQVIFAIIAPPKHREILLIIDYITDEYYIINPTTLCGTESSQSVCRIFRNKINEIKETPGRVINMGDIPNDIRGSGLVSKLLICAFIKNYANDLPLINLNLREIGTNISS-LLP----IISESKKILMSDLTKITKIKKSNLNLEQRKLKIDELLHDVSHLTVNEI-IDIIVRQFPDRVKSKHKPYLG--------NKTRPQKINKGILVRKFMTDMKWTVDKVFNNTNVSIRPTFQKIVLNFLIKSPFQGNWNNPLNFCKRSKQALNLELFSNFEIMFQLKRADNTSPGIDGIQYRDLLLLDPEGKLLTFLFNKIIVNKIIPDSWKSFKTLLTPKPNKDGKYDDVSSWRPIALLSVIYKVFASCLASRLTYWINTNNLLHIGQKGGSRHDGCVEHNSILSAALEHSKYSKNCPLAIAWLDIKDAFGSVPHDYMWSLLRFIGVGEDFTSILQLLYSDTSTFYSCGPILTPNIPIKQGVKQGCPISMILFALAINPVLEAVSRSDCEPFMIGESPVKILAYADDIALIAKSVEDLQKITNIAVRTAAEIGFEYRPEKCGYLQLPKVHIDSEILINNTKIKKLLSKEFYQYLGVPVGEEPDQSPYDILDKAVSDTRKLANSELLGWQKLKAYKIFVHSRLIFPFRTREIKTGALSKSNGNNNRSVNISSQLRGCFRKMLCLPNQSEVSYFYNATEKGGANCVDLLDEYHTQTITHFFRLLTSECDYARQINIDSLKFVTGPRLGIKTPSLQESLDWFNGKESKPHHSGRKTRFQRARIAVAFFEKTHNISVSFIVKDHKPSLYITTAMRGTIILTSDLRKTTSKVLHMALCDSYLSKWEKSCVSNFIASAVKLSPKINKAIFRSELSEFAWNFIHRARTNTLSIFAKHHNKGD-KRLCRLCHAEDETMSHAIQSCKVHLTLGAERHNDCLKLIASNLTKNSNLIVVVDHVCSLVPNSKERVDLMITDLSRKKIFLVDMKCPCDSVNNFEQVDLLNLRKYDSLKNQIQAVKPDFKVELDTCVFGSLGSFPLKTSEILVKIGVKRTELKPLLKECALSNIAHSARRWHFHKTGILVDPEKIYKNEVN 3866          

HSP 3 Score: 511.531 bits (1316), Expect = 3.611e-145
Identity = 296/627 (47.21%), Postives = 402/627 (64.11%), Query Frame = 1
Query: 5149 LLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLE-YDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKC-DQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWH 7023
            LL ++ V   FV+IL+LLYT+T+S+Y C  + TP I IK+GV+QGCPISMILF +AINPVL AIS S    + IG S +Q+LAYADDIALIA +++DLQ L + +V  A+ IG E+RPEKCAY+       ++ II+NN+ +KKL   EFYQYLGVP+G++ +Q+PY+ + K++ DT K+ +S+LY WQK+ AYK F HSRL F FRTREIK S++  PS     N N ++QLR  +    NLP+++  +Y Y+    GGA  +DL DEY   +IV  FRLLT  C  +  I   SL  V S R+ I +P++SQ  EWINGK    N   RKT + R R +I++      I +  E +    S+RITT   GT II    RK L+  LH  +Q++Y         +N + +    CPL+NK IF+S L    W FIHRARTNTL+  A+P    E  R CR+C  + ET+ HVLQ+CK N  L   RHN+CL KI +SIK  E ++ +D KC LVP   +RVDL+I + +  TI LVD+KCP+D+  NFQ  + LNL+KY +LK  ++   P YKV L TCI GSLGS+ S T+ +L ++GV+    + L   CAISNI +SA+I +
Sbjct:  902 LLKHMNVSAEFVNILQLLYTDTSSFYQCNQMQTPDIPIKKGVKQGCPISMILFAIAINPVLEAISLSNTTPFMIGTSSVQVLAYADDIALIATNSSDLQTLFDLAVNTANMIGFEYRPEKCAYIQYPKVETVSEIIVNNVLIKKLASNEFYQYLGVPIGDNNDQSPYEIVSKVIEDTKKIAHSNLYGWQKLKAYKIFLHSRLIFPFRTREIKTSSLADPSRNSRINVNNTSQLRGHLRKILNLPNHSEISYFYNSTENGGAGCIDLLDEYHAQTIVYFFRLLTSTCGYSKTINTFSLLSVTSPRLGIKNPTLSQSFEWINGKESKLNHSGRKTRFHRLRTSIEYFSRVHNIKLLFEIFKQNPSIRITTSLTGTRIILPKLRKTLSKILHSALQDSYLSKWEMSNHSNLLVSAIKLCPLVNKLIFRSDLTDFEWKFIHRARTNTLSTFAKPHKAGE-DRLCRRCKSEDETILHVLQTCKINQSLATNRHNACLIKIKDSIKDPELLVVVDHKCSLVPTSSERVDLIITNTEKKTILLVDMKCPIDSVSNFQLVDNLNLEKYAKLKLDIQAVKPDYKVELHTCIMGSLGSIESKTEDLLLKMGVQPNRIIGLMKECAISNIAHSAKICY 1527          

HSP 4 Score: 233.032 bits (593), Expect = 1.964e-60
Identity = 215/766 (28.07%), Postives = 356/766 (46.48%), Query Frame = 1
Query: 2470 SENDVNSPFYKTRSHIKAKGKPKECPSDSPSLSSVLIDKYGPHDKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXX------------KENISVLVFDEVASNNLTTINEDFIIENITYKQFKDIAFIICPF-RSNDVILTFFYNKLIKSSFIINTASNHHPSTD-LTHVGKTLTMLFNQFCRTDIVQLSMDH--SNVQTIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDN---ETRKIIPSGQT----LIQRRHISNCLQVQIVNCNVNE---AYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSD-PGEFVFNIQEFCIRSNKPLELCKISARDVIFELK 4686
            +EN +N P  +T      K +P + P  +     + ID   P     WF+D DI  YLE  I+   H  +   IV+ + S +++    +   +   + V CPLNI N HW+LFVY K +  S+ +DP+            A  +N   N+ F +   +  N +   + Q N+ DCGP  C YA  IS    +       EIR++ H  +  I  +     +   D+        + +   +++                       + N  V V     + NL   N  FI +N+T+KQFK I  I+C   + ND  L F Y+  +K S I +   N+    D L  +G  +T   NQFC    V+   +H  S+   + YD          FI+  +   +V +  ++C+ + ++ +S    ++ I  SP +NLI  N K+    I  YF  L  ++ Y+L+N+TL  A+ +DC Q++ +FL+   +  A++IFAI +PP   + LLVIDY     + L+P ++  N +     + +L  I+  R++ G A       HDIR SGL S ILIC++I NY  D SL  ++L  +G K+ + L+P    +DN   E R+ + +  T    L QR+   N L +++ + +V+E        I  +     K   +PYLG N     +N  K       +  N K  V +I+N+ +++  P  + I++ FT  D   E   N+ +F   S++ LEL  IS+ +++  L+
Sbjct:  155 TENIINPPTKRTGQRNTVKKQPSK-PKSTICNKYLTIDNRDPE---IWFSDNDIDWYLETHIDNPMHGYIKCFIVNILCSNIQEEVVAIPRSISNAEMVFCPLNINNSHWILFVYCKKSHSSYFIDPILKNRNNLINKQMALKINIALNRFFNLKIAVSYNPHKNLQYQNNNFDCGPFTCAYAVIISKDLDNFPDGFIDEIRRDVHRNQMEIVGNLIGGAKTKMDMSSFNFKKAIAKTKYSNNNPNTSNILKSQMPCVSSIICKYYENNCKVSVLSAELTLNLIAQNNSFIADNVTFKQFKGIDHILCVIPKFNDWCL-FIYSIKLKHSHIFDF--NYRTIDDRLIEIGVKITDYLNQFCLNRNVKFLANHHISHSTFLSYDTQYFAS---HFIV--LFEKLVYEIHLDCD-KVYDSLSKIKTETCIEISPGLNLI--NGKITDKIIFEYFNKLKLSDDYILLNKTLCVAISDDCTQYMKDFLNIECMMKANVIFAIISPPKVHELLLVIDYSTEEHYYLSPVTTDLNVVSQHFCYNILIKINELRESKGRAMKTGHCRHDIRGSGLLSKILICAFIRNYALDYSLENINLHEIG-KIVNSLVPIVDGSDNVKYEKREKLKAANTLKLNLKQRKEKINQLLIKLSDSDVDEIICTILTNIPRLENLQKKEFREPYLGKN-----HNKPKKFNNPAEFLTNMKTTVYRILNDQSVKIEPPFQEIINNFTHPDVVPECWDNVLKFSKISDELLELTYISSEEILSNLQ 899          

HSP 5 Score: 109.768 bits (273), Expect = 3.807e-23
Identity = 87/320 (27.19%), Postives = 148/320 (46.25%), Query Frame = 2
Query: 1319 DTAFDLSSGGTDILFNLGHVNFRKLSKSNDELRETLQIANPKEKLIKLIFLLNCLKIEGKISKTSKISNNNFAYNYILNCGILHSVAPVDELLNFRIENFEMWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISKINSLYKFIEKNSDDIKCQYNYANNLLEQSKLFKSKNHDKELTSNAVSFTLNILRFKINLLNRGLAKCFYSKSTKTTCPYDQGIVITELIDFLTAFLLKKFKDEIPKFALPLKPETVKSAKVGDR---VTLVSSVKLKKTNPNKITLVPTGDYVSEPVVESNMNLYSETK 2269
            D+    + G +DI+FN+G +NF ++   +  LR         EK+I +IF LN L+++G++  TS  S  ++  NY+ NC  L+S++ ++      +++ E W P +++ I   S++ ++C E LL LR+ KL  E D L+N+I P  I  INS    I+K       Q     NL       KS++ D E  + A  F   ++R K     + L    YS      C   + I  T +   + + +           +L L      S ++ DR   +TLV++ +  + NP       +G  V   VVE+    ++  K
Sbjct: 2027 DSDHSQAEGESDIMFNIGLLNFHEMCLKDTTLRSLFSQPKSSEKIIDIIFHLNYLQLKGELKHTSNGSYGDYVCNYVRNCLSLYSISDIETYNTLSLDDIEKWAPSLVKMIPPLSYHSSSCKERLLRLRQWKLSSEIDRLHNIISPETIKHINSFLSEIDKVYPQKATQLKALQNLSNNINAIKSES-DDETIAKAKRFLFFVIRDK----KKFLLGSKYSYRIPLHC---RKICTTNMTTHIFSIIQ----------SLLLTLTVSDSEQIADRSDSITLVNTTQSVEINPG----TQSGSIVDISVVETGSTEHARVK 2324          

HSP 6 Score: 109.768 bits (273), Expect = 3.807e-23
Identity = 87/320 (27.19%), Postives = 148/320 (46.25%), Query Frame = 2
Query: 1319 DTAFDLSSGGTDILFNLGHVNFRKLSKSNDELRETLQIANPKEKLIKLIFLLNCLKIEGKISKTSKISNNNFAYNYILNCGILHSVAPVDELLNFRIENFEMWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISKINSLYKFIEKNSDDIKCQYNYANNLLEQSKLFKSKNHDKELTSNAVSFTLNILRFKINLLNRGLAKCFYSKSTKTTCPYDQGIVITELIDFLTAFLLKKFKDEIPKFALPLKPETVKSAKVGDR---VTLVSSVKLKKTNPNKITLVPTGDYVSEPVVESNMNLYSETK 2269
            D+    + G +DI+FN+G +NF ++   +  LR         EK+I +IF LN L+++G++  TS  S  ++  NY+ NC  L+S++ ++      +++ E W P +++ I   S++ ++C E LL LR+ KL  E D L+N+I P  I  INS    I+K       Q     NL       KS++ D E  + A  F   ++R K     + L    YS      C   + I  T +   + + +           +L L      S ++ DR   +TLV++ +  + NP       +G  V   VVE+    ++  K
Sbjct: 4376 DSDHSQAEGESDIMFNIGLLNFHEMCLKDTTLRSLFSQPKSSEKIIDIIFHLNYLQLKGELKHTSNGSYGDYVCNYVRNCLSLYSISDIETYNTLSLDDIEKWAPSLVKMIPPLSYHSSSCKERLLRLRQWKLSSEIDRLHNIISPETIKHINSFLSEIDKVYPQKATQLKALQNLSNNINAIKSES-DDETIAKAKRFLFFVIRDK----KKFLLGSKYSYRIPLHC---RKICTTNMTTHIFSIIQ----------SLLLTLTVSDSEQIADRSDSITLVNTTQSVEINPG----TQSGSIVDISVVETGSTEHARVK 4673          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Match: SMESG000021486.1 (SMESG000021486.1)

HSP 1 Score: 987.252 bits (2551), Expect = 0.000e+0
Identity = 626/1522 (41.13%), Postives = 860/1522 (56.50%), Query Frame = 1
Query: 2599 DKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXXKENISVLVFDEVASNNLTTI------------NEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQFCRTDIVQLSMDHSNVQ--TIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLI-----------QRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGG-----YFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVF-NIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITIS-LEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPKYL 7062
            D   WFTD DI  YL+  I+  +  ++N  IV+ + +   +   P+ E++ K   +LCPLNI   HW+LFVY K++  S+ +DP+            A  V    N+LF + + +  + +     Q NS DCGP IC YA  IS             IRKE H     ++L      +        +   ++ R     +      F       IS L+ +   +   TT+            N  FI EN+T++QF  I  II    ++     F ++ ++ +S+I N   +   +  L  +G+ +T   N+F  ++   + +   N++  +I  DD  L   +F  +I S+  +         EF       LK +K       C+NL  IN K++     +YF +L  N+ ++L+NET+  A+++DC +H+++FL+  +L  A +IFAI  PP   + LL+IDY     +++NPT+            I  N I+  ++  G    M   P+DIR SGL S +LIC++I NY  D  L  ++L  +G  + S LLP      +E++KI+ S  T I           QR+   + L   + +  VNE    II       +K   KPYLG        N T+  K+  G     +  + K  V K+ N  N+  RP  + I+  F    P    + N   FC RS + L L   S  +++F+LK A  +SPG D I YRDL LLDPEGKLL   FNKII N  IPD+W+SFKTLL PKP K G Y+D SSWRPIALLS  YK+F S L+ RLT WI  N LLH GQKGGS  +GCVEHN+IL+AALEHSKYSK  PLAIAWLDIKDAFGSVPH Y+W LL +IGV   F SIL+LLY++T+++Y CGP+ TP I IK+GV+QGCPISMILF LAINPVL A+S S    + IG S ++ILAYADDIALIA S  DLQ + N +V  A+ IG E+RPEKC YL        + I++NN K+KKL  +EFYQYLGVPVGE P+Q+PYD L+K ++DT KL NS+L  WQK+ AYK F HSRL F FRTREIK  A+ + +  ++ + N+S+QLR        LP+ +  +Y Y+    GGA  VDL DEY T +I   FRLLT +C     I  DSL+FV   R+ I  PS+ + L+W NGK    +   RKT + R R A+     T  I++S +  D   SL ITT  RGT I+ S  RK  +  LH  + ++Y        ++NFIA+     P INKAIF+S+L+  +WNFIHRARTNTL++ A+  N  +  R CR C  + ETMSH +QSCK +  L  ERHN CL  I +++ K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP D+  NF+  + LNL KYD LK++++   P +KV L TC+FGSLGS P  T  IL +IGVK  +   L   CA+SNI +SAR WHFH TG  +DP+ +
Sbjct: 4664 DPTIWFTDDDIDLYLKNHISSLEFASINCFIVEILRTNPGEDLFPIPEKVFKAQIILCPLNIDKTHWILFVYCKNSLTSYFIDPILQYRHRFVNKTYALQVTMALNQLFNLKAAMTFHPHENFLYQENSYDCGPYICAYAMMISGVWQSFPDHFIDIIRKEVHQ----LQLFCQQKDQKVSGGGLPKFMEIIKRNGVRKTSSATQAFLLKQNSCISSLLDEYFKTRTNTTVLSPELTMNIVAQNLTFIRENVTFRQFTGIDHIISIVPNDSNWSLFVFSLVLGNSYIFN-FHDRIVTERLIAIGENMTEYLNKFIISNRKVIFISEHNIEHASITSDDSLLFASAFIILIESLIFSKQVSHTRTSEF-------LKSVKIKQANEICLNLNSINCKINEEITAKYFSNLKLNSNFMLLNETMCTAILDDCTRHLTKFLNYEDLIKAQVIFAIIAPPKHREILLIIDYITDEYYIINPTTLCGTESSQSVCRIFRNKINEIKETPGRVINMGDIPNDIRGSGLVSKLLICAFIKNYANDLPLINLNLREIGTNISS-LLP----IISESKKILMSDLTKITKIKKSNLNLEQRKLKIDELLHDVSHLTVNEI-IDIIVRQFPDRVKSKHKPYLG--------NKTRPQKINKGILVRKFMTDMKWTVDKVFNNTNVSIRPTFQKIVLNFLIKSPFHGNWNNPLNFCKRSKQALNLELFSNFEIMFQLKRADNTSPGIDGIQYRDLLLLDPEGKLLTFLFNKIIVNKIIPDSWKSFKTLLTPKPNKDGKYDDVSSWRPIALLSVIYKVFASCLASRLTYWINTNNLLHIGQKGGSRHDGCVEHNSILSAALEHSKYSKNCPLAIAWLDIKDAFGSVPHDYMWSLLRFIGVGEDFTSILQLLYSDTSTFYSCGPILTPNIPIKQGVKQGCPISMILFALAINPVLEAVSRSDCEPFMIGESPVKILAYADDIALIAKSVEDLQKITNIAVRTAAEIGFEYRPEKCGYLQLPKVHIDSEILINNTKIKKLLSKEFYQYLGVPVGEEPDQSPYDILDKAVSDTRKLANSELLGWQKLKAYKIFVHSRLIFPFRTREIKTGALSKSNGNNNRSVNISSQLRGCFRKMLCLPNQSEVSYFYNATEKGGANCVDLLDEYHTQTITHFFRLLTSECDYARQINIDSLKFVTGPRLGIKTPSLQESLDWFNGKESKPHHSGRKTRFQRARIAVAFFEKTHNISVSFIVKDHKPSLYITTAMRGTIILTSDLRKTTSKVLHMALCDSYLSKWEKSCVSNFIASAVKLSPKINKAIFRSELSEFAWNFIHRARTNTLSIFAKHHNKGD-KRLCRLCHAEDETMSHAIQSCKVHLTLGAERHNDCLKLIASNLTKNSNLIVVVDHVCSLVPNSKERVDLMITDLSRKKIFLVDMKCPCDSVNNFEQVDLLNLRKYDSLKNQIQAVKPDFKVELDTCVFGSLGSFPLKTSEILVKIGVKRTELKPLLKECALSNIAHSARRWHFHKTGILVDPEKM 6158          

HSP 2 Score: 982.245 bits (2538), Expect = 0.000e+0
Identity = 632/1548 (40.83%), Postives = 873/1548 (56.40%), Query Frame = 1
Query: 2512 HIKAKGKPKECPSDSPSLSSVLIDK---YGPHDKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXXKENISVLVFDEVASNNLTTINEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQFCRTDIVQLSMDHSNVQ--TIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQTLI-----------QRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGG-----YFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFN-IQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITIS-LEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPK--YLRHVN 7074
            H + K + +E  +DS ++  + I+K       D   WFTD DI  YL+  I+  +  ++N  IV+ + +   +   P+ E++ K   +LCPLNI   HW+LFVY K++  S+ +DP+            A  V    N+LF + + +  + +     Q NS DCGP IC YA  IS             IRKE H     ++L      +        +   ++ R     +      F       IS L+ +   +   TT+    +  NI  +    +  II P  SN  +  F ++ ++ +S+I N   +   +  L  +G+ +T   N+F  ++   + +   N++  +I  DD  L   +F  +I S+  +         EF       LK +K       C+NL  IN K++     +YF +L  N+ ++L+NET+  A+++DC +H+++FL+  +L  A +IFAI  PP   + LL+IDY     +++NPT+            I  N I+  ++  G    M   P+DIR SGL S +LIC++I NY  D  L  ++L  +G  + S LLP      +E++KI+ S  T I           QR+   + L   + +  VNE    II       +K   KPYLG        N T+  K+  G     +  + K  V K+ N  N+  RP  + I+  F    P +  +N    FC RS + L L   S  +++F+LK A  +SPG D I YRDL LLDPEGKLL   FNKII N  IPD+W+SFKTLL PKP K G Y+D SSWRPIALLS  YK+F S L+ RLT WI  N LLH GQKGGS  +GCVEHN+IL+AALEHSKYSK  PLAIAWLDIKDAFGSVPH Y+W LL +IGV   F SIL+LLY++T+++Y CGP+ TP I IK+GV+QGCPISMILF LAINPVL A+S S    + IG S ++ILAYADDIALIA S  DLQ + N +V  A+ IG E+RPEKC YL        + I++NN K+KKL  +EFYQYLGVPVGE P+Q+PYD L+K ++DT KL NS+L  WQK+ AYK F HSRL F FRTREIK  A+ + +  ++ + N+S+QLR        LP+ +  +Y Y+    GGA  VDL DEY T +I   FRLLT +C     I  DSL+FV   R+ I  PS+ + L+W NGK    +   RKT + R R A+     T  I++S +  D   SL ITT  RGT I+ S  RK  +  LH  + ++Y        ++NFIA+     P INKAIF+S+L+  +WNFIHRARTNTL++ A+  N  +  R CR C  + ETMSH +QSCK +  L  ERHN CL  I +++ K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP D+  NF+  + LNL KYD LK++++   P +KV L TC+FGSLGS P  T  IL +IGVK  +   L   CA+SNI +SAR WHFH TG  +DP+  Y   VN
Sbjct: 2320 HARVKRRLEETDADS-TIKIIEINKPLNESDRDPTIWFTDDDIDLYLKNHISSLEFASINCFIVEILRTNPGEDLFPIPEKVFKAQIILCPLNIDKTHWILFVYCKNSLTSYFIDPILQYRHRFVNKTYALQVTMALNQLFNLKAAMTFHPHENFLYQENSYDCGPYICAYAMMISGVWQSFPDHFIDIIRKEVHQ----LQLFCQQKDQKVSGGGLPKFMEIIKRNGVRKTSSATQAFLLKQNSCISSLLDEYFKTRTNTTVLSPELTMNIVAQISHSLEKIIVPNDSNWSL--FVFSLVLGNSYIFN-FHDRIVTERLIAIGENMTEYLNKFIISNRKVIFISEHNIEHASITSDDSLLFASAFIILIESLIFSKQVSHTRTSEF-------LKSVKIKQANEICLNLNSINCKINEEITAKYFSNLKLNSNFMLLNETMCTAILDDCTRHLTKFLNYEDLIKAQVIFAIIAPPKHREILLIIDYITDEYYIINPTTLCGTESSQSVCRIFRNKINEIKETPGRVINMGDIPNDIRGSGLVSKLLICAFIKNYANDLPLINLNLREIGTNISS-LLP----IISESKKILMSDLTKITKIKKSNLNLEQRKLKIDELLHDVSHLTVNEI-IDIIVRQFPDRVKSKHKPYLG--------NKTRPQKINKGILVRKFMTDMKWTVDKVFNNTNVSIRPTFQKIVLNFLIKSPFQGNWNNPLNFCKRSKQALNLELFSNFEIMFQLKRADNTSPGIDGIQYRDLLLLDPEGKLLTFLFNKIIVNKIIPDSWKSFKTLLTPKPNKDGKYDDVSSWRPIALLSVIYKVFASCLASRLTYWINTNNLLHIGQKGGSRHDGCVEHNSILSAALEHSKYSKNCPLAIAWLDIKDAFGSVPHDYMWSLLRFIGVGEDFTSILQLLYSDTSTFYSCGPILTPNIPIKQGVKQGCPISMILFALAINPVLEAVSRSDCEPFMIGESPVKILAYADDIALIAKSVEDLQKITNIAVRTAAEIGFEYRPEKCGYLQLPKVHIDSEILINNTKIKKLLSKEFYQYLGVPVGEEPDQSPYDILDKAVSDTRKLANSELLGWQKLKAYKIFVHSRLIFPFRTREIKTGALSKSNGNNNRSVNISSQLRGCFRKMLCLPNQSEVSYFYNATEKGGANCVDLLDEYHTQTITHFFRLLTSECDYARQINIDSLKFVTGPRLGIKTPSLQESLDWFNGKESKPHHSGRKTRFQRARIAVAFFEKTHNISVSFIVKDHKPSLYITTAMRGTIILTSDLRKTTSKVLHMALCDSYLSKWEKSCVSNFIASAVKLSPKINKAIFRSELSEFAWNFIHRARTNTLSIFAKHHNKGD-KRLCRLCHAEDETMSHAIQSCKVHLTLGAERHNDCLKLIASNLTKNSNLIVVVDHVCSLVPNSKERVDLMITDLSRKKIFLVDMKCPCDSVNNFEQVDLLNLRKYDSLKNQIQAVKPDFKVELDTCVFGSLGSFPLKTSEILVKIGVKRTELKPLLKECALSNIAHSARRWHFHKTGILVDPEKIYKNEVN 3837          

HSP 3 Score: 511.146 bits (1315), Expect = 4.296e-145
Identity = 296/627 (47.21%), Postives = 402/627 (64.11%), Query Frame = 1
Query: 5149 LLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLE-YDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKC-DQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWH 7023
            LL ++ V   FV+IL+LLYT+T+S+Y C  + TP I IK+GV+QGCPISMILF +AINPVL AIS S    + IG S +Q+LAYADDIALIA +++DLQ L + +V  A+ IG E+RPEKCAY+       ++ II+NN+ +KKL   EFYQYLGVP+G++ +Q+PY+ + K++ DT K+ +S+LY WQK+ AYK F HSRL F FRTREIK S++  PS     N N ++QLR  +    NLP+++  +Y Y+    GGA  +DL DEY   +IV  FRLLT  C  +  I   SL  V S R+ I +P++SQ  EWINGK    N   RKT + R R +I++      I +  E +    S+RITT   GT II    RK L+  LH  +Q++Y         +N + +    CPL+NK IF+S L    W FIHRARTNTL+  A+P    E  R CR+C  + ET+ HVLQ+CK N  L   RHN+CL KI +SIK  E ++ +D KC LVP   +RVDL+I + +  TI LVD+KCP+D+  NFQ  + LNL+KY +LK  ++   P YKV L TCI GSLGS+ S T+ +L ++GV+    + L   CAISNI +SA+I +
Sbjct:  902 LLKHMNVSAEFVNILQLLYTDTSSFYQCNQMQTPDIPIKKGVKQGCPISMILFAIAINPVLEAISLSNTTPFMIGTSSVQVLAYADDIALIATNSSDLQTLFDLAVNTANMIGFEYRPEKCAYIQYPKVETVSEIIVNNVLIKKLASNEFYQYLGVPIGDNNDQSPYEIVSKVIEDTKKIAHSNLYGWQKLKAYKIFLHSRLIFPFRTREIKTSSLADPSRNSRINVNNTSQLRGHLRKILNLPNHSEISYFYNSTENGGAGCIDLLDEYHAQTIVYFFRLLTSTCGYSKTINTFSLLSVTSPRLGIKNPTLSQSFEWINGKESKLNHSGRKTRFHRLRTSIEYFSRVHNIKLLFEIFKQNPSIRITTSLTGTRIILPKLRKTLSKILHSALQDSYLSKWEMSNHSNLLVSAIKLCPLVNKLIFRSDLTDFEWKFIHRARTNTLSTFAKPHKAGE-DRLCRRCKSEDETILHVLQTCKINQSLATNRHNACLIKIKDSIKDPELLVVVDHKCSLVPTSSERVDLIITNTEKKTILLVDMKCPIDSVSNFQLVDNLNLEKYAKLKLDIQAVKPDYKVELHTCIMGSLGSIESKTEDLLLKMGVQPNRIIGLMKECAISNIAHSAKICY 1527          

HSP 4 Score: 232.646 bits (592), Expect = 2.062e-60
Identity = 215/766 (28.07%), Postives = 356/766 (46.48%), Query Frame = 1
Query: 2470 SENDVNSPFYKTRSHIKAKGKPKECPSDSPSLSSVLIDKYGPHDKAKWFTDLDISCYLEWKINGDKHFALNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXX------------KENISVLVFDEVASNNLTTINEDFIIENITYKQFKDIAFIICPF-RSNDVILTFFYNKLIKSSFIINTASNHHPSTD-LTHVGKTLTMLFNQFCRTDIVQLSMDH--SNVQTIEYDDLNLCKLSFKFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDN---ETRKIIPSGQT----LIQRRHISNCLQVQIVNCNVNE---AYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSD-PGEFVFNIQEFCIRSNKPLELCKISARDVIFELK 4686
            +EN +N P  +T      K +P + P  +     + ID   P     WF+D DI  YLE  I+   H  +   IV+ + S +++    +   +   + V CPLNI N HW+LFVY K +  S+ +DP+            A  +N   N+ F +   +  N +   + Q N+ DCGP  C YA  IS    +       EIR++ H  +  I  +     +   D+        + +   +++                       + N  V V     + NL   N  FI +N+T+KQFK I  I+C   + ND  L F Y+  +K S I +   N+    D L  +G  +T   NQFC    V+   +H  S+   + YD          FI+  +   +V +  ++C+ + ++ +S    ++ I  SP +NLI  N K+    I  YF  L  ++ Y+L+N+TL  A+ +DC Q++ +FL+   +  A++IFAI +PP   + LLVIDY     + L+P ++  N +     + +L  I+  R++ G A       HDIR SGL S ILIC++I NY  D SL  ++L  +G K+ + L+P    +DN   E R+ + +  T    L QR+   N L +++ + +V+E        I  +     K   +PYLG N     +N  K       +  N K  V +I+N+ +++  P  + I++ FT  D   E   N+ +F   S++ LEL  IS+ +++  L+
Sbjct:  155 TENIINPPTKRTGQRNTVKKQPSK-PKSTICNKYLTIDNRDPE---IWFSDNDIDWYLETHIDNPMHGYIKCFIVNILCSNIQEEVVAIPRSISNAEMVFCPLNINNSHWILFVYCKKSHSSYFIDPILKNRNNLINKQMALKINIALNRFFNLKIAVSYNPHKNLQYQNNNFDCGPFTCAYAVIISKDLDNFPDGFIDEIRRDVHRNQMEIVGNLIGGAKTKMDMSSFNFKKAIAKTKYSNNNPNTSNILKSQMPCVSSIICKYYENNCKVSVLSAELTLNLIAQNNSFIADNVTFKQFKGIDHILCVIPKFNDWCL-FIYSIKLKHSHIFDF--NYRTIDDRLIEIGVKITDYLNQFCLNRNVKFLANHHISHSTFLSYDTQYFAS---HFIV--LFEKLVYEIHLDCD-KVYDSLSKIKTETCIEISPGLNLI--NGKITDKIIFEYFNKLKLSDDYILLNKTLCVAISDDCTQYMKDFLNIECMMKANVIFAIISPPKVHELLLVIDYSTEEHYYLSPVTTDLNVVSQHFCYNILIKINELRESKGRAMKTGHCRHDIRGSGLLSKILICAFIRNYALDYSLENINLHEIG-KIVNSLVPIVDGSDNVKYEKREKLKAANTLKLNLKQRKEKINQLLIKLSDSDVDEIICTILTNIPRLENLQKKEFREPYLGKN-----HNKPKKFNNPAEFLTNMKTTVYRILNDQSVKIEPPFQEIINNFTHPDVVPECWDNVLKFSKISDELLELTYISSEEILSNLQ 899          

HSP 5 Score: 112.079 bits (279), Expect = 6.436e-24
Identity = 94/341 (27.57%), Postives = 158/341 (46.33%), Query Frame = 2
Query: 1319 DTAFDLSSGGTDILFNLGHVNFRKLSKSNDELRETLQIANPKEKLIKLIFLLNCLKIEGKISKTSKISNNNFAYNYILNCGILHSVAPVDELLNFRIENFEMWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISKINSLYKFIEKNSDDIKCQYNYANNLLEQSKLFKSKNHDKELTSNAVSFTLNILRFKINLLNRGLAKCFYSKSTKTTCPYDQGIVITELIDFLTAFLLKKFKDEIPKFALPLKPETVKSAKVGDR---VTLVSSVKLKKTNPNKITLVPTGDYVSEPVVESNMNLYSETKVVSNKTFDARTHKHIIQILPP 2332
            D+    + G +DI+FN+G +NF ++   +  LR         EK+I +IF LN L+++G++  TS  S  ++  NY+ NC  L+S++ ++      +++ E W P +++ I   S++ ++C E LL LR+ KL  E D L+N+I P  I  INS    I+K       Q     NL       KS++ D E  + A  F   ++R K     + L    YS      C   + I  T +   + + +           +L L      S ++ DR   +TLV++ +  + NP       +G  V   VVE+    ++  K    +T DA +   II+I  P
Sbjct: 2027 DSDHSQAEGESDIMFNIGLLNFHEMCLKDTTLRSLFSQPKSSEKIIDIIFHLNYLQLKGELKHTSNGSYGDYVCNYVRNCLSLYSISDIETYNTLSLDDIEKWAPSLVKMIPPLSYHSSSCKERLLRLRQWKLSSEIDRLHNIISPETIKHINSFLSEIDKVYPQKATQLKALQNLSNNINAIKSES-DDETIAKAKRFLFFVIRDK----KKFLLGSKYSYRIPLHC---RKICTTNMTTHIFSIIQ----------SLLLTLTVSDSEQIADRSDSITLVNTTQSVEINPG----TQSGSIVDISVVETGSTEHARVKRRLEET-DADSTIKIIEINKP 2344          

HSP 6 Score: 109.768 bits (273), Expect = 3.617e-23
Identity = 87/320 (27.19%), Postives = 148/320 (46.25%), Query Frame = 2
Query: 1319 DTAFDLSSGGTDILFNLGHVNFRKLSKSNDELRETLQIANPKEKLIKLIFLLNCLKIEGKISKTSKISNNNFAYNYILNCGILHSVAPVDELLNFRIENFEMWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISKINSLYKFIEKNSDDIKCQYNYANNLLEQSKLFKSKNHDKELTSNAVSFTLNILRFKINLLNRGLAKCFYSKSTKTTCPYDQGIVITELIDFLTAFLLKKFKDEIPKFALPLKPETVKSAKVGDR---VTLVSSVKLKKTNPNKITLVPTGDYVSEPVVESNMNLYSETK 2269
            D+    + G +DI+FN+G +NF ++   +  LR         EK+I +IF LN L+++G++  TS  S  ++  NY+ NC  L+S++ ++      +++ E W P +++ I   S++ ++C E LL LR+ KL  E D L+N+I P  I  INS    I+K       Q     NL       KS++ D E  + A  F   ++R K     + L    YS      C   + I  T +   + + +           +L L      S ++ DR   +TLV++ +  + NP       +G  V   VVE+    ++  K
Sbjct: 4347 DSDHSQAEGESDIMFNIGLLNFHEMCLKDTTLRSLFSQPKSSEKIIDIIFHLNYLQLKGELKHTSNGSYGDYVCNYVRNCLSLYSISDIETYNTLSLDDIEKWAPSLVKMIPPLSYHSSSCKERLLRLRQWKLSSEIDRLHNIISPETIKHINSFLSEIDKVYPQKATQLKALQNLSNNINAIKSES-DDETIAKAKRFLFFVIRDK----KKFLLGSKYSYRIPLHC---RKICTTNMTTHIFSIIQ----------SLLLTLTVSDSEQIADRSDSITLVNTTQSVEINPG----TQSGSIVDISVVETGSTEHARVK 4644          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Match: SMESG000044750.1 (SMESG000044750.1)

HSP 1 Score: 837.025 bits (2161), Expect = 0.000e+0
Identity = 567/1510 (37.55%), Postives = 809/1510 (53.58%), Query Frame = 1
Query: 2569 SVLIDKYGPHDKAK-WFTDLDISCYLEWKINGDKHFA-LNAGIVDAIASKMEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVPFEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICGYAKKISFGKTDLNKIMAQEIRKETHHRRCT-------IKLHEDTTHEXXXXXXXXXXXXLVGRGPAXXXXXXXXXXXXXXKENIS----VLVFDEVASNNLTTINEDFIIENITYKQFKDIAFIICPFRSNDVILTFFYNKLIKSSFIINTASNHHPSTDLTHVGKTLTMLFNQF-CRTDIVQLSMDHSNVQ-TIEYDDLNLCKLSF-KFIINSIQNTIVSDEWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQ---------NTDNETRKIIPSGQTLIQRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGD-NFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVFN-IQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSA 7011
            S +I+ + P   +K WFTD DI  YLE  I G+ +FA +   IV+ ++S  +D   P+ + L K   +LCPLNI   HW+LFVY K T  S+ +DP+            A  +    N LF + + +  + +     Q N+ DCGP IC YA  IS           + IRK  H  + T       +     TT  I     ++     V      S  +      +  KE  S    + V     + NL   N  FI EN+T++QF  + ++I    +N     + ++ + + S+I+N       +  L  +G+++T   N+F   T  V   ++H+    +   +D +    +F +F+       I+S   IN         +LK  K  I    C NL  IN  ++   I +YFK+L   + ++ +N  +  A+++DC +++++FL+   +  A +IFAI  PP   + L +IDY     +LLNPT+   N  +     IL N I+  R++      +   PHDIR SGL S IL CSY+SNY  D  L  ++L  +GN + S L   ++         N  NE  K     + L +R++  + L +++   NV+E  + I+  I+   +  + KPYLG+        N + +IK    +  + K  V  IIN+++++ RP  ENI+  F    P +  ++ + +FC RS+K L+L  +S  +V+FELK A  +SPG D I YRD+ +LDPEGKLL   FNKII    IPD+W+SFKTLLIPKPGK   Y+  SSWRPIALLS  YK+F S L+ RLT WI  N LLH GQKGGS  +GCVEHN+IL++ALEHSKYSK + LAIAWLDIKDAFGSVPH Y+W LL ++G+  +F                              +R  C            P ++  S+  I  Y            ADDIAL++ SA DLQ + N +V IAS IG E+RPEKC Y+        + I++N+IK+K+L   EFYQYLGVPVGE  +Q+PYD L+K+++DT K+ +SDL  WQK+ AYK F HSRL F FRTREIK SA+ + +A ++   N+++QLR+      +LP+++   Y Y+    GGA  VDL DEY T +I   FRL T  C     +  DSL FVA  R+ I +P++ +  +WINGK      L      I F                +   G  SL IT+E RGT I+    RK  +  LH  + ++Y      G  +NFIA+     P INKAI++  ++  +W+FIHRARTNTL++NA+P N     R CR C  + ETMSH++QSCK +  L +ERHN CL+ I + + K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP DT  NF   + LNL KYD LK +++ A P +KV L TCI GSLGS P  T  +L +IGV       L   CA+SNI +SA
Sbjct:  575 STIINYFFPERDSKIWFTDDDIDYYLESHI-GNPNFAPVKCFIVEILSSNCKDEIFPMPDNLFKAQIILCPLNINESHWILFVYCKLTKISYFVDPILRKRHNLINKSSALKITMALNNLFDLKAAVTFHPHDNLVYQDNNYDCGPYICAYAMLISQEWQTFPNNFIETIRKNVHRFQTTKFQNPPKVSGGGITTLPIFKRFQNVTRPPKVALPYKASLQNENSCVPSLIKEWFSNKNYITVLSPELTLNLIAHNNQFIRENVTFRQFTRVKYVIAVIPNNPHWCLYIFSPVFERSYILN-FQGRVLNERLVSIGESMTDYLNKFFISTAKVNFIVNHNQTHFSFRSEDPHFFASAFIEFLERFFFGNILSFTKIN--------ETLKSFKLQITNEICKNLNLINFSINDEIITKYFKNLNLPSNFIFLNSAMCTAIMDDCTRYLNDFLNYDNIIKAQVIFAIVAPPKFREILFIIDYNTDEYYLLNPTTLLGNDNYLSACKILRNKINEIRESPNRVLNLGECPHDIRGSGLVSKILFCSYMSNYANDLPLVGLNLRQIGNVVSSFLPIASEIKKVKSLENNIINENHK---ESENLKERKNKIDQLLIKVFKMNVDEIVDVILDQISIKTVPTH-KPYLGNIERVESQINKSNLIK---NFNSDMKTTVNTIINDVDVDVRPKYENIITNFEHKIPLDRTWDKVFKFCNRSDKKLKLKYLSNSEVLFELKKADNTSPGVDGIKYRDMVMLDPEGKLLTFLFNKIISKKVIPDSWKSFKTLLIPKPGKGDKYDSVSSWRPIALLSVIYKLFASCLANRLTFWINKNNLLHIGQKGGSRHDGCVEHNSILSSALEHSKYSKNSQLAIAWLDIKDAFGSVPHAYMWSLLRFVGLMRNF-----------------------------SIRSKC-----------EPFMIGESAVKILAY------------ADDIALVSKSAYDLQRVTNIAVVIASEIGFEYRPEKCGYIELPLIETESQILINDIKIKRLASSEFYQYLGVPVGEETDQSPYDILKKVVSDTRKIADSDLLGWQKLKAYKIFLHSRLVFPFRTREIKTSALAKSNANNNLTVNITSQLRNCFRKMMSLPNHSEVCYFYNSTENGGANCVDLLDEYHTQTITHFFRLFTSNCEFARKVNLDSLNFVAGPRLGIINPTLQETFDWINGKEPKLGILKEHDINIMF----------------IVTKGQPSLFITSEKRGTFILTPDLRKTTSKVLHMALCDSYLSKWEKGCTSNFIASAIKLSPKINKAIYRGDISEFAWHFIHRARTNTLSINAKPHNKGN-KRLCRLCHTEDETMSHIIQSCKIHLTLGSERHNDCLDLISSHLSKNSNLIVVVDHVCSLVPESKERVDLMITDMVRKKIFLVDMKCPCDTINNFALVDLLNLQKYDSLKIQIQDAKPDFKVELDTCIIGSLGSFPPKTPELLLKIGVDRTHLKGLLKDCALSNISHSA 1998          

HSP 2 Score: 245.358 bits (625), Expect = 2.387e-64
Identity = 145/333 (43.54%), Postives = 196/333 (58.86%), Query Frame = 1
Query: 6049 ASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITIS-LEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKCD-QVETMSHVLQSCKSNGMLINERHNSCLNKIYNSI-KSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTG 7038
            A  R+ I +P++ +  +WINGK        +KT + R R AI   +    I I  +   G  SL IT+E RGT I+    RK  +  LH  + ++Y      G  +NFIA+     P INKAI++  ++  +W+FIHRARTNTL++NA+P N     R CR C  + ETMSH++QSCK +  L +ERHN CL+ I + + K+S  I+ +D  C LVP   +RVDL+I D     I LVD+KCP DT  NF   + LNL KYD LK +++ A P +KV L TCI GSLGS P  T  +L +IGV       L   CA+SNI +SAR WH+H TG
Sbjct: 2049 ADRRLGIINPTLQETFDWINGKEPKPWHSGKKTRFQRARIAIAFFKKEHDINIMFIVTKGQPSLFITSEKRGTFILTPDLRKTTSKVLHMALCDSYLSKWEKGCTSNFIASAIKLSPKINKAIYRGDISEFAWHFIHRARTNTLSINAKPHNKGN-KRLCRLCHTEDETMSHIIQSCKIHLTLGSERHNDCLDLISSHLSKNSNLIVVVDHVCSLVPESKERVDLMITDMVRKKIFLVDMKCPCDTINNFALVDLLNLQKYDSLKMQIQDAKPDFKVELDTCIIGSLGSFPPKTPELLLKIGVDRTHLKGLLKDCALSNISHSARRWHYHKTG 2380          

HSP 3 Score: 80.1073 bits (196), Expect = 2.892e-14
Identity = 58/171 (33.92%), Postives = 97/171 (56.73%), Query Frame = 2
Query: 1319 DTAFDLSSGGTDILFNLGHVNFRKLSKSNDELRETLQIANPKEKLIKLIFLLNCLKIEGKISKTSKISNNNFAYNYILNCGILHSVAPVDELLNFRIENFEMWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISKINSLYKFIEKN----SDDIK 1819
            D+ F      +DI+FNLG  +   LS+ ++ L   L      EKL+ +IF LN L ++G    ++  +  N+  NYI+NC ++HS+  ++      ++N E+W PK+I  I + SHY ++C E LL L++ KL  E D ++N+I P  I +IN+    I+++    SD +K
Sbjct:  289 DSQFPQVESESDIMFNLGLFSLFGLSEYDNILNSLLCQPKSPEKLVDIIFHLNYLNLKGNFDTSTNGTYENYVQNYIVNCLLMHSILDLETCAFLNLDNIELWAPKLIELIPSSSHYSSSCKEKLLKLKQWKLSGEIDRIHNIICPETIIEINNFLGVIDQSYTTKSDQLK 459          
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Match: SMESG000079542.1 (SMESG000079542.1)

HSP 1 Score: 731.095 bits (1886), Expect = 0.000e+0
Identity = 454/1129 (40.21%), Postives = 624/1129 (55.27%), Query Frame = 1
Query: 3688 MKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLIDNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGKQLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLMASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPPNQNTDNETRKIIPSGQT-----LIQRRHISNCLQVQIVNCNVNEAYXXXXXXXXXXXXXXXFKPYLGDNFFSPLNNTTKM----IKMKGGYFHNRKKVVTKIINEINIETRPNMENILDQFTQSDPGEFVF-NIQEFCIRSNKPLELCKISARDVIFELKLASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTLLIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHPGQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHVYLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGCPISMILFNLAINPVLVAISSSPIAKYKIGNSXXXXXXXXXXXXXXXNSATDLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDXXXXXXXXXXXXXXXDREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKTFFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHNAPKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDSLRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLRNTRGITISLE-YDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYHMDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVNARPQNTSEISRKCRKC-DQVETMSHVLQSCKSNGMLINERHNSCLNKIYNSIKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFNFQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILREIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTG 7038
            M+S +     VNL   N K+    +  +F +L  +  ++ +N T++ A++NDC Q+++EFL+   L  A I+FAI  PP + + L++IDY     +LLNP+S   N  +  TA  LL  I+  ++      L+ + PHDIR SGL S ILIC++I +Y  D SL  ++L  +GN +KS LLP  Q+   ++ K +   +T     L QR  + N   + + N NVNE                              N  +K+     K K  +  N K  V KI+    I  RP +++ +  F+  DP    + N+  F ++S   L L  I  ++VIFELK A  +SPG D I Y D+ + D EGKLL   +NKII    +PD W+SFKTL+ PKP K   Y   SSWRPIALLS  YKIF SILSRRLT W+  N LLH GQKGGSV +GCVEHN++L++ LEHSKYSK  P+ IA+LDIKDAFGSVPH Y+W +L +IGV   F + L+LLYT T+SYY CGP+ TP I IK+GV+QGCP+SMILF +AINPVL A++ S +  +K+G S IQ+LAYADDIALIAN+  DLQ ++N + ++A  IG E+RP+KCAY+   N                             + E  NQ+PY+ L K+++D  KL +SDL  WQK+ AYKTF HSRLTF FRTR+ KISA+   SA   +N   +T+LRS +    +LP+N+  +Y Y+ +  GGA+  DL DEY T +I   FRL T  C  T  I  DSL+FV   R+  ++PS+     WIN     +    +K  + R R AIQ    T  ITIS E  D    LR+TT  +GT I+   DR+ ++  LH  + ++Y    +N    N I N                                              R CR+C +Q +T+ HVLQ+CK +  L   RHN CL +I + +KS   I+ +D  C LV    +RVDL+I +N+  TI +VDIKCP D+   F+  NK NL KYD LK +++ A PS+ V + TCI GS+GS       +L ++GV                 E  A  WH+H TG
Sbjct:   38 MQSKMCDEVVVNLNLKNCKITEKLLTNFFINLNLSENFIFLNNTMTTAILNDCTQYLNEFLNIENLMKAKIVFAIIAPPKARQILMIIDYTSDEYYLLNPSSLLPNFPYLCTANSLLTKINEIKETPELCKLLGNCPHDIRGSGLVSRILICAFIKHYALDLSLENINLRDIGNVIKS-LLPTVQDIIKDSIKNLSDNKTNVKLKLDQRILLINENLLTLQNANVNEMR----------------------------NKISKVPHPKTKFKSSFNQNMKTTVYKILVNDEINERPTIKDHIGNFSHQDPLIKSYDNVLNFGLKSQNTLLLRPIIDKEVIFELKRADNTSPGIDGIKYNDIAIHDAEGKLLTLLYNKIITENTMPDVWKSFKTLMTPKPDKGDKYNLISSWRPIALLSVVYKIFASILSRRLTSWVNRNNLLHIGQKGGSVHDGCVEHNSVLSSCLEHSKYSKDTPIIIAFLDIKDAFGSVPHDYMWKILRHIGVGEKFTNTLKLLYTETSSYYTCGPIVTPNIPIKQGVKQGCPLSMILFAIAINPVLQAVTLSKVKPFKVGESSIQVLAYADDIALIANNTHDLQNILNIAFDVAREIGFEYRPKKCAYIQLPNVE--------------------------TISEHTNQSPYEILNKVVSDAKKLADSDLCGWQKLKAYKTFLHSRLTFPFRTRDFKISAL---SANVCNN---TTKLRSHLRKMMSLPNNSEISYFYNSVENGGASCTDLLDEYHTQTISYFFRLFTANCEFTKRINIDSLKFVTGPRLGNNNPSLQDSFNWINKADIIQKHSGKKPRFFRPRTAIQFFEKTHNITISFEVLDDKPILRLTTALKGTIILTEFDRRLVSKVLHLALYDSYFTKWKNS---NIIEN----------------------------------------------RLCRRCHNQDKTLPHVLQNCKVHLTLALNRHNDCLQQIVHHLKSPSLIVVVDHTCSLVSNSKERVDLIITNNEKKTILMVDIKCPFDSLVTFETVNKENLAKYDSLKKQIQAAKPSFTVDIFTCIIGSMGSYD-----LLGKMGVP---------------FEKVALSWHYHVTG 1036          
The following BLAST results are available for this feature:
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
CU462878.32.582e-2130.84pep chromosome:GRCz11:17:8542203:8545189:1 gene:EN... [more]
BX649434.35.081e-1929.43pep chromosome:GRCz11:3:18270043:18274691:-1 gene:... [more]
CABZ01033394.12.180e-1824.95pep chromosome:GRCz11:13:6032844:6038477:-1 gene:E... [more]
si:dkey-187k19.22.226e-1829.79si:dkey-187k19.2 [Source:ZFIN;Acc:ZDB-GENE-131121-... [more]
BX323564.18.095e-1829.04pep chromosome:GRCz11:2:10597810:10600950:-1 gene:... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSXETT00000032317.11.224e-2533.47pep primary_assembly:Xenopus_tropicalis_v9.1:1:130... [more]
ENSXETT00000040456.11.349e-2533.47pep primary_assembly:Xenopus_tropicalis_v9.1:8:873... [more]
efcab146.456e-2233.92EF-hand calcium binding domain 14 [Source:Xenbase;... [more]
ENSXETT00000045854.17.413e-2234.92pep primary_assembly:Xenopus_tropicalis_v9.1:1:730... [more]
ENSXETT00000002428.16.864e-2134.50pep primary_assembly:Xenopus_tropicalis_v9.1:3:159... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|Q03278|PO21_NASVI5.079e-2326.98Retrovirus-related Pol polyprotein from type-1 ret... [more]
sp|P16423|POLR_DROME3.607e-2225.47Retrovirus-related Pol polyprotein from type-2 ret... [more]
sp|Q03274|PO22_POPJA1.217e-1928.51Retrovirus-related Pol polyprotein from type-1 ret... [more]
sp|P14381|YTX2_XENLA1.303e-1531.19Transposon TX1 uncharacterized 149 kDa protein OS=... [more]
sp|O00370|LORF2_HUMAN7.025e-1526.55LINE-1 retrotransposable element ORF2 protein OS=H... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A159X4Q61.799e-14273.04ORF2-like protein (Fragment) OS=Schmidtea mediterr... [more]
Q7YXU72.193e-7630.19ORF2 (Fragment) OS=Girardia tigrina OX=6162 PE=4 S... [more]
A0A3L8D8E56.714e-7529.08Reverse transcriptase domain-containing protein OS... [more]
A0A3L8DAI75.103e-7429.69Reverse transcriptase domain-containing protein OS... [more]
W4Y7A76.281e-7428.10Uncharacterized protein OS=Strongylocentrotus purp... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000034560.11.528e-3326.73pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000034831.16.881e-2329.49pep primary_assembly:Astyanax_mexicanus-2.0:3:1621... [more]
ENSAMXT00000052982.13.738e-2129.28pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000033573.18.124e-2131.63pep primary_assembly:Astyanax_mexicanus-2.0:3:3140... [more]
ENSAMXT00000029797.11.056e-1928.64pep primary_assembly:Astyanax_mexicanus-2.0:20:119... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 1
Match NameE-valueIdentityDescription
EDO478466.965e-834.86Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000039772.16.711e-2128.21pep primary_assembly:ASM223467v1:15:12930252:12933... [more]
ENSORLT00000042085.17.507e-2128.83pep primary_assembly:ASM223467v1:19:3759893:376362... [more]
ENSORLT00000045648.17.831e-2128.83pep primary_assembly:ASM223467v1:16:7714608:771833... [more]
ENSORLT00000045921.11.342e-2032.52pep primary_assembly:ASM223467v1:18:29376025:29377... [more]
ENSORLT00000029493.11.997e-2027.86pep primary_assembly:ASM223467v1:9:31095018:310979... [more]
back to top
BLAST of Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000062936.12.836e-9173.70SMESG000062936.1[more]
SMESG000021486.13.611e-14541.13SMESG000021486.1[more]
SMESG000021486.14.296e-14541.13SMESG000021486.1[more]
SMESG000044750.12.387e-6437.55SMESG000044750.1[more]
SMESG000079542.10.000e+040.21SMESG000079542.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30015424 ID=SMED30015424|Name=Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2|organism=Schmidtea mediterranea sexual|type=transcript|length=8216bp
GTTCTAATATCAACCGAAATGAATCAACCTTTGAAATTAATACTGATTAC
ATGAAATTATTCCTCGATTTTCTGAAAGATGGCAAGATTTCTTCCTCCTT
GGAATTTAAATGCAGAAATTCAAATACGGAAAATCCCAAATGCAAATTGC
AAAATAATTGTGATGCTTCTTCAATTTTTAAGCAGCTGTTACAGATGAGG
AGCTCGCTCTTGGAGCTTCATAGTAATTTTAATAAGGATTTTAACGAGCT
TAGTAATTTATTTATTTCCTTAAGCTTAAAATTTAATGATGCATTTAAAA
TTTTACCAAATAAGGATAATAATCCTTGCGATTTAAAATGTAATGCGAAT
AATTATGGAAGTTCTAGGTGCATTAACTTAAATAAATTTAAAGATAAAAT
TTTTGAATCCCAGGCAAATGATCAGGACAGCCCCATATCTAATATAGGAA
TTAATAATTTAAACAAGCAGGATGCTAATTTTGATTCAAATAGAGTTCCT
GAGGCAGATAAAAATGGGGTTCATGGTAGTTTTGACTTGTGTGAATGTAA
TGGTGAAATTGATGATACTTGTCTGACGATTTAGGACAGTCCTAGTTATA
ATGCTGTAATGAAAACTGATGTTTCTAGTAATAAGAATTCTTACGCTGAT
GATTTTAAACAGAGCTCTTCAACGGGTGACTTTTTCAATGGCAAAACGAT
TAAATCTGCACCCAAATCTACATTAAACAAAAATAGCATTAAAATGATGC
CTAATAGGATAAATGTTATTATTAATAATAGAAATAAACATGTCAGTAAC
GTCAGGAAACCCACTATTAATGTCGTGAATGAGGCCATTTCACAATGTAA
ACTAAATATTATTAAAAATAAAATAGTTAATTGTTATTTTAATGATAATT
ACATTGCGTCCCCGCCAGTTTTAAGAGGCTTCTATTTAATATATCTCCAG
AGCTTAAAAGCACAGTGTTCCTCAACGACATTGTTAAGAAGACTCTGGAG
ATAAATAAGATAATTAATAAATTAAAGCGGCAAAATAGATTGCCTAACTT
TTTGGTCAACTCCAACATCAAATTCGAGAACAATGAATTTTTGTAAGAAA
CTTGTTCTGTGAAAGCTCCGATTCAGTTAGCGGGGAAACGAAGATAAGAC
CGATAACTGATTCCGAACGACATGAATTGATGATTGGTATACCTCAGCTG
CATGACAAATTGATGAGTGAGGGTGGGGACAAACTGCATAAGTCTGTTAA
TAACATTCTAAAGCATAAAAGCCTAAACCATATCTCTTTCTAACGGCTTT
TTCTATACCGCGAAATCTGACACGGCGTTTGATTTATCATCAGGAGGTAC
CGACATATTGTTCAATCTGGGTCACGTAAACTTTCGCAAACTATCAAAAT
CAAATGATGAGCTTCGAGAGACTCTACAAATTGCGAACCCAAAAGAAAAA
TTGATAAAATTAATATTTTTATTAAATTGCCTAAAAATCGAGGGTAAAAT
ATCTAAAACCAGTAAAATATCAAACAACAATTTTGCATACAATTATATTT
TAAATTGTGGCATTCTACACTCTGTGGCGCCAGTTGATGAACTCCTCAAC
TTTCGCATCGAGAACTTTGAAATGTGGGGGCCCAAGATCATCAGAGCGAT
CAATAACCAGTCACACTACCACAATGCCTGTAGTGAAATGCTCCTTGTAC
TCAGGAAACACAAGCTCCAATTAGAATACGACTCCCTTAACAACGTTATT
GACCCTGCGGTAATTTCTAAAATCAACTCACTATATAAATTTATTGAAAA
AAATAGTGATGATATAAAATGCCAATATAACTATGCAAATAATCTACTTG
AACAAAGTAAATTATTTAAATCAAAAAATCATGATAAAGAATTGACCTCT
AATGCAGTCTCTTTTACCTTAAATATCCTCCGATTTAAGATAAATCTATT
AAATAGAGGTCTTGCAAAATGCTTCTATTCAAAGTCCACCAAAACAACTT
GTCCGTATGACCAGGGTATTGTCATCACTGAGCTGATTGATTTTTTAACA
GCTTTTCTATTGAAGAAATTTAAAGATGAGATACCAAAATTTGCTCTGCC
TCTAAAACCAGAAACTGTAAAGTCTGCTAAAGTTGGGGATCGAGTAACGC
TAGTCTCATCTGTGAAACTTAAGAAAACGAACCCAAATAAAATCACTTTG
GTCCCTACTGGTGATTATGTTTCTGAACCGGTAGTCGAATCAAATATGAA
TTTATATAGTGAAACAAAAGTTGTTTCAAATAAAACTTTTGACGCACGCA
CACACAAACATATTATTCAGATTCTGCCACCAAAATCGACTCTATCTCCT
AATGTGATGGAAGACATTAACGCTTCGTTGCCGGTAGTGAATAGTGAAAA
ATTTGAAAACAATGTACCGTTGAACAACCCGCGCCTGCTGAAATTGTGCG
AAGAACTGTGGATGAAAACAGCGAGAACGATGTGAATTCCCCATTCTATA
AGACACGCAGCCATATAAAAGCTAAGGGGAAACCGAAGGAATGTCCAAGC
GACTCGCCATCCCTATCTTCTGTTCTGATTGACAAGTATGGCCCACATGA
CAAGGCAAAATGGTTCACTGACCTTGACATTAGTTGCTATCTTGAATGGA
AGATAAATGGTGACAAACACTTCGCGCTCAACGCTGGAATAGTTGATGCG
ATAGCCAGTAAAATGGAAGACTACAAACCACCTCTTTTGGAGAGGCTTGT
TAAGTGCGACACGGTACTTTGTCCACTTAACATTAAAAACAAACATTGGG
TTCTATTTGTCTATGAAAAATCCACTGCTGAATCCTTTGTGATGGACCCA
CTTCCTGTTCCTTTTGAACCTGAAGAGACTGCATTGAGAGCGTCAGCAGT
AAATGACTGCTTTAATAAATTATTTAAAATCAATTCTAAAATCATTGATA
ATAAATATCCAACCGCCAAGAAACAGACCAATAGCAATGACTGCGGTCCT
ATTATTTGCGGATATGCCAAAAAGATCAGTTTTGGCAAAACTGACTTAAA
TAAAATTATGGCCCAAGAAATCCGCAAAGAGACACATCATCGCAGATGTA
CAATCAAACTTCATGAGGATACAACTCATGAAATCTCTGACGACATCATT
GATATCCAAACTGACTGACTCGTTGGGAGGGGGCCCGCTACTTCTTCGTT
TGATACATTTTACTTCTTTTCAACATACTTTAAAGAAAATATTAGTGTAC
TTGTTTTTGATGAAGTAGCTAGCAATAACTTGACTACCATAAATGAGGAT
TTTATAATTGAAAACATCACTTATAAACAGTTCAAGGACATTGCTTTTAT
AATATGCCCCTTTAGATCAAATGATGTCATCCTCACCTTTTTCTATAACA
AATTAATTAAGTCTAGTTTTATTATAAATACAGCGTCTAATCATCACCCC
TCAACAGACCTGACTCATGTTGGTAAGACATTGACAATGCTCTTCAACCA
ATTTTGCCGTACTGACATAGTCCAACTGTCAATGGACCATAGTAATGTTC
AGACTATAGAATATGATGATTTGAACCTCTGTAAATTATCCTTCAAATTC
ATCATTAATTCCATCCAAAATACAATTGTTTCTGATGAGTGGATTAACTG
TGAATTTGAGCGACACAATCCAGTGTCCTTAAAATATATGAAATCCCACA
TAATTGGGTCACCGTGTGTAAATTTAATTGGAATCAACGCAAAACTCGAT
GTTCCGGCCATAGATAGGTATTTCAAAAGTCTTATAGATAATAACAAATA
CGTGTTGGTGAATGAAACATTGAGTGATGCCTTGATTAATGACTGTTGGC
AACATGTATCCGAGTTTCTTGATAAGACTGAGCTTTCAAATGCGTCCATC
ATCTTTGCGATCTTTACTCCTCCATCATCTGGAAAGCAACTGCTAGTTAT
TGATTACGAAAAATGCACGTCTTTTTTGCTCAATCCAACGTCTTCTACTG
AGAATCCTCTGCATACTGACACTGCTTTCATCCTCTTGAATTTCATCTCA
TCTTTTCGTGACAACATGGGAGCTGCACCATTAATGGCTAGCCCACCTCA
TGACATCCGTGCATCTGGTCTCTTTTCTAACATCTTGATTTGCAGCTATA
TCTCAAATTACCTGACTGACAGAAGCCTCTTTAAAATGGATCTCACATCT
GTTGGAAACAAATTGAAGAGCATACTTCTGCCACCTAATCAAAATACAGA
CAACGAAACCCGTAAAATTATTCCGTCAGGGCAAACTCTAATCCAACGAC
GACATATAAGCAATTGTTTACAGGTTCAGATTGTAAATTGTAATGTGAAC
GAGGCTTATAATAAAATAATTACATCGATAAATAAAACTAATATTAAACC
TAATTTTAAACCATATTTGGGTGATAATTTCTTTTCCCCTCTAAATAATA
CTACTAAAATGATAAAAATGAAGGGGGGGTATTTTCATAACCGTAAAAAA
GTGGTCACTAAAATAATCAACGAAATTAACATTGAAACGCGACCAAATAT
GGAAAATATATTAGACCAATTCACTCAATCGGACCCAGGGGAATTTGTTT
TCAACATTCAGGAGTTCTGTATAAGATCAAATAAGCCTCTTGAACTTTGT
AAGATATCTGCTCGTGATGTCATCTTTGAACTGAAGCTAGCGAGCGCCTC
ATCTCCTGGAGACGACAATATAACTTATAGGGACCTTCGTCTTTTAGACC
CCGAGGGTAAACTTTTGGCGCGGTTTTTCAATAAAATTATTGACAATGGC
CAGATTCCGGATAACTGGCGAAGTTTTAAAACTCTTTTAATACCAAAACC
GGGTAAATCGGGAAGTTATGAGGACACTTCCTCGTGGAGGCCAATCGCCT
TATTATCCACGTGCTATAAAATATTCACTTCAATCCTCTCCAGGCGACTA
ACTGAATGGATCACTGACAATAAACTTCTTCATCCAGGCCAAAAGGGAGG
GTCCGTGTTTGAGGGCTGTGTCGAACATAATGCTATTTTAACTGCTGCCC
TTGAGCACTCCAAATATTCAAAGAAAGCCCCATTAGCAATCGCTTGGCTG
GATATTAAGGACGCGTTTGGGAGTGTGCCACATGTTTACCTATGGGATCT
CCTGGCATATATAGGGGTCGATGGACATTTTGTTTCCATTCTTCGCCTCT
TGTATACAAATACTAACTCCTATTACTGTTGCGGCCCTTTGACTACCCCT
AAAATTGACATCAAACGGGGAGTGCGCCAAGGTTGTCCAATATCTATGAT
CCTCTTTAACTTGGCCATAAACCCAGTTTTAGTCGCAATCTCATCATCGC
CTATAGCGAAATATAAAATAGGTAATTCCCAAATACAAATTCTGGCATAC
GCAGACGACATTGCGCTAATAGCCAACAGTGCCACGGATTTACAAGCACT
TATCAACAATTCGGTGGAGATTGCAAGTAGGATTGGCCTGGAATTCAGGC
CTGAAAAATGCGCCTACTTAACAACCTCAAATTCCACAGACTTAAATAAT
ATCATTTTAAATAATATAAAATTAAAGAAACTGAAAGACAGAGAGTTTTA
CCAGTACCTCGGAGTCCCTGTTGGTGAATCTCCTAACCAAACTCCTTATG
ACACTTTAGAGAAACTCTTGGCTGACACTAATAAACTCAAGAATTCCGAT
CTGTACCCTTGGCAGAAAGTCGATGCATACAAGACCTTTTTTCATTCGCG
TCTTACGTTTGCTTTTAGGACCAGGGAAATTAAAATCAGCGCCATTGGCA
GACCATCAGCCAAAGACAGCCACAATAAGAATCTGAGCACACAACTCAGA
TCTATAATATATGGCTTTTTCAACTTACCGCATAATGCTCCCAAAGCATA
CATTTACAGCCCAATACTAACTGGGGGAGCGGCATTTGTTGACCTGCACG
ATGAATACAGTACTTTGTCGATCGTGCAAGCGTTTAGACTCTTGACCTGT
AAATGTCCAATTACATCTGCCATCATACAAGATAGCCTTCGATTTGTCGC
TTCAACACGTATCAGGATTTCTGACCCCTCCATTTCGCAGTGCCTTGAAT
GGATAAATGGCAAATCTTTCAACAAAAATAGCCTATCCCGAAAGACTTGG
TGGATACGATTTCGCAATGCCATTCAACATCTGAGAAATACACGTGGTAT
CACAATTTCTCTAGAATATGATGGGTTCTTTTCGCTGAGAATTACAACTG
AACATAGGGGTACGACAATAATCTACAGCCTTGATCGTAAGAAACTTGCC
TGCTTCCTTCATAAACTTATCCAAGAGACTTATCACATGGACCTTCGCAA
TGGGAAAATAAACAACTTCATTGCCAACACATACGTAAATTGTCCATTGA
TAAATAAAGCAATTTTTAAAAGTAAGTTAAATCTAGTCTCTTGGAACTTC
ATCCATCGAGCAAGGACAAACACCTTAGCAGTAAACGCAAGACCCCAGAA
CACTAGCGAAATTTCACGCAAATGTCGTAAATGCGACCAGGTTGAAACTA
TGTCCCATGTTTTGCAGTCGTGTAAATCTAATGGGATGTTAATAAACGAG
CGCCACAATTCATGCCTAAATAAAATATATAACTCAATAAAATCGAGCGA
AAAAATAATAACATTAGACCAGAAATGTGAGCTTGTTCCAGGTGATGGGA
AAAGAGTCGATCTTCTGATTCGTGATAATAAAAATATGACGATTAAACTT
GTTGATATAAAATGCCCTTTAGATACGGAGTTTAATTTTCAAAATTCAAA
CAAATTAAACCTAGATAAATATGATGAACTAAAATCAAAGCTTGAAATTG
CGTTTCCTTCCTATAAAGTAACTCTGTCTACTTGTATTTTCGGGTCTTTA
GGTAGTGTGCCTTCTGCAACTGATTTGATTTTACGGGAGATAGGTGTCAA
GGATGAGGACAGACTTTCACTCACTGTTGGGTGCGCAATCTCTAATATAG
AATATTCGGCTAGGATCTGGCACTTTCACTCAACTGGAAACGACATTGAT
CCTAAATATCTTCGTCATGTAAATAACTAAAATCCGTATATATTTATAAA
TATTTTGATGTTTTTTTTTTCTATATATATTAATGTATATATATATATAA
AATATATGTGTATAAGTATCTGGATATCTTGAGAATAAGTTACTGAGCTC
GTTAGGGTCCTTATTTTAAAATATATGTATACATGTATATTTTGAGGTAA
CCCCACACTTTGTGGGAACCTTGTTTTTTATAGTATTTAGACTGGGATGA
CTTTGAACGCGATTGACCTGCTGCAACTGTCCTGCTAGTTGTTCTGGATC
GTGCTGCAGAGTGCGGTGCTTGGGCTGCTGTTCTATGGGCTGTGGAGTCT
CTATTTCATTTTAGATTGCTATATCTGCTCTGAAATACCGCCGATTTGCA
CGGTGACTTGCTGTGGTGGAGCTTTTTGCTTCGCTATTCTCATCTGGCGA
CTGACGGTGGATTACGGCAGATGGTAACTGACCTGGTTGGGAATTCGGCA
GCTTGAGAACTCAGCTACCGACTCAAAAGCATATCACACTGTTATATGAT
TTGCTAAAGTGGATCAGTCAGATTGATTGTACACACCTAGTGACTTGTCA
CTGTGTATCTGCACTGTCCGCTTTGCATTTCTGGCACTGCAGTTGATAGT
GAATGACTGGCAGGATGACTCGTTGGGATCGAGAGCTTCAAACAATCCAA
TATGCTGAGCTGCTCCTGTTAACTGTGATCTCACCATATGTTCTGCGGAG
ATGAAACAGTCAGATTGATTGTATGGATATGGTGACTTGTCACTGTATTT
CTGCACTGTTTCGTCTCGCGGTTCCGGTGATGCGGTTGATGCAGTGAAGA
ACAAACTCGAGATATCCCGATAACATTGTGTCTATCTATCCCATGCTGGA
TGACATTGAAAACAATTATTTTTCATTTTTGTTGTATCAAATAAATTTTT
ATCACGCAAAATCGTGAAGCGTTCCATATTTTTTGCCATTATCTACTTGC
GAAATGAGTGACAAGCGACGATTGTGAGTCCTGAAAAAAAAAATAAAATC
AACAACTCCAAAACAGTAAACAACTAAGTCACATTGGAATATATATAAAA
TATATTTTACAAATGAAAAAAGTTTAGATGAAAATGAAGAAACGAGAAAA
AAATACGCCCAACGTG
back to top

protein sequence of SMED30015424-orf-1

>SMED30015424-orf-1 ID=SMED30015424-orf-1|Name=SMED30015424-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=288bp
MWGPKIIRAINNQSHYHNACSEMLLVLRKHKLQLEYDSLNNVIDPAVISK
INSLYKFIEKNSDDIKCQYNYANNLLEQSKLFKSKNHDKELTSNAVSFTL
NILRFKINLLNRGLAKCFYSKSTKTTCPYDQGIVITELIDFLTAFLLKKF
KDEIPKFALPLKPETVKSAKVGDRVTLVSSVKLKKTNPNKITLVPTGDYV
SEPVVESNMNLYSETKVVSNKTFDARTHKHIIQILPPKSTLSPNVMEDIN
ASLPVVNSEKFENNVPLNNPRLLKLCEELWMKTARTM*
back to top

protein sequence of SMED30015424-orf-2

>SMED30015424-orf-2 ID=SMED30015424-orf-2|Name=SMED30015424-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=194bp
SNINRNESTFEINTDYMKLFLDFLKDGKISSSLEFKCRNSNTENPKCKLQ
NNCDASSIFKQLLQMRSSLLELHSNFNKDFNELSNLFISLSLKFNDAFKI
LPNKDNNPCDLKCNANNYGSSRCINLNKFKDKIFESQANDQDSPISNIGI
NNLNKQDANFDSNRVPEADKNGVHGSFDLCECNGEIDDTCLTI*
back to top

protein sequence of SMED30015424-orf-3

>SMED30015424-orf-3 ID=SMED30015424-orf-3|Name=SMED30015424-orf-3|organism=Schmidtea mediterranea sexual|type=polypeptide|length=132bp
MKTDVSSNKNSYADDFKQSSSTGDFFNGKTIKSAPKSTLNKNSIKMMPNR
INVIINNRNKHVSNVRKPTINVVNEAISQCKLNIIKNKIVNCYFNDNYIA
SPPVLRGFYLIYLQSLKAQCSSTTLLRRLWR*
back to top

protein sequence of SMED30015424-orf-4

>SMED30015424-orf-4 ID=SMED30015424-orf-4|Name=SMED30015424-orf-4|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1198bp
MLFNQFCRTDIVQLSMDHSNVQTIEYDDLNLCKLSFKFIINSIQNTIVSD
EWINCEFERHNPVSLKYMKSHIIGSPCVNLIGINAKLDVPAIDRYFKSLI
DNNKYVLVNETLSDALINDCWQHVSEFLDKTELSNASIIFAIFTPPSSGK
QLLVIDYEKCTSFLLNPTSSTENPLHTDTAFILLNFISSFRDNMGAAPLM
ASPPHDIRASGLFSNILICSYISNYLTDRSLFKMDLTSVGNKLKSILLPP
NQNTDNETRKIIPSGQTLIQRRHISNCLQVQIVNCNVNEAYNKIITSINK
TNIKPNFKPYLGDNFFSPLNNTTKMIKMKGGYFHNRKKVVTKIINEINIE
TRPNMENILDQFTQSDPGEFVFNIQEFCIRSNKPLELCKISARDVIFELK
LASASSPGDDNITYRDLRLLDPEGKLLARFFNKIIDNGQIPDNWRSFKTL
LIPKPGKSGSYEDTSSWRPIALLSTCYKIFTSILSRRLTEWITDNKLLHP
GQKGGSVFEGCVEHNAILTAALEHSKYSKKAPLAIAWLDIKDAFGSVPHV
YLWDLLAYIGVDGHFVSILRLLYTNTNSYYCCGPLTTPKIDIKRGVRQGC
PISMILFNLAINPVLVAISSSPIAKYKIGNSQIQILAYADDIALIANSAT
DLQALINNSVEIASRIGLEFRPEKCAYLTTSNSTDLNNIILNNIKLKKLK
DREFYQYLGVPVGESPNQTPYDTLEKLLADTNKLKNSDLYPWQKVDAYKT
FFHSRLTFAFRTREIKISAIGRPSAKDSHNKNLSTQLRSIIYGFFNLPHN
APKAYIYSPILTGGAAFVDLHDEYSTLSIVQAFRLLTCKCPITSAIIQDS
LRFVASTRIRISDPSISQCLEWINGKSFNKNSLSRKTWWIRFRNAIQHLR
NTRGITISLEYDGFFSLRITTEHRGTTIIYSLDRKKLACFLHKLIQETYH
MDLRNGKINNFIANTYVNCPLINKAIFKSKLNLVSWNFIHRARTNTLAVN
ARPQNTSEISRKCRKCDQVETMSHVLQSCKSNGMLINERHNSCLNKIYNS
IKSSEKIITLDQKCELVPGDGKRVDLLIRDNKNMTIKLVDIKCPLDTEFN
FQNSNKLNLDKYDELKSKLEIAFPSYKVTLSTCIFGSLGSVPSATDLILR
EIGVKDEDRLSLTVGCAISNIEYSARIWHFHSTGNDIDPKYLRHVNN*
back to top

protein sequence of SMED30015424-orf-5

>SMED30015424-orf-5 ID=SMED30015424-orf-5|Name=SMED30015424-orf-5|organism=Schmidtea mediterranea sexual|type=polypeptide|length=152bp
MEDYKPPLLERLVKCDTVLCPLNIKNKHWVLFVYEKSTAESFVMDPLPVP
FEPEETALRASAVNDCFNKLFKINSKIIDNKYPTAKKQTNSNDCGPIICG
YAKKISFGKTDLNKIMAQEIRKETHHRRCTIKLHEDTTHEISDDIIDIQT
D*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: molecular function
TermDefinition
GO:0003676nucleic acid binding
GO:0008234cysteine-type peptidase activity
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0000101muscle cell
PLANA:0000103central nervous system
PLANA:0000142posterior region of the whole animal
PLANA:0000419prepharyngeal region
Vocabulary: INTERPRO
TermDefinition
IPR017448SRCR-like_dom
IPR003653Peptidase_C48_C
IPR000477RT_dom
Vocabulary: cellular component
TermDefinition
GO:0016020membrane
Vocabulary: biological process
TermDefinition
GO:0006508proteolysis