Transposon Ty3-I Gag-Pol polyprotein

Overview
NameTransposon Ty3-I Gag-Pol polyprotein
Smed IDSMED30000559
Length (bp)10607
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30000559) t-SNE clustered cells

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30000559) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30000559) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30000559) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30000559

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 14

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
nervous systemSMED30000559SMESG000063504.1 SMESG000063503.1 SMESG000026061.1 dd_Smed_v4_3587_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cephalic gangliaSMED30000559SMESG000063504.1 SMESG000063503.1 SMESG000026061.1 dd_Smed_v4_3587_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
neuronSMED30000559SMESG000063504.1 SMESG000063503.1 SMESG000026061.1 dd_Smed_v4_3587_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult colorimetric in situ hybridization evidence
nervous systemSMED30000559SMESG000013684.1 dd_Smed_v4_14881_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cephalic gangliaSMED30000559SMESG000013684.1 dd_Smed_v4_14881_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
non-ciliated neuronSMED30000559SMESG000013684.1 dd_Smed_v4_14881_1_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
nervous systemSMED30000559 dd_Smed_v4_14472_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
epidermisSMED30000559 dd_Smed_v4_14472_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cephalic gangliaSMED30000559 dd_Smed_v4_14472_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
muscle cellSMED30000559 dd_Smed_v4_14472_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30000559 dd_Smed_v4_14472_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
head regionSMED30000559SMESG000063504.1 SMESG000063503.1 SMESG000026061.1 dd_Smed_v6_3587_1_1dd_Smed_v6PMID:28171748
Stückemann et al., 2017
whole organism asexual adult RNA-sequencing evidence
head regionSMED30000559SMESG000070298.1 OX_Smed_1.0.11165ox_Smed_v2PMID:24238224
Kao et al., 2013
whole organism asexual adult RNA-sequencing evidence
non-ciliated epidermisSMED30000559SMESG000013684.1 dd_Smed_v4_14881_1_1dd_Smed_v4PMID:28292427
Wurtzel et al., 2017
whole organism asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: GIN1 (gypsy retrotransposon integrase 1 [Source:HGNC Symbol;Acc:HGNC:25959])

HSP 1 Score: 97.4413 bits (241), Expect = 1.371e-19
Identity = 62/248 (25.00%), Postives = 119/248 (47.98%), Query Frame = 1
Query: 7582 REEIRKGTNQKY*ISKKLVVI-EDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQ 8322
            R  IR+   +     KKL  + +D    +L++V ++ ++ +L + H+   G H    +T + +   YYW ++  DVK+W  AC  C   KN  T   AP   +    +P  L  +D +    T+   + + ++ +D FTKW    P  D +A  V+K ++  I F  G P+K++ D+   F+ +    +  +F + +I   ++   T    E    T+ + L+     + N+WD+++S  ++  N+T 
Sbjct:   33 RSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVIVSEEEKKKVLRECHENDSGAHHGISRTLTLVESNYYWTSVTNDVKQWVYACQHCQVAKN--TVIVAPKQHLLKVENPWSLVTVDLMGPFHTSNRSHVYAIIMTDLFTKWIVILPLCDVSASEVSKAII-NIFFLYGPPQKIIMDQRDEFIQQINIELYRLFGIKQI-VISHTSGTVNPTESTPNTIKAFLSKHCADHPNNWDDHLSAVSFAFNVTH 276          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: RTL1 (retrotransposon Gag like 1 [Source:HGNC Symbol;Acc:HGNC:14665])

HSP 1 Score: 97.8265 bits (242), Expect = 3.591e-19
Identity = 80/318 (25.16%), Postives = 146/318 (45.91%), Query Frame = 1
Query: 6403 EQYDKLRDTKYFTVIDANKGYMQIQMEEG----DAEKTAFVIE-DGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQ-EEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVI 7332
            E +D+L   ++FT ++  +G +  +   G    D  K AF +E + +  Y   PF L+  P   Q +++ IL D+    V  Y +++LI S + EEHL  +  VL R +   +     K ++   +  FLG +VT +G+  N + + ++  +P P +   ++ F+     YR F++ ++ IA P+                     +   ++  Q  W  E Q+AF+ LK     AP+L +P  +  F +    +G A+ A L Q +++ GK     + S+ +   ++ YS  E +   I  A   +  Y+  TE  I
Sbjct:  652 ELFDQLHGAEWFTKLEL-RGTIVEESVNGHRTEDVWKAAFGLELEEMKSYQ--PFALSPDPIIPQNVIHFILKDMLGFFVLSYGQEVLIYSMSQEEHLHHVRQVLVRFRHHNVYCSLDKSQFHRQTVEFLGFVVTPKGVKLNKNVMTIITGYPTPGSKLSLRNFIEFVFPYRHFVERFSIIAEPL---------------------VRQLLSSYQFYWGVEEQEAFECLKRAFRKAPLLHHPKPQNPFYLETGVTGTALHASLIQIDDQTGKRACCAFYSRNISPIEVEYSQAEMKILPIRAAFMVWCRYLENTEEPI 945          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: RTL1 (retrotransposon Gag like 1 [Source:HGNC Symbol;Acc:HGNC:14665])

HSP 1 Score: 97.8265 bits (242), Expect = 3.591e-19
Identity = 80/318 (25.16%), Postives = 146/318 (45.91%), Query Frame = 1
Query: 6403 EQYDKLRDTKYFTVIDANKGYMQIQMEEG----DAEKTAFVIE-DGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQ-EEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVI 7332
            E +D+L   ++FT ++  +G +  +   G    D  K AF +E + +  Y   PF L+  P   Q +++ IL D+    V  Y +++LI S + EEHL  +  VL R +   +     K ++   +  FLG +VT +G+  N + + ++  +P P +   ++ F+     YR F++ ++ IA P+                     +   ++  Q  W  E Q+AF+ LK     AP+L +P  +  F +    +G A+ A L Q +++ GK     + S+ +   ++ YS  E +   I  A   +  Y+  TE  I
Sbjct:  652 ELFDQLHGAEWFTKLEL-RGTIVEESVNGHRTEDVWKAAFGLELEEMKSYQ--PFALSPDPIIPQNVIHFILKDMLGFFVLSYGQEVLIYSMSQEEHLHHVRQVLVRFRHHNVYCSLDKSQFHRQTVEFLGFVVTPKGVKLNKNVMTIITGYPTPGSKLSLRNFIEFVFPYRHFVERFSIIAEPL---------------------VRQLLSSYQFYWGVEEQEAFECLKRAFRKAPLLHHPKPQNPFYLETGVTGTALHASLIQIDDQTGKRACCAFYSRNISPIEVEYSQAEMKILPIRAAFMVWCRYLENTEEPI 945          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: GIN1 (gypsy retrotransposon integrase 1 [Source:HGNC Symbol;Acc:HGNC:25959])

HSP 1 Score: 83.5741 bits (205), Expect = 9.902e-17
Identity = 50/186 (26.88%), Postives = 91/186 (48.92%), Query Frame = 1
Query: 7582 REEIRKGTNQKY*ISKKLVVI-EDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETI 8136
            R  IR+   +     KKL  + +D    +L++V ++ ++ +L + H+   G H    +T + +   YYW ++  DVK+W  AC  C   KN  T   AP   +    +P  L  +D +    T+   + + ++ +D FTKW    P  D +A  V+K ++  I F  G P+K++ D+   F+ + +
Sbjct:   33 RSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVIVSEEEKKKVLRECHENDSGAHHGISRTLTLVESNYYWTSVTNDVKQWVYACQHCQVAKN--TVIVAPKQHLLKVENPWSLVTVDLMGPFHTSNRSHVYAIIMTDLFTKWIVILPLCDVSASEVSKAII-NIFFLYGPPQKIIMDQRDEFIQQKV 215          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: NYNRIN (NYN domain and retroviral integrase containing [Source:HGNC Symbol;Acc:HGNC:20165])

HSP 1 Score: 87.4261 bits (215), Expect = 5.224e-16
Identity = 46/151 (30.46%), Postives = 78/151 (51.66%), Query Frame = 1
Query: 7672 VVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIP-ATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANF 8121
            VVP +LR+ L+   HD  LG H    +T  +L    +W  +++ VK +C +C  C  R   G++ K    P P  +T+P     ++ +  +  +  G+KH+L+ +D  T+W EAFP    T   VA++L+  +  + G P +L   +G  F
Sbjct: 1535 VVPTQLRRDLIFSVHDIPLGAHQRPEETYKKLRLLGWWPGMQEHVKDYCRSCLFCIPRNLIGSELKVIESPWPLRSTAPWSNLQIEVVGPVTISEEGHKHVLIVADPNTRWVEAFPLKPYTHTAVAQVLLQHVFARWGVPVRLEAAQGPQF 1685          

HSP 2 Score: 64.6994 bits (156), Expect = 4.420e-9
Identity = 56/205 (27.32%), Postives = 89/205 (43.41%), Query Frame = 1
Query: 6841 VPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTL------KGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVI------RTDHKPLEGLWKHKDTSSRLLKWAMKIQD 7419
            +P+    +  F+G    +R  I DY  +  P+  L K                      KP  +W  EH++AF  LK  L++A  L  P+ +  F + +  S  A+ A+L Q E  G+ + I Y SK L      +G Q    +     YA+ +ALK F   I  T VV+      RT   P E     + + + L++W++ +QD
Sbjct: 1087 IPSNFTALSFFMGFMDSHRDAIPDYEALVGPLHSLLK---------------------QKPDWQWDQEHEEAFLALKRALVSALCLMAPNSQLPFRLEVTVSHVALTAILHQ-EHSGRKHPIAYTSKPLLPDEESQGPQ----SGGDSPYAVAWALKHFSRCIGDTPVVLDLSYASRTTADP-EVREGRRVSKAWLIRWSLLVQD 1264          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR855320.1 (pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENSDARG00000099359.2 transcript:ENSDART00000159655.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR855320.1)

HSP 1 Score: 372.089 bits (954), Expect = 1.207e-104
Identity = 215/670 (32.09%), Postives = 347/670 (51.79%), Query Frame = 1
Query: 6322 NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEV--VIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
              +R CIDYR LN++T+K++YP P     +++L+   +FT +D    Y  +++  GD  K+AF    G +EY  LPFGL+NAPA FQ  +N +L D+  Q   VY++DILI S + +EH++ I  VLQRL +  + +K  KC +   S  FLGHIV+VEG+  +P+KI+ V N+P P + + +Q FLG   +YR+FI++++++AAP+  LT                      +K   +W+S  + AF +LK   ++APIL  PD  +QF+V +DAS   V A+L Q     GK +   Y S  L  A+  Y    +E  A+  AL++++ ++ G+ V  ++ TDHK LE +   K  +SR  +WA+     +  I Y+PG  N   DALSR+ +          +E+   ++    K + IS I  EI      K   +   V          + VP+KLR  ++   H   +  H    +T+  + Q+++W  L +DV+ +  AC++CA  K +       L P+   + P    ++DF+  LP + NGN  IL   D F+K A   P     +     + V   +F+I G P  +++D+G  F+S+  +    +       ++ +HPQ++G  ER N  L   L   V +N + W + +S   Y HN
Sbjct:  574 GSLRPCIDYRGLNNITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRAGDEWKSAFNTPRGHFEYCVLPFGLSNAPAVFQAFVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHIRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGLRMDPEKIKAVVNWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTS---------------------SKTPFRWSSAAEAAFSKLKGCFVSAPILITPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYYSHRLSAAESNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKRLNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRLFD---------SSERTSSLEPVVPKRIVISNITWEI----ESKVRAALDGVTPPIGCPPSRLFVPEKLRSDVIRWGHSSKVACHPGVSRTSFVIKQRFWWPALARDVRDFVLACSVCAVSKTSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPPS-NGNTVILTVVDRFSKAAHFVPLPKLPSARETAVAVINHVFRIHGLPTDVVSDRGPQFISKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNPSSWSQQLSWVEYAHN 1208          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511082.1 (pep chromosome:GRCz11:9:14291932:14297132:1 gene:ENSDARG00000113678.1 transcript:ENSDART00000183119.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511082.1)

HSP 1 Score: 371.703 bits (953), Expect = 1.953e-104
Identity = 214/670 (31.94%), Postives = 348/670 (51.94%), Query Frame = 1
Query: 6322 NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEV--VIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
              +R CIDYR LN +T+K++YP P     +++L+   +FT +D    Y  +++ EGD  KTAF    G +EY  LPFGL+NAPA FQ L+N +L D+  Q   VY++DILI S + +EH++ +  VLQRL +  + +K  KC +   S  FLGHIV+VEG+  +P+K++ V ++P P + + +Q FLG   +YR+FI++++++AAP+  LT  +KT                      +W++  Q AFD+LK   ++APIL  PD  +QF+V +DAS   V A+L Q     GK +   Y S  L  A+  Y    +E  A+  AL++++ ++ G+ V  ++ TDHK LE +   K  +SR  +WA+        I Y+PG  N   DALSRI +          +E+    +    + L IS +  EI      +  ++ + V          + VP++LR  ++   H   L  H    +T   + Q+++W  + +D++ +  AC++CA  K +       L P+   + P    A+DF+  LP + NGN  IL   D F+K     P     +       V   +F+I G P  +++D+G  F+S+  +   ++       ++ +HPQ++G  ER N  L   L   V +N + W + +S   Y HN
Sbjct:  572 GSLRPCIDYRGLNAITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMLDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGMRMDPEKVQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALT-SLKTPF--------------------RWSNAAQVAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKMHPCAYFSHRLNNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIQSAKRLNSRQARWALFFGRFDFSISYRPGSKNVKPDALSRIFDH---------SERASSPETIVPRRLFISAVTWEI----ESRVRMALEGVTPPPGCPPSRLFVPEELRSDVIRWGHSSKLACHPGVSRTLYLIKQRFWWPVMARDIRNFVLACSVCAVSKTSNRPPAGLLQPLSVPSRPWSHIALDFVTGLPPS-NGNTVILTVVDRFSKATHFIPLPKLPSARETAAAVIDHVFRIHGLPTDVVSDRGPQFISKFWREFCHLMGATVSLSSGFHPQSNGQTERANQDLERMLRCLVSQNPSSWSQQLSWVEYAHN 1206          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX546500.1 (pep chromosome:GRCz11:23:12926092:12931693:-1 gene:ENSDARG00000086495.3 transcript:ENSDART00000122176.3 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX546500.1)

HSP 1 Score: 363.614 bits (932), Expect = 2.765e-102
Identity = 213/675 (31.56%), Postives = 348/675 (51.56%), Query Frame = 1
Query: 6322 NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEV--VIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAE-----AFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
              +R CIDYR LN +T+K++YP P     +++L+   +FT +D    Y  +++  GD  KTAF    G +EY  LPFGL+NAPA FQ L+N +L D+  Q   VY++DILI S + +EH++ +  VLQRL +  + +K  KC +   S  FLGHIV+VEG+  +P+KI+ V ++P P + + +Q FLG   +YR+FI++++++AAP+  LT                      +K   +W+S  + AF +LK   ++APIL  PD  +QF+V +DAS   V A+L Q     GK +   Y S  L  A+  Y    +E  A+  AL++++ ++ G+ V  ++ TDHK LE +   K  +SR  +WA+     +  I Y+PG  N   DALSR+ +        L +  P+  Q     ++   +I   +R   N         V          + VP++LR  ++   H   +  H    +T   + Q+++W  + +DV+ +  AC++CA  K++       L P+   + P    ++DF+  LP++ NGN  +L   D F+K A        P+  ETA     + V   +F+I G P  +++D+G  F+S   +    +       ++ +HPQ++G  ER N  L   L   V +N + W + +S   Y HN
Sbjct:  546 GSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTSLTS---------------------SKMPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYFSHRLSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIRSAKRLNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRLFDPSDR----LSSPDPVLPQGIVVANISW-EIESRVRTALNG--------VTPPIGCPPSRLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVIKQRFWWPTMARDVRDFVLACSVCAVSKSSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSS-NGNTVVLTVVDRFSKAAHFISLPKLPSARETA-----VAVIDHVFRIHGLPTDVVSDRGPQFVSRFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNPSSWSQQLSWVEYAHN 1180          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511224.1 (pep chromosome:GRCz11:2:18017000:18022765:1 gene:ENSDARG00000113243.1 transcript:ENSDART00000186877.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511224.1)

HSP 1 Score: 363.614 bits (932), Expect = 7.436e-102
Identity = 215/676 (31.80%), Postives = 347/676 (51.33%), Query Frame = 1
Query: 6322 NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEV--VIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEIS-KIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAE-----AFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
              +R CIDYR LN +T+K++YP P     +++L+   +FT +D    Y  + +  GD  KTAF    G +EY  LPFGL+NAPA FQ L+N +L D+  Q   VY++DILI S + +EH++ +  VLQRL +  + +K  KC +   S  FLGHIV+VEG+  +P+KI+ V N+P P + + +Q FLG   +YR+FI +++++AAP+  LT                      +K   +W+S  + AF +LK   ++APIL  PD  +QF+V +DAS   V A+L Q     GK +   Y S  L  A+  Y    +E  A+  AL++++ ++ G+ V  ++ TDHK LE +   K  +SR  +WA+     +  I Y+PG  N   DALSR+ +          T  P  +  ++     IS +I   +R   +         V          + VP++LR  ++   H   +  H    +T   + Q+++W  + +DV+ +  AC++CA  K++       L P+   + P    ++DF+  LP++ NGN  IL   D F+K A        P+  ETA     + V   +F+I G P  +++D+G  F+S+  +    +       ++ +HPQ++G  ER N  L   L   V +N + W + +S   Y HN
Sbjct:  572 GSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVCIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVNWPTPDSRKALQRFLGFANFYRRFIHNFSQLAAPLTSLTS---------------------SKTPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSSAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKRLNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRLFDPSDR------TSSPDPVLPQRIVVANISWEIESRVRTALDG--------VTPPIGCPPNRLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVVKQRFWWPAMARDVRDFVLACSVCAVSKSSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSS-NGNTVILTVVDRFSKAAHFISLPKLPSARETA-----VAVIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNPSSWSQQLSWVEYAHN 1206          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR925755.2 (pep chromosome:GRCz11:17:42486740:42492668:-1 gene:ENSDARG00000116402.1 transcript:ENSDART00000183946.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR925755.2)

HSP 1 Score: 360.918 bits (925), Expect = 4.676e-101
Identity = 208/671 (31.00%), Postives = 342/671 (50.97%), Query Frame = 1
Query: 6322 NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEE-KGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEV--VIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEIS-KIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
              +R CIDYR LN +T+K++YP P     +++L+   +FT +D    Y  +++  GD  KTAF    G +EY  LPFGL+NAPA FQ L+N +L D+  Q   VY++DILI S + +EH++ +  VLQRL +  + +K  KC +   S  FLGH V+VEG+  +P+KI+ V N+P P + + +Q FLG   +YR+FI+++ ++AAP+  LT                      +K   +W++  + AF +LK   ++APIL  PD  +QF+V +D S   V A+L Q     GK +   Y S  L  A+  Y    +E  A+  AL++++ ++ G+ V  ++ TDHK LE +   K  +SR  +WA+     +  I Y+PG  N   DALSR+ +          T  P  +  ++     IS +I   +R   +         V          + VP+ LR  ++   H   +  H    +T   + Q+++W  + +DV+ +  AC++CA  K++       L P+   + P    ++DF+  LP++ NGN  +L   D F+K A   P     +     + V   +F+I G P  +++D+G  F+S+  +    +       ++ +HPQ++G  ER N  L   L   V +N + W + +S   Y HN
Sbjct:  545 GSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHTVSVEGMRMDPEKIQAVVNWPTPDSRKALQRFLGFANFYRRFIRNFRQLAAPLTNLTS---------------------SKTPFRWSNAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDVSEVGVGAILSQRSALDGKIHPCAYFSHRLSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVSTDHKNLEYIKSAKRLNSRQARWALFFGRFNFSISYRPGSKNIKPDALSRLFDRSDR------TSSPDPVLPQRVFVANISWEIESRVRTALDG--------VTPPIGCPPSRLFVPEDLRSDVIWWGHSSKVACHPGVSRTLFVIKQRFWWPVMARDVRDFVLACSVCAASKSSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSS-NGNTVVLTVVDRFSKAAHFVPLPKLPSARETAVAVIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNPSSWSQQLSWVEYAHN 1179          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000035398.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:3:9869556:9871940:-1 gene:ENSXETG00000011182.1 transcript:ENSXETT00000035398.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 356.681 bits (914), Expect = 4.632e-105
Identity = 216/675 (32.00%), Postives = 335/675 (49.63%), Query Frame = 1
Query: 6328 IRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTE--VVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*D-ETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            +R CIDYR LN +T+K+ YP P   E +D+L+  K F+ +D    Y  I++ EGD  KTAF   DG YEY  +PFGL NAPA FQ  +N I  D+  +   VY++DILI S+  E H   +   L RL++  +  K  KC +  P   FLG+I++  G   +P K+  ++ +P+P + + +Q F+G   YYR+FI+ ++   AP++ L +                   K  +P   W     +AF  LK+  I+A +LR+P+    F + +DAS     A+L Q     GK +   Y SK    A+  Y    +E  A+  AL++++  + G    V I TDHK LE L   K  + R  +W++     H  + Y+PG  N+ ADALSR         F      PI    E+   +  S+I   +     ++  +S+     +   G  +  VP +LR  +L Q H     GH  S KT   L +  +W  ++KDV+ +  ACT+CAT K + ++    LHP+P  + P     MDF+ +LP +  GN  I V  D F+K A   P     +A  +A+L +  I    G P ++++D+G+ F+S   + +     +    ++AYHPQT+G  ER N  L   L   V   Q+DW + +    + HN  +   T
Sbjct:   52 LRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIR-------------------KGGRP-NCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQARWSLFFSRFHFVLTYRPGTKNRKADALSR--------SFSPEDRLPI----EREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIG--MAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSC-GNTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQDDWSDLLPWAEFAHNNARHSST 691          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: castor1 (cytosolic arginine sensor for mTORC1 subunit 1 [Source:Xenbase;Acc:XB-GENE-960689])

HSP 1 Score: 357.451 bits (916), Expect = 2.017e-99
Identity = 216/675 (32.00%), Postives = 335/675 (49.63%), Query Frame = 1
Query: 6328 IRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTE--VVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*D-ETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            +R CIDYR LN +T+K+ YP P   E +D+L+  K F+ +D    Y  I++ EGD  KTAF   DG YEY  +PFGL NAPA FQ  +N I  D+  +   VY++DILI S+  E H   +   L RL++  +  K  KC +  P   FLG+I++  G   +P K+  ++ +P+P + + +Q F+G   YYR+FI+ ++   AP++ L +                   K  +P   W     +AF  LK+  I+A +LR+P+    F + +DAS     A+L Q     GK +   Y SK    A+  Y    +E  A+  AL++++  + G    V I TDHK LE L   K  + R ++W++     H  + Y+PG  N+ ADALSR         F      PI    E+   +  S+I   +     ++  +S+     +   G  +  VP +LR  +L Q H     GH  S KT   L +  +W  ++KDV+ +  ACT+CAT K + ++    LHP+P  + P     MDF+ +LP +  GN  I V  D F+K A   P     +A  +A+L +  I    G P ++++D+G+ F+S   + +     +    ++AYHPQT+G  ER N  L   L   V   Q+DW + +    + HN      T
Sbjct:  570 LRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLDKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIR-------------------KGGRP-NCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQVRWSLFFSRFHFVLTYRPGTKNRKADALSR--------SFSPEDRLPI----EREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIG--MAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSC-GNTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQDDWSDLLPWAEFAHNNASHSST 1209          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: anxa6 (annexin A6 [Source:Xenbase;Acc:XB-GENE-989741])

HSP 1 Score: 341.273 bits (874), Expect = 2.356e-99
Identity = 185/521 (35.51%), Postives = 295/521 (56.62%), Query Frame = 1
Query: 5947 MCETMVLRDDSEQMVEL-SKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITHCTS*-------EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCA-VYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSR 7482
            +C   V   D   +  L  +  + +  D V I++ +L+ +QQ +L+ +L  YS +F+ +    G T   +H +DTG   PI+   Y +   +R E+K QI           F  +  +H           +     R C+DYR+LNDVT  D+YP P+  E  D+L + KY T +D ++GY QI +     EK+AF+   GLY++T +PFG+ NAPATFQRL+N +L  +Q  A  Y++DI + S+T+EEHL+ +  V  +++ A + +KP KC  A     +LGH V    + P+P K+E +  +P+P T +QV  FLG +GYYRKFI +Y+ +A P+ +LT    ++  S+ ++               WT E + A + LK+ L ++P+L  PDF ++FI+  DAS   + AVL Q    G+++ + Y S+ L   +  Y+TIEKEC AIV+AL++ +PY+YG E  + TDH PL  L +    + +LL+W++ +Q  +  I ++ G+ + NAD LSR
Sbjct:  262 ICSAPVEGTDDPPIPNLIEEATSARGIDAVTISD-HLHLSQQDQLRKILRSYSPMFSANP---GRTHWAEHKVDTGTQLPIRSPAYRVAEAVRPEMKSQID------EMLAFGVITPSHSPWASPVVLVPKKDGSTRFCVDYRRLNDVTTTDAYPMPRVDELLDRLGNAKYLTTLDLSRGYWQIPLAPSAQEKSAFLTPFGLYQFTVMPFGMRNAPATFQRLVNRLLEGMQDFAQAYLDDIAVFSQTWEEHLQHLQRVFAQIQDAGLTLKPEKCHLAMAEVQYLGHRVGGGQLRPDPAKVEAICQWPIPKTQKQVLAFLGTSGYYRKFIPNYSTVAKPLTDLT----SRQRSRTIV---------------WTPECESAMNALKQALASSPVLAAPDFSRRFILQTDASNFGLGAVLSQVNTYGEEHPVAYLSRKLLPREAAYATIEKECLAIVWALQKLQPYLYGREFTVVTDHNPLSWLQRVSGDNGKLLRWSLLLQQYNFTIQHRKGKEHHNADGLSR 753          

HSP 2 Score: 82.0333 bits (201), Expect = 1.418e-14
Identity = 39/109 (35.78%), Postives = 64/109 (58.72%), Query Frame = 1
Query: 7984 DYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIH 8310
            DY T++ EA       A TVA  L+ +I  ++G P ++L+D+G  F S+ ++ +     +  I+++ YHPQT+GL ERFNGTL + L  FV   + DW+ Y+    + +
Sbjct:    2 DYATRYPEAVALRKIDAPTVADALI-QIFSRVGFPSEILSDQGPQFTSQLLQCLWQRCGVRAIHSSPYHPQTNGLCERFNGTLKTMLRTFVESGEKDWERYLPHLLFAY 109          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000020886.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:3:49613601:49618752:-1 gene:ENSXETG00000011904.1 transcript:ENSXETT00000020886.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 356.681 bits (914), Expect = 3.761e-99
Identity = 216/675 (32.00%), Postives = 334/675 (49.48%), Query Frame = 1
Query: 6328 IRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTE--VVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*D-ETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            +R CIDYR LN +T+K+ YP P   E +D+L+  K F+ +D    Y  I++ EGD  KTAF   DG YEY  +PFGL NAPA FQ  +N I  D+  +   VY++DILI S+  E H   +   L RL++  +  K  KC +  P   FLG+I++  G   +P K+  ++ +P+P + + +Q F+G   YYR+FI+ ++   AP++ L +                   K  +P   W     +AF  LK+  I+A +LR+P+    F + +DAS     A+L Q     GK +   Y SK    A+  Y    +E  A+  AL++++  + G    V I TDHK LE L   K  + R  +W++     H  + Y+PG  N+ ADALSR         F      PI    E+   +  S+I   +     ++  +S+     +   G  +  VP +LR  +L Q H     GH  S KT   L +  +W  ++KDV+ +  ACT+CAT K + ++    LHP+P  + P     MDF+ +LP +  GN  I V  D F+K A   P     +A  +A+L +  I    G P ++++D+G+ F+S   + +     +    ++AYHPQT+G  ER N  L   L   V   Q+DW + +    + HN      T
Sbjct:  570 LRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLDKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIR-------------------KGGRP-NCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQARWSLFFSRFHFVLTYRPGTKNRKADALSR--------SFSPEDRLPI----EREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIG--MAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSC-GNTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQDDWSDLLPWAEFAHNNASHSST 1209          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: lin54 (lin-54 DREAM MuvB core complex component [Source:Xenbase;Acc:XB-GENE-5752559])

HSP 1 Score: 356.295 bits (913), Expect = 6.513e-99
Identity = 216/675 (32.00%), Postives = 334/675 (49.48%), Query Frame = 1
Query: 6328 IRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEK-GKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTE--VVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*D-ETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            +R CIDYR LN +T+K+ YP P   E +D+L+  K F+ +D    Y  I++ EGD  KTAF   DG YEY  +PFGL NAPA FQ  +N I  D+  +   VY++DILI S+  E H   +   L RL++  +  K  KC +  P   FLG+I++  G   +P K+  ++ +P+P + + +Q F+G   YYR+FI+ ++   AP++ L +                   K  +P   W     +AF  LK+  I+A +LR+P+    F + +DAS     A+L Q     GK +   Y SK    A+  Y    +E  A+  AL++++  + G    V I TDHK LE L   K  + R  +W++     H  + Y+PG  N+ ADALSR         F      PI    E+   +  S+I   +     ++  +S+     +   G  +  VP +LR  +L Q H     GH  S KT   L +  +W  ++KDV+ +  ACT+CAT K + ++    LHP+P  + P     MDF+ +LP +  GN  I V  D F+K A   P     +A  +A+L +  I    G P ++++D+G+ F+S   + +     +    ++AYHPQT+G  ER N  L   L   V   Q+DW + +    + HN      T
Sbjct:  570 LRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLDKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIR-------------------KGGRP-NCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQARWSLFFSRFHFVLTYRPGTKNRKADALSR--------SFSPEDRLPI----EREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIG--MAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSC-GNTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQDDWSDLLPWAEFAHNNASHSST 1209          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Gin1 (gypsy retrotransposon integrase 1 [Source:MGI Symbol;Acc:MGI:2182036])

HSP 1 Score: 96.2857 bits (238), Expect = 2.259e-19
Identity = 61/248 (24.60%), Postives = 116/248 (46.77%), Query Frame = 1
Query: 7582 REEIRKGTNQKY*ISKKLVVI-EDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQ 8322
            R  IR+   +     KKL  + +D    +L+VV ++ ++ +L + H+   G H    +T + +   YYW ++  DVK+W  AC  C   KN  T   AP   +P   +P  +  +D +    T+   + + ++ +D FTKW    P  D +A  ++K ++  I F  G P+K++ D+   F+ +    +  +F   +I  +      +   E    T+ + L+     + N WDE++   ++  N+T 
Sbjct:   33 RSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVVVSEEEKKKVLRECHENGPGVHHGISRTLTLVESGYYWTSVTNDVKQWVYACQHCQVAKN--TVIVAPQQHLPMVGNPWSVVTVDLMGPFHTSNRSHVYAIIMTDLFTKWVMILPLCDVSASEISKAII-NIFFLYGPPQKIIMDQRDEFIEQINVELYRLFGAKEIVISRASGSVNP-AENTPSTIKTFLSKHCADHPNSWDEHLPALSFAFNVTH 276          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Nynrin (NYN domain and retroviral integrase containing [Source:MGI Symbol;Acc:MGI:2652872])

HSP 1 Score: 82.4185 bits (202), Expect = 1.125e-14
Identity = 43/151 (28.48%), Postives = 75/151 (49.67%), Query Frame = 1
Query: 7672 VVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIP-ATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANF 8121
            VVP++LR+ L+   HD  +G H     T   +    +W  ++  V+ +C +C  C  R   G + K    P P  +T+P     ++ +  +  +  G+KH+L+ +D  T+W EAFP    T   VA++L+  +  + G P +L   +G  F
Sbjct: 1478 VVPRQLRRDLIFSVHDSPIGEHQGLEDTYKTVRLLGWWPGMQDHVRDYCRSCLFCIPRNLIGGELKVIESPWPLRSTAPWSSLQIEVVGPVTVSEEGHKHVLIVADANTRWVEAFPLKPYTHVAVAQVLLQHVFARWGVPIRLEAAQGPQF 1628          

HSP 2 Score: 60.4622 bits (145), Expect = 6.475e-8
Identity = 55/224 (24.55%), Postives = 91/224 (40.62%), Query Frame = 1
Query: 6781 VTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTL------KGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGL----WKHKDTS-SRLLKWAMKIQD 7419
            V  +G  P    +  +    +P+    +  F+G    +R  I DY  +  P+  L K                      KP  +W  EH+K+F  LK  L+ A  L  P+    F + +  S  ++ A L Q E  G+ + I Y SK L      +G Q    +     YA+ +ALK F   +    VV+R  +     +    W  +  S + L++W++ +QD
Sbjct: 1010 VPWDGKAPCQQVLAQLAQLNIPSNFTALSFFMGFMDSHRDVISDYEDLVGPLHGLLK---------------------QKPDWQWNQEHEKSFLALKRALVCALCLSTPNPNLPFYLEVTVSQVSLTASLHQ-EHSGRKHPIAYTSKPLLPDEDSEGPQ----SGGDSPYAVAWALKHFARCVGDNPVVLRLSYASRTTVDNEAWDSRRASKAWLIRWSLLLQD 1207          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Nynrin (NYN domain and retroviral integrase containing [Source:MGI Symbol;Acc:MGI:2652872])

HSP 1 Score: 82.4185 bits (202), Expect = 1.125e-14
Identity = 43/151 (28.48%), Postives = 75/151 (49.67%), Query Frame = 1
Query: 7672 VVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIP-ATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANF 8121
            VVP++LR+ L+   HD  +G H     T   +    +W  ++  V+ +C +C  C  R   G + K    P P  +T+P     ++ +  +  +  G+KH+L+ +D  T+W EAFP    T   VA++L+  +  + G P +L   +G  F
Sbjct: 1478 VVPRQLRRDLIFSVHDSPIGEHQGLEDTYKTVRLLGWWPGMQDHVRDYCRSCLFCIPRNLIGGELKVIESPWPLRSTAPWSSLQIEVVGPVTVSEEGHKHVLIVADANTRWVEAFPLKPYTHVAVAQVLLQHVFARWGVPIRLEAAQGPQF 1628          

HSP 2 Score: 60.4622 bits (145), Expect = 6.475e-8
Identity = 55/224 (24.55%), Postives = 91/224 (40.62%), Query Frame = 1
Query: 6781 VTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTL------KGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGL----WKHKDTS-SRLLKWAMKIQD 7419
            V  +G  P    +  +    +P+    +  F+G    +R  I DY  +  P+  L K                      KP  +W  EH+K+F  LK  L+ A  L  P+    F + +  S  ++ A L Q E  G+ + I Y SK L      +G Q    +     YA+ +ALK F   +    VV+R  +     +    W  +  S + L++W++ +QD
Sbjct: 1010 VPWDGKAPCQQVLAQLAQLNIPSNFTALSFFMGFMDSHRDVISDYEDLVGPLHGLLK---------------------QKPDWQWNQEHEKSFLALKRALVCALCLSTPNPNLPFYLEVTVSQVSLTASLHQ-EHSGRKHPIAYTSKPLLPDEDSEGPQ----SGGDSPYAVAWALKHFARCVGDNPVVLRLSYASRTTVDNEAWDSRRASKAWLIRWSLLLQD 1207          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Rtl1 (retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:MGI:2656842])

HSP 1 Score: 82.4185 bits (202), Expect = 1.187e-14
Identity = 73/322 (22.67%), Postives = 139/322 (43.17%), Query Frame = 1
Query: 6403 EQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIED--------GLYEY-TRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQ-EEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVI 7332
            E +D+L    +FT ++       + ++E +   T    ED        GL++     PF + +       +++ IL D+    V  +  ++L+ S + EEH + +  VL R +   I     K ++   +A  LG  ++ +G+  N + + ++   PVP + + +Q  + L   YR F++++A IAAP+                     +   ++     W  E Q+A + LK     +P+L +P  +  F +  D +G  + A L Q ++E GK     + S+ L   ++ Y  +E     I  A   +  Y+  TE  I
Sbjct:  912 ELFDQLHGAAWFTKLEL------LGIKESEMRHTVTHTEDTWRASFGFGLHQMRCYRPFTMNSYSDEGNNIVHFILKDILGLFVICHGREVLVYSMSQEEHSQHVRQVLVRFRYHNIYCSLDKTQFHRQTAEILGFNISPKGVKLNKNLMNLIVGCPVPGSRRCLQSVIDLVYPYRHFVENFAVIAAPL---------------------VRQLLSSEPYYWGEEEQEALESLKRAFRKSPVLYHPKPQNPFYLETDITGSFLSASLVQTDDETGKKSTCAFYSRPLSTMEVEYPRVEMRILPIRAAFMVWCRYLENTEEPI 1206          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Rtl1 (retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:MGI:2656842])

HSP 1 Score: 82.4185 bits (202), Expect = 1.187e-14
Identity = 73/322 (22.67%), Postives = 139/322 (43.17%), Query Frame = 1
Query: 6403 EQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIED--------GLYEY-TRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQ-EEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVI 7332
            E +D+L    +FT ++       + ++E +   T    ED        GL++     PF + +       +++ IL D+    V  +  ++L+ S + EEH + +  VL R +   I     K ++   +A  LG  ++ +G+  N + + ++   PVP + + +Q  + L   YR F++++A IAAP+                     +   ++     W  E Q+A + LK     +P+L +P  +  F +  D +G  + A L Q ++E GK     + S+ L   ++ Y  +E     I  A   +  Y+  TE  I
Sbjct:  912 ELFDQLHGAAWFTKLEL------LGIKESEMRHTVTHTEDTWRASFGFGLHQMRCYRPFTMNSYSDEGNNIVHFILKDILGLFVICHGREVLVYSMSQEEHSQHVRQVLVRFRYHNIYCSLDKTQFHRQTAEILGFNISPKGVKLNKNLMNLIVGCPVPGSRRCLQSVIDLVYPYRHFVENFAVIAAPL---------------------VRQLLSSEPYYWGEEEQEALESLKRAFRKSPVLYHPKPQNPFYLETDITGSFLSASLVQTDDETGKKSTCAFYSRPLSTMEVEYPRVEMRILPIRAAFMVWCRYLENTEEPI 1206          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P20825|POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 355.525 bits (911), Expect = 1.073e-100
Identity = 275/877 (31.36%), Postives = 438/877 (49.94%), Query Frame = 1
Query: 5785 ITKNHNIMLTDECVIPKNGII----PIRIANYERSNVKIHKGTRLGKL------FKGE----LEQTLEMCETMVLRDD------SEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPI--KQRPYGIPYKLREEIKRQIRVKRG--RRNKTKFF--TMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRI-----HELGALE--------VFVLVTEKPID--------IQAEQNKDLEISKIREEIRKGTNQKY*I----SKKLVVIEDFSGKQLIVVPQK----------------------LRQILL--------------LQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWA--EAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKR 8142
            +T N  I L D  ++P+N I     P  +  +  +N  +  G +L K       +K +     +QT ++  +   R+       + + +  S   ++K  D  +   ++LN+ + FKL+ LL+++ ++  K    L  T+ IKHV++T +  PI  KQ P    +++  E + Q  + +G  R + + +   T  +         N+ R+ IDYRKLN++TI D YP P   E   KL   +YFT ID  KG+ QI+M+E    KTAF  + G YEY R+PFGL NAPATFQR MN IL  +  +HC VY++DI+I S +  EHL  I  V  +L  A +K++  KC++ +  A FLGHIVT +GI PNP K++ + ++P+PT  ++++ FLGLTGYYRKFI +YA IA PM    K  +TK +++ +                   E+ +AF++LK  +I  PIL+ PDF+K+F++  DAS  A+ AVL Q       + I++ S+TL   +L YS IEKE  AIV+A K F+ Y+ G + +I +DH+PL  L   K+  ++L +W +++ +   KI Y  G+ N  ADALSRI     H   A +          + +TEKPI+        I++++NK +E SKI       T  +Y +      K ++++ F  + + +  +                       +R + L              LQ H+  L  H   +K      + +++ N +  ++   N C IC   K     TK PL   P      E C   F+  + ++    KH +   D ++K+A  E   T D      A   + +I  ++G P+ L  D+   F S  +KR
Sbjct:   55 LTSNGPITLNDLIMLPRNSIFKKTEPFYVHRFS-NNYDMLIGRKLLKNAQSVINYKNDTVTLFDQTYKLITSESERNQNLYIQRTPESIASSDQESIKKLDFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKK-RTKIDTQKL-------------------EYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQN-----GHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNK-VEHSKIFGN--SITTIQYDVMTLEKAKQILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEKLL--HPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNP----EHCREKFVVDIYSS--EGKHYISCIDIYSKFATLEQIKTKDWIECRNA---LMRIFNQLGKPKLLKADRDGAFSSLALKR 891          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P04323|POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 337.035 bits (863), Expect = 1.644e-94
Identity = 235/719 (32.68%), Postives = 368/719 (51.18%), Query Frame = 1
Query: 6049 YLNETQQFKLQTLLDEYSDVFAKHEFD-LGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIR--VKRG--RRNKTKFFT--MGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIE-LTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIH----------ELGALE---VFVLVTEKPID------IQAEQNKDLEISK--------------IREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLR------------------------QILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWA 8004
            +LN  ++ +L  LL +Y D+   HE D L  T+  KH I+T +  P+  + Y  P    +E++ QI+  + +G  R + + + +    +          + R+ IDYRKLN++T+ D +P P   E   KL    YFT ID  KG+ QI+M+     KTAF  + G YEY R+PFGL NAPATFQR MN IL  +  +HC VY++DI++ S + +EHL+ +  V ++L KA +K++  KC++ +    FLGH++T +GI PNP+KIE ++ +P+PT  ++++ FLGLTGYYRKFI ++A IA PM + L K +K          + T N            E+  AF +LK  +   PIL+ PDF K+F +  DAS  A+ AVL Q+      + ++Y S+TL   ++ YSTIEKE  AIV+A K F+ Y+ G    I +DH+PL  L++ KD +S+L +W +K+ +    I Y  G+ N  ADALSRI           +  A E     + +TE+P++      I ++   D++++K               RE+  +     +   K  + IE  +  ++I    KL                         + L+L  H+  L  H   +KT     + YY+ N +  ++   N C+IC   K     T  P      TT   E C   F+  + ++    KH +   D ++K+A
Sbjct:  164 HLNNEEKQRLCALLQKYHDI-QYHEGDKLTFTNQTKHTINTKHNLPLYSK-YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMK----------IDTTN-----------PEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD-----GHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAEQYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNITTYAEFKELILTAHEKLL--HPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPT----KTTPKPEHCREKFMIDIYSS--EGKHYVSCIDIYSKFA 846          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q99315|YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 335.109 bits (858), Expect = 2.379e-91
Identity = 232/761 (30.49%), Postives = 360/761 (47.31%), Query Frame = 1
Query: 6148 IKHVIDT-GNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITHCTS*-----EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDY-VITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQN---------------KDL--------EISKIRE-----EIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGAL-GGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            +KH I+    AR  + +PY +  K  +EI + ++    +    KF     + C+S      +     R+C+DYR LN  TI D +P P+      ++ + + FT +D + GY QI ME  D  KTAFV   G YEYT +PFGL NAP+TF R M     D++   VY++DILI S++ EEH K +  VL+RLK   + +K  KCK+A     FLG+ + ++ I P   K   +++FP P T++Q Q FLG+  YYR+FI + +KIA P                  I   I DK      +WT +  KA D+LK+ L  +P+L   + K  + +  DAS   + AVL + + K K   V+ Y SK+L+ AQ  Y   E E   I+ AL  F+  ++G    +RTDH  L  L    + + R+ +W   +      + Y  G  N  ADA+SR     A+      T +PID ++ ++               K+L        ++S  R      E+ +   + Y +  +++  +D      +VVP K +  ++  YHD  L GGH     T +++   YYW  L+  + ++   C  C   K+   +    L P+P         +MDF+  LP T N    ILV  D F+K A    T      T    L+ + IF   G PR + +D+     ++  + +     +    ++A HPQTDG  ER   TL   L  +   N  +W  Y+ Q  +++N T
Sbjct:  584 VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ----KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQP------------------IQLFICDK-----SQWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISR-----AVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQD-----RLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNIQNWHVYLPQIEFVYNST 1307          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q7LHG5|YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 333.954 bits (855), Expect = 3.438e-91
Identity = 231/760 (30.39%), Postives = 359/760 (47.24%), Query Frame = 1
Query: 6148 IKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITHCTS*-----EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDY-VITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQN---------------KDL--------EISKIRE-----EIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGAL-GGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKI-GTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            +KH I+    +P  + P   PY + E+ +++I     +    KF     + C+S      +     R+C+DYR LN  TI D +P P+      ++ + + FT +D + GY QI ME  D  KTAFV   G YEYT +PFGL NAP+TF R M     D++   VY++DILI S++ EEH K +  VL+RLK   + +K  KCK+A     FLG+ + ++ I P   K   +++FP P T++Q Q FLG+  YYR+FI + +KIA P                  I   I DK      +WT +  KA ++LK  L  +P+L   + K  + +  DAS   + AVL + + K K   V+ Y SK+L+ AQ  Y   E E   I+ AL  F+  ++G    +RTDH  L  L    + + R+ +W   +      + Y  G  N  ADA+SR     A+      T +PID ++ ++               K+L        ++S  R      E+ +   + Y +  +++  +D      +VVP K +  ++  YHD  L GGH     T +++   YYW  L+  + ++   C  C   K+   +    L P+P         +MDF+  LP T N    ILV  D F+K A    T      T    L+ + IF   G PR + +D+     ++  + +     +    ++A HPQTDG  ER   TL   L  +V  N  +W  Y+ Q  +++N T
Sbjct:  610 VKHDIEI---KPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQP------------------IQLFICDK-----SQWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISR-----AIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQD-----RLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNIQNWHVYLPQIEFVYNST 1333          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P0CT39|TF26_SCHPO (Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-6 PE=3 SV=1)

HSP 1 Score: 320.857 bits (821), Expect = 1.386e-87
Identity = 217/698 (31.09%), Postives = 354/698 (50.72%), Query Frame = 1
Query: 6328 IRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQ--HCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGT--EVVIRTDHKPLEGLWKHKD--TSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIR--EEIRKGTNQKY*ISKKLVVI---ED--------------FSGKQLIVVPQ--KLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFP-T*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVS--QCTY---IHNITQ 8322
            +RM +DY+ LN     + YP P  ++   K++ +  FT +D    Y  I++ +GD  K AF    G++EY  +P+G++ APA FQ  +NTIL + +  H   YM+DILI SK+  EH+K + +VLQ+LK A + I   KC++ +    F+G+ ++ +G TP  + I+ V  +  P   ++++ FLG   Y RKFI   +++  P+  L K        K V             + KWT    +A + +K+ L++ P+LR+ DF K+ ++  DAS  AV AVL Q+ +  K Y + Y S  +  AQL YS  +KE  AI+ +LK ++ Y+  T     I TDH+ L G   ++    + RL +W + +QD + +I Y+PG  N  ADALSRI         V  TE PI   +E N    +++I   ++ +     +Y    KL+ +   ED               + K  I++P   +L + ++ +YH+     H       + +L+++ W  ++K ++ +   C  C   K+   K   PL PIP +  P E  +MDF+  LP + +G   + V  D F+K A   P T   TA+  A++   ++I   G P++++ D    F S+T K  A+ +N     +  Y PQTDG  ER N T+   L      + N W +++S  Q +Y   IH+ TQ
Sbjct:  462 LRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLK--------KDV-------------RWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRI---------VDETE-PIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQ 1127          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A355ABF2 (Uncharacterized protein OS=Flavobacteriaceae bacterium OX=1871037 GN=DDZ39_05755 PE=4 SV=1)

HSP 1 Score: 713.761 bits (1841), Expect = 0.000e+0
Identity = 370/873 (42.38%), Postives = 527/873 (60.37%), Query Frame = 1
Query: 5722 IIPVKVNVKSEDTLIFEPDKSITKNHNIMLTDECVIPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIR-VKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDIQAEQNKDLEISKIREEI-----RKGTNQKY*ISKKLVVIEDFSG-KQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            ++P++++   + T+IFEP+      + + +        N  IP+R  NY   N+++ +G  +G +            E  V+  +  ++ E     N  F        +YLN  Q   ++ +  +YS VFA  +FDLG T+IIKH I      PIKQRPY   Y L+ EIK+Q+  +K  +  +  F           +    +R C+DYRKLN VT KD+YP P+  E  DKL ++K+FT +D   GY QI++ E D  KTAF   +GLYE+  LPFGLT APATFQR M+ IL+D  H  VY++DILI S  F+EHL DI  VLQRL+ A +KIKP KC+WA+    FLGH+V+ EGI P+P  +E +KNF  P  ++ +Q FLG+ GYYRKFI ++AKIA+P+ +LTK                 ND        WT +HQ++F++LK +LI  P+LR+PD  K FI+M DASG A+ AVLGQ++E  KD+VI YAS+ LK  ++ YS IEKE  AIV+++KQF  YI+  ++++ TD +PL+ L  HKD+SSRL++W++ +Q+  I I Y+ G+ N NAD LSR+ E     V +L      ++   Q  D  +  I E+          N +Y +  +  +++  +G K L V+P KL++ ++ +YHDG +GGHLS++KT S +  +YYW NLK DVK WC  C IC  RK     ++A L PI + ++PME  AMD +  LP T  GNK+ILVFSDYFTKW EA    D+ A+T+AK+ +  IIFK G P KL+TD+G NF+SE +  V   F + K  T+AYHPQ+DGLVERFN TL   L+ +  + Q DWDEY++ C + +  T
Sbjct: 1217 VMPMQIHCNDDKTVIFEPNDIFADTNCLSVATTVTNTDNSTIPVRFVNYSNDNIQLQQGQHIGTIH-----------EIDVVEVNVAKVFEGGMPPNFPF--------DYLNNEQNQAMENIFQKYSKVFATDDFDLGKTNIIKHFIPLDKDNPIKQRPYKAAYALKGEIKKQVEDMKHNKVIRNSFSPWASPIVMVKKKDGTMRFCVDYRKLNTVTRKDTYPLPRIDEMLDKLNNSKFFTSLDLQSGYWQIEISEEDKYKTAFTTGEGLYEFNVLPFGLTGAPATFQRCMSHILMDASHAMVYIDDILIYSIDFDEHLHDITMVLQRLELAGLKIKPRKCEWAKSMVTFLGHLVSSEGIKPDPKNVEKIKNFEKPVKVKGIQRFLGMAGYYRKFIPNFAKIASPLFDLTKK----------------ND-----NTLWTEKHQESFEELKNRLIHFPVLRFPDMNKDFIIMTDASGYAIGAVLGQKDEMLKDHVIAYASRILKSHEVNYSVIEKEALAIVYSVKQFHHYIWSRKIILYTDQRPLQWLMTHKDSSSRLIRWSLLLQEYDIDIKYRQGKANANADFLSRMDEPVQCMVSMLANFDKNELLQAQRADKGLFYIIEDTANIQNNSFANPRYELQSR--ILKYVTGNKVLTVIPDKLQEKIIKEYHDGPMGGHLSAKKTISAIGNRYYWKNLKDDVKEWCKTCNICLRRKGRAP-SRALLKPISSPSTPMETTAMDIMGPLPETTKGNKYILVFSDYFTKWPEAKALPDQKAQTIAKVFIEDIIFKYGAPSKLITDQGTNFLSEVMSEVNEFFKIDKHTTSAYHPQSDGLVERFNRTLEQMLSAYTNERQTDWDEYIAPCLFAYRNT 2046          

HSP 2 Score: 403.675 bits (1036), Expect = 9.670e-109
Identity = 255/739 (34.51%), Postives = 406/739 (54.94%), Query Frame = 3
Query: 3282 AIIAYDCQNPKLGQKYSLLDVENCPEVNPTKLIVTEPKIFHVYQESDFIHTEAKECIIKFSEESFVCNQVARTMLV--PTWAPEALIITPRECEEAFXXXXXXXXXXXXXXAQAGARVKQSVNVVEWTSAGGYCMGGKYKIWGQIADNIVVRRQYYVELRXXXXXXXXXXXXMMTHKFCMLDESSCDTGESMIVYKIDKYECQLTKLKSLKFRTIRGK-----QFTGAESKRIKDQKRKTVIVQEKDTPTTYMADSEAEAMRFVEKGEAIKCGKAVVKTNYEGIYISSNEIKDAKLKIDKFDVKLSSYFNNKIDYFYHHQLVQLDKVYQATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATNNQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVSLRQLRQVFGNSYKMLACLCPFVAKYLITAK-HDK--------EIRLIKNRRVEEEQLMENDKNDDNPSGSENVEQPQRGLYDSQNQQLRELSNNLDNKITNCHCVTRKTYGCYGIEANEILRELNEARPSIQVLIDNIIIKALVDTGATASMIREDQLSENRKK 5450
            AI AYDC N KLG ++SLLD E CPE  P +      +I+H+YQE   I++ AKEC +        C   + + ++  P+   E L I  R CE+AF+ K+I+  D + LKA+ G  +++ +  V   +  G C GG+Y   G+     +V   Y + ++K + +FD  + +MM+  +C    + C  G++ I+Y I K EC L  LK++ F  IRGK      F    + R K   R    +    TP   +A+  ++A+R V K + +KCG++V  TNY+ I +S+  +K+A +KI K ++ L+ YFNNK DY YHHQL Q++ +Y+  I NDCKLNREILRTKMA+ VTNP++ AP++   KG F+RV+GEVLQTF+CK V V +  ++ CT+ELPVI K + +YL+P+TR+L       +KI KC+ L  P Y+L+D  WI          P + +LT L   ++F  + D+  +G+Y + A+E ARK++L+P  +++IL+ +V   +    G   N+++LLS  HF+K    V+  +WG F + G   + ++GIYY V  LK     + S  +L ++ G S+K++  + P +A+  I+ K HDK          R   + R EE  +  +D      +G +N E  +R   +   +Q+  L N+     +  + V         +    +    N   P I V I+     AL+DTG+ A+++  + L    ++
Sbjct:  390 AIKAYDCDNAKLGTQFSLLDAEECPEAYPNQFEKRFNRIYHIYQERGLIYSAAKECTVTRRRSIRWCGHHSHSAIIKQPSMI-EYLNIGHRNCEDAFRLKEIRLADKLILKAKPGILIQEDIIKVGSIAFDGTCEGGEYWYQGKKIKQALVIETYQITIKKSEVAFDPDTNEMMSRSYCSAKSNFCFDGKTSIIYDIRKQECNLVFLKTVNFDVIRGKIFDNNLFIDKLNSRPKKIYRTEKTINSHITPVVLIANKTSDAIRLVRKDQVVKCGQSVYLTNYDRIVVSTVRVKEAIIKISKKEINLAVYFNNKADYLYHHQLRQIEDLYREMIINDCKLNREILRTKMALIVTNPNIAAPVIPLGKGVFSRVLGEVLQTFKCKQVEVKINISSDCTHELPVIYKNKFMYLEPVTRLLLPDVIKVKKI-KCSPLFSPAYKLSDNTWIVTPSLTPIAPPKRFQLTNLRNAVKFSRLKDLMKSGLYDKKAMEDARKYLLYPQVRDRILTEIVETSL--GDGNRPNYELLLSPNHFEKAAKNVMKKVWGRFLIFGQAIAGLMGIYYVVIFLKTFIEQVTSTYELYKIMGFSWKLILGIIPCLARPFISKKLHDKINDVEKKYANRHTYSNRDEEINVHVDDTETTVLNGDQNEE--RRRTVNVAGEQIYPLLNSKKKDESGANPVKGDNITFVRLYVGLV---DNVESPKILVQINGRTRIALIDTGSGATLLNYEGLKPQYRR 1119          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A147BAZ1 (Putative retrovirus-related pol polyprotein from transposon (Fragment) OS=Ixodes ricinus OX=34613 PE=4 SV=1)

HSP 1 Score: 556.984 bits (1434), Expect = 3.385e-165
Identity = 294/772 (38.08%), Postives = 453/772 (58.68%), Query Frame = 1
Query: 6046 NYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIR--VKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHE----LGALEVFVLVTEKPIDIQAEQNKDLEISKIREEI--RKGTNQKY*ISKKLVVIEDFSG------KQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
             ++   ++ +L++LL+ YS VFA+ + DLG  D+IKH I+T +  P+ QR Y IPY  R+E+ +QIR   + G    +K    G       +     R+ +DYR+LN VT  D YP P  QE   +L   +YF+V+D   G+ QI+M+  D  KTAF    G YE+ R+P GL N+PA +QR  + IL  +  Q C VYM+DI+I S+ F+ HL+DI  VL+RL+ A +K+KP+KC++      +LGHIV+ +G+ P+P+K E ++NFP PT +++++ FLGL GYYR+FI ++AKIA P+  LT    T                      +W    + A+ +L++ L+TAP+L YPDF   FI+  DAS  A+ AVL Q +  G +  + +AS+ L  A++ YS  EKEC A+++A + F+ Y+YG    I TD +PL  L   +D  SRL +W +++Q+   +II+KPG  + NADALSR  +    +GA+  FV V E    I+ EQ +D E+ KI + +      +Q Y   +  ++ +   G      ++  V+P+ L + +L  YH+    GH   +KT  ++  KY W  + +DVK +C +C  C  RK +G + +APL        P E  A+D +  LP T  GNK++LVF D+FTK+AEA P  D+ A+TVA+  V +I+ + G P++LLTD+G NF+S  +K    +  + K+ TTAYHP+++G VER N T+   L+  V ++Q DWD ++    + +N
Sbjct:  198 GHIRAEERGRLESLLNGYSAVFARSKLDLGRCDVIKHRINTTSDAPVYQRAYRIPYSQRDEMAKQIRDLEEEGIVEPSKS-PWGAPALLVEKPDGSSRLVVDYRRLNAVTRVDPYPIPDIQETLAQLGSARYFSVVDLAAGFWQIEMDPRDKNKTAFNTPSGHYEWNRMPMGLVNSPAVWQRTADVILEGLIGQSCHVYMDDIVIYSRDFDSHLRDIERVLKRLRAAGLKLKPSKCQFLRSEVKYLGHIVSADGVRPDPEKEEAIRNFPRPTKVREIREFLGLVGYYRRFIDNFAKIAKPLTTLTSKYATF---------------------RWGDSEEGAYCKLRDMLLTAPVLGYPDFGSPFILATDASQYALGAVLSQVK-NGIERPMAFASRQLNKAEVNYSATEKECLAVIWATRHFRCYLYGRRFKIITDCRPLRWLMNVRDPGSRLARWNLQLQEYDYEIIHKPGSGHTNADALSRARDPPSTVGAVLSFVPVFEDGT-IKDEQERDPELRKISDRLAGNDSDDQGYFRDRMGLLRKGGPGQREGRARERTVIPKSLVEKVLYAYHNAPYSGHFGFKKTLRKITAKYVWRGMHRDVKSYCASCESCQLRK-SGQRRRAPLELFGEVKEPFERTALDIVGPLPITTEGNKYVLVFVDHFTKFAEALPLRDQKAETVARAFVERIVLRHGVPQQLLTDQGTNFVSGLMKETCRLLGVKKLTTTAYHPESNGAVERLNKTIKGLLSHLVARDQRDWDLWMPYVLFSYN 944          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0V1MMY1 (Transposon Ty3-G Gag-Pol polyprotein OS=Trichinella papuae OX=268474 GN=TY3B-G PE=4 SV=1)

HSP 1 Score: 529.635 bits (1363), Expect = 2.021e-155
Identity = 301/884 (34.05%), Postives = 478/884 (54.07%), Query Frame = 1
Query: 5803 IMLTDECVIPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKV--KINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGR---RNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQ--HCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKPIDI-----------------QAEQNKDLE-ISKIREEIRKGTNQKY*ISKK------------------LVVIEDFSGKQLIVVPQK-LRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQ-NDWDEYVSQCTYIHNIT 8319
            + +    V+P+ G +P+++ N     V IH+G         ++  T+E  + + L    +    L  T   K  D+V  +  +  L+      L+ LL +++DVF+ ++ D+G T   +H I+TG+A+PI+  P  IP+  RE++   +     +      T  +T  +      +    +R C+D+RKLN VT KDSYP P+  E  D L   ++F+ +D   GY Q+ + + D EKTAF    GLY++  +PFGL NAPATFQR+M+  L  ++   C VY++D++I  +TF+EHL ++A VLQR++++ +K+KP KC+      LFLGHIV+ +G+  +P K E V ++P PT+  +V+ FLGL  YYR+F++ +A IA P+  LT+  +                     Q  W++E ++AF +LK  L TAPIL +P F   FI+  DAS   + AVL Q+ +   + VI YAS+TL   + +YST  KE  +IV+  K F+PY+ G   ++RTDH  L  L   K+   ++ +W   +Q+  ++++++ GR + NADA+SR  E+         TE P                    +A Q++++E + +     R+ TN ++    K                  L V+   +GK    VP K  R  +L Q H+G  GGHL  +KTA ++  +Y+W    +DVK WC  C  CA RK      KAP+  I    +PME+ A+D L  +P + NGN +I+V +DYFT+W EA+   ++ A+TVA+ LV + + + GTP KLL+D+G  F    +  +  +  + KI TTAYHPQ DG+VERFN TL   L   + + Q +DW+  + Q  + +N +
Sbjct:  162 VAIARAVVLPREGCVPVQLLNPSGDRVTIHRG---------KVVATIEAVDPLPLGRSPQTHAALP-TAVEKLLDEVAQRTTSEELD-----NLRALLTDFADVFSTYDGDIGRTTQAEHHINTGDAQPIRLCPRRIPWHFREQMNELLTDMLNKDIIEPSTSPWTAPVVLVKKKDG--NVRFCVDFRKLNLVTKKDSYPLPRIDETIDTLAGAEWFSTLDLTSGYWQVPVAKEDREKTAFCTPKGLYQFKVMPFGLCNAPATFQRVMDLTLTGLKWNKCLVYLDDVVIFGRTFQEHLNNLAEVLQRIRQSGLKLKPAKCRLCAKEILFLGHIVSRDGVRTDPSKTEKVASWPTPTSTSEVRTFLGLASYYRRFVKSFASIARPLHRLTEQGR---------------------QFSWSNEAEEAFQRLKRALTTAPILAFPRFDIPFIIDTDASETGIGAVLSQKHDPEGERVIAYASRTLSKTERKYSTTRKELLSIVYFTKLFRPYLVGQRFILRTDHDSLTWLRNFKEPEGQVARWLEHLQEYDMEVVHRRGRQHNNADAMSRRPEVTNDNGDHRTTEMPSAAVGAAAVSLSLKEGTEPSEAPQDENIECVIRFLRRGRRPTNSEWRELNKDSRDLVKQWKHLRLTPAGLAVV--VTGKPTRWVPPKHARLSILEQLHNGIGGGHLGVKKTAEKVKIRYFWPGWYRDVKAWCERCEACARRKTPPIVNKAPMESI-VVGNPMEIIAVDILGPVPRSRNGNSYIMVVTDYFTRWVEAYALPNQQAETVARKLVQQFVCRFGTPMKLLSDQGTQFQGRLVTELCKLLGIEKIRTTAYHPQCDGMVERFNRTLAMMLTTAMEEAQDDDWETQIPQVCFAYNAS 1004          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0P5WAS7 (Retrovirus-related Pol polyprotein from transposon OS=Daphnia magna OX=35525 PE=4 SV=1)

HSP 1 Score: 524.628 bits (1350), Expect = 3.174e-154
Identity = 292/787 (37.10%), Postives = 457/787 (58.07%), Query Frame = 1
Query: 6067 QFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYK----LREEIKRQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEV----FVLVTEKP------IDIQAEQNKDLEISKIRE------------EIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKL-PTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLS--RLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            Q  L  LL+++ D+FA  + +LG+T++IKH IDT    PI+QRPY +       L ++++  +  K  R +++ + +  +      +N  EIR C+DYRKLN +T KDS+P P+  E  DKL   K+FT +D   GY QIQ+ + D EKTAFV+E+ LYE+ R+ FGL NAPATFQRLMN +L DV      VY++D++I S TFE HL DI  +   LK A +K+K  KC++A+ S  +LGH+++ +GI P+P KI+ + N+  PT++ +V+ FLGL GYYR+FI+D+  IA P+  LT                   D   KP   W +E Q AF++L+  L+T P+L YP+F ++F +  DA    + AVL Q ++ G+++ I Y+S+ L  A+++YST EKE  A+V A+K F+ Y+      I +DH+PL+ L   KD + RL +WA+ +   + ++ Y+PGRI++NAD LSR+ ++ +++       L+ EK       IDI+    K +   K  +            EI +GT  ++ +  K     + + +  +V+P  LR ++L + HD  +GGHL+  KT  ++   YYW  ++KD+  +C AC IC    NT +  +A LHP     +P ++  MDFL  + P + NGN  ILV +DYF++W EA    D+TA+T A+ +   II + G P+ +++D+G NF S+  +       + +  TTAY+P ++G  ERFN TL S  R  +  GK+ N W+E +    + +  +    T
Sbjct:  122 QAPLSQLLNDFHDLFASKDSELGNTNLIKHTIDTEGRGPIRQRPYRVTNNQRKLLEDKVQEMLDAKVIRYSQSPWASPVV--LVEKKN-GEIRFCVDYRKLNSITKKDSFPMPRIDETLDKLYGKKFFTTLDLASGYWQIQVHDPDIEKTAFVVENNLYEFERMAFGLCNAPATFQRLMNYVLRDVLGNKALVYLDDVVIFSDTFESHLNDIREIFNLLKAANLKLKLNKCQFAKRSVNYLGHVISTDGIKPDPSKIDKIVNYKTPTSVDEVRSFLGLAGYYRRFIKDFGSIAKPLTRLTH-----------------KDLSRKP-FAWGTEEQVAFEKLRNSLVTPPVLAYPNFNEKFFLFTDACEYGIGAVLSQIQD-GQEHPIAYSSRQLTKAEMKYSTTEKEALAVVDAIKYFRHYLLDKPFEIISDHRPLQWLKNQKDNNGRLGRWAILLAATNYELKYRPGRIHQNADCLSRL-KIASIQTVPNNITLICEKQLEDELCIDIRNYLEKGVLDEKFSQSKPDWAKEIEYFEIIEGTLYRHELPSKDSKRNEINHQ--LVLPYSLRHLVLKELHDAPMGGHLAFYKTYLKVKNNYYWPTMRKDILEYCQACEICTA--NTASTYRALLHPHELAKAPFQVIGMDFLGPITPVSPNGNSCILVITDYFSRWVEAVALKDQTAQTTAECVYKTIIVRHGMPKAIVSDRGTNFTSKLFRYFCKKLKVDQRLTTAYNPASNGETERFNRTLTSMLRKELKDGKHAN-WEEMLDDVLFAYRSSTHSST 880          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0P4ZTQ7 (Retrovirus-related Pol polyprotein from transposon OS=Daphnia magna OX=35525 PE=4 SV=1)

HSP 1 Score: 523.857 bits (1348), Expect = 6.441e-154
Identity = 293/786 (37.28%), Postives = 453/786 (57.63%), Query Frame = 1
Query: 6067 QFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYK----LREEIKRQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEV---FVLVTEKP------IDIQAEQNKDLEISKIRE------------EIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKL-PTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLS--RLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            Q  L  LL+++ D+FA  + +LG+T++IKH IDT    PI+QRPY +       L ++++  +  K  R +++ + +  +      +N  EIR C+DYRKLN +T KDS+P P+  E  DKL   K+FT +D   GY QIQ+ + D EKTAFV+E+ LYE+ R+ FGL NAPATFQRLMN +L DV      VY++D++I S TFE HL DI  +   LK A +K+K  KC++A+ S  +LGH+++ +GI P+P KI+ + N+  PT++ +V+ FLGL GYYR+FI+D+  IA P+  LT                   D   KP   W +E Q AF++L+  L+T P+L YP+F ++F +  DA    + AVL Q ++ G+++ I Y+S+ L  A+++YST EKE  A+V A+K F+ Y+      I +DH+PL+ L   KD + RL +WA+ +   + ++ Y+PGRI++NAD LSR+       V     L+ EK       IDI+    K +   K  +            EI +GT  ++ +  K     + + +  +V+P  LR ++L + HD  +GGHL+  KT  ++   YYW  ++KD+  +C AC IC    NT +  +A LHP     +P ++  MDFL  + P + NGN  ILV +DYF++W EA    D+TA+T A+ +   II + G P+ +++D+G NF S+  +       + +  TTAY+P ++G  ERFN TL S  R  +  GK+ N W+E +    + +  +    T
Sbjct:  122 QAPLSQLLNDFHDLFASKDSELGNTNLIKHTIDTEGRGPIRQRPYRVTNNQRKLLEDKVQEMLDAKVIRYSQSPWASPVV--LVEKKN-GEIRFCVDYRKLNSITKKDSFPMPRIDETLDKLYGKKFFTTLDLASGYWQIQVHDPDIEKTAFVVENNLYEFERMAFGLCNAPATFQRLMNYVLRDVLGNKALVYLDDVIIFSDTFESHLNDIREIFNLLKAANLKLKLNKCQFAKRSVNYLGHVISTDGIKPDPSKIDKIVNYKTPTSVDEVRSFLGLAGYYRRFIKDFGSIAKPLTRLTH-----------------KDLSRKP-FAWGTEEQVAFEKLRNSLVTPPVLAYPNFNEKFFLFTDACEYGIGAVLSQIQD-GQEHPIAYSSRQLTKAEMKYSTTEKEALAVVDAIKYFRHYLLDKPFEIISDHRPLQWLKNQKDNNGRLGRWAILLAATNYELKYRPGRIHQNADCLSRLKVASIQPVPNNIKLICEKQLEDELCIDIRNYLEKGVLDEKFSQSKPDWAKEIEYFEIIEGTLYRHELPSKDSKRNEINHQ--LVLPYSLRHLVLKELHDAPMGGHLAFYKTYLKVKNNYYWPTMRKDILEYCQACEICTA--NTASTYRALLHPHELAKAPFQVIGMDFLGPITPVSPNGNSCILVITDYFSRWVEAVALKDQTAQTTAECVYKTIIVRHGMPKAIVSDRGTNFTSKLFRYFCKKLKVDQRLTTAYNPASNGETERFNRTLTSMLRKELKDGKHAN-WEEMLDDVLFAYRSSTHSST 880          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000037150.1 (pep primary_assembly:Astyanax_mexicanus-2.0:10:22265216:22268869:1 gene:ENSAMXG00000033912.1 transcript:ENSAMXT00000037150.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 451.44 bits (1160), Expect = 1.903e-132
Identity = 271/819 (33.09%), Postives = 434/819 (52.99%), Query Frame = 1
Query: 6052 LNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIK---RQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELT---KGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSR-------------------------IHELG-ALEVFVLVTEKPIDIQAEQNKDLEISKI--------------REEIRKGTNQKY*ISKKLV---------VIEDFSGKQL--IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            L+E Q   +  L D+Y D+FA+ E DLG T ++ H I   +  P++Q    IP    E +K   +Q+   +  RN    ++  I   T  +    +R+C+DYR+LN  T +D+YP P+ +E  D L   K+F+ +D   GY Q+ + E D  KTAF    GL+E+ R+PFGL NAPATFQRLM  +  D ++ +V  Y++D+++ S + E+HL+ +A V  RL+K  +K+K +KC + +P   +LGH+V+ EG+  +P KI+ V+ +  P+ L +++ FLG   YYR+F++ ++K+AAP+ +L     G + KG++  V +  +           W    +KAF  LKE+L +AP+L Y DF K FIV +DAS   + AVL QE+E GK   I +AS+ L+ A+     YS+++ E  A+ +A+ ++++ Y+ G EV I TD+ PL  L   K  ++   +WA ++   + KI Y+PG+ N+NADALSR                         +H+ G   E+  L     +D+   Q  D  I  +              RE +   T   +    +L+         V     G +   +++P+ L++ +L   HD    GH    +T  +L  + +W  + K V+RWC  C  C   K    K +A    + A T P E+ A+DF    P + +G +++LV +D FTK+ +A PT D+ A TVA++LV     + G P ++ +D+G NF S  I+++ NM+ + K   TAY PQ +G  ERFN TL   L       +  W  Y+S   + +N T
Sbjct:  280 LSEEQMAAVSLLFDQYQDIFAQTEGDLGCTTLLTHEIPLLDEVPVRQPYRRIPPSQYEAVKLHIQQLLDSKVIRNSASPYSSPIVLVTKKDG--SLRLCVDYRQLNAKTRRDAYPLPRIEESLDALAGAKWFSTLDLASGYNQVPVSEKDRYKTAFCTPFGLFEFNRMPFGLCNAPATFQRLMERMFGDCRYQSVLLYLDDVIVFSSSVEQHLERLAEVFSRLQKQGLKVKLSKCHFFQPQVNYLGHVVSREGVATDPAKIDAVRGWRRPSHLAELRSFLGFASYYRRFVEGFSKLAAPLHQLVGKLGGARRKGKTLPVPLAAS-----------WDERCEKAFQSLKERLTSAPVLAYADFSKPFIVEVDASHGGLGAVLSQEQE-GKVRPIAFASRGLRPAERNMDNYSSMKLELLAVKWAVTEKYREYLLGNEVTILTDNNPLSHLQTAKLGATE-QRWASQLASFNFKIKYRPGKSNQNADALSRQYVDRFAIGTKVPPLRMELLESEPMVHKTGQCTEMVALAGRSALDLHLLQEADPVIGPVCKFRKEGRYPRAEEREALSSPTKALFRQWDRLLEKDGVLYRAVQPSGGGPETCQLLLPKHLQEEVLSSVHDDH--GHQGVERTLKQLQSRCFWPGMAKHVERWCQQCRRCVLSKAVQPKIRAFQGTLQA-TRPHEILAIDFTLLEPAS-DGRENVLVLTDVFTKYTQAIPTRDQRASTVAQVLVQHWFHRFGLPSRIHSDQGRNFESMLIQQLCNMYGIQKSRATAYRPQGNGQCERFNRTLHDLLRTLPQSEKRRWPHYLSPMVFAYNTT 1079          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000052546.1 (pep primary_assembly:Astyanax_mexicanus-2.0:2:13031363:13036570:1 gene:ENSAMXG00000033629.1 transcript:ENSAMXT00000052546.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 449.899 bits (1156), Expect = 1.827e-128
Identity = 281/950 (29.58%), Postives = 483/950 (50.84%), Query Frame = 1
Query: 5725 IPVKVNVKSEDTLIFEPDKSITKNHNIMLTDECVIPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGE---LEQTLEMCET-------MVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIR--VKRG--RRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQH--CAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQ---LRYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLL---KWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEV------FVLVTEKPIDI---QAEQNKDLEIS---KIREEIRKGTN----------QKY*ISKKLVVIEDFSGKQLIVVPQ-----KLR----QILLLQYHDGAL---------------------------------GGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
            +P KV  +    ++ EP   +     +M+ +      +G +P+R+ N     +++   ++L  + K +     QT+ + E         + RDD      + + N  +    V++N + L  +Q  +L  LL+++ D+F+K + D G T  + H I TG+A P+KQR   +P ++ +E K+ ++  V RG    + + + +  +      +    +R C DYR+LN VT KD+YP P+ +E  D L + ++F+ +D   GY Q+ +++ D EKTA     GL++++R+PFGL NAPATFQRLM  +L D+      VY++DI+I S+ F+ H + +  V  RL+   +K+KP+KC   +P   FLGHI++ +G+  + +K+ V++ +P+P T++ ++  LG   YYR+F+  +A++A P+  L  G   +G + G +               W+ E Q AFD+LK  L++ P+L YPDF + FI+  D S   + AVL Q ++ G + VI YAS+ L+G++     YS  + E  A+ +A+ ++F+ ++   +  + TDH PL    ++ D+++  +   +W  ++ + +  I YKPGR N NAD LSR+      E       F+L+  + +      A + +++E +    ++  IRKG +          QK     + V      GK L    Q     KLR    Q   L+   GAL                                 GGH   R T   + Q YYW ++ +DV+ W   C  CA  K+   + +AP+    + ++P+E+ AMD+   L  +  G +++LV +D FT++  A PT ++TA T AK L+       G P +L +D+G NF S  IK +  ++ + K  T+ YHPQ +   ERFN TL   L     + + +W EY+ +   ++N
Sbjct:  569 VPPKVTCQ----VLVEPAPKVNMPKGLMVANVLAKAADGKVPVRVLNVSGCPIRLMPRSKLASVHKPQEVLSRQTVVLEEGDGVLHVRAIHRDDL-----VPECNEGQLPVPVQVNADGLTSSQYQELMDLLEKHKDIFSKSDSDFGYTTAVTHSIPTGDAPPVKQRHRRVPPQVFQEFKKHVQSLVDRGILEESCSPWASPAVIVI---KKDGTVRFCCDYRRLNQVTCKDAYPLPRVEESLDALGNAQWFSTLDLTAGYFQVAVKDSDREKTAVTTPFGLFQWSRMPFGLCNAPATFQRLMGVVLGDLAFDILLVYLDDIIIFSRDFKSHCERLELVFNRLRHHGLKLKPSKCFLLKPEVKFLGHIISAKGVQVDAEKVRVLETWPIPKTVKDIRQVLGFMSYYRRFVPKFAQLAKPLHALVGGKMNRGRATGPIT--------------WSGECQTAFDKLKGCLMSPPVLAYPDFTQPFILTTDGSLHGLGAVLSQ-KQGGTERVIAYASRGLRGSEKNDKNYSAFKLELLALKWAVTEKFRDFLMFAKFSVVTDHNPL----RYLDSANLGVVEQRWVAQMAEYNFDISYKPGRQNANADVLSRLPAHEEPEAKDTGKDFILINTEEVRACLWPAAERREVEPAVHVAVQASIRKGVSGYSWGEIEEQQKCDPDIRPVYSAVLRGKNLSPAEQRAMTPKLRKLSKQFDRLKLRHGALFRCICHPRDGEGVWQLIVPESLQRKVYESQHEHGGHFGERSTLEMMRQSYYWPSMSRDVQDWIKQCKRCALAKDVLPRNRAPM-TCSSVSAPLEVLAMDYTL-LERSSGGYENVLVLTDMFTRFTVAVPTKNQTAHTTAKALLKHWFVHYGCPARLHSDQGRNFESHVIKELCKLYGIAKSRTSPYHPQGNANCERFNRTLHDMLRTLPPEKKRNWKEYLPELVMVYN 1485          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041682.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02002320.1:24977:29035:1 gene:ENSAMXG00000038531.1 transcript:ENSAMXT00000041682.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 442.58 bits (1137), Expect = 2.613e-128
Identity = 278/920 (30.22%), Postives = 485/920 (52.72%), Query Frame = 1
Query: 5755 DTLIFEPDKSITKNHNIMLTDECVIPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKL-QTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGR---RNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVD--VQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAP---MIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQ---LRYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIH-------------------ELGALEVFV----LVTEKP-IDIQAEQNKD----------------------------LEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
             +  FEP  S T    ++++   +   +G + + + N E  ++ +     LG+LF  E++ TL   +   L   SEQ+V +      + +  + + +      +Q +L + LL  YS VF++ E +LG T +I+H I   +  P+KQR   +P    + +K  I+    R   R     ++  +      +    IR+C+DYR+LN  T KD+YP P+ +E  D L   ++F+ +D   GY Q+ M E D  KTAF    GL+E+ R+PFGL NAP+TFQRLM  I  D   Q   +Y++DI+I S TF+ HL+ +  VL+RL++  +K+K +KC + +    +LGH+++  G+  +P+KI+ V  +  P T+ Q++ FLG   YYR+F++ +AK A+P   ++ + +G K + ++K V             +  W+   ++AF+ LK KL++AP+L Y DF K FI+ +DAS   + AVL Q++E G+   I YAS+ L+ ++     YS+++ E   + +A+ ++F+ Y+ G +  + TD+ PL  L   K  +    +W  ++   +  I Y+PG  N+NADALSR+                    ++GA +  +     V  +P  D+Q  Q+ D                            LE+ +   ++R+     Y +++    +E+F     +V+P+ L++ +L   HD    GH  + +TAS + Q+ +W ++ K ++RWC  C+ C   K    K +  +  + A + P+E+ A+DF   +    +G +++LV +D F+K+ +AFPT D+ A TVA +LV K  +  G P+++ +D+G NF S+ +K +  ++++ K  TT YHPQ +G  ERFN T+   L     + +  W +Y+ Q  + +N T
Sbjct:  182 SSAFFEP-HSHTLPDGLLMSRALLSIDSGSVAVPVVNVEHRDIWLPPRVTLGQLFAVEMQPTLSTGKVEELFHCSEQVVAVQSLAVAEDFSDLTVGSWPTLTPEQSQLGKDLLQRYSSVFSQDEGELGCTHLIEHEIPLIDDTPVKQRYRRLPPSQYDLVKGHIQELLDRKVIRASCSPYSSPVVVVQKRDG--TIRLCVDYRQLNSKTRKDAYPLPRIEESLDALGGARFFSTLDLASGYNQVPMAEHDKSKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERIFGDERFQSLLLYLDDIVIFSSTFDLHLQRLEVVLKRLQQNNLKLKLSKCHFFQSQVKYLGHVISSAGVATDPEKIKAVSEWERPQTVTQLRSFLGFASYYRRFVEGFAKHASPLHRLVAVLQGGKRRVKTKPV-------------EGHWSDACEEAFETLKLKLVSAPVLGYADFSKPFILEIDASHGGLGAVLSQDQEGGRR-PIAYASRGLRDSERNMSNYSSMKLELLGLKWAVTEKFREYLLGAQFTVYTDNNPLSYLQTAKLGAVE-QRWVSQLALFNFNIKYRPGLSNRNADALSRLPACPTPQSFQETVSGISIPLQVGATKATISTIDAVPLRPKADLQRLQSADPVIGPFLQYWHRQKFPTAGERAQESKEVLELVRQWNKLRECDGVLYRLTRTPDGVEEF---LQLVLPECLQKEVLTALHDNH--GHQGAERTASLVRQRCFWPHMWKKIERWCKECSRCVVAKMGQPKIRTFMGNLSA-SKPLEIIAIDFTL-MDRASDGRENVLVITDVFSKFTQAFPTQDQRASTVAHILVEKWFYVYGVPQRIHSDQGRNFESDLLKSLCKIYDVKKSRTTPYHPQGNGQCERFNRTMHDLLRTLPSEQKRRWPKYLPQLLFAYNTT 1076          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041345.1 (pep primary_assembly:Astyanax_mexicanus-2.0:25:32334536:32339002:1 gene:ENSAMXG00000029230.1 transcript:ENSAMXT00000041345.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 443.351 bits (1139), Expect = 1.159e-127
Identity = 258/768 (33.59%), Postives = 407/768 (52.99%), Query Frame = 1
Query: 6076 LQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGR---RNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHE------LGALEVFVLVTEKPID---IQAEQNKDLEISKIREEIRKGTNQK---Y*ISKKLVVIEDFSGKQL-----IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHN 8313
            L  LL ++  V       LG T I+ H I T +  PI+Q+PY +  + ++ IK  I   + R   R  T  +   +      E    +R C+DYR++N  T  D+YP P+ QE  + L     F+ +D   GY Q+ +E     KTAF+   GLYE+T LPFG+ NA ATFQRLM+++LV++  + C VY+ DI++ S T E+HL  +  V + L +A + +   KC   + S +FLGH+++ EGI   P K+E ++ FP P +++++Q FLG+ G+Y +FI  +++ AA +  L K                     N P   WT E QKAF+ +K+ LITAP+L  P+F + F +  DAS + + AVL Q  + G ++V+ YAS+ L+GA+  YST EKEC A+V+A+++++ Y+ G    + TDH  L  ++ H   +SRL +WA+++Q     + Y+ G+ N   D LSRI +      +   +V       P+D   I   Q  D  +   R+E      +K   + ++K  ++      +QL     +VVP + R+  L   HD  L GHL   KT  RLL   YW ++++DV  +C +C IC   K   +K    L   P    P  +  +D +  LP +   N+H+LV  DY +KW E FP  +     + ++L+  I  + GTP  L++D+GA F S  +      + + +  TTAYHPQT+ L ER N TL + +A +V      WD+++ +  +  N
Sbjct:  596 LHKLLSQWPSVCTNQ---LGRTMIVLHHITTNDNLPIRQKPYKVSIEKQQLIKEAIEDMQRRGIVRPSTSPWASPVVLVPKKEG--GVRFCVDYRRMNSKTHLDAYPMPQVQEILESLHGAAIFSTLDLKSGYWQVGLEPDSIPKTAFITCQGLYEFTVLPFGIKNAAATFQRLMDSVLVNLKGKSCFVYINDIVVYSSTIEQHLGHLEEVFRCLHQAGLTLNLRKCNLLQRSLIFLGHVISGEGICTEPGKVEAIQAFPEPRSIKELQRFLGMAGWYHRFITHFSERAAILNALKKK--------------------NAPW-IWTQECQKAFEDIKQALITAPVLTPPNFSEPFQIQTDASDQGLGAVLSQGTD-GLEHVVAYASRLLQGAERNYSTAEKECLAVVWAVEKWRVYLEGRHFTVITDHSALSWVFNHPKPTSRLTRWAIRLQTFDFSVQYRKGKCNIVPDTLSRIPDRMTEGVMAPCQVTGSSDGLPVDWAEIARAQEVDGTLQPQRDETGNQETRKDRIHFVTKNDILFRAVPNQQLGHTLQVVVPVQHREAFLQYAHDNPLSGHLGQMKTLLRLLNIAYWPSIRRDVWTYCKSCEICQKYKPRISKLSGRLQSTP-VVEPGYMLGVDLMGPLPKSPRQNEHLLVIVDYCSKWVEMFPLREAKTSQIVQILIKDIFTRWGTPAYLVSDRGAQFTSRLLHATCRQWGVVQKLTTAYHPQTN-LTERVNRTLKTMIASYVKDKHRLWDQWIPEFRFAIN 1334          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041754.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02001938.1:55959:59870:1 gene:ENSAMXG00000043385.1 transcript:ENSAMXT00000041754.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 439.499 bits (1129), Expect = 1.163e-127
Identity = 275/818 (33.62%), Postives = 433/818 (52.93%), Query Frame = 1
Query: 6052 LNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIK---RQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAV--YMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGV---KTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSR-------------------IH-ELGALE-----VFVLVTEKPIDIQAEQNKDLEIS---KIREEIRKGTNQK----Y*ISKKLVVIEDF--------------SGKQL----IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            L + Q  + + L  +YS++FAK E DLG T +I H I   +  P++Q    IP      +K   +Q+   R  R  +  F+  I   T  +    +R+C+DYR+LN  T +DSYP P+ +E  D L   K+F+ +D   GY Q+ +EE D  KTAF    GL+E+ R+PFGL NAP TFQRLM  I  D ++ +V  Y++D+++ S+T EEHL+ +  V  RL+K  +K+K +KC++ +    +LGH+V+ EG+T +P KIEVVK +  P+ L +++ FLG   YYR+F++ ++K+AAP+  L  G+   + KG++    +              W +E ++AF  LK++L + P+L Y DF K FIV +DAS   + AVL QE+E GK   I +AS+ L+  +     YS+++ E  A+ +A+ ++F+ Y+ G +  I TD+ PL  L   K  +    +WA ++   +  I Y+PG+ N+NADALSR                   +H E  ALE     V       P D+   Q  D  I    K R E R+   ++      +SK L+   D               SG+      +++PQ L++ +L   HD    GH  + +T   L  + +W N+ +DV+RWC  C  C   K    K +A          P E+ A+DF    P + +G +++L+ +D FTK+ +A  T D+ A TVA  LV     + G P ++ +D+G NF S  IK++  ++++ K  TT+Y PQ +G  ERFN TL   L     + +  W  ++ Q T+ +N T
Sbjct:  264 LTDLQAQRARALFQKYSNIFAKSEGDLGCTSLISHEIPLLDEVPVRQPYRRIPPSQYSTVKAHIQQLLDSRVIRESSSPFSSPIVLVTKKDG--SLRLCVDYRQLNAKTRRDSYPLPRIEESLDALCGAKWFSTLDLASGYNQVPVEEKDKSKTAFCTPFGLFEFNRMPFGLCNAPGTFQRLMERIFGDCRYQSVLLYLDDVIVFSQTVEEHLERLEEVFSRLQKQNLKVKLSKCQFFQHQVSYLGHVVSAEGVTTDPAKIEVVKEWKSPSHLAELRSFLGFASYYRRFVEGFSKLAAPLHRLVGGLSGPRRKGKTPKTSLAAF-----------WDAECEQAFQSLKDRLTSTPVLAYADFNKPFIVEVDASHGGLGAVLSQEQE-GKVRPIAFASRGLRPTERNMENYSSMKLELLAVKWAVTEKFREYLLGHQFTIYTDNNPLSHLQTAKLGAVE-QRWASQLASFNFTIKYRPGKHNQNADALSRQYLERFAVGTKVPPLVMEAVHEERSALENQCRQVVAFPGRSPSDLGVLQRADPVIGPVWKFRSEGRRPRTEERDTLCNLSKVLIRQWDRLVEREGVLYRRAYPSGRGSEYFQLLLPQCLQKEVLHSVHDDH--GHQGTERTLQLLRDRCFWPNMTQDVERWCQQCQRCTLGKAVQPKVRA-FQGTLQAAHPNEILAIDFTILEPAS-DGKENVLILTDIFTKYTQAIATKDQRASTVAWALVQHWFHRFGPPVRIHSDQGRNFESLLIKQLCKVYSIQKSRTTSYRPQGNGQCERFNRTLHDLLRTLPVEEKRHWPRHLPQLTFAYNTT 1062          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004121.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004121.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 93.9745 bits (232), Expect = 1.638e-20
Identity = 72/277 (25.99%), Postives = 132/277 (47.65%), Query Frame = 1
Query: 7510 FVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLV--VIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKA-PLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFM-----SETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNI 8316
            F+ ++E P  I   +NK       +  +RK  N K+ +  K++  V      ++++V+ +  ++ +L+  H GA  GH   +KT  +L   YYW  +  DVK    +C +C   +N G++  A P   +   + P E+  +D L  LP T   N+++L+  DYF+KWAEA P  +++ + VA  L   +  + G PRK+ +  G  F+     S T+ R     +  +    A+      L +  N  L   + +   ++ +DW+  V Q  + + +
Sbjct:   24 FLQMSEFPEHILHNKNK-------KRALRKCAN-KFVLKGKVLYYVGRKRERRRMVVMDEDEKRNILMSVH-GA--GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVC---RNKGSRRVAMPSMKLLKASGPWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHVASALTV-VFCRYGFPRKVFSSLGKEFVTQVNKSSTLSRAYQRCSPSQTQVHAHVSVRVALNKATNQALKGCVNLVASQHPSDWESRVEQSLFEYRV 285          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004123.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004123.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 91.2781 bits (225), Expect = 1.211e-19
Identity = 71/279 (25.45%), Postives = 133/279 (47.67%), Query Frame = 1
Query: 7510 FVLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLV--VIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKA-PLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNM-HKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITQKVGT 8334
            F+ ++E P  I   +NK       +  +RK  N K+ +  K++  V      ++++V+ +  ++ +L+  H GA  GH   +KT  +L   YYW  +  DVK    +C +C   +N G++  A P   +   + P E+  +D L  LP T   N+++L+  DYF+KWAEA P  +++ + VA  L   +  + G PRK+ +  G  F+++  K     + + H +          G  +  N  L   + +   ++ +DW+  V Q  + + + +   T
Sbjct:   24 FLQMSEFPEHILHNKNK-------KRALRKCAN-KFVLKGKVLYYVGRKRERRRMVVMDEDEKRNILMSVH-GA--GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVC---RNKGSRRVAMPSMKLLKASGPWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHVASALTV-VFCRYGFPRKVFSSLGKEFVTQVNKIQHKKWCISHTLQNLCETRGPAGWHKATNQALKGCVNLVASQHPSDWESRVEQSLFEYRVGKHSTT 287          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000009777.1 (pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-1 gene:ENSPMAG00000008837.1 transcript:ENSPMAT00000009777.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 82.4185 bits (202), Expect = 1.243e-16
Identity = 53/180 (29.44%), Postives = 84/180 (46.67%), Query Frame = 1
Query: 7627 KKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLH-PIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNM 8163
            KKL      SG++ +VV  +  +  +LQ   GA   H    +T   L + YYW  +  D++ + NAC IC   K    K  +  H  +   + P E+  MD L   P T   ++ +L+  DYFTKWAE  P  D++A  V   L      + G P+KL  +    ++++  + +   F M
Sbjct:   48 KKLYYTGQKSGRKRLVVMNEEDKRSILQRVHGA--DHCGQTRTRKLLEEHYYWKGMVNDIRDYINACEIC---KQKSYKRSSISHVKLLKASYPWEVLGMDLLGPFPATSRAHRFVLLIVDYFTKWAELTPMTDQSAAHVVAALTT-AFHRFGFPKKLFCNVSEEYVAQINEEMFRHFPM 221          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO33875 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SR01])

HSP 1 Score: 54.299 bits (129), Expect = 1.102e-8
Identity = 25/86 (29.07%), Postives = 52/86 (60.47%), Query Frame = 1
Query: 6634 EDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGY 6891
            +D++    +FEEHL+ I  +LQ ++ +  K    K ++A  S  FLGH++   G+ P P+K++ ++ +  PT  ++++ F+G+  +
Sbjct:    1 DDVICFHSSFEEHLRGIERMLQAVRASGFK-SIKKSQFATRSVKFLGHVIDQNGVRPQPEKLD-IRQWETPTNEEELRKFIGVCTF 84          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000036827.1 (pep primary_assembly:ASM223467v1:16:28613823:28617539:1 gene:ENSORLG00000023550.1 transcript:ENSORLT00000036827.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 456.062 bits (1172), Expect = 8.145e-134
Identity = 275/835 (32.93%), Postives = 421/835 (50.42%), Query Frame = 1
Query: 5971 DDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRN-----KTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSR----------IHELGALEVFVLVTEK--------PI----------DIQAEQNKDLEISKIREEIRKGTNQKY-------*ISKKLVVIEDFSGKQL--------------------IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYV 8289
            D    +VE  +   VK  +  + + + L   Q+ +L  +L EY D+FA  E ++G T ++ H IDTG+ARPIK RP  +P  L  ++     ++   R          +  G+      +   ++R C+DYR LN VT KDSYP P+  E  D +  + +F+ +D   GY Q+ +      KTAF    GL+++  L FGL NAPATF+RLM  +L  +  Q C VY++DIL+   +F+  L+ +  VLQR+  A +K+ P KC +      FLGH +  EGI+   +K++ V+++P PTTL+ ++ FLGL  YYR+F++ ++ IAAP+  L +        K    V             WT E ++AF  LK+ L  +PIL  PD K  FI+  DAS   + AVL Q    G + V+ Y SKTL  A+ RY    +E  A+V A+  F+ Y+ G    +RTDH  L+ L   K+   ++ +W  ++      + ++PG  + NADA+SR            +  A E  +   E+        P+          + +A Q +D+++  + + +  G   ++         SK L   E F   +L                    +VVP+ LR  +L   H  A  GH    KT  R+ Q +YW  L++DV+ +C  C IC   K    +++A L  + A  +PME  A+D +   P T  GN+ +LV  DYFTKW EA+   D+ A TVA  LV  +  + G    + +D+G NF S     +     M K  TT  HPQ+DGLVERFN TL+ +LA+    +Q+DWDE++
Sbjct:  204 DQRNAVVEADRVTAVK--EIWRRSCDGLQPGQKDELWKVLLEYRDIFALSEDEVGLTHLVHHEIDTGDARPIKTRPRRLP--LAHQVAADSAIEEMLRGGIIEPSDSPWASGVVMVKKKKG-PKMRFCVDYRPLNGVTKKDSYPLPRIDESLDLVSGSSWFSSLDLRSGYWQVPLSPAARPKTAFCTGRGLWQFRVLSFGLCNAPATFERLMEKVLASIPRQECLVYLDDILVHGGSFKAALESLRKVLQRIAAAGLKLHPDKCCFMRRELEFLGHKIGGEGISTLEEKVQAVRDWPTPTTLRDLKSFLGLASYYRRFVRGFSCIAAPLFHLQR--------KDCDFV-------------WTQECEQAFSSLKKALTNSPILTPPDPKLPFILDTDASDVGMGAVLSQMGSAG-ERVVAYFSKTLSKAERRYCVTRRELLAVVKAIGHFRYYLCGLPFTVRTDHSALQWLMTFKEPEGQIARWLEELASFSFTVEHRPGSRHANADAMSRRPCALAGCQYCEKREAREAVISREEQSHTGESSWPVCRLVQGVDSTEWRAHQEQDVDLQPVLQWVEAGRKPEWGEVAGCSPGSKGL--FEKFDALRLKDGVLQRAWKEPATGEERWQVVVPRTLRNSVLQGCHGAAGSGHFGVSKTLRRIRQGFYWGQLRRDVEDFCRRCDICTAHKGPPDRSRAELQQL-AAGAPMERVAVDIMGPFPRTNRGNRFVLVAMDYFTKWPEAYAIPDQEAVTVADALVEGMFSRFGAAEVIHSDQGRNFESAVFSAMCERLGMRKTRTTPLHPQSDGLVERFNRTLVKQLAILTSAHQSDWDEHL 1008          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000032171.1 (pep primary_assembly:ASM223467v1:18:2114932:2117958:1 gene:ENSORLG00000025397.1 transcript:ENSORLT00000032171.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 432.95 bits (1112), Expect = 3.461e-128
Identity = 271/816 (33.21%), Postives = 431/816 (52.82%), Query Frame = 1
Query: 6073 KLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIR--VKRG--RRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVD--VQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLV--TEKPIDIQAEQ------------------------NKDLEISK---------IREEIRKGTNQKY*ISK--------------------KLVVIEDFSGKQL--IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            K Q LL +Y++VFA HE DLG T ++ H I   +  P++QR   IP    E +K  I   +  G  R + + + +  +    + +    +RMC+DYR+LN  T +D++P P+  E  D L   ++FT +D   GY Q+ + EGD  KTAF    GL+E+ R+PFGL NAP TFQRLM  I  D   Q   +Y++DI++ S T +EHL+ +  VL RL++ ++++K  KC + +    +LGH+++ +G++ +P KIE V  +  P+T+ +++ FLG   YYR+F++ +A++AAP+  L       GE  G    ++   K +  Q  WT+E Q+ FD LK+KL +AP+L Y DF   F + +DAS   + AV  Q E+ G    I YAS+ LK  +     YS+++ E  A+ +A+ ++F+ Y+ G   V+ TD+ PL  L   K       +W  ++     +I Y+ GR+N+NADALSR     + EV  +   +  P D+Q  Q                         +DL++S           R++     +++  +SK                    + V++ D SG+++  +++P+ L   +L Q H     GH    +T + L  + YW  + KDV  WC AC  C   K+      APL  + A + P EL AMDF    P+   G +++LV +D F+K+  A PT D+ A TVAK+LVA+   K G P +L +D+G +F S+ I+++  ++ + K  TT YHP  +G  ERFN TL + L       + DW   + Q T+ +N T
Sbjct:  185 KAQALLQKYANVFAAHEGDLGCTTLMTHEIPLLDDAPVRQRHRRIPPSEYEAVKDHINQLLASGVIRESNSPYASPIVL---ARKKDGSLRMCVDYRQLNSKTRRDAFPLPRIDESLDALSGARWFTTLDLASGYNQVPVTEGDRAKTAFCTPFGLFEWNRMPFGLCNAPGTFQRLMQRIFGDQQCQSVLLYLDDIVVFSSTIDEHLERLELVLGRLQQEKLRVKLPKCAFFQQEVRYLGHVISDQGVSTDPHKIEAVAGWQPPSTVSELRTFLGFASYYRRFVEGFARLAAPLHRLV------GELDG---TKSRRRKASSLQGHWTTECQQNFDALKQKLTSAPVLAYADFTLPFFLEVDASHNGLGAVFPQ-EQGGSVRPIAYASRGLKATERNMQNYSSMKLEFLALKWAMTEKFREYLLGHHCVVFTDNNPLSYLSTAK-LGEMEQRWVAQLAAFDYEIKYRSGRVNRNADALSRHPNHSSAEVGNMAPGSSLPRDLQQVQVQPVLAHCEMEQLMPVFPQRTTAEVQDLQVSDPVLASVLPFWRDQRYPNYSEREGLSKVALTLLRQWKNLAEVDGLVYRRVLMPD-SGQEVFQLLLPEILIPEVLEQVHQHH--GHQGIERTLALLRARCYWPGMSKDVAHWCQACERCQLAKDNSRSHSAPLGHLIA-SRPNELVAMDFTILEPSR-TGVENVLVLTDVFSKYTVAIPTRDQRAATVAKVLVAEWFSKFGVPARLHSDQGRSFESQLIQQLCGLYGIEKSRTTPYHPAGNGQCERFNRTLHNLLRALQVTRKRDWHSCLPQVTFCYNTT 981          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000041674.1 (pep primary_assembly:ASM223467v1:5:30088905:30092924:1 gene:ENSORLG00000022054.1 transcript:ENSORLT00000041674.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 439.884 bits (1130), Expect = 1.666e-127
Identity = 274/815 (33.62%), Postives = 430/815 (52.76%), Query Frame = 1
Query: 6073 KLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIK---RQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVD--VQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLV--TEKPIDIQAEQ------------------------NKDLEISK---------IREEIRKGTNQKY*ISK--------------------KLVVIEDFSGKQL--IVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            K Q LL +Y++VFA HE DLG T ++ H I   +  P++QR   IP    E +K    Q+      R     +   I      +    +RMC+DYR+LN  T +D++P P+  E  D L   ++FT +D   GY Q+ + EGD  KTAF I  GL+E+ R+PFGL NAP TFQRLM  I  D   Q   +Y++DI++ S T +EHL+ +  VL RL++ ++++K  KC + +    +LGH+++ +G++ +P KIE V  +  P+T+ +++ FLG   YYR+F++ +A++AAP+  L       GE  G    ++   K +  Q  WT+E Q+ FD LK+KL +AP+L Y DF   FI+ +DAS   + AVL Q E+ G    I YAS+ LK  +     YS+++ E  A+ +A+ ++F+ Y+ G   V+ TD+ PL  L   K   +   +W  ++     +I Y+ GR+N+NADALSR     + EV  +   +  P D+Q  Q                         +DL++S           R++     +++  +SK                    + V++ D SG+++  +++P+ L   +L Q H     GH    +T + L  + YW  + KDV  WC AC  C   K+      APL  + A + P EL AMDF    P+   G +++LV +D F+K+  A PT D+ A TVAK+LVA+   K G P +L +D+G +F S+ I+++  ++ + K  TT YHP  +G  ERFN TL + L       + DW   + Q T+ +N T
Sbjct:  277 KAQALLQKYANVFAAHEGDLGCTTLMTHEIPLLDDAPVRQRHRRIPPSEYEAVKDHINQLLASGVIRESNSPYASPIVLARKKDG--SLRMCVDYRQLNSKTRRDAFPLPRIDESLDALSGARWFTTLDLASGYNQVPVTEGDRAKTAFCIPFGLFEWNRMPFGLCNAPGTFQRLMQRIFGDQQCQSVLLYLDDIVVFSSTIDEHLERLELVLGRLQQEKLRVKLPKCAFFQQEVRYLGHVISDQGVSTDPHKIEAVAGWQPPSTVSELRTFLGFASYYRRFVEGFARLAAPLHRLV------GELDG---TKSRRRKASSLQGHWTTECQQNFDALKQKLTSAPVLAYADFTLPFILEVDASHNGLGAVLSQ-EQGGSVRPIAYASRGLKATERNMQNYSSMKLEFLALKWAMTEKFREYLLGHHCVVFTDNNPLSYLSTAK-LGAMEQRWVAQLAAFDYEIKYRSGRVNRNADALSRHPNHSSAEVGNMAPGSSLPRDLQQVQVQPVLAHCEMEQLMPVFPQRTTAEVQDLQVSDPVLASVLPFWRDQRYPNYSEREGLSKVALTLLRQWKNLAEVDGLVYRRVLMPD-SGQEVFQLLLPEILIPEVLEQVHQHH--GHQGIERTLALLRARCYWPGMSKDVAHWCQACERCQLAKDNSRSHSAPLGHLIA-SRPNELVAMDFTILEPSR-TGVENVLVLTDVFSKYTVAIPTRDQRAATVAKVLVAEWFSKFGVPARLHSDQGRSFESQLIQQLCGLYGIEKSRTTPYHPAGNGQCERFNRTLHNLLRALPVTRKRDWHSCLPQVTFCYNTT 1073          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000040501.1 (pep primary_assembly:ASM223467v1:1:1251188:1255180:-1 gene:ENSORLG00000023024.1 transcript:ENSORLT00000040501.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 433.721 bits (1114), Expect = 1.337e-125
Identity = 265/815 (32.52%), Postives = 430/815 (52.76%), Query Frame = 1
Query: 6025 DKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIP---YKLREEIKRQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALE-VFVLVTEKPIDIQAE---------------QNKDLEIS----------------------KIREEIR--KGTNQKY*ISKKLVVIEDFSGK-QLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            D +     +L+E  + +++ LL +Y+ VFAK + D+G T+++ H I   +  P++Q    IP   Y+L     +Q+   +  R  +  +   I      +    +RMC+DYR+LN  T KD++P P+  E  D L   ++FT +D   GY Q+++ E D  KTAF    GL+E+ R+PFGL NAP+TFQRLM  +  D   Q   +Y++D++I S + E+HL+ +  V  RL    +K+K +KC + +    +LGH+V+ EG++ +P+K  VV+++  P  L  ++ FLG   YYR+FI  +AKIA+P+  L   +   G  KG    + +++        W +E +++F +LK  LITAP+L Y DF+K F++ +DAS   + AVL QE E GK   + +AS++LK  +     YS+++ E  A+ +A+ ++F+ Y+ G    + TD+ PL  L   K  ++   +WA ++    + I Y+PG  N NADALSR H +  L     L TE+ + +QAE               Q +D  +                        +RE IR  K   +   +  + +   D  G+   +V+PQ L++ +L   HD    GH  + +T   +  +YYW N+  DV++WC  C  C   K    K K  +  + A + P E+ A+DF    P T +G +++LV +D F+K+ +  PT D+ A TVA+ LV       G P ++ +D+G NF S  + R+   + + K  TT YHPQ +G  ERFN TL   L     + +  W  ++SQ T+ +N T
Sbjct:  288 DSILPQLEHLDERDKERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVRQPYRRIPPSQYELARSHIQQLLQSQVIRESSSPYASPIVLVQKKDG--GLRMCVDYRQLNARTRKDAFPSPRIDESLDALAGAQWFTTLDLASGYSQVEVAEKDKAKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERLFGDCRFQSVLLYLDDVIIFSSSVEQHLQRLEQVFSRLDAQGLKVKLSKCHFFQKQVKYLGHVVSAEGVSTDPEKAAVVRDWRRPANLADLRSFLGFASYYRRFIAGFAKIASPLNSLVARLLPPGR-KGKTPKKPVDE-------FWDAECEESFQELKTALITAPVLAYADFQKPFVLEVDASHGGLGAVLSQEHE-GKRRPVAFASRSLKPTERNMNNYSSMKLELVALKWAVTEKFREYLLGNACTVFTDNNPLSHLATAKLGATE-QRWASELAAFDLTIKYRPGSQNANADALSRQHPVLELSGARPLETERVLGVQAEMSTLPGLSPSDLYTLQRQDPVLGPFIKYWERGKMPDARERHGLSKPVRELIRQWKRLREHESVLYRCIYGSDGHGETNQLVLPQSLQESILHSLHDDH--GHQGTERTLQLIRSRYYWPNMYSDVEKWCRTCERCVLSKALQPKVKTYMGSVKA-SRPHEILAIDFTVLDPAT-DGRENVLVMTDVFSKYTQTVPTKDQRASTVAEALVKHWFQLFGPPARIHSDQGRNFESNLVHRLCKFYQIDKSRTTPYHPQGNGQCERFNRTLHDLLRTLPPEQKRRWPRHLSQVTFAYNTT 1086          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000045600.1 (pep primary_assembly:ASM223467v1:4:26167161:26171153:-1 gene:ENSORLG00000023514.1 transcript:ENSORLT00000045600.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 431.024 bits (1107), Expect = 1.018e-124
Identity = 261/815 (32.02%), Postives = 432/815 (53.01%), Query Frame = 1
Query: 6025 DKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIP---YKLREEIKRQIRVKRGRRNKTKFFTMGITHCTS*EN*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDV--QHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQL---RYSTIEKECYAIVFAL-KQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALE-VFVLVTEKPIDIQAE---------------QNKDLEISKIREEIRKG----TNQKY*ISK--------------------KLVVIEDFSGK-QLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            D +     +L+E  + +++ LL +Y+ VFAK + D+G T+++ H I   +  P++Q    IP   Y+L     +Q+   +  R  +  +   I      +    +RMC+DYR+LN  T KD++P P+  E  D L   ++FT +D   GY Q+++ E D  KTAF    GL+E+ R+PFGL NAP+TFQRLM  +  D   Q   +Y++D++I S + E+HL+ +  V  RL+   +K+K +KC + +    +LGH+V+ EG++ +P+K  VV+++  P  L  ++ FLG   YYR+FI  +AKIA+P+  L   +   G  KG    + +++        W +E +K+F +LK  LITAP+L Y +F+K F++ +DAS   + AVL QE E G+   + +AS++LK  +     YS+++ E  A+ +A+ ++F+ Y+ G    + TD+ PL  L   K  ++   +WA ++    + I Y+PG  N NADALSR H +  L     L TE+ + +QAE               Q +D  +    +   +G      +++ +SK                    + +   D  G+   +V+PQ L++ +L   HD    GH  + +T   +  +YYW N+  DV++WC  C  C   K    K K  +  + A + P E+ A+DF    P T +G +++LV +D F+K+ +  PT D+ A TVA+ LV       G P ++ +D+G NF S  + ++   + + K  TT YHPQ +G  ERFN TL   L     + +  W  ++SQ T+ +N T
Sbjct:  288 DSILPQLEHLDERDKERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVRQPYRRIPPSQYELAHSHIQQLLQSQVIRESSSPYASPIVLVQKKDG--GLRMCVDYRQLNARTRKDAFPLPRIDESLDALAGAQWFTTLDLASGYSQVEVAEKDKAKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERLFGDCRFQSVLLYLDDVIIFSSSVEQHLQRLEQVFSRLEAQGLKVKLSKCHFFQKQVKYLGHVVSAEGVSTDPEKAAVVRDWRRPANLADLRSFLGFASYYRRFIAGFAKIASPLNSLVARLLPPGR-KGKTPKKPVDE-------FWDAECEKSFQELKTALITAPVLAYANFQKPFVLEVDASHGGLGAVLSQEHE-GRRRPVAFASRSLKPTERNMNNYSSMKLELVALKWAVTEKFREYLLGNACTVFTDNNPLSHLATAKLGATE-QRWASELAAFDLTIKYRPGSQNANADALSRQHPVLELSGARPLETERVLGVQAEMSTLPGLSPSDLYTLQQQDPVLGPFIKYWERGKMPDARERHGLSKPVRELIRQWKRLREHESVLYRYIYGSDGHGETNQLVLPQSLQESILHSLHDDH--GHQGTERTLQLIRSRYYWPNMYSDVEKWCRTCERCVLSKALQPKVKTYMGSVKA-SRPHEILAIDFTVLDPAT-DGRENVLVMTDVFSKYTQTVPTKDQRASTVAEALVKHWFQLFGPPARIHSDQGWNFESNLVHQLCKFYQIDKSRTTPYHPQGNGQCERFNLTLHDLLRTLPPEQKRRWPRHLSQVTFAYNTT 1086          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 652.129 bits (1681), Expect = 0.000e+0
Identity = 373/978 (38.14%), Postives = 554/978 (56.65%), Query Frame = 1
Query: 5452 IKSTNDTIISMSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDGAIKRKVNYQESRXXXXXXXXXXXXXYHCDHIIP---------VKVNVKSEDTLIFEPDKSITKNHNIMLTDECV-IPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITH--CTS*E--------N*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVF--VLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            I+  N  +I+ S  ++ V G I   + + +  +N    V+    + CI+  D++ +L    I   +   E R                + I+          VKVN ++   +IFEP+    K   + L  E V +  N  IPI I N++  +  I++  RLGKL          M +  +    +E   E   T          I++  LNET++ +LQ LL E+ D+FA  + DLG +D+ +H I   +  PI  RPY +    +   +++++         K    G+    C+  +             R CIDYR+LN VT +D+YP P   E    L    YFT +D   GY QI+++E D EKTAF I   LY++ ++PFGLTNAPA+FQR MN +   ++ C VY++D+LI S TFE+HLKDI NV  RL++ ++K+KP+KC WA+    FLGHIV+ +G  P+P  IE +KN P P T+ QVQ FLGL GYYRKFI++YA IA P+ ELTK        K    +             W  E Q AF+ L++KLI+APIL +PDFKK F++  DASG A  AVLGQ +++ ++                YS  E+E  AI+ A+K FK  ++G E+ I TDH+PL  L +HK+ SSRL++WAM++Q+    I +K G+ N NAD +SR        VF  ++  +K  D++   N     SK++E + K    +Y + K  +++ D   K+L+ +P+  R+ +L+QYHDG LGGH+S++KT +RL QKYYW N+ +DVK W   C ICATRK+TG+K KAPL P+P    PM + AMD +  LP T +GN +ILV +DY +K+ EAF   D+ A+T+A++LV +I  + GTP+++LTD+G NFMSE I+ V N F + K+ T+ YHPQT+G  ERFNG L+  ++ +V ++Q DWD Y++ C   + ++
Sbjct: 2482 IEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIII-DLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKL--------TPMMDIELPTAKTEGTEEELWT----------IDSQLLNETEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSVAEKEVQ---------KMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTYPLPDINEMLQMLDGAAYFTSLDLKSGYWQIKVKEEDREKTAFTIGKDLYQFKKMPFGLTNAPASFQRCMNFVTHGIKQCMVYIDDLLIYSSTFEQHLKDIRNVFVRLRQWKLKLKPSKCDWAKEKVTFLGHIVSAKGKEPDPRNIEKIKNCPAPKTVTQVQEFLGLCGYYRKFIKNYATIAKPIQELTK--------KDTPFI-------------WEEEQQIAFETLRDKLISAPILVHPDFKKPFLLATDASGYASGAVLGQWDDEKRER--------------NYSVTEREALAIIQAIKHFKYLLWGHEIYITTDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSGKANANADCMSRF-------VFEELMDHDKLNDMKIFDNN----SKLKEHMEKH-RHRYTLEKGWLLLMD-GEKRLMCLPESRRKEVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRKSTGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKFPEAFAIPDQKARTIARILVEEICCRYGTPKQILTDQGTNFMSEIIEEVTNYFRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMAYRMS 3383          

HSP 2 Score: 263.462 bits (672), Expect = 1.417e-69
Identity = 218/778 (28.02%), Postives = 377/778 (48.46%), Query Frame = 3
Query: 2847 KLMEKEVRELSAEDVNEEVVLRILFTTGINRD--KVITAMAYLIARYKKLLRS-NPKEQINEVAKIIYQRLKADVMPWIKLKGGWKIMYVETLKTSESETMENDDTEEIVKILVNKTTKKPNKSEGKXXXXXXXXXXXXXXXXXRGSNAIIAYDCQNPKLGQKYSLLDVENCPEVNPTKLIVTEPKI-FHVYQESDFIHTEAKECIIKFSEESFVCNQVARTMLVPTWAPEALI-ITPRECEEAFXXXXXXXXXXXXXXAQAGARVKQSVNVVEWTSAGGYCMGGKYKIWGQIADNIVVRRQYYVELRXXXXXXXXXXXXMMTHKFCMLDESSCDTGESMIVYKIDKYECQLTKLKSLKFRTIRGKQFTGAESKRIK---DQKRKT---------VIVQEKDTPTTYMADSEAEAMRFVEKGEAIKCGKAVVKTNYEGIYISSNEIKDAKLKIDKFDVKLSSYFNNKIDYFYHHQLVQLDKVYQATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATN-----NQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVS----LRQLRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIK 5102
            K+M+ +VR  + +   EE++L I   T  + D  + +  MAYL+     + R   P+   N     I   ++  + P+I  +GGW+       K  ++E M+    E + +         P  +  +  ++ II  I + +  ++G    + YDC N K+G KYSL + E C   NP KL  T   + ++VYQE DFI TE KEC +     +F C   + + ++       ++ +T  ECEE F T +I     + + A+ G      V              GK   W +   N+  RR + +E         S+S+   +++     + SC+TG S IVY  D   C+LT LKS  F  ++G+  +  E+   K   D+ R+T           +  + TPT  ++      MRF++     KC + V  TNY+GI++S   I +AK +ID            KI                           EIL+TK+A+ +TNPD   P+L  ++G F R+VGEV+ T++C+     L  N     ++C NEL ++ KG+  + QP+TR++  + ++P     C+N+  PL+++ D  W+    R     P    LTEL  + EF+ + D++ +G+Y    +E AR+H+LFP +++ ILS +V+     ++G   N+++LLS +HF+     ++ ++WG F + G M + ILGI   + ++++I + +++     ++ R++   ++KM     PF+AK ++   H K+I  IK
Sbjct: 1571 KIMDNKVRTTTKDP--EELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTA--RKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYA-----------AGKDNGWRR---NVSRRRVHLIE---------STSQGSSSNRKLPTTDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHSETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDP-----------KI---------------------------EILKTKLAMGITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFS-CSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKI---NWKMAIGFLPFLAKTMVLHGHSKDIHKIK 2278          

HSP 3 Score: 56.6102 bits (135), Expect = 5.185e-7
Identity = 51/177 (28.81%), Postives = 76/177 (42.94%), Query Frame = 1
Query:  661 EPYRGEPEKFDDFFTEITQYADACGWTPDELKRRIPLYLKGYALQAYNNLECRGNNLDQLIENLRAEIVIDSDLQKIN--VQKFRNRYQGSRESVGEYVYTITELAKKAFGAGEHTPKLIEDQFWNGIQKYLYKA-LIMVEYRDLNELVHKAKRVEAIQRDRFNASIDAVEIQRPTM 1182
            EP+ G   K D F  E   Y + C WT  ++K R+PLYLKG A   +         L    E     + I    +K N  +  F +R Q   ES   Y   + +L K+AFG  +     + D F  GI++   +A L  +    L++ V  A R EA     +  ++ A E    T+
Sbjct:   36 EPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTV 212          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 651.744 bits (1680), Expect = 0.000e+0
Identity = 373/978 (38.14%), Postives = 554/978 (56.65%), Query Frame = 1
Query: 5452 IKSTNDTIISMSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDGAIKRKVNYQESRXXXXXXXXXXXXXYHCDHIIP---------VKVNVKSEDTLIFEPDKSITKNHNIMLTDECV-IPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITH--CTS*E--------N*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVF--VLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            I+  N  +I+ S  ++ V G I   + + +  +N    V+    + CI+  D++ +L    I   +   E R                + I+          VKVN ++   +IFEP+    K   + L  E V +  N  IPI I N++  +  I++  RLGKL          M +  +    +E   E   T          I++  LNET++ +LQ LL E+ D+FA  + DLG +D+ +H I   +  PI  RPY +    +   +++++         K    G+    C+  +             R CIDYR+LN VT +D+YP P   E    L    YFT +D   GY QI+++E D EKTAF I   LY++ ++PFGLTNAPA+FQR MN +   ++ C VY++D+LI S TFE+HLKDI NV  RL++ ++K+KP+KC WA+    FLGHIV+ +G  P+P  IE +KN P P T+ QVQ FLGL GYYRKFI++YA IA P+ ELTK        K    +             W  E Q AF+ L++KLI+APIL +PDFKK F++  DASG A  AVLGQ +++ ++                YS  E+E  AI+ A+K FK  ++G E+ I TDH+PL  L +HK+ SSRL++WAM++Q+    I +K G+ N NAD +SR        VF  ++  +K  D++   N     SK++E + K    +Y + K  +++ D   K+L+ +P+  R+ +L+QYHDG LGGH+S++KT +RL QKYYW N+ +DVK W   C ICATRK+TG+K KAPL P+P    PM + AMD +  LP T +GN +ILV +DY +K+ EAF   D+ A+T+A++LV +I  + GTP+++LTD+G NFMSE I+ V N F + K+ T+ YHPQT+G  ERFNG L+  ++ +V ++Q DWD Y++ C   + ++
Sbjct: 2482 IEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIII-DLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKL--------TPMMDIELPTAKTEGTEEELWT----------IDSQLLNETEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSVAEKEVQ---------KMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTYPLPDINEMLQMLDGAAYFTSLDLKSGYWQIKVKEEDREKTAFTIGKDLYQFKKMPFGLTNAPASFQRCMNFVTHGIKQCMVYIDDLLIYSSTFEQHLKDIRNVFVRLRQWKLKLKPSKCDWAKEKVTFLGHIVSAKGKEPDPRNIEKIKNCPAPKTVTQVQEFLGLCGYYRKFIKNYATIAKPIQELTK--------KDTPFI-------------WEEEQQIAFETLRDKLISAPILVHPDFKKPFLLATDASGYASGAVLGQWDDEKRER--------------NYSVTEREALAIIQAIKHFKYLLWGHEIYITTDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSGKANANADCMSRF-------VFEELMDHDKLNDMKIFDNN----SKLKEHMEKH-RHRYTLEKGWLLLMD-GEKRLMCLPESRRKEVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRKSTGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKFPEAFAIPDQKARTIARILVEEICCRYGTPKQILTDQGTNFMSEIIEEVTNYFRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMAYRMS 3383          

HSP 2 Score: 263.848 bits (673), Expect = 1.223e-69
Identity = 218/778 (28.02%), Postives = 377/778 (48.46%), Query Frame = 3
Query: 2847 KLMEKEVRELSAEDVNEEVVLRILFTTGINRD--KVITAMAYLIARYKKLLRS-NPKEQINEVAKIIYQRLKADVMPWIKLKGGWKIMYVETLKTSESETMENDDTEEIVKILVNKTTKKPNKSEGKXXXXXXXXXXXXXXXXXRGSNAIIAYDCQNPKLGQKYSLLDVENCPEVNPTKLIVTEPKI-FHVYQESDFIHTEAKECIIKFSEESFVCNQVARTMLVPTWAPEALI-ITPRECEEAFXXXXXXXXXXXXXXAQAGARVKQSVNVVEWTSAGGYCMGGKYKIWGQIADNIVVRRQYYVELRXXXXXXXXXXXXMMTHKFCMLDESSCDTGESMIVYKIDKYECQLTKLKSLKFRTIRGKQFTGAES---KRIKDQKRKT---------VIVQEKDTPTTYMADSEAEAMRFVEKGEAIKCGKAVVKTNYEGIYISSNEIKDAKLKIDKFDVKLSSYFNNKIDYFYHHQLVQLDKVYQATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATN-----NQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVS----LRQLRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIK 5102
            K+M+ +VR  + +   EE++L I   T  + D  + +  MAYL+     + R   P+   N     I   ++  + P+I  +GGW+       K  ++E M+    E + +         P  +  +  ++ II  I + +  ++G    + YDC N K+G KYSL + E C   NP KL  T   + ++VYQE DFI TE KEC +     +F C   + + ++       ++ +T  ECEE F T +I     + + A+ G      V              GK   W +   N+  RR + +E         S+S+   +++     + SC+TG S IVY  D   C+LT LKS  F  ++G+  +  E+   K   D+ R+T           +  + TPT  ++      MRF++     KC + V  TNY+GI++S   I +AK +ID            KI                           EIL+TK+A+ +TNPD   P+L  ++G F R+VGEV+ T++C+     L  N     ++C NEL ++ KG+  + QP+TR++  + ++P     C+N+  PL+++ D  W+    R     P    LTEL  + EF+ + D++ +G+Y    +E AR+H+LFP +++ ILS +V+     ++G   N+++LLS +HF+     ++ ++WG F + G M + ILGI   + ++++I + +++     ++ R++   ++KM     PF+AK ++   H K+I  IK
Sbjct: 1571 KIMDNKVRTTTKDP--EELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTA--RKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYA-----------AGKDNGWRR---NVSRRRVHLIE---------STSQGSSSNRKLPTTDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHSETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDP-----------KI---------------------------EILKTKLAMGITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFS-CSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKI---NWKMAIGFLPFLAKTMVLHGHSKDIHKIK 2278          

HSP 3 Score: 56.6102 bits (135), Expect = 5.139e-7
Identity = 51/177 (28.81%), Postives = 76/177 (42.94%), Query Frame = 1
Query:  661 EPYRGEPEKFDDFFTEITQYADACGWTPDELKRRIPLYLKGYALQAYNNLECRGNNLDQLIENLRAEIVIDSDLQKIN--VQKFRNRYQGSRESVGEYVYTITELAKKAFGAGEHTPKLIEDQFWNGIQKYLYKA-LIMVEYRDLNELVHKAKRVEAIQRDRFNASIDAVEIQRPTM 1182
            EP+ G   K D F  E   Y + C WT  ++K R+PLYLKG A   +         L    E     + I    +K N  +  F +R Q   ES   Y   + +L K+AFG  +     + D F  GI++   +A L  +    L++ V  A R EA     +  ++ A E    T+
Sbjct:   36 EPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTV 212          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 651.358 bits (1679), Expect = 0.000e+0
Identity = 373/978 (38.14%), Postives = 554/978 (56.65%), Query Frame = 1
Query: 5452 IKSTNDTIISMSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDGAIKRKVNYQESRXXXXXXXXXXXXXYHCDHIIP---------VKVNVKSEDTLIFEPDKSITKNHNIMLTDECV-IPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITH--CTS*E--------N*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVF--VLVTEKPIDIQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            I+  N  +I+ S  ++ V G I   + + +  +N    V+    + CI+  D++ +L    I   +   E R                + I+          VKVN ++   +IFEP+    K   + L  E V +  N  IPI I N++  +  I++  RLGKL          M +  +    +E   E   T          I++  LNET++ +LQ LL E+ D+FA  + DLG +D+ +H I   +  PI  RPY +    +   +++++         K    G+    C+  +             R CIDYR+LN VT +D+YP P   E    L    YFT +D   GY QI+++E D EKTAF I   LY++ ++PFGLTNAPA+FQR MN +   ++ C VY++D+LI S TFE+HLKDI NV  RL++ ++K+KP+KC WA+    FLGHIV+ +G  P+P  IE +KN P P T+ QVQ FLGL GYYRKFI++YA IA P+ ELTK        K    +             W  E Q AF+ L++KLI+APIL +PDFKK F++  DASG A  AVLGQ +++ ++                YS  E+E  AI+ A+K FK  ++G E+ I TDH+PL  L +HK+ SSRL++WAM++Q+    I +K G+ N NAD +SR        VF  ++  +K  D++   N     SK++E + K    +Y + K  +++ D   K+L+ +P+  R+ +L+QYHDG LGGH+S++KT +RL QKYYW N+ +DVK W   C ICATRK+TG+K KAPL P+P    PM + AMD +  LP T +GN +ILV +DY +K+ EAF   D+ A+T+A++LV +I  + GTP+++LTD+G NFMSE I+ V N F + K+ T+ YHPQT+G  ERFNG L+  ++ +V ++Q DWD Y++ C   + ++
Sbjct: 2374 IEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIII-DLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKL--------TPMMDIELPTAKTEGTEEELWT----------IDSQLLNETEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSVAEKEVQ---------KMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTYPLPDINEMLQMLDGAAYFTSLDLKSGYWQIKVKEEDREKTAFTIGKDLYQFKKMPFGLTNAPASFQRCMNFVTHGIKQCMVYIDDLLIYSSTFEQHLKDIRNVFVRLRQWKLKLKPSKCDWAKEKVTFLGHIVSAKGKEPDPRNIEKIKNCPAPKTVTQVQEFLGLCGYYRKFIKNYATIAKPIQELTK--------KDTPFI-------------WEEEQQIAFETLRDKLISAPILVHPDFKKPFLLATDASGYASGAVLGQWDDEKRER--------------NYSVTEREALAIIQAIKHFKYLLWGHEIYITTDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSGKANANADCMSRF-------VFEELMDHDKLNDMKIFDNN----SKLKEHMEKH-RHRYTLEKGWLLLMD-GEKRLMCLPESRRKEVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRKSTGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKFPEAFAIPDQKARTIARILVEEICCRYGTPKQILTDQGTNFMSEIIEEVTNYFRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMAYRMS 3275          

HSP 2 Score: 178.333 bits (451), Expect = 7.622e-44
Identity = 100/304 (32.89%), Postives = 181/304 (59.54%), Query Frame = 3
Query: 4218 QLVQLDKVYQATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATN-----NQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVS----LRQLRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIK 5102
            +L   +K+Y   + NDC LNREIL+TK+A+ +TNPD   P+L  ++G F R+VGEV+ T++C+     L  N     ++C NEL ++ KG+  + QP+TR++  + ++P     C+N+  PL+++ D  W+    R     P    LTEL  + EF+ + D++ +G+Y    +E AR+H+LFP +++ ILS +V+     ++G   N+++LLS +HF+     ++ ++WG F + G M + ILGI   + ++++I + +++     ++ R++   ++KM     PF+AK ++   H K+I  IK
Sbjct: 1872 RLASTEKIYYDLVKNDCILNREILKTKLAMGITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFS-CSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKI---NWKMAIGFLPFLAKTMVLHGHSKDIHKIK 2170          

HSP 3 Score: 81.2629 bits (199), Expect = 1.858e-14
Identity = 69/245 (28.16%), Postives = 116/245 (47.35%), Query Frame = 3
Query: 2847 KLMEKEVRELSAEDVNEEVVLRILFTTGINRD--KVITAMAYLIARYKKLLRS-NPKEQINEVAKIIYQRLKADVMPWIKLKGGWKIMYVETLKTSESETMENDDTEEIVKILVNKTTKKPNKSEGKXXXXXXXXXXXXXXXXXRGSNAIIAYDCQNPKLGQKYSLLDVENCPEVNPTKLIVTEPKI-FHVYQESDFIHTEAKECIIKFSEESFVCNQVARTMLVPTWAPEALI-ITPRECEEAF 3566
            K+M+ +VR  + +   EE++L I   T  + D  + +  MAYL+     + R   P+   N     I   ++  + P+I  +GGW+       K  ++E M+    E + +         P  +  +  ++ II  I + +  ++G    + YDC N K+G KYSL + E C   NP KL  T   + ++VYQE DFI TE KEC +     +F C   + + ++       ++ +T  ECEE F
Sbjct: 1571 KIMDNKVRTTTKDP--EELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTA--RKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNECEEGF 1811          

HSP 4 Score: 56.6102 bits (135), Expect = 5.221e-7
Identity = 51/177 (28.81%), Postives = 76/177 (42.94%), Query Frame = 1
Query:  661 EPYRGEPEKFDDFFTEITQYADACGWTPDELKRRIPLYLKGYALQAYNNLECRGNNLDQLIENLRAEIVIDSDLQKIN--VQKFRNRYQGSRESVGEYVYTITELAKKAFGAGEHTPKLIEDQFWNGIQKYLYKA-LIMVEYRDLNELVHKAKRVEAIQRDRFNASIDAVEIQRPTM 1182
            EP+ G   K D F  E   Y + C WT  ++K R+PLYLKG A   +         L    E     + I    +K N  +  F +R Q   ES   Y   + +L K+AFG  +     + D F  GI++   +A L  +    L++ V  A R EA     +  ++ A E    T+
Sbjct:   36 EPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTV 212          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000012882.1 (SMESG000012882.1)

HSP 1 Score: 628.632 bits (1620), Expect = 1.309e-180
Identity = 371/988 (37.55%), Postives = 546/988 (55.26%), Query Frame = 1
Query: 5452 IKSTNDTIISMSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDGAIKRKVNYQESRXXXXXXXXXXXXXYHCDHIIP---------VKVNVKSEDTLIFEPDKSITKNHNIMLTDECV-IPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKV-KINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITH--CTS*E--------N*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRI--HELGALEV--FVLVTEKPID---IQAEQNKDLEISKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKL----RQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            I+  N  +I+ S  ++ V G I   + + +  +N    V+    + CI+  D++ +L    I   +   E R                + I+          VKVN ++   +IFEP+    K   + L  E V +  N  IPI I N++  +  I++  RLGKL                + D     +EL  T      +++  I++  LNE ++ +LQ LL E+ D+FA  + DLG +D+ +H I   +  PI  RPY +    + E +++++         K    G+    C+  +             R CIDYR+LN VT +D+YP P   E    L    YFT +D   GY QI+++E D EKTAF I   LY++ ++PFGLTNAPA+FQR MN +   ++ C VY++D+LI S TFE+HLKDI N                         FLGHIV+ +G  P+P  IE +KN P P T+ QVQ FLGL GYYRKFI++YA IA P+ ELTK        K    +             W  E Q AF+ L++KLI+APIL +PDFKK F++  DASG A  AVLGQ +++ ++ VI Y S+T K  +  YS  E+E  AI+ A+K FK  ++G E+ I TDH+PL  L +HK+ SSRL++WAM++Q+    I +K G+ N NAD +SR    EL   +V    ++  + ID   ++ EQ +D E+  I E +     + +  + KL  I   +    I +P K     R+ +L+QYHDG LGGH+S++KT +RL QKYYW N+ +DVK W   C ICATRK+TG+K KAPL P+P    PM + AMD +  LP T +GN +ILV +DY +K+ EAF   D+ A+T+A++LV +I  + GTP+++LTD+G NFMSE I+ V N F + K+ T+ YHPQT+G  ERFNG L+  ++ +V ++Q DWD Y++ C   + ++
Sbjct:  474 IEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIII-DLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKL--------------TPMMD-----IELPTTKTEGTEEELWTIDSQLLNEVEKEQLQKLLTEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQ---------KMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTYPLPDINEMLQTLHGAAYFTSLDLKSGYWQIKVKEEDREKTAFTIGKDLYQFKKMPFGLTNAPASFQRCMNFVTHGIKQCMVYIDDLLIYSSTFEQHLKDIRN---------------------EKVTFLGHIVSAKGKEPDPRNIEKIKNCPAPKTVTQVQEFLGLCGYYRKFIKNYATIAKPIQELTK--------KDTPFI-------------WEKEQQTAFETLRDKLISAPILVHPDFKKPFLLATDASGYASGAVLGQWDDEKRERVIGYYSRTFKKHEKNYSVTEREALAIIQAIKHFKYLLWGHEIYITTDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSGKANANADCMSRFVFEELMDHDVRRICMIIAEDIDFTKLRNEQKEDEELKTIIEFMETNDMKIFDNNSKLKSIWRNTD---IDIPWKRDESRRKEVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRKSTGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKFPEAFAIPDQKARTIARILVEEICCRYGTPKQILTDQGTNFMSEIIEEVTNYFRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMAYRMS 1387          

HSP 2 Score: 142.895 bits (359), Expect = 3.944e-33
Identity = 86/275 (31.27%), Postives = 161/275 (58.55%), Query Frame = 3
Query: 4305 VAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATN-----NQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVS----LRQLRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIK 5102
            +A+TNPD   P+L  ++G F R+VGEV+ T++C+     L  N     ++C NEL ++ KG+  + QP+TR++  + ++P     C+N+  PL+++ D  W+    R     P    LTEL  + EF+ + D++ +G+Y    +E AR+H+LFP +++ ILS +V+     ++G   N+++LLS +HF+     ++ ++WG F + G M + ILGI   + ++++I + +++     ++ R++   ++KM     PF+AK ++   H K+I  IK
Sbjct:    1 MAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFS-CSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANMMKAMWGRFLIFGQMMAGILGIILIIQIIRVILTQMLACFDIYKRERKI---NWKMAIGFLPFLAKTMVLHGHSKDIHKIK 270          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000069191.1 (SMESG000069191.1)

HSP 1 Score: 625.165 bits (1611), Expect = 3.062e-179
Identity = 370/1002 (36.93%), Postives = 550/1002 (54.89%), Query Frame = 1
Query: 5452 IKSTNDTIISMSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDGAIKRKVNYQESRXXXXXXXXXXXXXYHCDHIIP---------VKVNVKSEDTLIFEPDKSITKNHNIMLTDECV-IPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGELEQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKV-KINNNYLNETQQFKLQTLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEIKRQIRVKRGRRNKTKFFTMGITH--CTS*E--------N*NEIRMCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQMEEGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYMEDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIVTVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAPMIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLITAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKGAQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTSSRLLKWAMKIQDMHIKIIYKPGRINKNADALSRI--HELGALEV--FVLVTEKPID---IQAEQNKDLEI------------------SKIREEIRKGTNQKY*ISKKLVVIEDFSGKQLIVVPQKLRQILLLQYHDGALGGHLSSRKTASRLLQKYYWDNLKKDVKRWCNACTICATRKNTGTKTKAPLHPIPATTSPMELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEAFPT*DETAKTVAKLLVAKIIFKIGTPRKLLTDKGANFMSETIKRVANMFNMHKINTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNIT 8319
            IK  N  +I+ S  ++ V G I   + + +  +N    V+    + CI+  D++ +L    I   +   E R                + I+          VKVN ++   +IFEP+    K   + L  E V +  N  IPI I N++  +  I++  RLGKL                + D     +EL  T      +++  I++  LNET++ +L     E+ D+FA  + DLG +D+ +H I   +  PI  RPY +    +   +++++         K    G+    C+  +             R CIDYR+LN VT +D+YP P   E    L    YFT +D   GY QI+++E D EKTAF I   LY++ ++PF LTNAPA+FQR MN +  D++ C V++ D+LI S TFE+HLKDI NV  RL++ ++K+K +KC W +    FL HIV+ +G  P+P  IE +KN+PVP T+ QV+ FLGL GYYRKFI+ YA IA P+ ELTK        K    +             W  E Q AF+ L++KLI+APIL +PDFKK F++  DASG   E                   +T K  +  YS  E+E  AI+ A+K FK  ++G E+ I TDH+PL  L +HK+ SSRL++WAM++Q+    I +K  + N NAD LSR    EL   +V    ++  + ID   ++ EQ +D E+                  SK+RE + K    +Y I K  +++ D   K+L+ +P+  R+ +L+QYHDG LGGH+S++KT +RL QKYYW N+ +DVK W   C ICATRK+TG+K KAPL P+P    PM + AMD +  LP T +GN +ILV +DY +K+ EAF   D+ A+T+A++LV +I  + GTP+++LTD+G NFMSE I+ V N F + K+ T+ YHPQT+G  ERFNG L+  ++ +V ++Q DWD Y++ C   + ++
Sbjct: 2871 IKEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIII-DLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKL--------------TPMMD-----IELPATKTEGTEEELWTIDSQLLNETEKEQLM----EFKDIFAASDLDLGTSDVTQHTISLTDDTPITLRPYRLAEAQKSVAEKEVQ---------KMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTYPLPDINEMLQTLHGAAYFTSLDLKSGYWQIKVKEEDREKTAFTIGKDLYQFKKMPFRLTNAPASFQRCMNFVTHDIKQCMVFI-DLLIYSSTFEQHLKDIRNVFVRLRQWKLKLKRSKCDWTKEKVTFLEHIVSAKGKEPDPRNIEKIKNYPVPKTVTQVKEFLGLCGYYRKFIKSYATIAKPIQELTK--------KDTPFI-------------WEKEQQTAFETLRDKLISAPILVHPDFKKPFLLATDASGYDRE------------------HRTFKKHEKNYSVTEREALAIIQAIKHFKYLLWGHEIYIITDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSRKANANADCLSRFVFEELMDQDVRRICMIIAEDIDFTKLRNEQKEDEELKTIIEFMETNDMKIFDNNSKLREHMEKH-RHRYTIEKGWLLLMD-GEKRLMCLPESRRKEVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRKSTGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKFPEAFAIPDQKARTIARILVDEICCRYGTPKQILTDQGTNFMSEIIEEVTNYFRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMAYRMS 3797          

HSP 2 Score: 298.13 bits (762), Expect = 4.911e-80
Identity = 229/782 (29.28%), Postives = 390/782 (49.87%), Query Frame = 3
Query: 2847 KLMEKEVRELSAEDVNEEVVLRILFTTGINRD--KVITAMAYL----IARYKKLLRSNPKEQINEVAKIIYQRLKADVMPWIKLKGGWKIMYVETLKTSESETMENDDTEEIVKILVNKTTKKPNKSEGKXXXXXXXXXXXXXXXXXRGSNAIIAYDCQNPKLGQKYSLLDVENCPEVNPTKLIVTEPKI-FHVYQESDFIHTEAKECIIKFSEESFVCNQVARTMLVPTWAPEALI-ITPRECEEAFXXXXXXXXXXXXXXAQAGARVKQSVNVVEWTSAG-GYCMGGKYKIWGQIADNIVVRRQYYVELRXXXXXXXXXXXXMMTHKFCMLDESSCDTGESMIVYKIDKYECQLTKLKSLKFRTIRGKQFTGAESK---RIKDQKRKT---------VIVQEKDTPTTYMADSEAEAMRFVEKGEAIKCGKAVVKTNYEGIYISSNEIKDAKLKIDKFDVKLSSYFNNKIDYFYHHQLVQLDKVYQATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQTFQCKPVSVSLATN-----NQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDKCTNLLDPLYQLNDEMWITMSDRKEAXXXXXXXXXXXXXXXXXQEINDMNNNGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSKEHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVS----LRQLRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIK 5102
            K+M+ +VR  + +   EE++L I   T  + D  + +  MAYL    I  Y+K     P+   N     I   ++  + P+I  +GGW+       K +++E M+    E + +         P  +  +  ++ II  I + +  I+G    + YDC N K+G KYSL + E C   NP +L  T   + ++VYQE DFI TE KEC +     +F C   + + ++       ++ IT  ECEE F T +I     +++ A+ G      V      +AG G C GG+Y +  Q+   +VV   Y V+L K++  FD  ++ M  +  C+  + SC+TG S IVY  D   C+LT LKS  F  ++G   +  E+       D+ R+T           +  + TPT  ++      MRF++     KC +                                      +D+ YH  L   +K+Y   + NDC LNREIL+TK+A+A+TNPD   P+L  ++G F R+VGEV+ T++C+     L  N     ++C NEL ++ KG+  + QP+TR++  + ++P     C+N+  PL+++ D  W+    R     P    LTEL  + EF+ + D++ +G+Y    +E AR+H+LFP +++ ILS +V+     ++G   N+++LLS +HF+ V   ++ ++WG F + G M + ILGI   + ++++I + +++     ++ R++   ++KM     PF+AK ++   H K+I  IK
Sbjct: 1935 KIMDNKVRTTTKDP--EELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRK---RTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDNQTEMMQIKLKELLTEFETTINASDPKTA--RKGMNAIITTILIWMMCIKGVEPFVVYDCDNIKIGDKYSLKETEECIAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSITRNECEEGFTTGRISIAGRVQVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGGVHSETETNTQISALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM-------------------------------------MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFS-CSNVYGPLFEIRDGSWLQFPARVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHVTANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKI---NWKMAIGFLPFLAKTMVLHGHSKDIHKIK 2667          

HSP 3 Score: 56.225 bits (134), Expect = 6.313e-7
Identity = 50/171 (29.24%), Postives = 74/171 (43.27%), Query Frame = 1
Query:  661 EPYRGEPEKFDDFFTEITQYADACGWTPDELKRRIPLYLKGYALQAYNNLECRGNNLDQLIENLRAEIVIDSDLQKIN--VQKFRNRYQGSRESVGEYVYTITELAKKAFGAGEHTPKLIEDQFWNGIQKYLYKA-LIMVEYRDLNELVHKAKRVEAIQRDRFNASIDAVE 1164
            EP+ G   K D F  E   Y + C WT  ++K R+PLYLKG A   +         L    E     + I    +K N  +  F +R Q   ES   Y   + +L K+AFG  +     + D F  GI++   +A L  +    L++ V  A R EA     +  ++ A E
Sbjct:  260 EPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGEADLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATE 430          
The following BLAST results are available for this feature:
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
GIN11.371e-1925.00gypsy retrotransposon integrase 1 [Source:HGNC Sym... [more]
RTL13.591e-1925.16retrotransposon Gag like 1 [Source:HGNC Symbol;Acc... [more]
RTL13.591e-1925.16retrotransposon Gag like 1 [Source:HGNC Symbol;Acc... [more]
GIN19.902e-1726.88gypsy retrotransposon integrase 1 [Source:HGNC Sym... [more]
NYNRIN5.224e-1630.46NYN domain and retroviral integrase containing [So... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
CR855320.11.207e-10432.09pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENS... [more]
BX511082.11.953e-10431.94pep chromosome:GRCz11:9:14291932:14297132:1 gene:E... [more]
BX546500.12.765e-10231.56pep chromosome:GRCz11:23:12926092:12931693:-1 gene... [more]
BX511224.17.436e-10231.80pep chromosome:GRCz11:2:18017000:18022765:1 gene:E... [more]
CR925755.24.676e-10131.00pep chromosome:GRCz11:17:42486740:42492668:-1 gene... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSXETT00000035398.14.632e-10532.00pep primary_assembly:Xenopus_tropicalis_v9.1:3:986... [more]
castor12.017e-9932.00cytosolic arginine sensor for mTORC1 subunit 1 [So... [more]
anxa62.356e-9935.51annexin A6 [Source:Xenbase;Acc:XB-GENE-989741][more]
ENSXETT00000020886.13.761e-9932.00pep primary_assembly:Xenopus_tropicalis_v9.1:3:496... [more]
lin546.513e-9932.00lin-54 DREAM MuvB core complex component [Source:X... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 5
Match NameE-valueIdentityDescription
Gin12.259e-1924.60gypsy retrotransposon integrase 1 [Source:MGI Symb... [more]
Nynrin1.125e-1428.48NYN domain and retroviral integrase containing [So... [more]
Nynrin1.125e-1428.48NYN domain and retroviral integrase containing [So... [more]
Rtl11.187e-1422.67retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:M... [more]
Rtl11.187e-1422.67retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:M... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|P20825|POL2_DROME1.073e-10031.36Retrovirus-related Pol polyprotein from transposon... [more]
sp|P04323|POL3_DROME1.644e-9432.68Retrovirus-related Pol polyprotein from transposon... [more]
sp|Q99315|YG31B_YEAST2.379e-9130.49Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomy... [more]
sp|Q7LHG5|YI31B_YEAST3.438e-9130.39Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomy... [more]
sp|P0CT39|TF26_SCHPO1.386e-8731.09Transposon Tf2-6 polyprotein OS=Schizosaccharomyce... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A355ABF29.670e-10942.38Uncharacterized protein OS=Flavobacteriaceae bacte... [more]
A0A147BAZ13.385e-16538.08Putative retrovirus-related pol polyprotein from t... [more]
A0A0V1MMY12.021e-15534.05Transposon Ty3-G Gag-Pol polyprotein OS=Trichinell... [more]
A0A0P5WAS73.174e-15437.10Retrovirus-related Pol polyprotein from transposon... [more]
A0A0P4ZTQ76.441e-15437.28Retrovirus-related Pol polyprotein from transposon... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000037150.11.903e-13233.09pep primary_assembly:Astyanax_mexicanus-2.0:10:222... [more]
ENSAMXT00000052546.11.827e-12829.58pep primary_assembly:Astyanax_mexicanus-2.0:2:1303... [more]
ENSAMXT00000041682.12.613e-12830.22pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000041345.11.159e-12733.59pep primary_assembly:Astyanax_mexicanus-2.0:25:323... [more]
ENSAMXT00000041754.11.163e-12733.62pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 3
Match NameE-valueIdentityDescription
ENSPMAT00000004121.11.638e-2025.99pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 ... [more]
ENSPMAT00000004123.11.211e-1925.45pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 ... [more]
ENSPMAT00000009777.11.243e-1629.44pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 1
Match NameE-valueIdentityDescription
EDO338751.102e-829.07Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000036827.18.145e-13432.93pep primary_assembly:ASM223467v1:16:28613823:28617... [more]
ENSORLT00000032171.13.461e-12833.21pep primary_assembly:ASM223467v1:18:2114932:211795... [more]
ENSORLT00000041674.11.666e-12733.62pep primary_assembly:ASM223467v1:5:30088905:300929... [more]
ENSORLT00000040501.11.337e-12532.52pep primary_assembly:ASM223467v1:1:1251188:1255180... [more]
ENSORLT00000045600.11.018e-12432.02pep primary_assembly:ASM223467v1:4:26167161:261711... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000019863.11.417e-6938.14SMESG000019863.1[more]
SMESG000019863.11.223e-6938.14SMESG000019863.1[more]
SMESG000019863.17.622e-4438.14SMESG000019863.1[more]
SMESG000012882.11.309e-18037.55SMESG000012882.1[more]
SMESG000069191.13.062e-17936.93SMESG000069191.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30000559 ID=SMED30000559|Name=Transposon Ty3-I Gag-Pol polyprotein|organism=Schmidtea mediterranea sexual|type=transcript|length=10607bp
AGAGACAATCATTTTGAGAATAATAGAGATAATCGCTCCGAATACTGAAG
AATTTTCATTCAAATTGCCCATAAAATTTTACCAAAAGAAGTGGTCTTAA
AAAGGAAATCTTTTTTGAGATCTTAAATTCGAACAGATAACGTTCTACAC
TATTTTTGCACGCATTCACATAACACGCACCGACATACATTTTAACAATT
ACACAACACGCACCCAATTTTTATTTATCAATTGCTTCAAAATAAATTGT
ATCTATCTGAAAGAATTTGAATCTTGGAGTGTATATCTGGTAAATTTTAT
TTATTAATAAAGTATATACATATATTTGACTTTGTCTTCTGATTTATATA
AGTAACCAGTGCAAGTTGAGACTCACTAAAATCGAGACCTCAACATTTGG
TGCCGCGAACCGGGACTTTAAAATCAATTATTGTTATCAACAATTTGGGG
GATAGGAATTGCGATCAGAGGGTTAGTGTAGAATATATCGGGAAAATATA
TTTATACTAAATAATATTATTGACCATAAATTAAAATCATAATTTTGTAA
TCTAAAAAATAGATAATCAGCCTGAAATAATTTATAATTTGAGAAATAGA
ATCGTATATAGAATGAATAATAATCAAAATAATAATAACTTTATTAAAAT
TTTACCGATTGAACCTTATAGAGGAGAACCTGAAAAGTTCGACGATTTTT
TCACTGAAATCACTCAGTATGCAGACGCTTGTGGTTGGACACCTGATGAA
TTAAAGAGGAGAATTCCGTTGTATTTAAAAGGATATGCGTTACAGGCATA
TAATAATCTAGAGTGTAGAGGAAACAATCTTGATCAACTTATTGAAAATT
TGAGAGCAGAGATAGTAATTGACAGCGATTTACAGAAAATAAATGTGCAA
AAGTTTAGAAATCGATATCAAGGATCTAGAGAATCAGTTGGAGAATACGT
GTATACGATAACAGAGTTAGCAAAAAAGGCATTTGGTGCCGGAGAGCACA
CACCGAAATTAATTGAGGATCAGTTTTGGAATGGAATCCAGAAATATTTA
TATAAAGCATTAATAATGGTTGAATATAGAGATTTAAATGAATTAGTTCA
TAAAGCGAAAAGAGTAGAGGCAATACAGAGAGATAGATTCAATGCATCTA
TAGATGCAGTTGAGATTCAAAGGCCAACGATGGCGAAACCTAAGTTGGAA
AACGATGACGATGTTCCATGGGAAGAAATGAAACAGTTATATTTAAATTT
TGTAAAAAATAGACGAAAAAATAGTCCGAATTCGAGCAGAAGTCAATCAC
CTGAAAACAGGAAAAGATTTGACGCATTGAATTTTACAGAACAAAATCAA
AATCGATTTAGAGATGATAATAGAGACAGTCGTTCTGACTATAACAGAAA
TAATTTTAGAGATAATCGTTATAACAAAAGTAGTGGAGACAATTATTTCG
GGAATAATAGAGATAATCGTTCTGAATGCCGCCAGTATGATAATAGAAAT
AATAGAAATAGCTCAAGAGGCAGACAATCGAGTCCATATAGACAAAATCA
ATACCGAGAAAGGCGAGCAAGCCCGTATAATAATGATGGAAGAGATAAAG
TGAATTTCAATGGACAAATCAAATGTTACAACTGCCAGAGATATGGTCAT
TTAGAAAAAGTTTGTACCAATCAGAGATGCGAGTCACCAAATAGAGTAAA
ATTTAAAAGTGAACCAGAAGTGAAACAAATAAATGGAGTAGAAATTACTA
ATACGAATAATGAAATAAATACTTTGAGAGAGGAATTGAAAGAGTTTAAA
ATTCAATATAGCCGACTAATGGAGAGAAAGTCGGAGGTATCACTTAATGA
GTATGAGTGTTTTATAAATGAAGTTGATATTGATAATGAGGAAACTAAGG
GACAAATAAAAATGAAAGAGGAATTAAAAGAAGTTTTGAAAAAACGACCT
TATGACGATAATGAAACATTGAGAGTAATAAATAAAGTTCAAATAAGCAG
AATTGATCCGGAATATTTAACCAGTTTAATTCCATTAATGGTAATGAGAC
AGAGAGAAACACGAGTATTTGAATTAGAAATAGAGTTAAAAGAGATGATT
AAAAGTTATGGAGAATTTGTGAGAAAAAGTCCGAAATTTTATGAACCTTT
AAAAATAACATTGGATACTAAAGACGAATATTTTCTTCAAATGGTATTAT
CGAAAACAGAAGTCCATGTGTGTACAATGGTAAACAAAATACGACAATTG
GCAAGAGATAAATGGCCTCGAAAAATAGGGCCAAACGATGAAATAAAATT
ATATTTTAACGATATTGAGTTGAAAGATGGCGAATATCAGATTGACTATG
GAGTAAATACAATAGATGAGAATACTATATCTGCAGTGATAAATAGAGAA
ACTAGCTCAAGGGTAGCAACACCTTCTGTTGAACTGAATGAGGAAAATGA
TTATGATATGCTGTTTAAAATGTGGGATCAACCATCCGAAGGGAGTGTAA
AGTGACCACTTGAAAAAGAAATAATTTTTCAACCATTGGATAATAAATAT
GGTAATTTGCAAGAGACGATAGAAAGATCTCCAAACAGGCAATAATGCAA
GACGCTGAAACAATGAAACGAGGTAGAGGACGACTCCGAAAATCAGAAAT
AGCGGGAAAATCAGATGATCGAAAAACTGCAGAGCGTTTGAGAGAAATAG
CCAACAAAAAGAGACATGAAAATCCAAGAGTAGTAGACGATAATAACGAT
TTAAATGTGATAATTGAACTTATAAAAGCAATGAACTCAAAACGATATCC
GGAAACAAAACCATCAGTGTTAATCGAAAATATGGTACAAAAAATCAAGT
TAATGGAAAAGGAGGTCAGAGAGTTGAGTGCAGAGGATGTAAATGAGGAG
GTGGTTTTAAGGATACTCTTCACAACTGGTATAAATAGAGATAAAGTAAT
TACGGCAATGGCATATCTAATTGCGAGATATAAAAAGCTCTTAAGATCAA
ATCCAAAGGAGCAAATAAATGAAGTAGCTAAAATAATATATCAAAGATTA
AAGGCGGATGTGATGCCATGGATAAAATTGAAAGGAGGTTGGAAAATAAT
GTATGTTGAAACATTAAAAACGAGTGAAAGTGAAACGATGGAAAATGATG
ATACCGAGGAAATTGTAAAAATTCTTGTGAACAAAACGACGAAAAAACCA
AACAAAAGTGAAGGAAAATCAAGTATCTCAATAATAATAGTAATAATAAC
ACTATTATTAGATTTAATCAGAGGAAGTAATGCGATTATAGCATATGACT
GTCAAAACCCAAAATTGGGGCAAAAGTATTCATTATTGGATGTGGAAAAT
TGTCCAGAAGTGAATCCAACCAAATTAATCGTAACTGAACCCAAGATTTT
TCATGTGTATCAAGAATCTGACTTTATACACACAGAGGCAAAAGAGTGTA
TAATCAAATTTTCAGAAGAAAGTTTTGTATGTAATCAAGTGGCAAGGACC
ATGTTAGTACCGACATGGGCGCCAGAGGCATTAATAATAACACCAAGAGA
GTGTGAAGAAGCATTTAAAACGAAAAAGATTAAAACAACGGATGGAATTA
AATTGAAAGCTCAAGCAGGAGCAAGAGTAAAACAATCAGTAAATGTGGTC
GAATGGACATCAGCCGGCGGATACTGTATGGGAGGAAAGTATAAAATATG
GGGTCAGATTGCAGATAACATTGTAGTTCGGAGACAATATTATGTTGAAT
TGAGAAAATTTAAAGCATCATTTGATAGTTCATCAAAGAAAATGATGACA
CACAAGTTCTGTATGCTTGATGAATCTAGTTGTGACACTGGTGAATCGAT
GATCGTGTATAAAATAGATAAATATGAATGCCAATTAACTAAATTGAAGT
CACTAAAATTCCGAACAATCAGAGGAAAACAGTTTACTGGTGCAGAATCA
AAAAGAATAAAAGATCAAAAAAGGAAAACAGTAATCGTACAAGAAAAGGA
TACCCCAACAACATATATGGCAGACTCAGAAGCAGAGGCCATGAGATTTG
TAGAAAAAGGAGAAGCTATCAAATGCGGAAAAGCAGTTGTAAAGACTAAT
TATGAAGGTATCTATATCAGTAGTAATGAAATTAAAGACGCAAAATTGAA
AATCGATAAATTTGATGTCAAACTATCTTCGTATTTTAATAATAAAATTG
ATTACTTTTATCATCATCAGTTAGTTCAATTAGATAAAGTATATCAAGCA
ACAATCACTAACGACTGCAAACTCAATAGAGAGATATTGAGAACAAAGAT
GGCTGTAGCGGTTACAAACCCAGATTTAATGGCACCAATATTGTTTGCGG
AAAAAGGAACTTTTGCGAGAGTAGTAGGAGAAGTGTTACAGACTTTTCAA
TGTAAACCGGTCAGTGTATCACTAGCAACGAATAACCAATGTACCAACGA
ATTACCCGTTATTTCTAAAGGAGAAACAGTGTATTTACAGCCTATAACCA
GAATACTAACAGATAAAACTTACATTCCAAGGAAAATCGATAAATGTACC
AATTTGCTTGATCCATTATACCAGCTAAATGATGAAATGTGGATTACAAT
GTCTGATAGAAAAGAAGCAACTAAACCTTTTAAACTAGAATTAACAGAAC
TTGAGAAAAAGTTAGAATTTCAAGAGATAAATGATATGAATAACAACGGT
ATGTACACGCGAGATGCCATAGAAAGTGCAAGAAAGCACATGTTGTTTCC
TAACGAGAAAGAGAAAATATTATCAATAATGGTGAGTAAAGTCATGGAGG
GTAGCCACGGTGGTGATTATAATTTCGATGTGTTGTTATCAAAGGAACAT
TTCAAAAAGGTAGTATACAAAGTATTGTATAGCATATGGGGATATTTTGC
AGTTTTGGGAAATATGTTCTCAACTATACTGGGAATATATTATACAGTGG
CGTTGCTCAAAATGATCTGTTCATCATTAGTATCATTACGACAACTTCGA
CAAGTATTTGGCAACTCATATAAAATGTTGGCATGTTTATGCCCCTTCGT
TGCAAAATATCTGATAACAGCAAAACATGATAAAGAAATACGATTAATTA
AAAACCGGCGAGTAGAAGAGGAACAATTGATGGAAAACGATAAAAACGAT
GATAATCCAAGTGGATCTGAAAATGTGGAGCAACCACAAAGGGGATTGTA
CGACAGTCAAAACCAGCAACTGAGAGAATTGAGTAATAATCTGGATAACA
AAATAACAAACTGTCATTGCGTAACAAGAAAGACCTATGGATGTTACGGT
ATTGAAGCAAATGAGATATTACGAGAACTCAATGAAGCCAGACCAAGCAT
CCAAGTATTAATCGATAATATAATCATCAAAGCACTAGTAGATACCGGAG
CAACAGCCTCCATGATAAGGGAGGATCAGTTATCAGAAAATCGAAAAAAA
GATAAAAAGCACAAATGATACGATAATATCTATGTCATCGCATACTATGC
AAGTGAGAGGAGATATTGAGCTCACTGTTCATTTTCCGAGTAGAACAGTT
AATCACGTATTCAAAGTAATGACTGAATGCAGAAGTAAATGCATCCTATG
GATAGATATCCTTCGCAAGTTAACTGATGGTGCAATCAAAAGGAAAGTAA
ACTATCAGGAATCACGATTTGCGGAAGTAGTAGCATTGGAAGCCTTCGAA
TTAGAATATCACTGTGATCATATAATTCCAGTAAAAGTTAACGTCAAAAG
TGAGGACACTCTTATTTTTGAGCCGGATAAGTCAATTACAAAAAATCATA
ATATAATGTTAACCGACGAATGTGTTATACCAAAAAACGGAATAATTCCA
ATTAGAATTGCTAATTACGAGAGAAGCAATGTGAAAATTCACAAAGGAAC
AAGATTGGGAAAGTTGTTCAAAGGTGAATTAGAACAAACATTGGAAATGT
GTGAAACGATGGTGTTGAGAGATGATAGTGAGCAAATGGTCGAACTAAGT
AAGACAAATAACGTTAAATTTTATGATAAAGTAAAAATTAATAATAACTA
TTTAAATGAAACCCAACAATTTAAACTACAAACTCTACTCGATGAATATA
GTGATGTATTTGCAAAACATGAGTTTGATTTGGGAGATACTGACATCATT
AAACATGTAATAGATACGGGAAACGCCAGACCAATAAAACAACGACCTTA
TGGGATACCTTATAAATTAAGGGAAGAGATAAAGCGACAAATAAGAGTTA
AAAGAGGCAGGCGTAATAAAACAAAGTTTTTCACAATGGGCATCACCCAT
TGTACCAGTTAGGAAAATTAGAATGAAATTAGAATGTGCATTGACTACCG
TAAATTAAATGATGTAACAATTAAAGATTCATACCCATTCCCGAAAGCAC
AAGAGCAATATGATAAACTCAGAGATACTAAGTATTTTACAGTTATTGAC
GCAAATAAGGGGTATATGCAAATTCAAATGGAAGAGGGGGATGCGGAAAA
AACTGCTTTTGTCATTGAAGACGGATTGTATGAATACACGAGATTACCTT
TTGGGTTAACTAATGCACCCGCAACATTTCAGCGCCTAATGAATACAATA
CTAGTAGATGTACAACATTGCGCAGTGTATATGGAAGATATATTAATTGC
ATCAAAAACTTTTGAGGAACATTTAAAAGATATAGCAAATGTGTTACAAA
GACTAAAAAAGGCTAGGATTAAAATCAAACCGACAAAATGTAAATGGGCA
GAGCCATCAGCGCTATTTTTAGGCCACATAGTCACGGTAGAGGGCATAAC
ACCTAACCCAGATAAAATTGAAGTTGTTAAAAATTTCCCAGTCCCAACAA
CCTTACAACAAGTACAAGGATTTTTGGGATTAACCGGGTATTACCGTAAA
TTTATACAAGATTACGCAAAAATAGCAGCACCCATGATAGAACTAACAAA
AGGAGTCAAAACAAAAGGAGAGAGTAAAGGAGTATTAATAGTACAAACAA
TAAATGATAAAATCAATAAACCACAAAAGAAATGGACATCCGAACATCAA
AAGGCATTTGATCAGTTAAAAGAGAAACTAATAACAGCACCAATATTACG
ATACCCAGACTTTAAGAAACAATTTATCGTCATGATGGACGCCAGCGGGA
AAGCCGTAGAGGCAGTCTTAGGACAAGAAGAAGAAAAGGGTAAAGATTAC
GTAATAACATATGCAAGTAAAACATTAAAAGGAGCACAACTTAGGTACTC
AACCATAGAAAAAGAATGCTACGCCATAGTATTTGCATTAAAACAATTCA
AACCATACATATACGGAACGGAAGTGGTAATAAGAACGGATCATAAACCA
TTAGAAGGCCTTTGGAAACATAAAGATACATCAAGTAGACTTTTAAAATG
GGCAATGAAAATACAAGACATGCATATCAAGATTATTTATAAACCAGGCA
GGATAAACAAAAATGCAGATGCACTATCACGCATTCATGAGCTAGGAGCA
TTAGAAGTTTTTGTTTTAGTAACAGAAAAGCCCATAGACATACAAGCGGA
ACAAAATAAAGATCTAGAAATTTCTAAGATTAGAGAGGAAATTAGAAAAG
GAACAAATCAAAAATATTAAATTTCAAAAAAATTAGTTGTTATTGAGGAT
TTTAGTGGAAAACAGTTAATAGTAGTACCACAAAAATTAAGGCAAATTCT
TTTGTTACAATATCATGATGGTGCTCTAGGGGGTCATTTATCGAGCAGAA
AAACAGCAAGCAGATTATTGCAAAAATATTATTGGGATAACTTAAAAAAA
GATGTTAAAAGATGGTGTAATGCATGTACAATATGCGCAACTAGAAAAAA
TACGGGGACAAAGACAAAAGCACCCTTACACCCAATCCCAGCTACAACAT
CGCCAATGGAGCTATGTGCTATGGATTTTTTATGCAAACTACCTACAACT
ATTAATGGGAATAAACATATATTAGTTTTTAGCGACTATTTTACTAAATG
GGCAGAGGCATTTCCAACCTAAGATGAGACAGCAAAAACAGTAGCCAAAT
TACTTGTAGCGAAAATTATTTTTAAAATCGGCACACCTAGAAAACTACTA
ACAGATAAGGGAGCAAATTTCATGAGCGAAACAATAAAAAGGGTAGCTAA
TATGTTCAACATGCACAAAATTAACACAACAGCTTATCATCCTCAAACTG
ATGGGTTAGTAGAAAGATTTAATGGCACACTTCTCAGTAGATTAGCGGTT
TTTGTGGGAAAAAATCAAAATGATTGGGATGAATACGTAAGTCAATGCAC
ATACATCCATAATATCACACAAAAAGTTGGAACGAACTACAAGCATTTCA
AAAACAAACATTAAAACCACTATGGCCTTTTTACGTAATATCAGCCAGCA
TTTTAGCAATAGCGATATTAGCAATAGTTAGTAGTGTACAAATCTACAAA
AAGTACAAGTGGAAAAGGCAGCGGTATCGGCACCGACTGGTCAGAAGTGT
ACCAGGGCATTTTCACTTGGATAAATATGTACCAGGTCAATATACAATGT
CTACTCTCGTCTAGTAAAATCAATAATTATCAATGTAAAATAATATGGTC
AATAATATTAATAAAATTTATACATTTATATAAAAAAGCGTATAAGGAAA
AACCTCAAATGCGCACCAAATATGTATATTTATATTTTATTTTTAGAAAA
AGAACCAATCCAAACATACAATCCAAACCCAACCAGTATCGCCAATCAAA
CCAATTCAAACAATACAATCAATCCAATCAATCCAATCAATCCCAACGAA
ATCAATTAACAATGCAATCCAGATCCCAAAAAATTCAATACTATATCTTT
CTAAACAAACATTATAGCAGCTTTTATAAGAACCTGCAAAACAAGATATG
TTCAGACGGAACCCCTATCACAGTCCGCAGCTTCCCAAACCAAGTATGCA
TCCTAAATAAAATGAAGGTGGAATTAGAGGAAAGGGAATTACACCTAATA
AAGTTTAATGAGGCTCGAAGAGTCAAACTACATGAGCTCATTGGTGCATA
TGACAGATTCCCGAATGCAGATGAAGCAGCAAAACGAGGAAAAAAAGTTC
CACATCGAATATTGTTCCATATGACGAAAACAGAAGTATCAATATGTCAG
CAAAAGATACCCCTCATTACGACAATCATGTGCACAGCGGTTATTAAAAC
AATAAAAACCGACAGAAGCGTCCCAAATGATTTCAATGAATTAACAGTAC
AGATGAACAATCTACCTTGTGTCATCGCAATAGAAGTACTATACGTTCCA
ACAATCCCAAATCCGTTCCAGTCATTACCAACAATACCTTCAAAATTGCA
AATACACGGTGAATTCATCAGAAGGAAAATATCGTTCACTCACAAGGAAT
TATCGACCACCGAGAAACAAGATCTAATTATGTTTGTGGACGTAAAGCTG
GTGTGCGACTCAAATGATAACATGAGGCCAATAGATGTAAACATCAGAAA
CTCTAAAGGGATAGCCATATACAATAAAAAAATCACACCCAGAACAAAGA
TCATGCATTATTATCCCGAAAAAACAATGGTGGTACAAAGATAAAATAAT
CAATGATCTTCTTAAAAAACCCGAACCTTTCTCCACAGCATCGACAACAA
TAATACTAAAAGCCATATACGGAATTGTAGAAAAACAATGGGAAGAAGTC
ATCAAATGGAACTCAGATAATTTTAACAGTGACAAGGAACAAATATTATC
ACAAGCCCAAGATGACAAGGAGGAAGATATATATATCGAACCTTCCCAAA
TAGAGTTAAAATTTAATAGAAATAGCAACCGGAGAACATTCGAGGACGAA
TTACGGGACAGCGATGATCAAGTCATTTTTATAAAGGAAGACAAAAAACA
AAAACTTACGGAAACAGTCAAACTAGAACCGCAAAAGCCAACAACATCCC
AACAAATGGAATGCGATATAGATTCAGACGATGAGTATTTGTCATGCGAG
GAAATAGAAGAGTCATACTTCATCATAGACAGAGAAAAAGAAAAAGTTAC
AGAAAGAGATTTATGCTACCACTTCGGAAGGAGAATCAACGAAGGAAGCG
TTCTGAACCTGGCATTAGGAAAAGGGTTCAAGATAACGAAGATGATGATT
CAAGACGTAGAAGTGGAGATGGTACGAATAGAGAAAGATAATGCGGACAT
TATCTTAACGGCCCAGGAAGGAATGTTGGATAATGAGGGTAGACCCATCA
CCACATTTATCTTGAACAAATATTTTAAAGCAGGGCCAGGAAAGTATAGA
GTTAACATATATGCCGAGGAGTGGGTATAAAATAAGTCAGGATGCAGAAC
CTCAATAAATTAGTCCAAGCAGAACAATCAATTCCCTATATCAAATCAAT
ATTCGAACAATCAAATCCATCCTCTGTGCGTCGAACAATTATTATTTTTA
TAAACAATATTGTGAACAATTGTTATTATTTAATTGATGAACAACAATTA
TTTATTGGTGTTGATATTTATTTTGTATTTGTAAATAATTTTTAAACACT
ATTGTTTAGTAACAATTTGATGTGGATTATTTAATTTTTATTAATTGGTG
TTGGTAG
back to top

protein sequence of SMED30000559-orf-1

>SMED30000559-orf-1 ID=SMED30000559-orf-1|Name=SMED30000559-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=277bp
MSSHTMQVRGDIELTVHFPSRTVNHVFKVMTECRSKCILWIDILRKLTDG
AIKRKVNYQESRFAEVVALEAFELEYHCDHIIPVKVNVKSEDTLIFEPDK
SITKNHNIMLTDECVIPKNGIIPIRIANYERSNVKIHKGTRLGKLFKGEL
EQTLEMCETMVLRDDSEQMVELSKTNNVKFYDKVKINNNYLNETQQFKLQ
TLLDEYSDVFAKHEFDLGDTDIIKHVIDTGNARPIKQRPYGIPYKLREEI
KRQIRVKRGRRNKTKFFTMGITHCTS*
back to top

protein sequence of SMED30000559-orf-2

>SMED30000559-orf-2 ID=SMED30000559-orf-2|Name=SMED30000559-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=631bp
MNNNQNNNNFIKILPIEPYRGEPEKFDDFFTEITQYADACGWTPDELKRR
IPLYLKGYALQAYNNLECRGNNLDQLIENLRAEIVIDSDLQKINVQKFRN
RYQGSRESVGEYVYTITELAKKAFGAGEHTPKLIEDQFWNGIQKYLYKAL
IMVEYRDLNELVHKAKRVEAIQRDRFNASIDAVEIQRPTMAKPKLENDDD
VPWEEMKQLYLNFVKNRRKNSPNSSRSQSPENRKRFDALNFTEQNQNRFR
DDNRDSRSDYNRNNFRDNRYNKSSGDNYFGNNRDNRSECRQYDNRNNRNS
SRGRQSSPYRQNQYRERRASPYNNDGRDKVNFNGQIKCYNCQRYGHLEKV
CTNQRCESPNRVKFKSEPEVKQINGVEITNTNNEINTLREELKEFKIQYS
RLMERKSEVSLNEYECFINEVDIDNEETKGQIKMKEELKEVLKKRPYDDN
ETLRVINKVQISRIDPEYLTSLIPLMVMRQRETRVFELEIELKEMIKSYG
EFVRKSPKFYEPLKITLDTKDEYFLQMVLSKTEVHVCTMVNKIRQLARDK
WPRKIGPNDEIKLYFNDIELKDGEYQIDYGVNTIDENTISAVINRETSSR
VATPSVELNEENDYDMLFKMWDQPSEGSVK*
back to top

protein sequence of SMED30000559-orf-3

>SMED30000559-orf-3 ID=SMED30000559-orf-3|Name=SMED30000559-orf-3|organism=Schmidtea mediterranea sexual|type=polypeptide|length=958bp
MQDAETMKRGRGRLRKSEIAGKSDDRKTAERLREIANKKRHENPRVVDDN
NDLNVIIELIKAMNSKRYPETKPSVLIENMVQKIKLMEKEVRELSAEDVN
EEVVLRILFTTGINRDKVITAMAYLIARYKKLLRSNPKEQINEVAKIIYQ
RLKADVMPWIKLKGGWKIMYVETLKTSESETMENDDTEEIVKILVNKTTK
KPNKSEGKSSISIIIVIITLLLDLIRGSNAIIAYDCQNPKLGQKYSLLDV
ENCPEVNPTKLIVTEPKIFHVYQESDFIHTEAKECIIKFSEESFVCNQVA
RTMLVPTWAPEALIITPRECEEAFKTKKIKTTDGIKLKAQAGARVKQSVN
VVEWTSAGGYCMGGKYKIWGQIADNIVVRRQYYVELRKFKASFDSSSKKM
MTHKFCMLDESSCDTGESMIVYKIDKYECQLTKLKSLKFRTIRGKQFTGA
ESKRIKDQKRKTVIVQEKDTPTTYMADSEAEAMRFVEKGEAIKCGKAVVK
TNYEGIYISSNEIKDAKLKIDKFDVKLSSYFNNKIDYFYHHQLVQLDKVY
QATITNDCKLNREILRTKMAVAVTNPDLMAPILFAEKGTFARVVGEVLQT
FQCKPVSVSLATNNQCTNELPVISKGETVYLQPITRILTDKTYIPRKIDK
CTNLLDPLYQLNDEMWITMSDRKEATKPFKLELTELEKKLEFQEINDMNN
NGMYTRDAIESARKHMLFPNEKEKILSIMVSKVMEGSHGGDYNFDVLLSK
EHFKKVVYKVLYSIWGYFAVLGNMFSTILGIYYTVALLKMICSSLVSLRQ
LRQVFGNSYKMLACLCPFVAKYLITAKHDKEIRLIKNRRVEEEQLMENDK
NDDNPSGSENVEQPQRGLYDSQNQQLRELSNNLDNKITNCHCVTRKTYGC
YGIEANEILRELNEARPSIQVLIDNIIIKALVDTGATASMIREDQLSENR
KKDKKHK*
back to top

protein sequence of SMED30000559-orf-4

>SMED30000559-orf-4 ID=SMED30000559-orf-4|Name=SMED30000559-orf-4|organism=Schmidtea mediterranea sexual|type=polypeptide|length=356bp
MYQVNIQCLLSSSKINNYQCKIIWSIILIKFIHLYKKAYKEKPQMRTKYV
YLYFIFRKRTNPNIQSKPNQYRQSNQFKQYNQSNQSNQSQRNQLTMQSRS
QKIQYYIFLNKHYSSFYKNLQNKICSDGTPITVRSFPNQVCILNKMKVEL
EERELHLIKFNEARRVKLHELIGAYDRFPNADEAAKRGKKVPHRILFHMT
KTEVSICQQKIPLITTIMCTAVIKTIKTDRSVPNDFNELTVQMNNLPCVI
AIEVLYVPTIPNPFQSLPTIPSKLQIHGEFIRRKISFTHKELSTTEKQDL
IMFVDVKLVCDSNDNMRPIDVNIRNSKGIAIYNKKITPRTKIMHYYPEKT
MVVQR*
back to top

protein sequence of SMED30000559-orf-5

>SMED30000559-orf-5 ID=SMED30000559-orf-5|Name=SMED30000559-orf-5|organism=Schmidtea mediterranea sexual|type=polypeptide|length=429bp
MCIDYRKLNDVTIKDSYPFPKAQEQYDKLRDTKYFTVIDANKGYMQIQME
EGDAEKTAFVIEDGLYEYTRLPFGLTNAPATFQRLMNTILVDVQHCAVYM
EDILIASKTFEEHLKDIANVLQRLKKARIKIKPTKCKWAEPSALFLGHIV
TVEGITPNPDKIEVVKNFPVPTTLQQVQGFLGLTGYYRKFIQDYAKIAAP
MIELTKGVKTKGESKGVLIVQTINDKINKPQKKWTSEHQKAFDQLKEKLI
TAPILRYPDFKKQFIVMMDASGKAVEAVLGQEEEKGKDYVITYASKTLKG
AQLRYSTIEKECYAIVFALKQFKPYIYGTEVVIRTDHKPLEGLWKHKDTS
SRLLKWAMKIQDMHIKIIYKPGRINKNADALSRIHELGALEVFVLVTEKP
IDIQAEQNKDLEISKIREEIRKGTNQKY*
back to top

protein sequence of SMED30000559-orf-6

>SMED30000559-orf-6 ID=SMED30000559-orf-6|Name=SMED30000559-orf-6|organism=Schmidtea mediterranea sexual|type=polypeptide|length=125bp
MECDIDSDDEYLSCEEIEESYFIIDREKEKVTERDLCYHFGRRINEGSVL
NLALGKGFKITKMMIQDVEVEMVRIEKDNADIILTAQEGMLDNEGRPITT
FILNKYFKAGPGKYRVNIYAEEWV*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0000418head
PLANA:0002056medial muscle cell
PLANA:0002063non-ciliated epidermis
Vocabulary: INTERPRO
TermDefinition
IPR018061Retropepsins
IPR021109Peptidase_aspartic_dom_sf
IPR026298Blc2_fam
IPR001969Aspartic_peptidase_AS
IPR002475Bcl2-like
IPR001995Peptidase_A2_cat
IPR000477RT_dom
IPR015416Znf_H2C2_histone_UAS-bd
IPR012337RNaseH-like_sf
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR001641Spumavirus_A9
Vocabulary: biological process
TermDefinition
GO:0042981regulation of apoptotic process
GO:0006508proteolysis
GO:0015074DNA integration
Vocabulary: molecular function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
GO:0003676nucleic acid binding
GO:0008270zinc ion binding