Transposon Ty3-I Gag-Pol polyprotein

Overview
NameTransposon Ty3-I Gag-Pol polyprotein
Smed IDSMED30006762
Length (bp)4786
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30006762) t-SNE clustered cells

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30006762) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30006762) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30006762) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30006762

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 7

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
gutSMED30006762SMESG000051654.1 SMESG000029259.1 dd_Smed_v4_1871_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
neuronSMED30006762SMESG000051654.1 SMESG000029259.1 dd_Smed_v4_1871_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30006762SMESG000051654.1 SMESG000029259.1 dd_Smed_v4_1871_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
whole organism asexual adult single-cell RNA-sequencing evidence
reproductive organSMED30006762SMESG000047424.1 SMESG000029259.1 Contig14663uc_Smed_v2PMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
reproductive organSMED30006762SMESG000047424.1 SMESG000029259.1 Contig14663newmark_estsPMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
reproductive organSMED30006762SMESG000046166.1 SMESG000029259.1 Contig14995uc_Smed_v2PMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
reproductive organSMED30006762SMESG000046166.1 SMESG000029259.1 Contig14995newmark_estsPMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: RTL1 (retrotransposon Gag like 1 [Source:HGNC Symbol;Acc:HGNC:14665])

HSP 1 Score: 93.2041 bits (230), Expect = 3.817e-18
Identity = 78/320 (24.38%), Postives = 151/320 (47.19%), Query Frame = 1
Query: 2248 PWNTPLVCVWKKEKKDIRLCLDFRQL-NKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQE---KTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQ-KDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI 3192
            P   P   V  + ++  RL  ++  L + +T RQ + +  + E+ D LHG+ +F+ ++L     +  ++    E   K AF   E +   +  PF ++  P   Q ++  +LKD+    V+ Y  ++LI++ ++E+H +   +VL +     +  + +K Q  R+ V+FLG ++   G++ +   +  I  +  P    +LR+F+     YR F++ ++  A  L      ++ +  W    ++AF  +K A   AP+L  P  +  F L+T  +   + A L Q  D+ G     A+ S  +S  E  Y     ++L I
Sbjct:  611 PSTAPWEPVGARMQERARLQEEYWDLQDMLTNRQDY-IQMIPELFDQLHGAEWFTKLELRGTIVEESVNGHRTEDVWKAAFGL-ELEEMKSYQPFALSPDPIIPQNVIHFILKDMLGFFVLSYGQEVLIYSMSQEEHLHHVRQVLVRFRHHNVYCSLDKSQFHRQTVEFLGFVVTPKGVKLNKNVMTIITGYPTPGSKLSLRNFIEFVFPYRHFVERFSIIAEPLVRQL-LSSYQFYWGVEEQEAFECLKRAFRKAPLLHHPKPQNPFYLETGVTGTALHASLIQIDDQTGKRACCAFYSRNISPIEVEYSQAEMKILPI 927          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: RTL1 (retrotransposon Gag like 1 [Source:HGNC Symbol;Acc:HGNC:14665])

HSP 1 Score: 93.2041 bits (230), Expect = 3.817e-18
Identity = 78/320 (24.38%), Postives = 151/320 (47.19%), Query Frame = 1
Query: 2248 PWNTPLVCVWKKEKKDIRLCLDFRQL-NKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQE---KTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQ-KDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI 3192
            P   P   V  + ++  RL  ++  L + +T RQ + +  + E+ D LHG+ +F+ ++L     +  ++    E   K AF   E +   +  PF ++  P   Q ++  +LKD+    V+ Y  ++LI++ ++E+H +   +VL +     +  + +K Q  R+ V+FLG ++   G++ +   +  I  +  P    +LR+F+     YR F++ ++  A  L      ++ +  W    ++AF  +K A   AP+L  P  +  F L+T  +   + A L Q  D+ G     A+ S  +S  E  Y     ++L I
Sbjct:  611 PSTAPWEPVGARMQERARLQEEYWDLQDMLTNRQDY-IQMIPELFDQLHGAEWFTKLELRGTIVEESVNGHRTEDVWKAAFGL-ELEEMKSYQPFALSPDPIIPQNVIHFILKDMLGFFVLSYGQEVLIYSMSQEEHLHHVRQVLVRFRHHNVYCSLDKSQFHRQTVEFLGFVVTPKGVKLNKNVMTIITGYPTPGSKLSLRNFIEFVFPYRHFVERFSIIAEPLVRQL-LSSYQFYWGVEEQEAFECLKRAFRKAPLLHHPKPQNPFYLETGVTGTALHASLIQIDDQTGKRACCAFYSRNISPIEVEYSQAEMKILPI 927          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: CNBP (CCHC-type zinc finger nucleic acid binding protein [Source:HGNC Symbol;Acc:HGNC:13164])

HSP 1 Score: 54.299 bits (129), Expect = 2.084e-7
Identity = 22/48 (45.83%), Postives = 33/48 (68.75%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGH 946
            C+ C KPGH +R+C+     +CY+CG  GHI+++C  +KC+RC   GH
Sbjct:  100 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGH 147          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: CNBP (CCHC-type zinc finger nucleic acid binding protein [Source:HGNC Symbol;Acc:HGNC:13164])

HSP 1 Score: 54.299 bits (129), Expect = 2.294e-7
Identity = 22/48 (45.83%), Postives = 33/48 (68.75%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGH 946
            C+ C KPGH +R+C+     +CY+CG  GHI+++C  +KC+RC   GH
Sbjct:   99 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGH 146          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: CNBP (CCHC-type zinc finger nucleic acid binding protein [Source:HGNC Symbol;Acc:HGNC:13164])

HSP 1 Score: 53.9138 bits (128), Expect = 2.350e-7
Identity = 22/48 (45.83%), Postives = 33/48 (68.75%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGH 946
            C+ C KPGH +R+C+     +CY+CG  GHI+++C  +KC+RC   GH
Sbjct:   93 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGH 140          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511082.1 (pep chromosome:GRCz11:9:14291932:14297132:1 gene:ENSDARG00000113678.1 transcript:ENSDART00000183119.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511082.1)

HSP 1 Score: 318.546 bits (815), Expect = 3.350e-88
Identity = 194/715 (27.13%), Postives = 348/715 (48.67%), Query Frame = 1
Query: 2185 DKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGK--RFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGK--IKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHV--LLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXX-XXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINK-QDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFG 4302
            + +E+ I +     IIR  +SP       V KK+   +R C+D+R LN IT +  +P+P +    + L G+ +F+ +DL NAY+ V++ +  + KTAF+T  G F +  +PFG++ AP  FQ L+  VL+D+    + VYLDDILIF+ + ++H     +VL ++   GL +  EKC    + V FLGHI++ +G++ D  K++A+  +  P   K L+ FLG  N+YRRFI+++++ A  L ++        RW+   + AF  +K   ++AP+L+ PD  ++F+++ DAS   +GA+LSQ+    G  H  AY SH +++ E+ Y I  +ELLA+    + + H+L G    F++ TDHK + ++ + K+ + ++   W  +    D  + YR G+ +   D LSR         + +H +  +    I  R L ++A      W+ ++     ++       C              ++++P + R  +I+  H   L CH G  +    I+       +A +++  +  C  C   KT               + P+  I +D    L  + N    I  ++D +SK      + K    R  +  +++      G P ++  D G  F ++  +E     G  +  SS +H  +NG  ER  + +   +   +++    +W+  +  +EY  N+     TG+SP E   G
Sbjct:  538 EAMEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNAITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMLDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGMRMDPEKVQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTSLKT-PFRWSNAAQVAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKMHPCAYFSHRLNNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIQSAKR-LNSRQARWALFFGRFDFSISYRPGSKNVKPDALSR---------IFDHSERASSPETIVPRRLFISAV----TWEIESRVRMALEGVTPPPGC-----------PPSRLFVPEELRSDVIRWGHSSKLACHPGVSRTLYLIKQRFWWPVMARDIRNFVLACSVCAVSKTSNRPPAGLLQPLSVPSRPWSHIALDFVTGLPPS-NGNTVILTVVDRFSKATHFIPLPKLPSARETAAAVIDHVFRIHGLPTDVVSDRGPQFISKFWREFCHLMGATVSLSSGFHPQSNGQTERANQDLERMLRCLVSQNP-SSWSQQLSWVEYAHNSLPVSATGLSPFECSLG 1223          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX546500.1 (pep chromosome:GRCz11:23:12926092:12931693:-1 gene:ENSDARG00000086495.3 transcript:ENSDART00000122176.3 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX546500.1)

HSP 1 Score: 316.62 bits (810), Expect = 7.142e-88
Identity = 226/908 (24.89%), Postives = 427/908 (47.03%), Query Frame = 1
Query: 1651 CLLDTGARINVMAKSVIDRLE-NIEILETRESLRCANNSRLETMGKLNINVKM---GSMERNVTFIIVKNLIPEIIGGVELQRL------FGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKR-LLEIFQANKNVFMADKWDIGCT-NLIKHKIITKGEPIMIK-PRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGK--RFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNT--VLTKQGKIWIPSDNRQRMIKEVHV--LLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXX-XXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSK---YISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFG 4302
             L+D+GA  N +   +++ L+     L +  +++  +   L T+  +   +++   G+   N++F + K++   +I G     L      +G    +   E   +S  +         +  +E+     +D+  V      L  +F  ++   +       C  +L+      KG+   +  P R      + +E+ I +     IIR  +SP       V KK+   +R C+D+R LN IT +  +P+P +    + L G+ +F+ +DL NAY+ V++    + KTAF+T  G F +  +PFG++ AP  FQ L+  VL+D+    + VYLDDILIF+ + ++H     +VL ++   GL +  EKC    + V+FLGHI++ +G++ D  KI+A+  +  P   K L+ FLG  N+YRRFI+++++ A  L S+   +    RW+   E AF ++K   ++AP+L+ PD  ++F+++ DAS   +GA+LSQ+    G  H  AY SH +S  E+ Y I  +ELLA+    + + H+L G    F++ TDHK + ++ + K+ + ++   W  +    +  + YR G+ +   D LSR         + +  D  +         V  +G         + V  I  ++E++  +  +   T  +     ++++P + R  +++  H   + CH G  +    I+       +A +V+  +  C  C   K+               + P+  I +D    L  + N    +  ++D +SK   +ISL  +    E  ++  +++      G P ++  D G  F +R  +E  +  G  +  SS +H  +NG  ER  + +   +   +++    +W+  +  +EY  N+    +TG+SP +   G
Sbjct:  332 ALIDSGAEGNFIDSDLVNELKIPFSPLSSPIAVQALSGLSLPTITHITAPIRLVTSGNHTENISFFLTKSVNNPVILGHPWLVLHKPHINWGHSAVFSWSESCHKSCLLSACSTVPCSVFQEEQ-----VDLSNVPREYHDLKRVFSKSRAASLPPHRPYDCAIDLLPGTSPPKGKLYSLSVPER------EAMEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTSLT-SSKMPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYFSHRLSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIRSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSR---------LFDPSDRLSSPDP-----VLPQG---------IVVANISWEIESR-VRTALNGVTPPIGCPPSRLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVIKQRFWWPTMARDVRDFVLACSVCAVSKSSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSS-NGNTVVLTVVDRFSKAAHFISLPKLPSARETAVA--VIDHVFRIHGLPTDVVSDRGPQFVSRFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNP-SSWSQQLSWVEYAHNSLPVSSTGLSPFQCSLG 1197          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: FO704673.1 (pep chromosome:GRCz11:12:14545475:14551077:-1 gene:ENSDARG00000112601.1 transcript:ENSDART00000188717.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:FO704673.1)

HSP 1 Score: 313.153 bits (801), Expect = 1.558e-86
Identity = 255/1014 (25.15%), Postives = 465/1014 (45.86%), Query Frame = 1
Query: 1645 VDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLE--TMGK------LNINVKMGSM-ERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADK---------WDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKD--IRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQK-DEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGK--RFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLC--HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXX-XXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILK-FGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEI--NREDMEKRIED--KTLKPKISKTVR--------------NFEM-EDVVLIKQEIR-----NKDDARWEGPYKVIKKIHERSYLLK 4533
            V  L+D+G+ +N++++ + D+L+    + T   +   N +R++  T+G         +++ +G   E  +TF ++ +   E+I G        I    I     + + +    + R    I     L  +I+  +V     L + ++    VF   K         WD  C   +   +      I    R +   +E  IEEA+    ++G IR   SP       +   EKKD  +R C+D+R LN IT +  +P+P V   L+ L  +R ++ +DL +AY  +++ +  + KTAF T  G + +  MP+G+A +P  FQ  + +V +DL    V+ Y+DDILI++   E H      VL ++    L    EKC+    +  FLG+II+  G++ +NTK++A+  +  PK VK L+ FLG  N+YRRFI++Y+  +  L S+      K++W     K+F ++K +  TAP+L  P+    F+++ DAS   IGAVLSQ+    G  H  AY S  +++ E+ Y +  KELL++    + + H+L G    F + TDHK + ++ + ++ +  +   W  + +  +  + YR GT +  AD LSR+           ++  +  +    IL  +       W  D ME  EI+   ++         N       + ++P   R R+++ VH  L   H G  +    ++N      +  ++   +++C  C + KT                 P+  + +D    L  + N    I  IID +SK   L  +        +   L + + + +G P+++  D G  F ++  K   K   I +  +S YH  +NG +ER  + I  Y+    +    K W++ +P  EY  N+    +TG++P + I G +     W     +  + +D  +R E+   +   ++ + +R              N++  + V L  +++R      K   R+ GP+K++K+I+  +Y L+
Sbjct:  332 VSALVDSGSAVNIISQELTDKLK----IPTSPCVPVINITRIDNGTIGSGIKAITQPVSLSIGLFHEETITFYVIPSCKYEVILG---HPWLTIHDPTISWNQGELTHWSTHCQQRCFSKILSLPCLSTSIESPEVHSQVTLPQPYREFAEVFNKSKAAQLPPHRSWD--CAIELLPNMSPPKSKIYPLSRPETQAMETYIEEAL----SSGYIRPSTSPAAAGFFFI---EKKDGGLRPCIDYRGLNNITVKYRYPLPLVPPALEQLREARIYTKLDLRSAYNLIRIREGDEWKTAFLTTRGHYEYLVMPYGLANSPAVFQSFINEVFRDLLNKCVIAYIDDILIYSPNLEQHIKDVRTVLTRLQENQLYAKLEKCEFHMSKTSFLGYIISHHGVEMNNTKVQAVTGWPLPKTVKELQRFLGFANFYRRFIRNYSLISAPLTSLLKGKPSKLKWNPETVKSFEKLKTSFTTAPILKHPNPELPFVVEVDASDYGIGAVLSQRHGNPGKLHPCAYFSRKLTAAERNYDVGNKELLSMKAALEEWRHWLEGAVHPFQIITDHKNLEYIKSARR-LNPRQARWSLFFTRFNFTVTYRPGTKNHKADALSRR-----------YDQGQLDQTPVSILPPSVVIAQISW--DIME--EIQRGQQDDPPPPECPPN-------RQYVPQTLRLRIMQWVHNSLSSGHPGISRTLNLVRNAFWWPKMNQDITTFVKSCAVCAQSKTPRELPSGLLQPLPIPHRPWSHLSIDFVTDLPNS-NNYTTILVIIDRFSKACRLIPLKGLPTAMETALELFQHVFRVYGIPEDIVSDRGPQFTSKVWKAFCKQLDINVSLTSGYHPESNGQVERLNQEIGRYLRTYCSREQDK-WSNFLPWAEYAQNSLTHSSTGLTPFQCILGYQPPMFPWSGEPSMVPSVDDWVQRSEEVWNSAHVRLQRAIRTQRINADQRRRPNPNYQPGQRVWLSTRDLRLRLPSRKLSPRYVGPFKILKRINNVTYRLE 1304          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR855320.1 (pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENSDARG00000099359.2 transcript:ENSDART00000159655.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR855320.1)

HSP 1 Score: 310.842 bits (795), Expect = 9.844e-86
Identity = 193/701 (27.53%), Postives = 344/701 (49.07%), Query Frame = 1
Query: 2227 IIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGK--RFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNT--VLTKQGKIWIPSDNRQRMIKEVHV--LLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXX-XXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINK-QDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFG 4302
            IIR  +SP       V KK+   +R C+D+R LN IT +  +P+P +    + L G+ +F+ +DL NAY+ V++    + K+AF+T  G F +  +PFG++ AP  FQ  +  VL+D+    + VYLDDILIF+ + ++H     +VL ++   GL +  EKC    + V FLGHI++ +G++ D  KI+A+ ++  P   K L+ FLG  N+YRRFI+++++ A  L ++   +    RW+   E AF ++K   ++AP+L+ PD  ++F+++ DAS   +GA+LSQ+    G  H  AY SH +S+ E  Y I  +ELLA+    + + H+L G    F++ TDHK + ++ + K+ + ++   W  +    +  + YR G+ +   D LSR    +      E   +    +  RI+                 +  I  ++E+K  +  ++  T  +     ++++P   R  +I+  H   + CH G  + +  I+       LA +V+  +  C  C   KT               + P+  I +D    L  + N    I  ++D +SK      + K    R  +  ++N      G P ++  D G  F ++  +E  +  G  +  SS +H  +NG  ER  + +   +   +++    +W+  +  +EY  N+     TG+SP +   G
Sbjct:  554 IIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNNITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRAGDEWKSAFNTPRGHFEYCVLPFGLSNAPAVFQAFVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHIRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGLRMDPEKIKAVVNWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTS-SKTPFRWSSAAEAAFSKLKGCFVSAPILITPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYYSHRLSAAESNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRLFDSS------ERTSSLEPVVPKRIV-----------------ISNITWEIESK-VRAALDGVTPPIGCPPSRLFVPEKLRSDVIRWGHSSKVACHPGVSRTSFVIKQRFWWPALARDVRDFVLACSVCAVSKTSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPPS-NGNTVILTVVDRFSKAAHFVPLPKLPSARETAVAVINHVFRIHGLPTDVVSDRGPQFISKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNP-SSWSQQLSWVEYAHNSLPVSATGLSPFQCSLG 1225          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511224.1 (pep chromosome:GRCz11:2:18017000:18022765:1 gene:ENSDARG00000113243.1 transcript:ENSDART00000186877.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511224.1)

HSP 1 Score: 309.686 bits (792), Expect = 1.931e-85
Identity = 195/719 (27.12%), Postives = 353/719 (49.10%), Query Frame = 1
Query: 2185 DKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGK--RFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGK---IKTRILTVTAEGGYNKWQNDNMEVQE-IKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHV--LLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXX-XXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSK---YISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFG 4302
            + +E+ I +     IIR  +SP       V KK+   +R C+D+R LN IT +  +P+P +    + L G+ +F+ +DL NAY+ V +    + KTAF+T  G F +  +PFG++ AP  FQ L+  VL+D+    + VYLDDILIF+ + ++H     +VL ++   GL +  EKC    + V+FLGHI++ +G++ D  KI+A+ ++  P   K L+ FLG  N+YRRFI ++++ A  L S+   +    RW+   E AF ++K   ++AP+L+ PD  ++F+++ DAS   +GA+LSQ+    G  H  AY SH +SS E+ Y I  +ELLA+    + + H+L G    F++ TDHK + ++ + K+ + ++   W  +    +  + YR G+ +   D LSR         + +  D  +     +  RI+            N + E++  ++  L+          N       ++++P + R  +++  H   + CH G  +    ++       +A +V+  +  C  C   K+               + P+  I +D    L  + N    I  ++D +SK   +ISL  +    E  ++  +++      G P ++  D G  F ++  +E  +  G  +  SS +H  +NG  ER  + +   +   +++    +W+  +  +EY  N+     TG+S  +   G
Sbjct:  538 EAMEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVCIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVNWPTPDSRKALQRFLGFANFYRRFIHNFSQLAAPLTSLT-SSKTPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSSAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSR---------LFDPSDRTSSPDPVLPQRIVVA----------NISWEIESRVRTALDGVTPPIGCPPN-------RLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVVKQRFWWPAMARDVRDFVLACSVCAVSKSSNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSS-NGNTVILTVVDRFSKAAHFISLPKLPSARETAVA--VIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVSQNP-SSWSQQLSWVEYAHNSLPVSATGLSSFQCSLG 1223          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000006041.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:6:37317867:37322724:1 gene:ENSXETG00000003070.1 transcript:ENSXETT00000006041.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 370.933 bits (951), Expect = 3.403e-106
Identity = 237/846 (28.01%), Postives = 417/846 (49.29%), Query Frame = 1
Query: 2038 RLLEIFQANKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRR-QPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNE-----------KIR------------WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLS---------------------RKTC-GTCVQCMMEHEDAKTG--------------KIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVL---------------------------TKQGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            RL +    +  +F  +  D+GC+   KHKI + + +P   + RR  P +LED + + ++ L+  GII++  SP+ +P+V V +K+   IR+C+D+R LN+ T    +  P +++ L+ L GS++FS +DL + YYQ+ +  E +EKTAF    G F FNRMP G+  AP TFQ LM + + D+    V+VYLDD+++F +T E+H     KV  ++   GL+L+PEKCQ  +  V ++GH+++ +GI TD +KIEA+ S+ KP+ +  LRSFLG C YYRRF+K ++K A+ L  +   N E           ++R            WTE C+KAF ++K  L  APVL + D  K + L  DAS + +G +L Q+  K     +A+ S ++S  E+ Y   + E LA+ +      + YLYG  F ++TD+  +T+++TT K + A    W+  L+     + YR G  + +AD LS                     R  C G   +C   +   K G               + T  +    +G   + Q+++   Q     L  K  + +  ++  L                           +++ ++ +P  +R  ++  +H    H G  K    +++      +  +V+    +C RC + KT+ ++    +       P + + +D    ++        +  + DHY++Y        Q   T+++ L+ ++ + +G PK +H D G++FE++ + EL    G+    ++PYH   +   ER  RT+ + +  +L +  + +W+  +  + +  N+T    TG SP  ++FGR+
Sbjct:  290 RLRKHLNEHSALFSRNDLDMGCSTSTKHKIRLREDKPFRERSRRIAPGDLED-LRKHLEELKAAGIIKESRSPYASPIVVV-RKKNGSIRMCVDYRTLNQRTIPDQYTTPRIEDALNCLVGSKWFSVLDLRSGYYQLPMHPEDKEKTAFICPLGFFEFNRMPQGLCGAPATFQRLMERTVGDMHLLEVLVYLDDLIVFGRTLEEHEQRLLKVFDRLEKEGLKLSPEKCQFCQPSVNYVGHVVSAEGIATDPSKIEAVSSWPKPRTITELRSFLGFCGYYRRFVKGFSKVAQPLNQLLQSNTEVEGSDRDILAKRLRGQGWTRESIEDFWTEECDKAFDQLKYCLTHAPVLAYADATKPYTLHIDASREGLGGILYQEYNK-ELRPVAFISRSLSPSERNYPAHKLEFLALKWAVVDKLHEYLYGAEFEVQTDNNPLTYILTTAK-LDATGHRWLAALAHYKFSLRYRPGRDNRDADGLSRRPHGDLHPDDEWIEIPAPGVRTMCQGITGRCQSGNFAEKIGMTVRGIPKLYCNVTSVGTSTMPALNKGDIKRDQSEDPLCQMAMEALRKKQVQILKTDSHPLASLLVKEWDRLRLRDGLVYRRAPSATDSEKWQLMLPQKHRDSVLMALHDEHGHLGYDKTLGLVRDRFYWPCMKQDVEDYCRSCLRCIQRKTLPSRAAPLSHMESH-SPLDLVCIDFLS-IEPDEGGTSNVLVVTDHYTRYAQAFPTKDQRAITVAKVLVERFFIHYGLPKRIHSDQGRDFESKLVHELMSMLGVLKSRTTPYHPQGDPQPERFNRTLLDML-GTLPKEKKTHWSRHIATVVHAYNSTKNGATGYSPYFLMFGRE 1128          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: anxa6 (annexin A6 [Source:Xenbase;Acc:XB-GENE-989741])

HSP 1 Score: 327.791 bits (839), Expect = 1.738e-95
Identity = 172/466 (36.91%), Postives = 263/466 (56.44%), Query Frame = 1
Query: 2011 DVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGE-PIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEK-IRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK 3402
            D L ++   +L +I ++   +F A+    G T+  +HK+ T  + PI     R    +  +++  I  +   G+I   +SPW +P+V V KK+    R C+D+R+LN +T   A+PMP VDE+LD L  ++Y +++DL   Y+Q+ L   +QEK+AF T  G + F  MPFG+  AP TFQ L+ ++L+ + +D    YLDDI +F++T E+H     +V  +I  AGL L PEKC +   EV++LGH +    ++ D  K+EAI  +  PK  K + +FLG   YYR+FI +Y+  A+ L  +  +   + I WT  CE A   +K+AL ++PVL  PDF + FIL TDAS   +GAVLSQ +  G EH +AY S  +   E  Y    KE LAI +  +    YLYG+ F + TDH  ++++         +   W   L   +  +++RKG  H NAD LSR+
Sbjct:  295 DHLHLSQQDQLRKILRSYSPMFSANP---GRTHWAEHKVDTGTQLPIRSPAYRVAEAVRPEMKSQIDEMLAFGVITPSHSPWASPVVLVPKKDGS-TRFCVDYRRLNDVTTTDAYPMPRVDELLDRLGNAKYLTTLDLSRGYWQIPLAPSAQEKSAFLTPFGLYQFTVMPFGMRNAPATFQRLVNRLLEGM-QDFAQAYLDDIAVFSQTWEEHLQHLQRVFAQIQDAGLTLKPEKCHLAMAEVQYLGHRVGGGQLRPDPAKVEAICQWPIPKTQKQVLAFLGTSGYYRKFIPNYSTVAKPLTDLTSRQRSRTIVWTPECESAMNALKQALASSPVLAAPDFSRRFILQTDASNFGLGAVLSQVNTYGEEHPVAYLSRKLLPREAAYATIEKECLAIVWALQKLQPYLYGREFTVVTDHNPLSWLQRVSG-DNGKLLRWSLLLQQYNFTIQHRKGKEHHNADGLSRQ 754          

HSP 2 Score: 91.2781 bits (225), Expect = 8.376e-18
Identity = 65/253 (25.69%), Postives = 116/253 (45.85%), Query Frame = 1
Query: 3916 IDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKI----DRMKWYSNKEINREDME---------KRIE----------------DKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIKHFK 4587
            +D+ ++Y    A+ K D  T+++ L+  +  + G P E+  D G  F ++ ++ L +  G+  I SSPYH  TNG+ ER   T++  +  +  E G K+W   +P + +      Q++TG SP E+++GR++    D +  Y       +++          +R+E                 K    + ++  R  E + V+L+     +K  A WEGPY V  K+H+ +Y         VV   E   H+K
Sbjct:    1 MDYATRYPEAVALRKIDAPTVADALIQIFS-RVGFPSEILSDQGPQFTSQLLQCLWQRCGVRAIHSSPYHPQTNGLCERFNGTLKTMLR-TFVESGEKDWERYLPHLLFAYREVPQESTGFSPFELLYGRRVRGPLDLLCEYWEGAPQSQEVPIIPYVLKFRQRLEQMTSLAHDHLSAAQQRQKVWYDRKARERRFMEGDKVLLLVPTRHDKLQAAWEGPYVVTHKLHDTTY---------VVTPPEDPSHYK 242          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000035398.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:3:9869556:9871940:-1 gene:ENSXETG00000011182.1 transcript:ENSXETT00000035398.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 300.827 bits (769), Expect = 1.447e-86
Identity = 207/718 (28.83%), Postives = 335/718 (46.66%), Query Frame = 1
Query: 2191 IEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKD--IRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKR--FVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVL--TKQGKIWIPSDNRQRMIKEVHVL--LCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXX-XXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKF-GAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYI--NASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGR 4305
            ++E I      G IR   SP       V   EKKD  +R C+D+R LNKIT +  +P+P + E+ D L G++ FS +DL  AY  +++ +  + KTAF+T++G + +  MPFG+  AP  FQE +  + +DL    V+VYLDDILIF++  E H +   + L ++    L    EKC     ++ FLG+II+  G + D  K+ AIQ +  P+  K ++ F+G  NYYR+FIK ++ +   + S+  K      W  +  +AF  +K+A I+A VL  P+    F ++ DAS    GA+LSQ+    G  H  AY S   SS E+ Y I  +ELLA+    + + H L G      + TDHK + F+ + K+    Q   W  + S     + YR GT +  AD LSR         +         +I   +L   AE                         + ++  +     T  G  ++P + R  ++++ H      H G++K  + +Q       +  +V+  +  C  C   K   ++           + P+  + MD    L  +      I  +ID +SK      + K         L  + I +  G P E+  D G  F +R  + L K+ G+ L FSS YH  TNG  ER  + + +++  + SL +    +W+D++P  E+  N     +TG SP   ++G+
Sbjct:   18 MKEYISENLQRGFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIRKGGRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQ-ARWSLFFSRFHFVLTYRPGTKNRKADALSRSFSPEDRLPIEREPIIPPSRIIASVLPQFAE-------------------------QILLSQSAAPPDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSCG-NTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQ---DDWSDLLPWAEFAHNNARHSSTGRSPFLSVYGQ 702          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000034712.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:9:33567219:33572531:-1 gene:ENSXETG00000011772.1 transcript:ENSXETT00000034712.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 311.227 bits (796), Expect = 1.345e-85
Identity = 241/919 (26.22%), Postives = 412/919 (44.83%), Query Frame = 1
Query: 1624 LDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRC--------ANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFI---CEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCT-NLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKD--IRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKR--FVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVL--TKQGKIWIPSDNRQRMIKEVHVL--LCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXX-XXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKF-GAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYI--NASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGR 4305
            L  Q   V   LD+GA  N M  +   ++  I +     S+R         + ++   T G+L++ +    +E+ ++F+I+      ++ G+   RL    + +   +  + S +    C I     R+            V +        ++F      F+       C  +L+   +  +G    + P      +++ I E   NL+  G IR   SP       V   EKKD  +R C+D+R LNKIT +  +P+P + E+ D L G++ FS +DL  AY  +++ +  + KTAF+T++G + +  MPFG+  AP  FQE +  + +DL    V+VYLDDILIF++  E H +   + L ++    L    EKC     ++ FLG+II+  G + D  K+ AIQ +  P+  K ++ F+G  NYYR+FIKD++ +   + S+  K      W  +  +AF  +K+A I+A VL  P+    F ++ DAS    GA+LSQ+    G  H  AY S   SS E+ Y I  +ELLA+    + + H L G      + TDHK + F+ + K+    Q + W  + S  +  + YR GT +  AD LSR         + +       +I   +L   AE                         + ++  +     T  G  ++P + R  ++++ H      H G++K  + ++       +  +V+  +  C  C   K   ++           + P+  + MD    L  +      I  +ID +SK      + K         L  + I +  G P E+  D G  F +R  + L K+ G+ L FSS YH  TNG  ER  + + +++  + SL +    +W+D++P  E+  N     +TG SP   ++G+
Sbjct:  347 LSSQAIPVSAFLDSGAAGNFMDLAFAKKV-GISLFPVTPSIRVFAIDDRPLSTDTITLTTGELSVQIGALHLEK-MSFLIIPCPSSPVVLGLPWLRLHNPSIDWSSGQISRWSQYCQRHCLIPQPLQRVTVSSTSFSALPSVYR-----DFSDVFCKKSAEFLPPHRRYDCPIDLLPGTMPPRGRTYPLSPAET-AAMKEYISE---NLQR-GFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENFLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKDFSSRIAPILSLIRKGGRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQAR-WSLFFSRFNFVLTYRPGTKNRKADALSRSFSPEDRLPIEQEPIIPPFRIIASVLPQFAE-------------------------QILLSQSAAPSDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLRRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPIPSRPWTHLGMDFIVELPPSCG-NTVIWVVIDRFSKMAHFIPLRKLPSAVELAHLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQ---DDWSDLLPWAEFAHNNASHSSTGRSPFLSVYGQ 1220          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000042189.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:KV460667.1:82848:87648:1 gene:ENSXETG00000017765.1 transcript:ENSXETT00000042189.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 309.301 bits (791), Expect = 4.854e-85
Identity = 240/919 (26.12%), Postives = 411/919 (44.72%), Query Frame = 1
Query: 1624 LDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRC--------ANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFI---CEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCT-NLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKD--IRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEK-GHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKR--FVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVL--TKQGKIWIPSDNRQRMIKEVHVL--LCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXX-XXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKF-GAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYI--NASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGR 4305
            L  Q   V   LD+GA  N M  +   ++  I +     S+R         + ++   T G+L++ +    +E+ ++F+I+      ++ G+   RL    + +   +  + S +    C I     R+            V +        ++F      F+       C  +L+   +  +G    + P      +++ I E   NL+  G IR   SP       V   EKKD  +R C+D+R LNKIT +  +P+P + E+ D L G++ FS +DL  AY  +++ +  + KTAF+T++G + +  MPFG+  AP  FQE +  + +DL    V+VYLDDILIF++  E H +   + L ++    L    EKC     ++ FLG+II+  G + D  K+ AIQ +  P+  K ++ F+G  NYYR+FIKD++     + S+  K      W  +  +AF  +K+A I+A VL  P+    F ++ DAS    GA+LSQ+    G  H  AY S   SS E+ Y I  +ELLA+    + + H L G      + TDHK + F+ + K+    Q + W  + +  +  + YR GT +  AD LSR         + +       +I   +L   AE                         + ++  +     T  G  ++P + R  ++++ H      H G++K  + ++       +  +V+  +  C  C   K   ++           + P+  + MD    L  +      I  +ID +SK      + K         L  + I +  G P E+  D G  F +R  + L K+ G+ L FSS YH  TNG  ER  + + +++  + SL +    +W+D++P  E+  N     +TG SP   ++G+
Sbjct:  347 LSSQAIPVSAFLDSGAAGNFMDLAFAKKV-GISLFPVTPSIRVFAIDDRPLSTDTITLTTGELSVQIGALHLEK-MSFLIIPCPSSPVVLGLPWLRLHNPSIDWSSGQISRWSQYCQRHCLIPQPLQRVTVSSTSFSALPSVYR-----DFSDVFCKKSAEFLPPHRRYDCPIDLLPGTMPPRGRTYPLSPAET-AAMKEYISE---NLQR-GFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENFLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKDFSSHIAPILSLIRKGGRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQAR-WSLFFTRFNFVLTYRPGTKNRKADALSRSFSPKDRLPIEQEPIIPPFRIIASVLPQFAE-------------------------QILLSQSAAPSDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLRRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPIPSRPWTHLGMDFIVELPPSCG-NTVIWVVIDRFSKMAHFIPLRKLPSAVELAHLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQ---DDWSDLLPWAEFAHNNASHSSTGRSPFLSVYGQ 1220          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Rtl1 (retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:MGI:2656842])

HSP 1 Score: 80.4925 bits (197), Expect = 1.742e-14
Identity = 72/300 (24.00%), Postives = 138/300 (46.00%), Query Frame = 1
Query: 2326 NKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQ------FCFNRM----PFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQ-KDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI 3192
            + +T+RQ +    V E+ D LHG+ +F+ ++L          KES+ +   +  E        F  ++M    PF + +       ++  +LKD+    V+ +  ++L+++ ++E+H     +VL +     +  + +K Q  R+  + LG  I+  G++ +   +  I     P   + L+S + +   YR F++++A  A  L      ++E   W E  ++A   +K A   +PVL  P  +  F L+TD +   + A L Q  DE G +   A+ S  +S+ E  Y      +L I
Sbjct:  898 DMLTDRQDYTQ-MVPELFDQLHGAAWFTKLELLGI-------KESEMRHTVTHTEDTWRASFGFGLHQMRCYRPFTMNSYSDEGNNIVHFILKDILGLFVICHGREVLVYSMSQEEHSQHVRQVLVRFRYHNIYCSLDKTQFHRQTAEILGFNISPKGVKLNKNLMNLIVGCPVPGSRRCLQSVIDLVYPYRHFVENFAVIAAPLVRQL-LSSEPYYWGEEEQEALESLKRAFRKSPVLYHPKPQNPFYLETDITGSFLSASLVQTDDETGKKSTCAFYSRPLSTMEVEYPRVEMRILPI 1188          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Rtl1 (retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:MGI:2656842])

HSP 1 Score: 80.4925 bits (197), Expect = 1.742e-14
Identity = 72/300 (24.00%), Postives = 138/300 (46.00%), Query Frame = 1
Query: 2326 NKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQ------FCFNRM----PFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQ-KDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI 3192
            + +T+RQ +    V E+ D LHG+ +F+ ++L          KES+ +   +  E        F  ++M    PF + +       ++  +LKD+    V+ +  ++L+++ ++E+H     +VL +     +  + +K Q  R+  + LG  I+  G++ +   +  I     P   + L+S + +   YR F++++A  A  L      ++E   W E  ++A   +K A   +PVL  P  +  F L+TD +   + A L Q  DE G +   A+ S  +S+ E  Y      +L I
Sbjct:  898 DMLTDRQDYTQ-MVPELFDQLHGAAWFTKLELLGI-------KESEMRHTVTHTEDTWRASFGFGLHQMRCYRPFTMNSYSDEGNNIVHFILKDILGLFVICHGREVLVYSMSQEEHSQHVRQVLVRFRYHNIYCSLDKTQFHRQTAEILGFNISPKGVKLNKNLMNLIVGCPVPGSRRCLQSVIDLVYPYRHFVENFAVIAAPLVRQL-LSSEPYYWGEEEQEALESLKRAFRKSPVLYHPKPQNPFYLETDITGSFLSASLVQTDDETGKKSTCAFYSRPLSTMEVEYPRVEMRILPI 1188          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Zcchc13 (zinc finger, CCHC domain containing 13 [Source:MGI Symbol;Acc:MGI:1922314])

HSP 1 Score: 55.0694 bits (131), Expect = 7.190e-8
Identity = 25/58 (43.10%), Postives = 35/58 (60.34%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKERECYTNME 976
            C+ C +PGH +R+CN +   +CY CG  GHI+++C  IKC+RC   GH    C    E
Sbjct:   91 CYICSQPGHLARDCNRQEEQKCYTCGEFGHIQKDCTQIKCYRCGENGHMAVNCSKTSE 148          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Cnbp (cellular nucleic acid binding protein [Source:MGI Symbol;Acc:MGI:88431])

HSP 1 Score: 54.299 bits (129), Expect = 1.575e-7
Identity = 22/48 (45.83%), Postives = 33/48 (68.75%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGH 946
            C+ C KPGH +R+C+     +CY+CG  GHI+++C  +KC+RC   GH
Sbjct:   99 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGH 146          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Cnbp (cellular nucleic acid binding protein [Source:MGI Symbol;Acc:MGI:88431])

HSP 1 Score: 53.9138 bits (128), Expect = 1.638e-7
Identity = 22/48 (45.83%), Postives = 33/48 (68.75%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGH 946
            C+ C KPGH +R+C+     +CY+CG  GHI+++C  +KC+RC   GH
Sbjct:   98 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGH 145          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P20825|POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 340.117 bits (871), Expect = 2.138e-96
Identity = 291/979 (29.72%), Postives = 460/979 (46.99%), Query Frame = 1
Query: 1633 QGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTF-----------------IIVKNLIPEIIGGVELQR----LFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHA------IDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKD----IRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCE--KAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSR-----KTCGTCVQCMMEHEDAK--------TGKIKTRILTVTAE-----------GGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTV---------LTKQGKIWIPSDNRQRMIKEVHVL-------------------LCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEP-FEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGI--IERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQK-TTGVSPAEIIF 4299
            +GR   CLLDTG+ IN++        ENI  L  + S RC     L + G + +N  +  + RN  F                 +I + L+      +  +     LF    K I  E E+  +   +         +D+E ++        +D L      +L  +    +N+   +   +  TN IKH +  T   PI  K        E ++E  ++ + N G+IR+ NSP+N+P   V KK         R+ +D+R+LN+IT    +P+PN+DEIL  L   +YF++IDL   ++Q+++D+ES  KTAFSTK G + + RMPFG+  AP TFQ  M  +L+ L     +VYLDDI+IF+ +  +H N    V  K+A A L+L  +KC+  +KE  FLGHI+  DGI+ +  K++AI S+  P   K +R+FLG+  YYR+FI +YA  A+ + S C K   KI  T+  E  +AF ++K  +I  P+L  PDF K+F+L TDAS   +GAVLSQ       H I++ S  ++ HE  Y    KELLAI +  K F HYL G++F++ +DH+ + ++   K+P  A+ + W   LS    K++Y KG  ++ AD LSR            Q   E +++             K +I+ + ++                 Q D M +++ K  L +    FI  N T+         + ++  I I +    ++I+ + +L                   L H G QK+TK  + N    N    ++ +I  C  C   KT    TK   +   + E   EK  +DI        ++ K+    ID YSK+ +L  I  +D       L+ +   + G PK L  D    F + ++K   +   +EL  ++      NG+  +ER  +TI E I    +    +     +  I YT N  ++  TTG  PA+I  
Sbjct:   21 KGRSYKCLLDTGSTINMIN-------ENIFCLPIQNS-RC---EVLTSNGPITLN-DLIMLPRNSIFKKTEPFYVHRFSNNYDMLIGRKLLKNAQSVINYKNDTVTLFDQTYKLITSESERNQNLYIQRTPE-SIASSDQESIKKLDFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTS-CLKKRTKID-TQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNG-----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEP-GAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQHSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQYDVMTLEKAKQILLD---HFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEIILQSHEKLLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKITPNPEHCREKFVVDIYS------SEGKHYISCIDIYSKFATLEQIKTKDWIECRNALM-RIFNQLGKPKLLKADRDGAFSSLALKRWLEEEEVELQLNTA----KNGVADVERLHKTINEKIRIINSSDDEEVKLSKIETILYTYNQKIKHDTTGQRPAQIFL 964          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P04323|POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 332.798 bits (852), Expect = 7.612e-94
Identity = 209/628 (33.28%), Postives = 329/628 (52.39%), Query Frame = 1
Query: 1606 GRPS-TTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCAN-----NSRLETMGKLNINVK----MGSMERNVTFIIVKNLIPEIIGGVELQ----RLFGIELKYI--LEEHEKR--------SDFICEIEARFGRII-TDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKE----KKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEM-CEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSR 3399
            G+P   T+  +   + CL+DTG+ +N+ +K++ D    + I  T   +  +N     N  +    K+         +     N   ++ + L+ E    +  +     L+  + K I  +  HE+          D +     +   I+ +D  RL H    L     +RL  + Q   ++   +   +  TN  KH I TK    +      P   E ++E  I+++ N GIIR  NSP+N+P+  V KK+    K+  R+ +D+R+LN+IT     P+PN+DEIL  L    YF++IDL   ++Q+++D ES  KTAFSTK G + + RMPFG+  AP TFQ  M  +L+ L     +VYLDDI++F+ + ++H    G V  K+A A L+L  +KC+  ++E  FLGH++  DGI+ +  KIEAIQ +  P   K +++FLG+  YYR+FI ++A  A+ + + C K N KI  T    + AF ++K  +   P+L  PDF K+F L TDAS   +GAVLSQ       H ++Y S  ++ HE  Y    KELLAI +  K F HYL G+ F + +DH+ ++++   K P  ++   W   LS  D  ++Y KG  +  AD LSR
Sbjct:   11 GKPQYITIKYKENNLKCLIDTGSTVNMTSKNIFD----LPIQNTSTFIHTSNGPLIVNKSIIIPSKILFPTTNEFLLHPFSENYDLLLGRKLLAEAKATISYRDQEVTLYNNKYKLIEGIATHEQSHFQNVNMIPDTMLRQPNKISPILESDLYRLEH----LNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSKYSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPM-TKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDG-----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDP-NSKLTRWRVKLSEFDFDIKYIKGKENCVADALSR 623          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q99315|YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 326.635 bits (836), Expect = 1.143e-89
Identity = 264/873 (30.24%), Postives = 420/873 (48.11%), Query Frame = 1
Query: 2113 IKHKIITKGEPIMIKPRRQPINLEDKIEEAI----KNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALE-SICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGH-EHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK----TCGTCVQCMMEHEDA--KTGKIKTRILTVTAE-GGYNKWQNDNMEVQEIKNKLENKDC---KFIMENNTVLTKQGKIWIPSDNRQRMIKEVH---VLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEP-FEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKF-GAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINRE-----DMEKRIEDKTLKPKISKTVRNFEME---------------DVVLIKQEIRNKDDAR------WEGPYKVIKKIHERSYL--LKDQNGKMVVRNVEKIKHF 4584
            +KH I  K  P    PR QP ++ +K E+ I    + L +N  I    SP ++P+V V KK+    RLC+D+R LNK T    FP+P +D +L  +  ++ F+++DL + Y+Q+ ++ + + KTAF T  G++ +  MPFG+  AP TF   M    +DL    V VYLDDILIF+++ E+H+     VL ++    L +  +KC+   +E +FLG+ I    I     K  AI+ F  PK VK  + FLG+ NYYRRFI + +K A+ ++  IC K+    +WTE  +KA  ++K+AL  +PVLV  + +  + L TDAS D IGAVL + D K     V+ Y S ++ S +K Y     ELL I     HF + L+GK F LRTDH ++  +    +P   + Q W++ L++ D  +EY  G  +  AD +SR     T  T      E   +  K+  + + +L    E   +N    D    +  + KLE  +     + +E+  +   Q ++ +P   +  +++  H   +   H G       I        L   + + I  C +CQ +K+   +     Q +   E  +  I MD    L  T N    I  ++D +SK     A  K  + T    LL ++I  + G P+ +  D      A   +EL K  GI+   SS  H  T+G  ER  +T+   + A  +    +NW   +P+IE+  N+T  +T G SP EI  G   +     S+ E+N       ++ K ++  T++ K        EME               D VL+ ++   K  A       + GP++V+KKI++ +Y   L     K  V NV+ +K F
Sbjct:  584 VKHDIEIK--PGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGT-FRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKS----QWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEP-ARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYY-QDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNI-QNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKVQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKKF 1444          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q7LHG5|YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 323.939 bits (829), Expect = 5.623e-89
Identity = 258/853 (30.25%), Postives = 412/853 (48.30%), Query Frame = 1
Query: 2113 IKHKIITKGEPIMIKPRRQPINLEDKIEEAI----KNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALE-SICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGH-EHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK----TCGTCVQCMMEHEDA--KTGKIKTRILTVTAE-GGYNKWQNDNMEVQEIKNKLENKDC---KFIMENNTVLTKQGKIWIPSDNRQRMIKEVH---VLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEP-FEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKF-GAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINRE-----DMEKRIEDKTLKPKISKTVRNFEME---------------DVVLIKQEIRNKDDAR------WEGPYKVIKKIHERSYLL 4530
            +KH I  K  P    PR QP ++ +K E+ I    + L +N  I    SP ++P+V V KK+    RLC+D+R LNK T    FP+P +D +L  +  ++ F+++DL + Y+Q+ ++ + + KTAF T  G++ +  MPFG+  AP TF   M    +DL    V VYLDDILIF+++ E+H+     VL ++    L +  +KC+   +E +FLG+ I    I     K  AI+ F  PK VK  + FLG+ NYYRRFI + +K A+ ++  IC K+    +WTE  +KA  ++K AL  +PVLV  + +  + L TDAS D IGAVL + D K     V+ Y S ++ S +K Y     ELL I     HF + L+GK F LRTDH ++  +    +P   + Q W++ L++ D  +EY  G  +  AD +SR     T  T      E   +  K+  + + +L    E   +N    D    +  + KLE  +     + +E+  +   Q ++ +P   +  +++  H   +   H G       I        L   + + I  C +CQ +K+   +     Q +   E  +  I MD    L  T N    I  ++D +SK     A  K  + T    LL ++I  + G P+ +  D      A   +EL K  GI+   SS  H  T+G  ER  +T+   + A ++    +NW   +P+IE+  N+T  +T G SP EI  G   +     S+ E+N       ++ K ++  T++ K        EME               D VL+ ++   K  A       + GP++V+KKI++ +Y L
Sbjct:  610 VKHDIEIK--PGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGT-FRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKS----QWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEP-ARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYY-QDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNI-QNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKVQQIYVGPFRVVKKINDNAYEL 1450          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P10394|POL4_DROME (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 312.768 bits (800), Expect = 3.829e-86
Identity = 165/452 (36.50%), Postives = 257/452 (56.86%), Query Frame = 1
Query: 2068 NVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKK-----EKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKT 3405
            ++F  +   I   NL K ++ +   EP+  K  R P +  ++I+  ++ L  + I+    S +N+PL+ V KK     +KK  RL +D+RQ+NK      FP+P +D+ILD L  ++YFS +DL + ++Q++LD+ S++ T+FST  G + F R+PFG+  AP +FQ +MT     +      +Y+DD+++   +E+       +V GK     L+L+PEKC  F  EV FLGH     GI  D+ K + IQ++  P    + R F+  CNYYRRFIK++A  +R +  +C K N    WT+ C+KAF  +K  LI   +L +PDF KEF + TDAS    GAVL+Q +  GH+  +AY S A +  E     T +EL AI++   HF  Y+YGK F ++TDH+ +T++ +   P +   +  +  L   +  +EY KG  +  AD LSR T
Sbjct:  288 DIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHITRLC-KKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQ-NHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDHRPLTYLFSMVNPSSKLTRIRLE-LEEYNFTVEYLKGKDNHVADALSRIT 736          

HSP 2 Score: 106.301 bits (264), Expect = 1.696e-21
Identity = 87/330 (26.36%), Postives = 158/330 (47.88%), Query Frame = 1
Query: 3673 HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKE-----INREDMEK----RIE------DKTLKPKISKTVRNFEME---------DVVLIKQEIRNKDDARWEGPYKVIKKIHERS--YLLKDQNGKMVVRNVEKIKHF 4584
            H G  K    ++ +   +N++  +K+ +  C++CQK KT        T T      F+++ +D  GPL ++ N  +Y   +I   +KY+    I  +  +T+++ +   +ILK+G  K    D G  ++   I +L K   I+ I S+ +HH T G++ER  RT+ EYI + ++   + +W   +    Y  N T        P E++FGR  +  K ++         N +D  K    R+E       K L+    K   N++++         D VL++ E+ +K D ++ GPYK I+ I + +   LL ++N K +V   +++K F
Sbjct:  909 HTGITKTLAKVKRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYIS-TDKTDWDVWLQYFVYCFNTTQSMVHNYCPYELVFGRTSNLPKHFNKLHSIEPIYNIDDYAKESKYRLEVAYARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLRNEVGHKLDFKYTGPYK-IESIGDNNNITLLTNKNKKQIVHK-DRLKKF 1235          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0V0RP87 (Transposon Ty3-G Gag-Pol polyprotein OS=Trichinella nelsoni OX=6336 GN=TY3B-G PE=4 SV=1)

HSP 1 Score: 554.673 bits (1428), Expect = 7.783e-167
Identity = 334/1089 (30.67%), Postives = 567/1089 (52.07%), Query Frame = 1
Query: 1612 PSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRE-SLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLI-PEIIG--------------GVELQRLFGIELKYILE-EHEKRSDFICEIEARFGRIITDEERL---------------RHAIDVLKVTGNKR--LLEIFQANKNVFMADKWDIGCTNLIKHKIITKG-EPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDL-WKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKN--------KLENKDCKFI----MENNTVLTKQGKI---WIPSDN----------RQR---MIKEVH--VLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYI-NASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKI----------------DRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRN-------------FEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGK--MVVRNVEKIKHF 4584
            PS    + G ++  LLD+GA ++V+  S   +    E L+    S+   +  R+   G+  + +++GS    +   IV++L+ P I+G                E+    G  ++ + E  H  R    C       R ++ EE +               R  +D  + +   R  L  + +         + D+G T+L++H+I T G +P+ + PRR P    + ++  I+ + + G+I   + PW++P+V V KK+    R C+D+R+LN +T   A P+P +D+ LD L G+++FS++DL + Y+QV++ +E +EKTAFST  G F F  MPFG+  AP TFQ LM K L+ L WK   +VYLDDI++F KTEE+H      VL ++ + GL++ PEKCQ+ R+ V +LGH++ + GI TD  K  A+Q +  P+CV+ +R FLG+ +YYRRF++++A  A  L ++  K  EK  W    E+AF  +K+AL++ P+L  PDF + F+LD DAS D +GAVLSQ+ E+    VIAY S ++S  E+GYC TR+E+LA+ +  +HF  YLYG+RF+ RTDH ++ ++ + ++P   Q   W+  L+  D ++ +R G  H NAD LSR+    C QC ME   A   ++    + + A     +WQ  + E+Q+I+         +   +  + +     + + ++ ++G I   W   D           RQR   ++  VH      H G  K    ++          +V+     CE C      T K +   Q      PF+++ MD+ GPL+ET N  +YI    D++SK+    A+   + RT++  L+N    ++GAP+ LH D G+NFE+  +KE+ +  G+    ++ YH  ++G++ER  RT+ + +  AS++     +W   +  +     ++V  TTG +P+ +IFGR++                + +  Y+ +   R+D+E+  E  T++ K  +  R              +E  D V ++   + K  A W+GPY+V KK+   +Y +K   G+   +V + +++K +
Sbjct:  386 PSVAGKLNGLEIPLLLDSGAVVSVVPLSTWHKSTGGEPLKAAGGSILLGDGRRVRLCGQGTLPLQLGSWRGRLHVAIVESLVVPGILGTNFLDQYVKLIDWQAGEMTMTDGSSVRIVHEPSHATRPGIGCTWITASPREVSREETVGERPGNNSGELGACERALVDRAECSAQNRRALRSLIRRYGKAISCGEGDLGRTSLVQHRIETGGAQPVKLPPRRLPQAQRETVDRLIREMLHAGVIEPASGPWSSPVVLVRKKDGSP-RFCVDYRRLNAVTRVDAQPIPRIDDTLDALAGAKWFSTLDLASGYWQVEVAEEDREKTAFSTPLGLFQFRVMPFGLCNAPATFQRLMEKALRGLTWKT-CLVYLDDIIVFGKTEEEHLERLEGVLSRLQSVGLKIKPEKCQLMRQSVHYLGHVVTQQGIGTDPEKTAAVQKWPTPRCVREVRQFLGLASYYRRFVRNFAGVANPLHALT-KKGEKWHWGPKEEEAFTLLKKALVSPPILGHPDFDRTFLLDVDASEDAVGAVLSQQGERDPPSVIAYASRSLSRAERGYCATRREMLALVWATQHFRPYLYGRRFIARTDHNSLRWLRSFREP-EGQVARWLERLAEFDFEVVHRAGRKHQNADALSRR---VCKQCGMEGSPA---EVPVGAVKLDAASPIKQWQESDKELQQIREWSTQRTWPRAAPEGSRLLRSLWAQRDRIVVREGTICRKWETPDTGETRLLQVIPRQRIPEILAAVHNGQSGAHLGVAKTLAKLRQRYYWPQQREDVEDWCRACETCAARAVPTKKPQAPMQLQPVGYPFQRVGMDLLGPLEETRNGNRYILVACDYFSKWPEAFALPNAEARTVAAALVNGLFCRYGAPETLHSDQGRNFESALVKEVCQLFGVAKTRTTAYHPQSDGLVERMNRTLLDLLAKASIDHP--DDWDAHLDRVLLAYRSSVHHTTGATPSRVIFGREMRLPVDLVYGLPENAPEESVGEYTRR--LRQDLEQLYE--TVRGKAGREQRRQKFWKDRKAHGPVYEPGDQVWMQVPEKTKLGAYWDGPYEVQKKLDWNTYRVKQMKGRRQRLVVHFDRLKPY 1458          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0V0X0B7 (Transposon Ty3-G Gag-Pol polyprotein OS=Trichinella sp. T6 OX=92179 GN=TY3B-G PE=4 SV=1)

HSP 1 Score: 554.288 bits (1427), Expect = 2.191e-166
Identity = 334/1089 (30.67%), Postives = 567/1089 (52.07%), Query Frame = 1
Query: 1612 PSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRE-SLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLI-PEIIG--------------GVELQRLFGIELKYILE-EHEKRSDFICEIEARFGRIITDEERL---------------RHAIDVLKVTGNKR--LLEIFQANKNVFMADKWDIGCTNLIKHKIITKG-EPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDL-WKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKN--------KLENKDCKFI----MENNTVLTKQGKI---WIPSDN----------RQR---MIKEVHVLLC--HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYI-NASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKI----------------DRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRN-------------FEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGK--MVVRNVEKIKHF 4584
            PS    + G ++  LLD+GA ++V+  S+  +    E LE    S+   +  R+   G+  + +++GS    +   IV++L+ P I+G                E+    G  ++   E  H  R    C       R ++ EE +               R  +D  + +   R  L  + +         + D+G T+L++H+I T G +P+ + PRR P    + ++  I+ + + G+I   + PW++P+V V KK+    R C+D+R+LN +T   A P+P +D+ LD L G+++FS++DL + Y+QV++ +E +EKTAFST  G F F  MPFG+  AP TFQ LM K L+ L WK   +VYLDDI++F KTEE+H      VL ++ + GL++ PEKCQ+ R+ V +LGH++ + GI TD  K  A+Q +  P+CV+ +R FLG+ +YYRRF++++A  A  L ++  K  EK  W    E+AF  +K+AL++ P+L  PDF + F+LD DAS D +GAVLSQ+  +    VIAY S ++S  E+GYC TR+E+LA+ +  +HF  YLYG+RF+ RTDH ++ ++   ++P   Q   W+  L+  D ++ +R G  H NAD LSR+    C QC +E     T ++    + + A     +WQ  + E+Q+I+         +   +  + +     + + ++ ++G I   W   D           RQR   ++  VH      H G  K    ++          +V+     CE C      T K +   Q      PF+++ MD+ GPL+ET N  +YI  + D++SK+    A+   + RT++  L++    ++GAP+ LH D G+NFE+  +KE+ +  G+    ++ YH  ++G++ER  RT+ + +  AS++     +W   +  +     ++V  TTG +P+ IIFGR++                + +  Y+ +   R+D+E+  E  T++ K  +  R              +E  D V I+   + K  A W+GPY+V KK+   +Y +K+  G+   +V + +++K +
Sbjct:  346 PSVAGKLNGFEISLLLDSGAVVSVVPLSIWHKSTGGEPLEAAGGSILLGDGRRVRLCGQGTLPLQLGSWRGRLHVAIVESLVVPGILGTNFLDQYVKLIDWQAGEMTMTDGSSVRIAHEPSHATRPGIGCTWITAGPREVSREETVGERPGNDSGELGACERALVDRAECSAQNRRTLRSLIRRYGKAISCGEGDLGRTSLVQHRIETGGAQPVKLPPRRLPQAQRETVDRLIREMLHAGVIEPASGPWSSPVVLVQKKDGSP-RFCVDYRRLNAVTRVDAQPIPRIDDTLDALAGAKWFSTLDLASGYWQVEVAEEDREKTAFSTPLGLFQFRVMPFGLCNAPATFQRLMEKALRGLTWKT-CLVYLDDIIVFGKTEEEHLERLEGVLSRLQSVGLKIKPEKCQLMRQSVHYLGHVVTQQGIGTDPEKTAAVQKWPTPRCVREVRQFLGLASYYRRFVRNFAGVANPLHALT-KKGEKWHWGPKEEEAFTLLKKALVSPPILGHPDFDRTFLLDVDASEDAVGAVLSQQGGQDPPSVIAYASRSLSRAERGYCATRREMLALVWATQHFRPYLYGRRFIARTDHNSLRWLRNFREP-EGQVARWLERLAEFDFEVVHRAGRKHQNADALSRR---VCKQCGLE---GSTAEVPVGAVRLDAANPIKQWQESDKELQQIREWSTQRTWPRAAPEGSRLLRSLWAQRDRIVVREGTICRKWETPDTGETRLLQVIPRQRIPEILAAVHNGQSGGHLGVAKTLAKVRQRYYWPQQREDVEDWCRACETCAARAVPTKKPQAPMQLQPVGYPFQRVGMDLVGPLEETRNGNRYILVVCDYFSKWPEAFALPNAEARTVAAALVDGLFCRYGAPETLHSDQGRNFESALVKEVCQLFGVAKTRTTAYHPQSDGLVERMNRTLLDLLAKASIDHP--DDWDAHLNRVLLAYRSSVHHTTGATPSRIIFGREMRLPVDLVYGLPENTPEESVGEYTRR--LRQDLEQLYE--TVRGKAGREQRRQKFWKDRKAHGPVYEPGDQVWIQVPEKTKLGAYWDGPYEVQKKLDWNTYRVKEMKGRRQRLVVHFDRLKPY 1418          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A087SUZ9 (Retrovirus-related Pol polyprotein from transposon 412 (Fragment) OS=Stegodyphus mimosarum OX=407821 GN=X975_16630 PE=4 SV=1)

HSP 1 Score: 549.666 bits (1415), Expect = 1.844e-163
Identity = 313/930 (33.66%), Postives = 509/930 (54.73%), Query Frame = 1
Query: 2065 KNVFMADKWDIGCTNLIKHKIITKGEP-IMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCG-TCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIME---------------------------------NNTVLTKQ-----GKIW-----IPSDNRQRMIKEVHVLLC--HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMK----------WYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDAR------------------------------WEGPYKVIKKIHERSY-LLKDQNGKMVVRNVEKIKHFKK 4590
            ++VF     D+G T+L +H+I T   P I   PRR PI  ++++   +K+++ + +I    SPW +P+V V KK+    R C+D+R+LN++T++ ++P+P +D+ILD L GS++FS++DL + Y+QV++  + +EKTAF+T +G + F  MPFG+  AP TF+ LM  VLK L  +  ++YLDDI+I  K+ E+H     KVL K+  A L+L+P KC++FR+EV +LGH+I+ +G++TD  K+ A++ +++P+ V  LRSFLG+C YYRRF+KD++  AR L  +  ++ +K  WT+ CE AF  +KEAL +AP+L +P   + FILDTDAS +++GAVLSQ+ E G E V+AY S  +S  E+ YC+TRKELLAI    +HF+HYLYG++F+LRTDH ++ +++  K P   Q   WI  L   DI + +RKG SH NAD LSR+ C   C  C       +      R +T +     + W +  +     K++L+++D K I+E                                  N VL ++     GK +     +P      ++KE+H      H G  K  + ++      N   +V+K    C+ C   K    +++ + Q      PFE+I  DI GPL  T +  KYI   ID+++K+     I  Q+  T++E ++  WI +FG P +LH D G+NF +  +KEL K  GI+   ++P H  ++G++ER  RTI   ++  ++   +++W   +P       + V +TTG SP++++FGR + R+             S+ E   +D++ R E           + NF  E V L  ++++ + D R                              W+GPY V+ ++++    + K  N K  V + ++++  KK
Sbjct:  643 QDVFSRSSSDVGRTSLTQHRIDTGDHPPIKQHPRRLPIAKQEEVRALLKDMQESNVIEPSASPWASPIVLVRKKDGS-TRFCVDYRRLNEVTKKDSYPLPRIDDILDTLSGSKWFSTLDLKSGYWQVEIHPDDREKTAFTTGQGLWQFKVMPFGLCNAPATFERLMETVLKGLSYEACLIYLDDIIIVGKSFEEHLENLRKVLQKLKEANLKLSPAKCKLFRQEVTYLGHVISAEGVRTDPEKVSAVKDWRRPENVHQLRSFLGLCTYYRRFVKDFSSIARPLHKLT-ESKQKFVWTKECENAFKNLKEALTSAPILTYPQLDRPFILDTDASNESVGAVLSQEIE-GQERVVAYWSKCLSKPERNYCVTRKELLAIVKAVEHFHHYLYGRKFLLRTDHASLAWLLNFKNP-EGQIARWIQRLQEYDITIRHRKGQSHGNADALSRRPCPENCRYCSRVEAKYQLVNPVARQITASTLADPDPWTDKEIR----KDQLQDRDIKPIIELMETSNRKPTWQDISSYSPTTKQYWALWDSLHIRNGVLYRKFESDDGKTFRWQLVLPRSRIPDVLKELHSSPAGGHFGIMKTLQRVRERFYWNNAKDDVQKWCRTCDACVSRKGPKKRSRGKLQRYNVGAPFERIAFDILGPLPRTADGNKYILVAIDYFTKWPEAYPIPDQEAVTVAEMMIQHWISRFGVPLQLHSDQGRNFTSAVVKELCKLLGIDKTQTTPLHPQSDGMVERFNRTILNNLSLVVSR-NQQDWDKKLPLFLLAYRSAVHETTGYSPSQMLFGRDL-RLPCDLLFGRPPDAPSSPEEYIQDLQARFE----------VMHNFARERVNLATEKMKTRYDTRATGHRFNEGDKVWLWNPTRRKGLSPKLQSPWDGPYTVLNRLNDVVVRIRKSSNSKPKVVHYDRLQARKK 1552          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A5S6QWV2 (Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1)

HSP 1 Score: 530.406 bits (1365), Expect = 4.140e-163
Identity = 307/976 (31.45%), Postives = 536/976 (54.92%), Query Frame = 1
Query: 1630 IQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRE-SLRCANNSRLETMGKLNINVKMG-SMERNVTFIIVKNLI-PEIIGGVELQR------LFGIELKYILEEHEKRSDFICEIEARFGRIITDEER-LRHAIDVLKVTG--------NKRLLEIFQANKNVFMADKWDIGCTNLIKHKIIT-KGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCK--------------FIMENNTVLTKQG---KIW-------------IPSDNRQRMIKEVHVLLC--HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEIN-REDMEKRIEDKTLKPKIS 4401
            I GR  +  +DTGA +++M+  + + ++  ++L  +   L  A    L  +G   +N+ +G S   + T ++  +L  P +IG   L++           L+    E + ++D + E       + T++ R     ID +  +G         ++L ++F   K V      D+G T++++H+I T   +PI + PRR P      I++ +  +   G+I + +SPW+ P+V   KK+    R C+D+R+LN  TE+   P+P +D+ LDIL GSR+FS++DL + Y+QV+++ E + KTAFST  G + F  MPFG+  AP TFQ LM +VL+ L     +VYLDDI+IF +T  +      +VL ++  AGL++ P KC++  +EV FLGH+++  GI TD  K   +++++ P+C   LR F+G+ +YYR+F+K++A  A  L  +  K +++  W + CE+AF EMK  L+TAP+L FPDF K F+LD DAS D +GAVLSQ  + G EH IAY S A++  E+ YC TR+ELLA+ +  KHF  YLYG RF +RTDH  + ++   ++P   Q   W+  L+  D+++ +R G  H NAD LSR+TCG C + ++    A+    + +    TA       QN + +++ +K  +  +DC               F  + N ++   G   ++W             +PS  R+ +++E+H      H G ++  +  +       +A ++K     C+ C   K  + ++++  Q+I++  PF+++ +D  GPL +T   K+Y+  ++D+++K+    A    + +T+++ L++ +I +FG P+ +H D G++FEA  + E  +  GI+   ++ YH  +NG++ER  RT+ + + A L++   + W +++P   +  N +   TTG SP   +FGR+       +   ++ R D+ +R E +T+   +S
Sbjct:    4 ICGRGTELTIDTGAAVSLMSHELFNDIQPADLLPIQNVRLITAAGDELRVVGTSTVNISIGKSPPTSHTLLVAHSLTCPCLIGADFLRQHKCIIDFTSGTLQMNGYEVKLKTDRLSEHSIATASLATEDLRTYEEIIDSMCASGATENDYETREQLKKLFWKYKGVLPTTDDDLGRTSVVRHRIRTGNAKPIKMGPRRMPHRDRPIIQDLVDRMLQQGVIERASSPWSFPVVLA-KKKNGSFRFCVDYRKLNDRTEKDVQPLPRIDDALDILSGSRWFSTLDLASGYWQVEVEPEDRPKTAFSTPTGSYQFRVMPFGLCNAPATFQRLMEQVLEGLQWKTCLVYLDDIIIFGRTTTEQMERLEEVLNRLQAAGLKVKPSKCKLMTQEVVFLGHVVSGKGISTDPLKCATVENWEPPRCTNELRQFIGLASYYRKFVKNFASIAAPLHRLLQK-DKRWSWDDDCERAFQEMKRRLLTAPILKFPDFNKTFVLDVDASNDGLGAVLSQVHDDG-EHPIAYASRALTKPERRYCATRRELLALVWASKHFRPYLYGTRFKVRTDHNCLIWLNNFREP-EGQIARWLERLAEFDMEICHRPGKLHDNADALSRQTCGQCGKAVIPVAYAQAEPTQQQDQMNTA-------QNQDGDIKTLKEWIA-QDCWPSSCPEGVSRELRIFWGQRNALILDNGTLFRLWETPNSQRPLKLLVVPSHMRREILEELHDAPAGGHLGERRTLEKARCRFYWPGMAKDIKLWCRTCDACACRKGPSRRSRQPLQSIQTGYPFQRVGIDFLGPLPKTTTGKRYVLVVVDYFTKWTEAYATENMEAKTVAKVLVDNFITRFGPPESVHSDQGRSFEATLLAETFQLLGIKKTHTTSYHPQSNGLVERFNRTLLDTL-ALLSKRSPETWDEMLPWTTFAYNTSSHDTTGTSPFLALFGRE-------ARLPVDLRYDLPQREEPQTVTAYVS 959          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A4Y2G706 (Transposon Ty3-I Gag-Pol polyprotein OS=Araneus ventricosus OX=182803 GN=TY3B-I_1264 PE=4 SV=1)

HSP 1 Score: 531.561 bits (1368), Expect = 1.255e-161
Identity = 314/935 (33.58%), Postives = 499/935 (53.37%), Query Frame = 1
Query: 2026 TGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEP-IMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCG-TCVQCM-MEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENK-------DCKFIMENNTVLTKQGKIW---------------------------IPSDNRQRMIKEVHVLLC--HAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKI-----------DRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDA------------------------------RWEGPYKVIKKIHERSY-LLKDQNGKMVVRNVEKIKHFK 4587
            T  K LL+ FQ   N+F     D+G  N+ +H+I T   P I   PRR P+  +++ E  +K + +NGII + + PW +P+V V KK+    R C+D+R+LN+IT + ++P+P +D+ LD L+GS++FS++DL + Y+QV++  E +EKTAF+T +G + F  MPFG+  APGTF+ LM  VL+ L  +  +VYLDDI+I  +T ++H N   KV  ++  A L+L+P+KC+ FRKEV +LGHII+ DG++TD  K +A+  + +P+ V +LRSFLG+C YYRRF+++++  AR L  +  +      WTE CEK+F  +K+ALIT+PVL +P   KEFILDTDAS + IGAVLSQK     E VIAY S ++   E+ YC+TRKELLAI    +HF+HYLYG++F+LRTDH ++ +++  ++P   Q   WI  L   D ++++RKGTSH NAD LSR+ C  +C  C   E        I  ++LT       ++ Q   +E   IK  LE K         + I   +    +   +W                           +P    Q +++E H      H G  K     +     + L  +V+K    C  C   K   T+TK   Q      PFE++ +DI GPL  T    +Y+  ++D+++K+     I  Q+  T++E L+  WI ++G P  LH D G NF +    EL K  GI    ++  H  ++G++ER  RTI  +++  +++  + +W   +P       +   + TG +PA+++FGR +                  N+ +N  ++E R+E          +V  F  E + L  + ++ + D+                               WEGPY ++KK+++  Y + +  N K  V ++ ++  ++
Sbjct:  228 TAVKELLQEFQ---NLFSTSDSDVGRCNMTQHRINTGNHPPIKQYPRRLPLAKKEEAERLVKEMVDNGIIEESSGPWASPIVLV-KKKDGSTRFCVDYRKLNEITIKDSYPLPRIDDTLDALNGSQWFSTLDLKSGYWQVEIQPEDKEKTAFTTGQGLWQFKVMPFGLCNAPGTFERLMETVLRGLTSEACLVYLDDIIIVGRTFQEHINNIRKVFQRLQKANLKLSPKKCRFFRKEVSYLGHIISADGVKTDPEKTKAVVDWPRPETVHDLRSFLGLCTYYRRFVRNFSAIARPLHKLT-EARSNFNWTEECEKSFNNLKQALITSPVLTYPRTDKEFILDTDASNEGIGAVLSQKI-GNEECVIAYFSKSLGKPERNYCVTRKELLAIVKSIEHFHHYLYGRKFLLRTDHASLRWLLNFREP-EGQIARWIQRLQEYDFEIQHRKGTSHGNADALSRRPCKESCKHCTNAEKRFGMETDISMKVLTTEDAWSSSEVQKAQLEDPAIKPILERKLNSEDRPSWQEIAPESPATKRYWALWDSLHLKDGVLYRKWESDDGSSGRWQLILPKSRIQEVLRETHESASGGHFGVMKTLSKTRERFYWDRLRADVEKWCRECHACGARKGPKTRTKGRLQRYNVGAPFERMALDILGPLPVTAKGNRYVLVLMDYFTKWPEAIPIPDQEASTVAEELVRAWISRYGVPMILHSDQGTNFNSALFTELCKLLGILKTRTTALHPESDGMVERFNRTILNHLSLFVSK-NQTDWDTHLPLFLLAYRSADDEATGCTPADMLFGRTLRLPCDILFGRPSDTPSSPNEYLN--NLEARLE----------SVHAFARERIKLASERMKTRYDSGATGHHFKEGDQVWMYNPKRWRGLSPKLQQNWEGPYTIVKKLNDVIYRVQRSPNAKPKVIHINRLTPYR 1142          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000045469.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000976.1:426503:430633:1 gene:ENSAMXG00000030479.1 transcript:ENSAMXT00000045469.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 439.884 bits (1130), Expect = 7.989e-130
Identity = 262/826 (31.72%), Postives = 433/826 (52.42%), Query Frame = 1
Query: 2050 IFQANKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGK---NNEKIR----------WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTC----------GTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNK---------------------WQNDNME-VQEIKNKLEN-----KDCKFIMENNTVLTKQ----------GKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK----IDRM 4320
            + + ++N+F     D+GCT L++H I +    P+  + RR P +  D ++  I+ L  + ++R  +SP+ +P+V V KK+   IRLC+D+RQLN  T + AFP+P ++E LD L GS++FS++DL + Y QV + ++ +EKTAF T  G F FNRMPFG+  AP TFQ LM ++  D     +++YLDDI+IF+   + H      VL ++    L+L  EKC  F+  V +LGH+I++ G++TD  KI A+  +++P  VK +RSFLG  +YYRRF++ +A  A  L  + G    + ++ R          W E+CE +F  +K+ L+ APVL F DF + FI++ DAS   +GAVLSQ+ E G    IA+ S  +   E+    Y   + ELLA+ +   + F  YL G +F + TD+  ++++ + K  + A  Q W + L+  + +++YR GTS+ NAD LSR             G  V   +   D +T      +++ T +   ++                     W+       +E++N+ E      +  + I+E   VL +            ++ +P   +  ++  +H    H G ++ T  ++  C    +  ++KK    CERC   K +  K +    ++ +T P E + MD    L+   N ++ +  + D +SK+    A+  Q   T+   L+ KW   +G PK +H D G++FE   +K L +T GIE   ++PYH   NG  ER  RT+ + +     E  RK W  ++P + +  N T+ ++T  SP E++FG+K    IDRM
Sbjct:  324 LLRRHQNLFSEHAGDLGCTTLVQHSIPLLDSVPVRQRYRRLPPSQYDLVKAHIQELVEHQVVRPSSSPYASPIVVVQKKDG-SIRLCVDYRQLNAKTRKDAFPLPRIEESLDALSGSKWFSTLDLASGYNQVPVLEKDKEKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERIFGDQSFHTLLLYLDDIVIFSSDLQQHLQRLDMVLSRLQQHNLKLKLEKCHFFQTRVSYLGHVISESGVETDPEKIRAVVDWKRPTTVKEVRSFLGFASYYRRFVEGFAALAAPLHKLVGALQGSKKQPRSKMHSIVIPHWDEVCETSFRALKDRLVRAPVLGFADFTRPFIVEIDASHTGLGAVLSQEQE-GKRRPIAFASRGLRPSERNMSNYSSMKLELLALKWAVTEKFREYLLGTQFTVYTDNNPLSYLQSAK--LAAVEQRWASQLALFNYELKYRPGTSNGNADALSRLPADTDQSSVGVRGISVPPEVPLADKETASRSQVVISSTVDAIPSRSSVDLQILQAADPLIAAFSVYWRRGQAPTARELRNEREGVKELVRQWQRIVERQGVLYRSVQLPPAKHTVFQLVLPKALQTEVLLALHDQHGHQGMERTTDLVRQRCYWPKMWQDIKKWCTECERCSIAKAMHPKVRTFMGSLLATRPLEILAMDFTV-LERASNGQENVLVLTDVFSKFTQAYAVADQKASTVVRVLVEKWFYVYGVPKRIHSDQGRSFEGELLKHLCETYGIEKSRTTPYHPEGNGQCERFNRTLHDLLRTLPPEKKRK-WPQLLPHLLFAYNTTIHQSTQYSPYELLFGQKPQLPIDRM 1143          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041345.1 (pep primary_assembly:Astyanax_mexicanus-2.0:25:32334536:32339002:1 gene:ENSAMXG00000029230.1 transcript:ENSAMXT00000041345.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 434.491 bits (1116), Expect = 3.279e-127
Identity = 312/1058 (29.49%), Postives = 529/1058 (50.00%), Query Frame = 1
Query: 1624 LDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLR-----CANNSRLETMGKLNINVKMGSMERNVTFIIVKN--LIPEIIGGVELQRLFGIELK-----YILEEH-------EKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLE----IFQANKNVF-MADKWDIGCTN------LIKHKIITKGE-PIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLS----RKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNK----LENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVH--VLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK----IDRM-------KWYSNKEINR-EDMEKRIEDKTLKPKISKT--------VRNFEMEDVVLIKQEIRNKDD--------ARWEGPYKVIKKIHERSYLLK--DQNGKMVVRNVEKIKHF 4584
            L ++G++V  ++DTG+  +++ +   DR++  +  E   S R      AN       G + ++ ++G  +    F +++N   +  ++ G++     G+ L      YIL          E  S      + R    I  E+ +   +D  +     +L+E      QA +++  +  +W   CTN      ++ H I T    PI  KP +  I  +  I+EAI++++  GI+R   SPW +P+V V KKE   +R C+D+R++N  T   A+PMP V EIL+ LHG+  FS++DL + Y+QV L+ +S  KTAF T +G + F  +PFGI  A  TFQ LM  VL +L      VY++DI++++ T E H     +V   +  AGL LN  KC + ++ + FLGH+I+ +GI T+  K+EAIQ+F +P+ +K L+ FLG+  +Y RFI  ++++A  L ++  KN   I WT+ C+KAF ++K+ALITAPVL  P+F + F + TDAS   +GAVLSQ  + G EHV+AY S  +   E+ Y    KE LA+ +  + +  YL G+ F + TDH A++++    KP T++   W   L + D  ++YRKG  +   D LS    R T G    C +             I       G  + Q D    QE +      +   D  F    N  L    ++ +P  +R+  ++  H   L  H G  K    + N     ++  +V    ++CE CQK K   +K     Q+    EP   + +D+ GPL ++  + +++  I+D+ SK++ +  + +     I + L+     ++G P  L  D G  F +R +    +  G+    ++ YH  TN + ER  RT++  I AS  +   + W   +PE  + +N+  Q++TG +PAE+  GRK    ++R+          + K + R +D+  +I D   K +  +           NF   D+V ++    ++ D         +W+GP K+I+KI   +Y ++  D + ++   +VE +K++
Sbjct:  430 LSVRGQQVTAMIDTGSTFSLIQEGTWDRMKRTQ--EVWRSGRGQKFILANGQTQTAKGVVELDCELGKCQAKRPFYVMENKDHVFSLVLGLDFLHDTGLILDFQKDVYILPGGDTVPFGGEGESPPFSYADMRL--CIAQEDYV--PLDYEEEQEINKLVEGADITSQAKQDLHKLLSQWPSVCTNQLGRTMIVLHHITTNDNLPIRQKPYKVSIEKQQLIKEAIEDMQRRGIVRPSTSPWASPVVLVPKKEG-GVRFCVDYRRMNSKTHLDAYPMPQVQEILESLHGAAIFSTLDLKSGYWQVGLEPDSIPKTAFITCQGLYEFTVLPFGIKNAAATFQRLMDSVLVNLKGKSCFVYINDIVVYSSTIEQHLGHLEEVFRCLHQAGLTLNLRKCNLLQRSLIFLGHVISGEGICTEPGKVEAIQAFPEPRSIKELQRFLGMAGWYHRFITHFSERAAILNALKKKNAPWI-WTQECQKAFEDIKQALITAPVLTPPNFSEPFQIQTDASDQGLGAVLSQGTD-GLEHVVAYASRLLQGAERNYSTAEKECLAVVWAVEKWRVYLEGRHFTVITDHSALSWVFNHPKP-TSRLTRWAIRLQTFDFSVQYRKGKCNIVPDTLSRIPDRMTEGVMAPCQVTGSSDGLPVDWAEIARAQEVDGTLQPQRDETGNQETRKDRIHFVTKNDILFRAVPNQQLGHTLQVVVPVQHREAFLQYAHDNPLSGHLGQMKTLLRLLNIAYWPSIRRDVWTYCKSCEICQKYKPRISKLSGRLQSTPVVEPGYMLGVDLMGPLPKSPRQNEHLLVIVDYCSKWVEMFPLREAKTSQIVQILIKDIFTRWGTPAYLVSDRGAQFTSRLLHATCRQWGVVQKLTTAYHPQTN-LTERVNRTLKTMI-ASYVKDKHRLWDQWIPEFRFAINSAWQESTGFTPAEVALGRKLKGPLERLLSQPPNPDHDAYKVVQRQQDLFCQIRDNVDKAQAKQAKFYNRRRKQVNFVGGDLVWVETHPLSRADKGFAAKLAEKWKGPAKIIRKISPVNYEIEYLDDSSRVDTIHVENLKYY 1475          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041682.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02002320.1:24977:29035:1 gene:ENSAMXG00000038531.1 transcript:ENSAMXT00000041682.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 431.409 bits (1108), Expect = 5.899e-127
Identity = 261/810 (32.22%), Postives = 427/810 (52.72%), Query Frame = 1
Query: 2047 EIFQANKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESIC-----GKNNEKIR-----WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDA-----KTGKIKTRILTVTAE------------------GGY------NKWQNDNMEVQEIKNKLE-----NK--DCKFIMENNTVLTKQG-----KIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            ++ Q   +VF  D+ ++GCT+LI+H+I +    P+  + RR P +  D ++  I+ L +  +IR   SP+++P+V V K++   IRLC+D+RQLN  T + A+P+P ++E LD L G+R+FS++DL + Y QV + +  + KTAF T  G F FNRMPFG+  AP TFQ LM ++  D     +++YLDDI+IF+ T + H      VL ++    L+L   KC  F+ +VK+LGH+I+  G+ TD  KI+A+  +++P+ V  LRSFLG  +YYRRF++ +AK A  L  +      GK   K +     W++ CE+AF  +K  L++APVL + DF K FIL+ DAS   +GAVLSQ D++G    IAY S  +   E+    Y   + ELL + +   + F  YL G +F + TD+  ++++ T K  + A  Q W++ L+  +  ++YR G S+ NAD LSR       Q   E         + G  K  I T+ A                   G +       K+       QE K  LE     NK  +C  ++   T  T  G     ++ +P   ++ ++  +H    H GA++    ++  C   ++  ++++  + C RC   K    K +     + +++P E I +D    +    + ++ +  I D +SK+        Q   T++  L+ KW   +G P+ +H D G+NFE+  +K L K   ++   ++PYH   NG  ER  RT+ + +    +E  R+ W   +P++ +  N TV +TT  SP E++FGR+
Sbjct:  291 DLLQRYSSVFSQDEGELGCTHLIEHEIPLIDDTPVKQRYRRLPPSQYDLVKGHIQELLDRKVIRASCSPYSSPVVVVQKRDGT-IRLCVDYRQLNSKTRKDAYPLPRIEESLDALGGARFFSTLDLASGYNQVPMAEHDKSKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERIFGDERFQSLLLYLDDIVIFSSTFDLHLQRLEVVLKRLQQNNLKLKLSKCHFFQSQVKYLGHVISSAGVATDPEKIKAVSEWERPQTVTQLRSFLGFASYYRRFVEGFAKHASPLHRLVAVLQGGKRRVKTKPVEGHWSDACEEAFETLKLKLVSAPVLGYADFSKPFILEIDASHGGLGAVLSQ-DQEGGRRPIAYASRGLRDSERNMSNYSSMKLELLGLKWAVTEKFREYLLGAQFTVYTDNNPLSYLQTAK--LGAVEQRWVSQLALFNFNIKYRPGLSNRNADALSRLPACPTPQSFQETVSGISIPLQVGATKATISTIDAVPLRPKADLQRLQSADPVIGPFLQYWHRQKFPTAGERAQESKEVLELVRQWNKLRECDGVLYRLT-RTPDGVEEFLQLVLPECLQKEVLTALHDNHGHQGAERTASLVRQRCFWPHMWKKIERWCKECSRCVVAKMGQPKIRTFMGNLSASKPLEIIAIDFTL-MDRASDGRENVLVITDVFSKFTQAFPTQDQRASTVAHILVEKWFYVYGVPQRIHSDQGRNFESDLLKSLCKIYDVKKSRTTPYHPQGNGQCERFNRTMHDLLRTLPSEQKRR-WPKYLPQLLFAYNTTVHQTTAHSPYELMFGRQ 1093          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000054230.1 (pep primary_assembly:Astyanax_mexicanus-2.0:14:19483082:19487098:1 gene:ENSAMXG00000038338.1 transcript:ENSAMXT00000054230.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 427.172 bits (1097), Expect = 1.683e-125
Identity = 246/828 (29.71%), Postives = 435/828 (52.54%), Query Frame = 1
Query: 2005 AIDVLKVTGNKR--LLEIFQANKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALE----SICGKNNEK-------IRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK----------TCGTCVQCMMEHEDAKTGKIKT--RILTVTAEGGYNKWQNDNMEVQEI--------------KNKLENKDCKFIMENNTVLTKQGKIW----------------IPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            A+D+  +T  ++  +  + Q   +VF A   D+GCTNLI+H+I +    P+  + RR P +  + ++  I  L    ++R+ +SP+ +P+V V KK+   +R+C+D+R LN  T + AFP+P ++E LD L G+R+FS++DL + Y QV + ++ + KTAF T  G F +NRMPFG+  AP TFQ LM ++  D     +++YLDDI++F+ +   H      VLG++   GL+    KC  F++EV +LGH+I+  G+ TD  KIEA+  +Q+P+ V  LRSFLG  +YYRRF++ +A+ A  L     ++ G   +K       + WT  CE++F  +K  L++APVL + DF   FIL+ DAS+  +GAVLSQ+ + G    +AY S  +   E+    Y   + E LA+ +   + F  YL G + V+ TD+  ++++ + K  + A    W   L++ D +++YR G S+ NAD LSR+            GT V  +++         +    +L  ++    +  Q+ +  ++E+              K ++       + + + ++ + G ++                +P+  +  ++K++H    H G ++ ++ ++  C    +  ++K+    CERCQ  K            + ++ P E + +D    L+ + N  + +  + D +SKY        Q   T+++ LL +W  KFG P  +H D G++FE+  I++     G+E   ++PYH   NG  ER  RT+   +  +L    +++WA  +P++ +  N T  + TG SP  ++FG++
Sbjct:  250 AVDLSALTDQEQGVVRSLLQKYNSVFSAHDGDLGCTNLIEHEIPLLDNIPVRQRHRRIPPSDYELVKAHIDQLLEAQVVRESSSPYASPIVLV-KKKDGSLRMCVDYRLLNSKTRKDAFPLPRIEESLDALSGARWFSTLDLASGYNQVPVAEQDKPKTAFCTPFGLFEWNRMPFGLCNAPSTFQRLMQRMFGDQQYQSLLLYLDDIIVFSSSVSQHLQRLEVVLGRLQKEGLKAKLSKCVFFKQEVGYLGHVISNKGVSTDPAKIEAVAQWQRPRTVSELRSFLGFASYYRRFVEGFAQLAGPLHKLVAALVGSKTKKGSGQALGMAWTAQCEQSFEALKARLVSAPVLAYADFSLPFILEIDASYSGLGAVLSQEHD-GAVRPVAYASRGLRPPERNMDNYSSMKLEFLALKWAMTEKFREYLLGHKCVVYTDNNPLSYLQSAK--LGAMEHRWAAQLAAFDFEIKYRSGRSNKNADALSRQYQLGPSVTEYVPGTPVPELVQQASPVVSATQALVSVLPGSSPVDIHSLQDADPLLREVLVFWRKQVQPTSAEKRQVSRPAMALLRQWDRLVERDGVLYRRVFRPDGGEESFQLLLPAVLKPEVLKQLHQDHGHQGIERTSELVRQRCYWPGMFADIKQWCRECERCQVAKDAGPVPHSFMGHLLASRPNEIVAIDFTV-LEPSRNGLENVLVMTDVFSKYTIAVPTRDQRASTVAQVLLREWFYKFGVPGRIHSDQGRSFESALIQQFCHLYGVERSRTTPYHPAGNGQCERFNRTLHNLLR-TLPTSRKQDWASNLPQVIFCYNTTPHQGTGESPYYLMFGQE 1071          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000034038.1 (pep primary_assembly:Astyanax_mexicanus-2.0:14:3120146:3124090:1 gene:ENSAMXG00000040885.1 transcript:ENSAMXT00000034038.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 424.476 bits (1090), Expect = 8.175e-125
Identity = 291/951 (30.60%), Postives = 456/951 (47.95%), Query Frame = 1
Query: 2062 NKNVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIR--------------WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSR----------KTCGTCVQCMMEHEDAKTGKIKTRILTVT-AEGGYNKWQN------------DNMEVQEIK---------------------------------NKLENKDCKFIMENNTVLTK------QGKIW---IPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGR--------------KIDRMKWYSNKEINREDMEKRIEDKTLKPKISK-----------TVRNFEMEDVVLIK---QEIRNKDDARWEG-PYKVIKKIHE-RSYLLKDQNGKMVVRNVEKIKH 4581
            N +VF     D G T ++KH+I +    P  +  R+ P +    + +AI  +E  G+IR   SP+ +P+V V KK+   +R+C+D+R+LN  + R AFP+P V+E L+ L  ++YFS++DL + Y+QV++ +  + KTAFST  G F  NRMPFG+  +P TFQ LMT    DL  + +++YLDDI+IF++T ++H      V  ++   GL+L P+KC + R+EV +LGH+++ DGI+TD  KI  +  ++ P+    L  FLG   YYRRFI+ YA  A  L  +   +  K +              WTE CE AF  +K  L TAPVL + D+ + F+L TDAS   +GAVL+Q  + G E VIAY S  ++  E  Y   + E LA+ +     F  +LYG +F + TD+  + +++TT K + A  Q W+  LS  D  + YR+G  + NAD LSR           TC   V    + +  + G    R +  T +E G NK  +             +M  QEI+                                  KL  K+ K ++  + +L +      +G  +   +P   R  +   +H    H G ++    ++       +  +VK   E CERC   KT T   +    +I +  P E I +D    ++++    + I  + DH+S++        Q   T+++ L   +  KFG P +LH D G+NFE+  +KEL +  G +   +SPYH   NG  ER  RT+   +  +L    +  W + +  + +  N T   +TG +P  ++FGR                D  +     +   + ++   E   L  + SK            VR F   D VLIK    E R K   RWE  PY VIKK      Y+++ ++G      VE++ H
Sbjct:  210 NSDVFSKHHLDYGHTTVVKHEIPLVDPRPFRLPYRKIPPSQYQAVRKAITEMEEAGVIRPSKSPYASPIVVVTKKDG-SLRICMDYRKLNSCSTRDAFPLPRVEEALEALGEAKYFSTLDLTSGYWQVEVAEADKHKTAFSTPMGLFEANRMPFGLQNSPSTFQRLMTCCFGDLNFESLLIYLDDIVIFSRTFDEHLERLQMVFDRLRKYGLKLKPQKCHLLRREVLYLGHVVSSDGIKTDPDKISKVAGWKVPENRHELLQFLGFAGYYRRFIEGYASIAAPLYRLTSGDPRKKKRGRKGPPAVGLPFIWTEECENAFQTLKTKLTTAPVLGYADYSQPFVLQTDASLAGLGAVLAQVQD-GRERVIAYASRGLNPAETRYPAHKLEFLALKWAVTDKFYDHLYGHKFSVLTDNNPLKYVMTTAK-LDATGQRWVAQLSMFDFDIRYRQGGCNANADGLSRMPASEVAEALHTCPQLVPTSSQKQSQEKGDSAERAVDGTSSESGSNKEPSADQFLSAGSDALPSMSKQEIRAAQRTDPVIGPVLYYKCQYAKPKRSARTQSNEQEKLLRKEWKKLVVKDDILFRHVRDRQRGSFYQLVLPEKFRGYVKSSLHDESGHFGVERTFALVRERFYWPRMFNDVKTWCEQCERCCLRKTPTAGLRAPLVSIHTNAPLELICIDFLT-VEKSKGGFENILVVTDHFSRWAQAYPTKDQKAETVAKVLWKNFFCKFGFPAKLHADQGRNFESTVVKELCRLTGTQKSHTSPYHPQGNGTTERFNRTLMNLM-GTLPPQSKARWHEYIDALTHAYNCTRHDSTGYTPYYLMFGRHPRLPIDLVFGLAPATDYCEHSDYAKTLHDSLKYACEQANLTSRHSKDTQKKHYDVKAKVRQFTPGDRVLIKVCHTETRQKLGDRWEPKPYLVIKKQPGIPVYVVRSEDG-----TVERVIH 1150          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000009777.1 (pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-1 gene:ENSPMAG00000008837.1 transcript:ENSPMAT00000009777.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 53.9138 bits (128), Expect = 1.265e-7
Identity = 43/231 (18.61%), Postives = 103/231 (44.59%), Query Frame = 1
Query: 3628 DNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAK----TAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            ++++ +++ VH    H G  +  K ++ +   + +  +++  I  CE C++     +          S  P+E + MD+ GP   T    +++  I+D+++K+  LT +  Q    +   L      +FG PK+L  +  + + A+  +E+ +     +G+ +      H   NG      + +R+ ++ +   G  ++W   + +  +  + +    T  SP  ++ GR+
Sbjct:   68 EDKRSILQRVHGA-DHCGQTRTRKLLEEHYYWKGMVNDIRDYINACEICKQKSYKRSSISHVKLLKASY-PWEVLGMDLLGPFPATSRAHRFVLLIVDYFTKWAELTPMTDQSAAHVVAALTTA-FHRFGFPKKLFCNVSEEYVAQINEEMFRHFPMCSGLAIS-----HLWANGAHRGTSQALRDCVSKA--TGRHRDWELQLEQRLFEYHTSKHSATRYSPFYLMLGRE 288          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000010393.1 (pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 gene:ENSPMAG00000009411.1 transcript:ENSPMAT00000010393.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 49.6766 bits (117), Expect = 2.248e-7
Identity = 32/100 (32.00%), Postives = 46/100 (46.00%), Query Frame = 2
Query:  710 GGRTYRDVV----QVGATKREINEWKPETRVKVIECWTCQKPGHSSRECNIKRR-----FQCYACGVEGHIRRECPTI-------KCHRCNARGHKEREC 961
            GG+++ D      + G   RE    +         C+ C K GH +REC   R+       CY CG +GH+ REC +        KC+ C  RGH +R+C
Sbjct:    2 GGKSFTDGCYRCGEGGHIARECPLPQDSVSSNTAACYNCGKGGHIARECPEGRQDRGGGPSCYTCGKQGHLARECSSGGGGPGDNKCYGCGQRGHMQRDC 101          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004121.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004121.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 49.6766 bits (117), Expect = 2.643e-6
Identity = 49/241 (20.33%), Postives = 102/241 (42.32%), Query Frame = 1
Query: 3628 DNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYI-SLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFS--SPYHHNTNGIIERQFRTIREYINASLNEGGR-----------KNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            D ++ ++  VH    H G +K    ++ +     + ++VK +I +C  C+   +             S  P+E + +D+ GPL  T    +Y+  +ID++SK+  ++  I K  E   S   L     ++G P+++    GK F    + ++ K++ +   +   SP     +  +     ++R  +N + N+  +            +W   V +  +        TT  SP  ++FGR+
Sbjct:   76 DEKRNILMSVHGA-GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVCRNKGSRRVAMPSMKLLKASG-PWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHVASA--LTVVFCRYGFPRKVFSSLGKEF----VTQVNKSSTLSRAYQRCSPSQTQVHAHV-----SVRVALNKATNQALKGCVNLVASQHPSDWESRVEQSLFEYRVGKHSTTQYSPFYLMFGRE 303          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004123.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004123.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 48.1358 bits (113), Expect = 8.148e-6
Identity = 50/241 (20.75%), Postives = 101/241 (41.91%), Query Frame = 1
Query: 3628 DNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYI-SLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFE-------------ARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
            D ++ ++  VH    H G +K    ++ +     + ++VK +I +C  C+   +             S  P+E + +D+ GPL  T    +Y+  +ID++SK+  ++  I K  E   S   L     ++G P+++    GK F              + +++ L +T G      + +H  TN       + ++  +N   ++    +W   V +  +        TT  SP  ++FGR+
Sbjct:   76 DEKRNILMSVHGA-GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVCRNKGSRRVAMPSMKLLKASG-PWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHVASA--LTVVFCRYGFPRKVFSSLGKEFVTQVNKIQHKKWCISHTLQNLCETRG-----PAGWHKATN-------QALKGCVNLVASQHP-SDWESRVEQSLFEYRVGKHSTTQYSPFYLMFGRE 299          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Match: GIS2 (Translational activator for mRNAs with internal ribosome entry sites; associates with polysomes and binds to a specific subset of mRNAs; localizes to RNA processing bodies (P bodies) and to stress granules; may have a role in translation regulation under stress conditions; ortholog of human ZNF9/CNBP, a gene involved in myotonic dystrophy type 2 [Source:SGD;Acc:S000005199])

HSP 1 Score: 54.6842 bits (130), Expect = 5.569e-9
Identity = 26/56 (46.43%), Postives = 31/56 (55.36%), Query Frame = 2
Query:  803 CWTCQKPGHSSRECNIKRRF---QCYACGVEGHIRRECPTIKCHRCNARGHKEREC 961
            C+ C KPGH   +C + R     QCY CG  GH+R EC   +C  CN  GH  REC
Sbjct:   25 CYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSECTVQRCFNCNQTGHISREC 80          

HSP 2 Score: 50.0618 bits (118), Expect = 2.821e-7
Identity = 26/79 (32.91%), Postives = 40/79 (50.63%), Query Frame = 2
Query:  737 QVGATKREINEWKPETRVKVIECWTCQKPGHSSREC---NIKRRFQCYACGVEGHIRRECPTIK-CHRCNARGHKEREC 961
            Q G   RE  E K  +R   + C+ C  P H +++C   +     +CY CG  GH+ R+C   + C+ CN  GH  ++C
Sbjct:   72 QTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDC 150          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO33875 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SR01])

HSP 1 Score: 53.5286 bits (127), Expect = 1.012e-8
Identity = 25/86 (29.07%), Postives = 52/86 (60.47%), Query Frame = 1
Query: 2608 DDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNY 2865
            DD++ F  + E+H     ++L  +  +G + + +K Q   + VKFLGH+I+++G++    K++ I+ ++ P   + LR F+G+C +
Sbjct:    1 DDVICFHSSFEEHLRGIERMLQAVRASGFK-SIKKSQFATRSVKFLGHVIDQNGVRPQPEKLD-IRQWETPTNEEELRKFIGVCTF 84          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO33090 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7ST86])

HSP 1 Score: 53.9138 bits (128), Expect = 3.002e-7
Identity = 36/141 (25.53%), Postives = 65/141 (46.10%), Query Frame = 1
Query: 2437 QVKLDKESQEKTAFSTK-EGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYL-DDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLG 2853
            ++ L ++S +    +T   G   + R   G+  +    +ELM +VL DL  +GV+V L DD+     T  + ++ + + L  +    LRL+  K  I  K    LG I +   ++    ++  +     P  V  +RSF+G
Sbjct:  186 KIPLSRDSLKYCGVATPFRGVRVYTRCAMGMPGSETALEELMCRVLGDLLAEGVVVKLADDLYCGGNTPHELFSNWKRTLQALHECNLRLSASKTIINPKTTTILGWIWSAGTLKASPHRVATLAQCPTPTTVGLMRSFIG 326          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO25785 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A8DVH4])

HSP 1 Score: 45.0542 bits (105), Expect = 5.458e-6
Identity = 24/65 (36.92%), Postives = 35/65 (53.85%), Query Frame = 2
Query:  770 WKPETRVKVIECWTCQKPGHSSREC-NIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKEREC 961
            +K E R   I C  C + GH + +C + K+  +C  CG +GH +R CP   C  C+  GH+ R C
Sbjct:    4 YKEEKRSMYIRCHNCNERGHMAVDCPDPKKVIKCCLCGGQGHYKRSCPNELCFNCDQPGHQSRVC 68          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO43256 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7RZD2])

HSP 1 Score: 47.7506 bits (112), Expect = 6.389e-6
Identity = 25/75 (33.33%), Postives = 42/75 (56.00%), Query Frame = 1
Query: 2419 LGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEED 2643
            + + Y+  KL   +     F+T  G+F F RMPFGI+ A    Q+++ +   D+  +GV+   DDI++  KTE +
Sbjct:    1 MSSCYWHKKLTDAASLLCTFNTPFGRFRFRRMPFGISCASEVAQKMVEEHFGDI--EGVLPVYDDIIVSGKTEAE 73          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000036827.1 (pep primary_assembly:ASM223467v1:16:28613823:28617539:1 gene:ENSORLG00000023550.1 transcript:ENSORLT00000036827.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 461.455 bits (1186), Expect = 1.408e-138
Identity = 281/951 (29.55%), Postives = 491/951 (51.63%), Query Frame = 1
Query: 1972 RIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIIT-KGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTC--GTCVQC---------MMEHEDAKTGKIKTRILTVTAEGGYNKW---QNDNMEVQEIKNKLE-------------NKDCKFIMEN-NTVLTKQG---KIW-------------IPSDNRQRMIKEVHVLL--CHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGV------------SPAEIIFGRKIDRMKWYSNKEINREDMEK----------RIEDKTLKPKISKTVR----NFEMEDVVLIKQEIRNKD-----DARWEGPYKVIKKIHERSYLLK-DQNGKMVVRNVEKIKHFK 4587
            R+   +E  R + D L+      L ++    +++F   + ++G T+L+ H+I T    PI  +PRR P+  +   + AI+ +   GII   +SPW + +V V KK+   +R C+D+R LN +T++ ++P+P +DE LD++ GS +FSS+DL + Y+QV L   ++ KTAF T  G + F  + FG+  AP TF+ LM KVL  + +   +VYLDDIL+   + +       KVL +IA AGL+L+P+KC   R+E++FLGH I  +GI T   K++A++ +  P  +++L+SFLG+ +YYRRF++ ++  A  L  +  K+ + + WT+ CE+AF  +K+AL  +P+L  PD +  FILDTDAS   +GAVLSQ    G E V+AY S  +S  E+ YC+TR+ELLA+     HF +YL G  F +RTDH A+ +++T K+P   Q   W+  L+S    +E+R G+ H NAD +SR+ C    C  C         +   E + TG+    +  +       +W   Q  ++++Q +   +E             +   K + E  + +  K G   + W             +P   R  +++  H      H G  K  + I+       L  +V+     C+ C   K    +++ E Q + +  P E++ +DI GP   T    +++   +D+++K+    AI  Q+  T+++ L+     +FGA + +H D G+NFE+     + +  G+    ++P H  ++G++ER  RT+ + + A L    + +W + +P +     + VQ +T              +PAE+ FG+  D +      E  R+  ++          ++E   ++ K +  +R    +F+  D+V +    R K      D +W GP +V++K+ E  Y ++    G+ V  + +++  ++
Sbjct:  214 RVTAVKEIWRRSCDGLQPGQKDELWKVLLEYRDIFALSEDEVGLTHLVHHEIDTGDARPIKTRPRRLPLAHQVAADSAIEEMLRGGIIEPSDSPWASGVVMVKKKKGPKMRFCVDYRPLNGVTKKDSYPLPRIDESLDLVSGSSWFSSLDLRSGYWQVPLSPAARPKTAFCTGRGLWQFRVLSFGLCNAPATFERLMEKVLASIPRQECLVYLDDILVHGGSFKAALESLRKVLQRIAAAGLKLHPDKCCFMRRELEFLGHKIGGEGISTLEEKVQAVRDWPTPTTLRDLKSFLGLASYYRRFVRGFSCIAAPLFHLQRKDCDFV-WTQECEQAFSSLKKALTNSPILTPPDPKLPFILDTDASDVGMGAVLSQMGSAG-ERVVAYFSKTLSKAERRYCVTRRELLAVVKAIGHFRYYLCGLPFTVRTDHSALQWLMTFKEP-EGQIARWLEELASFSFTVEHRPGSRHANADAMSRRPCALAGCQYCEKREAREAVISREEQSHTGESSWPVCRLVQGVDSTEWRAHQEQDVDLQPVLQWVEAGRKPEWGEVAGCSPGSKGLFEKFDALRLKDGVLQRAWKEPATGEERWQVVVPRTLRNSVLQGCHGAAGSGHFGVSKTLRRIRQGFYWGQLRRDVEDFCRRCDICTAHKGPPDRSRAELQQLAAGAPMERVAVDIMGPFPRTNRGNRFVLVAMDYFTKWPEAYAIPDQEAVTVADALVEGMFSRFGAAEVIHSDQGRNFESAVFSAMCERLGMRKTRTTPLHPQSDGLVERFNRTLVKQL-AILTSAHQSDWDEHLPLVLMAYRSAVQDSTLCTPALLMLGRELRTPAEMSFGKPPDALGAPPGPEYARKLQDRMDTAHAFARNQLEKAGIRQKRNYDLRAKGKDFKAGDLVWVYNPKRKKGRCPKLDCQWVGPCEVLEKLGEVVYRVELPPGGRRVTLHRDRLAPYR 1160          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000039099.1 (pep primary_assembly:ASM223467v1:16:31928608:31932747:-1 gene:ENSORLG00000023909.1 transcript:ENSORLT00000039099.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 434.876 bits (1117), Expect = 5.101e-128
Identity = 254/809 (31.40%), Postives = 422/809 (52.16%), Query Frame = 1
Query: 2068 NVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGK-NNEKIR----------WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK-----------TCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNK-------------------WQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQG----------------KIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
             VF   + D+GCT+LI H+I +    P+  + RR P +     +E I NL  + +IR+ +SP+ +P+V + KK+   +RLC+D+RQLN  T + AFP+P ++E LD L G+R+FS++DL + Y+QV + +  + KTAF T  G F +NRMPFG+  AP TFQ LM ++  +     +++YLDDI++F+ T E H      VL ++   GL++   KC  F+ +V +LGH+I+  G+ TD  K+EA+ +++ P  V  LRSF+G  +YYRRF++ +AK A  L  +  +    K+R          WTE C+++F  +K  L T PVL + DF + FIL+ DAS   +GAVLSQ+ E G    IAY S  +   E+    Y   R E LA+ +   + F  YL G++ ++ TD+  ++++ T K  + A  Q W   L++ D+++ YR G S+ NAD LSR+             G+C+  M   +  +T  + T   T+ A   ++                    W+       E + +L +     + + N ++ + G                +I +P   R+ ++  VH    H G  +    ++  C    ++ EV +    CERCQ  K      +     + +++P E + MD    L+ T +  + +  I D +SKY    A   Q   T+++ L+ +W  KFG P  +H D G++FE+  I++L     +E   ++PYH   NG  ER  RT+ + +  +L    +++W   +P++ Y+ N T   +TG SP  ++FG++
Sbjct:  321 TVFSLHEGDLGCTSLITHEIPLVDDAPVRQRYRRIPPSDYVAAKEHINNLLQSQVIRESSSPFASPIVLIRKKDG-GLRLCVDYRQLNSRTRKDAFPLPRIEESLDALSGARWFSTLDLASGYHQVAVAEADRPKTAFCTPFGLFEWNRMPFGLCNAPSTFQRLMQRMFGEQQGQSLLLYLDDIIVFSSTIEQHLERLELVLERLQLEGLKVKLAKCAFFQHQVHYLGHVISDQGVSTDPGKVEAVANWEPPTTVFQLRSFIGFASYYRRFVEGFAKLAAPLHRLVAELEGNKVRKKSARGLTNHWTEECQRSFEALKAKLTTTPVLAYADFSRPFILEVDASNGGLGAVLSQEQE-GKVRPIAYASRGLRPTERNPVNYSSMRLEFLALKWAVAEKFREYLLGQKCIVYTDNNPLSYLSTAK--LGAMEQRWAAQLAAFDLEIRYRSGRSNRNADALSRQHFPDMQAWRDVLPGSCLP-MSLQQVQQTETVGTTQATMVALPHHSPSDMASLQGADPVLKEFLPFWERQTRPSPEERRQLSSPTLALLRQWNRLVEQGGVLYRRVFRDDGGEAVLQILLPGSIREEVLTAVHQQHGHQGVDRTLDLLRQRCYWPGMSAEVAEWCSQCERCQVAKVTRPAARAPMGHLLASKPNEILAMDFSV-LEPTTSGIENVLVITDIFSKYTMAVATRDQRAATVAQVLVTEWFSKFGVPARIHSDQGRSFESALIQQLCDLYAVEKSRTTPYHPEGNGQCERFNRTLHDLLR-TLPVSRKRDWNVCLPQLLYSYNTTPHHSTGESPFFLMFGQE 1122          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000029819.1 (pep primary_assembly:ASM223467v1:24:6358775:6362914:-1 gene:ENSORLG00000022361.1 transcript:ENSORLT00000029819.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 434.876 bits (1117), Expect = 5.792e-128
Identity = 254/809 (31.40%), Postives = 422/809 (52.16%), Query Frame = 1
Query: 2068 NVFMADKWDIGCTNLIKHKI-ITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGK-NNEKIR----------WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRK-----------TCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNK-------------------WQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQG----------------KIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK 4308
             VF   + D+GCT+LI H+I +    P+  + RR P +     +E I NL  + +IR+ +SP+ +P+V + KK+   +RLC+D+RQLN  T + AFP+P ++E LD L G+R+FS++DL + Y+QV + +  + KTAF T  G F +NRMPFG+  AP TFQ LM ++  +     +++YLDDI++F+ T E H      VL ++   GL++   KC  F+ +V +LGH+I+  G+ TD  K+EA+ +++ P  V  LRSF+G  +YYRRF++ +AK A  L  +  +    K+R          WTE C+++F  +K  L T PVL + DF + FIL+ DAS   +GAVLSQ+ E G    IAY S  +   E+    Y   R E LA+ +   + F  YL G++ ++ TD+  ++++ T K  + A  Q W   L++ D+++ YR G S+ NAD LSR+             G+C+  M   +  +T  + T   T+ A   ++                    W+       E + +L +     + + N ++ + G                +I +P   R+ ++  VH    H G  +    ++  C    ++ EV +    CERCQ  K      +     + +++P E + MD    L+ T +  + +  I D +SKY    A   Q   T+++ L+ +W  KFG P  +H D G++FE+  I++L     +E   ++PYH   NG  ER  RT+ + +  +L    +++W   +P++ Y+ N T   +TG SP  ++FG++
Sbjct:  321 TVFSLHEGDLGCTSLITHEIPLVDDAPVRQRYRRIPPSDYVAAKEHINNLLQSQVIRESSSPFASPIVLIRKKDG-GLRLCVDYRQLNSRTRKDAFPLPRIEESLDALSGARWFSTLDLASGYHQVAVAEADRPKTAFCTPFGLFEWNRMPFGLCNAPSTFQRLMQRMFGEQQGQSLLLYLDDIIVFSSTIEQHLERLELVLERLQLEGLKVKLAKCAFFQHQVHYLGHVISDQGVSTDPGKVEAVANWEPPTTVFQLRSFIGFASYYRRFVEGFAKLAAPLHRLVAELEGNKVRKKSARGLTNHWTEECQRSFEALKAKLTTTPVLAYADFSRPFILEVDASNGGLGAVLSQEQE-GKVRPIAYASRGLRPTERNPVNYSSMRLEFLALKWAVAEKFREYLLGQKCIVYTDNNPLSYLSTAK--LGAMEQRWAAQLAAFDLEIRYRSGRSNRNADALSRQHFPDMQAWRDVLPGSCLP-MSLQQVQQTETVGTTQATMVALPHHSPSDMASLQGADPVLKEFLPFWERQTRPSPEERRQLSSPTLALLRQWNRLVEQGGVLYRRVFRDDGGEAVLQILLPGSIREEVLTAVHQQHGHQGVDRTLDLLRQRCYWPGMSAEVAEWCSQCERCQVAKVTRPAARAPMGHLLASKPNEILAMDFSV-LEPTTSGIENVLVITDIFSKYTMAVATRDQRAATVAQVLVTEWFSKFGVPARIHSDQGRSFESALIQQLCDLYAVEKSRTTPYHPEGNGQCERFNRTLHDLLR-TLPVSRKRDWNVCLPQLLYSYNTTPHHSTGESPFFLMFGQE 1122          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000040501.1 (pep primary_assembly:ASM223467v1:1:1251188:1255180:-1 gene:ENSORLG00000023024.1 transcript:ENSORLT00000040501.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 426.017 bits (1094), Expect = 3.085e-125
Identity = 274/906 (30.24%), Postives = 468/906 (51.66%), Query Frame = 1
Query: 2035 KRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKP-RRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESIC---------GKNNEKIR---WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAE----------------------GGYNK-WQNDNMEVQEIKNKLEN------KDCKFIMENNTVLTK----------QGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK----IDRM------------KWY----SNKEINREDMEKRIED----KTLKPKISKTVRNFEMEDVVLIKQE---IRNKDDARWEG-PYKVI 4500
            +R+  +      VF  D  D+GCTNL+ H+I    E  + +P RR P +  +     I+ L  + +IR+ +SP+ +P+V V KK+   +R+C+D+RQLN  T + AFP P +DE LD L G+++F+++DL + Y QV++ ++ + KTAF T  G F FNRMPFG+  AP TFQ LM ++  D     V++YLDD++IF+ + E H     +V  ++   GL++   KC  F+K+VK+LGH+++ +G+ TD  K   ++ +++P  + +LRSFLG  +YYRRFI  +AK A  L S+          GK  +K     W   CE++F E+K ALITAPVL + DF+K F+L+ DAS   +GAVLSQ+ E G    +A+ S ++   E+    Y   + EL+A+ +   + F  YL G    + TD+  ++ + T K  + A  Q W + L++ D+ ++YR G+ + NAD LSR+        ++E   A+  + + R+L V AE                      G + K W+   M     ++ L        +  K + E+ +VL +            ++ +P   ++ ++  +H    H G ++  + I++     N+ ++V+K    CERC   K +  K K    ++K++ P E + +D    L    + ++ +  + D +SKY        Q   T++E L+  W   FG P  +H D G+NFE+  +  L K   I+   ++PYH   NG  ER  RT+ + +  +L    ++ W   + ++ +  N TV ++TG++P  ++FGR+    +D +            +W     S+  +  E +++R+E+    + LK +      +FE  D V  +      RNK    W+  PY+++
Sbjct:  303 ERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVRQPYRRIPPSQYELARSHIQQLLQSQVIRESSSPYASPIVLVQKKDG-GLRMCVDYRQLNARTRKDAFPSPRIDESLDALAGAQWFTTLDLASGYSQVEVAEKDKAKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERLFGDCRFQSVLLYLDDVIIFSSSVEQHLQRLEQVFSRLDAQGLKVKLSKCHFFQKQVKYLGHVVSAEGVSTDPEKAAVVRDWRRPANLADLRSFLGFASYYRRFIAGFAKIASPLNSLVARLLPPGRKGKTPKKPVDEFWDAECEESFQELKTALITAPVLAYADFQKPFVLEVDASHGGLGAVLSQEHE-GKRRPVAFASRSLKPTERNMNNYSSMKLELVALKWAVTEKFREYLLGNACTVFTDNNPLSHLATAK--LGATEQRWASELAAFDLTIKYRPGSQNANADALSRQ------HPVLELSGARPLETE-RVLGVQAEMSTLPGLSPSDLYTLQRQDPVLGPFIKYWERGKMPDARERHGLSKPVRELIRQWKRLREHESVLYRCIYGSDGHGETNQLVLPQSLQESILHSLHDDHGHQGTERTLQLIRSRYYWPNMYSDVEKWCRTCERCVLSKALQPKVKTYMGSVKASRPHEILAIDFTV-LDPATDGRENVLVMTDVFSKYTQTVPTKDQRASTVAEALVKHWFQLFGPPARIHSDQGRNFESNLVHRLCKFYQIDKSRTTPYHPQGNGQCERFNRTLHDLLR-TLPPEQKRRWPRHLSQVTFAYNTTVHQSTGMTPYFLMFGREPRLPVDFLLGSDTEHDVPLDEWLEEHQSSLAVAYEAVQRRMENVRAQRDLKMQDQCLAPDFEEGDFVYTRNHGARGRNKIQDFWDPTPYQIV 1195          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000045600.1 (pep primary_assembly:ASM223467v1:4:26167161:26171153:-1 gene:ENSORLG00000023514.1 transcript:ENSORLT00000045600.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 423.32 bits (1087), Expect = 2.696e-124
Identity = 273/906 (30.13%), Postives = 468/906 (51.66%), Query Frame = 1
Query: 2035 KRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKP-RRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESIC---------GKNNEKIR---WTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEK---GYCITRKELLAI-YYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAE----------------------GGYNK-WQNDNMEVQEIKNKLEN------KDCKFIMENNTVLTK----------QGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRK----IDRM------------KWY----SNKEINREDMEKRIED----KTLKPKISKTVRNFEMEDVVLIKQE---IRNKDDARWEG-PYKVI 4500
            +R+  +      VF  D  D+GCTNL+ H+I    E  + +P RR P +  +     I+ L  + +IR+ +SP+ +P+V V KK+   +R+C+D+RQLN  T + AFP+P +DE LD L G+++F+++DL + Y QV++ ++ + KTAF T  G F FNRMPFG+  AP TFQ LM ++  D     V++YLDD++IF+ + E H     +V  ++   GL++   KC  F+K+VK+LGH+++ +G+ TD  K   ++ +++P  + +LRSFLG  +YYRRFI  +AK A  L S+          GK  +K     W   CEK+F E+K ALITAPVL + +F+K F+L+ DAS   +GAVLSQ+ E G    +A+ S ++   E+    Y   + EL+A+ +   + F  YL G    + TD+  ++ + T K  + A  Q W + L++ D+ ++YR G+ + NAD LSR+        ++E   A+  + + R+L V AE                      G + K W+   M     ++ L        +  K + E+ +VL +            ++ +P   ++ ++  +H    H G ++  + I++     N+ ++V+K    CERC   K +  K K    ++K++ P E + +D    L    + ++ +  + D +SKY        Q   T++E L+  W   FG P  +H D G NFE+  + +L K   I+   ++PYH   NG  ER   T+ + +  +L    ++ W   + ++ +  N TV ++TG++P  ++FGR+    +D +            +W     S+  +  E +++R+E+    + LK +      +FE  D V  +      RNK    W+  PY+++
Sbjct:  303 ERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVRQPYRRIPPSQYELAHSHIQQLLQSQVIRESSSPYASPIVLVQKKDG-GLRMCVDYRQLNARTRKDAFPLPRIDESLDALAGAQWFTTLDLASGYSQVEVAEKDKAKTAFCTPFGLFEFNRMPFGLCNAPSTFQRLMERLFGDCRFQSVLLYLDDVIIFSSSVEQHLQRLEQVFSRLEAQGLKVKLSKCHFFQKQVKYLGHVVSAEGVSTDPEKAAVVRDWRRPANLADLRSFLGFASYYRRFIAGFAKIASPLNSLVARLLPPGRKGKTPKKPVDEFWDAECEKSFQELKTALITAPVLAYANFQKPFVLEVDASHGGLGAVLSQEHE-GRRRPVAFASRSLKPTERNMNNYSSMKLELVALKWAVTEKFREYLLGNACTVFTDNNPLSHLATAK--LGATEQRWASELAAFDLTIKYRPGSQNANADALSRQ------HPVLELSGARPLETE-RVLGVQAEMSTLPGLSPSDLYTLQQQDPVLGPFIKYWERGKMPDARERHGLSKPVRELIRQWKRLREHESVLYRYIYGSDGHGETNQLVLPQSLQESILHSLHDDHGHQGTERTLQLIRSRYYWPNMYSDVEKWCRTCERCVLSKALQPKVKTYMGSVKASRPHEILAIDFTV-LDPATDGRENVLVMTDVFSKYTQTVPTKDQRASTVAEALVKHWFQLFGPPARIHSDQGWNFESNLVHQLCKFYQIDKSRTTPYHPQGNGQCERFNLTLHDLLR-TLPPEQKRRWPRHLSQVTFAYNTTVHQSTGMTPYFLMFGREPRLPVDFLLGSDTEHDVPLDEWLEEHQSSLAVAYEAVQRRMENVRAQRDLKMQDQCLAPDFEEGDFVYTRNHGARGRNKIQDFWDPTPYQIV 1195          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000029259.1 (SMESG000029259.1)

HSP 1 Score: 2218.74 bits (5748), Expect = 0.000e+0
Identity = 1093/1096 (99.73%), Postives = 1094/1096 (99.82%), Query Frame = 1
Query: 1309 MDDDCMIDKDCSVFERDINNNSSVEIKKSNDKESQKKFKEECKVNEVVKHEERTIEKIYKFLKKNGDVCINEIVEDTREQNILSVEEQKSPVITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIKHFKKGG 4596
            MDDDCMIDKDCSVFERDINNNSSVEIKKSNDKESQKKFKEECKVNEVVKHEERTIEKIYKFLKKNGDVCINEIVEDTREQNILSVEEQKSP ITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELK ILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCQKMKTITTKTKEETQTIKSTEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEI+YTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIKHFKKGG
Sbjct: 2449 MDDDCMIDKDCSVFERDINNNSSVEIKKSNDKESQKKFKEECKVNEVVKHEERTIEKIYKFLKKNGDVCINEIVEDTREQNILSVEEQKSPAITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKCILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCQKMKTITTKTKEETQTIKSTEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIKYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIKHFKKGG 3544          

HSP 2 Score: 561.607 bits (1446), Expect = 1.537e-166
Identity = 298/340 (87.65%), Postives = 299/340 (87.94%), Query Frame = 2
Query:  161 ASNMATIPTDLLIGRLLPEKFHRGDDLELFIKECQRFFEITKTPVKTQMVLVITLLDRTLIEEYEAAEGKTVEQKLRAAFHRPTSLIDDLREALNYEQGNDSAEIFIEKISKMTKKLASHTWNEEEIQKCLLTHCVRDKEVRKEIEMKDLXXXXXXXXXXXXXXXXXXXXXXVNTVRSIRPTTGGRTYRDVVQVGATKREINEWKPETRVKVIECWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQRNTYQQREDMYQPRNVHQRQWGNQQYNRKDIAAIESDDEMMNTKQSSQP 1180
            ASNMATIPTDLLIGRLLPEKFHRGDDLELFIKECQRFFEITKTPVKTQMVLVITLLDRTLIEEYEAAEGKTVEQKLRAAFHRPTSLIDDLREALNYEQ                                         EVRKEIEMKDLKTAEQIKETIKKIEKVNKVIEQVNTVRSIRPTTGGRTYRDVVQVGATKREINEWKPETRVK+IECWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQRNTYQQREDMYQPRNVHQRQWGNQQYNRKDIAAIESDDEMMNTKQSSQP
Sbjct: 2148 ASNMATIPTDLLIGRLLPEKFHRGDDLELFIKECQRFFEITKTPVKTQMVLVITLLDRTLIEEYEAAEGKTVEQKLRAAFHRPTSLIDDLREALNYEQ-----------------------------------------EVRKEIEMKDLKTAEQIKETIKKIEKVNKVIEQVNTVRSIRPTTGGRTYRDVVQVGATKREINEWKPETRVKMIECWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQRNTYQQREDMYQPRNVHQRQWGNQQYNRKDIAAIESDDEMMNTKQSSQP 2446          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000081257.1 (SMESG000081257.1)

HSP 1 Score: 578.556 bits (1490), Expect = 0.000e+0
Identity = 337/870 (38.74%), Postives = 495/870 (56.90%), Query Frame = 1
Query: 1609 RPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESL-RCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYN------KWQNDNMEVQEIKNKLENKDCKF-----------------IMENNTVLTKQGK--IWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCXXXXXXXXXXXXXXXXXXXXEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNG 4140
            R  + ++I G  V+ L ++GA I+VM++     + +  ++++R  L    +N   + +G + I  K     R +  IIV+ + P  IGGV+  ++FG+ELK   E +   S  +     RF    TD +RL++ +  LK+  N  L  +     N+FMA ++D+G T +I H++ T G PI+  PR QP++LE K+EE ++NL   G++RKC SPWNTPLV V K +   +R+CLDFR LN +TE+ +FPMP++  +LD L  S+ F SIDLG AYY V+L++ SQ KTAFSTKEGQFCFNR+PFG++ AP TFQ+LM ++L+ L   GV+VYLDDILI+ + +E H  +  +V  +I  +GL++NPEKC   +  + F+GH +++ GIQT+  KI  I++  +PK    LRSFLG+  YYRRFIK+Y+  A  L +     ++ I WTE C K F  +K+ L  AP+L +P   + FI+DTD SF+ IG                     ++ HE GYC+TRKELL ++ F  HF  YLYG+RFV RTDHKA+TFM TTKKPI+ QFQTW+   S  D  ++YRKG  H NA   SR     C    MEH+DAK  K +TR +  + +G  N      + QN++    +I + L   +                    I +N  ++   GK  + +P    + ++   H+ LCH G  K   Y+++   + ++   + + I  C+ C   K    +TKE           E+I +DI     ET   KKY+  IID +SK +SL     QDE TI   +LN WI +FG P+ +  D  + FE    ++  +  GI+  FSSPY H +NG
Sbjct:  283 RKWSIMEINGFHVEMLWNSGASISVMSEKCWRLIGSPILMDSRILLSEVFSNEDKKPLGSVKIVAKWNKKFRELNVIIVRKIHPNFIGGVDTMKIFGMELK---EVNNIESSLV---NKRF----TDSDRLKNTLLTLKLDKNSELGTLISQFSNIFMASRFDLGHTKVITHELKTSGPPILQNPRGQPMHLEAKVEELVQNLLEAGVVRKCQSPWNTPLVIVGKPDG-SVRICLDFRLLNSVTEKFSFPMPDMQLLLDCLGKSKIFYSIDLGQAYYLVELNENSQIKTAFSTKEGQFCFNRLPFGLSTAPATFQKLMHQILEGLVFKGVVVYLDDILIYGENQETHDKLLFEVFTRIRDSGLKVNPEKCAFNKSVLNFIGHTVSEKGIQTNKRKISEIENATEPKSTTELRSFLGLTTYYRRFIKNYSMIAAPLYAATTGCDKMIVWTEECRKRFINLKKLLCEAPILEYPRADRLFIIDTDDSFEAIG--------------------HLTKHEIGYCVTRKELLVLHEFIVHFRQYLYGRRFVARTDHKALTFMNTTKKPISPQFQTWMANFSEYDFALQYRKGNEHGNAGGWSRLNNTICSHYQMEHKDAKKAKCRTRCIN-SLQGSSNIMKIIKQKQNEDKVTSQIISHLNGNEAHISYKTISSSIFKYLKILQIQDNVLMINTDGKLAVVVPDSYVKSLVNYFHIELCHLGINKTLFYLKDFFFLPSMNQIITECINKCKICASRKIDQGRTKEILLPRTGERFLEQIVVDI--DYMETKESKKYMIVIIDCFSKLVSLP----QDEATILNVILNNWIYRFGRPESILTDRERIFEGSMFRDWMEKFGIKQEFSSPYQHQSNG 1114          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000016546.1 (SMESG000016546.1)

HSP 1 Score: 529.635 bits (1363), Expect = 4.109e-172
Identity = 266/462 (57.58%), Postives = 327/462 (70.78%), Query Frame = 1
Query: 1945 ICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYL 3330
            +C I+A+FG++IT  ER   A   LKV     L EI + N+NVFMA+KW+IGCT L+KHKI+T+G  I IKPRRQPI+LE KIEEAI+NL  NGII+                             LN +T+RQA+PM N+ EILD   G ++F+SIDLGNAYYQV+L+++S+EKTAFST  GQ+CFNRMPFGIA  PGT QELM KVL ++  +G +VYL+DILIFT T+E +Y +   V+ +I                                    + EAI+SFQKP+C+KNL+SF GICNYYRRFIKDYAKK R LE +CGK N K+ W E CEKAF +MK+AL  +PVL +PDF ++FI+DTD+SFDTIG VLSQKD  G+E VIAYGSHAMS+HEKGYCI RKELLAIYYFC+HFNH+LYGKRF LRTD KAITFM++TKKPITAQF+TWIN+L
Sbjct:  113 MCNIDAKFGKLITGTERFDRARKELKVNYT-VLAEIIKNNQNVFMANKWEIGCTALLKHKIVTRGSLINIKPRRQPIHLEPKIEEAIQNLFKNGIIK----------------------------NLNLVTDRQAYPMQNIAEILDRFEGEKHFNSIDLGNAYYQVELEEKSKEKTAFSTTTGQYCFNRMPFGIATGPGTSQELMRKVLGNI--NGTVVYLNDILIFTATKEQYYAVLNDVIERIG-----------------------------------RPEAIKSFQKPECIKNLKSFPGICNYYRRFIKDYAKKTRTLEELCGKYNVKLIWAENCEKAFEDMKKALTESPVLGYPDFTRDFIIDTDSSFDTIGDVLSQKDNNGYEKVIAYGSHAMSTHEKGYCIKRKELLAIYYFCQHFNHHLYGKRFTLRTDLKAITFMLSTKKPITAQFKTWINHL 508          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000038955.1 (SMESG000038955.1)

HSP 1 Score: 429.098 bits (1102), Expect = 1.210e-129
Identity = 223/509 (43.81%), Postives = 310/509 (60.90%), Query Frame = 1
Query: 1978 ITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYN 3504
             TD +RL+  +  LK+  N +L  +     N+FMA ++D+G T +I H+I           +++P++LE K+EE ++NL   G++RK  SPWNTPLV V K +   IR+CLDFR LN +TE+ +F  P++  +LD L  S+ FSSIDLG AYYQV+L++ SQ KTAFSTKEGQFCFNR+PFG++ AP TFQ+LM ++L+ L   GV+VYLD+ILI+ + +E H  +  +V  +I  +GL++NPEKC   +  + F+GH                                            +Y+  A  +      N++ I WTE C  +F  +K+ L  AP+L +P   + F++DTDASF  IGAVLSQ  E   E VIAYGS  ++ HE G+C+T KELLA++ F  HF  YLYG+RFV RTDHKA+TFM TTKKPI+ QFQTW+  LS  D  ++YRKG  H NAD  SR     C  C+ME++DAK  K +TR +  + +G  N
Sbjct:   97 FTDSDRLKITLSTLKLDKNSKLGTLISKFSNIFMASRFDLGHTKVITHEI-----------KKKPMHLEGKVEELVQNLLEAGVVRKSISPWNTPLVIVGKLDG-SIRMCLDFRLLNSVTEKFSFYSPDMKLLLDCLGNSKIFSSIDLGQAYYQVELNENSQIKTAFSTKEGQFCFNRLPFGLSTAPATFQKLMHQILEGLVFKGVVVYLDEILIYGENQEIHDKLLFEVFTRIRDSGLKVNPEKCAFNKSVLNFIGHT-------------------------------------------NYSIIAAPMYVATTGNDKMIVWTEECRNSFINLKKLLCEAPILEYPRADRLFVIDTDASFGAIGAVLSQIKEDCTEVVIAYGSRHLTKHEMGHCVTIKELLALHEFIVHFRQYLYGRRFVARTDHKALTFMNTTKKPISPQFQTWMANLSEHDFALQYRKGEEHGNADGKSRLNNTKCSHCLMENKDAKEAKCRTRYIN-SLQGSSN 549          

HSP 2 Score: 89.3521 bits (220), Expect = 1.690e-17
Identity = 52/162 (32.10%), Postives = 91/162 (56.17%), Query Frame = 1
Query: 4018 APKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNE-GGRKNWADIVPEIEYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVI 4500
            +P+ +  D G  FE    ++  +  GI+  FSSPY H +N + ER  RT+R+ +  SL E   + NW  ++P IE++ NAT+Q +T  SP EI++GRKI+      + +  RE++E   E KT   K + T++N +++++     + R   +   + PY+ +
Sbjct:  615 SPESILTDRGIIFEGSMFRDWMEKFGIKQEFSSPYQHQSNILAERIIRTVRDMLATSLAEIKTKNNWCRLLPRIEFSFNATIQNSTKFSPFEIVYGRKINLYSGVEHIQKFREEIED--ETKTNLVKAATTMQNRDIDNLGTRSLKNRAYQEGIKQTPYEAL 774          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000035021.1 (SMESG000035021.1)

HSP 1 Score: 372.474 bits (955), Expect = 1.298e-114
Identity = 220/569 (38.66%), Postives = 314/569 (55.18%), Query Frame = 1
Query: 1606 GRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQ 3312
            GR  + + + G  V+ L D+GA I+V+++     + +  ++++R  L        E +    I  K     R +  IIV+ + P+ IGGV+  ++FG+ELK      E  +     +  RF    TD E L+  +  LK   N +L  +     N+FMA ++D G   +I H+I T   PI+   RRQ ++LE K+EE ++NL   G++RK  SPWNTPLV V K +                               D  + SR+        AYYQV+L++ SQ KTAFSTKEGQ CFNR+P G++ AP TFQ+ + + LK L   G ++YLDDILI+ + +E +  +  +V  +I  +GL+ NPEKC   +  + F+GH +++ GIQT+  KI  I++  +PK    LRSFLG+ N+Y+RFIK+Y+  A  L +                           TAP+L +P   + FI+DTD SF  IGAVLSQ  E G E VIAYGS  ++ HE  YC+TRKEL A++ F  HF  YLYGKRFV R DHKA+TFM TTKKPI++QF 
Sbjct:    5 GRKWSMMRLNGFHVEMLWDSGASISVISEKCWRLIGSPILMDSRIRLSGVFPKEDEKLLGSKIVAKWNKKFRKLNVIIVRKIYPDFIGGVDTIKIFGMELK------EVNNIESLLVNKRF----TDSELLKITLLTLKFDKNTKLGTLIFQFSNIFMASRFDFGHIKVIAHEIKTSDSPILQNSRRQLMHLEGKVEELVQNLLEAGVLRKSQSPWNTPLVIVGKPD-------------------------------DSKNVSRF-------QAYYQVELNENSQIKTAFSTKEGQLCFNRLPIGLSTAPATFQKHINQTLKGLVFKGEVLYLDDILIYAENQETYDKLLFEVFIRIRDSGLKANPEKCAFNKSVLNFIGHTVSEKGIQTNKRKISEIENATEPKSTTELRSFLGLTNFYQRFIKNYSMIAEPLYTAT-------------------------TAPILEYPRADRLFIIDTDDSFGAIGAVLSQIKEDGTEGVIAYGSRHLTEHEMEYCMTRKELFALHEFIVHFRQYLYGKRFVARMDHKALTFMNTTKKPISSQFH 500          
The following BLAST results are available for this feature:
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
RTL13.817e-1824.38retrotransposon Gag like 1 [Source:HGNC Symbol;Acc... [more]
RTL13.817e-1824.38retrotransposon Gag like 1 [Source:HGNC Symbol;Acc... [more]
CNBP2.084e-745.83CCHC-type zinc finger nucleic acid binding protein... [more]
CNBP2.294e-745.83CCHC-type zinc finger nucleic acid binding protein... [more]
CNBP2.350e-745.83CCHC-type zinc finger nucleic acid binding protein... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
BX511082.13.350e-8827.13pep chromosome:GRCz11:9:14291932:14297132:1 gene:E... [more]
BX546500.17.142e-8824.89pep chromosome:GRCz11:23:12926092:12931693:-1 gene... [more]
FO704673.11.558e-8625.15pep chromosome:GRCz11:12:14545475:14551077:-1 gene... [more]
CR855320.19.844e-8627.53pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENS... [more]
BX511224.11.931e-8527.12pep chromosome:GRCz11:2:18017000:18022765:1 gene:E... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSXETT00000006041.13.403e-10628.01pep primary_assembly:Xenopus_tropicalis_v9.1:6:373... [more]
anxa61.738e-9536.91annexin A6 [Source:Xenbase;Acc:XB-GENE-989741][more]
ENSXETT00000035398.11.447e-8628.83pep primary_assembly:Xenopus_tropicalis_v9.1:3:986... [more]
ENSXETT00000034712.11.345e-8526.22pep primary_assembly:Xenopus_tropicalis_v9.1:9:335... [more]
ENSXETT00000042189.14.854e-8526.12pep primary_assembly:Xenopus_tropicalis_v9.1:KV460... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 5
Match NameE-valueIdentityDescription
Rtl11.742e-1424.00retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:M... [more]
Rtl11.742e-1424.00retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:M... [more]
Zcchc137.190e-843.10zinc finger, CCHC domain containing 13 [Source:MGI... [more]
Cnbp1.575e-745.83cellular nucleic acid binding protein [Source:MGI ... [more]
Cnbp1.638e-745.83cellular nucleic acid binding protein [Source:MGI ... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|P20825|POL2_DROME2.138e-9629.72Retrovirus-related Pol polyprotein from transposon... [more]
sp|P04323|POL3_DROME7.612e-9433.28Retrovirus-related Pol polyprotein from transposon... [more]
sp|Q99315|YG31B_YEAST1.143e-8930.24Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomy... [more]
sp|Q7LHG5|YI31B_YEAST5.623e-8930.25Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomy... [more]
sp|P10394|POL4_DROME3.829e-8636.50Retrovirus-related Pol polyprotein from transposon... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0V0RP877.783e-16730.67Transposon Ty3-G Gag-Pol polyprotein OS=Trichinell... [more]
A0A0V0X0B72.191e-16630.67Transposon Ty3-G Gag-Pol polyprotein OS=Trichinell... [more]
A0A087SUZ91.844e-16333.66Retrovirus-related Pol polyprotein from transposon... [more]
A0A5S6QWV24.140e-16331.45Uncharacterized protein OS=Trichuris muris OX=7041... [more]
A0A4Y2G7061.255e-16133.58Transposon Ty3-I Gag-Pol polyprotein OS=Araneus ve... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000045469.17.989e-13031.72pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000041345.13.279e-12729.49pep primary_assembly:Astyanax_mexicanus-2.0:25:323... [more]
ENSAMXT00000041682.15.899e-12732.22pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000054230.11.683e-12529.71pep primary_assembly:Astyanax_mexicanus-2.0:14:194... [more]
ENSAMXT00000034038.18.175e-12530.60pep primary_assembly:Astyanax_mexicanus-2.0:14:312... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 4
Match NameE-valueIdentityDescription
ENSPMAT00000009777.11.265e-718.61pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-... [more]
ENSPMAT00000010393.12.248e-732.00pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 g... [more]
ENSPMAT00000004121.12.643e-620.33pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 ... [more]
ENSPMAT00000004123.18.148e-620.75pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 ... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 1
Match NameE-valueIdentityDescription
GIS25.569e-946.43Translational activator for mRNAs with internal ri... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 4
Match NameE-valueIdentityDescription
EDO338751.012e-829.07Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO330903.002e-725.53Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO257855.458e-636.92Predicted protein [Source:UniProtKB/TrEMBL;Acc:A8... [more]
EDO432566.389e-633.33Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000036827.11.408e-13829.55pep primary_assembly:ASM223467v1:16:28613823:28617... [more]
ENSORLT00000039099.15.101e-12831.40pep primary_assembly:ASM223467v1:16:31928608:31932... [more]
ENSORLT00000029819.15.792e-12831.40pep primary_assembly:ASM223467v1:24:6358775:636291... [more]
ENSORLT00000040501.13.085e-12530.24pep primary_assembly:ASM223467v1:1:1251188:1255180... [more]
ENSORLT00000045600.12.696e-12430.13pep primary_assembly:ASM223467v1:4:26167161:261711... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000029259.11.537e-16699.73SMESG000029259.1[more]
SMESG000081257.10.000e+038.74SMESG000081257.1[more]
SMESG000016546.14.109e-17257.58SMESG000016546.1[more]
SMESG000038955.11.210e-12943.81SMESG000038955.1[more]
SMESG000035021.11.298e-11438.66SMESG000035021.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30006762 ID=SMED30006762|Name=Transposon Ty3-I Gag-Pol polyprotein|organism=Schmidtea mediterranea sexual|type=transcript|length=4786bp
GTGGTATGTTGATGGTTTGTTGATATTTGATTAAAATTACAATTTATGAA
AAGAAAATGTCCCAAGAATATTTGTTTATATAATCAGATTTTTATGAAAT
ATAATTTGGATTTAATAGTCTTAGTAATTTATTTTGATGACTGAAGAATT
AAATGTTATAGCTTCTAACATGGCGACCATTCCAACGGATCTGCTTATTG
GAAGATTACTTCCAGAAAAATTCCACAGAGGAGATGACCTCGAATTATTT
ATCAAAGAATGTCAGAGGTTCTTTGAAATAACGAAGACACCAGTGAAGAC
ACAAATGGTACTAGTTATTACACTTCTGGACAGGACTTTGATTGAAGAAT
ATGAAGCCGCAGAAGGTAAAACCGTTGAACAGAAATTAAGAGCAGCTTTT
CATCGACCAACGTCATTGATTGATGACTTAAGAGAAGCCTTGAACTACGA
GCAAGGTAACGATTCGGCGGAAATATTTATAGAGAAAATATCAAAAATGA
CTAAAAAATTGGCCTCACACACTTGGAATGAGGAAGAAATTCAGAAATGT
TTATTGACACATTGTGTAAGAGATAAAGAAGTCAGGAAAGAAATCGAAAT
GAAAGATCTCAAAACGGCGGAACAAATTAAAGAAACTATAAAGAAGATAG
AAAAAGTTAACAAGGTTATTGAACAAGTAAACACGGTGAGATCAATACGA
CCTACAACTGGGGGAAGAACTTATAGAGATGTAGTACAAGTTGGAGCCAC
GAAGAGAGAAATCAACGAATGGAAACCAGAGACAAGAGTTAAAGTGATAG
AATGCTGGACGTGTCAAAAACCGGGACACAGTAGCAGAGAATGTAATATA
AAAAGAAGATTTCAATGTTATGCATGTGGTGTTGAAGGTCACATACGAAG
AGAATGCCCAACAATCAAATGTCATAGATGCAATGCACGAGGACACAAAG
AAAGAGAGTGCTACACAAATATGGAAAGACGAAATCAAGGAAGAGATCGA
GACCAAAGGAAGATGTCAGGAGGAAGAATTCAAAGAAATACCTATCAACA
AAGGGAAGATATGTATCAACCAAGGAACGTACATCAAAGACAATGGGGAA
ACCAACAATATAACAGAAAAGATATTGCAGCTATCGAGTCTGATGATGAG
ATGATGAATACAAAACAGTCAAGCCAGCCAGACGAGTATAACAGGGAGAA
TGACCCAAACGAGCACGCTCCGTCAAACGGGACATTGATCGGAGCAATTT
ACTAAGTAAGATTGATTTCAACCAAAGAATGAATGCTAACGGTATGTGTA
ATTTAAGTATGGATGATGATTGTATGATTGATAAAGATTGTAGTGTTTTT
GAACGTGATATTAATAATAATAGTAGTGTAGAAATTAAAAAGTCGAATGA
TAAGGAAAGTCAGAAAAAATTTAAAGAGGAGTGTAAAGTAAATGAAGTAG
TAAAACACGAGGAAAGAACGATAGAGAAAATTTACAAATTCCTAAAGAAA
AATGGAGACGTGTGTATTAATGAAATTGTAGAAGATACTAGAGAGCAGAA
TATTTTGTCAGTAGAAGAACAAAAAAGTCCAGTAATAACATATAATATAA
TGCAAGGTAGACCGAGTACAACACTGGATATCCAAGGGCGGAAAGTAGAT
TGTTTATTGGATACAGGAGCGCGAATTAATGTGATGGCTAAATCTGTAAT
TGATCGATTAGAAAATATCGAAATATTAGAAACAAGGGAATCGCTAAGAT
GTGCAAACAACAGTAGATTAGAAACTATGGGTAAACTAAACATCAATGTT
AAAATGGGCAGTATGGAAAGAAATGTAACATTCATTATAGTAAAGAATTT
AATACCAGAAATTATTGGAGGAGTAGAACTACAAAGGTTATTTGGTATAG
AATTGAAATATATACTAGAAGAACATGAAAAACGTAGTGATTTCATTTGC
GAAATAGAAGCTAGATTTGGGCGGATTATAACAGATGAAGAAAGATTACG
TCATGCCATCGACGTTTTAAAAGTTACCGGAAATAAGAGACTGCTAGAAA
TATTTCAGGCAAACAAGAATGTTTTTATGGCAGATAAATGGGACATTGGG
TGTACCAATCTGATAAAACATAAGATCATCACGAAAGGAGAGCCAATAAT
GATTAAACCGAGACGTCAGCCAATAAATTTGGAAGACAAGATTGAGGAAG
CAATAAAAAATCTAGAAAACAACGGAATAATTAGGAAGTGCAATTCACCG
TGGAACACACCTTTAGTTTGTGTATGGAAGAAAGAGAAAAAAGACATCAG
GCTTTGTCTAGACTTCAGACAATTAAACAAGATAACAGAAAGACAAGCAT
TTCCAATGCCAAATGTAGATGAAATTTTAGACATCCTACACGGATCCAGA
TATTTTAGCTCAATCGACTTGGGAAATGCTTATTACCAAGTGAAGTTAGA
TAAAGAATCTCAAGAGAAAACAGCATTCTCAACAAAAGAAGGACAGTTCT
GTTTTAACAGGATGCCGTTCGGTATTGCAGCGGCACCAGGAACATTTCAA
GAATTAATGACGAAAGTATTGAAAGACTTGTGGAAAGATGGAGTGATGGT
ATATTTAGACGACATCCTAATATTCACAAAGACAGAAGAAGACCATTATA
ACATATTTGGAAAAGTCCTAGGGAAGATCGCAACAGCAGGACTAAGATTG
AACCCCGAAAAATGTCAAATATTTAGAAAAGAAGTGAAGTTTCTGGGACA
CATAATAAATAAAGACGGCATACAAACAGATAATACTAAAATAGAAGCAA
TACAATCATTTCAAAAACCAAAATGTGTGAAGAATCTGAGGAGCTTTCTG
GGTATCTGTAACTATTATCGACGGTTCATAAAAGACTATGCAAAGAAGGC
AAGAGCACTAGAAAGTATATGCGGAAAAAACAATGAGAAAATAAGATGGA
CAGAGATGTGTGAAAAGGCTTTCGGGGAAATGAAAGAAGCATTGATAACC
GCCCCAGTATTGGTATTTCCAGATTTCAGAAAAGAATTTATATTAGACAC
AGACGCGAGCTTCGATACTATTGGAGCAGTTCTTTCGCAAAAGGATGAAA
AAGGACATGAACATGTCATCGCATATGGTTCACATGCGATGAGCAGCCAC
GAAAAAGGATACTGCATTACCAGAAAAGAATTATTGGCAATATACTATTT
TTGTAAACATTTCAACCACTACTTATATGGTAAGAGATTCGTACTGAGAA
CGGACCATAAAGCTATTACGTTTATGGTAACAACGAAGAAACCAATAACG
GCTCAATTCCAGACATGGATCAACTATTTAAGCAGTCTGGATATTAAAAT
GGAATACAGGAAAGGAACAAGCCATACAAACGCGGATATGCTATCCAGGA
AAACATGCGGAACATGTGTACAGTGTATGATGGAACACGAAGACGCAAAA
ACCGGCAAAATTAAAACCAGAATATTAACAGTAACAGCAGAAGGCGGATA
TAACAAGTGGCAAAATGACAATATGGAGGTCCAAGAAATAAAGAATAAGT
TAGAAAATAAAGATTGTAAGTTCATAATGGAAAACAACACGGTACTAACT
AAACAAGGTAAAATATGGATACCGTCAGATAATAGACAGAGGATGATAAA
AGAAGTACACGTATTGCTGTGCCATGCAGGTGCACAGAAAGTAACAAAAT
ATATCCAGAACAACTGCGACATGGAAAATCTAGCAACGGAAGTAAAAAAG
GTAATTGAGAACTGCGAAAGATGTCAGAAAATGAAAACGATAACAACCAA
GACAAAGGAGGAAACTCAAACAATAAAGAGTACAGAACCATTCGAGAAGA
TATATATGGATATATGTGGGCCATTGAAAGAAACGTTCAACAAAAAGAAA
TATATTTGCGGAATCATTGACCACTATAGTAAATACATCTCATTAACGGC
CATAAACAAGCAAGACGAAAGAACAATTAGTGAGACGTTATTAAATAAAT
GGATATTAAAGTTTGGAGCGCCAAAAGAACTTCATGTAGATTGTGGGAAG
AACTTTGAAGCAAGAAGCATAAAAGAACTAGCAAAGACAGCTGGCATTGA
ATTAATTTTCTCAAGCCCATATCACCATAACACGAATGGTATTATTGAAA
GACAATTCAGAACAATCAGAGAGTATATTAACGCGTCATTGAATGAAGGA
GGAAGGAAAAACTGGGCTGATATAGTGCCAGAAATAGAATATACGTTAAA
TGCAACAGTTCAAAAAACAACGGGAGTAAGCCCAGCAGAGATTATATTTG
GGAGGAAGATCGACAGAATGAAATGGTACTCGAATAAAGAAATAAATAGA
GAAGATATGGAAAAGAGAATAGAGGATAAAACACTAAAACCAAAAATAAG
CAAAACAGTGAGAAATTTTGAAATGGAAGATGTGGTCTTGATCAAACAAG
AAATTCGAAATAAAGACGATGCAAGGTGGGAAGGGCCGTACAAAGTGATA
AAGAAAATACATGAACGGAGCTACTTACTTAAGGATCAAAATGGGAAGAT
GGTAGTCAGAAATGTTGAGAAGATCAAACATTTTAAAAAAGGGGGATGTG
AGGAGTTAAAATTAAAATACTAAATTAAATTAGTTCACGATTAAAATTTG
TAAAGTTTTAATTGAAACTTAAATTATTTTAAGATATTAAAATTGAAATG
ATAATTGAAATTTTAAAATAAAATAATTTTTAATAAAATTGAAATTCAAG
TATGAATGCAATGGTATGTTGATATTTGGTTGAAAT
back to top

protein sequence of SMED30006762-orf-1

>SMED30006762-orf-1 ID=SMED30006762-orf-1|Name=SMED30006762-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=373bp
MTEELNVIASNMATIPTDLLIGRLLPEKFHRGDDLELFIKECQRFFEITK
TPVKTQMVLVITLLDRTLIEEYEAAEGKTVEQKLRAAFHRPTSLIDDLRE
ALNYEQGNDSAEIFIEKISKMTKKLASHTWNEEEIQKCLLTHCVRDKEVR
KEIEMKDLKTAEQIKETIKKIEKVNKVIEQVNTVRSIRPTTGGRTYRDVV
QVGATKREINEWKPETRVKVIECWTCQKPGHSSRECNIKRRFQCYACGVE
GHIRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQR
NTYQQREDMYQPRNVHQRQWGNQQYNRKDIAAIESDDEMMNTKQSSQPDE
YNRENDPNEHAPSNGTLIGAIY*
back to top

protein sequence of SMED30006762-orf-2

>SMED30006762-orf-2 ID=SMED30006762-orf-2|Name=SMED30006762-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1115bp
MNANGMCNLSMDDDCMIDKDCSVFERDINNNSSVEIKKSNDKESQKKFKE
ECKVNEVVKHEERTIEKIYKFLKKNGDVCINEIVEDTREQNILSVEEQKS
PVITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEIL
ETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVE
LQRLFGIELKYILEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVT
GNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPIN
LEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLN
KITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAF
STKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFT
KTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQT
DNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESICGK
NNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGA
VLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLY
GKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHT
NADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNME
VQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHA
GAQKVTKYIQNNCDMENLATEVKKVIENCERCQKMKTITTKTKEETQTIK
STEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTI
SETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHH
NTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIEYTLNATVQKTTGV
SPAEIIFGRKIDRMKWYSNKEINREDMEKRIEDKTLKPKISKTVRNFEME
DVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIK
HFKKGGCEELKLKY*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000026gut
PLANA:0000099neuron
PLANA:0002089reproductive organ
PLANA:0003116parenchymal cell
Vocabulary: INTERPRO
TermDefinition
IPR001878Znf_CCHC
IPR000477RT_dom
IPR012337RNaseH-like_sf
IPR001584Integrase_cat-core
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: molecular function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
Vocabulary: biological process
TermDefinition
GO:0015074DNA integration
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableTMHMMTMhelixcoord: 42..64
NoneNo IPR availableTMHMMTMhelixcoord: 132..151
NoneNo IPR availableTMHMMTMhelixcoord: 105..127