Transposon Ty3-I Gag-Pol polyprotein

Overview
NameTransposon Ty3-I Gag-Pol polyprotein
Smed IDSMED30035617
Length (bp)5321
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30035617) t-SNE clustered cells

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30035617) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30035617) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30035617) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30035617

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 29

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
Smed sexual biotypeSMED30035617h1SMcG0004802 Contig46169uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0004802 Contig46169newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0004803 Contig46169uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0004803 Contig46169newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig45798newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig45798uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0001428 Contig19051newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0001428 Contig19051uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig17060uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig17060newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig19051newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig19051uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig35011newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617h1SMcG0009729 Contig35011uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig18539uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig18539newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig18102newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig18102uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16918uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16918newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16094uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16094newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16739newmark_estsPMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
Smed sexual biotypeSMED30035617 Contig16739uc_Smed_v2PMID:29674431
Fincher et al., 2018
FACS sorted cell population adult hermaphrodite single-cell RNA-sequencing evidence
neuronSMED30035617h1SMcG0009729 dd_Smed_v6_5675_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cholinergic neuronSMED30035617h1SMcG0009729 dd_Smed_v6_5675_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
GABAergic neuronSMED30035617h1SMcG0009729 dd_Smed_v6_5675_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30035617h1SMcG0009729 dd_Smed_v6_5675_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
glial cellSMED30035617h1SMcG0009729 dd_Smed_v6_5675_0dd_Smed_v6PMID:29674432
Plass et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511082.1 (pep chromosome:GRCz11:9:14291932:14297132:1 gene:ENSDARG00000113678.1 transcript:ENSDART00000183119.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511082.1)

HSP 1 Score: 55.4546 bits (132), Expect = 7.444e-7
Identity = 30/74 (40.54%), Postives = 41/74 (55.41%), Query Frame = 1
Query: 5098 LTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L   +P   + Y L+  ++   EK +   L A +I PS S        VKKKDG+ R CIDYR LNA+T ++TY
Sbjct:  519 LPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNAITVKNTY 592          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX546500.1 (pep chromosome:GRCz11:23:12926092:12931693:-1 gene:ENSDARG00000086495.3 transcript:ENSDART00000122176.3 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX546500.1)

HSP 1 Score: 54.6842 bits (130), Expect = 1.494e-6
Identity = 29/74 (39.19%), Postives = 41/74 (55.41%), Query Frame = 1
Query: 5098 LTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L   +P   + Y L+  ++   EK +   L A +I PS S        VKKKDG+ R CIDYR LN++T ++TY
Sbjct:  493 LPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNSITVKNTY 566          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR925755.2 (pep chromosome:GRCz11:17:42486740:42492668:-1 gene:ENSDARG00000116402.1 transcript:ENSDART00000183946.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR925755.2)

HSP 1 Score: 54.6842 bits (130), Expect = 1.506e-6
Identity = 29/74 (39.19%), Postives = 41/74 (55.41%), Query Frame = 1
Query: 5098 LTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L   +P   + Y L+  ++   EK +   L A +I PS S        VKKKDG+ R CIDYR LN++T ++TY
Sbjct:  492 LPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNSITVKNTY 565          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511224.1 (pep chromosome:GRCz11:2:18017000:18022765:1 gene:ENSDARG00000113243.1 transcript:ENSDART00000186877.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511224.1)

HSP 1 Score: 54.6842 bits (130), Expect = 1.550e-6
Identity = 29/74 (39.19%), Postives = 41/74 (55.41%), Query Frame = 1
Query: 5098 LTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L   +P   + Y L+  ++   EK +   L A +I PS S        VKKKDG+ R CIDYR LN++T ++TY
Sbjct:  519 LPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNSITVKNTY 592          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR749164.1 (pep chromosome:GRCz11:21:30355767:30361679:1 gene:ENSDARG00000116832.1 transcript:ENSDART00000189948.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR749164.1)

HSP 1 Score: 53.1434 bits (126), Expect = 4.245e-6
Identity = 38/124 (30.65%), Postives = 58/124 (46.77%), Query Frame = 1
Query: 4969 IDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGT-------SDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            I    +N+ E  +LQ +   + D+  A +    T       +D     +P T  TP   R + L++ +    +K + + L+ G I PS S        VKKKDG+ R CIDYR LN +T +  Y
Sbjct:  464 IQINTINKTEDSELQHVPDAYHDLTEAFNKQKATKLPPHRDNDCAIELLPGT--TPPRGRIFPLSQPETEAMKKYISEELEKGFIRPSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVKYRY 585          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000006041.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:6:37317867:37322724:1 gene:ENSXETG00000003070.1 transcript:ENSXETT00000006041.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 102.834 bits (255), Expect = 4.259e-21
Identity = 105/400 (26.25%), Postives = 180/400 (45.00%), Query Frame = 1
Query: 4270 AMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLW--PCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINT-----YLAVI-----RRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRIC------MLKSEVRCSE----------------KIVVPGR-TQMICYVKVNEETRGE-MIFEPNHKFEKKRELPLAREIV--------YVNNNSEIPINITNFDEEDKVIYENERLGK-----LTPMMDIKLPTTKTEGT-EEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            A+   L GP+  VP++I GV  +A++DTGA ++ +  D     L   P  +   +++   S+      G +   +E+      T      LAV+       L    ++GT+   +L+K ++     +EL+ K G         +L S VR  E                +++ PG+ T M   VK+  E  G  ++ E     +    LP   E++           N   + + + N   E   ++ +  LG+     L P+    L   K E T   EL+ + + ++    K +L+K L E   +F+ +DLD+G S  T+H I L +D P   R  R+A     +  K ++++  AG+I+ S S +  P+V+V+KK+G+ R C+DYR LN  T  D Y
Sbjct:    5 ALPNGLIGPSPIVPVQIEGVYSEALLDTGAQVTLLYRDFYKKYLSHIPLEKLEKLEIWGLSETKFPYDGYVSVKLEFSPTVAGTNEAVETLAVVCPRPPGALQNAVVVGTNT--DLIKRMLAP--QLELKVKKGASIHPLLQPVLSSLVRQKEEPSEGIGNVWYLQRKDRVIQPGKITCMRARVKICWENPGHHLVIEGGPGLD----LPFGVELIPEALPADCLKKNCGTVTVGLKNTTNEPVFLHSHSLLGRVYSASLVPVA--ALGRDKAEATVSAELFDLSNSVIPPEWKSRLRKHLNEHSALFSRNDLDMGCSTSTKHKIRLREDKPFRERSRRIAPGDLEDLRKHLEELKAAGIIKESRSPYASPIVVVRKKNGSIRMCVDYRTLNQRTIPDQY 394          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: anxa6 (annexin A6 [Source:Xenbase;Acc:XB-GENE-989741])

HSP 1 Score: 95.5153 bits (236), Expect = 5.152e-19
Identity = 48/115 (41.74%), Postives = 71/115 (61.74%), Query Frame = 1
Query: 4975 SQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            S  L+ ++++QL+K+L  +  +F+A+    G +   +H +      PI    YR+AEA + E + ++ +ML  GVI PS S W  PVV+V KKDG+ RFC+DYRRLN VT  D Y
Sbjct:  294 SDHLHLSQQDQLRKILRSYSPMFSANP---GRTHWAEHKVDTGTQLPIRSPAYRVAEAVRPEMKSQIDEMLAFGVITPSHSPWASPVVLVPKKDGSTRFCVDYRRLNDVTTTDAY 405          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000023941.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:KV463742.1:21:4282:-1 gene:ENSXETG00000015734.1 transcript:ENSXETT00000023941.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 94.3597 bits (233), Expect = 1.376e-18
Identity = 48/115 (41.74%), Postives = 71/115 (61.74%), Query Frame = 1
Query: 4975 SQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            S  L+ ++++QL+K+L  +  +F+A+    G +   +H +      PI    YR+AEA + E + ++ +ML  GVI PS S W  PVV+V KKDG+ RFC+DYRRLN VT  D Y
Sbjct: 1027 SDHLHLSQQDQLRKILRSYSPMFSANP---GRTHWAEHKVDTGTQLPIRSPAYRVAEAVRPEMKSQIDEMLAFGVITPSHSPWASPVVLVPKKDGSTRFCVDYRRLNDVTTTDAY 1138          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000015952.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:8:63280736:63282565:1 gene:ENSXETG00000009808.1 transcript:ENSXETT00000015952.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 89.7373 bits (221), Expect = 2.150e-17
Identity = 43/112 (38.39%), Postives = 70/112 (62.50%), Query Frame = 1
Query: 4984 LNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++E  K++L   L+E + +F+  ++D+G +  TQHTI L+D TP   RP R+    + + ++ +Q+M   G+I  S S +  P+V+V+KKDG+ R C+DYR LN  T  D Y
Sbjct:  381 VSEEWKQRLSNGLLERRQVFSTDEMDVGCAKSTQHTIRLSDSTPFRERPRRVPPKDREDLQRTLQEMKRRGIIADSRSPYASPIVIVRKKDGSIRLCVDYRTLNRRTVPDQY 492          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: alkbh5 (alkB homolog 5, RNA demethylase [Source:Xenbase;Acc:XB-GENE-987580])

HSP 1 Score: 83.9593 bits (206), Expect = 9.926e-16
Identity = 40/115 (34.78%), Postives = 71/115 (61.74%), Query Frame = 1
Query: 4975 SQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++ L  ++K+++++ L+  + IF+      G +++ +H I       + +RPYR+ EA++     E+++ML+  VIE S S W  P+V+V K DG+ RFC D+R+LN V+K D Y
Sbjct:  192 AETLTNSQKQEMREFLIRNRQIFSDQP---GLTEMIKHDIITGPGVKVNVRPYRIPEARRQAVAGEIKRMLELDVIEESHSEWSSPIVLVPKPDGSIRFCNDFRKLNEVSKFDAY 303          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P20825|POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 70.0922 bits (170), Expect = 1.726e-10
Identity = 59/202 (29.21%), Postives = 102/202 (50.50%), Query Frame = 1
Query: 4762 HKFEKKRELPLAREIVYVNNNSEIPINITN-----FDEEDKVI-YENERLGKLTPMMDIKLPTTKTEGTEEELWTID-SQL----LNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKK---DGTQRF--CIDYRRLNAVTKRDTY 5319
            H+F    ++ + R+++    N++  IN  N     FD+  K+I  E+ER   L      + P +     +E +  +D SQ     LN+ E  +L+ LL +F+++       L  ++  +H +  T ++PI  + Y LA+  + E E +VQ+ML+ G+I  S S +  P  +V KK    G  ++   IDYR+LN +T  D Y
Sbjct:   84 HRFSNNYDMLIGRKLL---KNAQSVINYKNDTVTLFDQTYKLITSESERNQNLYIQ---RTPESIASSDQESIKKLDFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRY 279          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P10394|POL4_DROME (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 68.1662 bits (165), Expect = 7.153e-10
Identity = 90/387 (23.26%), Postives = 171/387 (44.19%), Query Frame = 1
Query: 4213 PRTSQRS----RSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI--NMDKIPGRLWPCIEEAN-MDVITCSKESVAVIGKIWSIIEY-KHVKINTYLAVIRRLSADC--IIGTDLMPELLKEIIIDLGSME----LRDKTGRICMLKSEVRCS--EKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNN----NSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQ------RFCIDYRRLN 5295
            PRT Q +     SV++D    TI     +    +       GV++  ++DTGA+IS +  N DK        I+  N +++    ++ +   G+ +  I+  K+V  + +  V +     C  IIG D + +      IDL   E    +R    +  +       S     ++P R+Q++  + V+  ++ + I  PN + +           +YV N    +S   + I N  + D+++  N    K  P+ +  +    +E   + + +   +   E  K QL+ +  E+ DIFA     +  +++ +  + L DD P+  + YR   +Q  E + +VQK++   ++EPS S +  P+++V KK          R  IDYR++N
Sbjct:   17 PRTGQLATAFRYSVEEDRRVYTINYNLNIFSTFIHAK---TGVKLVFLLDTGADISILKENSDKFSN-----IQITNKINIQGIGQQKIQSRGQTFIEIQTGKYVIPHDFHLVDKNFPIPCDGIIGIDFIKKY--NCQIDLNQEEDWFIIRPNNLKFPIYIPIAYSSGINTTLLPARSQVVRRLIVS--SKDDNILIPNQEIQTG---------IYVANTIATSSNTFVRILNTTDSDQLV--NMDTLKYEPLSNYNVVQANSEHRNKTVLSQLKKNFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVEEIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQIN 380          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P04323|POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 65.855 bits (159), Expect = 4.187e-9
Identity = 43/128 (33.59%), Postives = 70/128 (54.69%), Query Frame = 1
Query: 4951 EEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKK---DGTQRF--CIDYRRLNAVTKRDTY 5319
            E +L+ ++   LN  EK++L  LL ++ DI       L  ++ T+HTI    + P+  + Y   +A + E E ++Q ML+ G+I  S S +  P+ +V KK    G Q+F   IDYR+LN +T  D +
Sbjct:  156 ESDLYRLEH--LNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSK-YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRH 280          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q7LHG5|YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 65.4698 bits (158), Expect = 5.788e-9
Identity = 33/66 (50.00%), Postives = 40/66 (60.61%), Query Frame = 1
Query: 5122 LRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L+PY + E  + E  K VQK+LD   I PS S    PVV+V KKDGT R C+DYR LN  T  D +
Sbjct:  625 LQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPF 690          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q99315|YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 65.0846 bits (157), Expect = 6.341e-9
Identity = 33/66 (50.00%), Postives = 40/66 (60.61%), Query Frame = 1
Query: 5122 LRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L+PY + E  + E  K VQK+LD   I PS S    PVV+V KKDGT R C+DYR LN  T  D +
Sbjct:  599 LQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPF 664          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A355ABF2 (Uncharacterized protein OS=Flavobacteriaceae bacterium OX=1871037 GN=DDZ39_05755 PE=4 SV=1)

HSP 1 Score: 415.616 bits (1067), Expect = 2.993e-114
Identity = 236/666 (35.44%), Postives = 384/666 (57.66%), Query Frame = 1
Query: 1861 KGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELR-RTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPV-DISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVG 3852
            K  N +I  +L++ +  KG K    YDCDN K+G ++SL + EEC  A P + +     + Y++YQE   I +  KEC+VTR+     CGHHSHS II+  +    +++    CE+ F    I +A ++ + A+ G +    +   G I A DGTC+GGEY    + +K  +VIE+YQ+ ++K +  FDPDT  M     C A    C  G ++I+Y    + C L  LK+  FD ++G++            +D+L  R  +I +     TI S +TP V+I+  T+  +R ++   V KC + V  TNY  I +S   +  A  +I  K++ L +Y NNK D+LYH  L   E +Y +++ NDC LNREIL+TK+A+ +TNP+ A P++PL +G F R++GEV+ T+KC++   K+  +S      C +EL +++K K  + +PVTR++ P+        CS ++ P +++ D +W+  P+   +APPK F LT L +  +F  + D+ +SG+Y  K +E AR++LL+P  R  IL+ IV  + G+  G +PNYELLLSP+HF+ A  N+MK +WGRFLIFGQ +AG++GI  ++  ++  + Q+ + +++YK     +WK+ +G +P LA+  +       I+ +++        +  DEE+ 
Sbjct:  371 KVFNFVILMLLLFSLP-KG-KAIKAYDCDNAKLGTQFSLLDAEECPEAYPNQFEKRFNRI-YHIYQERGLIYSAAKECTVTRRRSIRWCGHHSHSAIIKQPSMIEYLNIGHRNCEDAFRLKEIRLADKLILKAKPGILIQEDIIKVGSI-AFDGTCEGGEYWYQGKKIKQALVIETYQITIKKSEVAFDPDTNEMMSRSYCSAKSNFCFDGKTSIIYDIRKQECNLVFLKTVNFDVIRGKIF------DNNLFIDKLNSRPKKIYR--TEKTINSHITPVVLIANKTSDAIRLVRKDQVVKCGQSVYLTNYDRIVVSTVRVKEAIIKISKKEINLAVYFNNKADYLYHHQLRQIEDLYREMIINDCKLNREILRTKMALIVTNPNIAAPVIPLGKGVFSRVLGEVLQTFKCKQVEVKINISS-----DCTHELPVIYKNKFMYLEPVTRLLLPDVIKVKKIKCSPLFSPAYKLSDNTWIVTPSLTPIAPPKRFQLTNLRNAVKFSRLKDLMKSGLYDKKAMEDARKYLLYPQVRDRILTEIVETSLGD--GNRPNYELLLSPNHFEKAAKNVMKKVWGRFLIFGQAIAGLMGIYYVVIFLKTFIEQVTSTYELYKI-MGFSWKLILGIIPCLARPFISKKLHDKINDVEKKYANRHTYSNRDEEIN 1016          

HSP 2 Score: 164.466 bits (415), Expect = 1.612e-36
Identity = 104/352 (29.55%), Postives = 182/352 (51.70%), Query Frame = 1
Query: 4267 IAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKT-EGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            + ++  +  P + V  +ING    A+IDTG+  + +N + +  +    I+   + V   S   +  IG++   +   + KI++ L V++ L   CIIG   + +L K I+ D     L+     I +    +  +E   +P   +++  ++++      +IFEPN  F     L +A  +   +N S IP+   N+  ++  + + + +G +  +  +++   K  EG     +  D   LN  + + ++ +  ++  +FA  D DLG +++ +H IPL  D PI  RPY+ A A K E +K+V+ M    VI  S S W  P+VMVKKKDGT RFC+DYR+LN VT++DTY
Sbjct: 1075 VGLVDNVESPKILV--QINGRTRIALIDTGSGATLLNYEGLKPQYRRLIKPTCLKVQGMSGTILNPIGQLLCELTICNQKISSELVVVKNLPYPCIIGMSTLSKL-KGIVFDPSCNRLKTLNNEI-IDNMNIYVTEDTTIPKWNEVVMPMQIHCNDDKTVIFEPNDIFADTNCLSVATTVTNTDN-STIPVRFVNYSNDNIQLQQGQHIGTIHEIDVVEVNVAKVFEGGMPPNFPFD--YLNNEQNQAMENIFQKYSKVFATDDFDLGKTNIIKHFIPLDKDNPIKQRPYKAAYALKGEIKKQVEDMKHNKVIRNSFSPWASPIVMVKKKDGTMRFCVDYRKLNTVTRKDTY 1419          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A4Y2G3W0 (Retrovirus-related Pol polyprotein from transposon 412 OS=Araneus ventricosus OX=182803 GN=POL_952 PE=4 SV=1)

HSP 1 Score: 156.762 bits (395), Expect = 7.253e-38
Identity = 103/338 (30.47%), Postives = 183/338 (54.14%), Query Frame = 1
Query: 4342 MIDTGANISAINMDKIPGRLWPCIEEA-NMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVR----CS----EKIVVPGRTQMICYVKVNEETRGEMIFEPN--HKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDS-QLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++DTGAN++ +  D         I  A N+ + T + E   + GK+ + IE    K +  + V   ++  CI+G D + +      +DL   E+R     I +  + V+    CS    ++ ++P R++  C ++   E  G+  +     H    ++ + +A  +V +   + IP+ + N + + K++ + + +    P++DI     +  G +    T+++ Q+LNE ++  ++KLL EF+D+F+  D D+G  ++TQH I   D  PI   P RL  A+K EAE  V++M+D G+IE S   W  P+V+VKKKDG+ RFC+DYR+LN +TK+D+Y
Sbjct:    2 LVDTGANVTLLRTDLAQKLKEQLIYTAPNISLKTATGEKTEIRGKLDASIECGSRKFHHRIYVAD-ITDPCILGLDFLQKF--NFTVDLEKNEIRTGGEEIPLFSASVQHSKSCSVLVKKRTIIPARSE--CLIQGIPEVPGQFRYAVTDFHSQVPQKGVLVAATLVDLEREA-IPVRVLNLNNKPKILDKGDVIATCEPVVDIVARPQEFSGEQHLQSTLENLQILNEEQRIAVRKLLNEFQDLFSICDADVGRCNMTQHRINTGDHPPIKQYPRRLPLARKEEAEHLVKEMVDNGIIEESSGPWASPIVLVKKKDGSTRFCVDYRKLNEITKKDSY 333          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A4Y2CTA5 (Retrovirus-related Pol polyprotein from transposon 412 (Fragment) OS=Araneus ventricosus OX=182803 GN=POL_235 PE=4 SV=1)

HSP 1 Score: 162.925 bits (411), Expect = 1.018e-37
Identity = 116/383 (30.29%), Postives = 205/383 (53.52%), Query Frame = 1
Query: 4219 TSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIE--INGVEIKAMIDTGANISAINMDKIPGRLWPCIEEA-NMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVR----CS----EKIVVPGRTQMICYVKVNEETRGEMIFE----PNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDS-QLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++Q + S+++ PE      + +L+G    + +E  I G++   ++DTGANI+ +  D         I  A N+ + T + E   + GK+ + IE    K +  + V   ++  CI+G D + +      +DLG  E+R     I +  + V+    CS    ++ ++P R++  C ++   E  G+  +     PN   +K   + +A  +V +     IPI + N + + K++ + + +    P +DI     +  G +    T+++ Q+LNE ++  ++KLL EF+D+F+  D D+G  ++TQH I   D  PI   P RL  A+K +AE  V++M+D G+IE S   W  P+V+VKKKDG+ RFC+DYR+LN +TK+D+Y
Sbjct:   94 SNQENSSLNRAPEEG--PRISSLSGKKNGLYLEGSICGIQCWMLVDTGANITLLRTDMAQKWKEQLIYTAPNISLKTATGEKTEIRGKLDASIECGSRKFHHRIYVAD-ITDPCILGLDFLQKF--NFTVDLGKNEIRTGGEEIPLFSANVQHSKSCSILVKKRTIIPARSE--CLIQGIPEAPGQFRYAVTNFPNQASQKG--VLVAATLVDLEMEV-IPIRVLNLNNKPKILDKGDVIATCEPAVDIVARPQEFSGAQHLQSTLENLQILNEEQRIAVRKLLNEFQDLFSICDADIGRCNMTQHRINTGDHPPIKQYPKRLPLARKEKAEHLVKEMVDNGIIEESSGPWASPIVLVKKKDGSTRFCVDYRKLNEITKKDSY 466          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A4Y2R4I5 (Retrovirus-related Pol polyprotein from transposon 297 (Fragment) OS=Araneus ventricosus OX=182803 GN=pol_3438 PE=4 SV=1)

HSP 1 Score: 160.999 bits (406), Expect = 3.172e-37
Identity = 105/345 (30.43%), Postives = 183/345 (53.04%), Query Frame = 1
Query: 4318 INGVEIKAMIDTGANISAINMDKIPGRLWPCIEEA-NMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVR----CS----EKIVVPGRTQMICYVKVNEETRGEMIFE-PNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDS-QLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            I G++   ++DTGAN++ +  D         I  A N+ + T + E   + GK+ + IE    K +  + V   ++  CI+G D + +      +DL   E+R     I +  + V+    CS    ++ ++P R++  C ++   E  G+  +   N   +  ++  L    +       IP+ + N + + K++ + + +    P++DI     +  G +    T+++ Q+LNE ++  ++KLL EF+D+F+  D D+G  ++TQH I   D  PI   P RL  A+K EAE  VQ+M+D G+IE S   W  P+V+VKKKDG+ RFC+DYR+LN +TK+D+Y
Sbjct:   38 ICGIQCLMLVDTGANVTLLRTDLAQKLKEQLIYTAPNISLKTATGEKTEIRGKLDASIECGSRKFHHRIYVAD-ITDPCILGLDFLQKF--NFTVDLEKNEIRTGGEEIPLFSASVQHSKSCSVLAKKRTIIPARSE--CLIQGVPEVPGQFRYAVTNFPSQVSQKGVLVAATLVDLEMEAIPVRVLNLNNKPKILDKGDVIATCDPVVDIVARPQEFSGAQHLQSTLENLQILNEEQRTAVKKLLNEFQDLFSTCDADVGRCNMTQHRINTGDHPPIKQYPRRLPLARKEEAEHLVQEMVDNGIIEESSGPWASPIVLVKKKDGSTRFCVDYRKLNEITKKDSY 377          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A4Y2HSA1 (Retrovirus-related Pol polyprotein from transposon 412 OS=Araneus ventricosus OX=182803 GN=POL_851 PE=4 SV=1)

HSP 1 Score: 155.992 bits (393), Expect = 4.353e-37
Identity = 104/340 (30.59%), Postives = 184/340 (54.12%), Query Frame = 1
Query: 4342 MIDTGANISAINMDKIPGRLWPCIEEA-NMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVR----CS----EKIVVPGRTQMICYVKVNEETRGEMIFE----PNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDS-QLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++DTGAN++ +  D         I  A N+ + T + E   + GK+ + IE    K +  + V   ++  CI+G D + +      +DL   E+R     I +  + V+    CS    ++ ++P R++  C ++   E  G+  +     P+   +K   + +A  +V +   + IP+ + N + + K++ + + +    P++DI     +  G +    T+++ Q+LNE ++  ++KLL EF+D+F+  D D+G  ++TQH I   D  PI   P RL  A+K EAE  V++M+D G+IE S   W  P+V+VKKKDG+ RFC+DYR+LN +TK+D+Y
Sbjct:    2 LVDTGANVTLLRTDLAQKLKEQLIYTAPNISLKTATGEKTEIRGKLDASIECGSRKFHHRIYVAD-ITDPCILGLDFLQKF--NFTVDLEKNEIRTGGEEIPLFAASVQHSKSCSVLAKKRTIIPARSE--CLIQGVPEVPGQFRYAVTNIPSQVSQKG--VLVAASLVDLEMEA-IPVRVLNLNNKPKILDKGDVIATCDPVVDIVARPQEFSGAQHLQSTLENLQMLNEEQRTAIKKLLNEFQDLFSTCDADVGRCNMTQHRINTGDHPPIKQYPRRLPLARKEEAEHLVKEMVDNGIIEESSGPWASPIVLVKKKDGSTRFCVDYRKLNEITKKDSY 333          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000038088.1 (pep primary_assembly:Astyanax_mexicanus-2.0:23:13869500:13874588:-1 gene:ENSAMXG00000037864.1 transcript:ENSAMXT00000038088.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 107.457 bits (267), Expect = 8.514e-23
Identity = 75/255 (29.41%), Postives = 120/255 (47.06%), Query Frame = 1
Query: 4627 KTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMM--------DIKLPTTK---------------TEGTEEELWTIDSQL-LNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            + G +  + S  RC  ++ VP  +++I + +V     G+           +  + +AR I  V +   +P+ + N    D  I   ++LGKL  +         D+KL ++                T G+  E+  + +   L  +++  L +LL ++  +F+ S+ D G +D  QH IP  D  P   R   L      E    +  MLD GVI  S S W  PVVMV+KKDG+ RFC+DYR+LNAVT +D +
Sbjct:  154 RDGFVGNVWSASRC--RVRVPPGSEIIVWGRVRAGNNGQEYCGLVEPMPDQDAVGVARTIAKVRHG-RVPVRLCNVHPYDVFIGRFQKLGKLYEVQPADVHGQCDLKLSSSSEGVVEVSVIETVPVGTGGSTFEVSQLTTHTDLTTSQQAVLSQLLHKWSSVFSQSEEDFGCTDAIQHCIPTGDALPSRERFRPLPPNMYQEMRVLLADMLDKGVISESSSPWAAPVVMVRKKDGSWRFCVDYRKLNAVTHKDAF 405          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000052546.1 (pep primary_assembly:Astyanax_mexicanus-2.0:2:13031363:13036570:1 gene:ENSAMXG00000033629.1 transcript:ENSAMXT00000052546.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 105.916 bits (263), Expect = 2.836e-22
Identity = 51/117 (43.59%), Postives = 71/117 (60.68%), Query Frame = 1
Query: 4969 IDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            +++  L  ++ ++L  LL + KDIF+ SD D G +    H+IP  D  P+  R  R+      E +K VQ ++D G++E SCS W  P V+V KKDGT RFC DYRRLN VT +D Y
Sbjct:  674 VNADGLTSSQYQELMDLLEKHKDIFSKSDSDFGYTTAVTHSIPTGDAPPVKQRHRRVPPQVFQEFKKHVQSLVDRGILEESCSPWASPAVIVIKKDGTVRFCCDYRRLNQVTCKDAY 790          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000030217.1 (pep primary_assembly:Astyanax_mexicanus-2.0:16:12351169:12363564:1 gene:ENSAMXG00000035335.1 transcript:ENSAMXT00000030217.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 105.145 bits (261), Expect = 5.211e-22
Identity = 53/152 (34.87%), Postives = 92/152 (60.53%), Query Frame = 1
Query: 4864 DKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            ++VI+E ER G    +  ++  +   EG   +L  +    L+  ++++ + LL ++ ++F+  + D+G + + QH IPLTDD P+  R  RL  +Q  + +  +Q +L   V+  SCS +  P+V+V+KKDGT R C+DYR+LNA T++D Y
Sbjct:  253 ERVIFE-EREGNQENVAVMR--SLCAEGQPLDLSNLSWPTLSLAQEQEGKALLRQYSEVFSQGEGDIGCTTLIQHEIPLTDDAPVRQRYRRLPPSQYEQVKAHIQDLLQREVVRVSCSPYSSPIVVVQKKDGTLRLCVDYRQLNAKTRKDAY 401          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041682.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02002320.1:24977:29035:1 gene:ENSAMXG00000038531.1 transcript:ENSAMXT00000041682.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 104.76 bits (260), Expect = 7.129e-22
Identity = 75/270 (27.78%), Postives = 128/270 (47.41%), Query Frame = 1
Query: 4585 KEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEE---TRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNEN---------------------EKEQLQK-LLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            K+ + +   +E    TG  C+ +  VR    + VP  +  +     N           FEP H       L ++R ++ +++ S + + + N +  D  +     LG+L     +++  T + G  EEL+    Q++                        E+ QL K LL  +  +F+  + +LG + + +H IPL DDTP+  R  RL  +Q    +  +Q++LD  VI  SCS +  PVV+V+K+DGT R C+DYR+LN+ T++D Y
Sbjct:  130 KQALAECHRLECLPPTG--CLGQVTVRGRAAVRVPAGSVKLVVATCNNNFGAVLSSAFFEP-HSHTLPDGLLMSRALLSIDSGS-VAVPVVNVEHRDIWLPPRVTLGQL---FAVEMQPTLSTGKVEELFHCSEQVVAVQSLAVAEDFSDLTVGSWPTLTPEQSQLGKDLLQRYSSVFSQDEGELGCTHLIEHEIPLIDDTPVKQRYRRLPPSQYDLVKGHIQELLDRKVIRASCSPYSSPVVVVQKRDGTIRLCVDYRQLNSKTRKDAY 392          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000054058.1 (pep primary_assembly:Astyanax_mexicanus-2.0:19:20158532:20160786:-1 gene:ENSAMXG00000037637.1 transcript:ENSAMXT00000054058.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 103.219 bits (256), Expect = 7.545e-22
Identity = 68/222 (30.63%), Postives = 111/222 (50.00%), Query Frame = 1
Query: 4735 RGEMIFEPNHKFEKKRELPLAREIVYV-----NNNSEIPINITNFDEEDKVIYENERLGKLTPM----------MDIKLPTTKTEGTEEELW-TIDSQLLN-----------ENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            +G  + EP        +LPL   IV +      +  +IP+ + N   ED  +    R+G L+P+          +  +  +  TE    +L  + D Q ++           E E+ +L+ LL ++ D+FA  D +LG ++  +H I L DDTP+ L   R+   Q  E +  + ++L  GVI+ S S +  PVV+V+K DG+ R C+DYR+LN  TKRD +
Sbjct:  133 QGSWLLEPG-------KLPLPGGIVVMPSLVKEHRYQIPVQVVNLSREDVWLNPRTRIGVLSPVQCITNNQHCEVTFQRISANTEQVSVDLKESQDHQKVSDILSKLDIGGTEKEQAELRALLGKYSDVFAVGDDELGYTEKVKHEIVLVDDTPVNLPYRRIPPNQYKEVKDHISQLLRKGVIQESTSSYASPVVVVRKSDGSIRLCVDYRKLNLKTKRDAF 347          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000040305.1 (pep primary_assembly:ASM223467v1:1:25459511:25466649:-1 gene:ENSORLG00000028409.1 transcript:ENSORLT00000040305.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 117.472 bits (293), Expect = 9.887e-26
Identity = 107/430 (24.88%), Postives = 195/430 (45.35%), Query Frame = 1
Query: 4126 VERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDK---IPGRLWPCIEEANMDVITC-SKESVAVIGKIWSIIEY--KHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMI---C-YV-----KVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMM---DIKLPTTKTEGTEE--------------ELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
             ++H P PR       +P P  P  A+L+P T+  +          T+ +        + V I I+GV + A++D+GA    + +     IP  + P I  + +  +     E +AV+G++   +    + + +N  +A     S + I+G   +  L  +  +D GS E+     ++   K + + +  +V   RT ++   C Y+     +    T  +++  P   F  K  + +AR +V  + +  IPI + N       + +    G L P      ++L   K + + +              +L+      L E  + +L  LL  + D+F+A   DLG ++V QH I  T  + +  +P R++  ++  A+++VQ+ L+AG+   S S W  P+VMVKKKD + R C+DYR LN  T +D Y
Sbjct:  492 FQKHCPSPR-------WPMP-GPRKANLLPSTNGNN----------TVVIYAKSPEEEMCVQIMIHGVRMCALLDSGARRKVLPLHLFHLIPNGIQPPISPSTVHTLQGIGPEGLAVLGEVDLPVNVGCRVISVNFIIADTTE-STEVILGHPFL--LQTQACLDYGSKEITLFGEKVPPFKPDPQPATHLVRVARTTVLEAGCEYIVPGTSQAGFATSEDLMLSPAKAFVSKHHVMVARSVVQPSQSLCIPIRVFNPSTSPVTLKKGVVAGVLQPAQVVGKVELSRPKDQPSHDPGNSFSFSVPQHLTDLYEESCANLPEESRYRLAGLLRSYSDVFSAGSTDLGRTNVVQHDILTTPGSAVKQQPRRMSREKQEAADQQVQQSLEAGLARHSNSSWAAPIVMVKKKDQSPRLCVDYRPLNDRTIKDAY 900          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000036827.1 (pep primary_assembly:ASM223467v1:16:28613823:28617539:1 gene:ENSORLG00000023550.1 transcript:ENSORLT00000036827.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 114.005 bits (284), Expect = 8.124e-25
Identity = 63/162 (38.89%), Postives = 88/162 (54.32%), Query Frame = 1
Query: 4900 LTPMMDIKLPTTKTEGTE---------------------EELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQ-RFCIDYRRLNAVTKRDTY 5319
            L+P   I LPTT     E                     +E+W      L   +K++L K+L+E++DIFA S+ ++G + +  H I   D  PI  RP RL  A +  A+  +++ML  G+IEPS S W   VVMVKKK G + RFC+DYR LN VTK+D+Y
Sbjct:  180 LSPPYSIPLPTTPQPKIETQPLPSDQRNAVVEADRVTAVKEIWRRSCDGLQPGQKDELWKVLLEYRDIFALSEDEVGLTHLVHHEIDTGDARPIKTRPRRLPLAHQVAADSAIEEMLRGGIIEPSDSPWASGVVMVKKKKGPKMRFCVDYRPLNGVTKKDSY 341          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000044989.1 (pep primary_assembly:ASM223467v1:1:17572028:17576089:-1 gene:ENSORLG00000028051.1 transcript:ENSORLT00000044989.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 104.76 bits (260), Expect = 6.969e-22
Identity = 65/188 (34.57%), Postives = 100/188 (53.19%), Query Frame = 1
Query: 4831 IPINITNFDEEDKVIYENERLGKLT------PMMDIKLPTTKTEGTEEELWTIDSQ-----------LLNE-------NEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYR-LAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
             P+ + N  EED  +    RLG L+      P  ++     +     EE+ TI+ +           LL+         ++ +L KL+ ++ D+FA SD DLG +D  QH I LTDD P+T +PYR +   Q  E +  + ++L  GVI+ S S +  P+V+V+K D + R C+DYRRLNA T+RD +
Sbjct:  157 FPVQVVNLSEEDLWLSPKTRLGILSKVECVEPAAEVSF--NRISADHEEV-TIEQKEASVSEDAPHLLLDHLKIGGTTEQQTRLAKLISQYTDVFALSDEDLGYADRIQHEIHLTDDVPVT-QPYRRVPPTQYKEVKDHIAQLLRKGVIQESTSAYASPIVLVRKADNSLRLCVDYRRLNAKTRRDAF 340          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000045600.1 (pep primary_assembly:ASM223467v1:4:26167161:26171153:-1 gene:ENSORLG00000023514.1 transcript:ENSORLT00000045600.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 101.293 bits (251), Expect = 7.064e-21
Identity = 47/113 (41.59%), Postives = 77/113 (68.14%), Query Frame = 1
Query: 4984 LNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYR-LAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L+E +KE+++ LL ++  +FA  DLD+G +++  H IPL D+TP+  +PYR +  +Q   A   +Q++L + VI  S S +  P+V+V+KKDG  R C+DYR+LNA T++D +
Sbjct:  297 LDERDKERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVR-QPYRRIPPSQYELAHSHIQQLLQSQVIRESSSPYASPIVLVQKKDGGLRMCVDYRQLNARTRKDAF 408          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000040501.1 (pep primary_assembly:ASM223467v1:1:1251188:1255180:-1 gene:ENSORLG00000023024.1 transcript:ENSORLT00000040501.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 100.908 bits (250), Expect = 9.338e-21
Identity = 47/113 (41.59%), Postives = 77/113 (68.14%), Query Frame = 1
Query: 4984 LNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYR-LAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            L+E +KE+++ LL ++  +FA  DLD+G +++  H IPL D+TP+  +PYR +  +Q   A   +Q++L + VI  S S +  P+V+V+KKDG  R C+DYR+LNA T++D +
Sbjct:  297 LDERDKERVKALLSKYNRVFAKDDLDVGCTNLMTHEIPLLDETPVR-QPYRRIPPSQYELARSHIQQLLQSQVIRESSSPYASPIVLVQKKDGGLRMCVDYRQLNARTRKDAF 408          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000069191.1 (SMESG000069191.1)

HSP 1 Score: 2244.54 bits (5815), Expect = 0.000e+0
Identity = 1134/1240 (91.45%), Postives = 1159/1240 (93.47%), Query Frame = 1
Query: 1606 GLSDVQYDSDI--QKANSKIQPKRYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEMMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAGRTRIKRQKTSNWKRWMNLKXXXXXXXXXXENQGGAQMLTQEQLKQWEN*KTRHSEHYKNPPPVYECLKKAAKSVERNTPVERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            G++ + Y++ +  +K   +  P  YEFAITDIIRKVIEPFILTQGGWQAFINLVSKD+QTEMMQ KLKELLTEFETTINASDPKTARKGMN IITTILIWMMCIKGV+PFVVYDCDNIKIGDKYSLKETEEC AANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVS+TR+ECEEGFTTGRISIAGRV VAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQG VH ETETNTQ SALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM                                     MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFP RVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQH TAN+MKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSK IHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEA +TR+KRQKTSNWKRW+N K+D+SDNSDNDENQGG QMLTQEQLKQWEN KTRHS HY + PPVYECLKK AKSVERNTPVERHVPIPR  TLRNIYP PYEPNWADL+PRTSQRSRSVDQD ETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI MDKIPGRLWPCI+EANMDVITCSKESVAVIGKIWSIIEYK+VKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKK+ELPLAREIVYVN+NSEIPINITNFDEEDKVIYENERLGKLTPMMDI+LP TKTEGTEEELWTIDSQLLNE EKEQ    LMEFKDIFAASDLDLGTSDVTQHTI LTDDTPITLRPYRLAEAQKS AEKEVQKMLDAGVIEPSCS WQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
Sbjct: 1951 GMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDNQTEMMQIKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCIKGVEPFVVYDCDNIKIGDKYSLKETEECIAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSITRNECEEGFTTGRISIAGRVQVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGGVHSETETNTQISALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM-------------------------------------MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPARVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHVTANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTQMLTQEQLKQWENRKTRHSGHYNHRPPVYECLKKTAKSVERNTPVERHVPIPRSTTLRNIYPVPYEPNWADLVPRTSQRSRSVDQDLETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAIKMDKIPGRLWPCIKEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPATKTEGTEEELWTIDSQLLNETEKEQ----LMEFKDIFAASDLDLGTSDVTQHTISLTDDTPITLRPYRLAEAQKSVAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 3149          

HSP 2 Score: 1019.22 bits (2634), Expect = 0.000e+0
Identity = 527/553 (95.30%), Postives = 539/553 (97.47%), Query Frame = 2
Query:   11 EERHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVLTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDTXXXXXXXXXXXXSSTGDEWDNWINERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTXXXXXXXXXXXXXXXWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1669
            E RHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKK+VTPQFVKDDTIYITIEVETEREMK+RIQQEQARPRISVLEKRIPGT+YLPTDIDD VEEGLTRLTETQKCRELAAILRTGAKNQRRTVT+LAKWII+SCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIM PDFEPIEREFYITAKVSEVKKR PYE PFHLQY+GRSLDDDLEIGVTQMQLGLIN+LTVAKIAGRELNKREIKTKTP+GRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLF TE+END+ITDE DELVC QRQPVEIIEPDTEEENKKKERRRR+STGDEWDNWINERGA TS+E  DEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLI DMLGLANPEIKRNLKLNNIG EAKQLW AIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN
Sbjct: 1421 EGRHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKHVTPQFVKDDTIYITIEVETEREMKIRIQQEQARPRISVLEKRIPGTRYLPTDIDDSVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTELAKWIINSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRKPYELPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPLGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFTTENENDMITDELDELVCAQRQPVEIIEPDTEEENKKKERRRRNSTGDEWDNWINERGAITSREMTDEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLIADMLGLANPEIKRNLKLNNIGTEAKQLWVAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1973          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000066529.1 (SMESG000066529.1)

HSP 1 Score: 2244.16 bits (5814), Expect = 0.000e+0
Identity = 1123/1240 (90.56%), Postives = 1154/1240 (93.06%), Query Frame = 1
Query: 1606 GLSDVQYDSDI--QKANSKIQPKRYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEMMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAGRTRIKRQKTSNWKRWMNLKXXXXXXXXXXENQGGAQMLTQEQLKQWEN*KTRHSEHYKNPPPVYECLKKAAKSVERNTPVERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            G++ + Y++ +  +K   +  P  YEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTE+MQ KLKELLTEFETTINASDPKTARKGMN IITTILIWMMC+KGV+PFVVYDCDNIKIGDKYSLKETEECKAANPG+LQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTR+ECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVH ETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQI PKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVD+SQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGG                        I++      +I  QM                     LACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSK IHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEA +TR+KRQKTSNWKRW+N K+D+SDNSDNDENQGG +MLTQEQLKQWEN KTRHS HY +PPPVYECLKKAA SVERN PV+R VPIPRP TLRNIYP PYEPNWADL+ RTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI MDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYK+VKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSE+RCSEKIVVPGRTQMICYVKVN ETRGEMIFEPNHKFEKK+ELPLAREIVYVN+NSEIPINITNFDEEDKVIYENERLGKLTPMMDI+LPTTK EGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCS WQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
Sbjct:  530 GMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHSETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIYPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDVSQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGI---------------LGIILIIQIVR------VILTQM---------------------LACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTRMLTQEQLKQWENRKTRHSGHYNHPPPVYECLKKAASSVERNMPVDRQVPIPRPTTLRNIYPVPYEPNWADLVSRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAIKMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSELRCSEKIVVPGRTQMICYVKVNGETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPTTKAEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 1727          

HSP 2 Score: 1022.69 bits (2643), Expect = 0.000e+0
Identity = 524/547 (95.80%), Postives = 533/547 (97.44%), Query Frame = 2
Query:   29 KLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVLTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDTXXXXXXXXXXXXSSTGDEWDNWINERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTXXXXXXXXXXXXXXXWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1669
            KLGVDPRSMISQKPKSTENVAGGVP+FLLKKYVTPQFVKDDTIYITIEVETEREMK+RIQQEQARPRI+VLEKRIPG KYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSC+RVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIM PDFEPIEREFYITAKVSEVKKRIPYEPPFHLQY+GRSLDDDLEIGVTQMQLGLIN+LTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEE TRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDT  E +KK + RRSSTGDEWDNWINERGATTSKET DEQPKRKETGT  EKK RGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLW AIKPIINKI+DNKVRTTTKDPEELLLIINHLTKR DDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN
Sbjct:    8 KLGVDPRSMISQKPKSTENVAGGVPRFLLKKYVTPQFVKDDTIYITIEVETEREMKIRIQQEQARPRINVLEKRIPGIKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCSRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEEKTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDT--EEEKKGKGRRSSTGDEWDNWINERGATTSKETTDEQPKRKETGTVTEKKPRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWTAIKPIINKIIDNKVRTTTKDPEELLLIINHLTKRFDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 552          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000069191.1 (SMESG000069191.1)

HSP 1 Score: 2244.16 bits (5814), Expect = 0.000e+0
Identity = 1134/1240 (91.45%), Postives = 1159/1240 (93.47%), Query Frame = 1
Query: 1606 GLSDVQYDSDI--QKANSKIQPKRYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEMMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAGRTRIKRQKTSNWKRWMNLKXXXXXXXXXXENQGGAQMLTQEQLKQWEN*KTRHSEHYKNPPPVYECLKKAAKSVERNTPVERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            G++ + Y++ +  +K   +  P  YEFAITDIIRKVIEPFILTQGGWQAFINLVSKD+QTEMMQ KLKELLTEFETTINASDPKTARKGMN IITTILIWMMCIKGV+PFVVYDCDNIKIGDKYSLKETEEC AANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVS+TR+ECEEGFTTGRISIAGRV VAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQG VH ETETNTQ SALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM                                     MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFP RVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQH TAN+MKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSK IHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEA +TR+KRQKTSNWKRW+N K+D+SDNSDNDENQGG QMLTQEQLKQWEN KTRHS HY + PPVYECLKK AKSVERNTPVERHVPIPR  TLRNIYP PYEPNWADL+PRTSQRSRSVDQD ETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI MDKIPGRLWPCI+EANMDVITCSKESVAVIGKIWSIIEYK+VKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKK+ELPLAREIVYVN+NSEIPINITNFDEEDKVIYENERLGKLTPMMDI+LP TKTEGTEEELWTIDSQLLNE EKEQ    LMEFKDIFAASDLDLGTSDVTQHTI LTDDTPITLRPYRLAEAQKS AEKEVQKMLDAGVIEPSCS WQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
Sbjct: 1971 GMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDNQTEMMQIKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCIKGVEPFVVYDCDNIKIGDKYSLKETEECIAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSITRNECEEGFTTGRISIAGRVQVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGGVHSETETNTQISALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM-------------------------------------MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPARVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHVTANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTQMLTQEQLKQWENRKTRHSGHYNHRPPVYECLKKTAKSVERNTPVERHVPIPRSTTLRNIYPVPYEPNWADLVPRTSQRSRSVDQDLETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAIKMDKIPGRLWPCIKEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPATKTEGTEEELWTIDSQLLNETEKEQ----LMEFKDIFAASDLDLGTSDVTQHTISLTDDTPITLRPYRLAEAQKSVAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 3169          

HSP 2 Score: 1018.84 bits (2633), Expect = 0.000e+0
Identity = 527/553 (95.30%), Postives = 539/553 (97.47%), Query Frame = 2
Query:   11 EERHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVLTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDTXXXXXXXXXXXXSSTGDEWDNWINERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTXXXXXXXXXXXXXXXWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1669
            E RHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKK+VTPQFVKDDTIYITIEVETEREMK+RIQQEQARPRISVLEKRIPGT+YLPTDIDD VEEGLTRLTETQKCRELAAILRTGAKNQRRTVT+LAKWII+SCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIM PDFEPIEREFYITAKVSEVKKR PYE PFHLQY+GRSLDDDLEIGVTQMQLGLIN+LTVAKIAGRELNKREIKTKTP+GRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLF TE+END+ITDE DELVC QRQPVEIIEPDTEEENKKKERRRR+STGDEWDNWINERGA TS+E  DEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLI DMLGLANPEIKRNLKLNNIG EAKQLW AIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN
Sbjct: 1441 EGRHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKHVTPQFVKDDTIYITIEVETEREMKIRIQQEQARPRISVLEKRIPGTRYLPTDIDDSVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTELAKWIINSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRKPYELPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPLGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFTTENENDMITDELDELVCAQRQPVEIIEPDTEEENKKKERRRRNSTGDEWDNWINERGAITSREMTDEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLIADMLGLANPEIKRNLKLNNIGTEAKQLWVAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1993          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000069191.1 (SMESG000069191.1)

HSP 1 Score: 2243.39 bits (5812), Expect = 0.000e+0
Identity = 1134/1240 (91.45%), Postives = 1159/1240 (93.47%), Query Frame = 1
Query: 1606 GLSDVQYDSDI--QKANSKIQPKRYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEMMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAGRTRIKRQKTSNWKRWMNLKXXXXXXXXXXENQGGAQMLTQEQLKQWEN*KTRHSEHYKNPPPVYECLKKAAKSVERNTPVERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            G++ + Y++ +  +K   +  P  YEFAITDIIRKVIEPFILTQGGWQAFINLVSKD+QTEMMQ KLKELLTEFETTINASDPKTARKGMN IITTILIWMMCIKGV+PFVVYDCDNIKIGDKYSLKETEEC AANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVS+TR+ECEEGFTTGRISIAGRV VAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQG VH ETETNTQ SALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM                                     MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFP RVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQH TAN+MKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSK IHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEA +TR+KRQKTSNWKRW+N K+D+SDNSDNDENQGG QMLTQEQLKQWEN KTRHS HY + PPVYECLKK AKSVERNTPVERHVPIPR  TLRNIYP PYEPNWADL+PRTSQRSRSVDQD ETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI MDKIPGRLWPCI+EANMDVITCSKESVAVIGKIWSIIEYK+VKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKK+ELPLAREIVYVN+NSEIPINITNFDEEDKVIYENERLGKLTPMMDI+LP TKTEGTEEELWTIDSQLLNE EKEQ    LMEFKDIFAASDLDLGTSDVTQHTI LTDDTPITLRPYRLAEAQKS AEKEVQKMLDAGVIEPSCS WQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
Sbjct: 1949 GMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDNQTEMMQIKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCIKGVEPFVVYDCDNIKIGDKYSLKETEECIAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSITRNECEEGFTTGRISIAGRVQVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGGVHSETETNTQISALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEM-------------------------------------MDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPARVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHVTANMMKAMWGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTQMLTQEQLKQWENRKTRHSGHYNHRPPVYECLKKTAKSVERNTPVERHVPIPRSTTLRNIYPVPYEPNWADLVPRTSQRSRSVDQDLETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAIKMDKIPGRLWPCIKEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPATKTEGTEEELWTIDSQLLNETEKEQ----LMEFKDIFAASDLDLGTSDVTQHTISLTDDTPITLRPYRLAEAQKSVAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 3147          

HSP 2 Score: 1019.61 bits (2635), Expect = 0.000e+0
Identity = 527/553 (95.30%), Postives = 539/553 (97.47%), Query Frame = 2
Query:   11 EERHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVLTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDTXXXXXXXXXXXXSSTGDEWDNWINERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTXXXXXXXXXXXXXXXWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1669
            E RHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKK+VTPQFVKDDTIYITIEVETEREMK+RIQQEQARPRISVLEKRIPGT+YLPTDIDD VEEGLTRLTETQKCRELAAILRTGAKNQRRTVT+LAKWII+SCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIM PDFEPIEREFYITAKVSEVKKR PYE PFHLQY+GRSLDDDLEIGVTQMQLGLIN+LTVAKIAGRELNKREIKTKTP+GRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLF TE+END+ITDE DELVC QRQPVEIIEPDTEEENKKKERRRR+STGDEWDNWINERGA TS+E  DEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLI DMLGLANPEIKRNLKLNNIG EAKQLW AIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN
Sbjct: 1419 EGRHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKHVTPQFVKDDTIYITIEVETEREMKIRIQQEQARPRISVLEKRIPGTRYLPTDIDDSVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTELAKWIINSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRKPYELPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPLGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFTTENENDMITDELDELVCAQRQPVEIIEPDTEEENKKKERRRRNSTGDEWDNWINERGAITSREMTDEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLIADMLGLANPEIKRNLKLNNIGTEAKQLWVAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1971          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000066529.1 (SMESG000066529.1)

HSP 1 Score: 2242.62 bits (5810), Expect = 0.000e+0
Identity = 1123/1240 (90.56%), Postives = 1154/1240 (93.06%), Query Frame = 1
Query: 1606 GLSDVQYDSDI--QKANSKIQPKRYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEMMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFVVYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGXXXXXXXXXXXXXXXGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAMWGRFLIFGQMMAXXXXXXXXXXXXXXXXTQMLACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAGRTRIKRQKTSNWKRWMNLKXXXXXXXXXXENQGGAQMLTQEQLKQWEN*KTRHSEHYKNPPPVYECLKKAAKSVERNTPVERHVPIPRPATLRNIYPAPYEPNWADLIPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKVIYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 5319
            G++ + Y++ +  +K   +  P  YEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTE+MQ KLKELLTEFETTINASDPKTARKGMN IITTILIWMMC+KGV+PFVVYDCDNIKIGDKYSLKETEECKAANPG+LQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTR+ECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVH ETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQI PKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVD+SQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGG                        I++      +I  QM                     LACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSK IHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEA +TR+KRQKTSNWKRW+N K+D+SDNSDNDENQGG +MLTQEQLKQWEN KTRHS HY +PPPVYECLKKAA SVERN PV+R VPIPRP TLRNIYP PYEPNWADL+ RTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAI MDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYK+VKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSE+RCSEKIVVPGRTQMICYVKVN ETRGEMIFEPNHKFEKK+ELPLAREIVYVN+NSEIPINITNFDEEDKVIYENERLGKLTPMMDI+LPTTK EGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCS WQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
Sbjct:  530 GMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNECEEGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTCRLTLLKSAIFDEVQGRVHSETETNTQKSALDELRRTGRIGKPTITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAKAQIYPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDVSQSGIYQTKDLERAREHLLFPAQRQAILSNIVTITGGI---------------LGIILIIQIVR------VILTQM---------------------LACFDIYKRERKINWKMAIGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLEAEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTRMLTQEQLKQWENRKTRHSGHYNHPPPVYECLKKAASSVERNMPVDRQVPIPRPTTLRNIYPVPYEPNWADLVSRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKAMIDTGANISAIKMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEYKNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICMLKSELRCSEKIVVPGRTQMICYVKVNGETRGEMIFEPNHKFEKKKELPLAREIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPTTKAEGTEEELWTIDSQLLNENEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY 1727          

HSP 2 Score: 1021.92 bits (2641), Expect = 0.000e+0
Identity = 524/547 (95.80%), Postives = 533/547 (97.44%), Query Frame = 2
Query:   29 KLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVLTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDTXXXXXXXXXXXXSSTGDEWDNWINERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTXXXXXXXXXXXXXXXWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 1669
            KLGVDPRSMISQKPKSTENVAGGVP+FLLKKYVTPQFVKDDTIYITIEVETEREMK+RIQQEQARPRI+VLEKRIPG KYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSC+RVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIM PDFEPIEREFYITAKVSEVKKRIPYEPPFHLQY+GRSLDDDLEIGVTQMQLGLIN+LTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEE TRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDT  E +KK + RRSSTGDEWDNWINERGATTSKET DEQPKRKETGT  EKK RGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLW AIKPIINKI+DNKVRTTTKDPEELLLIINHLTKR DDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN
Sbjct:    8 KLGVDPRSMISQKPKSTENVAGGVPRFLLKKYVTPQFVKDDTIYITIEVETEREMKIRIQQEQARPRINVLEKRIPGIKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCSRVENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRIPYEPPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEEKTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIEPDT--EEEKKGKGRRSSTGDEWDNWINERGATTSKETTDEQPKRKETGTVTEKKPRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWTAIKPIINKIIDNKVRTTTKDPEELLLIINHLTKRFDDPNRYLVGMAYLMYNTIVIYRKRTPRYNPN 552          
The following BLAST results are available for this feature:
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
BX511082.17.444e-740.54pep chromosome:GRCz11:9:14291932:14297132:1 gene:E... [more]
BX546500.11.494e-639.19pep chromosome:GRCz11:23:12926092:12931693:-1 gene... [more]
CR925755.21.506e-639.19pep chromosome:GRCz11:17:42486740:42492668:-1 gene... [more]
BX511224.11.550e-639.19pep chromosome:GRCz11:2:18017000:18022765:1 gene:E... [more]
CR749164.14.245e-630.65pep chromosome:GRCz11:21:30355767:30361679:1 gene:... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSXETT00000006041.14.259e-2126.25pep primary_assembly:Xenopus_tropicalis_v9.1:6:373... [more]
anxa65.152e-1941.74annexin A6 [Source:Xenbase;Acc:XB-GENE-989741][more]
ENSXETT00000023941.11.376e-1841.74pep primary_assembly:Xenopus_tropicalis_v9.1:KV463... [more]
ENSXETT00000015952.12.150e-1738.39pep primary_assembly:Xenopus_tropicalis_v9.1:8:632... [more]
alkbh59.926e-1634.78alkB homolog 5, RNA demethylase [Source:Xenbase;Ac... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|P20825|POL2_DROME1.726e-1029.21Retrovirus-related Pol polyprotein from transposon... [more]
sp|P10394|POL4_DROME7.153e-1023.26Retrovirus-related Pol polyprotein from transposon... [more]
sp|P04323|POL3_DROME4.187e-933.59Retrovirus-related Pol polyprotein from transposon... [more]
sp|Q7LHG5|YI31B_YEAST5.788e-950.00Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomy... [more]
sp|Q99315|YG31B_YEAST6.341e-950.00Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomy... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A355ABF22.993e-11435.44Uncharacterized protein OS=Flavobacteriaceae bacte... [more]
A0A4Y2G3W07.253e-3830.47Retrovirus-related Pol polyprotein from transposon... [more]
A0A4Y2CTA51.018e-3730.29Retrovirus-related Pol polyprotein from transposon... [more]
A0A4Y2R4I53.172e-3730.43Retrovirus-related Pol polyprotein from transposon... [more]
A0A4Y2HSA14.353e-3730.59Retrovirus-related Pol polyprotein from transposon... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000038088.18.514e-2329.41pep primary_assembly:Astyanax_mexicanus-2.0:23:138... [more]
ENSAMXT00000052546.12.836e-2243.59pep primary_assembly:Astyanax_mexicanus-2.0:2:1303... [more]
ENSAMXT00000030217.15.211e-2234.87pep primary_assembly:Astyanax_mexicanus-2.0:16:123... [more]
ENSAMXT00000041682.17.129e-2227.78pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000054058.17.545e-2230.63pep primary_assembly:Astyanax_mexicanus-2.0:19:201... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000040305.19.887e-2624.88pep primary_assembly:ASM223467v1:1:25459511:254666... [more]
ENSORLT00000036827.18.124e-2538.89pep primary_assembly:ASM223467v1:16:28613823:28617... [more]
ENSORLT00000044989.16.969e-2234.57pep primary_assembly:ASM223467v1:1:17572028:175760... [more]
ENSORLT00000045600.17.064e-2141.59pep primary_assembly:ASM223467v1:4:26167161:261711... [more]
ENSORLT00000040501.19.338e-2141.59pep primary_assembly:ASM223467v1:1:1251188:1255180... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000069191.10.000e+091.45SMESG000069191.1[more]
SMESG000066529.10.000e+090.56SMESG000066529.1[more]
SMESG000069191.10.000e+091.45SMESG000069191.1[more]
SMESG000069191.10.000e+091.45SMESG000069191.1[more]
SMESG000066529.10.000e+090.56SMESG000066529.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30035617 ID=SMED30035617|Name=Transposon Ty3-I Gag-Pol polyprotein|organism=Schmidtea mediterranea sexual|type=transcript|length=5321bp
TTCCGATCTGGAAGAGCGACACAGCGAAAAATTGGGAGTAGACCCAAGAT
CAATGATAAGTCAGAAGCCAAAATCCACGGAAAATGTCGCAGGAGGAGTA
CCACAATTTCTATTGAAAAAATACGTAACACCTCAATTCGTAAAAGATGA
TACGATATACATCACGATTGAAGTAGAAACAGAACGAGAAATGAAATTGC
GAATCCAACAAGAACAAGCGAGACCGCGAATCAGCGTACTCGAGAAACGA
ATACCAGGAACAAAATACTTGCCCACAGATATAGACGACCCTGTGGAGGA
AGGATTAACAAGATTAACGGAAACGCAAAAATGTAGAGAGTTAGCAGCGA
TATTGAGAACCGGAGCCAAGAATCAAAGACGAACTGTAACAGATTTAGCA
AAATGGATAATCGATTCATGTAACAGAGTAGAAAACCTGCGAAATATAGT
AAAGGAAATAGAGGAGTATCGGAGAATGTATGCGTGGTACCCAGAGCAAT
TCCGAGAAATTATCCGAAAATGGAAACATACCGAATGGGCAGCCGAAACG
ATAGAGAAGTATAGTTTAAAAATTATGATGCCAGATTTCGAACCAATCGA
AAGAGAATTCTACATCACTGCCAAAGTGAGTGAAGTGAAAAAGAGAATAC
CGTATGAACCGCCATTTCATTTACAATACCAAGGACGATCATTAGACGAC
GATCTCGAAATCGGAGTTACGCAAATGCAATTGGGATTAATAAATGTATT
AACAGTAGCAAAGATCGCGGGTAGAGAACTCAATAAAAGAGAAATCAAAA
CGAAAACGCCAATAGGGCGACAGGAAAGTCTAAATACGCCAGAAGTTCCA
CCTGCAGCAGATGAAGTAATCGTCATCACAACAGGAGATGAAGAGATACT
AAATGATCTAAATGATATTGTGGACGAGGAAACAACCCGGAGAATACCGG
AATCAAGAGATTTGTTCATGACAGAAGATGAAAACGATCTGATAACAGAT
GAACCAGACGAGCTAGTATGTGGTCAAAGACAACCAGTCGAAATTATAGA
GCCAGATACCGAAGAGGAAAATAAAAAGAAAGAGAGAAGACGCCGAAGCT
CGACAGGAGACGAATGGGATAATTGGATAAATGAACGAGGAGCTACAACA
TCCAAAGAAACGGCAGACGAACAACCCAAACGAAAAGAAACAGGTACGGC
AATGGAAAAGAAATCCAGAGGACGACCAAAAATTGATAAACCAGCTCCAG
AAAGGAAAACTAAAGAACAAATGCAGATGGATCAATTGACAACGAAATTG
AACAAGCTCAGCCAAGTAACGAAGCAGTTACAGGGAAAACTGAAAGGCCA
AGAACAAGGAGCAAAGTGGATCAACAGACTAATAACTGATATGCTGGGAT
TAGCGAATCCAGAAATAAAGAGAAATTTGAAACTTAATAATATCGGAGCA
GAAGCGAAACAGCTTTGGGCAGCAATAAAACCAATAATAAATAAAATAAT
GGATAATAAAGTACGAACCACAACTAAAGATCCAGAGGAATTATTGTTGA
TAATAAATCACTTAACGAAACGATCTGATGACCCAAATAGATATTTGGTA
GGAATGGCCTATCTGATGTACAATACGATAGTGATATACAGAAAGCGAAC
TCCAAGATACAACCCAAACGATATGAATTCGCAATAACGGATATTATCAG
AAAAGTAATCGAACCATTTATCTTAACTCAAGGCGGATGGCAAGCTTTTA
TAAATTTGGTAAGTAAAGATGATCAAACCGAGATGATGCAAGCGAAGTTA
AAAGAGTTATTAACGGAATTCGAAACAACGATTAATGCAAGCGATCCAAA
AACAGCTAGAAAAGGGATGAATACGATAATCACTACAATACTCATATGGA
TGATGTGCATAAAAGGAGTAAAACCGTTTGTAGTATATGACTGTGACAAC
ATAAAGATTGGTGACAAGTACTCACTAAAAGAAACAGAGGAGTGCAAAGC
TGCAAACCCCGGAAGATTACAGACAACAGCAACCGCAGTATCATATAACG
TATATCAAGAAGTAGACTTCATCAAAACAGAAATCAAAGAATGCTCAGTA
ACAAGGAAAATTGTAGCATTTCATTGCGGACATCACTCACACTCAACAAT
TATAGAAGCTGGAACGGAAACAACAGTGGTTTCTGTAACGCGGAGCGAAT
GTGAAGAAGGATTCACGACAGGACGCATCTCAATAGCTGGACGAGTGAAT
GTGGCAGCCGAGGAAGGAAAAATAAAAACTACCCGAGTATACGCTGCGGG
AAGAATAACGGCTGGAGACGGAACGTGTCAAGGAGGAGAGTACACCCTAT
TGAATCAATTAGTCAAAGGAGTAGTAGTGATTGAAAGTTATCAAGTAAAG
TTGGAGAAATACCAAGGATATTTCGATCCAGATACGCAAGCAATGAAAAA
GTACCCTCAATGTCTAGCAACAGATAGATCATGCAATACGGGAATGTCAA
CGATTGTATATCACGCCGATACCAGAACTTGTAGACTCACCTTATTGAAA
AGCGCAATATTTGATGAAGTGCAAGGACGAGTGCACAGAGAAACAGAAAC
TAACACGCAAAAAAGTGCATTAGATGAATTACGGAGAACCGGAAGAATCG
GAAAACCAACGATAACCACAACTATCGGATCAGAGGTCACACCAACAGTA
GTAATATCAACAGATACTACAATCGGAATGAGATTTATAAAAGGACGAAT
GGTGAGCAAATGTGACGAAATGGTAGCTAGTACCAATTATAAAGGAATAT
TCTTATCGCGAACAGCAATACCGAACGCAAAAGCGCAAATAGACCCCAAA
GATGTGAAACTATATCTCTATATAAATAACAAGATGGATTTCCTATATCA
CAAAGGATTGGCATCAACAGAGAAAATATACTACGATTTAGTTAAAAATG
ACTGTATCTTGAACAGAGAAATATTAAAAACAAAGTTAGCAATGGCAATC
ACAAATCCGGATAATGCAATTCCATTATTACCACTCCAGGAAGGATATTT
TGGGAGAATCGTTGGAGAAGTTATGTATACGTACAAATGTGAAAAGACAA
TTGCAAAACTGCCAGAGAATTCAACGATTCCAAGAGATAAATGCATAAAC
GAATTAGAGATAATGCATAAAGGGAAAATCAGATTCGCTCAACCAGTAAC
AAGGATGATAAACCCAGAGAAATTTGTACCCAATATGTTCAGTTGTAGTA
ATGTATATGGACCATTATTCGAAATAAGAGATGGAAGTTGGTTACAATTT
CCAACCCGAGTAATTGTAGCACCACCAAAAATATTTGGATTAACAGAATT
GGCCAGCGAAGCAGAATTCAAACCAGTTGATATATCACAGAGTGGAATAT
ACCAAACAAAAGATCTGGAACGAGCCAGAGAACATTTATTATTTCCAGCA
CAAAGGCAAGCAATATTATCCAACATTGTAACTATAACGGGAGGTAACAA
CTATGGAGAGAAACCCAACTACGAATTGTTATTATCACCAGACCATTTCC
AACACGCTACAGCAAACATAATGAAAGCCATGTGGGGAAGATTCTTAATC
TTCGGACAAATGATGGCAGGAATTTTAGGAATTATACTCATCATCCAAAT
CGTAAGGGTAATCCTAACACAGATGTTAGCATGCTTCGATATTTACAAAA
GAGAAAGGAAAATCAACTGGAAAATGGCAATTGGATTCCTACCGTTCTTA
GCAAAGACAATGGTGTTACATGGACACTCAAAATACATTCACAAGATAAA
GAGACTAGTAGGAATTCTAACAAAAATGAACGGTACCGATGAAGAAGTTG
GACGGAAATTCAAAGCGATAATGAGATTAGAAGCCGGGAGAACCAGAATA
AAACGACAGAAAACGAGCAACTGGAAGCGATGGATGAACCTCAAAAATGA
TAACAGTGACAACAGCGACAATGACGAAAATCAAGGAGGAGCACAGATGC
TAACTCAAGAACAGCTTAAACAATGGGAAAACTGAAAGACAAGACATAGT
GAACATTACAAAAATCCACCACCAGTATATGAATGTCTGAAGAAAGCAGC
AAAATCAGTAGAGAGAAATACGCCGGTGGAAAGACACGTGCCGATACCAA
GACCAGCAACACTACGAAACATTTATCCGGCACCATACGAACCGAATTGG
GCAGACTTGATACCGAGAACCAGCCAAAGATCGAGAAGTGTGGATCAGGA
TCCAGAAACTCGAACCATAGCTATGTTACAAACATTAGCCGGACCAACAC
TAACGGTACCCATAGAAATAAATGGGGTAGAAATAAAAGCAATGATTGAT
ACCGGAGCAAACATCTCAGCAATAAATATGGATAAAATACCCGGAAGATT
GTGGCCATGTATAGAAGAAGCTAACATGGATGTAATAACTTGCAGCAAAG
AATCGGTAGCAGTCATCGGAAAAATTTGGTCGATAATAGAATATAAACAT
GTAAAAATCAATACGTATTTAGCAGTTATCAGGAGATTGAGTGCAGATTG
TATAATTGGAACAGATCTCATGCCAGAATTACTAAAAGAGATAATAATAG
ATCTAGGATCAATGGAGCTGAGAGATAAAACCGGACGAATATGCATGTTA
AAATCAGAAGTGAGATGCTCAGAGAAAATAGTAGTGCCAGGGCGCACACA
AATGATTTGTTACGTAAAAGTGAACGAGGAAACCAGAGGCGAAATGATTT
TCGAACCTAACCACAAATTTGAGAAAAAGAGAGAACTACCATTAGCCAGA
GAGATAGTGTATGTAAACAACAACAGTGAAATTCCAATCAACATTACTAA
TTTCGATGAGGAAGACAAAGTAATTTACGAAAACGAAAGATTGGGGAAAC
TAACACCGATGATGGACATTAAACTTCCAACAACGAAAACCGAGGGAACG
GAGGAAGAATTATGGACAATCGACAGTCAGCTACTAAACGAAAACGAGAA
AGAACAACTACAAAAACTGCTAATGGAATTTAAAGATATATTTGCAGCAT
CTGACTTAGATCTCGGTACAAGCGATGTAACGCAACATACGATACCATTA
ACAGATGATACCCCAATAACACTACGGCCATATCGATTGGCAGAAGCTCA
AAAATCAGAAGCAGAAAAAGAAGTTCAAAAGATGTTAGACGCAGGAGTAA
TCGAACCAAGTTGCTCAGTGTGGCAATTTCCAGTGGTCATGGTAAAGAAA
AAGGACGGAACACAACGGTTCTGTATAGATTATCGACGACTCAATGCGGT
AACAAAACGGGATACATACCC
back to top

protein sequence of SMED30035617-orf-1

>SMED30035617-orf-1 ID=SMED30035617-orf-1|Name=SMED30035617-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=349bp
MLQTLAGPTLTVPIEINGVEIKAMIDTGANISAINMDKIPGRLWPCIEEA
NMDVITCSKESVAVIGKIWSIIEYKHVKINTYLAVIRRLSADCIIGTDLM
PELLKEIIIDLGSMELRDKTGRICMLKSEVRCSEKIVVPGRTQMICYVKV
NEETRGEMIFEPNHKFEKKRELPLAREIVYVNNNSEIPINITNFDEEDKV
IYENERLGKLTPMMDIKLPTTKTEGTEEELWTIDSQLLNENEKEQLQKLL
MEFKDIFAASDLDLGTSDVTQHTIPLTDDTPITLRPYRLAEAQKSEAEKE
VQKMLDAGVIEPSCSVWQFPVVMVKKKDGTQRFCIDYRRLNAVTKRDTY
back to top

protein sequence of SMED30035617-orf-2

>SMED30035617-orf-2 ID=SMED30035617-orf-2|Name=SMED30035617-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=751bp
MMQAKLKELLTEFETTINASDPKTARKGMNTIITTILIWMMCIKGVKPFV
VYDCDNIKIGDKYSLKETEECKAANPGRLQTTATAVSYNVYQEVDFIKTE
IKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRSECEEGFTTGRIS
IAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLLNQLVKGVVVI
ESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTIVYHADTRTC
RLTLLKSAIFDEVQGRVHRETETNTQKSALDELRRTGRIGKPTITTTIGS
EVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSRTAIPNAK
AQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILNREILKT
KLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPENSTIP
RDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLFEIRD
GSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLERARE
HLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANIMKAM
WGRFLIFGQMMAGILGIILIIQIVRVILTQMLACFDIYKRERKINWKMAI
GFLPFLAKTMVLHGHSKYIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLE
AGRTRIKRQKTSNWKRWMNLKNDNSDNSDNDENQGGAQMLTQEQLKQWEN
*
back to top

protein sequence of SMED30035617-orf-3

>SMED30035617-orf-3 ID=SMED30035617-orf-3|Name=SMED30035617-orf-3|organism=Schmidtea mediterranea sexual|type=polypeptide|length=562bp
SDLEERHSEKLGVDPRSMISQKPKSTENVAGGVPQFLLKKYVTPQFVKDD
TIYITIEVETEREMKLRIQQEQARPRISVLEKRIPGTKYLPTDIDDPVEE
GLTRLTETQKCRELAAILRTGAKNQRRTVTDLAKWIIDSCNRVENLRNIV
KEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIE
REFYITAKVSEVKKRIPYEPPFHLQYQGRSLDDDLEIGVTQMQLGLINVL
TVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEIL
NDLNDIVDEETTRRIPESRDLFMTEDENDLITDEPDELVCGQRQPVEIIE
PDTEEENKKKERRRRSSTGDEWDNWINERGATTSKETADEQPKRKETGTA
MEKKSRGRPKIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKLKGQ
EQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAKQLWAAIKPIINKIM
DNKVRTTTKDPEELLLIINHLTKRSDDPNRYLVGMAYLMYNTIVIYRKRT
PRYNPNDMNSQ*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000099neuron
PLANA:0000463cholinergic neuron
PLANA:0000464GABAergic neuron
PLANA:0003116parenchymal cell
PLANA:0007528glial cell
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR018061Retropepsins
IPR008974TRAF-like
Vocabulary: molecular function
TermDefinition
GO:0005515protein binding
GO:0003676nucleic acid binding
GO:0004190aspartic-type endopeptidase activity
Vocabulary: biological process
TermDefinition
GO:0006508proteolysis
GO:0015074DNA integration