Transposon Ty3-I Gag-Pol polyprotein

Overview
NameTransposon Ty3-I Gag-Pol polyprotein
Smed IDSMED30023331
Length (bp)6213
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30023331) t-SNE clustered cells

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30023331) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Transposon Ty3-I Gag-Pol polyprotein (SMED30023331) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Transposon Ty3-I Gag-Pol polyprotein (SMED30023331) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Homology
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: ZBED5 (zinc finger BED-type containing 5 [Source:HGNC Symbol;Acc:HGNC:30803])

HSP 1 Score: 284.263 bits (726), Expect = 1.101e-89
Identity = 177/449 (39.42%), Postives = 263/449 (58.57%), Query Frame = 1
Query: 4447 KRIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPS-------IQTNNVTDIIPAIIEHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEIC-KTRQAQISH 5766
            +RIKD++ DIEE +  +L     F+LQ+DES D+S  A LL  VR+     I    L C  L  + TG++IFNCI ++++K +I W+ CV +C+D   ++ G I   VTL+K   P   S+HC L+   L  K +P +LK+ LD+ VQI+NYIK+RP Q+R+ K L   M A++ +LLL+TE+R LSRGKVL RL EL+ ELL +  +    +      N  W  +LAYLADIF  LN VN S+QGKN  + T  DK+ +  +K+ +W + + + N  D FP+       I +    DI  AI++HL  L   +  YF  +  ++  W+RNPF         ++   E +I L+++  +K  F+++S   FW  + +E+PS++++A+ + L F+T +LCE GFS     KTK R+RL D    MR+ +S I PNI  IC K  Q   SH
Sbjct:  250 RRIKDLAADIEEELVCRLKICDGFSLQLDESADVSGLAVLLVFVRYRFNKSIEEDLLLCESLQSNATGEEIFNCINSFMQKHEIEWEKCVDVCSDASRAVDGKIAEAVTLIKYVAPESTSSHCLLYRHALAVKIMPTSLKNVLDQAVQIINYIKARPHQSRLLKILCEEMGAQHTALLLNTEVRWLSRGKVLVRLFELRRELLVFMDSAF--RLSDCLTNSSWLLRLAYLADIFTKLNEVNLSMQGKNVTVFTVFDKMSSLLRKLEFWASSVEEEN-FDCFPTLSDFLTEINSTVDKDICSAIVQHLRGLRATLLKYF-PVTNDNNAWVRNPFTVTVKPASLVARDYESLIDLTSDSQVKQNFSELSLNDFWSSLIQEYPSIARRAVRVLLPFATMHLCETGFSYYAATKTKYRKRL-DAAPHMRIRLSNITPNIKRICDKKTQKHCSH 693          

HSP 2 Score: 69.3218 bits (168), Expect = 1.101e-89
Identity = 43/119 (36.13%), Postives = 63/119 (52.94%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLGM----QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEIV 4378
            R+Y + Y+SFGF +   ++ P  +CV+C   LSNS++ P+KL RH    HA    K+  +FK+ L      +    K   T ++ A  ASY     IAL  + HTI E LI P   ++V
Sbjct:  108 RKYDESYLSFGFTYFGNRDAPHAQCVLCKKILSNSSLAPSKLRRHLETKHAAYKDKDISFFKQHLDSPENNKPPTPKIVNTDNESATEASYNVSYHIALSGEAHTIGELLIKPCAKDVV 226          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: ZBED5 (zinc finger BED-type containing 5 [Source:HGNC Symbol;Acc:HGNC:30803])

HSP 1 Score: 284.263 bits (726), Expect = 1.101e-89
Identity = 177/449 (39.42%), Postives = 263/449 (58.57%), Query Frame = 1
Query: 4447 KRIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPS-------IQTNNVTDIIPAIIEHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEIC-KTRQAQISH 5766
            +RIKD++ DIEE +  +L     F+LQ+DES D+S  A LL  VR+     I    L C  L  + TG++IFNCI ++++K +I W+ CV +C+D   ++ G I   VTL+K   P   S+HC L+   L  K +P +LK+ LD+ VQI+NYIK+RP Q+R+ K L   M A++ +LLL+TE+R LSRGKVL RL EL+ ELL +  +    +      N  W  +LAYLADIF  LN VN S+QGKN  + T  DK+ +  +K+ +W + + + N  D FP+       I +    DI  AI++HL  L   +  YF  +  ++  W+RNPF         ++   E +I L+++  +K  F+++S   FW  + +E+PS++++A+ + L F+T +LCE GFS     KTK R+RL D    MR+ +S I PNI  IC K  Q   SH
Sbjct:  250 RRIKDLAADIEEELVCRLKICDGFSLQLDESADVSGLAVLLVFVRYRFNKSIEEDLLLCESLQSNATGEEIFNCINSFMQKHEIEWEKCVDVCSDASRAVDGKIAEAVTLIKYVAPESTSSHCLLYRHALAVKIMPTSLKNVLDQAVQIINYIKARPHQSRLLKILCEEMGAQHTALLLNTEVRWLSRGKVLVRLFELRRELLVFMDSAF--RLSDCLTNSSWLLRLAYLADIFTKLNEVNLSMQGKNVTVFTVFDKMSSLLRKLEFWASSVEEEN-FDCFPTLSDFLTEINSTVDKDICSAIVQHLRGLRATLLKYF-PVTNDNNAWVRNPFTVTVKPASLVARDYESLIDLTSDSQVKQNFSELSLNDFWSSLIQEYPSIARRAVRVLLPFATMHLCETGFSYYAATKTKYRKRL-DAAPHMRIRLSNITPNIKRICDKKTQKHCSH 693          

HSP 2 Score: 69.3218 bits (168), Expect = 1.101e-89
Identity = 43/119 (36.13%), Postives = 63/119 (52.94%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLGM----QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEIV 4378
            R+Y + Y+SFGF +   ++ P  +CV+C   LSNS++ P+KL RH    HA    K+  +FK+ L      +    K   T ++ A  ASY     IAL  + HTI E LI P   ++V
Sbjct:  108 RKYDESYLSFGFTYFGNRDAPHAQCVLCKKILSNSSLAPSKLRRHLETKHAAYKDKDISFFKQHLDSPENNKPPTPKIVNTDNESATEASYNVSYHIALSGEAHTIGELLIKPCAKDVV 226          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: ZBED9 (zinc finger BED-type containing 9 [Source:HGNC Symbol;Acc:HGNC:13851])

HSP 1 Score: 261.536 bits (667), Expect = 3.916e-85
Identity = 173/454 (38.11%), Postives = 269/454 (59.25%), Query Frame = 1
Query: 4447 KRIKDMSHDIEE--IVNYKLSKKYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYL-EKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSIQT-----NNVTDII---PAIIEHLTILDEIIACYFSSLKLESYD--WIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSN-EHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTRQAQISH 5766
            +RI+++++D+E+  I   KL+K YF+LQ+DE  DI++   LL  VRF  +++I  +F     L  +TT  +++  +  Y+  K  + + FCVG+C+DG  SM G     VT +KE  P   +THCF+H E L  K +   L S L+ +V+IVNYIKS  L +R+F  L  +M+A ++ LLLH EIR LSRGKVL R+ E+++ELL + Q      + + F++  W A+LAYL+DIF   N +N S+QGKN    +  DK+    +K+  WKNRI+  +  DMF ++ T      N  DI      I EHLT L E    YF S +       WI+NPF + + +       +++++ L+T+  LK+ F   ++   FWI  + ++P L++ A+ + L F ++YLCE GFSTL+ IKTK R  L ++   +RVA+S I+P + ++   +QA +SH
Sbjct:  725 RRIQELANDMEDQLIEQIKLAK-YFSLQLDECRDIANMIILLVYVRFEHDDDIKEEFFFSASLPTNTTSSELYEAVKNYIVNKCGLEFKFCVGVCSDGAASMTGKHSEVVTQIKELAPECKTTHCFIHRESLAMKKISAELNSVLNDIVKIVNYIKSNSLNSRLFSLLCDNMEADHKQLLLHAEIRWLSRGKVLSRMFEIRNELLVFLQG-KKPMWSQLFKDVNWTARLAYLSDIFSIFNDLNASMQGKNATYFSMADKVEGQKQKLEAWKNRIS-TDCYDMFHNLTTIINEVGNDLDIAHLRKVISEHLTNLLECFEFYFPSKEDPRIGNLWIQNPFLSSKDNLNLTVTLQDKLLKLATDEGLKISFENTASLPSFWIKAKNDYPELAEIALKLLLLFPSTYLCETGFSTLSVIKTKHRNSL-NIHYPLRVALSSIQPRLDKLTSKKQAHLSH 1174          

HSP 2 Score: 73.1738 bits (178), Expect = 3.916e-85
Identity = 42/110 (38.18%), Postives = 60/110 (54.55%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRL---LGMQAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLI 4354
            R+Y   YI FGF    + E   P+C++CG  L+N AM P+KL RH  + H  + S+  ++F+R    L  Q KQ      I+  A  ASYK    +A    P+TIAE+L+
Sbjct:  584 RKYDPSYIEFGFVAVIDGEVLKPQCIICGDVLANEAMKPSKLKRHLYSKHKEISSQPKEFFERKSSELKSQPKQVFNVSHINISALRASYKVALPVAKSKTPYTIAETLV 693          

HSP 3 Score: 25.0238 bits (53), Expect = 3.916e-85
Identity = 9/25 (36.00%), Postives = 18/25 (72.00%), Query Frame = 3
Query: 4377 LKIMFGDAKNEIMKIPLSNDTIKKK 4451
            L+++   A  ++ ++PLSNDTI ++
Sbjct:  702 LEMLGESAAKKVAQVPLSNDTIARR 726          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: ZBED9 (zinc finger BED-type containing 9 [Source:HGNC Symbol;Acc:HGNC:13851])

HSP 1 Score: 261.536 bits (667), Expect = 4.110e-85
Identity = 173/454 (38.11%), Postives = 269/454 (59.25%), Query Frame = 1
Query: 4447 KRIKDMSHDIEE--IVNYKLSKKYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYL-EKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSIQT-----NNVTDII---PAIIEHLTILDEIIACYFSSLKLESYD--WIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSN-EHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTRQAQISH 5766
            +RI+++++D+E+  I   KL+K YF+LQ+DE  DI++   LL  VRF  +++I  +F     L  +TT  +++  +  Y+  K  + + FCVG+C+DG  SM G     VT +KE  P   +THCF+H E L  K +   L S L+ +V+IVNYIKS  L +R+F  L  +M+A ++ LLLH EIR LSRGKVL R+ E+++ELL + Q      + + F++  W A+LAYL+DIF   N +N S+QGKN    +  DK+    +K+  WKNRI+  +  DMF ++ T      N  DI      I EHLT L E    YF S +       WI+NPF + + +       +++++ L+T+  LK+ F   ++   FWI  + ++P L++ A+ + L F ++YLCE GFSTL+ IKTK R  L ++   +RVA+S I+P + ++   +QA +SH
Sbjct:  876 RRIQELANDMEDQLIEQIKLAK-YFSLQLDECRDIANMIILLVYVRFEHDDDIKEEFFFSASLPTNTTSSELYEAVKNYIVNKCGLEFKFCVGVCSDGAASMTGKHSEVVTQIKELAPECKTTHCFIHRESLAMKKISAELNSVLNDIVKIVNYIKSNSLNSRLFSLLCDNMEADHKQLLLHAEIRWLSRGKVLSRMFEIRNELLVFLQG-KKPMWSQLFKDVNWTARLAYLSDIFSIFNDLNASMQGKNATYFSMADKVEGQKQKLEAWKNRIS-TDCYDMFHNLTTIINEVGNDLDIAHLRKVISEHLTNLLECFEFYFPSKEDPRIGNLWIQNPFLSSKDNLNLTVTLQDKLLKLATDEGLKISFENTASLPSFWIKAKNDYPELAEIALKLLLLFPSTYLCETGFSTLSVIKTKHRNSL-NIHYPLRVALSSIQPRLDKLTSKKQAHLSH 1325          

HSP 2 Score: 73.1738 bits (178), Expect = 4.110e-85
Identity = 42/110 (38.18%), Postives = 60/110 (54.55%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKR---LLGMQAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLI 4354
            R+Y   YI FGF    + E   P+C++CG  L+N AM P+KL RH  + H  + S+  ++F+R    L  Q KQ      I+  A  ASYK    +A    P+TIAE+L+
Sbjct:  735 RKYDPSYIEFGFVAVIDGEVLKPQCIICGDVLANEAMKPSKLKRHLYSKHKEISSQPKEFFERKSSELKSQPKQVFNVSHINISALRASYKVALPVAKSKTPYTIAETLV 844          

HSP 3 Score: 25.409 bits (54), Expect = 4.110e-85
Identity = 9/25 (36.00%), Postives = 18/25 (72.00%), Query Frame = 3
Query: 4377 LKIMFGDAKNEIMKIPLSNDTIKKK 4451
            L+++   A  ++ ++PLSNDTI ++
Sbjct:  853 LEMLGESAAKKVAQVPLSNDTIARR 877          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Match: ZBED8 (zinc finger BED-type containing 8 [Source:HGNC Symbol;Acc:HGNC:30804])

HSP 1 Score: 250.366 bits (638), Expect = 7.525e-77
Identity = 163/442 (36.88%), Postives = 246/442 (55.66%), Query Frame = 1
Query: 4450 RIKDMSHDIEEIV--NYKLSKKYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSIQTNNVTD-----IIPAIIEHLTILDEIIACYFSSLKL-ESYDWIRNPF-GTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTR 5748
            RI +MS DI + V  + K S     +Q+ E+ D+   +QL+A VR+I E EIV +FL C  L     G D+FN    +  K +I+ D C  +CTDG  SM G     V  VK++ P+I+ THC L+   LV KTLP  L+ AL  VV+++N+IK R    R+F+     +  +Y  LL HTE+R LSRG++L  + E+ +E+  +  + + N  V  FEN  +   LAYLAD+FK+LN ++ S+Q    N +++ +K+ AF +K  +W+ RI K N  + FP ++   V+D     I   I  HL  L      YFS   L E+  WI +PF    +F +    ++ +     ++ + L MEF  M  E FW       P+L+K A+ I + F+T+YLCELGFS+L + KTK R    +L +++RVAIS   P   +I + +
Sbjct:  149 RIDEMSQDILQQVLEDIKASPLKVGIQLAETTDMDDCSQLMAFVRYIKEREIVEEFLFCEPLQLSMKGIDVFNLFRDFFLKHKIALDVCGSVCTDGASSMLGENSEFVAYVKKEIPHIVVTHCLLNPHALVIKTLPTKLRDALFTVVRVINFIKGRAPNHRLFQAFFEEIGIEYSVLLFHTEMRWLSRGQILTHIFEMYEEINQFLHHKSSN-LVDGFENKEFKIHLAYLADLFKHLNELSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIEKRNFTN-FPFLEEIIVSDNEGIFIAAEITLHLQQLSNFFHGYFSIGDLNEASKWILDPFLFNIDFVDDSYLMKNDLAELRASGQIL-MEFETMKLEDFWCAQFTAFPNLAKTALEILMPFATTYLCELGFSSLLHFKTKSRSCF-NLSDDIRVAISKKVPRFSDIIEQK 586          

HSP 2 Score: 60.4622 bits (145), Expect = 7.525e-77
Identity = 38/125 (30.40%), Postives = 62/125 (49.60%), Query Frame = 2
Query: 4022 MKTNRQYSDEYISFGFAWTDEKE-CPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLG--MQAKQFK--GAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEIVK 4381
            M   R++ D+Y+ + F  T E +    P+CV+C    SN+ + P+KL+ HF   H  +   + +  K +     Q++  K  G  +  D    ASY+   L A +  PHT+AE L+ P   EI +
Sbjct:    1 MSKKRKWDDDYVRYWFTCTTEVDGTQRPQCVLCNSVFSNADLRPSKLSDHFNRQHGGVAGHDLNSLKHMPAPSDQSETLKAFGVASHEDTLLQASYQFAYLCAKEKNPHTVAEKLVKPCALEIAQ 125          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Match: CNBP (gene:FBgn0034802 transcript:FBtr0303464)

HSP 1 Score: 60.8474 bits (146), Expect = 6.340e-10
Identity = 29/74 (39.19%), Postives = 45/74 (60.81%), Query Frame = 1
Query:  274 EESIRREMPECWLCHKIGHTKIDCP------IKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQE 477
            ++  + + P C+ C+K GH   +CP          + C+ C+R+GHIS+NCP+  +  C+GCGK GH+RR C E
Sbjct:   88 KDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPE-TSKTCYGCGKSGHLRRECDE 160          

HSP 2 Score: 52.7582 bits (125), Expect = 3.037e-7
Identity = 34/99 (34.34%), Postives = 51/99 (51.52%), Query Frame = 1
Query:  283 IRREMPECWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQE---------IRCERCSRNGHRSEEC--YTKMRYG 546
            +RR   +C+ C++ GH    CP + +  C+ C+  GHIS++C     P C+ C K GH  R C E         + C +C+R GH S+ C   +K  YG
Sbjct:   50 MRRNREKCYKCNQFGHFARACPEEAE-RCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPETSKTCYG 147          

HSP 3 Score: 51.6026 bits (122), Expect = 7.945e-7
Identity = 32/93 (34.41%), Postives = 46/93 (49.46%), Query Frame = 1
Query:  283 IRREMPE----CWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCP---DKKAP---RCFGCGKEGHIRRLCQEIR--CERCSRNGHRSEEC 525
              R  PE    C+ C+ IGH   DC       C+ C+++GH  RNCP   +++ P    C+ C + GHI + C E    C  C ++GH   EC
Sbjct:   66 FARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRREC 158          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Match: CNBP (gene:FBgn0034802 transcript:FBtr0071994)

HSP 1 Score: 60.8474 bits (146), Expect = 6.340e-10
Identity = 29/74 (39.19%), Postives = 45/74 (60.81%), Query Frame = 1
Query:  274 EESIRREMPECWLCHKIGHTKIDCP------IKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQE 477
            ++  + + P C+ C+K GH   +CP          + C+ C+R+GHIS+NCP+  +  C+GCGK GH+RR C E
Sbjct:   88 KDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPE-TSKTCYGCGKSGHLRRECDE 160          

HSP 2 Score: 52.7582 bits (125), Expect = 3.037e-7
Identity = 34/99 (34.34%), Postives = 51/99 (51.52%), Query Frame = 1
Query:  283 IRREMPECWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQE---------IRCERCSRNGHRSEEC--YTKMRYG 546
            +RR   +C+ C++ GH    CP + +  C+ C+  GHIS++C     P C+ C K GH  R C E         + C +C+R GH S+ C   +K  YG
Sbjct:   50 MRRNREKCYKCNQFGHFARACPEEAE-RCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPETSKTCYG 147          

HSP 3 Score: 51.6026 bits (122), Expect = 7.945e-7
Identity = 32/93 (34.41%), Postives = 46/93 (49.46%), Query Frame = 1
Query:  283 IRREMPE----CWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCP---DKKAP---RCFGCGKEGHIRRLCQEIR--CERCSRNGHRSEEC 525
              R  PE    C+ C+ IGH   DC       C+ C+++GH  RNCP   +++ P    C+ C + GHI + C E    C  C ++GH   EC
Sbjct:   66 FARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRREC 158          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX546500.1 (pep chromosome:GRCz11:23:12926092:12931693:-1 gene:ENSDARG00000086495.3 transcript:ENSDART00000122176.3 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX546500.1)

HSP 1 Score: 344.739 bits (883), Expect = 1.289e-96
Identity = 221/749 (29.51%), Postives = 374/749 (49.93%), Query Frame = 3
Query: 1479 IEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGR--RFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPV----KRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQF----RTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSK-EIPIKEELEEQTRRKFNVGEEVLVKVETRHKGQ-DRY 3662
            +E+ I +     IIR  +SP       V KK+   +R C+D+R LN++T +  +P+P +    + L G+ FF+ +DL NAY+ V +    + KTAF+T  G + +  +PFG++ AP  FQ ++  VL  +  +   VYLDDILI+S S ++H   +  VL+ + E GL +  EKC    + ++FLGHI+S EG++ DP KIQA+ ++  P+  K L+ FLG  N+YRRFI ++++ A  L  L S  S     WS    + F  LK    +APIL  PD  ++F+++ DAS   +GA+LSQR   DGK    AY SH ++  E+ Y I  +ELLAV      ++H+L G    F + TDHK + ++ + K+ + S+   W  +    +  + YR G  +   D +SR   D   +         +G +   +   I    R+ + +G+   +     +  +PE    + ++  H  ++ CH GV +    +K R+    +   +++ V +C +C  +K+   + + P             +  I +D    L  + G    +L ++D++SK     ++ K          +   + R  G P ++  D G  F S   REF ++L   +  SS +H Q+NGQ ER      RT+R L++     +  + W++ L  VE+  N+    +TG SP +   G +        S+  +P      ++ RR +N   + L++V TR K + DR+
Sbjct:  514 MEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTSLTS--SKMPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYFSHRLSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIRSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRL-FDPSDRLSSPDPVLPQGIVVANISWEIESRVRTAL-NGVTPPIGCPPSRLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVIKQRFWWPTMARDVRDFVLACSVCAVSKS---SNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSSNG-NTVVLTVVDRFSKAAHFISLPKLPSARETAVAVIDHVFRIHGLPTDVVSDRGPQFVSRFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVS-----QNPSSWSQQLSWVEYAHNSLPVSSTGLSPFQCSLGYQPPVFPSLESEVAVPSVHAFVQRCRRTWNRARQTLLRVGTRTKAKADRH 1247          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511082.1 (pep chromosome:GRCz11:9:14291932:14297132:1 gene:ENSDARG00000113678.1 transcript:ENSDART00000183119.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511082.1)

HSP 1 Score: 345.51 bits (885), Expect = 1.782e-96
Identity = 264/956 (27.62%), Postives = 444/956 (46.44%), Query Frame = 3
Query:  897 SITVRMNGENFD--CLLDTGARINVMSVNCFNKLRGQQLTKSDD-KLRCANESTIETI----GKTKVQVTIGNVSKEVIFIVAE-KVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEA--KFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGR--RFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGK---IKTRL-LDSIREEGRSNIQ---HGIVEE---VRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPV----KRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSK-EIPIKEELEEQTRRKFNVGEEVLVKVETRHKGQ-DRY 3662
            ++ +  +  NF    ++D+GA  N +      KLR   ++ S    +   N S++ +I    G  ++ +T GN S+ + F + E  VTP V+G   L         + + + +  +  +  C + A     R +  EE +   L  +   E   LK + + S    +       R +    +++    P   K Y   +     +E+ I +     IIR  +SP       V KK+   +R C+D+R LNA+T +  +P+P +    + L G+ FF+ +DL NAY+ V + E  + KTAF+T  G + +  +PFG++ AP  FQ ++  VL  +  +   VYLDDILI+S S ++H   +  VL+ + E GL +  EKC    + + FLGHI+S EG++ DP K+QA+ ++  P+  K L+ FLG  N+YRRFI ++++ A  L  L S        WS      F+ LK    +APIL  PD  ++F+++ DAS   +GA+LSQR   DGK    AY SH + + E+ Y I  +ELLAV      ++H+L G    F + TDHK + ++ + K+ + S+   W  +    D  + YR G  +   D +SR            H E        +  RL + ++  E  S ++    G+         +  +PE    + I+  H  +L CH GV +    +K R+    +   I+  V +C +C  +K    + + P             +  I +D    L  + G    IL ++D++SK      + K          +   + R  G P ++  D G  F S   REF  ++   +  SS +H Q+NGQ ER  + +  ++   L  +  + W++ L  VE+  N+     TG SP E   G +        S+  +P      ++ RR +N   + L++V  R K + DR+
Sbjct:  344 NVMLEWSSGNFSTQAIIDSGAEGNFIDSALVKKLRLPVISLSQPISVHALNGSSLPSITHSTGPIRL-ITSGNHSEIIHFFLTEAPVTPVVLGHPWLVIHNPHINWRQESVISWSESCHATCLLSACSSVSRSVFQEEHM--DLSNVP-KEYLDLKRVFSKSRAASLPPH----RPYDCAIELLPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNAITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMLDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGMRMDPEKVQAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTS--LKTPFRWSNAAQVAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKMHPCAYFSHRLNNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIQSAKR-LNSRQARWALFFGRFDFSISYRPGSKNVKPDALSRI---------FDHSERASSPETIVPRRLFISAVTWEIESRVRMALEGVTPPPGCPPSRLFVPEELRSDVIRWGHSSKLACHPGVSRTLYLIKQRFWWPVMARDIRNFVLACSVCAVSKT---SNRPPAGLLQPLSVPSRPWSHIALDFVTGLPPSNG-NTVILTVVDRFSKATHFIPLPKLPSARETAAAVIDHVFRIHGLPTDVVSDRGPQFISKFWREFCHLMGATVSLSSGFHPQSNGQTERANQDLERMLRC-LVSQNPSSWSQQLSWVEYAHNSLPVSATGLSPFECSLGYQPPAFPSLESEVAVPSAHAFVQRCRRTWNRARQTLLQVGLRTKAKADRH 1273          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: BX511224.1 (pep chromosome:GRCz11:2:18017000:18022765:1 gene:ENSDARG00000113243.1 transcript:ENSDART00000186877.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:BX511224.1)

HSP 1 Score: 337.806 bits (865), Expect = 6.351e-94
Identity = 221/749 (29.51%), Postives = 370/749 (49.40%), Query Frame = 3
Query: 1479 IEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRD-KDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGR--RFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPV----KRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQF----RTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSK-EIPIKEELEEQTRRKFNVGEEVLVKVETRHKGQ-DRY 3662
            +E+ I +     IIR  +SP       V KK+   +R C+D+R LN++T +  +P+P +    + L G+ FF+ +DL NAY+ V +    + KTAF+T  G + +  +PFG++ AP  FQ ++  VL  +  +   VYLDDILI+S S ++H   +  VL+ + E GL +  EKC    + ++FLGHI+S EG++ DP KIQA+ N+  P+  K L+ FLG  N+YRRFI ++++ A  L  L S  S     WS    + F  LK    +APIL  PD  ++F+++ DAS   +GA+LSQR   DGK    AY SH ++S E+ Y I  +ELLAV      ++H+L G    F + TDHK + ++ + K+ + S+   W  +    +  + YR G  +   D +SR   D   +         +  +   +   I    R+ +  G+   +     +  +PE    + ++  H  ++ CH GV +    +K R+    +   +++ V +C +C  +K+   + + P             +  I +D    L  + G    IL ++D++SK     ++ K          +   + R  G P ++  D G  F S   REF ++L   +  SS +H Q+NGQ ER      RT+R L++     +  + W++ L  VE+  N+     TG S  +   G +        S+  +P      ++ RR +N   + L++  TR K + DR+
Sbjct:  540 MEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVCIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVNWPTPDSRKALQRFLGFANFYRRFIHNFSQLAAPLTSLTS--SKTPFRWSSAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSSAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRL-FDPSDRTSSPDPVLPQRIVVANISWEIESRVRTALD-GVTPPIGCPPNRLFVPEELRSDVVRWGHSSKVACHPGVSRTLFVVKQRFWWPAMARDVRDFVLACSVCAVSKS---SNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSSNG-NTVILTVVDRFSKAAHFISLPKLPSARETAVAVIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVS-----QNPSSWSQQLSWVEYAHNSLPVSATGLSSFQCSLGYQPPVFPSLDSEVAVPSVHAFVQRCRRTWNRARQTLLQAGTRTKAKADRH 1273          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR855320.1 (pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENSDARG00000099359.2 transcript:ENSDART00000159655.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR855320.1)

HSP 1 Score: 336.265 bits (861), Expect = 2.017e-93
Identity = 219/737 (29.72%), Postives = 364/737 (49.39%), Query Frame = 3
Query: 1515 IIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGR--RFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPV----KRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQF----RTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSK-EIPIKEELEEQTRRKFNVGEEVLVKVETRHKGQ-DRY 3662
            IIR  +SP       V KK+   +R C+D+R LN +T +  +P+P +    + L G+ FF+ +DL NAY+ V +    + K+AF+T  G + +  +PFG++ AP  FQ  +  VL  +  +   VYLDDILI+S S ++H   +  VL+ + E GL +  EKC    + + FLGHI+S EG++ DP KI+A+ N+  P+  K L+ FLG  N+YRRFI ++++ A  L  L S  S     WS    + F  LK    +APIL  PD  ++F+++ DAS   +GA+LSQR   DGK    AY SH +++ E  Y I  +ELLAV      ++H+L G    F + TDHK + ++ + K+ + S+   W  +    +  + YR G  +   D +SR   D+  +         K  + + +   I  + R+ +  G+   +     +  +PE    + I+  H  ++ CH GV + +  +K R+    L   +++ V +C +C  +K    + + P             +  I +D    L  + G    IL ++D++SK      + K          +   + R  G P ++  D G  F S   REF ++L   +  SS +H Q+NGQ ER      RT+R L++     +  + W++ L  VE+  N+     TG SP +   G +        S+  +P      ++ RR ++   + L++V  R K + DR+
Sbjct:  554 IIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNNITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRAGDEWKSAFNTPRGHFEYCVLPFGLSNAPAVFQAFVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHIRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGLRMDPEKIKAVVNWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTS--SKTPFRWSSAAEAAFSKLKGCFVSAPILITPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYYSHRLSAAESNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIKSAKR-LNSRQARWALFFGRFNFTISYRPGSKNIKPDALSRL-FDSSERTSSLEPVVPKRIVISNITWEIESKVRAALD-GVTPPIGCPPSRLFVPEKLRSDVIRWGHSSKVACHPGVSRTSFVIKQRFWWPALARDVRDFVLACSVCAVSKT---SNRPPAGLLQPLSVPSRPWSHISLDFVTGLPPSNG-NTVILTVVDRFSKAAHFVPLPKLPSARETAVAVINHVFRIHGLPTDVVSDRGPQFISKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVS-----QNPSSWSQQLSWVEYAHNSLPVSATGLSPFQCSLGYQPPVFPSLESEVAVPSVHAFVQRCRRTWSRARQTLLQVGVRTKAKADRH 1275          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Match: CR925755.2 (pep chromosome:GRCz11:17:42486740:42492668:-1 gene:ENSDARG00000116402.1 transcript:ENSDART00000183946.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR925755.2)

HSP 1 Score: 335.88 bits (860), Expect = 2.039e-93
Identity = 221/756 (29.23%), Postives = 370/756 (48.94%), Query Frame = 3
Query: 1479 IEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDK-DGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGR--RFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSR-------TKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEM----FEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQF----RTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSK-EIPIKEELEEQTRRKFNVGEEVLVKVETRHKGQ-DRY 3662
            +E+ I +     IIR  +SP       V KK+   +R C+D+R LN++T +  +P+P +    + L G+ FF+ +DL NAY+ V +    + KTAF+T  G + +  +PFG++ AP  FQ ++  VL  +  +   VYLDDILI+S S ++H   +  VL+ + E GL +  EKC    + ++FLGH +S EG++ DP KIQA+ N+  P+  K L+ FLG  N+YRRFI ++ + A  L  L S  S     WS    + F  LK    +APIL  PD  ++F+++ D S   +GA+LSQR   DGK    AY SH +++ E+ Y I  +ELLAV      ++H+L G    F + TDHK + ++ + K+ + S+   W  +    +  + YR G  +   D +SR       T     V  Q         +I++R+  ++          G+   +     +  +PE+   + I   H  ++ CH GV +    +K R+    +   +++ V +C +C  +K+   + + P        +    +  I +D    L  + G    +L ++D++SK      + K          +   + R  G P ++  D G  F S   REF ++L   +  SS +H Q+NGQ ER      RT+R L++     +  + W++ L  VE+  N+     TG SP +   G +        S+  +P      ++ RR +N   + L++V  R K + DR+
Sbjct:  513 MEKYISDSLAAKIIRPSSSPAGAGFFFV-KKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVQFLGHTVSVEGMRMDPEKIQAVVNWPTPDSRKALQRFLGFANFYRRFIRNFRQLAAPLTNLTS--SKTPFRWSNAAEAAFSKLKGCFVSAPILIAPDPSRQFVVEVDVSEVGVGAILSQRSALDGKIHPCAYFSHRLSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVSTDHKNLEYIKSAKR-LNSRQARWALFFGRFNFSISYRPGSKNIKPDALSRLFDRSDRTSSPDPVLPQRVFVANISWEIESRVRTAL---------DGVTPPIGCPPSRLFVPEDLRSDVIWWGHSSKVACHPGVSRTLFVIKQRFWWPVMARDVRDFVLACSVCAASKS---SNRPPAGLLQPLSVPSRPWSHISLDFVTGLPSSNG-NTVVLTVVDRFSKAAHFVPLPKLPSARETAVAVIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFHPQSNGQTERANQDLERTLRCLVS-----QNPSSWSQQLSWVEYAHNSLPVSATGLSPFQCSLGYQPPVFPSLESEVAVPSIHAFVQRCRRTWNRARQTLLQVGERTKAKADRH 1246          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: anxa6 (annexin A6 [Source:Xenbase;Acc:XB-GENE-989741])

HSP 1 Score: 327.791 bits (839), Expect = 4.475e-95
Identity = 167/464 (35.99%), Postives = 264/464 (56.90%), Query Frame = 3
Query: 1299 EQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGE-PINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVL-GKIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSR 2684
            + L +++  +L++I+ +   +F A+    GRTH  +HK+ T  + PI    YR    +  +++  I  +   G+I   +SPW +P++ V KK+    R C+D+R+LN VT   A+PMP +DE+LD LG + + +++DL   Y+Q+ L   +QEK+AF T  G Y F  MPFG+  AP TFQ ++ ++L G    A  YLDDI ++S + E+H   L  V   I++AGL + PEKC +   E+++LGH +    ++ DP+K++AI  +  P   K++ +FLG   YYR+FI +Y+  A+ L  L S   ++ + W+    S    LK AL ++P+L  PD  + FIL TDAS   +GAVLSQ +  G+E  +AY S  +   E  Y    KE LA+ +     + YLYGR FT+ TDH  ++++         +   W   L   +  +++RKG  H NAD +SR
Sbjct:  295 DHLHLSQQDQLRKILRSYSPMFSANP---GRTHWAEHKVDTGTQLPIRSPAYRVAEAVRPEMKSQIDEMLAFGVITPSHSPWASPVVLVPKKDGS-TRFCVDYRRLNDVTTTDAYPMPRVDELLDRLGNAKYLTTLDLSRGYWQIPLAPSAQEKSAFLTPFGLYQFTVMPFGMRNAPATFQRLVNRLLEGMQDFAQAYLDDIAVFSQTWEEHLQHLQRVFAQIQDAGLTLKPEKCHLAMAEVQYLGHRVGGGQLRPDPAKVEAICQWPIPKTQKQVLAFLGTSGYYRKFIPNYSTVAKPLTDLTSRQRSRTIVWTPECESAMNALKQALASSPVLAAPDFSRRFILQTDASNFGLGAVLSQVNTYGEEHPVAYLSRKLLPREAAYATIEKECLAIVWALQKLQPYLYGREFTVVTDHNPLSWLQRVSG-DNGKLLRWSLLLQQYNFTIQHRKGKEHHNADGLSR 753          

HSP 2 Score: 101.293 bits (251), Expect = 1.038e-20
Identity = 70/234 (29.91%), Postives = 114/234 (48.72%), Query Frame = 3
Query: 3132 IDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISR------EKWYG---SKEIPI-------KEELEEQTR-----------------------RKFNVGEEVLVKVETRH-KGQDRYEGPYKVIEKVHDRRYIL 3713
            +D  ++Y    A+ K D  T+   +++ +  R G P EI  D G  F S +++   +   ++   SSPYH QTNG  ER   T++ ++   ++  +K DW   LP + F      Q++TG SP E+++G+++        E W G   S+E+PI       ++ LE+ T                        R+F  G++VL+ V TRH K Q  +EGPY V  K+HD  Y++
Sbjct:    1 MDYATRYPEAVALRKIDAPTVADALIQIFS-RVGFPSEILSDQGPQFTSQLLQCLWQRCGVRAIHSSPYHPQTNGLCERFNGTLKTMLRTFVESGEK-DWERYLPHLLFAYREVPQESTGFSPFELLYGRRVRGPLDLLCEYWEGAPQSQEVPIIPYVLKFRQRLEQMTSLAHDHLSAAQQRQKVWYDRKARERRFMEGDKVLLLVPTRHDKLQAAWEGPYVVTHKLHDTTYVV 232          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000035398.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:3:9869556:9871940:-1 gene:ENSXETG00000011182.1 transcript:ENSXETT00000035398.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 300.827 bits (769), Expect = 2.261e-86
Identity = 214/705 (30.35%), Postives = 326/705 (46.24%), Query Frame = 3
Query: 1473 SKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKD--IRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKK-ARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYG--RRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKA-----LTITTKEPVKRQDSKEMFEIIFVDI---CGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLIN---ATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK 3521
            + ++E I    + G IR   SP       V   EKKD  +R C+D+R LN +T +  +P+P I E+ D L G+  FS +DL  AY  + + E  + KTAF+T++G Y +  MPFG+  AP  FQE +  +   +  K  +VYLDDILI+S   E H + + E L  + E  L    EKC     +I FLG+IIS  G + DP+K+ AIQ +  P   K ++ F+G  NYYR+FI  ++ + A +L  +  G   +   W       F+ LK A  +A +L  P+    F ++ DAS    GA+LSQR   DGK    AY S   +S E+ Y I  +ELLAV      ++H L G     T+ TDHK + F+ + K+    Q   W  + S     + YR G  ++ AD +SR+                  +I   +L    E+   +      +       +P       +++ H  +   H G EK  + ++       +   +++ V +C +C  TKA       +    PV  +    +     V++   CG           I  +ID++SK      + K         +  + I R  G P EI  D G  F S   R   K L + L FSS YH QTNG  ER  + +   +    +  QD    DW+++LP  EF  N  +  +TG+SP   V+G+
Sbjct:   16 AAMKEYISENLQRGFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIRKG--GRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQ-ARWSLFFSRFHFVLTYRPGTKNRKADALSRSFSPEDRLPIEREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSCG--------NTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQD----DWSDLLPWAEFAHNNARHSSTGRSPFLSVYGQ 702          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: castor1 (cytosolic arginine sensor for mTORC1 subunit 1 [Source:Xenbase;Acc:XB-GENE-960689])

HSP 1 Score: 308.531 bits (789), Expect = 2.589e-84
Identity = 259/966 (26.81%), Postives = 418/966 (43.27%), Query Frame = 3
Query:  930 DCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRC--------ANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDS--KLKEIITNSGNVFMADKWDVGRTH--------LVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKD--IRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKK-ARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYG--RRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKA-----LTITTKEPVKRQDSKEMFEIIFVDI---CGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLIN---ATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK-KISREKWYGSKEIPIKEELEEQTRRKFNVGEEVLVKVETRHKG-QDR---YEGPYKVIEKV 3692
               LD+GA  N M +  F K  G  L      +R         + ++   T G+  VQ+   ++ K    I+    +P V+G              +  +       ++     +++ +  +    ++R L+++ ++  S   L  +  +  +VF     +    H        L+   +  RG    + P        + ++E I    + G IR   SP       V   EKKD  +R C+D+R LN +T +  +P+P I E+ D L G+  FS +DL  AY  + + E  + KTAF+T++G Y +  MPFG+  AP  FQE +  +   +  K  +VYLDDILI+S   E H + + E L  + E  L    +KC     +I FLG+IIS  G + DP+K+ AIQ +  P   K ++ F+G  NYYR+FI  ++ + A +L  +  G   +   W       F+ LK A  +A +L  P+    F ++ DAS    GA+LSQR   DGK    AY S   +S E+ Y I  +ELLAV      ++H L G     T+ TDHK + F+ + K+    Q + W  + S     + YR G  ++ AD +SR+                  +I   +L    E+   +      +       +P       +++ H  +   H G EK  + ++       +   +++ V +C +C  TKA       +    PV  +    +     V++   CG           I  +ID++SK      + K         +  + I R  G P EI  D G  F S   R   K L + L FSS YH QTNG  ER  + +   +    +  QD    DW+++LP  EF  N     +TG+SP   V+G+  ++  + +   ++P  ++L       +   +  L K    HK   DR      PYKV EKV
Sbjct:  355 SAFLDSGAAGNFMDL-AFAKKVGISLFPVTPPIRVLAIDDRPLSTDTITLTTGELSVQIGALHLEKMSFLIIPCPSSPVVLG--------------LPWLRLHNPSIDWSSGQISRWSQYCQRHCLILRPLQRVTVSSTSLSALPSVYRDFSDVFCKKSAEFLPPHRRYDCPIDLLPGIMPPRGRTYPLSPAE-----TAAMKEYISENLQRGFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLDKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIRKG--GRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQVR-WSLFFSRFHFVLTYRPGTKNRKADALSRSFSPEDRLPIEREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSCG--------NTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQD----DWSDLLPWAEFAHNNASHSSTGRSPFLSVYGQHPLAFPQDFLLSKVPAADDLAAHMSVIWAATKSNLEKSSLVHKTFADRRRKPSPPYKVGEKV 1282          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: lin54 (lin-54 DREAM MuvB core complex component [Source:Xenbase;Acc:XB-GENE-5752559])

HSP 1 Score: 308.916 bits (790), Expect = 2.600e-84
Identity = 263/966 (27.23%), Postives = 420/966 (43.48%), Query Frame = 3
Query:  930 DCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRC--------ANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDS--KLKEIITNSGNVFMADKWDVGRTH--------LVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKD--IRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKK-ARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYG--RRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKA-----LTITTKEPVKRQDSKEMFEIIFVDI---CGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLIN---ATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK-KISREKWYGSKEIPIKEELEEQTRRKFNVGEEVLVKVETRHKG-QDR---YEGPYKVIEKV 3692
               LD+GA  N M +  F K  G  L      +R         + ++   T G+  VQ+   ++ K    I+    +P V+G   L+       L    I+ S    +       ++ +  +    ++R L+++ ++  S   L  +  +  +VF     +    H        L+   +  RG    + P        + ++E I    + G IR   SP       V   EKKD  +R C+D+R LN +T +  +P+P I E+ D L G+  FS +DL  AY  + + E  + KTAF+T++G Y +  MPFG+  AP  FQE +  +   +  K  +VYLDDILI+S   E H + + E L  + E  L    +KC     +I FLG+IIS  G + DP+K+ AIQ +  P   K ++ F+G  NYYR+FI  ++ + A +L  +  G   +   W       F+ LK A  +A +L  P+    F ++ DAS    GA+LSQR   DGK    AY S   +S E+ Y I  +ELLAV      ++H L G     T+ TDHK + F+ + K+    Q + W  + S     + YR G  ++ AD +SR+                  +I   +L    E+   +      +       +P       +++ H  +   H G EK  + ++       +   +++ V +C +C  TKA       +    PV  +    +     V++   CG           I  +ID++SK      + K         +  + I R  G P EI  D G  F S   R   K L + L FSS YH QTNG  ER  + +   +    +  QD    DW+++LP  EF  N     +TG+SP   V+G+  ++  + +   ++P  ++L       +   +  L K    HK   DR      PYKV EKV
Sbjct:  355 SAFLDSGAAGNFMDL-AFAKKVGISLFPVTPPIRVLAIDDRPLSTDTITLTTGELSVQIGALHLEKMSFLIIPCPSSPVVLGLPWLR-------LHNPSIDWSSGQIS-------RWSQYCQRHCLILRPLQRVTVSSTSLSALPSVYRDFSDVFCKKSAEFLPPHRRYDCPIDLLPGIMPPRGRTYPLSPAE-----TAAMKEYISENLQRGFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLDKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKGFSSRIAPILSLIRKG--GRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQAR-WSLFFSRFHFVLTYRPGTKNRKADALSRSFSPEDRLPIEREPIIPPSRIIASVLPQFAEQILLSQSAAPPDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLQRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPVPSRPWTHLGMDFIVELPPSCG--------NTVIWVVIDRFSKMAHFVPLRKLPSAVELAQLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQD----DWSDLLPWAEFAHNNASHSSTGRSPFLSVYGQHPLAFPQDFLLSKVPAADDLAAHMSVIWAATKSNLEKSSLVHKTFADRRRKPSPPYKVGEKV 1282          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Match: ENSXETT00000034712.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:9:33567219:33572531:-1 gene:ENSXETG00000011772.1 transcript:ENSXETT00000034712.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 308.531 bits (789), Expect = 2.602e-84
Identity = 265/978 (27.10%), Postives = 426/978 (43.56%), Query Frame = 3
Query:  900 ITVRMNGENF--DCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRC--------ANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDS--KLKEIITNSGNVFMADKWDVGRTH--------LVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKD--IRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKK-ARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR-DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYG--RRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKA-----LTITTKEPVKRQDSKEMFEIIFVDI---CGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLIN---ATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK-KISREKWYGSKEIPIKEELEEQTRRKFNVGEEVLVKVETRHKG-QDR---YEGPYKVIEKV 3692
            + +R++ +       LD+GA  N M +  F K  G  L      +R         + ++   T G+  VQ+   ++ K    I+    +P V+G   L+       L    I+ S    +       ++ +  +    + + L+++ ++  S   L  +  +  +VF     +    H        L+   +  RG    + P        + ++E I    + G IR   SP       V   EKKD  +R C+D+R LN +T +  +P+P I E+ D L G+  FS +DL  AY  + + E  + KTAF+T++G Y +  MPFG+  AP  FQE +  +   +  K  +VYLDDILI+S   E H + + E L  + E  L    EKC     +I FLG+IIS  G + DP+K+ AIQ +  P   K ++ F+G  NYYR+FI D++ + A +L  +  G   +   W       F+ LK A  +A +L  P+    F ++ DAS    GA+LSQR   DGK    AY S   +S E+ Y I  +ELLAV      ++H L G     T+ TDHK + F+ + K+    Q + W  + S  +  + YR G  ++ AD +SR+                  +I   +L    E+   +      +       +P       +++ H  +   H G EK  + ++       +   +++ V +C +C  TKA       +    P+  +    +     V++   CG           I  +ID++SK      + K         +  + I R  G P EI  D G  F S   R   K L + L FSS YH QTNG  ER  + +   +    +  QD    DW+++LP  EF  N     +TG+SP   V+G+  ++  +     E+P  ++L       +   +  L K    HK   DR      PYKV EKV
Sbjct:  343 VQIRLSSQAIPVSAFLDSGAAGNFMDL-AFAKKVGISLFPVTPSIRVFAIDDRPLSTDTITLTTGELSVQIGALHLEKMSFLIIPCPSSPVVLGLPWLR-------LHNPSIDWSSGQIS-------RWSQYCQRHCLIPQPLQRVTVSSTSFSALPSVYRDFSDVFCKKSAEFLPPHRRYDCPIDLLPGTMPPRGRTYPLSPAE-----TAAMKEYISENLQRGFIRPSTSPAGAGFFFV---EKKDGGLRPCIDYRGLNKITVKNRYPLPLISELFDQLKGAKIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDLLGKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENFLFAKLEKCTFEVPKISFLGYIISSRGFEMDPAKVSAIQKWPLPQSTKAIQRFIGFANYYRQFIKDFSSRIAPILSLIRKG--GRPNCWPPVALEAFQSLKDAFISASVLRHPEPHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQAR-WSLFFSRFNFVLTYRPGTKNRKADALSRSFSPEDRLPIEQEPIIPPFRIIASVLPQFAEQILLSQSAAPSDTPIGMAFVPPELRLPILQQTHSSKQAGHPGSEKTLELLRRLVWWPTIRKDVRDFVAACTVCATTKASHSRPCGLLHPLPIPSRPWTHLGMDFIVELPPSCG--------NTVIWVVIDRFSKMAHFIPLRKLPSAVELAHLFIQHIFRLHGFPVEIVSDRGSQFVSRFWRSLCKSLGVSLQFSSAYHPQTNGAAERVNQALEQFLRNHVSLCQD----DWSDLLPWAEFAHNNASHSSTGRSPFLSVYGQHPLAFPQDLLLSEVPAADDLAAHMSVIWAATKSNLEKSSLVHKTFADRRRKPSPPYKVGEKV 1282          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Zbed5 (zinc finger, BED type containing 5 [Source:MGI Symbol;Acc:MGI:1919220])

HSP 1 Score: 269.626 bits (688), Expect = 4.948e-87
Identity = 159/454 (35.02%), Postives = 258/454 (56.83%), Query Frame = 1
Query: 4444 KKRIKDMSHDIEEIVNYKLSKKYF---ALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSI----------QTNNVTDIIPAIIEHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTRQAQISH 5766
            ++RI DMS DI + V  ++        ++Q+DES D+++  QL+  VR+I++ +  ++FLCC+ L    T  D+F  + ++L + +ISW    G+CTDG P+  GC  G   LV  ++P  I  HC LHL+ L  KTLP   +  +  V+  VN++K+  L +R+F QL   +D   ++LLLHTE R LSRGKVL R+ EL+DEL  +F   A+ +F   F ++    K+AYL DIF  LN +N S+QG N   L  ++KI +F  K+  W+ ++ +N    M P++          Q   +T +I ++ EHL +L   I+ YF +L    +   R+PF     +       +E+   L+ + + + +F+ M    FW+   + +P LS+  + + L F T+YLCE GFS+L  IK+K R RL  +E+++R A++   P I ++ + +Q+Q SH
Sbjct:  308 RRRIHDMSADILDQVVQEIKSAPLPICSIQLDESTDVANCLQLMVYVRYINDGDFKDEFLCCKPLERTATALDVFEAVDSFLRQHEISWKSICGVCTDGAPATLGCQSGFQRLVLNESPKAIGAHCMLHLQTLAMKTLPQDFQEVMKSVLSSVNFVKASSLNSRLFLQLCSDLDEPSKTLLLHTEGRWLSRGKVLKRIFELRDELKMFFNQKAIRQFEALFSDNSALQKVAYLVDIFTILNELNLSLQGPNSTCLDLSEKIHSFQMKLQLWQKKLDENK-FYMLPTLSAFFEEHDIEQHKRITVVI-SVKEHLDMLASEISWYFPNLPEIPFALARSPFSV--KAEDVPETAQEDFTRLTNSDAARADFSTMPVTQFWVKCLQSYPVLSEMVLRLLLPFPTTYLCETGFSSLLVIKSKYRSRLV-VEDDLRCALAKTTPRISDLVRKKQSQPSH 756          

HSP 2 Score: 75.0998 bits (183), Expect = 4.948e-87
Identity = 39/113 (34.51%), Postives = 63/113 (55.75%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLGMQAKQFKGAMTISDKAQIASYKEL*LIALK----LKPHTIAESLILP 4360
            R+Y++ ++ +GF  T       P+CV+CGV LS  +M P KL RHF + H++   K+ +YF+       K      + S+K  +A+ +   L+AL+    +KPHT AE L+ P
Sbjct:  167 RKYNEGFLQYGFTSTITVGIERPQCVICGVVLSAESMKPNKLKRHFESKHSSFAGKDTNYFRSKADGLKKARPDTGSKSNKQNVAAVETSYLVALRIARDMKPHTFAEHLLFP 279          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Zmym6 (zinc finger, MYM-type 6 [Source:MGI Symbol;Acc:MGI:106505])

HSP 1 Score: 200.29 bits (508), Expect = 1.356e-59
Identity = 146/449 (32.52%), Postives = 239/449 (53.23%), Query Frame = 1
Query: 4450 RIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFID--ENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIIS-THCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFP---------SIQTNNVTDIIPAIIEHLTILDEII-ACYFSSLKLESYD-WIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTRQ 5751
            RIK +S DIE+ +  K+ + ++FALQ+DES + +    LL  VRFID    ++  + L C E+   +T  ++F  I  Y++   ++W+ CVG CTDG  SM        + ++E   N ++ THCF+H E L +K L   L   L +  QI++++K+    +++   L   M +++ +L L+ E+R LSRG++L RL EL+ E+   F N   +   + F ++ W AKLAYLADIF  +N +N+S+QG         +K+  F K++  W  R  +N+   MFP          +   N+  I   I EHL  L +I  ACY     L S + W+ +PF T+  SN     +EE++  LS +   +     MS   FW+  +  +P L +KA+ + L FS++ LC+  FS LT     K+  L      +R+A++ + P I ++ K ++
Sbjct:  718 RIKKLSDDIEDQLLQKVRESRWFALQVDESSEATEVPLLLCYVRFIDYDRGDVKEELLFCTEMPSPSTDLEVFELINKYIDSRSLNWNHCVGFCTDGAASMTDRYFRLRSKIQEIAKNTVTFTHCFIHREHLAAKKLSPCLHEILLQSSQILSFVKNSASDSQMLTILCEEMGSEHVNLPLNAEVRWLSRGRILTRLFELRHEIEI-FLNQKHSDLARYFHDEEWIAKLAYLADIFSLINKLNSSLQGTMTTFFNLYNKVDVFQKRLKMWLKRAQEND-YGMFPLFSEFLDSSDVSVKNIASI---IFEHLEGLSQIFHACYPPEEDLRSGNLWLTDPFATYH-SNNLTDSEEEKLAVLSADTGFQSVHKSMSVTQFWVNAKTSYPKLHEKALKLLLPFSSTCLCDATFSALT---ASKQRDLRTCGPTLRLAVTSLVPRIEKLAKEKE 1157          

HSP 2 Score: 52.373 bits (124), Expect = 1.356e-59
Identity = 41/120 (34.17%), Postives = 63/120 (52.50%), Query Frame = 2
Query: 4031 NRQYSDEYISFGF--AWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYF-KRLLGM--QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEI 4375
            ++ Y+ EYI FGF      E+  P P+CV+CG  L + ++ P  L+ H    H++L +K  D+F ++ L M  Q    K  + + +    ASY     IA + KP +IAE LI P   E+
Sbjct:  573 SQTYNAEYIRFGFIICSGSEESSPSPQCVICGEVLPSESVMPVSLSNHLKAKHSDLENKPVDFFEEKSLEMECQNSSLKKCLLVEESLVKASYLIAFQIAARNKPFSIAEELIKPYLVEM 692          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Zmym6 (zinc finger, MYM-type 6 [Source:MGI Symbol;Acc:MGI:106505])

HSP 1 Score: 199.904 bits (507), Expect = 1.710e-59
Identity = 145/449 (32.29%), Postives = 239/449 (53.23%), Query Frame = 1
Query: 4450 RIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFID--ENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIIS-THCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFP---------SIQTNNVTDIIPAIIEHLTILDEII-ACYFSSLKLESYD-WIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEICKTRQ 5751
            RIK +S DIE+ +  K+ + ++FALQ+DES + +    LL  VRFID    ++  + L C E+   +T  ++F  I  Y++   ++W+ CVG CTDG  SM        + ++E   N ++ THCF+H E L +K L   L   L +  QI++++K+    +++   L   M +++ +L L+ E+R LSRG++L RL EL+ E+  +  N   +   + F ++ W AKLAYLADIF  +N +N+S+QG         +K+  F K++  W  R  +N+   MFP          +   N+  I   I EHL  L +I  ACY     L S + W+ +PF T+  SN     +EE++  LS +   +     MS   FW+  +  +P L +KA+ + L FS++ LC+  FS LT     K+  L      +R+A++ + P I ++ K ++
Sbjct:  810 RIKKLSDDIEDQLLQKVRESRWFALQVDESSEATEVPLLLCYVRFIDYDRGDVKEELLFCTEMPSPSTDLEVFELINKYIDSRSLNWNHCVGFCTDGAASMTDRYFRLRSKIQEIAKNTVTFTHCFIHREHLAAKKLSPCLHEILLQSSQILSFVKNSASDSQMLTILCEEMGSEHVNLPLNAEVRWLSRGRILTRLFELRHEIEIFL-NQKHSDLARYFHDEEWIAKLAYLADIFSLINKLNSSLQGTMTTFFNLYNKVDVFQKRLKMWLKRAQEND-YGMFPLFSEFLDSSDVSVKNIASI---IFEHLEGLSQIFHACYPPEEDLRSGNLWLTDPFATYH-SNNLTDSEEEKLAVLSADTGFQSVHKSMSVTQFWVNAKTSYPKLHEKALKLLLPFSSTCLCDATFSALT---ASKQRDLRTCGPTLRLAVTSLVPRIEKLAKEKE 1249          

HSP 2 Score: 52.373 bits (124), Expect = 1.710e-59
Identity = 41/120 (34.17%), Postives = 63/120 (52.50%), Query Frame = 2
Query: 4031 NRQYSDEYISFGF--AWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYF-KRLLGM--QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEI 4375
            ++ Y+ EYI FGF      E+  P P+CV+CG  L + ++ P  L+ H    H++L +K  D+F ++ L M  Q    K  + + +    ASY     IA + KP +IAE LI P   E+
Sbjct:  665 SQTYNAEYIRFGFIICSGSEESSPSPQCVICGEVLPSESVMPVSLSNHLKAKHSDLENKPVDFFEEKSLEMECQNSSLKKCLLVEESLVKASYLIAFQIAARNKPFSIAEELIKPYLVEM 784          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Gtf2ird2 (GTF2I repeat domain containing 2 [Source:MGI Symbol;Acc:MGI:2149780])

HSP 1 Score: 120.553 bits (301), Expect = 9.918e-27
Identity = 110/392 (28.06%), Postives = 188/392 (47.96%), Query Frame = 1
Query: 4513 FALQIDESVDISSKAQLLALVRFIDEN-EIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKE------KNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSIQT-----NNVTDIIPAII----EHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTL 5640
            +++ IDE  DI+   QL   +R +D+N ++  + L    +T   +G +IF  +   L+K  I W   V + + G P+M     G VT ++       K  ++ S  C +H E L ++ L   +   +D VV  VN+I SR L    F  L   +D++Y SLL HT ++ L RG VL R  E  +E+  +    +  K V    +  W   LA+L D+  +LN+++ S+QG ++ +    D I AF  K+  W+  + +NN L  FP++++     ++  + IP I+    E    L +  +C  S L L S     +PF T    +      + E+I L  N  L+ ++ K+    F+  +   +P        +   FS++++CE  FS L
Sbjct:  541 YSIAIDEITDINDTTQLAIFIRGVDDNFDVSEELLDTVPMTGAKSGNEIFLRVEKSLKKFSIDWSKLVSVASTGTPAMMDANSGLVTKLRARAASCCKGADLKSVRCIIHPEWLCAQKL--RMGHVMDVVVDSVNWICSRGLNHGDFTTLLYELDSQYGSLLYHTALKWLGRGLVLRRFFESLEEIDSFMS--SRGKPVPQLSSRDWILDLAFLVDMTTHLNTLDASLQGHSQIVTQMYDFIRAFLAKLCLWETHLARNN-LAHFPTLKSVSRSESDGLNYIPKIVELKAEFQRRLSDFKSCE-SELTLFS-----SPFST--TIDSVREELQMEVIDLQCNTVLRTKYDKVGVPDFYKHLWSSYPKYRSHCARMLSMFSSTHICEQLFSIL 919          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Match: Rtl1 (retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:MGI:2656842])

HSP 1 Score: 75.8702 bits (185), Expect = 6.197e-13
Identity = 70/297 (23.57%), Postives = 135/297 (45.45%), Query Frame = 3
Query: 1614 NAVTERQAFPMPNIDEMLDLLGGSVFFSSIDL-----GNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMV--YLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAK-KARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKD-GKERVIAYGSHSMTSHEKGYCITRKELLAV 2477
            + +T+RQ +    + E+ D L G+ +F+ ++L         + V   E++   +     +   C+   PF + +       ++  +L  I G  V  +  ++L+YS S+E+H   + +VL       +  + +K Q  ++    LG  IS +GV+ + + +  I     P   + L+S + +   YR F+ ++A   A ++ QL    S++   W E      E LK A   +P+L  P  +  F L+TD +   + A L Q D + GK+   A+ S  +++ E  Y      +L +
Sbjct:  898 DMLTDRQDYTQ-MVPELFDQLHGAAWFTKLELLGIKESEMRHTVTHTEDTWRASFGFGLHQMRCYR--PFTMNSYSDEGNNIVHFILKDILGLFVICHGREVLVYSMSQEEHSQHVRQVLVRFRYHNIYCSLDKTQFHRQTAEILGFNISPKGVKLNKNLMNLIVGCPVPGSRRCLQSVIDLVYPYRHFVENFAVIAAPLVRQLL---SSEPYYWGEEEQEALESLKRAFRKSPVLYHPKPQNPFYLETDITGSFLSASLVQTDDETGKKSTCAFYSRPLSTMEVEYPRVEMRILPI 1188          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q99315|YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 345.125 bits (884), Expect = 5.334e-95
Identity = 266/898 (29.62%), Postives = 419/898 (46.66%), Query Frame = 3
Query: 1326 KLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGK-ERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRT---------------------KCDTCVQCQMAH-EEAKKGKIKTRLLDSIR---------EEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIHRLLC---HAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTK---EPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFG-------------------KKISREKWYGSKEIPIKEELEE---------QTRRK---FNVGEEVLVKVETRHKG------QDRYEGPYKVIEKVHDRRYILRNEDGKRIER--NVEKLKNFLRR 3776
            K +EII N      AD  ++   H ++ K   R     ++PY      E +I + +Q L +N  I    SP ++P++ V KK+    RLC+D+R LN  T    FP+P ID +L  +G +  F+++DL + Y+Q+ +E + + KTAF T +G+Y +  MPFG+  AP TF   M      ++   VYLDDILI+S S E+H+  L  VL+ ++   L +  +KC+   EE  FLG+ I  + +     K  AI++F  P  VK+ + FLG+ NYYRRFI + +K A+ ++         K +W+E  +   + LK AL  +P+L   + K  + L TDAS D IGAVL + D   K   V+ Y S S+ S +K Y     ELL +     HF++ L+G+ FTLRTDH ++  +    +P   + Q W++ L++ D  +EY  G  +  AD +SR                      K D      + H +E  +  +    + + R         E  R N  + + +E+   + + ++P  +    ++  H       H GV      +   Y    L   I + +R+C  CQ  K+         +P+   + + +   I +D    L  T      IL ++D++SK     A  K  + T    +L ++I  + G P+ I  D      +   +E TK L IK   SS  H QT+GQ ER  +T+  L+ A      + +W   LP++EF  N+T  +T GKSP EI  G                     +   K   +  I  KE+LE            RRK    N+G+ VLV  +   K       Q  Y GP++V++K++D  Y L     K+  R  NV+ LK F+ R
Sbjct:  563 KYREIIRNDLPPRPADINNIPVKHDIEIKPGARLP--RLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGT-FRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICD----KSQWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEP-ARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKN--YSLEDEMIYYQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWL--DISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNIQ-NWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKVQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKKFVYR 1447          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|Q7LHG5|YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 344.354 bits (882), Expect = 5.594e-95
Identity = 265/895 (29.61%), Postives = 418/895 (46.70%), Query Frame = 3
Query: 1326 KLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGK-ERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRT---------------------KCDTCVQCQMAH-EEAKKGKIKTRLLDSIR---------EEGRSNIQHGIVEEV---RKKTMIPENELQETIKEIHRLLC---HAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTK---EPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRF-GCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFG-------------------KKISREKWYGSKEIPIKEELEE---------QTRRK---FNVGEEVLVKVETRHKG------QDRYEGPYKVIEKVHDRRYILRNEDGKRIER--NVEKLKNF 3767
            K +EII N      AD  ++   H ++ K   R     ++PY      E +I + +Q L +N  I    SP ++P++ V KK+    RLC+D+R LN  T    FP+P ID +L  +G +  F+++DL + Y+Q+ +E + + KTAF T +G+Y +  MPFG+  AP TF   M      ++   VYLDDILI+S S E+H+  L  VL+ ++   L +  +KC+   EE  FLG+ I  + +     K  AI++F  P  VK+ + FLG+ NYYRRFI + +K A+ ++         K +W+E  +   E LK AL  +P+L   + K  + L TDAS D IGAVL + D   K   V+ Y S S+ S +K Y     ELL +     HF++ L+G+ FTLRTDH ++  +    +P   + Q W++ L++ D  +EY  G  +  AD +SR                      K D      + H +E  +  +    + + R         E  R N  + + +E+   + + ++P  +    ++  H       H GV      +   Y    L   I + +R+C  CQ  K+         +P+   + + +   I +D    L  T      IL ++D++SK     A  K  + T    +L ++I  + G P+ I  D      +   +E TK L IK   SS  H QT+GQ ER  +T+  L+ A +    + +W   LP++EF  N+T  +T GKSP EI  G                     +   K   +  I  KE+LE            RRK    N+G+ VLV  +   K       Q  Y GP++V++K++D  Y L     K+  R  NV+ LK+ 
Sbjct:  589 KYREIIRNDLPPRPADINNIPVKHDIEIKPGARLP--RLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKDGT-FRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICD----KSQWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEP-ARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKN--YSLEDEMIYYQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWL--DISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNIQ-NWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKVQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKSL 1470          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|P04323|POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 328.176 bits (840), Expect = 7.799e-92
Identity = 209/639 (32.71%), Postives = 341/639 (53.36%), Query Frame = 3
Query:  888 GRPS-ITVRMNGENFDCLLDTGARINVMSVNCFNKL--RGQQLTKSDDKLRCANESTI---ETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQET----FGFRLLKIKDIEASEKDK----NYICNIEAKFGRKIKD--EERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKE----KKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCV---QCQMAHEE 2729
            G+P  IT++    N  CL+DTG+ +N+ S N F+           + +     N+S I   + +  T  +  +   S+    ++  K+  +    I  ++     +  +   I+ I   E+      N I +   +   KI    E  L R LE L   E  +L  ++    ++   +   +  T+  KH I T+        Y  P   E ++E  IQ++   GIIR  NSP+N+P+  V KK+    K+  R+ +D+R+LN +T     P+PN+DE+L  LG   +F++IDL   ++Q+E++ ES  KTAFSTK+G Y + RMPFG+  AP TFQ  M  +L  +  K  +VYLDDI+++S+S ++H   LG V + + +A L++  +KC+ +K+E  FLGH+++ +G++ +P KI+AIQ +  P   K++++FLG+  YYR+FI ++A  A+ + + C   + K    +   +S F+ LK  ++  PIL  PD  K+F L TDAS   +GAVLSQ   DG    ++Y S ++  HE  Y    KELLA+ +    F+HYL GR F + +DH+ ++++   K P  S+   W   LS  D  ++Y KG  +  AD +SR K +      Q Q + EE
Sbjct:   11 GKPQYITIKYKENNLKCLIDTGSTVNMTSKNIFDLPIQNTSTFIHTSNGPLIVNKSIIIPSKILFPTTNEFLLHPFSENYDLLLGRKLLAEAKATISYRDQEVTLYNNKYKLIEGIATHEQSHFQNVNMIPDTMLRQPNKISPILESDLYR-LEHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSKYSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTK-CLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQ---DG--HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDP-NSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLSEQTQHSAEE 641          

HSP 2 Score: 62.3882 bits (150), Expect = 4.826e-8
Identity = 71/288 (24.65%), Postives = 120/288 (41.67%), Query Frame = 3
Query: 2847 ELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMF-EIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINAT-LQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSKEIPIKEELEEQTRRKFNVGEEVLVKVETRHKG----QDRYEGPYKVIEKV 3692
            E +E I   H  L H G++K      + Y   +    IQ I+  C IC   K     T  P K     E   E   +DI           K+ +  ID YSK+  L  I  +D    K  ++ +   + G PK ++ D   AF S  ++ + +  E++L  ++         IER  +TI + I      D ++T  +++   +    + TK  TTG++PA I           Y  + I   ++ +E    K N  + V  +V+TR++     + + E P+K  + V
Sbjct:  749 EFKELILTAHEKLLHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKTTPKPEHCREKFMIDIYS------SEGKHYVSCIDIYSKFATLEEIKTKDWIECKNALM-RIFNQLGKPKLLKADRDGAFSSLALKRWLESEEVELQLNTT--KTGVADIERLHKTINEKIRIIKTSDDEETKLSKMETVLNIYNHKTKHDTTGQTPAHIFL---------YAGQPILDTQQNKENKINKIN-NDRVEYEVDTRYRKGPLQKGKLENPFKPTKNV 1017          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|A4Z943|ZBED5_BOVIN (Zinc finger BED domain-containing protein 5 OS=Bos taurus OX=9913 GN=ZBED5 PE=4 SV=1)

HSP 1 Score: 289.271 bits (739), Expect = 1.628e-90
Identity = 179/449 (39.87%), Postives = 266/449 (59.24%), Query Frame = 1
Query: 4447 KRIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPSI-----QTNNVTD--IIPAIIEHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEIC-KTRQAQISH 5766
            +RIKD++ DIEE +  +L     F+LQ+DES D+S  A LL  VR+     I    L C  L  + TG++IFNCI ++++K +I W+ CV +C+D   +M G I   VTL+K   P   S+HC L+   L  KT+P +LK+ LD+ VQI+NYIK+RP Q+R+ K L   M A++ +LLL+TE+R LSRGKVL RL EL+ ELL +  +    +      N  W  +LAYLADIF  LN VN S+QGKN  + T  DK+ +  +K+ +W + + + N  D FP++     + N+  D  I  AI++HL  L   +  YF  +  + + W+RNPF         ++   E +I L+++  +K  F+++S   FW  + +E+PS++++A+ + L F+T +LCE GFS     KTK R+RL D    MR+ +S I PNI  IC K  Q   SH
Sbjct:  251 RRIKDLAADIEEELVCRLKICDGFSLQLDESADVSGLAVLLVFVRYRFNKSIEEDLLLCESLQSNATGEEIFNCINSFMQKHEIEWEKCVDVCSDASRAMDGKIAEAVTLIKYVAPESTSSHCLLYRHALAVKTMPTSLKNVLDQAVQIINYIKARPHQSRLLKILCEEMGAQHTALLLNTEVRWLSRGKVLVRLFELRRELLVFMDSAF--RLSDCLTNSSWLLRLAYLADIFTKLNEVNLSMQGKNVTVFTVFDKMSSLLRKLEFWASSVEEEN-FDCFPTLSDFLTEINSTVDKNICSAIVQHLRGLRSTLLKYF-PVTNDIHAWVRNPFTVTVKPASLVARDYESLIDLTSDSQVKQSFSELSLNDFWSSLIQEYPSVARRAVRVLLPFATMHLCETGFSYYAATKTKYRKRL-DAAPHMRIRLSNITPNIKRICDKKTQKHCSH 694          

HSP 2 Score: 69.707 bits (169), Expect = 1.628e-90
Identity = 43/119 (36.13%), Postives = 63/119 (52.94%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLGM----QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEIV 4378
            R+Y + Y+SFGF +   ++ P  +CV+C   LSNS++ P+KL RH    HA    K+  +FK+ L      +    K   T ++ A  ASY     IAL  + HTI E LI P   ++V
Sbjct:  109 RKYDESYLSFGFTYFGNRDAPHAQCVLCKKILSNSSLAPSKLRRHLETKHAAYKDKDISFFKQHLDSPENNKPPTPKIVNTDNESATEASYNVSYHIALSGEAHTIGELLIKPCAKDVV 227          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Match: sp|A4Z944|ZBED5_CANLF (Zinc finger BED domain-containing protein 5 OS=Canis lupus familiaris OX=9615 GN=ZBED5 PE=4 SV=1)

HSP 1 Score: 286.574 bits (732), Expect = 9.086e-90
Identity = 178/449 (39.64%), Postives = 263/449 (58.57%), Query Frame = 1
Query: 4447 KRIKDMSHDIEEIVNYKLSK-KYFALQIDESVDISSKAQLLALVRFIDENEIVNQFLCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKGCVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSRPLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVXXXXXXXXXXXXXYFQNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTDKILAFNKKILYWKNRITKNNTLDMFPS-------IQTNNVTDIIPAIIEHLTILDEIIACYFSSLKLESYDWIRNPFGTFEFSNXXXXXXXXXXXXXXTNRSLKMEFTKMSNEHFWIFVQEEHPSLSKKAITI*LQFSTSYLCELGFSTLTNIKTKKRERLTDLEEEMRVAISYIRPNIGEIC-KTRQAQISH 5766
            +RIKD++ DIEE +  +L     F+LQ+DES D+S  A LL  VR+     I    L C  L  + TG++IFNCI ++++K +I W+ CV +C+D   +M G I   VTL+K   P   S+HC L+   L  K +P +LK+ LD+ VQI+NYIK+RP Q+R+ K L   M A++ +LLL+TE+R LSRGKVL RL EL+ ELL +  +    +      N  W  +LAYLADIF  LN VN S+QGKN  + T  DK+ +  +K+ +W + + + N  D FP+       I +    DI  AI++HL  L   +  YF  +  ++  W+RNPF         ++   E +I L+++  +K  F+++S   FW  + +E+PS++++A+ + L F+T +LCE GFS     KTK R+RL D    MR+ +S I PNI  IC K  Q   SH
Sbjct:  251 RRIKDLAADIEEELVCRLKICDGFSLQLDESADVSGLAVLLVFVRYRFNKSIEEDLLLCESLQSNATGEEIFNCINSFMQKHEIEWEKCVDVCSDASRAMDGKIAEAVTLIKYVAPESTSSHCLLYRHALAVKIMPTSLKNVLDQAVQIINYIKARPHQSRLLKILCEEMGAQHTALLLNTEVRWLSRGKVLVRLFELRRELLVFMDSAF--RLSDCLTNSSWLLRLAYLADIFTKLNEVNLSMQGKNVTVFTVFDKMSSLLRKLEFWASSVEEEN-FDCFPTLSDFLTEINSTVDKDICSAIVQHLRGLRSTLLKYF-PVTNDNNTWVRNPFTVTVKPASLVARDYESLIDLTSDSQVKQNFSELSLNDFWSSLIQEYPSIARRAVRVLLPFATMHLCETGFSYYAATKTKYRKRL-DAAPHMRIRLSNITPNIKRICDKKTQKHCSH 694          

HSP 2 Score: 69.707 bits (169), Expect = 9.086e-90
Identity = 43/119 (36.13%), Postives = 63/119 (52.94%), Query Frame = 2
Query: 4034 RQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSAMFPAKLNRHFTNSHANLVSKNNDYFKRLLGM----QAKQFKGAMTISDKAQIASYKEL*LIALKLKPHTIAESLILPSCCEIV 4378
            R+Y + Y+SFGF +   ++ P  +CV+C   LSNS++ P+KL RH    HA    K+  +FK+ L      +    K   T ++ A  ASY     IAL  + HTI E LI P   ++V
Sbjct:  109 RKYDESYLSFGFTYFGNRDAPHAQCVLCKKILSNSSLAPSKLRRHLETKHAAYKDKDISFFKQHLDSPENNKPPTPKIVNTDNESATEASYNVSYHIALSGEAHTIGELLIKPCAKDVV 227          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A5J4NQX1 (Uncharacterized protein OS=Paragonimus westermani OX=34504 GN=DEA37_0008956 PE=4 SV=1)

HSP 1 Score: 540.806 bits (1392), Expect = 3.482e-155
Identity = 306/946 (32.35%), Postives = 499/946 (52.75%), Query Frame = 3
Query:  894 PSITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGG----IELQETFGFRLLKIK----DIEASEKDKNYICNIEAKFGRKIKDEE-------RLIRA--LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVT-RGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLK---KEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTK------CDTCVQCQMA-----HEEAKKGKIKTRLLD------SIREEG-RSNIQH--GIVEEVR-------------------------KKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKI 3527
            P +  ++ G +F+CLLD+G+  +++    +   +   +     ++   N   +  +   +  V IG  +  + F V  K+   V+ G    I +Q       L ++            K     +E + G+ +   E       +  R+  +  L  +   +++++I     +F  +    GRTH V+H+I T    PI     R P++L+ ++E+ + ++ E GIIR   SPW +P++ V KK+   +RLC+D+RQLNAVT++ +FP+P ID+ +D L G+ +FS++DL + Y+QVE+E   + KTAF   +G Y F  MPFG+  AP TFQ +M +VLG +  K  +VYLDD++++  + ++H   L  VL  + EAGL++NP KC  ++ E+++LGH+IS+ GV  DP K++ ++ +  P  V+++RSFLG+ +YYRRF+  ++  A+ L  L      +  EWSE     F+ LK AL TAPIL  PD +   K FILDTDAS   IGAVLSQ D + KE VIAYG+ ++   E+ YC TRKE+LA+ +F  HF+HYL GRRF +RTDH ++ ++   K P   Q   W  +L   +   ++R G+ H+NAD +SR        C +C +  MA      E  +   I++   D       I+E G R   Q   G   E R                          + ++PE+ ++  ++E+H  L HAG  K+ +  K R+   H    ++++  SC +C + K        P++   +    EI+ VDI GPL ET    +YIL ++D ++K+     + + D  T  K I + WI R+G P ++  D G  FES MM E  ++L I+   ++ YH Q NG +ER  RT++ L+ A +       W E + +      A+   +T +SP  ++ G+++
Sbjct:  285 PVLKAKVGGRDFECLLDSGSVCSLLDKETYTICQFDAIAGPGSQVLAVNGQPLHILACVRGTVQIGKWTCTLNFFVVPKLPFHVVLGSNYLITMQCILDLPNLAVQVYGERFPLEPVYKTLAGVVEVRTGQHVTKGEVHAEFEWQKFRSTIISHLPAHLAQRVRDVIEKHRAIFWWEGQPPGRTHFVRHRIDTGEAHPIRQAARRLPVHLQGQVEQLLTDMLEKGIIRPSTSPWASPVVLV-KKKDGSLRLCVDYRQLNAVTKKDSFPLPRIDDTIDALSGTEWFSTLDLASGYWQVEVEPSDRLKTAFVVPSGLYEFETMPFGLTNAPATFQRLMQQVLGDLIPKQCLVYLDDVIVHGRTVDEHLHNLSNVLSRLAEAGLKLNPAKCSFMRTEVKYLGHVISQSGVSCDPEKVRKVKEWPTPESVEEIRSFLGLASYYRRFVPRFSHIAKPLTLLTE--KGRVFEWSEDCQQSFDRLKDALCTAPILALPDTRADAKMFILDTDASDVAIGAVLSQLDNNDKEHVIAYGNRNLNRSERNYCTTRKEMLALVHFIKHFRHYLLGRRFLVRTDHSSLQWLQNFKDP-EGQVARWQEFLQEYNFECKHRPGLRHRNADALSRRPIRDHGDCPSCTRYSMAITIRPEENNRWAAIQSSDADLQIIYRRIKEGGPRPTTQEMAGSSWEARCLWSLWNHLYIADNVLYYQYGPTYESQIVVPESAVRSVLQELHENLGHAGQNKLEEAAKRRFWWPHQRRDVKDVCSSCGLCAQIKNPKRQQCAPLQSMLTGYPNEIVEVDIVGPLPETTRGNRYILVMVDHFTKWCEAVPVPRADGGTTAKFIFDHWISRWGAPTQLHTDRGSNFESTMMHELCRILGIRKTRTTAYHPQGNGAVERVNRTLKSLLKAFVSSENTRSWDEAVSQCLLAYRASVHSSTAQSPHYLLTGREM 1226          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0V1KKR9 (Transposon Ty3-G Gag-Pol polyprotein OS=Trichinella nativa OX=6335 GN=TY3B-G PE=4 SV=1)

HSP 1 Score: 516.538 bits (1329), Expect = 1.008e-154
Identity = 290/932 (31.12%), Postives = 490/932 (52.58%), Query Frame = 3
Query: 1278 ERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVT-RGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGA--MVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTC----------VQCQMAHEEAKKGKIKTRLLDS---------------------------------IREEGRSNI-QHGIV---------EEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKI-------------------------SREKWYGSKEIPIKEELEEQTRR------------KFNVGEEVLVKVETRHKGQDRYEGPYKVIEKVHDRRYILRNEDGKR--IERNVEKLKNFLRRGI 3782
            E+++ + ++      S L  I+    +V      D+GRT LV+H I T   +P+   P R P +  +++E  +  +    ++   +SPW +P++ V KK+    R C+D+RQLN +T + A P+P ID+ LD L G+ +FS++DL + Y+QVE+E + +EKTAF+T  G Y F  MPFG+  AP TFQ +M   L  + G+  +VYLDD++++  + E+H   L EV + + E GL++ PEKC+++K  + +LGHIIS++G+ TDPSK  A++ +  P CV +LR FLG+ +YYR+F++ +A  A  L +L       + +WS+   S F+ LK  LT+AP+L +PD  ++FI+D DAS D +GAVLSQR++   ERV+AY S ++T  E+ YC TR+E+L + +    F+ YLYG+RF +RTDH  + ++ T ++P   Q   W+  L+ LD  VE+R G  H NAD +SRT C  C          VQ      E      K +LL +                                 + ++ R+ + + G++         EE  K+ ++P     E ++ +H  R   H G  +    ++ R+    +   +    R+C  C + K  T   + P++   +    + + +DI GPL +T    +Y+L + D ++K+     +   +  T+ K ++EK++  FG P  +  D G++FE+ ++ E  ++  IK   SSPYH Q NGQ ER  RT+ D+++  + D     W ++LP V    N++  ++TG +PA  + G+++                         +RE+     E+  ++ L+ Q RR            +F   + V + +  R K    +EGPY+V+E +  + Y +R+ + KR  I  + +++K +  R I
Sbjct:   53 EKMLPSEQESSGKHRSALAAILKEFADVLSTSDEDLGRTSLVRHAIHTGDAKPVRCSPRRIPYHQRAQVESLLDEMLRQDVVEPSSSPWASPIVLVRKKDGS-CRFCVDYRQLNNLTRKDAHPLPRIDDTLDALAGAQWFSTLDLASGYWQVEVEPQDREKTAFTTPLGLYQFKVMPFGLCNAPATFQRLMEIALRGLVGSDCLVYLDDVIVFGKTAEEHTARLREVFRRLREVGLKVKPEKCRLMKRRVAYLGHIISEKGIATDPSKTSAVREWPTPTCVSELRQFLGLASYYRKFVNGFANVAAPLHRLLE--KGAEWDWSKACQSAFDTLKYHLTSAPVLAYPDFHRQFIVDVDASGDGLGAVLSQREEKA-ERVVAYASRTLTKAERRYCATRREMLGLVWALREFRPYLYGQRFLVRTDHSCLRWLTTFREP-EGQVARWLESLAELDFEVEHRAGRLHGNADALSRTSCAQCGRLVEGSACAVQAAQLRTEDVAQSFKDQLLAAQQADPEIQLLRQWLVGASWPVECPPECSRDMHVLWQQRRTWVDEDGLIWRHRRGLTAEEGAKQALVPRALRNEVLQSMHDSRYAGHLGERRTLARVRSRFYWPGMSGDVHTWCRTCTQCARRKGPTKNNRAPMQAMAAGYPLQRVGMDILGPLEKTPSGNRYVLVLTDYFTKWTAAFPLANMEASTVAKVLVEKYVAYFGAPDCLHSDQGRSFEASVVLEMCRLFGIKKTRSSPYHPQGNGQAERFNRTLLDMLS-IMVDGNPGQWDDMLPFVMLAYNSSVHESTGVTPAIAMLGRELRLPLDVQIGNPPGGEAQGLPDYIRETRERIDRVHEL-ARDHLKTQQRRQKYLHDRHAKESRFCPNDRVWLAMPRRGKLDRGWEGPYRVVEVMGPQTYRVRHNERKRRTIVVHSDRMKRYHARDI 977          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A0V1KLB2 (Transposon Ty3-G Gag-Pol polyprotein OS=Trichinella nativa OX=6335 GN=TY3B-G PE=4 SV=1)

HSP 1 Score: 518.079 bits (1333), Expect = 2.233e-154
Identity = 310/1072 (28.92%), Postives = 542/1072 (50.56%), Query Frame = 3
Query:  936 LLDTGARINVMSVNCFNKLRG-QQLTKSDDKLRCANESTIETIGKTKVQVTIGN-------------VSKEVI----FIVAEKVTPDVIGG--------IELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVT-RGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGA--MVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTC----------VQCQMAHEEAKKGKIKTRLLDS---------------------------------IREEGRSNI-QHGIV---------EEVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKI-------------------------SREKWYGSKEIPIKEELEEQTRR------------KFNVGEEVLVKVETRHKGQDRYEGPYKVIEKVHDRRYILRNEDGKR--IERNVEKLKNFLRRGI 3782
            L+DTG+ + + +      L+  + + K   +L  A  + +E       ++ +GN             +S +++    F+     TPD   G        I  +++    L++++  ++       + +  A+     +  E+++ + ++      S L  I+    +V      D+GRT LV+H I T   +P+   P R P +  +++E  +  +    ++   +SPW +P++ V KK+    R C+D+RQLN +T + A P+P ID+ LD L G+ +FS++DL + Y+QVE+E + +EKTAF+T  G Y F  MPFG+  AP TFQ +M   L  + G+  +VYLDD++++  + E+H   L EV + + E GL++ PEKC+++K  + +LGHIIS++G+ TDPSK  A++ +  P CV +LR FLG+ +YYR+F++ +A  A  L +L       + +WS+   S F+ LK  LT+AP+L +PD  ++FI+D DAS D +GAVLSQR++   ERV+AY S ++T  E+ YC TR+E+L + +    F+ YLYG+RF +RTDH  + ++ T ++P   Q   W+  L+ LD  VE+R G  H NAD +SRT C  C          VQ      E      K +LL +                                 + ++ R+ + + G++         EE  K+ ++P     E ++ +H  R   H G  +    ++ R+    +   +    R+C  C + K  T   + P++   +    + + +DI GPL +T    +Y+L + D ++K+     +   +  T+ K ++EK++  FG P  +  D G++FE+ ++ E  ++  IK   SSPYH Q NGQ ER  RT+ D+++  + D     W ++LP V    N++  ++TG +PA  + G+++                         +RE+     E+  ++ L+ Q RR            +F   + V + +  R K    +EGPY+V+E +  + Y +R+ + KR  I  + +++K +  R I
Sbjct:    2 LVDTGSAVTLANERFIRHLKTLRDVPKPSIRLETATATELEITNACVTEIILGNSVTVQHTVLCVRELSHKILLGWDFMRYHGCTPDPTAGCLRMRQGNIPFRKSHAVALVRVESPQS-----ELMAHHPAQ-----EAMEKMLPSEQESSGKHRSALAAILKEFADVLSTSDEDLGRTSLVRHAIHTGDAKPVRCSPRRIPYHQRAQVESLLDEMLRQDVVEPSSSPWASPIVLVRKKDGS-CRFCVDYRQLNNLTRKDAHPLPRIDDTLDALAGAQWFSTLDLASGYWQVEVEPQDREKTAFTTPLGLYQFKVMPFGLCNAPATFQRLMEIALRGLVGSDCLVYLDDVIVFGKTAEEHTARLREVFRRLREVGLKVKPEKCRLMKRRVAYLGHIISEKGIATDPSKTSAVREWPTPTCVSELRQFLGLASYYRKFVNGFANVAAPLHRLLE--KGAEWDWSKACQSAFDTLKYHLTSAPVLAYPDFHRQFIVDVDASGDGLGAVLSQREEKA-ERVVAYASRTLTKAERRYCATRREMLGLVWALREFRPYLYGQRFLVRTDHSCLRWLTTFREP-EGQVARWLESLAELDFEVEHRAGRLHGNADALSRTSCAQCGRLVEGSACAVQAAQLRTEDVAQSFKDQLLAAQQADPEIQLLRQWLVGASWPVECPPECSRDMHVLWQQRRTWVDEDGLIWRHRRGLTAEEGAKQALVPRALRNEVLQSMHDSRYAGHLGERRTLARVRSRFYWPGMSGDVHTWCRTCTQCARRKGPTKNNRAPMQAMAAGYPLQRVGMDILGPLEKTPSGNRYVLVLTDYFTKWTAAFPLANMEASTVAKVLVEKYVAYFGAPDCLHSDQGRSFEASVVLEMCRLFGIKKTRSSPYHPQGNGQAERFNRTLLDMLS-IMVDGNPGQWDDMLPFVMLAYNSSVHESTGVTPAIAMLGRELRLPLDVQIGNPPGGEAQGLPDYIRETRERIDRVHEL-ARDHLKTQQRRQKYLHDRHAKESRFCPNDRVWLAMPRRGKLDRGWEGPYRVVEVMGPQTYRVRHNERKRRTIVVHSDRMKRYHARDI 1056          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A5S6QHB1 (Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1)

HSP 1 Score: 526.939 bits (1356), Expect = 8.806e-154
Identity = 309/951 (32.49%), Postives = 509/951 (53.52%), Query Frame = 3
Query:  900 ITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQ-QLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRL-------------LKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQ-----LKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIE-----EAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIK--GAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQ-----------------------CQMAHEEAK---KGKIKTRLLDSIREEGRSN-------------IQHGIV---------EEVRKKTMIPENELQETIKEIHRLLC--HAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKK 3524
            + V++       L+DTGA   ++  + F ++RGQ +L+  + +L  A  + ++ +G   + + +G+ + ++  IV   +    + GI+  +  GF +             L+I    A     N   ++    G+ +   +R+++ L +     L  ++  KL  +++   N F A ++D+GRT ++KH IVT     NI+P R PL   + +E     + IQ L +N II   NSPW   ++ V KK+   +RLC+D+R+LN V+ R A+P+P IDE L+ L G+ +FS+IDL + Y+QVEL E ++EKTAF T +G + FN MPFG+  AP TFQ +M  VL  +K    +VYLDDI+++S + E+H   L +VL  +++AGL+ N  KC++  +E+R+LGHI+S++G++ DPS  + ++ +  P C+ +++ FLG+ +YYR+FI D+A  A+ L QL      K  +W+    S F+ L+ AL + PIL  PD    FILDTDAS   IGAVLSQRD+ G+E  +AY S ++T  E+ YC+TR+E+LAV  F   F+ YL  ++FTLRTDH ++ ++   K P   Q+  W   L   D  +E+R G  H NAD +SR  C  C +                        Q+  E+     K K    +   IR   RSN             IQ GI+            R + ++P+  ++  + + H+ L   H G+EK  + +++R+      S  ++ V SC  C          + P++   +   +E + +DI GPL  ++   +YIL  ID +SK+     I  Q+ +T+ + ++ ++I R+G P+ I  D G  FE+ + +     L I+   ++PYH   NGQ+ER  RT+  +++  + D     W E+LP+V     A+ Q +T  +P  +VFG++
Sbjct:  356 VPVQVRNRCIRMLVDTGAGRTLLRSDEFARIRGQWKLSPCNVRLLSAEGTALDVMGSVSLPLQVGDRTFDMEVIVVNALQFAGLLGIDFLKQHGFVVDLARGTLNSPKQKLRIPLQSAGRTSGNGAWSVSV--GKPVSLSKRMLQQLVETTGVPLTASQRKKLHGMLSKFRNAFAASEFDIGRTSVLKHDIVTD----NIRPVRHPLRRLAPVERKEVSQLIQRLLDNKIIEPSNSPWAAGIVPVRKKDG-SLRLCVDYRKLNEVSRRDAYPIPRIDETLEALAGARYFSTIDLLSGYWQVELTEAAKEKTAFITHDGLFQFNVMPFGLTGAPATFQRLMEHVLAGLKWNTCLVYLDDIIVFSRTAEEHVEHLSQVLNRLQKAGLKANASKCKLFCKEVRYLGHIVSEKGIEPDPSLTEKMRTYPVPTCLAEVKRFLGLASYYRKFIKDFAAIAKPLHQLTE--KRKPFQWTPECTSAFQKLRTALLSEPILRLPDFDASFILDTDASDTAIGAVLSQRDEHGREHPVAYASRTLTRAEQRYCVTRREMLAVITFTDQFRPYLQ-QKFTLRTDHGSLQWLRDFKNP-DGQWARWQQKLQQYDFDIEHRAGSRHANADTLSRIPCKQCGRSGTEVMGVPVNVVALENLEEMRTSQLDDEDIAPILKAKAAGVVGQEIRCGKRSNSKNLLMLNWHRLAIQKGILVRKWFCDDQSGYRWQVVVPKRMIKPVLDQAHQQLTAGHLGIEKTIERIRERFYWPGYRSDTKKYVASCYECNTRNEPVGKGRAPLQPVVTTRRWEKLAIDILGPLVVSETGNRYILVAIDCFSKFAEAFPIPDQEAKTVTRVLVNEFICRYGVPEAIHTDQGSQFEAAIFQSMCTELGIRKTRTTPYHPSGNGQVERMNRTLGTMLSKVV-DENHRKWDEVLPKVMMAYRASIQSSTRMAPYTMVFGEQ 1294          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Match: A0A5S6Q2I3 (Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1)

HSP 1 Score: 526.554 bits (1355), Expect = 1.081e-153
Identity = 309/951 (32.49%), Postives = 509/951 (53.52%), Query Frame = 3
Query:  900 ITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQ-QLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRL-------------LKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQ-----LKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIE-----EAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIK--GAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQ-----------------------CQMAHEEAK---KGKIKTRLLDSIREEGRSN-------------IQHGIV---------EEVRKKTMIPENELQETIKEIHRLLC--HAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKK 3524
            + V++       L+DTGA   ++  + F ++RGQ +L+  + +L  A  + ++ +G   + + +G+ + ++  IV   +    + GI+  +  GF +             L+I    A     N   ++    G+ +   +R+++ L +     L  ++  KL  +++   N F A ++D+GRT ++KH IVT     NI+P R PL   + +E     + IQ L +N II   NSPW   ++ V KK+   +RLC+D+R+LN V+ R A+P+P IDE L+ L G+ +FS+IDL + Y+QVEL E ++EKTAF T +G + FN MPFG+  AP TFQ +M  VL  +K    +VYLDDI+++S + E+H   L +VL  +++AGL+ N  KC++  +E+R+LGHI+S++G++ DPS  + ++ +  P C+ +++ FLG+ +YYR+FI D+A  A+ L QL      K  +W+    S F+ L+ AL + PIL  PD    FILDTDAS   IGAVLSQRD+ G+E  +AY S ++T  E+ YC+TR+E+LAV  F   F+ YL  ++FTLRTDH ++ ++   K P   Q+  W   L   D  +E+R G  H NAD +SR  C  C +                        Q+  E+     K K    +   IR   RSN             IQ GI+            R + ++P+  ++  + + H+ L   H G+EK  + +++R+      S  ++ V SC  C          + P++   +   +E + +DI GPL  ++   +YIL  ID +SK+     I  Q+ +T+ + ++ ++I R+G P+ I  D G  FE+ + +     L I+   ++PYH   NGQ+ER  RT+  +++  + D     W E+LP+V     A+ Q +T  +P  +VFG++
Sbjct:  356 VPVQVRNRCIRMLVDTGAERTLLRSDEFARIRGQWKLSPCNVRLLSAEGTALDVMGSVSLPLQVGDRTFDMEVIVVNALQFAGLLGIDFLKQHGFVVDLARGTLNSPKQKLRIPLQSAGRTSGNGAWSVSV--GKPVSLSKRMLQQLVETTGVPLTASQRKKLHGMLSKFRNAFAASEFDIGRTSVLKHDIVTD----NIRPVRHPLRRLAPVERKEVSQLIQRLLDNKIIEPSNSPWAAGIVPVRKKDG-SLRLCVDYRKLNEVSRRDAYPIPRIDETLEALAGARYFSTIDLLSGYWQVELTEAAKEKTAFITHDGLFQFNVMPFGLTGAPATFQRLMEHVLAGLKWNTCLVYLDDIIVFSRTAEEHVEHLSQVLNRLQKAGLKPNASKCKLFCKEVRYLGHIVSEKGIEPDPSLTEKMRTYPVPTCLAEVKRFLGLASYYRKFIKDFAAIAKPLHQLTE--KRKPFQWTPECTSAFQKLRTALLSEPILRLPDFDASFILDTDASDTAIGAVLSQRDEHGREHPVAYASRTLTRAEQRYCVTRREMLAVITFTDQFRPYLQ-QKFTLRTDHGSLQWLRDFKNP-DGQWARWQQKLQQYDFDIEHRAGSRHANADTLSRIPCKQCGRSGTEVMGVPVNVVALENLEEMRTSQLDDEDIAPILKAKAAGVVGQEIRCGKRSNSKNLLMLNWHRLAIQKGILVRKWFCDDQSGYRWQVVVPKRMIKPVLDQAHQQLTAGHLGIEKTIERIRERFYWPGYRSDTKKYVASCYECNTRNEPVEKGRAPLQPVVATRRWEKLAIDILGPLVVSETGNRYILVAIDCFSKFAEAFPIPDQEAKTVTRVLVNEFICRYGVPEAIHTDQGSQFEAAIFQSMCTELGIRKTRTTPYHPSGNGQVERMNRTLGTMLSKVV-DENHRKWDEVLPKVMMAYRASIQSSTRMAPYTMVFGEQ 1294          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041345.1 (pep primary_assembly:Astyanax_mexicanus-2.0:25:32334536:32339002:1 gene:ENSAMXG00000029230.1 transcript:ENSAMXT00000041345.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 421.779 bits (1083), Expect = 3.628e-121
Identity = 284/945 (30.05%), Postives = 475/945 (50.26%), Query Frame = 3
Query:  897 SITVRMNGENFDCLLDTGARINVMSVNCFNKL-RGQQLTKSD--DKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTP--DVIGGIELQETFGFRLLKIKDI-------------EASEKDKNY----ICNIEAKFGRKIKDEERLIRAL-EQLKINEDSK--LKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGE-PINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKG--AMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKC----DTCVQCQ--------------MAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKKTMIPENELQETIKEI----HRLLCHAGVEKIADYMKDRYVGKHL--------------WSKIQEIV----RSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKI 3527
            ++ + + G+    ++DTG+  +++    ++++ R Q++ +S    K   AN  T    G  ++   +G    +  F V E       ++ G++     G  L   KD+             E      +Y    +C  +  +     +EE+ I  L E   I   +K  L ++++   +V       +GRT +V H I T    PI  KPY+  +  +  I+EAI+++   GI+R   SPW +P++ V KKE   +R C+D+R++N+ T   A+PMP + E+L+ L G+  FS++DL + Y+QV LE +S  KTAF T  G Y F  +PFGI  A  TFQ +M  VL  +KG    VY++DI++YSS+ E+H   L EV + + +AGL +N  KC +++  + FLGH+IS EG+ T+P K++AIQ F +P  +K+L+ FLG+  +Y RFI+ ++++A +L  L     N    W++     FE +K AL TAP+L  P+  + F + TDAS   +GAVLSQ   DG E V+AY S  +   E+ Y    KE LAV +    ++ YL GR FT+ TDH A++++    KP TS+   W   L + D  V+YRKG  +   D +SR            CQ              +A  +   G ++ +  ++  +E R +  H + +       +P  +L  T++ +    HR       E    Y  D  +  HL              W  I+  V    +SCEICQK K         ++     E   ++ VD+ GPL ++  + +++L I+D  SK+  +  + +     I + +++    R+G P  +  D G  F S ++    +   +    ++ YH QTN   ER  RT++ +I + ++D+ +  W + +PE  F +N+  Q++TG +PAE+  G+K+
Sbjct:  427 TLPLSVRGQQVTAMIDTGSTFSLIQEGTWDRMKRTQEVWRSGRGQKFILANGQTQTAKGVVELDCELGKCQAKRPFYVMENKDHVFSLVLGLDFLHDTGLILDFQKDVYILPGGDTVPFGGEGESPPFSYADMRLCIAQEDYVPLDYEEEQEINKLVEGADITSQAKQDLHKLLSQWPSVCTNQ---LGRTMIVLHHITTNDNLPIRQKPYKVSIEKQQLIKEAIEDMQRRGIVRPSTSPWASPVVLVPKKEG-GVRFCVDYRRMNSKTHLDAYPMPQVQEILESLHGAAIFSTLDLKSGYWQVGLEPDSIPKTAFITCQGLYEFTVLPFGIKNAAATFQRLMDSVLVNLKGKSCFVYINDIVVYSSTIEQHLGHLEEVFRCLHQAGLTLNLRKCNLLQRSLIFLGHVISGEGICTEPGKVEAIQAFPEPRSIKELQRFLGMAGWYHRFITHFSERAAILNALKK--KNAPWIWTQECQKAFEDIKQALITAPVLTPPNFSEPFQIQTDASDQGLGAVLSQ-GTDGLEHVVAYASRLLQGAERNYSTAEKECLAVVWAVEKWRVYLEGRHFTVITDHSALSWVFNHPKP-TSRLTRWAIRLQTFDFSVQYRKGKCNIVPDTLSRIPDRMTEGVMAPCQVTGSSDGLPVDWAEIARAQEVDGTLQPQRDETGNQETRKDRIHFVTKNDILFRAVPNQQLGHTLQVVVPVQHR-------EAFLQYAHDNPLSGHLGQMKTLLRLLNIAYWPSIRRDVWTYCKSCEICQKYKPRISKLSGRLQSTPVVEPGYMLGVDLMGPLPKSPRQNEHLLVIVDYCSKWVEMFPLREAKTSQIVQILIKDIFTRWGTPAYLVSDRGAQFTSRLLHATCRQWGVVQKLTTAYHPQTN-LTERVNRTLKTMIASYVKDKHRL-WDQWIPEFRFAINSAWQESTGFTPAEVALGRKL 1354          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000041754.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02001938.1:55959:59870:1 gene:ENSAMXG00000043385.1 transcript:ENSAMXT00000041754.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 414.461 bits (1064), Expect = 5.467e-120
Identity = 259/857 (30.22%), Postives = 436/857 (50.88%), Query Frame = 3
Query: 1278 ERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQ-PLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLG--KIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLE-----------WSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEK---GYCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSR---------TKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEV----------RKKTMIPENELQETIKEIHRLLC------------------------------------------------HAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFG--KKISREKWYGSKEIPIK-EELEE 3584
            E +    E L   +  + + +     N+F   + D+G T L+ H+I    E    +PYR+ P +  S ++  IQ L ++ +IR+ +SP+++P++ V KK+   +RLC+D+RQLNA T R ++P+P I+E LD L G+ +FS++DL + Y QV +EE+ + KTAF T  G + FNRMPFG+  APGTFQ +M ++ G  + +  ++YLDD++++S + E+H   L EV   +++  L++   KCQ  + ++ +LGH++S EGV TDP+KI+ ++ +  P+ + +LRSFLG  +YYRRF+  ++K A  L +L  G S  + +           W       F+ LK  LT+ P+L + D  K FI++ DAS   +GAVLSQ +++GK R IA+ S  +   E+    Y   + ELLAV +     F+ YL G +FT+ TD+  ++ + T K  + +  Q W + L+S +  ++YR G ++QNAD +SR         TK    V  +  HEE  +  ++ +    +   GRS    G+++            R +   P  E ++T+  + ++L                                                 H G E+    ++DR    ++   ++   + C+ C   KA+    +       +    EI+ +D    L      K+ +L + D ++KY    A   Q   T+   +++ W  RFG P  I  D G+ FES ++++  K+  I+   ++ Y  Q NGQ ER  RT+ DL+  TL   +K  W   LP++ F  N T  +TTG++P  ++FG   ++  +   G+ E+    E LEE
Sbjct:  255 EYVTSDFEGLTDLQAQRARALFQKYSNIFAKSEGDLGCTSLISHEIPLLDEVPVRQPYRRIPPSQYSTVKAHIQQLLDSRVIRESSSPFSSPIVLVTKKDG-SLRLCVDYRQLNAKTRRDSYPLPRIEESLDALCGAKWFSTLDLASGYNQVPVEEKDKSKTAFCTPFGLFEFNRMPFGLCNAPGTFQRLMERIFGDCRYQSVLLYLDDVIVFSQTVEEHLERLEEVFSRLQKQNLKVKLSKCQFFQHQVSYLGHVVSAEGVTTDPAKIEVVKEWKSPSHLAELRSFLGFASYYRRFVEGFSKLAAPLHRLVGGLSGPRRKGKTPKTSLAAFWDAECEQAFQSLKDRLTSTPVLAYADFNKPFIVEVDASHGGLGAVLSQ-EQEGKVRPIAFASRGLRPTERNMENYSSMKLELLAVKWAVTEKFREYLLGHQFTIYTDNNPLSHLQTAK--LGAVEQRWASQLASFNFTIKYRPGKHNQNADALSRQYLERFAVGTKVPPLVM-EAVHEE--RSALENQCRQVVAFPGRSPSDLGVLQRADPVIGPVWKFRSEGRRPRTEERDTLCNLSKVLIRQWDRLVEREGVLYRRAYPSGRGSEYFQLLLPQCLQKEVLHSVHDDHGHQGTERTLQLLRDRCFWPNMTQDVERWCQQCQRCTLGKAVQPKVRAFQGTLQAAHPNEILAIDFTI-LEPASDGKENVLILTDIFTKYTQAIATKDQRASTVAWALVQHWFHRFGPPVRIHSDQGRNFESLLIKQLCKVYSIQKSRTTSYRPQGNGQCERFNRTLHDLLR-TLPVEEKRHWPRHLPQLTFAYNTTPHQTTGQTPHFLMFGYHPRLPVDFLLGTGEVTASTEPLEE 1102          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000037150.1 (pep primary_assembly:Astyanax_mexicanus-2.0:10:22265216:22268869:1 gene:ENSAMXG00000033912.1 transcript:ENSAMXT00000037150.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 403.675 bits (1036), Expect = 6.609e-117
Identity = 255/833 (30.61%), Postives = 419/833 (50.30%), Query Frame = 3
Query: 1269 KDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQ-PLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLG--KIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSG-----------PSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEK---GYCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCV------QCQMAHEEAKKGKIKT-RLLDSIREEGRSNIQHGIVEEV----------RKKTMIPENELQETI--------KEIHRLLCHAGV----------------------------------------EKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFG 3518
            K    L+  +E L   + + +  +     ++F   + D+G T L+ H+I    E    +PYR+ P +    ++  IQ L ++ +IR   SP+++P++ V KK+   +RLC+D+RQLNA T R A+P+P I+E LD L G+ +FS++DL + Y QV + E+ + KTAF T  G + FNRMPFG+  AP TFQ +M ++ G  + +  ++YLDD++++SSS E+H   L EV   +++ GL++   KC   + ++ +LGH++S+EGV TDP+KI A++ + +P+ + +LRSFLG  +YYRRF+  ++K A  L QL              P      W E     F+ LK  LT+AP+L + D  K FI++ DAS   +GAVLSQ +++GK R IA+ S  +   E+    Y   + ELLAV +     ++ YL G   T+ TD+  ++ + T K   T   Q W + L+S + +++YR G ++QNAD +SR   D           +M   E++    KT +  + +   GRS +   +++E           RK+   P  E +E +        ++  RLL   GV                                        E+    ++ R     +   ++   + C  C  +KA+    +       +    EI+ +D       + GR+  +L + D ++KY        Q   T+ + +++ W  RFG P  I  D G+ FES ++++   M  I+   ++ Y  Q NGQ ER  RT+ DL+  TL   +K  W   L  + F  N T  +TTGK+P  ++FG
Sbjct:  268 KSPHGLLSDIEGLSEEQMAAVSLLFDQYQDIFAQTEGDLGCTTLLTHEIPLLDEVPVRQPYRRIPPSQYEAVKLHIQQLLDSKVIRNSASPYSSPIVLVTKKDG-SLRLCVDYRQLNAKTRRDAYPLPRIEESLDALAGAKWFSTLDLASGYNQVPVSEKDRYKTAFCTPFGLFEFNRMPFGLCNAPATFQRLMERMFGDCRYQSVLLYLDDVIVFSSSVEQHLERLAEVFSRLQKQGLKVKLSKCHFFQPQVNYLGHVVSREGVATDPAKIDAVRGWRRPSHLAELRSFLGFASYYRRFVEGFSKLAAPLHQLVGKLGGARRKGKTLPVPLAASWDERCEKAFQSLKERLTSAPVLAYADFSKPFIVEVDASHGGLGAVLSQ-EQEGKVRPIAFASRGLRPAERNMDNYSSMKLELLAVKWAVTEKYREYLLGNEVTILTDNNPLSHLQTAKLGATE--QRWASQLASFNFKIKYRPGKSNQNADALSRQYVDRFAIGTKVPPLRMELLESEPMVHKTGQCTEMVALAGRSALDLHLLQEADPVIGPVCKFRKEGRYPRAEEREALSSPTKALFRQWDRLLEKDGVLYRAVQPSGGGPETCQLLLPKHLQEEVLSSVHDDHGHQGVERTLKQLQSRCFWPGMAKHVERWCQQCRRCVLSKAVQPKIRAFQGTLQATRPHEILAIDFTLLEPASDGREN-VLVLTDVFTKYTQAIPTRDQRASTVAQVLVQHWFHRFGLPSRIHSDQGRNFESMLIQQLCNMYGIQKSRATAYRPQGNGQCERFNRTLHDLLR-TLPQSEKRRWPHYLSPMVFAYNTTPHQTTGKTPYYLMFG 1094          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000042253.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000503.1:15658:18693:-1 gene:ENSAMXG00000039114.1 transcript:ENSAMXT00000042253.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 397.127 bits (1019), Expect = 2.113e-116
Identity = 240/822 (29.20%), Postives = 403/822 (49.03%), Query Frame = 3
Query: 1275 EERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVT---RGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGA--MVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSR-----TKCDTCVQ----------CQMAHEEAKKGKIKTRL-LDSIREEGRSN---------------------------------------IQHGIVEEVRK---------KTMIPENELQETIKEIHRL--LCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKI 3527
            EE L R+   L   +  +L+++I      F   + +  RT L  H I T   +         R         E+ ++++   G+I   +SPW+ P++   KK+  + R C+D+R+LNAVT+  ++P+P ID+ LD L GS +FSS+DL + Y+QV L E  +EKTAFS  +  + F  +PFG+  +P TF+ +M +VL  I  +  +VYLDD++ + +  +   + L  VL AI+ A L++NP KC ++++ + FLGH++S  GV+TDP K +A++++  P   K +RSF+G+ +YYRRFI  +         L          WS+     F  LK  L  AP+L + ++++  +    AS   +GAVLSQ  +DG ERVIAY S  +   E+ YC+T +ELLAV     HF+ Y+YG  F LRTDH ++ ++M  ++P   Q   WI+ +      V +R G +H NAD +SR       C  C +          C     +   G +  ++ ++ IR+  R++                                       +  G++  V +         + +IP       ++ +H      H GV K    ++ R+      + ++  V  C++C   K      + P+         E + VD+ GP   T    +Y+L  +D ++K+    A+  Q   T    ++ ++  RFG P+E+  D G+ FES +M E  ++L I    ++P H Q++G +ER F        A +  + + DW + LP V     +  Q+TTG +PA ++FG+++
Sbjct:   85 EEMLERSCVGLSEPQQLRLRDLIERFRASFAVSERECTRTSLAFHSINTGDAQTSGWQTAAARLAFAKRVAAEQLVRDMASAGVIEPSSSPWSAPVVLA-KKKDGNWRFCVDYRRLNAVTKLDSYPLPRIDDTLDQLSGSAWFSSLDLRSGYWQVPLAEGDREKTAFSLGSALWHFTVLPFGLCNSPATFERLMERVLSGIPRSCCVVYLDDVMAHGTDFDSALSHLEVVLGAIQAANLKLNPAKCNLLRQRVNFLGHVVSGAGVETDPKKTEAVRDWPVPRNAKMVRSFVGLASYYRRFIRGFRDVGGNASHLTR--PGVTFRWSDEAERAFGELKSRLCNAPVLAYRNVREASLWIRIASDRGLGAVLSQV-QDGSERVIAYYSRRLDKAERNYCVTCRELLAVVEGLKHFRPYVYGVPFLLRTDHASLQWLMRFREP-EGQLARWISRIQEFSFEVVHRPGRSHGNADALSRRPCVAVDCKHCARAEEKSAETAHCAAVSADVTGGVVVAQVSVEQIRDAQRADRDLMWAVHALEANVTPSWEEVVPLGPVAKALRSNWASFSLSDGVLCHVWEDPANGQRVFRVVIPRALRDSVLRGVHGSPGAGHFGVTKTLKRLRQRFYWPGCRTDVELFVHCCDVCAAKKGPARAPRAPLHPYQCGGPMERVAVDVLGPFPVTDSGNRYVLVAMDYFTKWPEAYAVPDQGAVTTADVLVREFFCRFGVPEELHSDQGRNFESEVMAEVCRILGIHKTRTTPLHPQSDGLVER-FNCTLAAQLAMVSSKGQRDWDKHLPVVLLACRSAVQETTGFTPALLMFGREL 900          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Match: ENSAMXT00000048272.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000106.1:219951:223307:1 gene:ENSAMXG00000032559.1 transcript:ENSAMXT00000048272.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 380.563 bits (976), Expect = 6.606e-110
Identity = 186/487 (38.19%), Postives = 293/487 (60.16%), Query Frame = 3
Query: 1305 LKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKG--AMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTR 2756
            L  N+  +L ++++  G+VF     D+GRT LV+H I +  G P+  +P R     + + +  IQ   E+G+    NS W +P++ V KK++   RLC+D+R LNA T + A+P+P I + L+ L  + +FS++DL + Y+QV L   +++  AF ++ G + +N MPFG+  AP TFQ +M +VL  ++    +VYLDDI++      +    L +V     +A L++ P KC + + E+ +LGHI+S +G+ TDP K++ +Q + +P CV ++R F+G+  YYRRF+ D+A  A+ L +L     + + +W+    + FE LK +LT+ P+L +P      ILDTDAS   IGAVLSQ  +DG ERV+AYGS  ++S E+ YC TR+ELLAV  F  HF+ YL GR F +R+DH ++ +++  K+P   Q   W+  L+  D +V +R G +HQNAD++SR  C T   C M        K+  R
Sbjct:   35 LSANDRLELAQLLSTYGDVFSTGPTDLGRTSLVQHDIQLLPGPPVKQQPRRMAFEKQIESDAQIQQSLESGLASPSNSSWASPIVLVRKKDQT-YRLCVDYRALNARTVKDAYPLPRIQDTLETLSTAKWFSTLDLASGYWQVALTPRARKAVAFCSRKGLFTWNVMPFGLCNAPATFQRLMDRVLAGLQWETCLVYLDDIILLGKDVPEILQRLAQVFDRFRQANLKLKPTKCCLFRREVSYLGHIVSAQGIATDPEKVRKVQQWPQPTCVSEVRQFVGLAAYYRRFVQDFATIAKPLHELTK--KHVRFQWTPECQTAFEELKSSLTSTPVLGYPRDHGNLILDTDASNFGIGAVLSQV-QDGAERVLAYGSRRLSSTEQNYCTTRRELLAVVEFTRHFRQYLLGRPFIVRSDHSSLRWLVNMKEP-EGQLARWLEKLAEYDFQVVHRPGHHHQNADVMSRRPCRTTCPCNMTDPADVACKVTHR 516          

HSP 2 Score: 106.686 bits (265), Expect = 1.800e-22
Identity = 61/241 (25.31%), Postives = 119/241 (49.38%), Query Frame = 3
Query: 2814 EVRKKTMIPENELQETIKEIH--RLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKIS 3530
            E   + ++P    Q+  +++H   +  H GVE+    ++ RY    +   I    R+C  C        T + P+         E I +D+ GP+ ET    +YIL   D ++K+    A+      T+ + +  +W+ R+G P+ +  D G  FES + ++  ++L ++   ++P+  Q++GQ+ER   T++ ++ AT  +R   DW  + P       ATK  +TG +P  ++FG++++
Sbjct:  641 EFHPQMILPRVFRQDVQQQMHDGPVGGHFGVERTLARVQTRYYWYQMREDITLWCRTCTSCAAKARPPKTPQAPMGTVRVGAPMERIAIDLMGPMNETDRHNRYILVAQDYFTKWVEAYALPNDQAVTVAEVLTSEWVCRYGAPQTLHSDQGSNFESEVFQKMCELLGVEKTRTTPFRPQSDGQVERFNATLQKIL-ATTAERCHWDWDLMTPFAVMAYRATKHSSTGLTPNMMLFGRELT 880          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000009777.1 (pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-1 gene:ENSPMAG00000008837.1 transcript:ENSPMAT00000009777.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 68.9366 bits (167), Expect = 2.175e-12
Identity = 54/237 (22.78%), Postives = 110/237 (46.41%), Query Frame = 3
Query: 2820 RKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHH-QTNGQIERQFRTIRDLIN-ATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKK 3524
            RK+ ++   E + +I +      H G  +    +++ Y  K + + I++ + +CEIC K K+   ++   VK   +   +E++ +D+ GP   T    +++L I+D ++K+  LT +T Q    +   +      RFG PK++  +  + + + +  E  +     +C      H   NG      + +RD ++ AT + R   DW   L +  F  + +K   T  SP  ++ G++
Sbjct:   59 RKRLVVMNEEDKRSILQRVHGADHCGQTRTRKLLEEHYYWKGMVNDIRDYINACEIC-KQKSYKRSSISHVKLLKASYPWEVLGMDLLGPFPATSRAHRFVLLIVDYFTKWAELTPMTDQSAAHVVAALTTA-FHRFGFPKKLFCNVSEEYVAQINEEMFR--HFPMCSGLAISHLWANGAHRGTSQALRDCVSKATGRHR---DWELQLEQRLFEYHTSKHSATRYSPFYLMLGRE 288          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004121.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004121.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 65.855 bits (159), Expect = 2.336e-11
Identity = 50/243 (20.58%), Postives = 110/243 (45.27%), Query Frame = 3
Query: 2814 EVRKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQF-------RTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK 3521
            E R+  ++ E+E +  +  +H    H G +K    ++  Y    + S ++ ++ SC +C+   +  +     +K   +   +E++ +D+ GPL  T    +Y+L +ID +SK+     + ++ +E +  + L     R+G P+++    GK F + + +  T     + C  SP   Q +  +  +        + ++  +N  +  +  +DW   + +  F     K  TT  SP  ++FG+
Sbjct:   66 ERRRMVVMDEDEKRNILMSVHGA-GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVCRNKGSRRVAMPS-MKLLKASGPWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHV-ASALTVVFCRYGFPRKVFSSLGKEFVTQVNKSSTLSRAYQRC--SPSQTQVHAHVSVRVALNKATNQALKGCVN-LVASQHPSDWESRVEQSLFEYRVGKHSTTQYSPFYLMFGR 302          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000004123.1 (pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 gene:ENSPMAG00000003764.1 transcript:ENSPMAT00000004123.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 64.3142 bits (155), Expect = 7.235e-11
Identity = 52/248 (20.97%), Postives = 110/248 (44.35%), Query Frame = 3
Query: 2814 EVRKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFS------------SPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK 3521
            E R+  ++ E+E +  +  +H    H G +K    ++  Y    + S ++ ++ SC +C+   +  +     +K   +   +E++ +D+ GPL  T    +Y+L +ID +SK+     + ++ +E +  + L     R+G P+++    GK F    + +  K+   K C S            + +H  TN       + ++  +N  +  +  +DW   + +  F     K  TT  SP  ++FG+
Sbjct:   66 ERRRMVVMDEDEKRNILMSVHGA-GHFGQKKTILKLEADYYWLGMISDVKNLIASCGVCRNKGSRRVAMPS-MKLLKASGPWEVLGLDVLGPLPVTSRANRYLLLLIDYFSKWAEAVPLIEKSQEHV-ASALTVVFCRYGFPRKVFSSLGKEF----VTQVNKIQHKKWCISHTLQNLCETRGPAGWHKATN-------QALKGCVN-LVASQHPSDWESRVEQSLFEYRVGKHSTTQYSPFYLMFGR 298          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000010393.1 (pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 gene:ENSPMAG00000009411.1 transcript:ENSPMAT00000010393.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 49.2914 bits (116), Expect = 3.370e-7
Identity = 28/98 (28.57%), Postives = 47/98 (47.96%), Query Frame = 1
Query:  205 KSYARVVYQEPQGRVQYKKKEQREESIRREMPECWLCHKIGHTKIDCP-----IKGKIECWTCHRSGHISRNCPDKKAP----RCFGCGKEGHIRRLC 471
            KS+    Y+  +G    ++    ++S+      C+ C K GH   +CP       G   C+TC + GH++R C          +C+GCG+ GH++R C
Sbjct:    4 KSFTDGCYRCGEGGHIARECPLPQDSVSSNTAACYNCGKGGHIARECPEGRQDRGGGPSCYTCGKQGHLARECSSGGGGPGDNKCYGCGQRGHMQRDC 101          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Match: GIS2 (Translational activator for mRNAs with internal ribosome entry sites; associates with polysomes and binds to a specific subset of mRNAs; localizes to RNA processing bodies (P bodies) and to stress granules; may have a role in translation regulation under stress conditions; ortholog of human ZNF9/CNBP, a gene involved in myotonic dystrophy type 2 [Source:SGD;Acc:S000005199])

HSP 1 Score: 52.7582 bits (125), Expect = 3.845e-8
Identity = 28/77 (36.36%), Postives = 43/77 (55.84%), Query Frame = 1
Query:  304 CWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKA---PRCFGCGKEGHIRRLCQEIRCERCSRNGHRSEEC 525
            C++C KIGH   DC    +  C+ C++ GH+  +C   +     +C+ CG+ GH+R  C   RC  C++ GH S EC
Sbjct:    6 CYVCGKIGHLAEDC--DSERLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSECTVQRCFNCNQTGHISREC 80          

HSP 2 Score: 49.2914 bits (116), Expect = 6.159e-7
Identity = 32/109 (29.36%), Postives = 51/109 (46.79%), Query Frame = 1
Query:  292 EMPECWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKK--------------AP----------------RCFGCGKEGHIRRLCQEIR-CERCSRNGHRSEEC 525
            E  +C+ C + GH + +C ++    C+ C+++GHISR CP+ K               P                +C+ CG+ GH+ R CQ  R C  C+  GH S++C
Sbjct:   45 EFKQCYNCGETGHVRSECTVQ---RCFNCNQTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDC 150          

HSP 3 Score: 46.595 bits (109), Expect = 4.791e-6
Identity = 27/86 (31.40%), Postives = 41/86 (47.67%), Query Frame = 1
Query:  304 CWLCHKIGHTKIDCPIKGKIE---CWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQE---------IRCERCSRNGHRSEEC 525
            C+ C+K GH + DC +   +E   C+ C  +GH+   C      RCF C + GHI R C E         + C +C    H +++C
Sbjct:   25 CYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSEC---TVQRCFNCNQTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDC 107          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Match: AIR2 (RNA-binding subunit of the TRAMP nuclear RNA surveillance complex; involved in nuclear RNA processing and degradation; involved in TRAMP complex assembly as a bridge between Mtr4p and Trf4p; stimulates the poly(A) polymerase activity of Pap2p in vitro; has 5 zinc knuckle motifs; AIR2 has a paralog, AIR1, that arose from the whole genome duplication; Air2p and Air1p have nonredundant roles in regulation of substrate specificity of the exosome [Source:SGD;Acc:S000002334])

HSP 1 Score: 48.521 bits (114), Expect = 7.880e-6
Identity = 42/129 (32.56%), Postives = 62/129 (48.06%), Query Frame = 1
Query:  157 TLMDK-VEKQREQINSMKSYARVVYQEPQGRVQYKKKEQREESIRREMPECWLCHKIGHTKIDCPIKGKIECWTCHRS-GHISRNCPDKKAPRCFGCGKEGHIRRLC----QEIRCERCSRNGHRSEEC 525
            T  DK V    E++NS  +  R +    QGR  +   +  +++I+   P+C  C + GH K DCP    I C  C  +  H SR+CP  KA +C  C + GH R  C    ++++C  C    H  E C
Sbjct:   16 TPPDKLVAPSIEEVNSNPNELRAL--RGQGRY-FGVSDDDKDAIKEAAPKCNNCSQRGHLKKDCP---HIICSYCGATDDHYSRHCP--KAIQCSKCDEVGHYRSQCPHKWKKVQCTLCKSKKHSKERC 136          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO33875 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SR01])

HSP 1 Score: 57.7658 bits (138), Expect = 4.243e-10
Identity = 27/86 (31.40%), Postives = 52/86 (60.47%), Query Frame = 3
Query: 1890 DDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNY 2147
            DD++ + SS E+H   +  +L+A+  +G + + +K Q     ++FLGH+I + GV+  P K+  I+ +  P   ++LR F+G+C +
Sbjct:    1 DDVICFHSSFEEHLRGIERMLQAVRASGFK-SIKKSQFATRSVKFLGHVIDQNGVRPQPEKLD-IRQWETPTNEEELRKFIGVCTF 84          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO34570 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SP17])

HSP 1 Score: 50.447 bits (119), Expect = 1.558e-7
Identity = 27/75 (36.00%), Postives = 36/75 (48.00%), Query Frame = 1
Query:  304 CWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQ-EIRCERCSRNGHRSEEC 525
            C  CH++GH    CP++G+  C+ C  +GH+   CP    P C  C + GH    C    RC RC   GH    C
Sbjct:   19 CGYCHQVGHPISTCPVRGR--CFRCGAAGHVVARCPAPAVP-CGYCHQVGHPISTCPVRGRCFRCGAAGHVVARC 90          

HSP 2 Score: 46.2098 bits (108), Expect = 5.941e-6
Identity = 27/81 (33.33%), Postives = 35/81 (43.21%), Query Frame = 1
Query:  304 CWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLC--QEIRCERCSRNGHRSEECYTKMR 540
            C+ C   GH    CP    + C  CH+ GH    CP +   RCF CG  GH+   C    + C  C + GH    C  + R
Sbjct:    1 CFRCGAAGHVVARCP---ALACGYCHQVGHPISTCPVRG--RCFRCGAAGHVVARCPAPAVPCGYCHQVGHPISTCPVRGR 76          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO42408 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7S1Q3])

HSP 1 Score: 51.6026 bits (122), Expect = 9.225e-7
Identity = 24/72 (33.33%), Postives = 45/72 (62.50%), Query Frame = 3
Query:  939 LDTGARINVMSVNCFNKLRGQQLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGG 1154
            LDTGA  + +S+N +NKL  +Q  +S  +LR  N+STI+ +G+ K+  T   ++++V F + ++    ++ G
Sbjct:  115 LDTGATCSTLSLNDYNKLTKKQPEQSQTELRTYNQSTIKPMGQVKLHCTANGITRKVHFQIIKEAPTSLLSG 186          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Match: EDO25785 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A8DVH4])

HSP 1 Score: 45.0542 bits (105), Expect = 8.999e-6
Identity = 29/75 (38.67%), Postives = 41/75 (54.67%), Query Frame = 1
Query:  250 QYKKKEQREESIRREMPECWLCHKIGHTKIDCPIKGK-IECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLC 471
            +Y K+E+R   IR     C  C++ GH  +DCP   K I+C  C   GH  R+CP++    CF C + GH  R+C
Sbjct:    2 RYYKEEKRSMYIR-----CHNCNERGHMAVDCPDPKKVIKCCLCGGQGHYKRSCPNE---LCFNCDQPGHQSRVC 68          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000032171.1 (pep primary_assembly:ASM223467v1:18:2114932:2117958:1 gene:ENSORLG00000025397.1 transcript:ENSORLT00000032171.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 416.001 bits (1068), Expect = 5.676e-123
Identity = 248/830 (29.88%), Postives = 426/830 (51.33%), Query Frame = 3
Query: 1296 LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLG--KIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCS----------GPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEK---GYCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTC--------------------VQCQMAH---------------EEAKKGKIKTRLLDSI-------------REEGRSNIQHGIVEE----------VRKKTMIPENE------------LQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKK 3524
            LE L   +  K + ++    NVF A + D+G T L+ H+I +    P+  +  R P +    +++ I  L  +G+IR+ NSP+ +P++   KK+   +R+C+D+RQLN+ T R AFP+P IDE LD L G+ +F+++DL + Y QV + E  + KTAF T  G + +NRMPFG+  APGTFQ +M ++ G  + +  ++YLDDI+++SS+ ++H   L  VL  +++  LR+   KC   ++E+R+LGH+IS +GV TDP KI+A+  +  P+ V +LR+FLG  +YYRRF+  +A+ A  L +L              S+ +  W+      F+ LK  LT+AP+L + D    F L+ DAS + +GAV  Q ++ G  R IAY S  + + E+    Y   + E LA+ +     F+ YL G    + TD+  ++++ T K  +    Q W+  L++ D  ++YR G  ++NAD +SR    +                     VQ  +AH                E +  ++   +L S+               EG S +   ++ +          V ++ ++P++             + E ++++H+   H G+E+    ++ R     +   +    ++CE CQ  K  + +   P+    +    E++ +D    L  ++   + +L + D +SKY V      Q   T+ K ++ +W  +FG P  +  D G++FES ++++   +  I+   ++PYH   NGQ ER  RT+ +L+ A LQ  +K DW   LP+V F  N T  +TTG+SP   +FG++
Sbjct:  175 LETLPAEDRGKAQALLQKYANVFAAHEGDLGCTTLMTHEIPLLDDAPVRQRHRRIPPSEYEAVKDHINQLLASGVIRESNSPYASPIVLARKKDGS-LRMCVDYRQLNSKTRRDAFPLPRIDESLDALSGARWFTTLDLASGYNQVPVTEGDRAKTAFCTPFGLFEWNRMPFGLCNAPGTFQRLMQRIFGDQQCQSVLLYLDDIVVFSSTIDEHLERLELVLGRLQQEKLRVKLPKCAFFQQEVRYLGHVISDQGVSTDPHKIEAVAGWQPPSTVSELRTFLGFASYYRRFVEGFARLAAPLHRLVGELDGTKSRRRKASSLQGHWTTECQQNFDALKQKLTSAPVLAYADFTLPFFLEVDASHNGLGAVFPQ-EQGGSVRPIAYASRGLKATERNMQNYSSMKLEFLALKWAMTEKFREYLLGHHCVVFTDNNPLSYLSTAK--LGEMEQRWVAQLAAFDYEIKYRSGRVNRNADALSRHPNHSSAEVGNMAPGSSLPRDLQQVQVQPVLAHCEMEQLMPVFPQRTTAEVQDLQVSDPVLASVLPFWRDQRYPNYSEREGLSKVALTLLRQWKNLAEVDGLVYRRVLMPDSGQEVFQLLLPEILIPEVLEQVHQHHGHQGIERTLALLRARCYWPGMSKDVAHWCQACERCQLAKDNSRSHSAPLGHLIASRPNELVAMDFTI-LEPSRTGVENVLVLTDVFSKYTVAIPTRDQRAATVAKVLVAEWFSKFGVPARLHSDQGRSFESQLIQQLCGLYGIEKSRTTPYHPAGNGQCERFNRTLHNLLRA-LQVTRKRDWHSCLPQVTFCYNTTPHQTTGESPFFFMFGQQ 998          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000029819.1 (pep primary_assembly:ASM223467v1:24:6358775:6362914:-1 gene:ENSORLG00000022361.1 transcript:ENSORLT00000029819.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 424.476 bits (1090), Expect = 7.835e-123
Identity = 262/884 (29.64%), Postives = 449/884 (50.79%), Query Frame = 3
Query: 1296 LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGA--MVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKL----------EWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKG---YCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKG----------------------------------------------------INHQNADMVSRTKCDTCV---------QCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIV---------EEVRKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK--KISREKWYGSKEIPIKEELEE-----QTRRKF---NVGEEVLVKVETRHKGQD 3656
            L  L   E  K++ ++     VF   + D+G T L+ H+I +    P+  +  R P +     +E I NL ++ +IR+ +SP+ +P++ + KK+   +RLC+D+RQLN+ T + AFP+P I+E LD L G+ +FS++DL + Y+QV + E  + KTAF T  G + +NRMPFG+  AP TFQ +M ++ G+ +G   ++YLDDI+++SS+ E+H   L  VL+ ++  GL++   KC   + ++ +LGH+IS +GV TDP K++A+ N+  P  V +LRSF+G  +YYRRF+  +AK A  L +L +     K+           W+E     FE LK  LTT P+L + D  + FIL+ DAS   +GAVLSQ +++GK R IAY S  +   E+    Y   R E LA+ +     F+ YL G++  + TD+  ++++ T K  + +  Q W   L++ D+ + YR G                                                     +H  +DM S    D  +         Q + + EE ++  + +  L  +R+  R   Q G++          E   + ++P +  +E +  +H+   H GV++  D ++ R     + +++ E    CE CQ  K      + P+    + +  EI+ +D    L  T    + +L I D +SKY +  A   Q   T+ + ++ +W  +FG P  I  D G++FES ++++   +  ++   ++PYH + NGQ ER  RT+ DL+  TL   +K DW   LP++ ++ N T   +TG+SP  ++FG+  ++  +   G  + P++  ++E     + R K       E++L   + R +G D
Sbjct:  301 LSALSAGEQEKVRALLGKYLTVFSLHEGDLGCTSLITHEIPLVDDAPVRQRYRRIPPSDYVAAKEHINNLLQSQVIRESSSPFASPIVLIRKKDG-GLRLCVDYRQLNSRTRKDAFPLPRIEESLDALSGARWFSTLDLASGYHQVAVAEADRPKTAFCTPFGLFEWNRMPFGLCNAPSTFQRLMQRMFGEQQGQSLLLYLDDIIVFSSTIEQHLERLELVLERLQLEGLKVKLAKCAFFQHQVHYLGHVISDQGVSTDPGKVEAVANWEPPTTVFQLRSFIGFASYYRRFVEGFAKLAAPLHRLVAELEGNKVRKKSARGLTNHWTEECQRSFEALKAKLTTTPVLAYADFSRPFILEVDASNGGLGAVLSQ-EQEGKVRPIAYASRGLRPTERNPVNYSSMRLEFLALKWAVAEKFREYLLGQKCIVYTDNNPLSYLSTAK--LGAMEQRWAAQLAAFDLEIRYRSGRSNRNADALSRQHFPDMQAWRDVLPGSCLPMSLQQVQQTETVGTTQATMVALPHHSPSDMASLQGADPVLKEFLPFWERQTRPSPEERRQ--LSSPTLALLRQWNRLVEQGGVLYRRVFRDDGGEAVLQILLPGSIREEVLTAVHQQHGHQGVDRTLDLLRQRCYWPGMSAEVAEWCSQCERCQVAKVTRPAARAPMGHLLASKPNEILAMDFSV-LEPTTSGIENVLVITDIFSKYTMAVATRDQRAATVAQVLVTEWFSKFGVPARIHSDQGRSFESALIQQLCDLYAVEKSRTTPYHPEGNGQCERFNRTLHDLLR-TLPVSRKRDWNVCLPQLLYSYNTTPHHSTGESPFFLMFGQEPRLPVDFLLGRVQEPVEGTVQEWVQEHKARLKLAFEGTREKLLAAADRRKRGHD 1176          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000039099.1 (pep primary_assembly:ASM223467v1:16:31928608:31932747:-1 gene:ENSORLG00000023909.1 transcript:ENSORLT00000039099.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 424.476 bits (1090), Expect = 7.909e-123
Identity = 262/884 (29.64%), Postives = 449/884 (50.79%), Query Frame = 3
Query: 1296 LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGA--MVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKL----------EWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKG---YCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKG----------------------------------------------------INHQNADMVSRTKCDTCV---------QCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIV---------EEVRKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK--KISREKWYGSKEIPIKEELEE-----QTRRKF---NVGEEVLVKVETRHKGQD 3656
            L  L   E  K++ ++     VF   + D+G T L+ H+I +    P+  +  R P +     +E I NL ++ +IR+ +SP+ +P++ + KK+   +RLC+D+RQLN+ T + AFP+P I+E LD L G+ +FS++DL + Y+QV + E  + KTAF T  G + +NRMPFG+  AP TFQ +M ++ G+ +G   ++YLDDI+++SS+ E+H   L  VL+ ++  GL++   KC   + ++ +LGH+IS +GV TDP K++A+ N+  P  V +LRSF+G  +YYRRF+  +AK A  L +L +     K+           W+E     FE LK  LTT P+L + D  + FIL+ DAS   +GAVLSQ +++GK R IAY S  +   E+    Y   R E LA+ +     F+ YL G++  + TD+  ++++ T K  + +  Q W   L++ D+ + YR G                                                     +H  +DM S    D  +         Q + + EE ++  + +  L  +R+  R   Q G++          E   + ++P +  +E +  +H+   H GV++  D ++ R     + +++ E    CE CQ  K      + P+    + +  EI+ +D    L  T    + +L I D +SKY +  A   Q   T+ + ++ +W  +FG P  I  D G++FES ++++   +  ++   ++PYH + NGQ ER  RT+ DL+  TL   +K DW   LP++ ++ N T   +TG+SP  ++FG+  ++  +   G  + P++  ++E     + R K       E++L   + R +G D
Sbjct:  301 LSALSAGEQEKVRTLLGKYLTVFSLHEGDLGCTSLITHEIPLVDDAPVRQRYRRIPPSDYVAAKEHINNLLQSQVIRESSSPFASPIVLIRKKDG-GLRLCVDYRQLNSRTRKDAFPLPRIEESLDALSGARWFSTLDLASGYHQVAVAEADRPKTAFCTPFGLFEWNRMPFGLCNAPSTFQRLMQRMFGEQQGQSLLLYLDDIIVFSSTIEQHLERLELVLERLQLEGLKVKLAKCAFFQHQVHYLGHVISDQGVSTDPGKVEAVANWEPPTTVFQLRSFIGFASYYRRFVEGFAKLAAPLHRLVAELEGNKVRKKSARGLTNHWTEECQRSFEALKAKLTTTPVLAYADFSRPFILEVDASNGGLGAVLSQ-EQEGKVRPIAYASRGLRPTERNPVNYSSMRLEFLALKWAVAEKFREYLLGQKCIVYTDNNPLSYLSTAK--LGAMEQRWAAQLAAFDLEIRYRSGRSNRNADALSRQHFPDMQAWRDVLPGSCLPMSLQQVQQTETVGTTQATMVALPHHSPSDMASLQGADPVLKEFLPFWERQTRPSPEERRQ--LSSPTLALLRQWNRLVEQGGVLYRRVFRDDGGEAVLQILLPGSIREEVLTAVHQQHGHQGVDRTLDLLRQRCYWPGMSAEVAEWCSQCERCQVAKVTRPAARAPMGHLLASKPNEILAMDFSV-LEPTTSGIENVLVITDIFSKYTMAVATRDQRAATVAQVLVTEWFSKFGVPARIHSDQGRSFESALIQQLCDLYAVEKSRTTPYHPEGNGQCERFNRTLHDLLR-TLPVSRKRDWNVCLPQLLYSYNTTPHHSTGESPFFLMFGQEPRLPVDFLLGRVQEPVEGTVQEWVQEHKARLKLAFEGTREKLLAAADRRKRGHD 1176          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000041674.1 (pep primary_assembly:ASM223467v1:5:30088905:30092924:1 gene:ENSORLG00000022054.1 transcript:ENSORLT00000041674.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 422.935 bits (1086), Expect = 1.331e-122
Identity = 266/907 (29.33%), Postives = 460/907 (50.72%), Query Frame = 3
Query: 1296 LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLG--KIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCS----------GPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEK---GYCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTC--------------------VQCQMAH---------------EEAKKGKIKTRLLDSI-------------REEGRSNIQHGIVEE----------VRKKTMIPENE------------LQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK--KISREKWYGSKEIPIK-------EELEEQTRRKFNVGEEVLVKVETRHK-GQDRY--EGPYKVIEKVHDRRYILRN 3719
            LE L   +  K + ++    NVF A + D+G T L+ H+I +    P+  +  R P +    +++ I  L  +G+IR+ NSP+ +P++   KK+   +R+C+D+RQLN+ T R AFP+P IDE LD L G+ +F+++DL + Y QV + E  + KTAF    G + +NRMPFG+  APGTFQ +M ++ G  + +  ++YLDDI+++SS+ ++H   L  VL  +++  LR+   KC   ++E+R+LGH+IS +GV TDP KI+A+  +  P+ V +LR+FLG  +YYRRF+  +A+ A  L +L              S+ +  W+      F+ LK  LT+AP+L + D    FIL+ DAS + +GAVLSQ ++ G  R IAY S  + + E+    Y   + E LA+ +     F+ YL G    + TD+  ++++ T K  + +  Q W+  L++ D  ++YR G  ++NAD +SR    +                     VQ  +AH                E +  ++   +L S+               EG S +   ++ +          V ++ ++P++             + E ++++H+   H G+E+    ++ R     +   +    ++CE CQ  K  + +   P+    +    E++ +D    L  ++   + +L + D +SKY V      Q   T+ K ++ +W  +FG P  +  D G++FES ++++   +  I+   ++PYH   NGQ ER  RT+ +L+ A    RK+ DW   LP+V F  N T  +TTG+SP  ++FG+  ++  +   G  + P+        EE + + R  F+V ++ L +   R K   D++  E P    + V  R Y +R 
Sbjct:  267 LETLPAEDRGKAQALLQKYANVFAAHEGDLGCTTLMTHEIPLLDDAPVRQRHRRIPPSEYEAVKDHINQLLASGVIRESNSPYASPIVLARKKDGS-LRMCVDYRQLNSKTRRDAFPLPRIDESLDALSGARWFTTLDLASGYNQVPVTEGDRAKTAFCIPFGLFEWNRMPFGLCNAPGTFQRLMQRIFGDQQCQSVLLYLDDIVVFSSTIDEHLERLELVLGRLQQEKLRVKLPKCAFFQQEVRYLGHVISDQGVSTDPHKIEAVAGWQPPSTVSELRTFLGFASYYRRFVEGFARLAAPLHRLVGELDGTKSRRRKASSLQGHWTTECQQNFDALKQKLTSAPVLAYADFTLPFILEVDASHNGLGAVLSQ-EQGGSVRPIAYASRGLKATERNMQNYSSMKLEFLALKWAMTEKFREYLLGHHCVVFTDNNPLSYLSTAK--LGAMEQRWVAQLAAFDYEIKYRSGRVNRNADALSRHPNHSSAEVGNMAPGSSLPRDLQQVQVQPVLAHCEMEQLMPVFPQRTTAEVQDLQVSDPVLASVLPFWRDQRYPNYSEREGLSKVALTLLRQWKNLAEVDGLVYRRVLMPDSGQEVFQLLLPEILIPEVLEQVHQHHGHQGIERTLALLRARCYWPGMSKDVAHWCQACERCQLAKDNSRSHSAPLGHLIASRPNELVAMDFTI-LEPSRTGVENVLVLTDVFSKYTVAIPTRDQRAATVAKVLVAEWFSKFGVPARLHSDQGRSFESQLIQQLCGLYGIEKSRTTPYHPAGNGQCERFNRTLHNLLRALPVTRKR-DWHSCLPQVTFCYNTTPHQTTGESPFFLMFGQQPRLPIDFMLGQVKEPVNGTIHEWVEEHQARLRVAFDVAKDRLAEAAARRKRNHDKHVQEAPLDEGQLVLLRDYSVRG 1167          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Match: ENSORLT00000032005.1 (pep primary_assembly:ASM223467v1:7:31190450:31194463:1 gene:ENSORLG00000029990.1 transcript:ENSORLT00000032005.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 422.55 bits (1085), Expect = 2.078e-122
Identity = 267/907 (29.44%), Postives = 461/907 (50.83%), Query Frame = 3
Query: 1296 LEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKI-VTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLG--KIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCS----------GPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEK---GYCITRKELLAVYYFCIH-FKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTC--------------------VQCQMAH---------------EEAKKGKIKTRLLDSI-------------REEGRSNIQHGIVEE----------VRKKTMIPENE------------LQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGK--KISREKWYGSKEIPIK-------EELEEQTRRKFNVGEEVLVKVETRHK-GQDRY--EGPYKVIEKVHDRRYILRN 3719
            LE L   +  K + ++    NVF A + D+G T L+ H+I +    P+  +  R P +    +++ I  L  +G+IR+ NSP+ +P++   KK+   +R+C+D+RQLN+ T R AFP+P IDE LD L G+ +F+++DL + Y QV + E  + KTAF T  G + +NRMPFG+  APGTFQ +M ++ G  + +  ++YLDDI+++SS+ ++H   L  VL  +++  LR+   KC   ++E+R+LGH+IS +GV TDP KI+A+  +  P+ V +LR+FLG  +YYRRF+  +A+ A  L +L              S+ +  W+      F+ LK  LT+AP+L + D    FIL+ DAS + +GAVLSQ ++ G  R IAY S  + + E+    Y   + E LA+ +     F+ YL G    + TD+  ++++ T K  + +  Q W+  L++ D  ++YR G  ++NAD +SR    +                     VQ  +AH                E +  ++   +L S+               EG S +   ++ +          V ++ ++P++             + E ++++H+   H G+E+    ++ R     +   +    ++CE CQ  K  + +   P+    +    E++ +D    L  ++   + +L + D +SKY V      Q   T+ K ++ +W  +FG P  +  D G++FES ++++   +  I+   ++PYH   NGQ ER  RT+ +L+ A    RK+ DW   LP+V F  N T  +TTG+SP  ++FG+  ++  +   G  + P+        EE + + R  F+V ++ L +   R K   D++  E P    + V  R Y +R 
Sbjct:  267 LETLPAEDRGKARALLQKYANVFAAHEGDLGCTTLMTHEIPLLDDAPVRQRHRRIPPSEYEAVKDHINQLLASGVIRESNSPYASPIVLARKKDGS-LRMCVDYRQLNSKTRRDAFPLPPIDESLDALSGARWFTTLDLASGYNQVPVTEGDRAKTAFCTPFGLFEWNRMPFGLCNAPGTFQRLMQRIFGDQQCQSVLLYLDDIVVFSSTIDEHLERL--VLGRLQQEKLRVKLPKCAFFQQEVRYLGHVISDQGVSTDPHKIEAVAGWQPPSTVSELRTFLGFASYYRRFVEGFARLAAPLHRLVGELDGTKSRRRKASSLQGHWTTECQQNFDALKQKLTSAPVLAYADFTLPFILEVDASHNGLGAVLSQ-EQGGSVRPIAYASRGLKATERNMQNYSSMKLEFLALKWAMTEKFREYLLGHHCVVFTDNNPLSYLSTAK--LGAMEQRWVAQLAAFDYEIKYRSGRVNRNADALSRHPNHSSAEVGNMAPGSSLPRDLQQVQVQPVLAHCEMEQLMPVFPQRTTAEVQDLQVSDPVLASVLPFWRDKRYPNYSEREGLSKVALTLLRQWKNLAEVDGLVYRRVLMPDSGQEVFQLLLPEILIPEVLEQVHQHHGHQGIERTLALLRARCYWPGMSKDVAHWCQACERCQLAKDNSRSHSAPLGHLIASRPNELVAMDFTI-LEPSRTGVENVLVLTDVFSKYTVAIPTRDQRAATVAKVLVAEWFSKFGVPARLHSDQGRSFESQLIQQLRGLYGIEKSRTTPYHPAGNGQCERFNRTLHNLLRALPVTRKR-DWHSCLPQVTFCYNTTPHQTTGESPFFLMFGQQPRLPIDFMLGQVKEPVNGTIHEWVEEHQARLRVAFDVAKDRLAEAAARRKRNHDKHVQEAPLDEGQLVLLRDYSVRG 1165          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000029259.1 (SMESG000029259.1)

HSP 1 Score: 1250.34 bits (3234), Expect = 0.000e+0
Identity = 617/1048 (58.87%), Postives = 788/1048 (75.19%), Query Frame = 3
Query:  771 RVSEKVYRFLKR-------EMFEEEVER-IGTIETEQVVEILYNRMQGRPSITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGI--VEEVRK---------------------KTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSKEIPIKEELEE------------QTRRKFNVGEEVLVKVETRHKGQDRYEGPYKVIEKVHDRRYILRNEDGKRIERNVEKLKNFLRRG 3779
            R  EK+Y+FLK+       E+ E+  E+ I ++E ++   I YN MQGRPS T+ + G   DCLLDTGARINVM+ +  ++L   ++ ++ + LRCAN S +ET+GK  + V +G++ + V FI+ + + P++IGG+ELQ  FG  L  I  +E  EK  ++IC IEA+FGR I DEERL  A++ LK+  + +L EI   + NVFMADKWD+G T+L+KHKI+T+GEPI IKP RQP+NLE KIEEAI+NL  NGIIRKCNSPWNTPL+CVWKKEKKDIRLCLDFRQLN +TERQAFPMPN+DE+LD+L GS +FSSIDLGNAYYQV+L++ESQEKTAFSTK GQ+CFNRMPFGIAAAPGTFQE+MTKVL  +   G MVYLDDILI++ ++E HY I G+VL  I  AGLR+NPEKCQI ++E++FLGHII+K+G+QTD +KI+AIQ+F KP CVK LRSFLGICNYYRRFI DYAKKAR LE +C G +N+K+ W+E     F  +K AL TAP+L FPD +KEFILDTDASFDTIGAVLSQ+D+ G E VIAYGSH+M+SHEKGYCITRKELLA+YYFC HF HYLYG+RF LRTDHKAITFM+TTKKPIT+QFQTWINYLSSLDI++EYRKG +H NADM+SR  C TCVQC M HE+AK GKIKTR+L    E G +  Q+    V+E++                      K  IP +  Q  IKE+H LLCHAG +K+  Y+++    ++L +++++++ +CE CQK K +T  TKE  +   S E FE I++DICGPL ET  +KKYI GIID YSKY  LTAI KQDE TI +T+L KWIL+FG PKE+ VDCGK FE+  ++E  K   I+L FSSPYHH TNG IERQFRTIR+ INA+L +  + +WA+I+PE+++TLNAT QKTTG SPAEI+FG+KI R KWY +KEI  +E++E+            +T R F + + VL+K E R+K   R+EGPYKVI+K+H+R Y+L++++GK + RNVEK+K+F + G
Sbjct: 2501 RTIEKIYKFLKKNGDVCINEIVEDTREQNILSVEEQKSPAITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKSVIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKNLIPEIIGGVELQRLFGIELKCI--LEEHEKRSDFICEIEARFGRIITDEERLRHAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMIKPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLCLDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKESQEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDDILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKDGIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESIC-GKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTIGAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYLYGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHTNADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEVQEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHAGAQKVTKYIQNNCDMENLATEVKKVIENCERCQKMKTITTKTKEETQTIKSTEPFEKIYMDICGPLKETFNKKKYICGIIDHYSKYISLTAINKQDERTISETLLNKWILKFGAPKELHVDCGKNFEARSIKELAKTAGIELIFSSPYHHNTNGIIERQFRTIREYINASLNEGGRKNWADIVPEIKYTLNATVQKTTGVSPAEIIFGRKIDRMKWYSNKEIN-REDMEKRIEDKTLKPKISKTVRNFEMEDVVLIKQEIRNKDDARWEGPYKVIKKIHERSYLLKDQNGKMVVRNVEKIKHFKKGG 3544          

HSP 2 Score: 97.8265 bits (242), Expect = 8.866e-20
Identity = 63/170 (37.06%), Postives = 97/170 (57.06%), Query Frame = 1
Query:   91 KEVEKEVILRKIKSTEEIKETLTLMDKVEKQREQINSMKSYARVVYQEPQGRVQYKKKEQREESIRREMPECWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKEGHIRRLCQEIRCERCSRNGHRSEECYTKM---RYGQATERQRFTGNRYSR 591
            +EV KE+ ++ +K+ E+IKET+  ++KV K  EQ+N+++S      +   G   Y+   Q   + +RE+ E W        K +  +K  IECWTC + GH SR C  K+  +C+ CG EGHIRR C  I+C RC+  GH+  ECYT M     G+  ++++ +G R  R
Sbjct: 2245 QEVRKEIEMKDLKTAEQIKETIKKIEKVNKVIEQVNTVRSI-----RPTTGGRTYRDVVQVGAT-KREINE-W--------KPETRVK-MIECWTCQKPGHSSRECNIKRRFQCYACGVEGHIRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQR 2398          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000081257.1 (SMESG000081257.1)

HSP 1 Score: 600.897 bits (1548), Expect = 0.000e+0
Identity = 347/884 (39.25%), Postives = 495/884 (56.00%), Query Frame = 3
Query:  864 EILYNRMQGRPSITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLR--CANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQ--------------------HGIVEEVRKKTM-----------------------------IPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNG 3356
            ++++ +   R    + +NG + + L ++GA I+VMS  C+  +    L  S   L    +NE   + +G  K+        +E+  I+  K+ P+ IGG++  + FG  L ++ +IE+S  +K +             D +RL   L  LK++++S+L  +I+   N+FMA ++D+G T ++ H++ T G PI   P  QP++LE+K+EE +QNL E G++RKC SPWNTPL+ V K +   +R+CLDFR LN+VTE+ +FPMP++  +LD LG S  F SIDLG AYY VEL E SQ KTAFSTK GQ+CFNR+PFG++ AP TFQ++M ++L  +  KG +VYLDDILIY  ++E H  +L EV   I ++GL++NPEKC   K  + F+GH +S++G+QT+  KI  I+N  +P    +LRSFLG+  YYRRFI +Y+  A  L    +G  +K + W+E     F  LK  L  APIL +P   + FI+DTD SF+ IG                     +T HE GYC+TRKELL ++ F +HF+ YLYGRRF  RTDHKA+TFM TTKKPI+ QFQTW+   S  D  ++YRKG  H NA   SR     C   QM H++AKK K +TR ++S+  +G SNI                     +G    +  KT+                             +P++ ++  +   H  LCH G+ K   Y+KD +    +   I E +  C+IC   K     TKE +  +  +   E I VDI     ETK  KKY++ IID +SK   L     QDE TI   IL  WI RFG P+ I  D  + FE  M R++ +   IK  FSSPY HQ+NG
Sbjct:  274 DVMFKKDDDRKWSIMEINGFHVEMLWNSGASISVMSEKCWRLIGSPILMDSRILLSEVFSNEDK-KPLGSVKIVAKWNKKFRELNVIIVRKIHPNFIGGVDTMKIFGMELKEVNNIESSLVNKRFT------------DSDRLKNTLLTLKLDKNSELGTLISQFSNIFMASRFDLGHTKVITHELKTSGPPILQNPRGQPMHLEAKVEELVQNLLEAGVVRKCQSPWNTPLVIVGKPDG-SVRICLDFRLLNSVTEKFSFPMPDMQLLLDCLGKSKIFYSIDLGQAYYLVELNENSQIKTAFSTKEGQFCFNRLPFGLSTAPATFQKLMHQILEGLVFKGVVVYLDDILIYGENQETHDKLLFEVFTRIRDSGLKVNPEKCAFNKSVLNFIGHTVSEKGIQTNKRKISEIENATEPKSTTELRSFLGLTTYYRRFIKNYSMIAAPLYAATTG-CDKMIVWTEECRKRFINLKKLLCEAPILEYPRADRLFIIDTDDSFEAIG--------------------HLTKHEIGYCVTRKELLVLHEFIVHFRQYLYGRRFVARTDHKALTFMNTTKKPISPQFQTWMANFSEYDFALQYRKGNEHGNAGGWSRLNNTICSHYQMEHKDAKKAKCRTRCINSL--QGSSNIMKIIKQKQNEDKVTSQIISHLNGNEAHISYKTISSSIFKYLKILQIQDNVLMINTDGKLAVVVPDSYVKSLVNYFHIELCHLGINKTLFYLKDFFFLPSMNQIITECINKCKICASRKIDQGRTKEILLPRTGERFLEQIVVDI--DYMETKESKKYMIVIIDCFSKLVSLP----QDEATILNVILNNWIYRFGRPESILTDRERIFEGSMFRDWMEKFGIKQEFSSPYQHQSNG 1114          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000016546.1 (SMESG000016546.1)

HSP 1 Score: 522.316 bits (1344), Expect = 1.786e-166
Identity = 259/461 (56.18%), Postives = 325/461 (70.50%), Query Frame = 3
Query: 1233 ICNIEAKFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYL 2615
            +CNI+AKFG+ I   ER  RA ++LK+N  + L EII N+ NVFMA+KW++G T L+KHKIVTRG  INIKP RQP++LE KIEEAIQNL++NGII+                             LN VT+RQA+PM NI E+LD   G   F+SIDLGNAYYQVELEE+S+EKTAFST  GQYCFNRMPFGIA  PGT QE+M KVLG I G +VYL+DILI++++KE++Y +L +V++ I                                    + +AI++F KP C+K L+SF GICNYYRRFI DYAKK R LE+LC G  N KL W+E     FE +K ALT +P+L +PD  ++FI+DTD+SFDTIG VLSQ+D +G E+VIAYGSH+M++HEKGYCI RKELLA+YYFC HF H+LYG+RFTLRTD KAITFM++TKKPIT+QF+TWIN+L
Sbjct:  113 MCNIDAKFGKLITGTERFDRARKELKVNY-TVLAEIIKNNQNVFMANKWEIGCTALLKHKIVTRGSLINIKPRRQPIHLEPKIEEAIQNLFKNGIIK----------------------------NLNLVTDRQAYPMQNIAEILDRFEGEKHFNSIDLGNAYYQVELEEKSKEKTAFSTTTGQYCFNRMPFGIATGPGTSQELMRKVLGNINGTVVYLNDILIFTATKEQYYAVLNDVIERI-----------------------------------GRPEAIKSFQKPECIKNLKSFPGICNYYRRFIKDYAKKTRTLEELC-GKYNVKLIWAENCEKAFEDMKKALTESPVLGYPDFTRDFIIDTDSSFDTIGDVLSQKDNNGYEKVIAYGSHAMSTHEKGYCIKRKELLAIYYFCQHFNHHLYGKRFTLRTDLKAITFMLSTKKPITAQFKTWINHL 508          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000038955.1 (SMESG000038955.1)

HSP 1 Score: 445.662 bits (1145), Expect = 6.767e-134
Identity = 259/636 (40.72%), Postives = 372/636 (58.49%), Query Frame = 3
Query:  906 VRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRCANESTIE---TIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKF-GRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNI 2795
            + +NG + D L D+GA I+VMS + + +L G+ +   D ++R +   + E    +G  K+   +   +K+V F    +   +++ G                  A E     + NIE+    +K  D +RL   L  LK++++SKL  +I+   N+FMA ++D+G T ++ H+I            ++P++LE K+EE +QNL E G++RK  SPWNTPL+ V K +   IR+CLDFR LN+VTE+ +F  P++  +LD LG S  FSSIDLG AYYQVEL E SQ KTAFSTK GQ+CFNR+PFG++ AP TFQ++M ++L  +  KG +VYLD+ILIY  ++E H  +L EV   I ++GL++NPEKC   K  + F+GH                                           ++Y+  A  +    +G ++K + W+E   + F  LK  L  APIL +P   + F++DTDASF  IGAVLSQ  +D  E VIAYGS  +T HE G+C+T KELLA++ F +HF+ YLYGRRF  RTDHKA+TFM TTKKPI+ QFQTW+  LS  D  ++YRKG  H NAD  SR     C  C M +++AK+ K +TR ++S+  +G SNI
Sbjct:    1 MEINGFHVDMLWDSGASISVMS-DKYWRLIGRPILM-DSRIRLSGVFSKEDEKPLGSVKI---VAKWNKKVSFHRRSRHHENILHG------------------AKE-----VNNIESLLVNKKFTDSDRLKITLSTLKLDKNSKLGTLISKFSNIFMASRFDLGHTKVITHEI-----------KKKPMHLEGKVEELVQNLLEAGVVRKSISPWNTPLVIVGKLDG-SIRMCLDFRLLNSVTEKFSFYSPDMKLLLDCLGNSKIFSSIDLGQAYYQVELNENSQIKTAFSTKEGQFCFNRLPFGLSTAPATFQKLMHQILEGLVFKGVVVYLDEILIYGENQEIHDKLLFEVFTRIRDSGLKVNPEKCAFNKSVLNFIGH-------------------------------------------TNYSIIAAPMYVATTG-NDKMIVWTEECRNSFINLKKLLCEAPILEYPRADRLFVIDTDASFGAIGAVLSQIKEDCTEVVIAYGSRHLTKHEMGHCVTIKELLALHEFIVHFRQYLYGRRFVARTDHKALTFMNTTKKPISPQFQTWMANLSEHDFALQYRKGEEHGNADGKSRLNNTKCSHCLMENKDAKEAKCRTRYINSL--QGSSNI 550          

HSP 2 Score: 95.1301 bits (235), Expect = 3.706e-19
Identity = 56/142 (39.44%), Postives = 83/142 (58.45%), Query Frame = 3
Query: 3237 PKEIRVDCGKAFESGMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRK-KTDWAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSKEI-PIKEELEEQTRRKFNVGEEVLVKVETRHKGQD 3656
            P+ I  D G  FE  M R++ +   IK  FSSPY HQ+N   ER  RT+RD++  +L + K K +W  +LP +EF+ NAT Q +T  SP EIV+G+KI+   + G + I   +EE+E++T+         LVK  T  + +D
Sbjct:  616 PESILTDRGIIFEGSMFRDWMEKFGIKQEFSSPYQHQSNILAERIIRTVRDMLATSLAEIKTKNNWCRLLPRIEFSFNATIQNSTKFSPFEIVYGRKINL--YSGVEHIQKFREEIEDETKTN-------LVKAATTMQNRD 748          
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Match: SMESG000065302.1 (SMESG000065302.1)

HSP 1 Score: 391.734 bits (1005), Expect = 4.763e-117
Identity = 274/853 (32.12%), Postives = 409/853 (47.95%), Query Frame = 3
Query:  870 LYNRMQGRPSITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKLRCANESTIE---TIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKI--KGAMVYLDDILIYSSSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFTLRTDHKAI---TFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMVSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSN------------------IQHGIVEEV-------------------------------RKKTMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSCEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGIIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVD 3257
            ++ +   R    + +NG + + L D+GA I+VMS  C+ +L G  +   D ++R +   + E     G  K+        +E   I+  K+ PD IGG++  + F  +L ++  IE+S  +K +             D +RL   L  LK++++SKL  +I+   N+FMA K+D+G                                  I+NL   G++RKC S WNTPL+ V K +    R+CLDFR LN+VTE+ +FPMP++  +LD LG S   SSIDL   YYQVEL E SQ KTAFSTK GQ+CFNR+PFG++ AP TFQ++M ++L  +  K  +VYLDDIL Y+ ++E H  +L EV   I ++GL+++P KC   K  + F+ H +S++G+QT+  K   I N  +     +LRSFLG+ NYYRRFI +Y+  A  L    +G ++K + W+E                                        AVLSQ  +DG E VIAY S  +T+HE GYC+ RKELL +Y   +HF+ YL   +   +   + I    +   T KP  S                   KG  H N+D +SR     C  CQ+  +++K+GK +TR ++S+   G SN                  I H I  E                                +   ++P++ ++  +   +    + G++K    +K  +    +   I E +  C+IC   K      KE +  +  +   E I VDI     ETK  KKY++ IID++SK   L+A   QDE TI   IL  WI RFG P+ I  D
Sbjct:   36 MFKKDDDRKWSMMEINGFHLEMLWDSGASISVMSEKCW-RLIGSPILM-DSRIRLSGVFSKEYEKPFGSVKIVAKRNKKVREFNVIIVRKIHPDFIGGVDTMKIFCMKLKEVNIIESSLVNKRF------------TDSDRLKNTLLTLKLDKNSKLGTLISQFSNIFMASKFDLGH---------------------------------IKNLLVAGVVRKCQSSWNTPLVIVGKPDGSS-RMCLDFRCLNSVTEKVSFPMPDMQLLLDCLGKSKILSSIDLRQVYYQVELNENSQIKTAFSTKEGQFCFNRLPFGLSTAPATFQKLMHQILEGLVFKRVVVYLDDILKYAENQETHDKLLFEVFTRIIDSGLKVSPAKCAFNKSLLNFINHTVSEKGIQTNKGKFSEIVNPIETKSTTELRSFLGLTNYYRRFIKNYSMIAAPLYAATTG-NDKMIVWTEEC--------------------------------------AVLSQIKEDGTEGVIAYVSRHLTNHEMGYCVIRKELLVLYELIVHFRQYL--DKICSQNGSQGIEIYEYNKETNKPTISDI-----------------KGEEHGNSDGISRLNNTICSHCQIEQKDSKEGKCRTRYINSL--HGSSNIMKIIKQKQNEDRVTSEIISHFIGNEAHISYETISSSIFKYLKILQIQDDVLMINSDGKLAVVVPDSYVKSLVNYFYIEQGYLGIKKTLFCLKGFFFWPSMNQIITECINKCKICASRKIDQGRRKEILFPRTGERFLEQIIVDI--AYMETKESKKYMIVIIDRFSKLISLSAAITQDEATILNVILNNWIYRFGRPESILTD 778          
The following BLAST results are available for this feature:
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ZBED51.101e-8939.42zinc finger BED-type containing 5 [Source:HGNC Sym... [more]
ZBED51.101e-8939.42zinc finger BED-type containing 5 [Source:HGNC Sym... [more]
ZBED93.916e-8538.11zinc finger BED-type containing 9 [Source:HGNC Sym... [more]
ZBED94.110e-8538.11zinc finger BED-type containing 9 [Source:HGNC Sym... [more]
ZBED87.525e-7736.88zinc finger BED-type containing 8 [Source:HGNC Sym... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 2
Match NameE-valueIdentityDescription
CNBP6.340e-1039.19gene:FBgn0034802 transcript:FBtr0303464[more]
CNBP6.340e-1039.19gene:FBgn0034802 transcript:FBtr0071994[more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
BX546500.11.289e-9629.51pep chromosome:GRCz11:23:12926092:12931693:-1 gene... [more]
BX511082.11.782e-9627.62pep chromosome:GRCz11:9:14291932:14297132:1 gene:E... [more]
BX511224.16.351e-9429.51pep chromosome:GRCz11:2:18017000:18022765:1 gene:E... [more]
CR855320.12.017e-9329.72pep chromosome:GRCz11:1:7956030:7961696:1 gene:ENS... [more]
CR925755.22.039e-9329.23pep chromosome:GRCz11:17:42486740:42492668:-1 gene... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 5
Match NameE-valueIdentityDescription
anxa64.475e-9535.99annexin A6 [Source:Xenbase;Acc:XB-GENE-989741][more]
ENSXETT00000035398.12.261e-8630.35pep primary_assembly:Xenopus_tropicalis_v9.1:3:986... [more]
castor12.589e-8426.81cytosolic arginine sensor for mTORC1 subunit 1 [So... [more]
lin542.600e-8427.23lin-54 DREAM MuvB core complex component [Source:X... [more]
ENSXETT00000034712.12.602e-8427.10pep primary_assembly:Xenopus_tropicalis_v9.1:9:335... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 5
Match NameE-valueIdentityDescription
Zbed54.948e-8735.02zinc finger, BED type containing 5 [Source:MGI Sym... [more]
Zmym61.356e-5932.52zinc finger, MYM-type 6 [Source:MGI Symbol;Acc:MGI... [more]
Zmym61.710e-5932.29zinc finger, MYM-type 6 [Source:MGI Symbol;Acc:MGI... [more]
Gtf2ird29.918e-2728.06GTF2I repeat domain containing 2 [Source:MGI Symbo... [more]
Rtl16.197e-1323.57retrotransposon Gaglike 1 [Source:MGI Symbol;Acc:M... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|Q99315|YG31B_YEAST5.334e-9529.62Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomy... [more]
sp|Q7LHG5|YI31B_YEAST5.594e-9529.61Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomy... [more]
sp|P04323|POL3_DROME7.799e-9232.71Retrovirus-related Pol polyprotein from transposon... [more]
sp|A4Z943|ZBED5_BOVIN1.628e-9039.87Zinc finger BED domain-containing protein 5 OS=Bos... [more]
sp|A4Z944|ZBED5_CANLF9.086e-9039.64Zinc finger BED domain-containing protein 5 OS=Can... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A5J4NQX13.482e-15532.35Uncharacterized protein OS=Paragonimus westermani ... [more]
A0A0V1KKR91.008e-15431.12Transposon Ty3-G Gag-Pol polyprotein OS=Trichinell... [more]
A0A0V1KLB22.233e-15428.92Transposon Ty3-G Gag-Pol polyprotein OS=Trichinell... [more]
A0A5S6QHB18.806e-15432.49Uncharacterized protein OS=Trichuris muris OX=7041... [more]
A0A5S6Q2I31.081e-15332.49Uncharacterized protein OS=Trichuris muris OX=7041... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000041345.13.628e-12130.05pep primary_assembly:Astyanax_mexicanus-2.0:25:323... [more]
ENSAMXT00000041754.15.467e-12030.22pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000037150.16.609e-11730.61pep primary_assembly:Astyanax_mexicanus-2.0:10:222... [more]
ENSAMXT00000042253.12.113e-11629.20pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000048272.16.606e-11038.19pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 4
Match NameE-valueIdentityDescription
ENSPMAT00000009777.12.175e-1222.78pep scaffold:Pmarinus_7.0:GL476990:135790:139231:-... [more]
ENSPMAT00000004121.12.336e-1120.58pep scaffold:Pmarinus_7.0:GL477387:93825:107330:1 ... [more]
ENSPMAT00000004123.17.235e-1120.97pep scaffold:Pmarinus_7.0:GL477387:93825:107321:1 ... [more]
ENSPMAT00000010393.13.370e-728.57pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 g... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 2
Match NameE-valueIdentityDescription
GIS23.845e-836.36Translational activator for mRNAs with internal ri... [more]
AIR27.880e-632.56RNA-binding subunit of the TRAMP nuclear RNA surve... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 4
Match NameE-valueIdentityDescription
EDO338754.243e-1031.40Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO345701.558e-736.00Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO424089.225e-733.33Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
EDO257858.999e-638.67Predicted protein [Source:UniProtKB/TrEMBL;Acc:A8... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000032171.15.676e-12329.88pep primary_assembly:ASM223467v1:18:2114932:211795... [more]
ENSORLT00000029819.17.835e-12329.64pep primary_assembly:ASM223467v1:24:6358775:636291... [more]
ENSORLT00000039099.17.909e-12329.64pep primary_assembly:ASM223467v1:16:31928608:31932... [more]
ENSORLT00000041674.11.331e-12229.33pep primary_assembly:ASM223467v1:5:30088905:300929... [more]
ENSORLT00000032005.12.078e-12229.44pep primary_assembly:ASM223467v1:7:31190450:311944... [more]
back to top
BLAST of Transposon Ty3-I Gag-Pol polyprotein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000029259.18.866e-2058.87SMESG000029259.1[more]
SMESG000081257.10.000e+039.25SMESG000081257.1[more]
SMESG000016546.11.786e-16656.18SMESG000016546.1[more]
SMESG000038955.16.767e-13440.72SMESG000038955.1[more]
SMESG000065302.14.763e-11732.12SMESG000065302.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30023331 ID=SMED30023331|Name=Transposon Ty3-I Gag-Pol polyprotein|organism=Schmidtea mediterranea sexual|type=transcript|length=6213bp
GTGGAGGACCTTGTTGATAGGTTGTTCGAAAATAAAGAATTTACCAAAGA
GAAATTAACAGCAAGGATGTTAATAGTATGTGCGGAGAATAAAGAAGTTG
AGAAAGAAGTGATACTACGGAAGATAAAAAGCACAGAAGAAATAAAGGAA
ACACTAACTCTGATGGACAAAGTAGAAAAGCAAAGAGAACAGATAAACAG
TATGAAAAGTTACGCCAGAGTGGTCTATCAGGAACCACAGGGAAGAGTAC
AATACAAGAAGAAAGAACAGAGAGAAGAAAGTATCCGAAGAGAAATGCCA
GAATGCTGGTTATGCCACAAAATAGGACACACGAAAATTGATTGTCCAAT
AAAAGGTAAAATAGAATGCTGGACATGTCATAGAAGCGGACATATAAGTA
GAAACTGTCCAGATAAAAAAGCTCCAAGATGTTTTGGGTGTGGGAAAGAA
GGTCATATAAGAAGATTGTGTCAGGAAATACGCTGTGAAAGATGTAGCAG
AAATGGTCATAGATCCGAGGAGTGTTATACAAAGATGAGGTACGGGCAAG
CGACGGAGAGACAAAGATTCACGGGAAACAGATACAGTCGAATAAATGGT
ATCGAGGAAGAAGAAGAAAGGGAATCGTGTGTAGAGACAAATGAAGATAA
TTTTAGGAAGACGTACCCAAACAACAGGGCTCCACCAGTAGAGGAGTTGG
TTGGAGCCATATTCTAACAGAAAATTTGATGGGATATCCTCAAGGGGCAG
AAAAGTGGGAGAATGTTGGAAGGGTAAGCGAGAAAGTATACAGGTTCCTA
AAGAGAGAAATGTTTGAAGAGGAAGTGGAGAGAATTGGAACCATCGAAAC
GGAGCAGGTTGTTGAAATTTTGTATAACAGAATGCAAGGAAGGCCTAGTA
TAACAGTAAGGATGAATGGGGAAAATTTCGACTGTTTATTAGACACTGGG
GCAAGAATCAATGTAATGAGTGTGAACTGTTTCAACAAGCTACGAGGACA
ACAATTGACGAAAAGTGACGATAAGTTACGATGCGCAAACGAGAGTACGA
TCGAGACAATAGGAAAAACGAAGGTGCAAGTAACCATTGGAAATGTTTCG
AAGGAAGTTATATTCATTGTGGCGGAGAAAGTGACACCAGACGTGATTGG
AGGAATCGAATTACAAGAAACATTTGGATTTAGACTGTTAAAAATTAAAG
ATATCGAGGCTAGTGAAAAAGACAAGAACTATATTTGTAACATTGAAGCA
AAATTTGGCAGAAAAATAAAGGATGAAGAACGATTGATACGAGCGTTGGA
GCAGTTAAAGATCAATGAAGATAGCAAATTAAAGGAAATTATAACAAACA
GTGGAAACGTGTTTATGGCAGATAAATGGGATGTGGGTCGCACACATTTG
GTAAAACACAAGATAGTTACAAGAGGTGAGCCGATAAACATAAAACCATA
TCGCCAACCACTAAATTTGGAATCAAAGATTGAAGAAGCAATACAAAATT
TGTACGAAAATGGAATTATACGGAAATGTAACTCTCCATGGAATACACCG
CTAATATGTGTATGGAAAAAAGAGAAAAAGGATATTAGACTATGCTTGGA
CTTTAGACAGCTGAACGCGGTAACAGAGAGGCAGGCATTTCCAATGCCAA
ACATAGACGAGATGTTGGATTTGTTAGGAGGATCCGTCTTTTTCAGCTCA
ATCGATCTAGGTAATGCATATTACCAAGTGGAACTAGAAGAAGAATCACA
GGAGAAAACAGCTTTCTCAACAAAGAACGGACAATATTGTTTCAACAGAA
TGCCGTTTGGAATTGCAGCAGCACCAGGTACGTTCCAAGAAATGATGACA
AAAGTATTAGGCAAAATAAAGGGAGCAATGGTGTATTTGGACGACATCTT
GATTTATTCAAGCAGTAAAGAGAAACACTATACAATACTCGGAGAAGTGT
TAAAGGCGATTGAGGAAGCAGGCCTAAGGATTAATCCAGAAAAATGCCAA
ATAATAAAGGAAGAAATCAGATTCTTAGGACACATAATCAGCAAGGAAGG
AGTACAGACAGATCCATCTAAAATTCAGGCCATACAGAACTTTGGGAAGC
CTAACTGCGTAAAGAAACTCCGAAGCTTTTTAGGTATTTGCAATTATTAT
AGAAGATTCATCAGTGATTATGCAAAGAAGGCAAGGATGTTAGAACAATT
ATGTAGCGGACCAAGTAACAAAAAATTAGAATGGAGTGAAGGAACGAATA
GCGTGTTTGAGGGATTGAAACTAGCACTCACGACGGCGCCAATCTTATGT
TTTCCGGATCTGAAAAAAGAGTTCATTTTAGATACCGACGCTAGTTTCGA
TACCATTGGAGCAGTACTTTCACAAAGAGATAAAGATGGAAAGGAAAGAG
TAATTGCGTATGGATCACACTCGATGACGAGCCACGAGAAAGGATATTGT
ATTACAAGAAAGGAACTATTGGCGGTATATTATTTTTGCATTCATTTTAA
GCACTATCTCTACGGAAGAAGATTCACGCTCAGGACAGACCACAAGGCAA
TCACGTTTATGATGACAACCAAGAAGCCGATTACTTCACAATTCCAAACA
TGGATCAATTATTTGAGCAGTTTAGATATAAGAGTAGAATACAGAAAGGG
TATTAATCACCAAAATGCAGACATGGTGTCGAGAACTAAATGTGACACGT
GTGTTCAATGTCAAATGGCCCACGAAGAAGCTAAGAAAGGAAAGATAAAA
ACAAGATTACTAGACTCGATCAGAGAAGAGGGAAGAAGCAACATCCAACA
TGGAATTGTAGAAGAAGTACGAAAGAAAACGATGATACCTGAAAATGAGT
TACAAGAAACAATAAAGGAAATACATAGACTATTGTGTCATGCTGGAGTC
GAGAAGATAGCCGATTATATGAAAGATAGATATGTTGGAAAACACCTGTG
GAGTAAGATTCAGGAGATTGTTCGAAGTTGTGAGATTTGTCAAAAAACCA
AGGCTTTAACAATAACAACGAAAGAACCAGTAAAAAGACAAGATTCGAAG
GAAATGTTCGAAATCATATTCGTAGATATATGTGGACCGTTGGCAGAAAC
AAAAGGACGTAAGAAGTATATATTGGGCATAATAGATCAGTATAGTAAGT
ATCAAGTCTTGACAGCGATAACAAAACAAGACGAAGAAACAATAAAGAAA
ACGATTTTAGAAAAGTGGATTTTAAGATTTGGATGCCCAAAAGAGATAAG
AGTTGATTGCGGAAAGGCGTTTGAATCAGGAATGATGAGAGAATTCACAA
AGATGTTAGAGATTAAATTATGTTTTTCGAGTCCATACCATCACCAAACA
AACGGTCAGATAGAAAGACAATTCAGGACAATACGGGATTTGATAAATGC
TACTTTACAGGATAGAAAGAAGACGGATTGGGCAGAGATATTACCAGAAG
TCGAATTCACTTTAAATGCAACCAAACAAAAAACGACAGGGAAAAGCCCG
GCAGAGATAGTTTTTGGAAAAAAGATTAGCAGGGAAAAATGGTACGGGTC
CAAAGAAATACCGATAAAGGAAGAACTAGAAGAACAAACAAGGAGAAAAT
TCAATGTAGGAGAAGAAGTATTAGTAAAAGTAGAAACACGGCACAAAGGC
CAGGACAGGTATGAAGGTCCGTACAAAGTGATAGAGAAAGTGCATGACAG
ACGATACATCTTAAGAAATGAAGATGGAAAAAGGATCGAAAGAAATGTGG
AAAAACTTAAAAATTTCTTAAGAAGGGGGATATGAGGAAGTTTTATGATG
AAGAAATTTTTAAATTGAAAATTTTAAAAATTAAAAAGTTTAGATTAATT
TTTAAAAAAAACAAAACAAGAAGATATTTTAATTAAAAATATCTTCGTAG
AAAATTTGTTTAAATAAGAAAGATTAAAGCAGAGAATTTTTGAGGTTGAT
ATTGATTGAAATAAAATTTATGGAAACAAAAACTGATTTTAATGTTTTAC
AGGATGAACCTCCTAATAATAATGAAGACTAATAGGCAGTATTCTGACGA
ATATATAAGTTTCGGGTTTGCATGGACCGATGAAAAGGAGTGTCCAATTC
CTAAATGTGTTGTATGTGGTGTAGAGCTGTCCAATAGTGCTATGTTTCCG
GCTAAATTGAACCGACACTTTACTAATTCGCATGCTAATCTTGTTTCAAA
AAATAATGATTATTTTAAAAGGTTATTGGGAATGCAAGCAAAACAATTTA
AAGGTGCTATGACAATTTCTGATAAGGCGCAAATTGCTTCATACAAGGAA
CTCTAATTAATTGCACTGAAATTAAAACCACATACTATAGCAGAAAGTCT
TATTTTACCGTCATGTTGTGAAATAGTTAAAAATTATGTTTGGTGATGCA
AAAAATGAGATTATGAAGATTCCCCTATCTAATGATACAATAAAAAAAAA
GAATAAAAGATATGTCACATGATATTGAGGAAATCGTTAATTATAAACTT
TCCAAGAAATATTTTGCTTTGCAAATTGACGAATCTGTTGATATTAGCAG
CAAAGCTCAATTACTTGCATTAGTTCGGTTCATTGATGAAAATGAAATAG
TAAATCAATTTCTTTGCTGCAGAGAGCTTACAGAACATACAACAGGAAAA
GATATTTTTAATTGTATTACCACATATTTAGAAAAATCACAAATATCATG
GGATTTCTGCGTAGGAATTTGTACAGATGGATGTCCATCAATGGCAGGAT
GTATTAAAGGATGTGTTACACTTGTGAAGGAAAAGAATCCAAATATCATA
TCTACACACTGTTTTTTACATCTGGAAGTTTTAGTTTCAAAAACATTGCC
AAACACATTAAAATCTGCATTGGATAAAGTGGTACAAATAGTAAATTATA
TCAAATCAAGACCTCTGCAGGCACGCATTTTTAAACAACTTCGTATATCT
ATGGATGCTAAGTATGAAAGTTTGCTATTGCACACTGAAATAAGATCGTT
GTCTCGCGGTAAAGTCCTATGTCGTTTATTGGAGCTCAAAGATGAACTGC
TGTGTTATTTCCAAAATGTCGCTATGAATAAATTTGTTAAAAATTTTGAA
AATGATATTTGGTGTGCTAAATTGGCATATTTAGCTGATATTTTTAAATA
CTTAAATTCAGTGAATACAAGTATTCAAGGTAAAAATGAGAATATTTTAA
CATCCACGGATAAAATACTGGCATTCAATAAAAAAATATTGTATTGGAAA
AATCGAATAACAAAAAACAATACGCTTGATATGTTTCCTTCAATTCAAAC
AAATAATGTAACAGATATTATTCCTGCCATAATAGAACATCTAACAATAT
TGGATGAAATAATCGCGTGTTACTTTTCATCTCTTAAATTAGAATCCTAT
GACTGGATACGAAATCCATTTGGGACATTTGAATTTTCAAATATAGAACT
ATCTTTACAAGAAGAAGAAATTATTTCCTTGTCAACAAATCGCTCTCTGA
AAATGGAGTTCACAAAAATGTCAAATGAACATTTTTGGATTTTTGTCCAA
GAAGAACACCCATCTCTATCTAAGAAGGCAATCACGATTTAATTACAATT
TTCTACCTCATATCTTTGTGAATTAGGATTTTCAACATTAACCAATATAA
AAACTAAAAAACGAGAAAGACTTACCGATCTAGAAGAGGAAATGAGGGTA
GCAATATCTTATATTAGACCTAATATTGGCGAAATATGTAAAACTCGCCA
GGCTCAAATATCCCATTGAATTTTTTATTATATTTTATCGTTGTTATTTT
GAATTTGTAACAATTTAGTTTTGTTGATTTTATTTTATATAATGTATAAT
ATAATTATTATTTGACTTGAATAAATAAAAAAATTTTTATATATTTCCGT
GTATAATCCAAATTACTGGAATCAATGTTATTTGGTCTGATGCTCCGGGA
AGGTTTCATGTAATTAAATGTGCTCCGTGATCCCGAAAAGGTTAAGAACC
ACTGAATTAGATAGAAAATAAAATTGAAATAACATTTGTAAATATACTTA
CCAGCTGTGATATTCAGATTCATGTTATGATTATGAAAATTTTGTTCTAA
AAATAGTATCTATTAAAAGAAATCTATATGATAATATTTGAAAAGATAAA
TAGCGTCACGAATGATTATAGATCGTAACGCTTCACGGAGAGCATCCGTC
ACGCATGTGTGAG
back to top

protein sequence of SMED30023331-orf-1

>SMED30023331-orf-1 ID=SMED30023331-orf-1|Name=SMED30023331-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=239bp
VEDLVDRLFENKEFTKEKLTARMLIVCAENKEVEKEVILRKIKSTEEIKE
TLTLMDKVEKQREQINSMKSYARVVYQEPQGRVQYKKKEQREESIRREMP
ECWLCHKIGHTKIDCPIKGKIECWTCHRSGHISRNCPDKKAPRCFGCGKE
GHIRRLCQEIRCERCSRNGHRSEECYTKMRYGQATERQRFTGNRYSRING
IEEEEERESCVETNEDNFRKTYPNNRAPPVEELVGAIF*
back to top

protein sequence of SMED30023331-orf-2

>SMED30023331-orf-2 ID=SMED30023331-orf-2|Name=SMED30023331-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1019bp
MGYPQGAEKWENVGRVSEKVYRFLKREMFEEEVERIGTIETEQVVEILYN
RMQGRPSITVRMNGENFDCLLDTGARINVMSVNCFNKLRGQQLTKSDDKL
RCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDVIGGIELQETFG
FRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRALEQLKINEDSK
LKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPYRQPLNLESK
IEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDFRQLNAVTE
RQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEKTAFSTKN
GQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMVYLDDILIYSSSKEKH
YTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTDPSKIQ
AIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPSNKKL
EWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVLSQR
DKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRRFT
LRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADMV
SRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKK
TMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRS
CEICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILG
IIDQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFES
GMMREFTKMLEIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTD
WAEILPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSKEIPIKEEL
EEQTRRKFNVGEEVLVKVETRHKGQDRYEGPYKVIEKVHDRRYILRNEDG
KRIERNVEKLKNFLRRGI*
back to top

protein sequence of SMED30023331-orf-3

>SMED30023331-orf-3 ID=SMED30023331-orf-3|Name=SMED30023331-orf-3|organism=Schmidtea mediterranea sexual|type=polypeptide|length=105bp
MFYRMNLLIIMKTNRQYSDEYISFGFAWTDEKECPIPKCVVCGVELSNSA
MFPAKLNRHFTNSHANLVSKNNDYFKRLLGMQAKQFKGAMTISDKAQIAS
YKEL*
back to top

protein sequence of SMED30023331-orf-4

>SMED30023331-orf-4 ID=SMED30023331-orf-4|Name=SMED30023331-orf-4|organism=Schmidtea mediterranea sexual|type=polypeptide|length=377bp
MSHDIEEIVNYKLSKKYFALQIDESVDISSKAQLLALVRFIDENEIVNQF
LCCRELTEHTTGKDIFNCITTYLEKSQISWDFCVGICTDGCPSMAGCIKG
CVTLVKEKNPNIISTHCFLHLEVLVSKTLPNTLKSALDKVVQIVNYIKSR
PLQARIFKQLRISMDAKYESLLLHTEIRSLSRGKVLCRLLELKDELLCYF
QNVAMNKFVKNFENDIWCAKLAYLADIFKYLNSVNTSIQGKNENILTSTD
KILAFNKKILYWKNRITKNNTLDMFPSIQTNNVTDIIPAIIEHLTILDEI
IACYFSSLKLESYDWIRNPFGTFEFSNIELSLQEEEIISLSTNRSLKMEF
TKMSNEHFWIFVQEEHPSLSKKAITI*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: molecular function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
Vocabulary: biological process
TermDefinition
GO:0006508proteolysis
GO:0015074DNA integration
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000020protonephridia
PLANA:0000074oocyte
PLANA:0000099neuron
PLANA:0000101muscle cell
PLANA:0000231vitelline gland
PLANA:0000429neoblast
PLANA:0002032epidermal cell
PLANA:0002075pigment cell
PLANA:0003116parenchymal cell
Vocabulary: INTERPRO
TermDefinition
IPR041577RT_RNaseH_2
IPR036397RNaseH_sf
IPR043128Rev_trsase/Diguanyl_cyclase
IPR041588Integrase_H2C2
IPR025398DUF4371
IPR012337RNaseH-like_sf
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR021109Peptidase_aspartic_dom_sf
IPR001878Znf_CCHC
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 17..152
e-value: 4.3E-75
score: 253.7
NoneNo IPR availableGENE3DG3DSA:1.10.340.70coord: 465..548
e-value: 6.3E-8
score: 34.7
NoneNo IPR availableGENE3DG3DSA:3.10.20.370coord: 312..427
e-value: 2.8E-59
score: 202.1
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 584..718
NoneNo IPR availablePANTHERPTHR24559:SF269coord: 28..444
NoneNo IPR availablePANTHERPTHR24559:SF269coord: 584..718
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 28..444
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 321..440
e-value: 1.13474E-45
score: 157.269
NoneNo IPR availableCDDcd01647RT_LTRcoord: 50..225
e-value: 4.77922E-72
score: 231.33
NoneNo IPR availableSUPERFAMILYSSF56672DNA/RNA polymerasescoord: 7..427
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 290..389
e-value: 4.5E-32
score: 110.1
IPR036397Ribonuclease H superfamilyGENE3DG3DSA:3.30.420.10coord: 555..753
e-value: 2.9E-49
score: 169.0
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3DG3DSA:3.30.70.270coord: 239..440
e-value: 2.8E-59
score: 202.1
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3DG3DSA:3.30.70.270coord: 92..225
e-value: 4.3E-75
score: 253.7
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 564..675
e-value: 3.9E-19
score: 69.0
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 551..720
score: 25.707
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 491..547
e-value: 1.0E-11
score: 44.8
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 68..225
e-value: 1.5E-28
score: 99.9
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 48..225
score: 18.162
IPR012337Ribonuclease H-like superfamilySUPERFAMILYSSF53098Ribonuclease H-likecoord: 562..714