Dimer_Tnp_hAT domain-containing protein

Overview
NameDimer_Tnp_hAT domain-containing protein
Smed IDSMED30014315
Length (bp)3112
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Dimer_Tnp_hAT domain-containing protein (SMED30014315) t-SNE clustered cells

Violin plots show distribution of expression levels for Dimer_Tnp_hAT domain-containing protein (SMED30014315) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Dimer_Tnp_hAT domain-containing protein (SMED30014315) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Dimer_Tnp_hAT domain-containing protein (SMED30014315) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Homology
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Match: GTF2IRD2 (GTF2I repeat domain containing 2 [Source:HGNC Symbol;Acc:HGNC:30775])

HSP 1 Score: 88.1965 bits (217), Expect = 4.595e-17
Identity = 54/168 (32.14%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  612 KLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 772          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Match: GTF2IRD2 (GTF2I repeat domain containing 2 [Source:HGNC Symbol;Acc:HGNC:30775])

HSP 1 Score: 88.1965 bits (217), Expect = 5.912e-17
Identity = 54/168 (32.14%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  774 KLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 934          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Match: GTF2IRD2B (GTF2I repeat domain containing 2B [Source:HGNC Symbol;Acc:HGNC:33125])

HSP 1 Score: 87.4261 bits (215), Expect = 9.652e-17
Identity = 53/168 (31.55%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            ++VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  612 RLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 772          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Match: GTF2IRD2B (GTF2I repeat domain containing 2B [Source:HGNC Symbol;Acc:HGNC:33125])

HSP 1 Score: 87.0409 bits (214), Expect = 1.211e-16
Identity = 53/168 (31.55%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            ++VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  779 RLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 939          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Match: ZBED8 (zinc finger BED-type containing 8 [Source:HGNC Symbol;Acc:HGNC:30804])

HSP 1 Score: 70.8626 bits (172), Expect = 9.081e-12
Identity = 57/199 (28.64%), Postives = 98/199 (49.25%), Query Frame = 2
Query: 1871 SIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPE-LENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEEL-VF*RKINSFCRRYSER*ITSFSISKEI 2461
            S+ TDGA SM G+N    + ++ +I + ++T HC+++  AL  +T P +L   +  V+  +  +     +A  HR F+ F  E+  +YS  L H ++RW  + ++L        EIN FL+ +  N  +  EN  +     ++ D+   LN L   +Q  G       E+L  F RK   + +R  +R  T+F   +EI
Sbjct:  239 SVCTDGASSMLGENSEFVAYVKKEIPHIVVT-HCLLNPHALVIKTLPTKLRDALFTVVRVINFIK---GRAPNHRLFQAFFEEIGIEYSVLLFHTEMRWLSRGQILTHIFEMYEEINQFLHHKSSNLVDGFENKEFKIHLAYLADLFKHLNELSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIEKRNFTNFPFLEEI 433          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Zebrafish
Match: CR392001.3 (pep chromosome:GRCz11:8:38963323:38965260:-1 gene:ENSDARG00000117159.1 transcript:ENSDART00000181495.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:CR392001.3)

HSP 1 Score: 163.696 bits (413), Expect = 1.596e-41
Identity = 87/184 (47.28%), Postives = 124/184 (67.39%), Query Frame = 2
Query: 1865 IVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEE-LVF*RKINSFCR 2413
            ++S+ATDGA SM G   G  ++LQ  ++  +L  HCI+H++ALCAQTFP+E + VMNLVI  +  +    +KA  HRQF+  L+E++++YSD LLHNKVRW  K EVL+RF  CL  + TFL  +D+ +P+LE+  WL+   FMVD+T+ LN L+  LQ +GN    ++E  L F RK+  F R
Sbjct:  244 LISVATDGAPSMRGSKRGFVTLLQKALDRNLLAFHCILHQEALCAQTFPSECMVVMNLVIEMVNKII---AKALNHRQFRALLDEVDSEYSDLLLHNKVRWLSKDEVLRRFVACLEHVKTFLKSKDLIYPQLEDTEWLEKLHFMVDMTSHLNKLNESLQVRGNTALQMLEAVLSFERKLTVFAR 424          

HSP 2 Score: 120.168 bits (300), Expect = 2.099e-27
Identity = 70/176 (39.77%), Postives = 99/176 (56.25%), Query Frame = 1
Query: 2539 FANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKAN*Q-------MKFKTTSYEPNLSKLSKTLQSQHSH 3045
            F +R  +F+  K TL+F V+PL  + + +   P GI    LEM++  +  K LW  KF  L  ++E++  QK       KW+ ++ +     LI DTWN+LPDCY  +K  AF VL+IFGSTY  +Q FS MN IKSK +           +K K TSY P++ KLS  ++ Q SH
Sbjct:  465 FESRFCEFRKEKMTLSFPVTPLEIDPSLLSTFP-GIIQADLEMEMADISDKDLWVSKFKRLTAELEDVTRQKAQLAQSHKWSEMEGLPVPEKLIYDTWNALPDCYKNMKTYAFGVLSIFGSTYLCEQIFSNMNYIKSKYRTRLTHESLQSCVKIKVTSYMPDVEKLSSDVRKQKSH 639          

HSP 3 Score: 92.0485 bits (227), Expect = 1.418e-18
Identity = 65/131 (49.62%), Postives = 91/131 (69.47%), Query Frame = 1
Query:  508 KGKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVT---FADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL*DNKI 891
            +GKPFT+G+Y+K+ FI +SE LF DFKNK + ++KIKD+PLSAKTV++R  K++ N+T     DI  + A S+  DESCD       A   RY++  GP+EE+++L+P  GQTRGE+I   V KCL +N I
Sbjct:  115 RGKPFTDGDYMKESFINISEHLFSDFKNKTEIIQKIKDMPLSAKTVKERAIKMAGNITEQQIKDINSAPAYSIACDESCD------TALLCRYVNSDGPQEEIIKLIPLKGQTRGEDICEAVLKCLNENGI 239          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Zebrafish
Match: AL928808.1 (pep chromosome:GRCz11:20:17257101:17259573:-1 gene:ENSDARG00000101333.2 transcript:ENSDART00000166397.2 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:AL928808.1)

HSP 1 Score: 85.1149 bits (209), Expect = 1.851e-16
Identity = 65/199 (32.66%), Postives = 98/199 (49.25%), Query Frame = 2
Query: 1856 IVKIVSIATDGARSMTGKNEGPTSILQSKINYEIL-TLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQY-SDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEEL-VF*RKINSFCRRYSER*ITSF 2443
            + K+ +I TDGA +M G   G   + ++   +    T HCIIH++   ++    + +    L IVN V      + A  HRQFK  ++E++    SD L H  VRW  +  VL RF   LN +  FL E+   +PEL +  W+    F+VD+   LN L+  LQGK   +  LV+ +  F  K+  F     +R  T F
Sbjct:  248 LSKLTAIVTDGAPAMLGSERGLVGLCKADDRFPAFWTFHCIIHQEHWVSKKLNLDHIMKPVLEIVNFVR-----THALNHRQFKNLIDELDEDLPSDLLFHCAVRWLSRGHVLSRFFELLNPVKLFLAEKHKEYPELHDPQWISDLAFLVDVLHYLNGLNVDLQGKLKMLPDLVQSVFAFVNKLKLFKTHLQKRDYTHF 441          

HSP 2 Score: 56.225 bits (134), Expect = 1.894e-7
Identity = 42/121 (34.71%), Postives = 67/121 (55.37%), Query Frame = 1
Query:  514 KPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVT---FADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXX-XXPFSGQTRGENIANTV 864
            KP+ EGE+VK    C+S+ +           + +KDL LS  TV+ RI+ I ++V     +D+Q     S+ +DESCD++D  Q A FVR++S      E L  ++P   +TRG ++  T+
Sbjct:  120 KPYLEGEFVKK---CLSDAVAILCPENENLKRSVKDLQLSRHTVEQRISDIDNSVETHLLSDLQKCQYFSIALDESCDVQDKPQFAIFVRFVSEDCTIREELLDIVPLKDRTRGIDLKETL 237          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Xenopus
Match: spag1 (sperm associated antigen 1 [Source:Xenbase;Acc:XB-GENE-853609])

HSP 1 Score: 85.5001 bits (210), Expect = 2.260e-16
Identity = 55/168 (32.74%), Postives = 85/168 (50.60%), Query Frame = 2
Query: 1874 IATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELE---NDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVL 2368
            I TDGAR+M G+N+G    L+ K   +    HCI+H++ LC  +     +  +  V+  +  +    +++  HR+FK FL E++  Y D LLH  VRW    + L RF     EI+ +L +   +   LE   +  +L    F+ DIT  LN L+  LQG+   V  L
Sbjct:  256 IVTDGARAMVGRNQGLAGRLR-KEGIDCHMFHCIVHQEVLCGTSLK---MADIMDVVTKVTNLIRGGNRSLTHRRFKNFLEELDAAYGDLLLHTNVRWLSAGKCLVRFFALRKEIHLYLSDIKCDSYLLEHLTDVSFLTALAFLTDITQFLNSLNLNLQGRDQNVSQL 419          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Xenopus
Match: ENSXETT00000016563.1 (pep primary_assembly:Xenopus_tropicalis_v9.1:4:37379994:37381838:-1 gene:ENSXETG00000009412.1 transcript:ENSXETT00000016563.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 85.1149 bits (209), Expect = 2.862e-16
Identity = 55/168 (32.74%), Postives = 85/168 (50.60%), Query Frame = 2
Query: 1874 IATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELE---NDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVL 2368
            I TDGAR+M G+N+G    L+ K   +    HCI+H++ LC  +     +  +  V+  +  +    +++  HR+FK FL E++  Y D LLH  VRW    + L RF     EI+ +L +   +   LE   +  +L    F+ DIT  LN L+  LQG+   V  L
Sbjct:  256 IVTDGARAMVGRNQGLAGRLR-KEGIDCHMFHCIVHQEVLCGTSLK---MADIMDVVTKVTNLIRGGNRSLTHRRFKNFLEELDAAYGDLLLHTNVRWLSAGKCLVRFFALRKEIHLYLSDIKCDSYLLEHLTDVSFLTALAFLTDITQFLNSLNLNLQGRDQNVSQL 419          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Xenopus
Match: ENSXETT00000028445.1 (general transcription factor II-I repeat domain-containing protein 2-like [Source:NCBI gene;Acc:101734469])

HSP 1 Score: 64.3142 bits (155), Expect = 7.056e-10
Identity = 48/171 (28.07%), Postives = 85/171 (49.71%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHP-----ELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPV 2359
            K  S+ T+G++SMT K+ G +++L+ K   + + LHCI+H++ L         V  + + I N ++    +  A   ++FK FL+E+   Y DF  H    W    + L RF     EI  +L+ +D N+       L N+ +L    F+ D+T  L+ L + ++ K   +
Sbjct:  241 KCTSVMTNGSKSMTAKSLGLSALLR-KEGADCVVLHCIMHEEMLIGTLLKMSDVMEVVVKISNFILEKRGFITAVTKKKFKTFLDELSAAYGDFDSHKNAYWSSAGQCLFRFFSLRKEI--YLFLKDTNYDPILTESLCNEDFLSSLAFLTDLTHYLHSLKKNIEAKDQLI 408          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Mouse
Match: Gtf2ird2 (GTF2I repeat domain containing 2 [Source:MGI Symbol;Acc:MGI:2149780])

HSP 1 Score: 86.2705 bits (212), Expect = 1.278e-16
Identity = 52/168 (30.95%), Postives = 87/168 (51.79%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N G  + L+++        ++ ++ CIIH + LCAQ             ++++V+ +  W  S+   H  F   L E+++QY   L H  ++W  +  VL+RF   L EI++F+       P+L +  W+    F+VD+TT LN LD  LQG
Sbjct:  606 KLVSVASTGTPAMMDANSGLVTKLRARAASCCKGADLKSVRCIIHPEWLCAQKL-------RMGHVMDVVVDSVNWICSRGLNHGDFTTLLYELDSQYGSLLYHTALKWLGRGLVLRRFFESLEEIDSFMSSRGKPVPQLSSRDWILDLAFLVDMTTHLNTLDASLQG 766          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Mouse
Match: Zbed5 (zinc finger, BED type containing 5 [Source:MGI Symbol;Acc:MGI:1919220])

HSP 1 Score: 55.8398 bits (133), Expect = 2.654e-7
Identity = 50/183 (27.32%), Postives = 89/183 (48.63%), Query Frame = 2
Query: 1865 IVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPE--LENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF*RKINSF 2407
            I  + TDGA +  G   G   ++ ++ + + +  HC++H + L  +T P +  +VM  V+ ++  V    + +   R F +  ++++      LLH + RW  + +VLKR     +E+  F  ++ I   E    ++  LQ   ++VDI T LN L+  LQG  +    L E      KI+SF
Sbjct:  399 ICGVCTDGAPATLGCQSGFQRLVLNE-SPKAIGAHCMLHLQTLAMKTLPQDFQEVMKSVLSSVNFVK---ASSLNSRLFLQLCSDLDEPSKTLLLHTEGRWLSRGKVLKRIFELRDELKMFFNQKAIRQFEALFSDNSALQKVAYLVDIFTILNELNLSLQGPNSTCLDLSE------KIHSF 571          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Match: sp|A4IFA3|GT2D2_BOVIN (General transcription factor II-I repeat domain-containing protein 2 OS=Bos taurus OX=9913 GN=GTF2IRD2 PE=2 SV=1)

HSP 1 Score: 88.5817 bits (218), Expect = 1.680e-16
Identity = 53/168 (31.55%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N+G  + L+SK+       ++ ++ CIIH ++LCAQ    + +  + +  VN       W  S+   H +F   L E++ QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  613 KLVSVASTGTPAMVDANDGLVTKLKSKVAMVCKGSDLKSVCCIIHPESLCAQKLKMDHIMSVVVNAVN-------WICSRGLNHSEFTTLLYELDCQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSQDWIKDLAFLVDMTMHLNTLNISLQG 773          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Match: sp|Q86UP8|GTD2A_HUMAN (General transcription factor II-I repeat domain-containing protein 2A OS=Homo sapiens OX=9606 GN=GTF2IRD2 PE=1 SV=3)

HSP 1 Score: 88.1965 bits (217), Expect = 2.207e-16
Identity = 54/168 (32.14%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  612 KLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 772          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Match: sp|Q6EKJ0|GTD2B_HUMAN (General transcription factor II-I repeat domain-containing protein 2B OS=Homo sapiens OX=9606 GN=GTF2IRD2B PE=1 SV=1)

HSP 1 Score: 87.4261 bits (215), Expect = 4.635e-16
Identity = 53/168 (31.55%), Postives = 90/168 (53.57%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            ++VS+A+ G  +M   N G  + L+S++       E+ ++ CIIH ++LCAQ    + V  + +  VN       W  S+   H +F   L E+++QY   L + +++W  +  VLKRF   L EI++F+       P+L +  W++   F+VD+T  LN L+  LQG
Sbjct:  612 RLVSVASTGTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN-------WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMHLNALNISLQG 772          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Match: sp|Q99NI3|GT2D2_MOUSE (General transcription factor II-I repeat domain-containing protein 2 OS=Mus musculus OX=10090 GN=Gtf2ird2 PE=2 SV=1)

HSP 1 Score: 86.2705 bits (212), Expect = 8.939e-16
Identity = 52/168 (30.95%), Postives = 87/168 (51.79%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKI-----NYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXW--SKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            K+VS+A+ G  +M   N G  + L+++        ++ ++ CIIH + LCAQ             ++++V+ +  W  S+   H  F   L E+++QY   L H  ++W  +  VL+RF   L EI++F+       P+L +  W+    F+VD+TT LN LD  LQG
Sbjct:  606 KLVSVASTGTPAMMDANSGLVTKLRARAASCCKGADLKSVRCIIHPEWLCAQKL-------RMGHVMDVVVDSVNWICSRGLNHGDFTTLLYELDSQYGSLLYHTALKWLGRGLVLRRFFESLEEIDSFMSSRGKPVPQLSSRDWILDLAFLVDMTTHLNTLDASLQG 766          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Match: sp|Q8IZ13|ZBED8_HUMAN (Protein ZBED8 OS=Homo sapiens OX=9606 GN=ZBED8 PE=1 SV=1)

HSP 1 Score: 70.8626 bits (172), Expect = 4.361e-11
Identity = 57/199 (28.64%), Postives = 98/199 (49.25%), Query Frame = 2
Query: 1871 SIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPE-LENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEEL-VF*RKINSFCRRYSER*ITSFSISKEI 2461
            S+ TDGA SM G+N    + ++ +I + ++T HC+++  AL  +T P +L   +  V+  +  +     +A  HR F+ F  E+  +YS  L H ++RW  + ++L        EIN FL+ +  N  +  EN  +     ++ D+   LN L   +Q  G       E+L  F RK   + +R  +R  T+F   +EI
Sbjct:  239 SVCTDGASSMLGENSEFVAYVKKEIPHIVVT-HCLLNPHALVIKTLPTKLRDALFTVVRVINFIK---GRAPNHRLFQAFFEEIGIEYSVLLFHTEMRWLSRGQILTHIFEMYEEINQFLHHKSSNLVDGFENKEFKIHLAYLADLFKHLNELSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIEKRNFTNFPFLEEI 433          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Match: A0A4Y2AEL1 (General transcription factor II-I repeat domain-containing protein 2A OS=Araneus ventricosus OX=182803 GN=GTF2IRD2_357 PE=4 SV=1)

HSP 1 Score: 234.572 bits (597), Expect = 3.069e-122
Identity = 118/182 (64.84%), Postives = 139/182 (76.37%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKAN*Q-------MKFKTTSYEPNLSKLSKTLQSQHSH 3045
            K  KD F  R EQF+ NK+TLAFIV+PLNTN+NEI+ EPFGID  SLEMQL+ LK+K  W  KFTELK K+EELEVQKCM++ Q KWTALKE+ RV ALI   WNSLP+CYSEVKK+A+ VLTIFGSTY  +QAFSCMNIIKS V++          +K KTT Y+P+L KLSK +Q Q SH
Sbjct:  253 KNMKDRFVVRFEQFKTNKSTLAFIVNPLNTNTNEINIEPFGIDAGSLEMQLLDLKTKDFWSGKFTELKSKLEELEVQKCMHIAQHKWTALKEIPRVEALIFSAWNSLPECYSEVKKVAYIVLTIFGSTYPCEQAFSCMNIIKSTVRSQLTNKSLGACLKLKTTIYKPDLIKLSKGMQRQCSH 434          

HSP 2 Score: 195.667 bits (496), Expect = 3.069e-122
Identity = 108/181 (59.67%), Postives = 137/181 (75.69%), Query Frame = 2
Query: 1856 IVKIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF*RKI 2398
            I KIVSIAT  +RSMT  + G TSILQ KIN+EILT HCIIH++ALC QTFPA++++ MNLVI ++  +    +KA YHRQFK+FL E+E+Q SD LLHNKVRW  +  VL+RFALCL+EI TFL E+ I++P+LE+D WLQ F F+VD T KLN    KLQGKGNP Y  +E + F +K+
Sbjct:   40 INKIVSIATGRSRSMTEIHRGVTSILQKKINHEILTFHCIIHQEALCVQTFPAKIIEGMNLVIKSINSIL---AKAIYHRQFKDFLEEIESQCSDLLLHNKVRWLSRGNVLQRFALCLSEIKTFLNEKSIDYPKLEDDKWLQKFNFIVDTTMKLN---LKLQGKGNPAYAKLEVVCFEKKL 214          

HSP 3 Score: 63.1586 bits (152), Expect = 3.069e-122
Identity = 27/41 (65.85%), Postives = 37/41 (90.24%), Query Frame = 3
Query: 2382 CFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIATVDMNYFS 2504
            CF++KL+LF ED++SGKLLHF+ LK+YRN+T AT++ NYFS
Sbjct:  209 CFEKKLLLFVEDMESGKLLHFKNLKQYRNETNATIEANYFS 249          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Match: A0A4Y2II58 (Papilin OS=Araneus ventricosus OX=182803 GN=Ppn_22 PE=4 SV=1)

HSP 1 Score: 204.527 bits (519), Expect = 5.964e-111
Identity = 102/144 (70.83%), Postives = 116/144 (80.56%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSK 2952
            K  KD FA R EQF+ NK+T AFIV+PLNTN+NEI+ EPFGID  SL+MQL+ LK+K LW  KFTELK    ELEVQKCM++ Q KWTALKE+SRV ALI   WNSLP+CYSEVKKLA+ VLTIF STYS +QAFSCMNIIK K
Sbjct:  205 KNMKDGFAERFEQFKTNKSTFAFIVNPLNTNTNEINIEPFGIDAGSLQMQLLDLKTKDLWSGKFTELK---SELEVQKCMHIAQHKWTALKEISRVEALIFGAWNSLPECYSEVKKLAYGVLTIFVSTYSCEQAFSCMNIIKMK 345          

HSP 2 Score: 187.963 bits (476), Expect = 5.964e-111
Identity = 101/162 (62.35%), Postives = 125/162 (77.16%), Query Frame = 2
Query: 1898 MTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELV 2383
            MTG + G TSILQ KIN+EILT HCIIH++ LCAQTFPAE+V+VMNLVI  +  +    +KA YHRQFK+FL E++ Q+SD L+HNKVRW  +  VL+RFALCL+EI TFL E+ I+HP+LE D   Q F FMVD T K+N L+ KLQGKGNP Y L+EE+V
Sbjct:    1 MTGIHRGVTSILQKKINHEILTFHCIIHQETLCAQTFPAEIVEVMNLVIKIINSI---LAKALYHRQFKDFLEEIDIQFSDLLMHNKVRWLSRGNVLQRFALCLSEIKTFLNEKSIDHPQLEEDIGFQKFNFMVDTTMKVNELNLKLQGKGNPAYALLEEIV 159          

HSP 3 Score: 63.1586 bits (152), Expect = 5.964e-111
Identity = 30/42 (71.43%), Postives = 37/42 (88.10%), Query Frame = 3
Query: 2382 CF-KEKLILFAEDIQSGKLLHFQFLKKYRNKTIATVDMNYFS 2504
            CF K+KL+LF EDI+SGKLLHFQ LK+YR++T AT+D NYFS
Sbjct:  160 CFEKKKLLLFVEDIESGKLLHFQNLKQYRDETNATIDTNYFS 201          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Match: A0A4Y2T489 (General transcription factor II-I repeat domain-containing protein 2A OS=Araneus ventricosus OX=182803 GN=GTF2IRD2_438 PE=4 SV=1)

HSP 1 Score: 228.409 bits (581), Expect = 3.753e-110
Identity = 117/182 (64.29%), Postives = 140/182 (76.92%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKAN*Q-------MKFKTTSYEPNLSKLSKTLQSQHSH 3045
            K  KD FA R EQF+ NK++LAF V+PLNTN+NEI+ +PFGID  SL+MQL+ LK+K  W  KFTELK K+EELEVQKCM++ Q KWTALKE+ RV ALI   WNSLP+CYSEVKKLA+ +LTIFGSTYS +QAFSCMNIIKS V++          +K KTT Y+P L KLSK +QSQ SH
Sbjct:  183 KNMKDGFAVRFEQFKTNKSSLAFKVNPLNTNTNEINTKPFGIDAGSLQMQLLDLKTKDFWSGKFTELKSKLEELEVQKCMHIAQHKWTALKEIPRVEALIFGAWNSLPECYSEVKKLAYGMLTIFGSTYSCEQAFSCMNIIKSTVRSQLTNTNLETCLKLKTTIYKPYLIKLSKGMQSQSSH 364          

HSP 2 Score: 161.384 bits (407), Expect = 3.753e-110
Identity = 87/142 (61.27%), Postives = 109/142 (76.76%), Query Frame = 2
Query: 1958 LTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELV 2383
            LT HCII+++ALCAQTFPA++V+ MNLVI  +  +    +KA YHRQFK+FL E++ Q+SD LLHNKVRW  +  VL+RFALCL++I TFL E+ I++PELE D WLQ F FMVD T KLN    KLQGKGNP Y L+EE+V
Sbjct:    3 LTFHCIIYQEALCAQTFPAKIVEGMNLVIKIINSIL---AKAIYHRQFKDFLEEIDNQFSDLLLHNKVRWLSRGNVLQRFALCLSDIKTFLNEKSIDYPELEEDKWLQKFNFMVDTTMKLN---LKLQGKGNPAYALLEEVV 138          

HSP 3 Score: 62.7734 bits (151), Expect = 3.753e-110
Identity = 27/41 (65.85%), Postives = 37/41 (90.24%), Query Frame = 3
Query: 2382 CFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIATVDMNYFS 2504
            CF++KL+LF ED++SGKLLHF+ LK+YRN+T AT++ NYFS
Sbjct:  139 CFEKKLLLFVEDMESGKLLHFKNLKQYRNETRATIEANYFS 179          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Match: A0A4Y2ES36 (General transcription factor II-I repeat domain-containing protein 2A OS=Araneus ventricosus OX=182803 GN=GTF2IRD2_403 PE=4 SV=1)

HSP 1 Score: 208.379 bits (529), Expect = 1.004e-108
Identity = 114/181 (62.98%), Postives = 138/181 (76.24%), Query Frame = 2
Query: 1856 IVKIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF*RKI 2398
            I KIVSIATDGAR MTG   G TSILQ KIN+EIL  H IIH++ALCAQTFPAE+++VMNLVI  +  +    +K  YHRQFK+FL E+ +Q+SD LLHNKVRW  +  VL+RFALCL+EI TFL E+ I+HPELE + WLQ F FMVD T KLN L+ KLQGKGNP Y L+E + F +K+
Sbjct:   81 INKIVSIATDGARIMTGIYRGVTSILQKKINHEILPFHFIIHQEALCAQTFPAEIIEVMNLVIKIINSIL---AKTLYHRQFKDFLEEIYSQFSDLLLHNKVRWLSRGNVLQRFALCLSEIKTFLNEKSIDHPELEENKWLQKFNFMVDTTMKLNELNLKLQGKGNPAYALLEVVCFEKKL 258          

HSP 2 Score: 172.94 bits (437), Expect = 1.004e-108
Identity = 84/126 (66.67%), Postives = 99/126 (78.57%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFG 2898
            K  KD FA R EQF  NK+TLAFI +PLNTN+NEI+ EPF I+ +SL+MQL+ LK+K LW  KFTELK  +EELEVQKCM++ Q KWT LKE+ RV ALI   WN LP+CYSEVKKLA+ VLTIFG
Sbjct:  297 KNMKDGFAERLEQFITNKSTLAFIENPLNTNTNEINIEPFVINARSLQMQLLDLKTKDLWSGKFTELKSNLEELEVQKCMHIAQHKWTTLKEILRVQALIFGGWNRLPECYSEVKKLAYGVLTIFG 422          

HSP 3 Score: 62.7734 bits (151), Expect = 1.004e-108
Identity = 27/41 (65.85%), Postives = 36/41 (87.80%), Query Frame = 3
Query: 2382 CFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIATVDMNYFS 2504
            CF++KL+LF ED++ GKLLHF+ LK+YRN+T AT+D NYFS
Sbjct:  253 CFEKKLLLFVEDMERGKLLHFKNLKQYRNETNATIDTNYFS 293          

HSP 4 Score: 25.0238 bits (53), Expect = 1.004e-108
Identity = 10/11 (90.91%), Postives = 10/11 (90.91%), Query Frame = 2
Query: 2900 RHIRGSKRSLA 2932
            RHIR SKRSLA
Sbjct:  423 RHIRASKRSLA 433          

HSP 5 Score: 102.834 bits (255), Expect = 4.031e-19
Identity = 58/80 (72.50%), Postives = 64/80 (80.00%), Query Frame = 1
Query:  664 ISSNVTFA---DIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL*DNKIE 894
            +SSNVT     DIQL+SALSL IDESCDIKDT QV  FVRYMS  GP+EELL LLP SGQTRG++IAN VQKCL DN I+
Sbjct:    1 MSSNVTHKQTEDIQLASALSLAIDESCDIKDTAQVTLFVRYMSSQGPKEELLGLLPLSGQTRGKDIANAVQKCLEDNGID 80          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Match: A0A4Y2UPV0 (General transcription factor II-I repeat domain-containing protein 2A OS=Araneus ventricosus OX=182803 GN=GTF2IRD2_319 PE=4 SV=1)

HSP 1 Score: 206.068 bits (523), Expect = 2.407e-108
Identity = 113/181 (62.43%), Postives = 137/181 (75.69%), Query Frame = 2
Query: 1856 IVKIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF*RKI 2398
            I KIVSIATDGAR MTG   G  SILQ KIN+EIL  H IIH++ALCAQTFPAE+++VMNLVI  +  +    +K  YHRQFK+FL E+ +Q+SD LLHNKVRW  +  VL+RFALCL+EI TFL E+ I+HPELE + WLQ F FMVD T KLN L+ KLQGKGNP Y L+E + F +K+
Sbjct:   99 INKIVSIATDGARIMTGIYRGVISILQKKINHEILPFHFIIHQEALCAQTFPAEIIEVMNLVIKIINSIL---AKTLYHRQFKDFLEEIYSQFSDLLLHNKVRWLSRGNVLQRFALCLSEIKTFLNEKSIDHPELEENKWLQKFNFMVDTTMKLNELNLKLQGKGNPAYALLEVVCFEKKL 276          

HSP 2 Score: 176.022 bits (445), Expect = 2.407e-108
Identity = 86/126 (68.25%), Postives = 100/126 (79.37%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFG 2898
            K  KD FA R EQF  NK+TLAFIV+PLNTN+NEI+ EPF I+ +SL+MQL+ LK+K LW  KFTELK  +EELEVQKCM++ Q KWT LKE+ RV ALI   WNSLP+CYSEVKKLA  VLTIFG
Sbjct:  315 KNMKDGFAERFEQFITNKSTLAFIVNPLNTNTNEINIEPFVINARSLQMQLLDLKTKDLWSGKFTELKSNLEELEVQKCMHIAQHKWTTLKEILRVEALIFGAWNSLPECYSEVKKLACGVLTIFG 440          

HSP 3 Score: 60.8474 bits (146), Expect = 2.407e-108
Identity = 26/41 (63.41%), Postives = 36/41 (87.80%), Query Frame = 3
Query: 2382 CFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIATVDMNYFS 2504
            CF++KL+LF ED++ GKLLHF+ LK+YR++T AT+D NYFS
Sbjct:  271 CFEKKLLLFVEDMERGKLLHFKNLKQYRDETNATIDTNYFS 311          

HSP 4 Score: 25.0238 bits (53), Expect = 2.407e-108
Identity = 10/11 (90.91%), Postives = 10/11 (90.91%), Query Frame = 2
Query: 2900 RHIRGSKRSLA 2932
            RHIR SKRSLA
Sbjct:  441 RHIRASKRSLA 451          

HSP 5 Score: 109.768 bits (273), Expect = 1.985e-21
Identity = 61/86 (70.93%), Postives = 68/86 (79.07%), Query Frame = 1
Query:  646 QDRITKISSNVT---FADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL*DNKIE 894
            QDR  K+SSNVT     DIQL+SALSL IDESCDIKDT QV  F+RYMS  GP+EELL LLP SGQTRG++IAN VQKCL DN I+
Sbjct:   13 QDRTAKMSSNVTHKQMEDIQLASALSLAIDESCDIKDTAQVTLFLRYMSSQGPKEELLGLLPLSGQTRGKDIANAVQKCLEDNGID 98          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Match: ENSAMXT00000031692.1 (pep primary_assembly:Astyanax_mexicanus-2.0:7:21501471:21504188:-1 gene:ENSAMXG00000043766.1 transcript:ENSAMXT00000031692.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 121.324 bits (303), Expect = 7.033e-29
Identity = 63/171 (36.84%), Postives = 101/171 (59.06%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVE 2374
            K+ ++ TDGA SM GK +G   +++ K+ + ++ LHCIIH++ LCA+   ++   VM  V   +  +      A  HRQF+  L EM+++Y+D  LH  VRW    +VL+RF  C++ I  FL E+   +P+LE++  +    F+ DIT +LN L+ +LQG G  V  + E
Sbjct:   81 KVFAVTTDGAPSMVGKQKGAVKLIEEKVGHPVMKLHCIIHQENLCAKMSNSDFNDVMATVAKVINFLVK--RSALTHRQFRSLLEEMDSEYADLPLHLAVRWLSCGKVLERFVSCIDAIKVFLAEKGQQYPQLEDENCIVKHFFLADITGQLNELNLRLQGAGQTVLDMFE 249          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Match: ENSAMXT00000050151.1 (pep primary_assembly:Astyanax_mexicanus-2.0:17:23214084:23215952:-1 gene:ENSAMXG00000029603.1 transcript:ENSAMXT00000050151.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 94.7449 bits (234), Expect = 1.362e-28
Identity = 61/188 (32.45%), Postives = 95/188 (50.53%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPE-----LENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEEL-VF*RKINSF 2407
            K++SI TDGA +M G+ +G  + L+   N ++++ HCIIH+  LC+            +  +  +I     S +  HR  +EFL E +    D LLHN VRW  K  VL+RF     E+  FL + +          L ++  + I  F+ DI + LN L+ +LQGK N +  L+  +  F RK+  F
Sbjct:  251 KVISITTDGAPAMIGRGKGAVARLKED-NADLISYHCIIHQAVLCS---ALSDEYAEVMKTMMKIINFLRASSSCQHRMLREFLKETDANSDDLLLHNNVRWLSKGRVLERFWSIRREVTAFLEKLESQKAANFSVFLNDEKNMGIIAFLADIMSHLNGLNLQLQGKNNSICDLMTAVRSFQRKLQVF 434          

HSP 2 Score: 53.1434 bits (126), Expect = 1.362e-28
Identity = 51/188 (27.13%), Postives = 85/188 (45.21%), Query Frame = 1
Query: 2524 KFKDEFANRSEQFQYNKTTLAFIVSPLN-------TNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-------N*QMKFKTTSYEPNLSKLSKTLQSQHSH 3045
            K  + F+ R + F   +    FI +P         TN    HF+    +T  L+MQ+I L++    + +F   +     L+                       ++S+T  + PD    +KK+A  +LT+FGSTYS + AFS MNIIK+K ++       +  M+   TS++P    L+   ++Q SH
Sbjct:  466 KLIENFSKRFDSFSIGEELKLFIQNPFLITDVRAFTNDVTHHFK--WANTGPLQMQMIDLQADVALKEQFARTESTTFWLQ-----------------------MVSET--AFPD----LKKVALFILTMFGSTYSCEAAFSTMNIIKTKYRSKLTNEHLHMCMRMALTSFKPRFKMLAGQAKAQFSH 622          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Match: ENSAMXT00000049756.1 (pep primary_assembly:Astyanax_mexicanus-2.0:21:10276151:10280466:-1 gene:ENSAMXG00000035254.1 transcript:ENSAMXT00000049756.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 115.161 bits (287), Expect = 2.920e-26
Identity = 57/152 (37.50%), Postives = 91/152 (59.87%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKL 2317
            K+ ++ TDGA SM GK +G   +++ K+ + I+ LHCIIH++ LCA+   ++   VM  V   +  +      A  HRQF+  L EM+++Y+D  LH+ VRW    +VL+RF  C++ I  FL E+   +P+LE++ W+    F+ DIT  L
Sbjct:  241 KVFAVTTDGAPSMVGKQKGAVKLIEEKVGHPIMKLHCIIHQENLCAKMSNSDFNDVMATVAKVINFLVK--RSALTHRQFRSLLEEMDSEYADLPLHSAVRWLSCGKVLERFVSCIDAIKVFLAEKGQQYPQLEDEKWIVKLFFLADITGHL 390          

HSP 2 Score: 66.6254 bits (161), Expect = 7.699e-11
Identity = 52/133 (39.10%), Postives = 75/133 (56.39%), Query Frame = 1
Query:  511 GKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVTFAD---IQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL*DNKIEKR 900
            GKPFT+GEYVK+ F+  +E LF D  NK     +IKD+P SA+TVQ RI +++ +V       +  +S  SL +DES D+ D  ++A   RY       EEL  L P    TRG+NIA  + +   +  ++ R
Sbjct:  108 GKPFTDGEYVKEAFLSCAETLFDDLPNKDTIKTRIKDMPTSARTVQRRIDEMAVDVRAQQTKGLTDASVFSLALDESVDVNDIPRLAVMARYCDTTTVREELCCLQPMPDTTRGDNIATVIMQHFVERGVDMR 240          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Match: ENSAMXT00000037411.1 (pep primary_assembly:Astyanax_mexicanus-2.0:13:18144228:18145385:-1 gene:ENSAMXG00000033115.1 transcript:ENSAMXT00000037411.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 105.916 bits (263), Expect = 7.706e-24
Identity = 61/167 (36.53%), Postives = 96/167 (57.49%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILT----LHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKG 2350
            K+ SI TDGA  M G + G T  ++ ++    LT    +HC+IH++ALC +    + V  + +  +N +      +K   HR+F++FL+E+E+ Y D L + +VRW  +  VL+RF   L EIN FL+ +D   PEL +  W     F+ D+T  LN L+ +LQG+G
Sbjct:   37 KLASITTDGAPCMVGASRGLTGRVKREMEERGLTAPLQVHCLIHQQALCCKVLKWDSVMKVVVSCINFIR-----AKGLKHREFQQFLSELESAYGDVLYYTEVRWLSRGRVLRRFYELLPEINAFLHSKDKTVPELLDPEWKWHLAFLTDVTEMLNSLNLQLQGQG 198          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Match: ENSAMXT00000037725.1 (pep primary_assembly:Astyanax_mexicanus-2.0:14:29162452:29166863:1 gene:ENSAMXG00000037303.1 transcript:ENSAMXT00000037725.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 93.9745 bits (232), Expect = 1.658e-23
Identity = 64/204 (31.37%), Postives = 103/204 (50.49%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKIN----YEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEEL-VF*RKINSFCRRYSER*ITSFSISKE 2458
            K+ ++ TDGA S+ G+  G    L+  +      E+   HCIIH++ALC +    + +    +  VN +      +++  HR+F  FL ++   Y D + H +VRW  +  VLKRF    NEI  FL ++  +   +EN  W+    F+ D+T  LN L+ KLQGK   +  L E +  F  K+N F  +  +  +T F   +E
Sbjct:  429 KLSAVTTDGALSVVGETNGFIGHLKRHLGPDQAAELKHYHCIIHQEALCGKHLHFKEIMDFVVSAVNFIR-----ARSLKHREFPSFLEDISAAYGDVIYHTEVRWLSRGNVLKRFFALRNEIKLFLEQKGKDTLVMENTDWIADLAFLTDLTGLLNELNLKLQGKDRLICHLFEAVCAFEMKLNLFATQLKKGNLTHFPTCQE 627          

HSP 2 Score: 36.5798 bits (83), Expect = 1.658e-23
Identity = 43/182 (23.63%), Postives = 75/182 (41.21%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-----N*Q--MKFKTTSYEPNLSKLSKTLQSQHSH 3045
            +  K  F+ R  QF+  + TL  +  P + ++  +  E        L+ ++I LK     R K  E    M  LE  + +   Q                       P+ ++  +K     +++FGSTY  +Q FS M + KS ++      N Q  ++  TTS EP++++L    +   SH
Sbjct:  644 EDLKLAFSARFGQFRNEQATLQLLADPFSVDTETVPGE--------LQPEIIELKCSTAMRTKHRE----MPLLEFYQSLDREQ----------------------FPNLFANAQKW----ISMFGSTYICEQMFSLMKLNKSPLRTRLTDENLQAVLRLATTSLEPDINQLVSERRCNISH 787          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Match: ENSORLT00000033718.1 (general transcription factor II-I repeat domain-containing protein 2 [Source:NCBI gene;Acc:110015985])

HSP 1 Score: 157.918 bits (398), Expect = 1.821e-62
Identity = 87/202 (43.07%), Postives = 124/202 (61.39%), Query Frame = 2
Query: 1865 IVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELV-F*RKINSFCRRYSER*ITSFSISKEISQ 2467
            +V +A+DGA SMTG  +G  ++LQ  ++ ++LT HCI+H++ALCAQTFP E  +VM+LVI  +  +    +    HRQF+  L+E+++ YSD LLHNKVRW  +  VLKRFA CL E+  FL  + +  PELE   W +   FMVD+T  LN L+  LQGKG     ++EE++ F RK+  F        +  F   +E  Q
Sbjct:  247 LVPVASDGAPSMTGAQKGFVALLQKSLDRKLLTFHCILHQEALCAQTFPPECTQVMDLVIQIVNKIM---ANGLNHRQFRSLLDELDSAYSDLLLHNKVRWLSRGVVLKRFAACLEEVKVFLSNKGLTFPELEQPEWQEKLHFMVDMTAHLNTLNTSLQGKGGTALHMLEEVLGFERKLTVFATDLKRGTLYHFPALREFQQ 445          

HSP 2 Score: 103.219 bits (256), Expect = 1.821e-62
Identity = 61/178 (34.27%), Postives = 95/178 (53.37%), Query Frame = 1
Query: 2539 FANRSEQFQYNKTTLAFIVSPLNTNSNEIHFEPFG--IDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKAN*Q-------MKFKTTSYEPNLSKLSKTLQSQHSH 3045
            F  R  +F+  K TL+F V+PL  + ++++   F   +    LEM+L       +W  KF  L   +E +  QK +     KW+ +  + +   L+ +TWN++PD Y  +K+ AF VL IFGSTY  +Q FS +N IK+K ++          +K K TSY P++ KL   +Q Q SH
Sbjct:  466 FGKRFCEFRQEKHTLSFPVTPLTIDPSQLNMTAFAGVVSQPDLEMEL-----ADIWVFKFKSLTAYLENVSRQKAVLAQNHKWSDIANLPKQDKLVFETWNAIPDSYINMKRYAFGVLEIFGSTYICEQVFSNVNFIKNKHRSRLTHISLRSCLKMKVTSYSPDVKKLCSEVQEQKSH 638          

HSP 3 Score: 88.9669 bits (219), Expect = 1.093e-17
Identity = 67/125 (53.60%), Postives = 89/125 (71.20%), Query Frame = 1
Query:  511 GKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVTFADIQ-LSSALSLVI--DESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL 876
            GKPFT+GEY+K+ FI +S+ LF DFKNK + L+KIKD+PLSAK VQDR   ++ NVT   I+ ++SAL+  I  + S D  D  Q+A F RY+S  GP+EE++EL+P  GQTRGE+I   V  CL
Sbjct:  113 GKPFTDGEYLKETFIKISKHLFSDFKNKDEILQKIKDMPLSAKIVQDRSVNMAENVTRQQIEDINSALAYAIACNMSKDKNDIEQIALFCRYVSAAGPQEEMIELIPLKGQTRGEDICEAVLHCL 237          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Match: ENSORLT00000035623.1 (pep primary_assembly:ASM223467v1:9:5067000:5069708:1 gene:ENSORLG00000026230.1 transcript:ENSORLT00000035623.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 134.42 bits (337), Expect = 1.970e-59
Identity = 76/175 (43.43%), Postives = 111/175 (63.43%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF 2386
            K+VS+ TDGA  M GKN G  ++L+      IL+ HCI+H++ALCAQ    +L +VM+LV+  +  +    ++A   RQFK  L E+   Y   LLH+ VRW  + +VL RFA CL+EI TFL  +++ HPEL +  WL  F ++VD+T  LN L+ K+QG GN ++ L ++ VF
Sbjct:  270 KLVSVCTDGAPCMVGKNRGFVALLREHEKRRILSFHCILHQEALCAQMCGEQLGEVMSLVVRVVNFIV---ARALNDRQFKALLEEVGNSYPGLLLHSNVRWLSRGKVLSRFAACLSEIRTFLERKNVEHPELADTEWLLKFYYLVDMTGHLNHLNVKMQGVGNTIFSL-QQAVF 440          

HSP 2 Score: 102.834 bits (255), Expect = 1.970e-59
Identity = 61/178 (34.27%), Postives = 95/178 (53.37%), Query Frame = 1
Query: 2539 FANRSEQFQYNKTTLAFIVSP--LNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-------N*QMKFKTTSYEPNLSKLSKTLQSQHSH 3045
            F  R  +F+       FI  P     +  ++ + P G+  +  E+Q   LK+  LW  KF  L   +E LE QK    +Q KW  +K++     LI  TWN+LP  +  +++++  VLT+FGSTY+ +Q+FS +  IK+ V++       N  MK   TSY+P+   +SKT+Q Q SH
Sbjct:  496 FKARFGEFRERTPLFKFITHPHECAVDITDLSYIP-GVSVRDFELQAADLKASDLWVNKFKSLNEDLERLERQKAELASQHKWEEIKKLQPADQLILKTWNALPVTFFTLQRVSVAVLTMFGSTYACEQSFSHLKNIKTNVRSRLTDGSLNACMKLNLTSYQPDYKSISKTMQHQKSH 672          

HSP 3 Score: 35.039 bits (79), Expect = 1.970e-59
Identity = 14/35 (40.00%), Postives = 26/35 (74.29%), Query Frame = 3
Query: 2379 WCFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIAT 2483
            + F+ +L LF  DI++G+LLHF+ L ++++  IA+
Sbjct:  440 FAFENRLELFIADIETGRLLHFEKLAEFKDACIAS 474          

HSP 4 Score: 62.3882 bits (150), Expect = 1.992e-9
Identity = 46/117 (39.32%), Postives = 74/117 (63.25%), Query Frame = 1
Query:  511 GKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNV---TFADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENI 852
            GKPFT+GE+ K + + V+ +L+ +F +K K +K+IKD+PLSA+TV DR   +S+ +      D+  +   SL +DES D+ + +Q +   RY++     EE L +LP  G TRGE++
Sbjct:  137 GKPFTDGEFAKSFMLDVANELYANFSDKDKIIKQIKDMPLSARTVHDRTIMMSNQIEATQVKDLNAAQFFSLALDESTDVSNLSQFSVIARYVAGDTIREESLAVLPLKGTTRGEDL 253          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Match: ENSORLT00000028910.1 (pep primary_assembly:ASM223467v1:18:2548265:2550717:1 gene:ENSORLG00000029755.1 transcript:ENSORLT00000028910.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 132.494 bits (332), Expect = 7.011e-59
Identity = 75/175 (42.86%), Postives = 111/175 (63.43%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF 2386
            K+VS+ TDGA  M GKN G  ++L+      IL+ HCI+H++ALCAQ    +L +VM+LV+  +  +    ++A   RQFK  L E+   Y   LLH+ VRW  + +VL RFA CL+EI TFL  +++ HPEL +  WL  F ++VD+T  LN L+ ++QG GN ++ L ++ VF
Sbjct:  243 KLVSVCTDGAPCMVGKNRGFVALLREHEKRRILSFHCILHQEALCAQMCGEQLGEVMSLVVWVVNFIV---ARALNDRQFKALLEEVGNSYPGLLLHSNVRWLSRGKVLSRFAACLSEIRTFLERKNVEHPELADTEWLLKFYYLVDMTGHLNHLNVRMQGVGNTIFSL-QQAVF 413          

HSP 2 Score: 102.834 bits (255), Expect = 7.011e-59
Identity = 61/178 (34.27%), Postives = 95/178 (53.37%), Query Frame = 1
Query: 2539 FANRSEQFQYNKTTLAFIVSP--LNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-------N*QMKFKTTSYEPNLSKLSKTLQSQHSH 3045
            F  R  +F+       FI  P     +  ++ + P G+  +  E+Q   LK+  LW  KF  L   +E LE QK    +Q KW  +K++     LI  TWN+LP  +  +++++  VLT+FGSTY+ +Q+FS +  IK+ V++       N  MK   TSY+P+   +SKT+Q Q SH
Sbjct:  469 FKARFGEFRERTPLFKFITHPHECAVDITDLSYIP-GVSVRDFELQAADLKASDLWVNKFKSLNEDLERLERQKAELASQHKWEEIKKLQPADQLILKTWNALPVTFFTLQRVSVAVLTMFGSTYACEQSFSHLKNIKTNVRSRLTDGSLNACMKLNLTSYQPDYKSISKTMQHQKSH 645          

HSP 3 Score: 34.6538 bits (78), Expect = 7.011e-59
Identity = 14/35 (40.00%), Postives = 26/35 (74.29%), Query Frame = 3
Query: 2379 WCFKEKLILFAEDIQSGKLLHFQFLKKYRNKTIAT 2483
            + F+ +L LF  DI++G+LLHF+ L ++++  IA+
Sbjct:  413 FAFENRLELFIADIETGRLLHFEKLAEFKDACIAS 447          

HSP 4 Score: 62.3882 bits (150), Expect = 1.738e-9
Identity = 46/117 (39.32%), Postives = 74/117 (63.25%), Query Frame = 1
Query:  511 GKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNV---TFADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENI 852
            GKPFT+GE+ K + + V+ +L+ +F +K K +K+IKD+PLSA+TV DR   +S+ +      D+  +   SL +DES D+ + +Q +   RY++     EE L +LP  G TRGE++
Sbjct:  110 GKPFTDGEFAKSFMLDVANELYANFSDKDKTIKQIKDMPLSARTVHDRTIMMSNQIEATQVKDLNAAQFFSLALDESTDVSNLSQFSVIARYVAGDTIREESLAVLPLKGTTRGEDL 226          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Match: ENSORLT00000038486.1 (pep primary_assembly:ASM223467v1:11:14048492:14066451:1 gene:ENSORLG00000022236.1 transcript:ENSORLT00000038486.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 132.109 bits (331), Expect = 9.602e-57
Identity = 73/169 (43.20%), Postives = 105/169 (62.13%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVL 2368
            K+VS+ TDGA  M GKN G  ++L+      IL+ HCI+H++ALCAQ    +  +VM+LV+  +  +    ++A   RQFK  L+E+   Y   LLH+ VRW  +  VL RFA CL+EI TFL  +++ HPEL +  WL  F ++VD+T  LN L+ K+QG GN +  L
Sbjct:  150 KLVSVCTDGALCMVGKNRGFVALLREHEKRRILSFHCILHQEALCAQMCGEQFGEVMSLVVRVVNFIV---ARALNDRQFKALLDEVGYSYPGLLLHSNVRWLSRGTVLSRFAACLSEIRTFLERKNVEHPELADTEWLLKFYYLVDMTGHLNHLNVKMQGVGNTILSL 315          

HSP 2 Score: 97.8265 bits (242), Expect = 9.602e-57
Identity = 66/194 (34.02%), Postives = 101/194 (52.06%), Query Frame = 1
Query: 2524 KFKDEFAN--------RSEQFQYNKTTLAFIVSP--LNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTA---LKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-------N*QMKFKTTSYEPNLSKLSKTLQSQHSH 3045
            KFKD+ ++        R  +F+       FI  P     NS ++ + P G+  +  E+Q   LK+  LW  KF  +   +E LE QK    +Q KW     LK+      LI  TWN+LP  +  +++++  VLT+FGSTY+ +Q+FS +  IK+ V++       N  MK   TSY+P+   +SKT+Q Q SH
Sbjct:  346 KFKDQASHLISCSHSKRFGEFRERTRLFKFITHPHECAVNSTDLSYIP-GVSVRDFELQAADLKASDLWVNKFKSMNEDLERLERQKAELASQHKWCYEEWLKDQ-----LILKTWNALPVTFFTLQRVSVAVLTMFGSTYACEQSFSHLKNIKTNVRSLLTDGSLNACMKLNLTSYQPDYKSISKTMQHQKSH 533          

HSP 3 Score: 33.113 bits (74), Expect = 9.602e-57
Identity = 13/32 (40.62%), Postives = 25/32 (78.12%), Query Frame = 3
Query: 2379 WCFKEKLILFAEDIQSGKLLHFQFLKKYRNKT 2474
            + F+++L LF  DI++G+LLHF+ L K++++ 
Sbjct:  320 FAFEKRLELFIADIETGRLLHFEKLSKFKDQA 351          

HSP 4 Score: 58.9214 bits (141), Expect = 2.011e-8
Identity = 43/115 (37.39%), Postives = 73/115 (63.48%), Query Frame = 1
Query:  511 GKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVTFADIQ-LSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENI 852
            GK FT+GE+ K + + V+ +L+ +F +K K  K+IK +PLSA+TV DR   +S+ +    ++ +++  SL +DES D+ + +Q +   RY++     EE L +LP  G TRGE++
Sbjct:   19 GKSFTDGEFAKSFMLDVANELYDNFSDKDKITKQIKYMPLSARTVHDRTIMMSNQIEATQVKDINAFFSLALDESTDVSNLSQFSVIARYVAGDTIREESLAVLPLKGTTRGEDL 133          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Match: ENSORLT00000041272.1 (pep primary_assembly:ASM223467v1:18:30803263:30808674:-1 gene:ENSORLG00000024130.1 transcript:ENSORLT00000041272.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 99.3673 bits (246), Expect = 9.221e-36
Identity = 56/130 (43.08%), Postives = 82/130 (63.08%), Query Frame = 2
Query: 1955 ILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQG 2344
            IL+ +CI+H++ALCAQ    +L +VM+L +  +  +    ++A   RQFK  L E+   Y D LLH+ VRW    +VL RFA CL+EI TFL  +++ HPEL    WL  F ++VD+T  L+ L+ K+QG
Sbjct:  215 ILSFNCILHQEALCAQMCGEQLGEVMSLAVWLINFIV---ARALNDRQFKSLLGEVGNSYPDLLLHSNVRWLSSGKVLSRFAACLSEIRTFLEMKNVGHPELSATKWLLKFYYLVDMTAHLDHLNVKMQG 341          

HSP 2 Score: 61.2326 bits (147), Expect = 9.221e-36
Identity = 46/170 (27.06%), Postives = 81/170 (47.65%), Query Frame = 1
Query: 2521 KKFKDEFANRSEQFQYNKTTLAFIVSP--LNTNSNEIHFEPFGIDTKSLEMQLIYLKSKALWRRKFTELKGKMEELEVQKCMYVTQKKWTALKEMSRV*ALISDTWNSLPDCYSEVKKLAFEVLTIFGSTYSWKQAFSCMNIIKSKVKA-------N*QMKFKTTSYEPN 3003
            + FK  F     +F+ +     FI  P     +S ++++ P G+  +   +Q   LK+  LW  KF  L   ++ LE Q+     Q              LI  TWN+L   +  +++ +  VLT+FGST + +Q FS +  I++ +++       N +MK K T+Y+P+
Sbjct:  390 QSFKASFG----EFREHTRLFKFITHPHECAVDSTDLNYIP-GVSVRDSVLQAADLKASDLWVNKFKSLNEDLDRLERQQAALANQ--------------LIVKTWNALTVTFFTLQRGSVAVLTMFGSTSACEQTFSHLKNIQTYLRSRLTDGSLNARMKLKLTTYQPD 540          

HSP 3 Score: 31.187 bits (69), Expect = 9.221e-36
Identity = 12/31 (38.71%), Postives = 23/31 (74.19%), Query Frame = 3
Query: 2391 EKLILFAEDIQSGKLLHFQFLKKYRNKTIAT 2483
            + L LF  DI++G+L+HF+ L ++++  IA+
Sbjct:  340 QGLELFIADIETGRLMHFEKLAEFKDACIAS 370          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Match: SMESG000011571.1 (SMESG000011571.1)

HSP 1 Score: 158.688 bits (400), Expect = 2.523e-44
Identity = 86/167 (51.50%), Postives = 118/167 (70.66%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVY 2362
            KIVSI+TD A+SM+G  +   +IL+ +IN+EI   HCIIH++ALC QTFP E+ KVM L+I  +  +     K   HRQFKEFL EME++Y++ LLHNKVRW  +  VLK F+  L EI  FL E+ +++PEL N+ W+Q   F+VD+T+ LN L+ KLQGKGN ++
Sbjct:   10 KIVSISTDVAKSMSGFRKWFVAILKERINHEIFAYHCIIHQEALCEQTFPEEISKVMRLMITIINSIVV---KGLNHRQFKEFLVEMESEYANLLLHNKVRWLSRRNVLKLFSSLLPEIEVFLLEKGVHYPELTNNQWIQYCHFVVDVTSHLNQLNLKLQGKGNTIF 173          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Match: SMESG000076570.1 (SMESG000076570.1)

HSP 1 Score: 137.887 bits (346), Expect = 1.596e-42
Identity = 69/91 (75.82%), Postives = 75/91 (82.42%), Query Frame = 1
Query:  946 MLQFIRPLETFEVSDDVEKFIEETDKLFELTQTSEDH*GLFIKAFLSMEATRKFD*TDVELHYNIRIQNAFMKTSNLASDLNAALS*YDCG 1218
            MLQFIRP ETFEV DDV+K IE+TDK FEL+QTSE+H G FIKAF+SMEATRKFD TD ELHY +RIQNAFMKTSNL SDLN  LS    G
Sbjct:    1 MLQFIRPNETFEVDDDVKKIIEDTDKFFELSQTSENHRGFFIKAFISMEATRKFDQTDAELHYKMRIQNAFMKTSNLESDLNTDLSYRRGG 91          

HSP 2 Score: 55.8398 bits (133), Expect = 1.596e-42
Identity = 30/43 (69.77%), Postives = 33/43 (76.74%), Query Frame = 2
Query: 1244 ARNVVRNNLTEEDLKVF*I*NTIHDNETKKYLKLQGFKTFVEM 1372
             RNVVR+NLTE+DLKVF I N I DNETKK L L+  KTF EM
Sbjct:   91 GRNVVRHNLTEDDLKVFLIQNAIEDNETKKDLNLRDLKTFEEM 133          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Match: SMESG000011945.1 (SMESG000011945.1)

HSP 1 Score: 147.517 bits (371), Expect = 1.478e-38
Identity = 85/187 (45.45%), Postives = 124/187 (66.31%), Query Frame = 2
Query: 1862 KIVSIATDGARSMTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAEXXXXXXXXXXXXXXXXXXWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRFALCLNEINTFLYEEDINHPELENDGWLQIF*FMVDITTKLN*LDRKLQGKGNPVYVLVEELVF*RKINSFCRRYS 2422
            +IVSI+TDGA+SM G ++G  +IL+ KIN+EI   HCI +++ LCAQTFP E+ KVM LVI  +  +    +KA  HRQ KE L EME++Y+D LLHNK++W  +   LKR A  L EI  FL E+ +++PEL ++  +Q F F+VD+   +N L+RK Q  GN ++ ++E      K++SF  + S
Sbjct:  133 EIVSISTDGAKSMIGVSKGFVAILKEKINHEIFVYHCIFNQETLCAQTFPEEICKVMRLVITIINSIV---AKALNHRQLKECLVEMESEYADPLLHNKIQWLSRGNALKRLASLLQEIEVFLLEKGVHYPELTDNQRIQNFHFVVDVMYHINQLNRKPQRNGNTIFSMLE------KVDSFKNKLS 310          

HSP 2 Score: 82.0333 bits (201), Expect = 2.463e-16
Identity = 59/126 (46.83%), Postives = 79/126 (62.70%), Query Frame = 1
Query:  508 KGKPFTEGEYVKDWFICVSEKLFRXXXXXXXXXXXXXXXXLSAKTVQDRITKISSNVTF---ADIQLSSALSLVIDESCDIKDTTQVAFFVRYMSYHGPXXXXXXXXPFSGQTRGENIANTVQKCL 876
            + KP+T+ EY+K  FI  SE+LFRDFKNK   LKKIK+L LSAKT++DR  K+ SN+T     D++L S LS  +D+SCDIKDT QV+ F                      TRGE+IA++V +C+
Sbjct:   21 RKKPYTDKEYIKSCFINASEELFRDFKNKADNLKKIKELSLSAKTMKDRTIKMCSNITIQQIEDLKLVSGLSTAVDKSCDIKDTMQVSLF----------------------TRGEDIASSVVECM 124          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Match: SMESG000006529.1 (SMESG000006529.1)

HSP 1 Score: 112.849 bits (281), Expect = 3.322e-29
Identity = 55/86 (63.95%), Postives = 70/86 (81.40%), Query Frame = 1
Query:  946 MLQFIRPLETFEVSDDVEKFIEETDKLFELTQTSEDH*GLFIKAFLSMEATRKFD*TDVELHYNIRIQNAFMKTSNLASDLNAALS 1203
            MLQFI+P E F V DD EKF+EE +K FELTQT+E+H G+FIKAFLS+EAT K++ T+ E  Y IR+Q AF+K SNLA+DLN+AL+
Sbjct:    6 MLQFIKPPEMFNVGDDTEKFLEEAEKFFELTQTAEEHHGIFIKAFLSIEATGKYEATNNEEIYKIRMQTAFVKPSNLANDLNSALA 91          
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Match: SMESG000009623.1 (SMESG000009623.1)

HSP 1 Score: 109.768 bits (273), Expect = 9.724e-28
Identity = 54/85 (63.53%), Postives = 69/85 (81.18%), Query Frame = 1
Query:  946 MLQFIRPLETFEVSDDVEKFIEETDKLFELTQTSEDH*GLFIKAFLSMEATRKFD*TDVELHYNIRIQNAFMKTSNLASDLNAAL 1200
            MLQFI+P E F VSDD EK +EE +K FELTQT+E+  G+FIK FLS+E TRK++ T+ E +Y IRIQ+AF+K +NLA+DLNAAL
Sbjct:    6 MLQFIKPPEMFSVSDDTEKVLEEAEKFFELTQTAEELRGIFIKVFLSIETTRKYEATNNEENYKIRIQSAFVKPANLANDLNAAL 90          
The following BLAST results are available for this feature:
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 5
Match NameE-valueIdentityDescription
GTF2IRD24.595e-1732.14GTF2I repeat domain containing 2 [Source:HGNC Symb... [more]
GTF2IRD25.912e-1732.14GTF2I repeat domain containing 2 [Source:HGNC Symb... [more]
GTF2IRD2B9.652e-1731.55GTF2I repeat domain containing 2B [Source:HGNC Sym... [more]
GTF2IRD2B1.211e-1631.55GTF2I repeat domain containing 2B [Source:HGNC Sym... [more]
ZBED89.081e-1228.64zinc finger BED-type containing 8 [Source:HGNC Sym... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 2
Match NameE-valueIdentityDescription
CR392001.31.596e-4147.28pep chromosome:GRCz11:8:38963323:38965260:-1 gene:... [more]
AL928808.11.851e-1632.66pep chromosome:GRCz11:20:17257101:17259573:-1 gene... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 3
Match NameE-valueIdentityDescription
spag12.260e-1632.74sperm associated antigen 1 [Source:Xenbase;Acc:XB-... [more]
ENSXETT00000016563.12.862e-1632.74pep primary_assembly:Xenopus_tropicalis_v9.1:4:373... [more]
ENSXETT00000028445.17.056e-1028.07general transcription factor II-I repeat domain-co... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 2
Match NameE-valueIdentityDescription
Gtf2ird21.278e-1630.95GTF2I repeat domain containing 2 [Source:MGI Symbo... [more]
Zbed52.654e-727.32zinc finger, BED type containing 5 [Source:MGI Sym... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|A4IFA3|GT2D2_BOVIN1.680e-1631.55General transcription factor II-I repeat domain-co... [more]
sp|Q86UP8|GTD2A_HUMAN2.207e-1632.14General transcription factor II-I repeat domain-co... [more]
sp|Q6EKJ0|GTD2B_HUMAN4.635e-1631.55General transcription factor II-I repeat domain-co... [more]
sp|Q99NI3|GT2D2_MOUSE8.939e-1630.95General transcription factor II-I repeat domain-co... [more]
sp|Q8IZ13|ZBED8_HUMAN4.361e-1128.64Protein ZBED8 OS=Homo sapiens OX=9606 GN=ZBED8 PE=... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A4Y2AEL13.069e-12264.84General transcription factor II-I repeat domain-co... [more]
A0A4Y2II585.964e-11170.83Papilin OS=Araneus ventricosus OX=182803 GN=Ppn_22... [more]
A0A4Y2T4893.753e-11064.29General transcription factor II-I repeat domain-co... [more]
A0A4Y2ES361.004e-10862.98General transcription factor II-I repeat domain-co... [more]
A0A4Y2UPV02.407e-10862.43General transcription factor II-I repeat domain-co... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000031692.17.033e-2936.84pep primary_assembly:Astyanax_mexicanus-2.0:7:2150... [more]
ENSAMXT00000050151.11.362e-2832.45pep primary_assembly:Astyanax_mexicanus-2.0:17:232... [more]
ENSAMXT00000049756.12.920e-2637.50pep primary_assembly:Astyanax_mexicanus-2.0:21:102... [more]
ENSAMXT00000037411.17.706e-2436.53pep primary_assembly:Astyanax_mexicanus-2.0:13:181... [more]
ENSAMXT00000037725.11.658e-2331.37pep primary_assembly:Astyanax_mexicanus-2.0:14:291... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000033718.11.821e-6243.07general transcription factor II-I repeat domain-co... [more]
ENSORLT00000035623.11.970e-5943.43pep primary_assembly:ASM223467v1:9:5067000:5069708... [more]
ENSORLT00000028910.17.011e-5942.86pep primary_assembly:ASM223467v1:18:2548265:255071... [more]
ENSORLT00000038486.19.602e-5743.20pep primary_assembly:ASM223467v1:11:14048492:14066... [more]
ENSORLT00000041272.19.221e-3643.08pep primary_assembly:ASM223467v1:18:30803263:30808... [more]
back to top
BLAST of Dimer_Tnp_hAT domain-containing protein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000011571.12.523e-4451.50SMESG000011571.1[more]
SMESG000076570.11.596e-4275.82SMESG000076570.1[more]
SMESG000011945.11.478e-3845.45SMESG000011945.1[more]
SMESG000006529.13.322e-2963.95SMESG000006529.1[more]
SMESG000009623.19.724e-2863.53SMESG000009623.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30014315 ID=SMED30014315|Name=Dimer_Tnp_hAT domain-containing protein|organism=Schmidtea mediterranea sexual|type=transcript|length=3112bp
CGCGTCCTCACTGTTGTCAGTCCATCGGAAGCTGTCGAATGACGCCGTCG
CCTGATGTTTCTATGAGGTTTAATTCGCTTTCGACGCAAGACAATTTGGC
GTCTTCACCAGTGCTGTCTTTGTTGTTTTGAGTAAAAAATTAGAAGGGAA
TTGTCTTCACAAAATTATGGAGGGACATATATATATAATAAAAAAACCGA
AGTAAAGGTAGGTAACATAATAAGTATTATATTTTTGAAATGCATTCTAT
ACATATTCTTTTCCAAACTAAACCTAAATACTTTTTTGAAGCCTATAAGA
ATTGTCAATAAGAAATCAAATTTAGAGACACCTCACTAAAAACATACTCA
GTTCGCTGGTAAATACTCAGCCGATAAAGAACGAAAAAAAAGCTGTCGAC
GAACTTCAAAAACAAAACCAGCAATCAAAATTCATATTAAGTAACTGGAC
GCAATATACAAGCAATATTAATCTGGCAAGCTTCGCAGTATCAATTGAAA
ATGCAATAAAGGCAAACCATTCACAGAGGGTGAGTATGTCAAAGATTGGT
TCATTTGTGTATCTGAAAAACTGTTTCGTGATTTCAAAAACAAACCAAAA
TTCTTGAAAAAAATCAAAGATTTGCCACTTTCCGCTAAAACAGTGCAAGA
TAGAATAACTAAAATATCTTCAAATGTAACATTTGCAGATATTCAACTTT
CTTCTGCCTTATCACTTGTTATAGATGAGTCTTGTGACATAAAAGACACG
ACACAAGTTGCCTTTTTTGTCAGGTATATGTCTTACCATGGTCCAGAAGA
AGAACTTCTAGAATTGCTACCGTTCTCGGGGCAAACAAGAGGGGAAAATA
TAGCGAATACCGTGCAAAAATGCCTTTAAGACAATAAGATCGAAAAACGT
GGAAAAAACGTGATTTATGAATAAACTTAACTCTACTTCTTATATATGCT
GCAATTTATTCGACCACTTGAAACATTTGAAGTTAGTGATGATGTCGAAA
AATTTATAGAAGAAACGGATAAGCTTTTTGAACTTACTCAAACATCGGAA
GATCACTGAGGATTATTTATCAAAGCATTTTTATCGATGGAAGCAACTCG
AAAATTTGACTAAACAGATGTGGAATTGCACTACAACATTAGAATCCAAA
ATGCTTTTATGAAGACATCAAATTTGGCAAGTGATCTCAATGCTGCTTTA
TCATAGTACGATTGTGGAATTTTTAAAGAAAATTGAAAAGTTAGCCAGAA
ATGTTGTTCGAAACAACCTGACAGAAGAAGATCTGAAAGTATTTTAAATT
TAAAACACCATTCATGATAATGAAACGAAAAAATATTTGAAGTTACAAGG
CTTTAAAACTTTCGTAGAAATGAAACACAATCATAAAATTTGATGAAATA
AAAAAAGAGATAGAAAATGAAAGTATTGCTGCTATTCAGCACGTACAGAA
AAAGTATGCAAAAGCTGCAATGTTTAATGAAGGTCAAACATGGCAGCAAC
AGATGCCAAAACCAAGCCAGATGAATGGAAGAAATGACCAAAGAAATCTC
CAAATCGACAACAAACAAATGAACAGTTTAGAGATAAGCGACTATGGATT
TCAAGAAATTCCAATCAACAGAATAGAATAAATTCCTATTCAAATGCTGA
TACTCTGCCAAGTAGATGCTGGGCTTACAGATAAGTGGGCCACCTACGAT
CAGAATGTCCAAATGTTCAATGCAATCACTGCCACAAGAAGGGGCATTTT
AGACACCAGTGCTATGAAACTCATCAACAGAAATTTCAGAGTTCCGGATA
TATAGCTGTTGTTGGAGAGGAACGGCAATCGCCAGTTCCAGTCACGACAA
ATTTGATTGTAAAAATCGTTTCAATAGCAACTGATGGAGCCAGAAGTATG
ACAGGAAAAAATGAAGGGCCAACATCTATTCTTCAGAGCAAAATAAACTA
CGAGATTCTTACACTTCATTGCATAATACATAAAAAAGCGCTTTGTGCCC
AAACATTTCCGGCTGAACTAGTTAAGGTCATGAATTTGGTAATTGTTAAT
TTAGTGATTGTTAACAGTATCTGGTCAAAAGCAGCCTACCATCGTCAGTT
TAAAGAATTTTTAAACGAAATGGAGACTCAGTATTCCGACTTCCTTCTTC
ACAATAAAGTGCGATGGTTTTTCAAAAGTGAGGTGTTGAAACGTTTTGCT
TTGTGCTTGAATGAAATAAATACATTCCTATATGAAGAAGACATTAATCA
CCCTGAATTAGAGAACGACGGATGGTTGCAAATATTTTAGTTTATGGTGG
ATATCACAACAAAATTAAATTAGCTTGATCGAAAACTGCAAGGAAAGGGA
AATCCAGTGTATGTCTTGGTAGAAGAATTGGTGTTTTAAAGAAAAATTAA
TTCTTTTTGCAGAAGATATTCAGAGCGGTAAATTACTTCATTTTCAATTT
CTAAAGAAATATCGCAATAAAACGATTGCAACTGTTGACATGAATTACTT
CAGTTATAATTACAGTTATAAAAAAATTTAAAGATGAATTTGCTAACAGA
TCTGAGCAATTCCAATACAACAAAACCACTTTAGCATTCATAGTAAGTCC
TCTCAACACAAATAGTAATGAAATCCACTTTGAGCCATTTGGAATTGATA
CTAAATCTCTCGAGATGCAATTGATCTATTTAAAAAGTAAAGCTTTGTGG
AGAAGAAAATTTACAGAGTTGAAAGGCAAGATGGAAGAGTTGGAGGTCCA
GAAATGTATGTACGTAACACAAAAAAAGTGGACAGCTTTAAAAGAAATGT
CGCGAGTTTAGGCACTTATATCCGACACATGGAATAGTCTTCCAGATTGC
TACAGTGAGGTGAAGAAGTTGGCATTTGAAGTGCTGACTATCTTCGGATC
GACATATTCGTGGAAGCAAGCGTTCTCTTGCATGAATATAATTAAAAGTA
AAGTAAAAGCCAACTAACAGATGAAATTTAAAACAACAAGTTATGAGCCA
AATTTATCTAAACTTTCTAAAACCTTGCAAAGCCAACACTCCCATTGAAT
TTATTTGCTCAATTGTTTGTTATGGAAGTACAAGTTAGTGTTTAATTGTT
TTCAATTTTTAC
back to top

protein sequence of SMED30014315-orf-1

>SMED30014315-orf-1 ID=SMED30014315-orf-1|Name=SMED30014315-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=131bp
MTGKNEGPTSILQSKINYEILTLHCIIHKKALCAQTFPAELVKVMNLVIV
NLVIVNSIWSKAAYHRQFKEFLNEMETQYSDFLLHNKVRWFFKSEVLKRF
ALCLNEINTFLYEEDINHPELENDGWLQIF*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: molecular function
TermDefinition
GO:0046983protein dimerization activity
GO:0003676nucleic acid binding
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000074oocyte
PLANA:0000231vitelline gland
PLANA:0002109X1 cell
PLANA:0002111X2 cell
Vocabulary: INTERPRO
TermDefinition
IPR026630EPM2A-int_1
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026630EPM2A-interacting protein 1PANTHERPTHR45913FAMILY NOT NAMEDcoord: 1..128