SMED30019038

Overview
NameSMED30019038
Smed IDSMED30019038
Length (bp)5638
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of SMED30019038 (SMED30019038) t-SNE clustered cells

Violin plots show distribution of expression levels for SMED30019038 (SMED30019038) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of SMED30019038 (SMED30019038) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for SMED30019038 (SMED30019038) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30019038

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 7

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
protonephridiaSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
nervous systemSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
epidermisSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
cephalic gangliaSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
neoblastSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
parenchymal cellSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
muscle cellSMED30019038 dd_Smed_v4_12777_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of SMED30019038 vs. Ensembl Zebrafish
Match: cnbpa (CCHC-type zinc finger, nucleic acid binding protein a [Source:ZFIN;Acc:ZDB-GENE-030131-5045])

HSP 1 Score: 48.9062 bits (115), Expect = 3.733e-6
Identity = 22/77 (28.57%), Postives = 37/77 (48.05%), Query Frame = 1
Query: 1396 SKQCRTCGKLGHWENEC-------------YQNVPCSRCGRKGHNINRCEVR--TCFTCGKQGHVSRECRKGTNQNQ 1581
            + +C  CG+ GHW   C              +++ C RCG +GH    CE     C+ C + GH+SR+C++   + +
Sbjct:    5 TSECFGCGRSGHWIKNCPNAGRGRGRGRGRGKDLFCYRCGEQGHIARDCEQTEDACYNCHRSGHISRDCKEPKKERE 81          
BLAST of SMED30019038 vs. Ensembl Xenopus
Match: chrne (cholinergic receptor, nicotinic epsilon [Source:Xenbase;Acc:XB-GENE-979686])

HSP 1 Score: 52.7582 bits (125), Expect = 6.053e-6
Identity = 25/55 (45.45%), Postives = 28/55 (50.91%), Query Frame = 1
Query: 1393 QSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRCEVRTCFTCGKQGHVSREC 1557
            Q +QC  CG   H  N+C   V C+ CG KGH    C V  C  CG  GH  REC
Sbjct:  402 QPRQCFKCGSNRHLANDCETKV-CALCGAKGHTSKECNVVRCNLCGTLGHTHREC 455          
BLAST of SMED30019038 vs. TrEMBL
Match: A0A4Y2GHW6 (Uncharacterized protein OS=Araneus ventricosus OX=182803 GN=AVEN_266380_1 PE=4 SV=1)

HSP 1 Score: 70.0922 bits (170), Expect = 2.985e-8
Identity = 85/339 (25.07%), Postives = 136/339 (40.12%), Query Frame = 1
Query:  637 NEPNKLPTANETIREYREKFRGEPFNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKP---LNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTA-TDGKFD--EVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIH-LPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGGNNPTQSKQCRTCGKLGHWENECYQ--------NVPCSRCGRKGHNINRCEV---------RTCFTCGKQGHVSRECRKGTNQNQ 1581
            + P   P AN  +   R   +   F+G  +    F   F V     GW N +  +++   L  SA ++      +G P   L +   IQ  L+  FG + LT     EL  R+Q+ GES    A++++RL   A  +   D  E   +  FVD +R  + +    +   T L  A+A + + E   + S++++        TI ++     + D K    + GA + F   +             +A R       NP  +  C  C K GH + EC          NV C  C +KGH    C            TC+ C K+GH+  EC++ T+ N 
Sbjct:  100 DRPINFP-ANPDLTYSRPTVKSLTFDGQTS-WTVFKTQFDVVSSANGWNNRVKASQLVASLRGSAAEVL-----QGIPSDKLTDLMTIQNALEARFGDSHLTQFYRTELKTRRQKPGESLQALAADVKRLMSLAYAECPMDVQESLAVQFFVDAIRDEDTQLRTRLMDFTDLKSALAYSMKCETSKIASKVSMHA-----RTIRIEDNAGKRKDEKFESLL-GALENFLEIL--------AAGKKSAPR------RNPNVT--CWRCYKRGHVQLECPSDSAPRRNPNVTCWICYKKGHVQRECPSDNASRRNPNATCWKCNKKGHLQNECQQITSINH 409          
BLAST of SMED30019038 vs. TrEMBL
Match: A0A4Y2FX16 (Uncharacterized protein OS=Araneus ventricosus OX=182803 GN=AVEN_108846_1 PE=4 SV=1)

HSP 1 Score: 70.0922 bits (170), Expect = 3.434e-8
Identity = 79/334 (23.65%), Postives = 131/334 (39.22%), Query Frame = 1
Query:  709 FNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKP---LNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTA-TDGKFD--EVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIH-LPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGGNNPTQSKQCRTCGKLGHWENECYQ--------NVPCSRCGRKGHNINRCEV---------RTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIE 1638
            F+G  +    F   F V     GW N +  +++   L  SA ++      +G P   L N   I+  L+  FG + LT     EL  R+Q+ GES    A++++RL   A  +   D  E   +  FVD +R  + +    +   T L  A+A + + E   + S++++        TI ++     + D K    + GA + F   +                  G         +  C  C K GH + EC          NV C  C +KGH    C            TC+ C K+GH+  EC++ T+ N      +   ++  R+  +E
Sbjct:  188 FDGQTS-WTVFKTQFDVVSSANGWNNRVKASQLVASLRGSAAEVL-----QGIPSDKLTNLTIIENVLEARFGDSHLTQFYRTELKTRRQKPGESLQALAADVKRLMSLAYAECPMDVQESLAVQFFVDAIRDEDTQLRTRLMDFTDLKSALAYSMKCETSKIASKVSMHA-----RTIRIEDNAGKRKDEKFESLL-GALEKFLEILA----------------VGKKSAPRRNPNVTCWRCYKRGHVQLECPSDSAPRRNPNVTCWICYKKGHVQRECPSDNASRRNPNATCWKCNKKGHLQNECQQITSINHHKETRHGSDKALSRRSFVE 493          
BLAST of SMED30019038 vs. TrEMBL
Match: A0A1Y1IQX0 (Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_010140020 PE=4 SV=1)

HSP 1 Score: 66.6254 bits (161), Expect = 6.558e-7
Identity = 31/73 (42.47%), Postives = 42/73 (57.53%), Query Frame = 1
Query: 1402 QCRTCGKLGHWENECYQNVPCSRCGRKGHNINRC-EVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSA 1617
            +CR CG+LGH+  +C Q V C+ C + GH    C +  TC  CGK GHV+++CR G   + G P     A SA
Sbjct:  224 RCRKCGELGHFARDCTQEVRCNNCLQHGHMAKECRKPPTCLKCGKSGHVAKDCRSGG--SSGGPNKRGVAFSA 294          
BLAST of SMED30019038 vs. TrEMBL
Match: A0A443SKN5 (Uncharacterized protein (Fragment) OS=Leptotrombidium deliense OX=299467 GN=B4U80_07539 PE=4 SV=1)

HSP 1 Score: 65.0846 bits (157), Expect = 1.762e-6
Identity = 86/316 (27.22%), Postives = 129/316 (40.82%), Query Frame = 1
Query:  697 RGEPFNGNANK-LNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQE-----FLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQI--FKPPVQRTXXXXXXXXXXXAQRTGYNGGNNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRCEVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAP 1620
            R   F+ NA++ +  FL  F    E  GW+      ++   L  SA D F +   + K L  WK++++     FL   + V  +     Q+L +RKQ   E  T F   +  L K A     +E+  I + VDGL  PE+R  +     TTL E    A   E  L S  T        A     P EN +  A  N+      ++  ++  +   ++ +        ++T    G  P    +CR CGK+GH    C+     +R  R+  NI +   + C+ CG  GH SR C K        PRNN      P
Sbjct:   30 RNISFSNNASENIVEFLEKFETAAETNGWSAKAKLKKLEGALIGSAKDWF-DVNIKDKNLQ-WKDVKQHMLDHFLPFDYEVFLM-----QKLKDRKQSAFEPVTSFIDSMLNLIKKAEHPS-EELKKIDLIVDGLL-PEIREYIITVKPTTLIELEEHAKLKEKALKSVST---SRYCQAVQRFGPVENPEMLALTNKMHDMEIKLKNYEKAIATLHDEKIPRKNEQTEQTSRTVGGKP----RCRDCGKVGHIAQRCF-----ARFSRQPLNITQ---KQCYNCGDIGHFSRNCPK--------PRNNPQFTGVP 313          
BLAST of SMED30019038 vs. TrEMBL
Match: K4HZB4 (Putative GIS2 DNA-binding protein (Fragment) OS=Polyporales sp. KUC9061 OX=1239933 PE=2 SV=1)

HSP 1 Score: 58.5362 bits (140), Expect = 1.927e-6
Identity = 27/74 (36.49%), Postives = 37/74 (50.00%), Query Frame = 1
Query: 1354 AQRTGYNGGNNP---TQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRC---EVRTCFTCGKQGHVSREC 1557
            A  +GY G  +     Q + C TCG +GH   +C Q   C  C   GH    C   + R C+ CG +GH+SR+C
Sbjct:   10 AGNSGYQGSWSAFGGGQQRTCYTCGGVGHLSRDCVQGSKCYNCSGFGHISKDCPQPQRRACYNCGSEGHISRDC 83          
BLAST of SMED30019038 vs. Ensembl Sea Lamprey
Match: ENSPMAT00000003739.1 (pep scaffold:Pmarinus_7.0:GL499243:411:1384:-1 gene:ENSPMAG00000003418.1 transcript:ENSPMAT00000003739.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 54.299 bits (129), Expect = 1.225e-8
Identity = 29/86 (33.72%), Postives = 36/86 (41.86%), Query Frame = 1
Query: 1396 SKQCRTCGKLGHWENEC-------------------YQNVPCSRCGRKGHNINRCEV--RTCFTCGKQGHVSRECRKGTNQNQGPP 1590
            S +C  CG  GHW  EC                    +   C RCG  GH    C +   TC+ CGK GH++REC +G     G P
Sbjct:    3 SNECFRCGGSGHWARECPNGAGGGRGPGGPVGRGGRGRGDGCYRCGEGGHIARECPLPQDTCYNCGKGGHIARECPEGRQDRGGGP 88          

HSP 2 Score: 51.2174 bits (121), Expect = 1.253e-7
Identity = 31/77 (40.26%), Postives = 37/77 (48.05%), Query Frame = 1
Query: 1405 CRTCGKLGHWENEC-YQNVPCSRCGRKGHNINRC-EVR-------TCFTCGKQGHVSRECRKGTNQNQGPPRNNNYA 1608
            C  CG+ GH   EC      C  CG+ GH    C E R       +C+TCGKQGH++REC  G     GP  N  Y 
Sbjct:   44 CYRCGEGGHIARECPLPQDTCYNCGKGGHIARECPEGRQDRGGGPSCYTCGKQGHLARECSSGGG---GPGDNKCYG 117          
BLAST of SMED30019038 vs. Ensembl Sea Lamprey
Match: ENSPMAT00000010393.1 (pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 gene:ENSPMAG00000009411.1 transcript:ENSPMAT00000010393.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 50.447 bits (119), Expect = 1.246e-7
Identity = 31/84 (36.90%), Postives = 37/84 (44.05%), Query Frame = 1
Query: 1405 CRTCGKLGHWENEC--------YQNVPCSRCGRKGHNINRC-EVR-------TCFTCGKQGHVSRECRKGTNQNQGPPRNNNYA 1608
            C  CG+ GH   EC             C  CG+ GH    C E R       +C+TCGKQGH++REC  G     GP  N  Y 
Sbjct:   10 CYRCGEGGHIARECPLPQDSVSSNTAACYNCGKGGHIARECPEGRQDRGGGPSCYTCGKQGHLARECSSGGG---GPGDNKCYG 90          

HSP 2 Score: 45.0542 bits (105), Expect = 9.623e-6
Identity = 22/74 (29.73%), Postives = 33/74 (44.59%), Query Frame = 1
Query: 1390 TQSKQCRTCGKLGHWENECYQN-------VPCSRCGRKGHNINRCEV-------RTCFTCGKQGHVSRECRKGT 1569
            + +  C  CGK GH   EC +          C  CG++GH    C           C+ CG++GH+ R+C K +
Sbjct:   32 SNTAACYNCGKGGHIARECPEGRQDRGGGPSCYTCGKQGHLARECSSGGGGPGDNKCYGCGQRGHMQRDCTKAS 105          
BLAST of SMED30019038 vs. Ensembl Medaka
Match: cnbpa (CCHC-type zinc finger nucleic acid binding protein [Source:NCBI gene;Acc:101173222])

HSP 1 Score: 49.6766 bits (117), Expect = 2.787e-6
Identity = 22/56 (39.29%), Postives = 31/56 (55.36%), Query Frame = 1
Query: 1399 KQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRCEVRT---CFTCGKQGHVSREC 1557
            ++C +CG  GH +  C   V C RCG  GH    C   +   C+ CGK GH+++EC
Sbjct:   84 QKCYSCGGFGHIQKLC-DKVKCYRCGEIGHVAVHCSKASETNCYNCGKAGHLAKEC 138          

HSP 2 Score: 48.521 bits (114), Expect = 5.777e-6
Identity = 23/60 (38.33%), Postives = 29/60 (48.33%), Query Frame = 1
Query: 1405 CRTCGKLGHWENEC--YQNVPCSRCGRKGHNINRCEVRTCFTCGKQGHVSRECRKGTNQN 1578
            C TCGK GH   +C       C  CG  GH    C+   C+ CG+ GHV+  C K +  N
Sbjct:   65 CYTCGKAGHMARDCDHANEQKCYSCGGFGHIQKLCDKVKCYRCGEIGHVAVHCSKASETN 124          
BLAST of SMED30019038 vs. Planmine SMEST
Match: SMESG000038549.1 (SMESG000038549.1)

HSP 1 Score: 1555.42 bits (4026), Expect = 0.000e+0
Identity = 851/1040 (81.83%), Postives = 886/1040 (85.19%), Query Frame = 1
Query:  790 MVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGGNNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRCEVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTKQVNVMQEISHLKDMMAKMMRTNQSQQQQSNIHMMQRVENIRNRETEMTXXXXXXXXXXXXMEEERVRQERFKQRQDNPPRINMMRMIKDVKEQYCYNIYKKLPKDKLAKPELEKLGENQTEPDETFRPTKADKRRTQKILLGKIKERITRENERSNENKITTVTKQTNGNSELRRDXXXXXXXXXXXXXXTPPRGLIKVKTKRNLQINMLRNIRDSSPEDGEISETSSLMGEALQVSENEEQNKQIIRPMNRWFSDRPLSPTEGELKSXXXXXXXXXXXXXXXXFINNEVFRRYMRYILQLEGSKGDVIPEQEKPRVKAYLEKQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMGSSKKNDNLYHNGWLIKPDLHLVDALIWPGKPTYVTLEKEKTVPDFGLWAEYAKNEFEINIKKNLQGATKRIFCKNREKLITQIIDQIHKNWEILMYLKPVEMKICIENLRDTVILRHANYKNTTLIGRYGTRRIGXXXXXXXXXXXXXPNEEQVTLKINNKTRVDELIKIINEMIMEQTKNQSEDGMIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEARWGTDTTWKEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQWNEQEKRWELKPGAMEEEPSXXXXXXXXXXXXPRNRTKEAENKTMPENECRKINVMKRQRSQPVDIDEAQIKLENEHCEKLIEILQQGARNQRRAASTHAKYMA 3909
            MVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGK DEVHLIGIFVDGLRRPELRSAVGMQMLTTLD+AVARANQAEIHLPSQ +LMQDESMIAT+A KPEENGQNDAK                                                  CGK GH  N C                                     R   NQNQGPPRNNNY Q APRQPN+E TKQVNVMQEISHLKDMMAKMMRTNQSQQQQSNIHMMQRVEN+RNRETE TQQQHAQDWQQLQ+EEERVRQERFKQ+QDNPPRINMMRMIKDVKEQYCYNIYKKLPKDKL KPELEKLGENQTEPDETFRPTKADKRRTQKILLGKIKERITREN+R NEN+ T VTKQTNGNSEL+RDIIEKSI+NI+KE ETPPRGL+KVKTKRNL+INMLRNIRDSSPEDGEISETSSLMGEALQVSENEEQNKQIIRPMNRW SDRPLSPTEGE+K I+ELIETIPETLINEEFINNE FRRYMR ILQLEG+KGDVIPEQEKPRVKAYL+KQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMGSSKKNDNLYHNGWLIKPDLHLV ALIWPGKPTYVTLEKEKTVPDFGLWAEYAKNEFEINIKKNLQGATKRIFCKNREKLITQIIDQIHKNWEILMYLKPVEMK+CIENLRDT+                                   PN EQ+TLK++NKTRVDELIKIINEMIMEQTK+QSEDGMIEDVAVKYLHHY+HEEDETYLYELG+WDDDQEIRIVPMEEARWGTDTTWK+MDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKL+ELDT+ECKPTPWRKQWNEQEKRWELKPG +EEEP EE+KVN  EEE P+ R KEAENKTMPENECRKINVMKRQRSQPVDIDEAQIKLEN+HCEKLIEILQQGAR+QRRAA+THAKYMA
Sbjct:    1 MVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGKLDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDKAVARANQAEIHLPSQNSLMQDESMIATMAAKPEENGQNDAK--------------------------------------------------CGKQGHVANNC-------------------------------------RGSNNQNQGPPRNNNYVQLAPRQPNVEQTKQVNVMQEISHLKDMMAKMMRTNQSQQQQSNIHMMQRVENVRNRETETTQQQHAQDWQQLQIEEERVRQERFKQQQDNPPRINMMRMIKDVKEQYCYNIYKKLPKDKLVKPELEKLGENQTEPDETFRPTKADKRRTQKILLGKIKERITRENKRPNENE-TIVTKQTNGNSELKRDIIEKSIDNIKKEFETPPRGLMKVKTKRNLRINMLRNIRDSSPEDGEISETSSLMGEALQVSENEEQNKQIIRPMNRWVSDRPLSPTEGEIKLIDELIETIPETLINEEFINNEAFRRYMRNILQLEGAKGDVIPEQEKPRVKAYLDKQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMGSSKKNDNLYHNGWLIKPDLHLVYALIWPGKPTYVTLEKEKTVPDFGLWAEYAKNEFEINIKKNLQGATKRIFCKNREKLITQIIDQIHKNWEILMYLKPVEMKVCIENLRDTL-----------------------------------PNVEQITLKMSNKTRVDELIKIINEMIMEQTKDQSEDGMIEDVAVKYLHHYMHEEDETYLYELGIWDDDQEIRIVPMEEARWGTDTTWKKMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLNELDTNECKPTPWRKQWNEQEKRWELKPGTIEEEPFEEEKVNYREEETPKYRRKEAENKTMPENECRKINVMKRQRSQPVDIDEAQIKLENDHCEKLIEILQQGARSQRRAATTHAKYMA 917          

HSP 2 Score: 584.719 bits (1506), Expect = 0.000e+0
Identity = 281/285 (98.60%), Postives = 283/285 (99.30%), Query Frame = 3
Query: 4782 KIRDQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAMIVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNASIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKYSLKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK 5636
            KI++QSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAMIVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRN SIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKY LKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK
Sbjct: 1020 KIKNQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAMIVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNTSIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKYWLKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK 1304          

HSP 3 Score: 106.686 bits (265), Expect = 0.000e+0
Identity = 70/76 (92.11%), Postives = 71/76 (93.42%), Query Frame = 1
Query: 4558 KDFTPMKKCITQTHEAKDDIYLTQDWDEWLRKQNEESQKVXXXXXXXXXXXXXXXXSMEKRPRGRPKLTTENGEPK 4785
            KDFT MKKCITQTHEAKDDIY TQD DEWLRKQNEESQK+TLTNTTENETTKETGTSMEKRPRGRPKLT EN EPK
Sbjct:  945 KDFTLMKKCITQTHEAKDDIYSTQDCDEWLRKQNEESQKITLTNTTENETTKETGTSMEKRPRGRPKLTAENEEPK 1020          
BLAST of SMED30019038 vs. Planmine SMEST
Match: SMESG000009033.1 (SMESG000009033.1)

HSP 1 Score: 1374.76 bits (3557), Expect = 0.000e+0
Identity = 816/1106 (73.78%), Postives = 871/1106 (78.75%), Query Frame = 1
Query: 1477 RKGHNINRCEVRTCFTCG--KQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTKQVNVMQEISHLKDMMAKMMRTNQSQQQQSNIHMMQRVENIRNRETEMTXXXXXXXXXXXXMEEERVRQERFKQRQDNPPRINMMRMIKDVKEQYCYNIYKKLPKDKLAKPELEKLGENQTEPDETFRPTKADKRRTQKILLGKIKERITRENERSNENKITTVTKQTNGNSELRRDXXXXXXXXXXXXXXTPPRGLIKVKTKRNLQINMLRNIRDSSPEDGEISETSSLMGEALQVSENEEQNKQIIRPMNRWFSDRPLSPTEGELKSXXXXXXXXXXXXXXXXFINNEVFRRYMRYILQLEGSKGDVIPEQEKPRVKAYLEKQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMGSSKKNDNLYHNGWLIKPDLHLVDALIWPGKPTYVTLEKEKTVPDFGLWAEYAKNEFEINIKKNLQGATKRIFCKNREKLITQIIDQIHKNWEILMYLKPVEMKICIENLRDTVILRHANYKNTTLIGRYGTRRIGXXXXXXXXXXXXXPNEEQVTLKINNKTRVDELIKIINEMIMEQTKNQSEDGMIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEARWGTDTTWKEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQWNEQEKRWELKPGAMEEEPSXXXXXXXXXXXXPRNRTKEAENKTMPENECRKINVMKRQRSQPVDIDEAQIKLENEHCEKLIEILQQGARNQRRAASTHAKYMAXXXXXXXXXXXXVKEIEEYRRKNEWCPEVFDQVIRKWKHREWSQETQGKYKIKVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPKPIKASMSTPSMGGDKNETPKMFEGDEVMILNDNTDEEEELSPLVIKTPIKLPEIREEQPSTSYYQINEQQGXXXXXXXXXXXXXXKDFTPMKKCITQTHEAKDDIYLTQDWDEWLRKQNEESQKVXXXXXXXXXXXXXXXXSMEKRPRGRPKLTTENGEPKS 4788
            RK +N+ R E          K  HV     K  NQNQGPPRNNNY Q APRQPN+E TKQVNVMQEISHLKDMM+KMMRT+Q QQQQSNIHMMQRVE+ RNRETEMTQQ+HAQDW QLQMEEERVRQER+K                                  LAKPE E+ GENQTEPDET RPTKADKRRTQKILLGKIKERITRENE++NEN+ T  TKQTNGN +L+RDIIEKSI+N+EKE ETPPRGLIKV+TKRNL+INMLRNIRDS PEDGE SETSSLMGEAL+VSENEEQNKQIIRPMNRW  DRP SPTEGE+K I+ELIETI ETLINEEFINNE                       EKPRVKAYL+KQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMG SKKN+NLYHNGWLIKPD+HLVDA+IWPGKPTYVTLEKE+T+PDFGL AEYAKNEFEINIKKNLQGATKRI CKNREKLITQIIDQIHKNWEILMYLKPVEMK C+ENLR TVILRH NYKN TLIG+YGT RI KE+AKEL+I VKLPNEEQ+TL ++NKTR                                    LHEEDETYLYELG+WDDDQEIRIVPMEEARWGTDTTWKEMDE                     DHRLKEKLSELD +ECKPTPWRKQWN+QEK+WELKPG MEEE  EE+K+NN + E P+ R K  E+KTM ENECRKINVMKRQR QPVDIDEAQIKLEN+HCEKLIEILQQGAR+QRRAA+ H KYMAENG          + +E  R                                 VFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPKPIKASMSTPSMGGDKNETPKMF GDE MILND+T+EEE+LSPLVIKTPIKLPEIREEQPSTSY+QI+EQQGDE EEMIIEMDIEDKDFTPMKKCITQ HEAKDDIYLTQD DEWLRKQNEESQK+TLTNTTENETTKETGTSMEKRPRGRPKLTTENGE K+
Sbjct: 1370 RKANNVERAEKLGTGKANVIKMYHVVDAGEKD-NQNQGPPRNNNYVQLAPRQPNVEQTKQVNVMQEISHLKDMMSKMMRTSQPQQQQSNIHMMQRVESARNRETEMTQQRHAQDWLQLQMEEERVRQERYK----------------------------------LAKPEPEEPGENQTEPDETIRPTKADKRRTQKILLGKIKERITRENEQTNENETTIETKQTNGNPKLKRDIIEKSISNMEKEFETPPRGLIKVETKRNLRINMLRNIRDSLPEDGETSETSSLMGEALRVSENEEQNKQIIRPMNRWIWDRPPSPTEGEIKLIDELIETILETLINEEFINNE-----------------------EKPRVKAYLDKQKLEDITKFQILMQTPDFIERIWIKQPNTDEIFDWIKERMGISKKNNNLYHNGWLIKPDMHLVDAIIWPGKPTYVTLEKERTIPDFGLLAEYAKNEFEINIKKNLQGATKRILCKNREKLITQIIDQIHKNWEILMYLKPVEMKACMENLRYTVILRHGNYKNATLIGKYGTHRIEKERAKELRINVKLPNEEQMTLIMSNKTR------------------------------------LHEEDETYLYELGIWDDDQEIRIVPMEEARWGTDTTWKEMDE---------------------DHRLKEKLSELDINECKPTPWRKQWNDQEKKWELKPGTMEEESLEEEKINNPKGETPKYRMKGVEHKTMSENECRKINVMKRQRFQPVDIDEAQIKLENDHCEKLIEILQQGARSQRRAATAHTKYMAENG----------ENLENLRE-------------------------------LVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPKPIKASMSTPSMGGDKNETPKMFGGDEEMILNDSTNEEEDLSPLVIKTPIKLPEIREEQPSTSYHQIDEQQGDEKEEMIIEMDIEDKDFTPMKKCITQIHEAKDDIYLTQDCDEWLRKQNEESQKITLTNTTENETTKETGTSMEKRPRGRPKLTTENGESKA 2319          

HSP 2 Score: 504.982 bits (1299), Expect = 4.928e-154
Identity = 280/332 (84.34%), Postives = 288/332 (86.75%), Query Frame = 1
Query:  232 RPRRNKSTKVKETTQTEDIEERTTLRSRNSYLLSGKSAKKSRSTSIARKLHRANSLEDIDSNSQESGDKENQEVTRTTSXXXXXXXXXXXXXXXXXXXXXXXXVPNKSGNQLNNPETTNKNNMAENNGNPNKKDGNEPNKLPTANETIREYREKFRGEPFNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQITLMQDESMIATIA 1227
            RPRRNKS KVKETTQTE IEERTTLRS NSYLLSGKSAKKSRSTSIARKLHR  SLEDIDSNSQ SGDKENQ +TRTTSQ+NIAN NENNN QGTN+EIQQQT PNKSGNQLNN E+TNKNNMAENNGN NK DG EP KLPT  E +REYREKFRGEPFNGNANKLNTF+RNFGV                       ALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRK+RHGESNTMFASEIRRLAKTATDGK DE HLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQI LMQDESMIAT+A
Sbjct: 1025 RPRRNKSIKVKETTQTEAIEERTTLRSGNSYLLSGKSAKKSRSTSIARKLHRTKSLEDIDSNSQGSGDKENQGLTRTTSQSNIANGNENNNHQGTNVEIQQQTAPNKSGNQLNNLESTNKNNMAENNGNTNKNDGKEPTKLPTTGEIVREYREKFRGEPFNGNANKLNTFIRNFGV-----------------------ALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKKRHGESNTMFASEIRRLAKTATDGKLDEAHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQIALMQDESMIATMA 1333          

HSP 3 Score: 62.003 bits (149), Expect = 4.928e-154
Identity = 39/65 (60.00%), Postives = 48/65 (73.85%), Query Frame = 2
Query: 1292 KYLNHQCKEQITIVRTITITMLKEQATTAEITQRKVNNVERAENLDIGKTSVTKMCHVVDVGEKD 1486
            KY NHQ KEQIT+ +TI I +++++ TT +  QRK NNVERAE L  GK +V KM HVVD GEKD
Sbjct: 1337 KYSNHQFKEQITVDQTIIIIIIRDRITTVDTIQRKANNVERAEKLGTGKANVIKMYHVVDAGEKD 1401          

HSP 4 Score: 184.496 bits (467), Expect = 4.024e-46
Identity = 88/92 (95.65%), Postives = 91/92 (98.91%), Query Frame = 3
Query: 4779 TKIRDQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKL 5054
            +K ++QSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKL
Sbjct: 2317 SKAKEQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKL 2408          
BLAST of SMED30019038 vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 279.256 bits (713), Expect = 6.964e-75
Identity = 174/449 (38.75%), Postives = 255/449 (56.79%), Query Frame = 1
Query:  607 NNGNPNKKDGNEPNKLPTAN-ETIREYREKFRGEPFNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTAT-DGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIH--LPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGG--NNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRC---EVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTKQVNVMQEISHLKDMMAK-------MMRTNQSQQQQSNIHMMQRVENIRNRETEMTXXXXXXXXXXXXMEEERVRQERFKQRQDNPPRINMMRM 1905
            N+ NPN     +PN +  +N  T+ E+REKFRGEPF+G+ NKL+TF++ F VY  +C WT+  VK R+PLYL  SA D+F+E       L  W EI+EFL   FG+ K  N+ I +  +RKQR  ESN ++A E+++L K A  +    E  ++ +F+ G++R ++R+ +G     TLD+AVA AN+ E H  L  Q+ ++  E   +T+    + N  N A MNQFIP A Q+FKPP  R   ++ N       +  YN    N P  ++QC TC K+GH    C++N  C +CGR+GH    C   + + C+ CG+QGH++R C      NQ  PR NN     P         Q+NVMQEI  L++ + +       MM+   +  QQ  I++M+  E I+   T   Q+Q   +WQ+LQ EE +  Q+R++ R+  P RINMMRM
Sbjct:    3 NSINPNTTSTPDPNLMINSNNTTVGEFREKFRGEPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTVNANNQNNNINPA-MNQFIPRAQQMFKPPTTRNPEDKGN------NQVSYNNNFRNKPNANEQCSTCQKMGHRSENCFRNYNCQKCGRRGHTERICRQLDNKACYNCGQQGHIARRCNIRGEANQ--PRPNNNPMRNP---------QINVMQEIDMLRETLQRMPMELKEMMQQIPTNIQQQRINVMRSNEEIQELRTREYQEQQQAEWQRLQTEEAQEIQQRWENRKQTPKRINMMRM 433          

HSP 2 Score: 200.29 bits (508), Expect = 7.942e-51
Identity = 101/295 (34.24%), Postives = 171/295 (57.97%), Query Frame = 3
Query: 4755 KIDNRKRGTKIRDQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAM-IVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNASIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKYSLKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK 5636
            KID      K ++Q +  Q  T+L+KL ++  +++     +E G +WI  L+ DML + +  ++R  + N    +A+QLW  +  II K++  K +T   + EE+  I+  L   ++D    LV ++YL+YN + +   + P +   N  E     I+++ I+P IL  GGW+ FIN   K++Q + M  +LK LL +FE  +   D       +++++ T+++W+MC      F+++DCD   +GDKYSLK+ EEC+ A P  L T   A++Y++YQE DFI+T+ KEC++TRK
Sbjct: 1483 KIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKMKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAQQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTP-RYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRK 1776          

HSP 3 Score: 168.318 bits (425), Expect = 3.543e-41
Identity = 104/220 (47.27%), Postives = 143/220 (65.00%), Query Frame = 1
Query: 3784 PVDID----EAQIKL-ENEHCEKLIEILQQGARNQRRAASTHAKYMAXXXXXXXXXXXXVKEIEEYRRKNEWCPEVFDQVIRKWKHREWSQETQGKYKIKVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPK-PI--KASMSTPSMGGDKNETPKMFEGDEVMI--LNDNTDEE 4413
            P DID    E   +L E + C +L  IL+ GA+NQRRA +  AK++ ++   +ENLR +VKEIEEYRR   W PE F ++IRKWKH EW+ ET  KY +K+  PD    E E + TAK++EVK+++ YE PFHLQ+ GRSLD+ LE+GVT M +G IN L +A+  G  L ++EIK K PI  + S++TP +    +E   +  GDE ++  LND  DEE
Sbjct: 1164 PTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRAVTDLAKWIIDSCSRIENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVREVKKRIPYEPPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEE 1383          

HSP 4 Score: 87.8113 bits (216), Expect = 8.820e-17
Identity = 48/127 (37.80%), Postives = 78/127 (61.42%), Query Frame = 1
Query: 3253 NEMIMEQTKNQSEDGMIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEA-RWGTDTTWKEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQWNEQEKRWELK 3630
            N  I++     S+DGM+EDVA+ Y    +   + T +YELG+W+D+Q I I+ M+E  R  T    +++ E ++E  +K  +I+ IY+ WR+D +L  KL  +D S C+ TPW+K ++E+ K W  K
Sbjct:  800 NREIIQPIIESSDDGMVEDVAIMYHKRQISPRERTPIYELGIWEDEQTINIIAMDEPERKNTTKREQQIQEFQEEMKRKMDQILTIYQLWRQDAKLSTKLQPIDDSRCRSTPWKKVYDEKTKEWNWK 926          
BLAST of SMED30019038 vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 278.87 bits (712), Expect = 8.266e-75
Identity = 174/449 (38.75%), Postives = 255/449 (56.79%), Query Frame = 1
Query:  607 NNGNPNKKDGNEPNKLPTAN-ETIREYREKFRGEPFNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTAT-DGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIH--LPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGG--NNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRC---EVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTKQVNVMQEISHLKDMMAK-------MMRTNQSQQQQSNIHMMQRVENIRNRETEMTXXXXXXXXXXXXMEEERVRQERFKQRQDNPPRINMMRM 1905
            N+ NPN     +PN +  +N  T+ E+REKFRGEPF+G+ NKL+TF++ F VY  +C WT+  VK R+PLYL  SA D+F+E       L  W EI+EFL   FG+ K  N+ I +  +RKQR  ESN ++A E+++L K A  +    E  ++ +F+ G++R ++R+ +G     TLD+AVA AN+ E H  L  Q+ ++  E   +T+    + N  N A MNQFIP A Q+FKPP  R   ++ N       +  YN    N P  ++QC TC K+GH    C++N  C +CGR+GH    C   + + C+ CG+QGH++R C      NQ  PR NN     P         Q+NVMQEI  L++ + +       MM+   +  QQ  I++M+  E I+   T   Q+Q   +WQ+LQ EE +  Q+R++ R+  P RINMMRM
Sbjct:    3 NSINPNTTSTPDPNLMINSNNTTVGEFREKFRGEPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTVNANNQNNNINPA-MNQFIPRAQQMFKPPTTRNPEDKGN------NQVSYNNNFRNKPNANEQCSTCQKMGHRSENCFRNYNCQKCGRRGHTERICRQLDNKACYNCGQQGHIARRCNIRGEANQ--PRPNNNPMRNP---------QINVMQEIDMLRETLQRMPMELKEMMQQIPTNIQQQRINVMRSNEEIQELRTREYQEQQQAEWQRLQTEEAQEIQQRWENRKQTPKRINMMRM 433          

HSP 2 Score: 200.29 bits (508), Expect = 7.843e-51
Identity = 101/295 (34.24%), Postives = 171/295 (57.97%), Query Frame = 3
Query: 4755 KIDNRKRGTKIRDQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAM-IVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNASIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKYSLKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK 5636
            KID      K ++Q +  Q  T+L+KL ++  +++     +E G +WI  L+ DML + +  ++R  + N    +A+QLW  +  II K++  K +T   + EE+  I+  L   ++D    LV ++YL+YN + +   + P +   N  E     I+++ I+P IL  GGW+ FIN   K++Q + M  +LK LL +FE  +   D       +++++ T+++W+MC      F+++DCD   +GDKYSLK+ EEC+ A P  L T   A++Y++YQE DFI+T+ KEC++TRK
Sbjct: 1483 KIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKMKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAQQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTP-RYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRK 1776          

HSP 3 Score: 168.318 bits (425), Expect = 3.683e-41
Identity = 104/220 (47.27%), Postives = 143/220 (65.00%), Query Frame = 1
Query: 3784 PVDID----EAQIKL-ENEHCEKLIEILQQGARNQRRAASTHAKYMAXXXXXXXXXXXXVKEIEEYRRKNEWCPEVFDQVIRKWKHREWSQETQGKYKIKVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPK-PI--KASMSTPSMGGDKNETPKMFEGDEVMI--LNDNTDEE 4413
            P DID    E   +L E + C +L  IL+ GA+NQRRA +  AK++ ++   +ENLR +VKEIEEYRR   W PE F ++IRKWKH EW+ ET  KY +K+  PD    E E + TAK++EVK+++ YE PFHLQ+ GRSLD+ LE+GVT M +G IN L +A+  G  L ++EIK K PI  + S++TP +    +E   +  GDE ++  LND  DEE
Sbjct: 1164 PTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRAVTDLAKWIIDSCSRIENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVREVKKRIPYEPPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEE 1383          

HSP 4 Score: 87.8113 bits (216), Expect = 9.659e-17
Identity = 48/127 (37.80%), Postives = 78/127 (61.42%), Query Frame = 1
Query: 3253 NEMIMEQTKNQSEDGMIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEA-RWGTDTTWKEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQWNEQEKRWELK 3630
            N  I++     S+DGM+EDVA+ Y    +   + T +YELG+W+D+Q I I+ M+E  R  T    +++ E ++E  +K  +I+ IY+ WR+D +L  KL  +D S C+ TPW+K ++E+ K W  K
Sbjct:  800 NREIIQPIIESSDDGMVEDVAIMYHKRQISPRERTPIYELGIWEDEQTINIIAMDEPERKNTTKREQQIQEFQEEMKRKMDQILTIYQLWRQDAKLSTKLQPIDDSRCRSTPWKKVYDEKTKEWNWK 926          
BLAST of SMED30019038 vs. Planmine SMEST
Match: SMESG000019863.1 (SMESG000019863.1)

HSP 1 Score: 278.87 bits (712), Expect = 8.567e-75
Identity = 174/449 (38.75%), Postives = 255/449 (56.79%), Query Frame = 1
Query:  607 NNGNPNKKDGNEPNKLPTAN-ETIREYREKFRGEPFNGNANKLNTFLRNFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTAT-DGKFDEVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIH--LPSQITLMQDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTXXXXXXXXXXXAQRTGYNGG--NNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNINRC---EVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTKQVNVMQEISHLKDMMAK-------MMRTNQSQQQQSNIHMMQRVENIRNRETEMTXXXXXXXXXXXXMEEERVRQERFKQRQDNPPRINMMRM 1905
            N+ NPN     +PN +  +N  T+ E+REKFRGEPF+G+ NKL+TF++ F VY  +C WT+  VK R+PLYL  SA D+F+E       L  W EI+EFL   FG+ K  N+ I +  +RKQR  ESN ++A E+++L K A  +    E  ++ +F+ G++R ++R+ +G     TLD+AVA AN+ E H  L  Q+ ++  E   +T+    + N  N A MNQFIP A Q+FKPP  R   ++ N       +  YN    N P  ++QC TC K+GH    C++N  C +CGR+GH    C   + + C+ CG+QGH++R C      NQ  PR NN     P         Q+NVMQEI  L++ + +       MM+   +  QQ  I++M+  E I+   T   Q+Q   +WQ+LQ EE +  Q+R++ R+  P RINMMRM
Sbjct:    3 NSINPNTTSTPDPNLMINSNNTTVGEFREKFRGEPFDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARDVTKLTTWTEIKEFLVKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKKLCKEAFGESDLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRCEAHLGLGYQVAVLATEPTTSTVNANNQNNNINPA-MNQFIPRAQQMFKPPTTRNPEDKGN------NQVSYNNNFRNKPNANEQCSTCQKMGHRSENCFRNYNCQKCGRRGHTERICRQLDNKACYNCGQQGHIARRCNIRGEANQ--PRPNNNPMRNP---------QINVMQEIDMLRETLQRMPMELKEMMQQIPTNIQQQRINVMRSNEEIQELRTREYQEQQQAEWQRLQTEEAQEIQQRWENRKQTPKRINMMRM 433          

HSP 2 Score: 199.904 bits (507), Expect = 8.447e-51
Identity = 101/295 (34.24%), Postives = 171/295 (57.97%), Query Frame = 3
Query: 4755 KIDNRKRGTKIRDQSKQQQWMTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRNRASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDEYRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAM-IVQQEIEPGILQNGGWKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNASIHSLMMTMMVWLMCTGATNGFLIFDCDKAVLGDKYSLKDIEECRMAIPNNLTTIEKAITYHIYQESDFIRTKAKECAITRK 5636
            KID      K ++Q +  Q  T+L+KL ++  +++     +E G +WI  L+ DML + +  ++R  + N    +A+QLW  +  II K++  K +T   + EE+  I+  L   ++D    LV ++YL+YN + +   + P +   N  E     I+++ I+P IL  GGW+ FIN   K++Q + M  +LK LL +FE  +   D       +++++ T+++W+MC      F+++DCD   +GDKYSLK+ EEC+ A P  L T   A++Y++YQE DFI+T+ KEC++TRK
Sbjct: 1483 KIDKPAPERKTKEQMQMDQLTTKLNKLSQVTKQLQGKMKGQEQGAKWINRLITDMLGLANPEIKRNLKLNNIGAEAQQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSEDPNRYLVGMAYLMYNTIVIYRKRTP-RYNPNGYEFAITDIIRKIIDPFILTQGGWQAFINLVSKDDQTEIMQTKLKELLTEFETTINASDPKTARKGMNAIITTILIWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSYNVYQEVDFIKTEIKECSVTRK 1776          

HSP 3 Score: 168.318 bits (425), Expect = 3.830e-41
Identity = 104/220 (47.27%), Postives = 143/220 (65.00%), Query Frame = 1
Query: 3784 PVDID----EAQIKL-ENEHCEKLIEILQQGARNQRRAASTHAKYMAXXXXXXXXXXXXVKEIEEYRRKNEWCPEVFDQVIRKWKHREWSQETQGKYKIKVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDNMLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPK-PI--KASMSTPSMGGDKNETPKMFEGDEVMI--LNDNTDEE 4413
            P DID    E   +L E + C +L  IL+ GA+NQRRA +  AK++ ++   +ENLR +VKEIEEYRR   W PE F ++IRKWKH EW+ ET  KY +K+  PD    E E + TAK++EVK+++ YE PFHLQ+ GRSLD+ LE+GVT M +G IN L +A+  G  L ++EIK K PI  + S++TP +    +E   +  GDE ++  LND  DEE
Sbjct: 1164 PTDIDDPVEEGLTRLTETQKCRELAAILRTGAKNQRRAVTDLAKWIIDSCSRIENLRNIVKEIEEYRRMYAWYPEQFREIIRKWKHTEWAAETIEKYSLKIMMPDFEPIEREFYITAKVREVKKRIPYEPPFHLQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEE 1383          

HSP 4 Score: 87.8113 bits (216), Expect = 8.829e-17
Identity = 48/127 (37.80%), Postives = 78/127 (61.42%), Query Frame = 1
Query: 3253 NEMIMEQTKNQSEDGMIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEA-RWGTDTTWKEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQWNEQEKRWELK 3630
            N  I++     S+DGM+EDVA+ Y    +   + T +YELG+W+D+Q I I+ M+E  R  T    +++ E ++E  +K  +I+ IY+ WR+D +L  KL  +D S C+ TPW+K ++E+ K W  K
Sbjct:  800 NREIIQPIIESSDDGMVEDVAIMYHKRQISPRERTPIYELGIWEDEQTINIIAMDEPERKNTTKREQQIQEFQEEMKRKMDQILTIYQLWRQDAKLSTKLQPIDDSRCRSTPWKKVYDEKTKEWNWK 926          
The following BLAST results are available for this feature:
BLAST of SMED30019038 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 1
Match NameE-valueIdentityDescription
cnbpa3.733e-628.57CCHC-type zinc finger, nucleic acid binding protei... [more]
back to top
BLAST of SMED30019038 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 1
Match NameE-valueIdentityDescription
chrne6.053e-645.45cholinergic receptor, nicotinic epsilon [Source:Xe... [more]
back to top
BLAST of SMED30019038 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A4Y2GHW62.985e-825.07Uncharacterized protein OS=Araneus ventricosus OX=... [more]
A0A4Y2FX163.434e-823.65Uncharacterized protein OS=Araneus ventricosus OX=... [more]
A0A1Y1IQX06.558e-742.47Uncharacterized protein OS=Klebsormidium nitens OX... [more]
A0A443SKN51.762e-627.22Uncharacterized protein (Fragment) OS=Leptotrombid... [more]
K4HZB41.927e-636.49Putative GIS2 DNA-binding protein (Fragment) OS=Po... [more]
back to top
BLAST of SMED30019038 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 2
Match NameE-valueIdentityDescription
ENSPMAT00000003739.11.225e-833.72pep scaffold:Pmarinus_7.0:GL499243:411:1384:-1 gen... [more]
ENSPMAT00000010393.11.246e-736.90pep scaffold:Pmarinus_7.0:GL485791:8073:10868:-1 g... [more]
back to top
BLAST of SMED30019038 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of SMED30019038 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 1
Match NameE-valueIdentityDescription
cnbpa2.787e-639.29CCHC-type zinc finger nucleic acid binding protein... [more]
back to top
BLAST of SMED30019038 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000038549.10.000e+081.83SMESG000038549.1[more]
SMESG000009033.14.928e-15473.78SMESG000009033.1[more]
SMESG000019863.16.964e-7538.75SMESG000019863.1[more]
SMESG000019863.18.266e-7538.75SMESG000019863.1[more]
SMESG000019863.18.567e-7538.75SMESG000019863.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30019038 ID=SMED30019038|Name=SMED30019038|organism=Schmidtea mediterranea sexual|type=transcript|length=5638bp
AAGCAATCAAAGCAATCAAAGCATTCAACGTAGACAAATTCACAACCAAT
ACAAACAAATCAACAAAATACCCACAAATACAAATAAAACAAGATATTCA
ATCAAGTATAGGTATCGAAGCGACAAATACTCGACGATAACAAGGATTCG
ACAACAAATCAAAACTCAGTGAAATATTCGCAACCAAAGGAACCGACTAC
GGACAAAAATACAGACGAGACGACAGACAACAGGCCCCGACGCAACAAAT
CCACCAAAGTAAAGGAAACCACCCAAACCGAAGATATCGAGGAAAGGACA
ACGTTACGATCGCGTAATAGCTACCTACTCTCAGGAAAAAGCGCCAAGAA
ATCAAGATCAACCTCAATCGCGAGAAAACTACACAGAGCCAATTCACTCG
AGGATATCGATTCCAACAGCCAAGAATCAGGCGACAAAGAAAACCAAGAA
GTAACAAGGACCACGAGTCAAACAAATATCGCGAACGAGAACGAGAATAA
CAATCAGCAAGGAACAAACATAGAAATCCAGCAACAAACCGTACCGAATA
AGTCGGGAAATCAATTAAATAATCCGGAAACAACAAACAAAAACAATATG
GCCGAAAATAACGGAAACCCAAATAAAAAAGACGGAAACGAACCAAACAA
ACTGCCAACAGCGAACGAAACAATTAGAGAATATCGGGAAAAATTCCGAG
GAGAACCGTTCAACGGGAACGCCAATAAACTGAATACGTTCCTAAGGAAT
TTTGGGGTATATGTAGAAGTTTGTGGATGGACAAACGATATGGTAAAAAC
TAGAATGCCACTATACCTATGTGACAGCGCACTGGACATCTTTCTGGAAG
CAAAAGAAAGAGGAAAACCGCTGAATAATTGGAAAGAGATCCAAGAATTC
TTAAAACTAACATTTGGAGTAGCAAAACTGACAAACCAAGGAATACAAGA
ATTATTTAACAGAAAACAAAGACATGGAGAGTCGAACACGATGTTCGCGA
GCGAAATCAGAAGATTGGCAAAAACGGCAACCGATGGAAAATTTGATGAA
GTCCACTTGATTGGCATATTTGTAGATGGATTAAGACGACCAGAACTGAG
ATCTGCAGTAGGAATGCAGATGTTAACGACATTAGACGAAGCGGTAGCCA
GGGCAAACCAGGCAGAAATACACCTGCCATCACAGATCACTCTTATGCAA
GACGAATCAATGATAGCAACGATTGCGGTAAAACCAGAGGAAAACGGACA
AAATGATGCAAAAATGAACCAATTTATACCAGGAGCTCACCAAATATTTA
AACCACCAGTGCAAAGAACAAATAACAATCGTCCGAACAATAACTATAAC
AATGCTCAAAGAACAGGCTACAACGGCGGAAATAACCCAACGCAAAGTAA
ACAATGTAGAACGTGCGGAAAACTTGGACATTGGGAAAACGAGTGTTACC
AAAATGTGCCATGTAGTAGATGTGGGAGAAAAGGACATAATATCAACCGA
TGTGAAGTCAGAACTTGCTTCACTTGCGGAAAACAGGGTCACGTATCAAG
AGAGTGTAGAAAAGGAACTAATCAAAATCAAGGACCGCCAAGAAACAATA
ACTATGCGCAATCAGCGCCAAGACAACCAAATATCGAGCCAACAAAACAA
GTAAACGTGATGCAGGAAATATCACACCTGAAAGATATGATGGCAAAGAT
GATGCGAACCAACCAATCACAACAACAACAATCAAATATCCATATGATGC
AAAGAGTCGAAAATATAAGAAATAGAGAAACAGAAATGACACAACAACAA
CACGCCCAGGATTGGCAGCAGTTACAAATGGAAGAGGAACGAGTGAGACA
AGAAAGATTCAAACAACGACAAGACAATCCGCCGAGAATAAATATGATGA
GAATGATCAAAGATGTCAAAGAACAATACTGCTACAATATATATAAAAAG
TTACCAAAAGATAAACTCGCGAAACCAGAACTAGAAAAACTAGGAGAAAA
TCAAACCGAACCAGATGAAACATTCAGACCAACTAAAGCAGATAAACGGA
GAACACAAAAAATACTACTCGGAAAAATCAAAGAAAGAATAACTCGCGAG
AACGAACGGTCAAACGAAAATAAGATAACGACAGTAACCAAACAGACAAA
TGGAAATTCCGAATTAAGACGAGATATAATAGAAAAATCGATCAACAATA
TAGAAAAAGAATCCGAAACACCACCAAGAGGATTAATAAAAGTAAAAACA
AAGAGAAATTTACAAATAAATATGCTGCGGAACATCAGAGATTCATCACC
AGAAGATGGAGAAATAAGTGAAACCAGCAGCCTGATGGGAGAAGCACTAC
AAGTATCCGAGAACGAAGAGCAGAATAAACAGATTATACGACCAATGAAT
CGATGGTTCTCAGATCGACCATTATCACCAACAGAAGGAGAACTAAAATC
AATAGAAGAACTAATTGAAACCATACCCGAAACATTAATAAACGAGGAAT
TCATCAACAACGAAGTATTCAGGAGATACATGAGGTACATCTTACAACTC
GAGGGATCGAAAGGAGACGTGATACCCGAACAGGAGAAACCACGGGTAAA
GGCATATCTGGAGAAACAGAAACTAGAAGATATAACGAAGTTTCAAATAC
TCATGCAAACGCCGGACTTCATAGAACGTATCTGGATTAAACAGCCAAAC
ACCGACGAAATATTCGATTGGATAAAAGAAAGAATGGGAAGCTCCAAGAA
AAATGATAACCTATACCATAATGGATGGTTAATAAAACCAGACCTACACC
TAGTAGATGCACTAATATGGCCGGGTAAACCAACATACGTAACTCTAGAA
AAAGAAAAAACAGTACCGGACTTCGGATTATGGGCAGAATACGCGAAAAA
TGAATTCGAAATCAATATCAAAAAGAACCTACAAGGAGCAACCAAGAGAA
TATTTTGTAAAAACAGAGAAAAATTAATCACACAAATAATAGATCAAATC
CACAAGAATTGGGAAATACTAATGTATCTCAAACCCGTAGAAATGAAAAT
ATGTATAGAAAACTTACGCGATACAGTAATTCTCAGACATGCTAATTATA
AAAATACGACATTAATTGGGAGATACGGAACTCGCCGAATAGGAAAAGAA
AAAGCAAAAGAATTGAAGATAAAAGTAAAGTTACCAAACGAGGAACAAGT
AACGTTGAAAATAAATAACAAAACGCGAGTAGACGAATTGATAAAAATCA
TCAACGAAATGATCATGGAACAAACAAAAAATCAAAGTGAGGACGGAATG
ATAGAAGATGTAGCCGTAAAATACCTTCATCATTATCTGCACGAAGAAGA
TGAAACATATCTATATGAGTTGGGAATGTGGGACGACGACCAGGAAATCA
GAATAGTACCGATGGAAGAAGCTCGATGGGGAACAGATACCACGTGGAAA
GAGATGGACGAAGTGGAAAAAGAAGGCCACCAAAAGTCCAAGAGAATTAT
AGAAATCTATCGACAATGGCGGAAGGATCACCGTCTGAAAGAGAAGCTCA
GCGAACTAGATACAAGTGAATGCAAACCAACCCCATGGAGGAAACAATGG
AATGAACAAGAAAAAAGATGGGAACTAAAACCCGGAGCAATGGAAGAAGA
GCCATCGGAGGAGAAAAAGGTTAACAATGGTGAAGAAGAAAACCCGAGAA
ACAGAACGAAAGAAGCCGAAAATAAAACGATGCCCGAAAACGAATGTAGG
AAAATAAACGTAATGAAGAGACAAAGATCCCAACCAGTAGATATCGACGA
AGCACAAATAAAATTAGAAAACGAGCACTGCGAAAAATTAATAGAAATAT
TACAACAAGGTGCAAGGAACCAGAGAAGAGCAGCATCAACGCACGCGAAA
TACATGGCGGAAAATGGTGAAAATCTCGAGAATTTGAGAGAGTTGGTAAA
GGAGATCGAGGAATATAGAAGAAAAAATGAATGGTGTCCAGAAGTATTCG
ACCAAGTAATACGCAAATGGAAACACCGAGAGTGGTCACAAGAAACCCAA
GGAAAATATAAAATAAAAGTATTTTACCCCGACCAAACATCCGAAGAATT
AGAAGTGCATCCGACGGCAAAAATAAAAGAGGTAAAAAGAAAACTAGCAT
ATGAAACTCCATTCCATCTACAATTCCACGGACGATCACTAGATAATATG
TTGGAAGTGGGAGTAACACCAATGGTGATGGGAGAAATAAACCCCTTGAA
AATGGCGGAAACATTGGGAATACCGCTGAAAAGAAAAGAGATAAAACCAA
AACCTATCAAAGCAAGTATGAGCACGCCCAGTATGGGAGGAGATAAAAAT
GAAACCCCAAAAATGTTCGAAGGAGATGAAGTAATGATTTTGAATGACAA
TACAGACGAGGAAGAAGAGCTGTCACCACTAGTAATAAAAACACCCATAA
AACTACCAGAAATCCGAGAGGAGCAACCGTCAACCAGCTATTATCAAATC
AATGAGCAACAAGGAGATGAAATAGAAGAGATGATAATTGAAATGGATAT
AGAAGACAAAGATTTCACGCCAATGAAAAAATGTATAACACAAACTCACG
AAGCAAAGGACGATATATACCTAACACAAGACTGGGACGAATGGCTCAGA
AAACAAAACGAGGAAAGCCAAAAGGTAACTCTAACGAACACCACGGAGAA
CGAAACAACAAAAGAAACAGGTACGTCCATGGAGAAACGTCCAAGAGGTA
GACCAAAATTGACAACCGAAAACGGGGAACCAAAATCAGAGACCAATCCA
AACAACAACAATGGATGACAAGGTTGGATAAACTAGAGAAAATAGCATAC
GAGATGAGGAAAACTCACCCCAGAGAGGAAAGCGGAACGGAATGGATAAG
AGAATTACTAGCGGATATGCTAATAATCACCAGCCAACCACTAAGGAGAC
TATGGCAAAGAAATCGAGCAAGCGAGGACGCCAAACAATTATGGTATACC
GTATCCAGCATAATCCCAAAACTGATCAAAGAGAAAACTAAAACGATGAA
ATTAGAATGGGAAGAGATGAGACAAATAGTCGCCCAACTGCTCACAAATA
CGGATGATGACGAATACCGACTAGTGGCACTCTCCTATCTCGTATATAAT
CTAATGAGTCTATCGACAAGCAAAATTCCGAACAAACAATTCATGAATAA
CGTAGAGCTAACCGCAATGATCGTACAGCAAGAAATCGAACCAGGAATCT
TACAGAATGGTGGATGGAAGCAATTTATAAACAAACCGAGGAAAAACGAA
CAGGTCAAACCAATGCACGTAAGACTAAAGGCATTATTAAACGACTTCGA
ACACGAACTGCCAATAAAAGATATAACAATGAGAAATGCAAGCATACACT
CCCTAATGATGACAATGATGGTGTGGCTAATGTGCACAGGTGCAACAAAC
GGATTCTTAATATTTGACTGCGATAAAGCAGTACTGGGAGACAAGTATTC
GCTGAAAGATATCGAGGAATGTAGGATGGCAATACCAAATAACCTGACCA
CTATAGAAAAAGCGATAACGTATCACATCTACCAAGAATCTGATTTCATC
AGAACAAAAGCCAAGGAATGCGCAATCACACGGAAAAC
back to top

protein sequence of SMED30019038-orf-1

>SMED30019038-orf-1 ID=SMED30019038-orf-1|Name=SMED30019038-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=274bp
MTRLDKLEKIAYEMRKTHPREESGTEWIRELLADMLIITSQPLRRLWQRN
RASEDAKQLWYTVSSIIPKLIKEKTKTMKLEWEEMRQIVAQLLTNTDDDE
YRLVALSYLVYNLMSLSTSKIPNKQFMNNVELTAMIVQQEIEPGILQNGG
WKQFINKPRKNEQVKPMHVRLKALLNDFEHELPIKDITMRNASIHSLMMT
MMVWLMCTGATNGFLIFDCDKAVLGDKYSLKDIEECRMAIPNNLTTIEKA
ITYHIYQESDFIRTKAKECAITRK
back to top

protein sequence of SMED30019038-orf-2

>SMED30019038-orf-2 ID=SMED30019038-orf-2|Name=SMED30019038-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1407bp
MAENNGNPNKKDGNEPNKLPTANETIREYREKFRGEPFNGNANKLNTFLR
NFGVYVEVCGWTNDMVKTRMPLYLCDSALDIFLEAKERGKPLNNWKEIQE
FLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRRLAKTATDGKFD
EVHLIGIFVDGLRRPELRSAVGMQMLTTLDEAVARANQAEIHLPSQITLM
QDESMIATIAVKPEENGQNDAKMNQFIPGAHQIFKPPVQRTNNNRPNNNY
NNAQRTGYNGGNNPTQSKQCRTCGKLGHWENECYQNVPCSRCGRKGHNIN
RCEVRTCFTCGKQGHVSRECRKGTNQNQGPPRNNNYAQSAPRQPNIEPTK
QVNVMQEISHLKDMMAKMMRTNQSQQQQSNIHMMQRVENIRNRETEMTQQ
QHAQDWQQLQMEEERVRQERFKQRQDNPPRINMMRMIKDVKEQYCYNIYK
KLPKDKLAKPELEKLGENQTEPDETFRPTKADKRRTQKILLGKIKERITR
ENERSNENKITTVTKQTNGNSELRRDIIEKSINNIEKESETPPRGLIKVK
TKRNLQINMLRNIRDSSPEDGEISETSSLMGEALQVSENEEQNKQIIRPM
NRWFSDRPLSPTEGELKSIEELIETIPETLINEEFINNEVFRRYMRYILQ
LEGSKGDVIPEQEKPRVKAYLEKQKLEDITKFQILMQTPDFIERIWIKQP
NTDEIFDWIKERMGSSKKNDNLYHNGWLIKPDLHLVDALIWPGKPTYVTL
EKEKTVPDFGLWAEYAKNEFEINIKKNLQGATKRIFCKNREKLITQIIDQ
IHKNWEILMYLKPVEMKICIENLRDTVILRHANYKNTTLIGRYGTRRIGK
EKAKELKIKVKLPNEEQVTLKINNKTRVDELIKIINEMIMEQTKNQSEDG
MIEDVAVKYLHHYLHEEDETYLYELGMWDDDQEIRIVPMEEARWGTDTTW
KEMDEVEKEGHQKSKRIIEIYRQWRKDHRLKEKLSELDTSECKPTPWRKQ
WNEQEKRWELKPGAMEEEPSEEKKVNNGEEENPRNRTKEAENKTMPENEC
RKINVMKRQRSQPVDIDEAQIKLENEHCEKLIEILQQGARNQRRAASTHA
KYMAENGENLENLRELVKEIEEYRRKNEWCPEVFDQVIRKWKHREWSQET
QGKYKIKVFYPDQTSEELEVHPTAKIKEVKRKLAYETPFHLQFHGRSLDN
MLEVGVTPMVMGEINPLKMAETLGIPLKRKEIKPKPIKASMSTPSMGGDK
NETPKMFEGDEVMILNDNTDEEEELSPLVIKTPIKLPEIREEQPSTSYYQ
INEQQGDEIEEMIIEMDIEDKDFTPMKKCITQTHEAKDDIYLTQDWDEWL
RKQNEESQKVTLTNTTENETTKETGTSMEKRPRGRPKLTTENGEPKSETN
PNNNNG*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000020protonephridia
PLANA:0000099neuron
PLANA:0000101muscle cell
PLANA:0002032epidermal cell
PLANA:0003116parenchymal cell
Vocabulary: INTERPRO
TermDefinition
IPR001878Znf_CCHC
IPR005162Retrotrans_gag_dom
Vocabulary: molecular function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding