CGG triplet repeat-binding protein 1

Overview
NameCGG triplet repeat-binding protein 1
Smed IDSMED30003596
Length (bp)5950
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of CGG triplet repeat-binding protein 1 (SMED30003596) t-SNE clustered cells

Violin plots show distribution of expression levels for CGG triplet repeat-binding protein 1 (SMED30003596) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of CGG triplet repeat-binding protein 1 (SMED30003596) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for CGG triplet repeat-binding protein 1 (SMED30003596) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30003596

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 4

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
X1 cellSMED30003596 SmedASXL_078415SmedAsxl_ww_GCZZ01PMID:26114597
Zhu et al., 2015
FACS sorted cell population asexual adult RNA-sequencing evidence
X2 cellSMED30003596 SmedASXL_078415SmedAsxl_ww_GCZZ01PMID:26114597
Zhu et al., 2015
FACS sorted cell population asexual adult RNA-sequencing evidence
X2 cellSMED30003596 SmedASXL_078414SmedAsxl_ww_GCZZ01PMID:26114597
Zhu et al., 2015
FACS sorted cell population asexual adult RNA-sequencing evidence
X2 cellSMED30003596 SmedASXL_076841SmedAsxl_ww_GCZZ01PMID:26114597
Zhu et al., 2015
FACS sorted cell population asexual adult RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Match: A0A0J7NEC4 (DUF659 domain-containing protein OS=Lasius niger OX=67767 GN=RF55_9308 PE=4 SV=1)

HSP 1 Score: 163.696 bits (413), Expect = 4.591e-39
Identity = 84/118 (71.19%), Postives = 96/118 (81.36%), Query Frame = 1
Query: 1276 LYPNVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNYYANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLGASI----TTGNILTI 1617
            LY NVDRLI+NVKKVFLKAP R+ +FKDLEP   LP + I TRWGT L +VNYYA NFEKIVRIFDALD++EA SIKIS+ LL D+TIK+DLIFIASNYGFL ASI    T+G  LT+
Sbjct:  290 LYTNVDRLISNVKKVFLKAPLRVGIFKDLEPDLALPPQPIITRWGTWLNAVNYYATNFEKIVRIFDALDDEEAASIKISKSLLRDNTIKSDLIFIASNYGFLEASIKKLETSGLPLTV 407          
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Match: A0A5E4MX87 (DUF659 domain-containing protein (Fragment) OS=Cinara cedri OX=506608 GN=CINCED_3A024417 PE=4 SV=1)

HSP 1 Score: 96.2857 bits (238), Expect = 1.779e-29
Identity = 51/122 (41.80%), Postives = 72/122 (59.02%), Query Frame = 1
Query: 1279 YPNVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNYYANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLGASITTGNILTITFCIDLNRN 1644
            +P VD+L+ANVK+VF KAP R+Q F    P  +LP E I TRWGT + +V YY+ NF+ +  I ++ D  +A SIK +Q     + +K +L FI SN+  L  +IT      I     LN+N
Sbjct:   90 FPKVDKLVANVKRVFKKAPYRVQKFHTDAPNISLPPEPILTRWGTWISAVLYYSENFQTVKNIIESFDENDALSIKNAQKYFKITQMKGNLTFIHSNFACLPIAITRLQKQGIPLSEVLNKN 211          

HSP 2 Score: 66.2402 bits (160), Expect = 1.779e-29
Identity = 37/74 (50.00%), Postives = 49/74 (66.22%), Query Frame = 3
Query: 1719 KNCGFTLMRNISAILAGV-----SVTLTEKYTCSEILAFKFA*ITSFDVEQSFSMIKSVLRPNRQSFLFENLSE 1925
            KN GF ++ NIS IL G       + + E  T S++  FKFA ITS DVE+SFS+ K++L PNR+SF FENL +
Sbjct:  210 KNKGFQIVCNISKILTGEEENVGDLDIPEDLTSSDMAYFKFAPITSADVERSFSLYKNILAPNRRSFKFENLKK 283          
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Match: J9L0Q9 (DUF659 domain-containing protein OS=Acyrthosiphon pisum OX=7029 PE=4 SV=2)

HSP 1 Score: 96.2857 bits (238), Expect = 5.815e-29
Identity = 53/149 (35.57%), Postives = 83/149 (55.70%), Query Frame = 1
Query: 1207 RILFCKVCNKSVLEEIIFTIKQHL---YPNVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNYYANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLGASITTGNILTITFCIDLNRN 1644
            ++L+ K+ + +     +  + + +   +P VD+L+ANVK+VF KAP R+Q F    P  +LP E I TRWGT + +V YY+ NF+ +  I ++ D  +A SIK +Q       +K +L FI SN+  L  +IT      I     LN+N
Sbjct:   50 KVLYPKMVHVTCTSHGLHRVAEQIRIQFPKVDKLVANVKRVFKKAPYRVQKFHTDAPNISLPPEPILTRWGTWISAVLYYSENFQTVKNIIESFDENDAISIKNAQKYFKIPQMKGNLTFIHSNFACLPIAITRLQKQGIPLSEVLNKN 198          

HSP 2 Score: 64.3142 bits (155), Expect = 5.815e-29
Identity = 36/74 (48.65%), Postives = 48/74 (64.86%), Query Frame = 3
Query: 1719 KNCGFTLMRNISAILAGV-----SVTLTEKYTCSEILAFKFA*ITSFDVEQSFSMIKSVLRPNRQSFLFENLSE 1925
            KN GF ++ NIS IL G       + + E  T S++  FKFA ITS DVE+S S+ K++L PNR+SF FENL +
Sbjct:  197 KNKGFQIVCNISKILTGEEENVGDLDIPEDLTSSDMAYFKFAPITSADVERSISLYKNILTPNRRSFKFENLKK 270          
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Match: A0A2S2N9G9 (DUF659 domain-containing protein OS=Schizaphis graminum OX=13262 GN=g.53683 PE=4 SV=1)

HSP 1 Score: 97.0561 bits (240), Expect = 3.563e-28
Identity = 46/105 (43.81%), Postives = 69/105 (65.71%), Query Frame = 1
Query: 1279 YPNVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNYYANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLGASI 1593
            Y  VD+LIA +KK+FLKAPSR+  FK++ P   LP E I TRWGT L +V YY ++F+KI  +    D + A +I+ +  L+ D  +K +L +I++N+ FL  +I
Sbjct:   60 YSEVDQLIATIKKIFLKAPSRVSKFKEMYPDLNLPPEPIITRWGTWLEAVQYYCDHFDKIKNVISNFDPESAAAIEKANSLMQDINLKNNLTYISANFCFLIQTI 164          

HSP 2 Score: 60.8474 bits (146), Expect = 3.563e-28
Identity = 37/87 (42.53%), Postives = 55/87 (63.22%), Query Frame = 3
Query: 1677 NTADIIKNTCYCN--RKNCGFTLMRNISAILAGVSV--TLTEKYTCSEILAFKFA*ITSFDVEQSFSMIKSVLRPNRQSFLFENLSE 1925
            +  +I+KN  + N   KN GF  ++ I  IL G +   +L  ++T S+I+   +A ITS DVE+SFS  KS+LRPNR++F F NL +
Sbjct:  194 HVGEIVKNK-FANIIEKNSGFQTIKIIRDILIGKNQQGSLDIEFTPSDIVNMNYAPITSVDVERSFSQYKSILRPNRRNFSFSNLQQ 279          
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Match: J9L9S4 (DUF659 domain-containing protein OS=Acyrthosiphon pisum OX=7029 PE=4 SV=2)

HSP 1 Score: 98.2117 bits (243), Expect = 4.324e-28
Identity = 45/106 (42.45%), Postives = 69/106 (65.09%), Query Frame = 1
Query: 1279 YPNVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNYYANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLGASIT 1596
            + NVD L+ANVK+VF K P R+Q F+D  PG  LP   + TRWGT L + +YY  +FE I R+ ++ D ++A ++K S+ +     ++A+LI+I SN+  L  +IT
Sbjct:  115 FGNVDNLVANVKQVFRKCPYRIQTFRDEAPGLPLPPSPVITRWGTWLTAASYYCTHFETIKRVVESFDERDAVAVKKSKQVFKLQQLQANLIYIKSNFDCLPIAIT 220          

HSP 2 Score: 59.6918 bits (143), Expect = 4.324e-28
Identity = 34/85 (40.00%), Postives = 53/85 (62.35%), Query Frame = 3
Query: 1686 DIIKNTCYCNRKNCGFTLMRNISAILAG-----VSVTLTEKYTCSEILAFKFA*ITSFDVEQSFSMIKSVLRPNRQSFLFENLSE 1925
            +I K   Y   KN G+ ++  IS +L+G      ++ L E  T +++L FK+A +TS DVE+SFSM K++L  NR+SF  EN+ +
Sbjct:  253 EIKKKLKYVLDKNIGYNIICKISKVLSGKEDNITNLDLPEDMTANDLLYFKYATLTSADVERSFSMFKNLLVDNRRSFKLENIKK 337          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Match: ENSAMXT00000053716.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02001785.1:10187:12838:-1 gene:ENSAMXG00000030794.1 transcript:ENSAMXT00000053716.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 64.3142 bits (155), Expect = 1.599e-10
Identity = 38/112 (33.93%), Postives = 57/112 (50.89%), Query Frame = -2
Query:  307 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 642
            FSK       S Q  + +   +   W Y +G  + I +D+GR FEG   R   E +GI++  ++PY  + NG  ER  RT+ D+L T   E K K    +LL ++ F+ N T
Sbjct:   12 FSKFTQAYPVSDQKASTVARVLTEKWFYVYGVPQRIHSDQGRNFEGELLRRLCELYGIEKSRTTPYHPEGNGQCERFNRTLHDLLRTLPPEKKRKWP--QLLPQLLFAYNTT 121          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Match: ENSAMXT00000045469.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02000976.1:426503:430633:1 gene:ENSAMXG00000030479.1 transcript:ENSAMXT00000045469.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 65.4698 bits (158), Expect = 6.134e-10
Identity = 37/112 (33.04%), Postives = 59/112 (52.68%), Query Frame = -2
Query:  307 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 642
            FSK     A + Q  + ++  ++  W Y +G  + I +D+GR FEG   +   E +GI++  ++PY  + NG  ER  RT+ D+L T   E K K    +LL  + F+ N T
Sbjct: 1009 FSKFTQAYAVADQKASTVVRVLVEKWFYVYGVPKRIHSDQGRSFEGELLKHLCETYGIEKSRTTPYHPEGNGQCERFNRTLHDLLRTLPPEKKRK--WPQLLPHLLFAYNTT 1118          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Match: ENSAMXT00000043043.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02002310.1:53248:54373:-1 gene:ENSAMXG00000041464.1 transcript:ENSAMXT00000043043.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 62.003 bits (149), Expect = 1.500e-9
Identity = 36/112 (32.14%), Postives = 55/112 (49.11%), Query Frame = -2
Query:  307 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 642
            FSK         Q  + ++  ++  W Y +G  + I +D+GR FEG   +   + +GI++  SSPY  + NG  ER  RT+ D+L T       K    + LS V F+ N T
Sbjct:   65 FSKFAQAYPTRDQKASTVVQTLVEKWFYTYGVPKRIHSDQGRSFEGELLKRLCQMYGIEKSRSSPYHPEGNGQCERFNRTLHDLLRT--LPPGEKRRWPQHLSTVVFAYNTT 174          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Match: ENSAMXT00000033536.1 (pep primary_assembly:Astyanax_mexicanus-2.0:3:66835186:66837354:-1 gene:ENSAMXG00000033197.1 transcript:ENSAMXT00000033536.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 58.9214 bits (141), Expect = 5.975e-8
Identity = 35/112 (31.25%), Postives = 57/112 (50.89%), Query Frame = -2
Query:  307 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 642
            FSK         Q  + + + +++ W Y FG    + +D+GR FE +        +G+++  ++PY+ Q NG  ER  RT+ D+L T  TE K    R   L++V F+ N T
Sbjct:  524 FSKYTQAVPTRDQRASTVASVLVHEWFYRFGVPARLHSDQGRNFESAVIAQLCLLYGVQKTHTTPYRPQGNGQCERFNRTLHDLLRTLPTEQKRVWTR--HLAQVVFAYNTT 633          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Match: ENSAMXT00000030217.1 (pep primary_assembly:Astyanax_mexicanus-2.0:16:12351169:12363564:1 gene:ENSAMXG00000035335.1 transcript:ENSAMXT00000030217.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 56.9954 bits (136), Expect = 2.417e-7
Identity = 34/112 (30.36%), Postives = 55/112 (49.11%), Query Frame = -2
Query:  307 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 642
            FSK         Q  + ++  +   W + +G  + I +D+GR FEG   R   + +G+ +  SSPY  + NG  ER  RT+ D+L T   E K + +    L ++ F+ N T
Sbjct:  982 FSKFTQAFPTYDQKASTVVRILTEKWFHVYGVPQRIHSDQGRCFEGEMLRALCKLYGVVKSRSSPYHPEGNGQCERFNRTMHDLLRTLPAERKRRWSH--HLPQLLFAYNTT 1091          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Match: ENSORLT00000040891.1 (pep primary_assembly:ASM223467v1:7:6445512:6449001:-1 gene:ENSORLG00000027734.1 transcript:ENSORLT00000040891.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 59.6918 bits (143), Expect = 1.278e-8
Identity = 35/93 (37.63%), Postives = 48/93 (51.61%), Query Frame = -2
Query:  364 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIK 642
            FSK         Q  A     + NNWI  FG  E ILTDRG  FE S F++    +G K+  ++ Y  Q NG  ER+ +T+  +L  SL+E +
Sbjct:   64 FSKYAQAVPTKDQSAATTARTVYNNWIQKFGCPERILTDRGAAFESSIFKELCRIYGCKKSRTTAYWPQGNGGCERVNQTLLGLL-NSLSETE 155          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Match: ENSORLT00000033794.1 (pep primary_assembly:ASM223467v1:24:18649635:18652646:1 gene:ENSORLG00000027956.1 transcript:ENSORLT00000033794.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 59.3066 bits (142), Expect = 1.765e-8
Identity = 35/93 (37.63%), Postives = 48/93 (51.61%), Query Frame = -2
Query:  364 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIK 642
            FSK         Q  A     + NNWI  FG  E ILTDRG  FE S F++    +G K+  ++ Y  Q NG  ER+ +T+  +L  SL+E +
Sbjct:   64 FSKYAQAVPTKDQSAATTARTVYNNWIQKFGCPERILTDRGAAFESSIFKELCRIYGCKKSRTTAYWPQGNGGCERVNQTLLGLL-NSLSETE 155          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Match: ENSORLT00000041487.1 (pep primary_assembly:ASM223467v1:1:12098754:12103049:1 gene:ENSORLG00000023677.1 transcript:ENSORLT00000041487.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 59.6918 bits (143), Expect = 1.794e-8
Identity = 35/93 (37.63%), Postives = 48/93 (51.61%), Query Frame = -2
Query:  364 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIK 642
            FSK         Q  A     + NNWI  FG  E ILTDRG  FE S F++    +G K+  ++ Y  Q NG  ER+ +T+  +L  SL+E +
Sbjct:   64 FSKYAQAVPTKDQSAATTARTVYNNWIQKFGCPERILTDRGAAFESSIFKELCRIYGCKKSRTTAYWPQGNGGCERVNQTLLGLL-NSLSETE 155          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Match: ENSORLT00000041701.1 (pep primary_assembly:ASM223467v1:10:29315372:29316364:-1 gene:ENSORLG00000024576.1 transcript:ENSORLT00000041701.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 55.0694 bits (131), Expect = 4.567e-7
Identity = 27/87 (31.03%), Postives = 44/87 (50.57%), Query Frame = -2
Query:  382 FSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLAT 642
            FSK         Q    ++  ++ +WI  +G    I +D+GR FE    +   + +GI++  ++PY  Q NG  ER  RT+ D+L T
Sbjct:   65 FSKFTVAVPTRDQRAVTVVKALVKHWIQPYGVPSRIHSDQGRCFEADVVQGLCKVYGIRKSRTTPYHAQGNGQCERFNRTLHDLLRT 151          
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Match: ENSORLT00000045127.1 (pep primary_assembly:ASM223467v1:10:587855:593081:1 gene:ENSORLG00000029202.1 transcript:ENSORLT00000045127.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 54.6842 bits (130), Expect = 4.657e-7
Identity = 35/100 (35.00%), Postives = 51/100 (51.00%), Query Frame = -2
Query:  385 NQYIL--SNF*NYRFSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLA 678
            N+YIL  S++    FSK         Q+   I   +   W+  FG   SI +D+GR FE + F++      I +  +SPY  QS+GL ER  RT+  +LA
Sbjct:   56 NKYILVISDY----FSKWTEAFPLPNQEAQSIAKVLTEEWVCRFGAPRSIHSDQGRNFESTLFKELCSLLSIHKSRTSPYHPQSDGLVERFNRTLLSLLA 151          
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Match: SMESG000020037.1 (SMESG000020037.1)

HSP 1 Score: 203.756 bits (517), Expect = 2.670e-60
Identity = 100/138 (72.46%), Postives = 114/138 (82.61%), Query Frame = -1
Query: 2552 MNLVKMLSKELAKSSKENTELHRQLLQMSREIQQVKATRINPTKVKVLYQKLTAAQRLWTEEKQLRQTQKVQIRSLKVALSACQEGNAVTYPLVFAPTQLAYRETTDASKLAKKPAQRRPGKAERAKRRAVQARDFSK 2965
            M   K+ S+EL KS +EN ELH+QL+QM++EI Q+KAT + P KVK LYQK+TAAQR W EEKQL QTQKVQIRSL+VALSACQEGNAVTYPLVFAP QLAYRE T+ +K A KP  RRPGKAERAKRRA QA+D SK
Sbjct:    1 MGPYKISSRELEKSRRENAELHQQLIQMTKEIHQIKATWVEPGKVKSLYQKMTAAQRGWAEEKQLHQTQKVQIRSLEVALSACQEGNAVTYPLVFAPAQLAYREATETTKPATKPTPRRPGKAERAKRRAAQAKDCSK 138          
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Match: SMESG000058621.1 (SMESG000058621.1)

HSP 1 Score: 202.601 bits (514), Expect = 2.491e-58
Identity = 113/187 (60.43%), Postives = 117/187 (62.57%), Query Frame = -2
Query:  214 RFSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT-------------------------------------------KGCYNYANRFKQPEYSSFHDGRESISTLGTA 645
            RFSKL S SAASTQDEA ILN ILNN+ Y FGR  SI  DRG IFEGS F DWMEKFGIKQEFSSPYQHQSNGL ERI RTVRDMLATSL EIKTKNN C+LL K+EFSLNAT                                           K   N  NRF+QP YSSFH GRESIST GT 
Sbjct:   63 RFSKLVSLSAASTQDEATILNFILNNYTYRFGRPGSIFADRGTIFEGSIFPDWMEKFGIKQEFSSPYQHQSNGLEERIKRTVRDMLATSLAEIKTKNNWCRLLPKIEFSLNATIQNSTKFSPFEIVYCRKINLYSGVGHIQKCREEIEDKTKTNLAKAATNMQNRFRQPGYSSFHGGRESISTCGTT 249          
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Match: SMESG000033890.1 (SMESG000033890.1)

HSP 1 Score: 195.667 bits (496), Expect = 2.889e-57
Identity = 95/146 (65.07%), Postives = 122/146 (83.56%), Query Frame = -1
Query: 2540 FIVDMNLVKMLSKELAKSSKENTELHRQLLQMSREIQQVKATRINPTKVKVLYQKLTAAQRLWTEEKQLRQTQKVQIRSLKVALSACQEGNAVTYPLVFAPTQLAYRETTDASKLAKKPAQRRPGKAERAKRRAVQARDFSKP*SL 2977
            +I+DMN +  L+KEL KS ++N +LH QLLQ+++EIQQ++A+ I P + K +YQ+LTAAQR W +EKQL Q+QK+QI SL+VALSACQEGNA+TY LVFAP QLAY ETT+A+K AKKPA+RRPGKAERAKRRA Q +++SK  SL
Sbjct:    4 YILDMNRIGTLTKELTKSRQQNEKLHSQLLQLNKEIQQIRASWIEPARPKRIYQRLTAAQRGWEDEKQLGQSQKIQILSLEVALSACQEGNAITYQLVFAPAQLAYMETTEATKPAKKPAERRPGKAERAKRRAFQIKEYSKIYSL 149          
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Match: SMESG000019458.1 (SMESG000019458.1)

HSP 1 Score: 191.815 bits (486), Expect = 1.539e-54
Identity = 95/113 (84.07%), Postives = 99/113 (87.61%), Query Frame = -2
Query:  307 RFSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 645
            RFSKL S S  S+QDEA ILN ILNNWIY FGR ESILTDRGRIFEGS FRDWMEKFGIKQEFSSPYQHQ NGL+ERIIRTV DML TSLTEIKTKNN C+LL K+EFSLNAT
Sbjct:   63 RFSKLVSLSFVSSQDEATILNVILNNWIYRFGRPESILTDRGRIFEGSMFRDWMEKFGIKQEFSSPYQHQLNGLAERIIRTVWDMLTTSLTEIKTKNNWCRLLPKIEFSLNAT 175          
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Match: SMESG000021658.1 (SMESG000021658.1)

HSP 1 Score: 186.037 bits (471), Expect = 2.855e-53
Identity = 93/113 (82.30%), Postives = 97/113 (85.84%), Query Frame = -2
Query:  307 RFSKLGS*SAASTQDEAIILNPILNNWIYHFGRLESILTDRGRIFEGSKFRDWMEKFGIKQEFSSPYQHQSNGLSERIIRTVRDMLATSLTEIKTKNNRCKLLSKVEFSLNAT 645
            RFSK  S S ASTQ+EA ILN ILNNWIY FG  ESILTDRGRIFEGS FRD MEKFGIKQEFSSPYQHQSNGL+E IIRTVR MLATSLTEIKT NN C+LL K+EFSLNAT
Sbjct:   69 RFSKFVSLSTASTQNEATILNVILNNWIYRFGIPESILTDRGRIFEGSMFRDLMEKFGIKQEFSSPYQHQSNGLAEIIIRTVRVMLATSLTEIKTMNNWCRLLPKIEFSLNAT 181          
The following BLAST results are available for this feature:
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0J7NEC44.591e-3971.19DUF659 domain-containing protein OS=Lasius niger O... [more]
A0A5E4MX871.779e-2941.80DUF659 domain-containing protein (Fragment) OS=Cin... [more]
J9L0Q95.815e-2935.57DUF659 domain-containing protein OS=Acyrthosiphon ... [more]
A0A2S2N9G93.563e-2843.81DUF659 domain-containing protein OS=Schizaphis gra... [more]
J9L9S44.324e-2842.45DUF659 domain-containing protein OS=Acyrthosiphon ... [more]
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSAMXT00000053716.11.599e-1033.93pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000045469.16.134e-1033.04pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000043043.11.500e-932.14pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
ENSAMXT00000033536.15.975e-831.25pep primary_assembly:Astyanax_mexicanus-2.0:3:6683... [more]
ENSAMXT00000030217.12.417e-730.36pep primary_assembly:Astyanax_mexicanus-2.0:16:123... [more]
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 5
Match NameE-valueIdentityDescription
ENSORLT00000040891.11.278e-837.63pep primary_assembly:ASM223467v1:7:6445512:6449001... [more]
ENSORLT00000033794.11.765e-837.63pep primary_assembly:ASM223467v1:24:18649635:18652... [more]
ENSORLT00000041487.11.794e-837.63pep primary_assembly:ASM223467v1:1:12098754:121030... [more]
ENSORLT00000041701.14.567e-731.03pep primary_assembly:ASM223467v1:10:29315372:29316... [more]
ENSORLT00000045127.14.657e-735.00pep primary_assembly:ASM223467v1:10:587855:593081:... [more]
back to top
BLAST of CGG triplet repeat-binding protein 1 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000020037.12.670e-6072.46SMESG000020037.1[more]
SMESG000058621.12.491e-5860.43SMESG000058621.1[more]
SMESG000033890.12.889e-5765.07SMESG000033890.1[more]
SMESG000019458.11.539e-5484.07SMESG000019458.1[more]
SMESG000021658.12.855e-5382.30SMESG000021658.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30003596 ID=SMED30003596|Name=CGG triplet repeat-binding protein 1|organism=Schmidtea mediterranea sexual|type=transcript|length=5950bp
CAAAAACGTATACTTAATTCTGAATATTAAAACAATGTTTTTAATATTCA
TTATTGGACATCACTCTCCTCTTCATTTTCTTTGTACTTCTTCAGCCATT
CGATTCTTCTTGATTTAATTGAGTGTGGAAATTGTAATTCAACTTGTCTT
GGCGATAAGAATTTTAAAATTTTGTGAGGTCCTTCATATTAAATTCCCTC
CTTCTTACGTATATGCGGTTCCAAGCGTACTAATACTTTCTCTCCCATCA
TGAAAACTCGAGTACTCCGGTTGTTTAAATCTATTTGCATAGTTGTAGCA
ACCTTTTGTAGCATTCAAACTAAATTCAACCTTTGATAACAATTTACACC
TATTATTCTTTGTTTTAATCTCTGTCAATGATGTTGCCAACATGTCCCGA
ACTGTTCTTATGATTCGCTCTGATAAACCATTCGATTGATGTTGATACGG
TGAACTAAATTCTTGTTTTATACCAAATTTCTCCATCCAATCACGAAACT
TTGATCCTTCGAAAATTCTTCCTCTATCTGTAAGAATGCTTTCAAGTCTA
CCAAAATGATATATCCAATTATTCAAAATGGGATTTAAAATTATTGCTTC
ATCCTGTGTACTTGCGGCTGATCAACTGCCCAATTTACTAAACCGATAGT
TTTAAAAATTACTCAAAATATATTGATTAATGATAACTAGTGCAGCCAAA
AATATGAATTATCGTGTTTTTTCTGAATCATTTTGAGAAAACGATATCAA
TCCATTTTAGGAAAAACTATGCCCTTTAACCCTTATTTTTTTGGGCGTAT
TTTTGCATAAAATCATTTTTAGGGCATATTTGAGTAAAAATCTTTGTTTT
TAATAATATTTTTTCGAAATAATAAACACTAATTATTGCAAATTTTTTAT
ACAGTATATTAAATAATGTGTTTTTACAGCGTACATTTTTGCTGGGGTTA
AGAAAAATGATTACATTTTAGCTGGGTTTTATAAAAAAGGTACTGACTTT
AGTTTGAAGGGGATCCCTGGTTATTAATAAATTTTTTTTAAAAATAAAAC
TTAGTTATTGCAATATTTACTTAATATACAACATTAAATTATAAGTTGCT
ATTTAAATAAATTTAATTTTTGTAAGATAAACCGATGGTAAATCAATCCG
TATCAAACAGATTGAAGCATTATGTATGCAAATTTCCAAATTTAAAAACG
GATGGAAGGATTCTTTTTTGTAAAGTGTGCAATAAATCGGTGTTAGAAGA
AATTATATTTACGATAAAACAGCACCTGTATCCTAACGTTGATAGGTTAA
TTGCAAATGTAAAAAAAGTATTTTTAAAAGCTCCATCTAGATTGCAGGTT
TTTAAGGATTTAGAACCTGGGTTTACACTCCCTACAGAACAAATAACCAC
ACGATGGGGCACTTCACTCATATCCGTAAATTACTATGCAAACAACTTTG
AGAAGATAGTGAGAATTTTTGATGCATTAGACAATAAGGAAGCAGGCTCA
ATAAAAATTTCACAAGATCTTCTATGTGACAGCACAATAAAAGCTGATTT
AATTTTTATTGCATCAAATTATGGATTTCTTGGAGCTTCAATAACAACTG
GAAACATCTTGACTATAACTTTCTGCATAGATTTAAATCGTAACACGCTA
TGGAATTAATTAACGTCATATACCGTAATACAGCCGACATTATTAAAAAT
ACTTGCTACTGTAATCGAAAAAACTGTGGGTTTACTTTAATGAGAAATAT
TTCAGCAATTTTGGCTGGTGTATCCGTCACATTAACAGAAAAGTATACCT
GTAGTGAAATTTTAGCTTTTAAATTCGCATGAATTACCTCATTTGATGTC
GAACAAAGTTTCTCGATGATTAAGAGTGTTTTAAGACCAAACAGACAAAG
CTTCCTGTTTGAAAATTTAAGCGAAATGTAATAATAATTTAAACTAATTT
TCTTAATTATTTATTTTTCAACCTTGTCACCTAGAACTTTTATGACATTT
TAGTTCTCTTCTTTTAAATTTAATTTTTGAATATAATACAAATAAATAGG
TATAGTTAATGAATATTCATTTAAAAATCTATTTGGTGTTTTATTTTTAT
TTGTTAGGTGTTCAGATTTTTGAAGCAAAATAAAATTTTGTAAAACATTT
TCAGATTTATAGAGCATATTTTTGGTTGCCCTAGAGTGGGGTTTATGGTA
TTTAACCATGAGCCCATGTATAGAACTTTTCTTGCCCCCAGTGGTTACCC
GTTGCTAAACAACGTTGTTACTGTAACCGCTAGTTATGAGATCGCCGAAG
GTGAGCGCTAGGAAAGAACTGCTACGGCCTGATCTCACGATGGGTTCAAC
GCACTACCGATAGAAGGTTTATGTCCGATTTCCTATTTCGGTCCCTTGTC
GTGCTTGACAATAATAGCCCTTAAACTATTGTCCGAACCACTCGTGACTT
CAATTTAACGGATTTTAATTTTATCATTAATAATTCGATGGATAAACAAT
TTGAATTCAGAAGGTCAACTGTGAATATTTAATAAGTCAAAGCGATCATG
GTTTAGAAAAATCTCTAGCTTGTACTGCACGACGCTTGGCGCGTTCTGCC
TTTCCTGGGCGTCGCTGAGCAGGTTTCTTTGCCAGCTTAGATGCATCGGT
TGTCTCTCGATAGGCCAACTGAGTCGGTGCGAACACAAGCGGGTAGGTTA
CGGCATTGCCTTCTTGGCAAGCCGATAATGCTACTTTCAAACTGCGGATT
TGTACTTTCTGGGTTTGGCGGAGCTGTTTTTCCTCGGTCCACAACCGCTG
TGCAGCAGTCAATTTTTGGTATAGCACTTTCACTTTAGTGGGGTTTATCC
GTGTCGCCTTAACCTGTTGAATTTCTCTCGACATTTGTAGTAATTGCCGA
TGTAATTCGGTGTTTTCTTTGCTTGACTTCGCCAGTTCTTTGCTCAACAT
TTTCACTAGGTTCATGTCTACAATAAAACGAAAATTTGGTCTTATTTTAT
TTTGGAAAAATTTTCTTATCTCTGACAACGTCAAGTTATGTTATTGATTT
AAGCTTATCACACGTCTGTATTACGTTGATTCCTTTACCGCTCTATCATT
ATTATTATCTCAGCTTCAAAAATTTAAAATAATTCATAAGTAAAGGGAGA
AAAATAAAATCCAAATTTTTATTTTGTAACACACTATCGCCGCATGGGAC
TTCGGTAACGTGGAATTTGGTTATGTTTCGGTCGCATTCTAGCATACGGA
TGGTCTTGGTCTTCCCTCGGACTTTGTTCACTAGACCGCATCTCTTTCGG
TACCACCACTTTCATGGATTTGGATAATTCTCCTAGCATTGGGCTATCGA
GTTCGTAAGAAATGCCGTACGATGGGGCATACACATCCGTACTCGGGATC
CTTCCCGCGGCAATATCAGAGTTTACTCTGACCAGATGACGCTTCGCCAT
TTCCACTTGCCAGTTGGTCCGTTGCTGGCTCTCCTCAATGCGTGCCAGAA
GATACTCGGCGCGTTGCCTACTTTCACAAGCGTGGTTTCGATCCAGCTCT
CTTTTGATAGCCACATATGGTAAGTCGGCATCTCCGAAGTCCTCTTGACG
TCCTGCCCTCTATTATTTTACGGGCGAGATTGAGAAGCCCGCGTCCCACA
TCGCCTGCCTGCATCTGGCAGCGGATCGTGTAGGGAATCGAATGGTAACT
CGAGACGCGTCGATCCACGATTCATCTGAAGGATAAATATTTACTTATCG
AATCTTTTATTTGTTTCTTGAGTTGTCCGAATTTAAAATGTGTGTGGTAC
GTGGAATCAGTCATTATTGAGTCTAGGTGTGGGTCATACTACCTGAGTAT
GTGGGTTGAAAAATGGTGAGTCGAACTTTGCGGCGTCGAGAAAATACCGT
TTAGGAGCCTTCACGCCCTCCCTGGTTAATGGCTACTGGCCTATATGTGT
CTAAAATTGACGGATTCCATTTCTAGTACGGAATTGATGTGAGGTTTGAG
CGAGAATATTTAATTAAGGGTCAACAGATTCGTGTCGTAGATTTAGTATT
GGAGTTTCATGAGATGACGCTATAAGTGGTTCATGTAGCGAGGTTTATTT
GCTTTGGTCCGATTATCTAACGTTGTGTGCATTTACTTCTGATCTTTATT
ATTAGGATCTCAGTCTGTTTGACTGTATTTAATTTCATCGGTGAAATTTT
CTCAAGAAGTAATTGACAACGAACAGCATTACGCTAGATTTTTTATTGCT
AATTTATAACAATTGGTGTACAGTTCTATCTTCATGCCTTTACTGATTCA
AAAAAGCGTCGTCGCAATCGGGGAAACAGGACATTTCCGAATCCAATTTG
TTGTCTACATTCGACCCTCATGATAACTAACTGAGAATTATTAACATCAA
TATATAATTTTACCTGCTTGTTCGTGCATCTTTTCCTCGATAGTTACTTC
TTCCACTTGGCTATCTCGCTGTTGGCCGTTAGAATGTCTTCCCATTTGTT
GTCGGTGATTGTACATTTTTCAGTTGTCTCGAAGCTAGAATAGTATTGCT
GATTTTTCACAGAAGGTGCGCCTATTTATATCGGGTTGTGTAGTGACACG
TTACCATTGTGTGCATGTTCCTAAGTATGTCAGTGTAACTCATTTTGATT
GTACCGTTCTAACGAAATCACATTGTTGTATGCTAATCATGTGTTGATTC
TTGTCTCACCGTGGCAGTGTGCTATTAATTTTCGATATTTCTCTGTCGTT
TTTCACTTCAATGTGGAATTTTGTTTGGATTTAGTTTTATTAAACTGCCC
CTCGTCATGCCGTCGAACCACTTAGGTCTGGTATCTAATTAGTTGTAAAT
GGCCCGCTCACCACCACCAATCAGCCCCTACACGCTCGGCCCGACACCCA
CCCGCCACACTTCCCAACATGCACTCACCAACCACGCCCTCCTGCCGCCA
CTCCATAACAAACGAACCTGCGGTCGAGAAGGCCAAATGTGCTCCACAGG
CGATCCCTCACGGGTTGGCAATCCGACGGACGATGACTCGATACCAATAT
TGTCTGACCATCAGATTTTTACCGATCATTAGGACTCTCAGCTCCCCATC
TCGTCCTCACCCATTACGTTCCGATCCTTCTCAGTGGATTTGGCGCACCA
AGTTGGTTCGGAGTTCATTCTATTATTATGTATTTTTTTGTAATTTAATT
GGGTCCTCCTCGACCTCCCGTACTATGCACCCAATCACGACCTCTAATTC
GGCATTATTGTTAGAAAGAAGCCTAAATTAATCGATTACCTTTGCATAGA
TATTAGGTTTAGTAATGTGCTTAGAATTGTTTACTTGTTATATAAATATT
GTTTTATTACATTGATTATTTTTCCGCCTCTATACAACTAAAGGGATTAA
AGTTATTTTTACATTAATATTTAAATTATTTGACTACTTTCTATACGAAC
ATAATTAGTAAAGTGTTTACTTTATTTGCGTAAATATCATTATTGAAGAG
TTCAAATTATTTTACGTGGCTATTATTAGTTAAGGGCCTGAATTATTTGA
CGTAACTATTATTAGTAAAATACTTAAATTGTTTGACGTAACAACGCTTC
TCGATTCTTTCCCTCACTCCCTGCCCTCGTTTGACCGTAGGGTTATTTGT
GTCGATATTAGAATATTACGAGTGCCACTGGACTACTCGGTGTTTATGAT
TTTAAGCCGTTGTGCAATGAAGATTTTAGGTCGATTTTATGGAAAATGAG
TGATGAAGGAGCAGTCAAAAGGCTAACCCTCAACCTTCCCCTATAATGGA
TAATATTGACGTTTGGGTTCCTGCGTTACTCACGCCGGAACCTTCACGGA
CAAAAGCTTTTGGCGGATGAACTGCACTTACACAGCTTGAATTGATGTGC
ATATTTTGCTTGCTTGGGGTGTTGTATCGGCTGTGGGCCGGAAAAATTTC
back to top

protein sequence of SMED30003596-orf-1

>SMED30003596-orf-1 ID=SMED30003596-orf-1|Name=SMED30003596-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=140bp
MNLVKMLSKELAKSSKENTELHRQLLQMSREIQQVKATRINPTKVKVLYQ
KLTAAQRLWTEEKQLRQTQKVQIRSLKVALSACQEGNAVTYPLVFAPTQL
AYRETTDASKLAKKPAQRRPGKAERAKRRAVQARDFSKP*
back to top

protein sequence of SMED30003596-orf-2

>SMED30003596-orf-2 ID=SMED30003596-orf-2|Name=SMED30003596-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=175bp
MVNQSVSNRLKHYVCKFPNLKTDGRILFCKVCNKSVLEEIIFTIKQHLYP
NVDRLIANVKKVFLKAPSRLQVFKDLEPGFTLPTEQITTRWGTSLISVNY
YANNFEKIVRIFDALDNKEAGSIKISQDLLCDSTIKADLIFIASNYGFLG
ASITTGNILTITFCIDLNRNTLWN*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: molecular function
TermDefinition
GO:0046983protein dimerization activity
GO:0003676nucleic acid binding
GO:0001227transcriptional repressor activity, RNA polymerase II transcription regulatory region sequence-specific binding
GO:0003690double-stranded DNA binding
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0002109X1 cell
PLANA:0002111X2 cell
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR001584Integrase_cat-core
Vocabulary: biological process
TermDefinition
GO:0015074DNA integration
GO:0000122negative regulation of transcription from RNA polymerase II promoter
GO:0006357regulation of transcription from RNA polymerase II promoter
Vocabulary: cellular component
TermDefinition
GO:0005634nucleus