Protein CBG22824

Overview
NameProtein CBG22824
Smed IDSMED30009376
Length (bp)3674
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Protein CBG22824 (SMED30009376) t-SNE clustered cells

Violin plots show distribution of expression levels for Protein CBG22824 (SMED30009376) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of Protein CBG22824 (SMED30009376) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for Protein CBG22824 (SMED30009376) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Homology
BLAST of Protein CBG22824 vs. UniProt/SwissProt
Match: sp|A6UTF6|TBP_META3 (TATA-box-binding protein OS=Methanococcus aeolicus (strain ATCC BAA-1280 / DSM 17508 / OCM 812 / Nankai-3) OX=419665 GN=tbp PE=3 SV=1)

HSP 1 Score: 56.225 bits (134), Expect = 1.895e-7
Identity = 41/130 (31.54%), Postives = 63/130 (48.46%), Query Frame = -2
Query:  770 KYPIIFFKSGKCRIMGCKKPLD----VNKLQYRIKD----------LKIQSITVTMDIGQSINLYNMSTKCLCMFEPELFPALRLLKYNPMCV-NVFASGKIVILGLRNLE--------YKNIVTDIHNE 1090
            K  ++ F+SGK    G K   D    +NK+   +++          +K+Q++  T ++G   NL ++ST     +EPE FP L     +P  V  VF SGK+VI GL+N E         KN V ++  E
Sbjct:   50 KVALLIFRSGKLNCTGAKSKEDAVIAINKVMEYLREAGLDLIDTPEVKVQNMVATAELGMEPNLDDLSTLERTEYEPEQFPGLVYRMESPKVVLLVFGSGKVVITGLKNKEDAYIALEKIKNTVKELEEE 179          
BLAST of Protein CBG22824 vs. UniProt/SwissProt
Match: sp|A6URP5|TBP_METVS (TATA-box-binding protein OS=Methanococcus vannielii (strain ATCC 35089 / DSM 1224 / JCM 13029 / OCM 148 / SB) OX=406327 GN=tbp PE=3 SV=1)

HSP 1 Score: 52.373 bits (124), Expect = 3.439e-6
Identity = 38/118 (32.20%), Postives = 58/118 (49.15%), Query Frame = -2
Query:  782 KYPIIFFKSGKCRIMGCKKPLDVN-KLQYRIKDLK-------------IQSITVTMDIGQSINLYNMSTKCLCMFEPELFPALRLLKYNP-MCVNVFASGKIVILGLRNLEYKNIVTD 1090
            K  ++ F+SGK    G K   D    ++  IK+LK             +Q++  T ++G   NL ++ST     +EPE FP L     +P + V +F SGK+VI GL+ +E   I  D
Sbjct:   50 KVALLIFRSGKLNCTGAKSKEDAEIAIKKIIKELKEAGMEIIDNPVVSVQNMVATTELGMEPNLDDISTLECTEYEPEQFPGLVYRLSDPKVVVLIFGSGKVVITGLKVIEDAYIAFD 167          
BLAST of Protein CBG22824 vs. UniProt/SwissProt
Match: sp|Q6M0L3|TBP_METMP (TATA-box-binding protein OS=Methanococcus maripaludis (strain S2 / LL) OX=267377 GN=tbp PE=3 SV=1)

HSP 1 Score: 52.373 bits (124), Expect = 3.472e-6
Identity = 38/118 (32.20%), Postives = 61/118 (51.69%), Query Frame = -2
Query:  782 KYPIIFFKSGKCRIMG--CKKP--LDVNKLQYRIKD----------LKIQSITVTMDIGQSINLYNMSTKCLCMFEPELFPALRLLKYNP-MCVNVFASGKIVILGLRNLEYKNIVTD 1090
            K  ++ F+SGK    G  CK+   + +NK+   +K+          +K+Q++  T ++G   NL ++ST     +EPE FP L      P + V +F SGK+VI GL+ +E   I  D
Sbjct:   50 KVALLIFRSGKLNCTGARCKEDAVIAINKIVKELKEAGMDLIDNPEVKVQNMVATTELGMEPNLDDISTLECTEYEPEQFPGLVYRLSEPKVVVLIFGSGKVVITGLKVIEDAYIAFD 167          
BLAST of Protein CBG22824 vs. UniProt/SwissProt
Match: sp|Q9P9I9|TBP_METTL (TATA-box-binding protein OS=Methanothermococcus thermolithotrophicus OX=2186 GN=tbp PE=3 SV=1)

HSP 1 Score: 51.6026 bits (122), Expect = 7.396e-6
Identity = 36/108 (33.33%), Postives = 54/108 (50.00%), Query Frame = -2
Query:  812 KYPIIFFKSGKCRIMGCKKPLDVN-KLQYRIKDLK-------------IQSITVTMDIGQSINLYNMSTKCLCMFEPELFPALRLLKYNP-MCVNVFASGKIVILGLR 1090
            K  ++ F+SGK    G K   D    ++  IK+LK             +Q++  T D+G   NL ++ST     +EPE FP L     +P + V +F SGK+VI GL+
Sbjct:   50 KVALLIFRSGKLNCTGAKSKEDAEIAIKKIIKELKDAGMDIIDNPEVNVQNMVATADLGIEPNLDDISTLEGTEYEPEQFPGLVYRLSDPKVVVLIFGSGKVVITGLK 157          
BLAST of Protein CBG22824 vs. TrEMBL
Match: A0A2L2YU97 (Uncharacterized protein (Fragment) OS=Parasteatoda tepidariorum OX=114398 PE=2 SV=1)

HSP 1 Score: 151.754 bits (382), Expect = 5.691e-38
Identity = 81/196 (41.33%), Postives = 116/196 (59.18%), Query Frame = 2
Query: 1316 IVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAMVYK 1903
            I GP+GSGK+ FV KL+++N+F      I +       +G  Q  F K+KN+K  +G   N SS      VIIIDDL  E + +   +NLF+K S H  +T+IF+ QN+FH+G   R  NLN  YLV++KNPRD + I+ L RQ +P      I  F DAT  P+ Y+F+DF  + +D LR++T IF ++    Y+
Sbjct:    7 ISGPTGSGKSFFVSKLIENNMFYPMPKNIIY------CYGCYQPLFDKMKNVKFEEGLPSNLSSI--SDAVIIIDDLMSELSSDITLSNLFSKYSHHRKLTIIFLVQNIFHKGRVMRDINLNSHYLVLYKNPRDKSQINHLGRQMFPGKLKSFIEIFHDATAEPYSYLFIDFRPETDDRLRLRTGIFPQDKHFSYQ 194          
BLAST of Protein CBG22824 vs. TrEMBL
Match: A0A4Y2QW74 (Uncharacterized protein OS=Araneus ventricosus OX=182803 GN=AVEN_149572_1 PE=4 SV=1)

HSP 1 Score: 147.517 bits (371), Expect = 1.801e-36
Identity = 84/203 (41.38%), Postives = 116/203 (57.14%), Query Frame = 2
Query: 1292 LRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAMVY 1900
            +++ +   I GPS SGKT FV +LL S L Q    KI W  GA       Q  F  + N++  +G   +  +      +IIIDDL  E + +K  +NLFTK S H  +++IFI QN+FH+G   R  +LN  YLV FKN RD   I  LARQ YP+   F + +F DAT+ P GY+FLD   +  + LRV+T IF ++  +VY
Sbjct:    6 IKHPSSMIISGPSNSGKTFFVKRLLDSKLIQPFPPKILWCYGA------YQKLFDNMPNVEFHEGLPSDIDTV--SDALIIIDDLMSEVSSDKRLSNLFTKGSHHRNLSIIFIVQNMFHKGKEMRNISLNASYLVCFKNVRDKQQISCLARQMYPSQSKFFLESFIDATQKPFGYLFLDLKPEREECLRVRTGIFPEDKNIVY 200          
BLAST of Protein CBG22824 vs. TrEMBL
Match: A0A3C1P802 (Uncharacterized protein OS=Planktothrix sp. UBA8402 OX=2055759 GN=DCQ63_02380 PE=4 SV=1)

HSP 1 Score: 150.984 bits (380), Expect = 8.501e-36
Identity = 71/129 (55.04%), Postives = 95/129 (73.64%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMF-KNGHPDNLQTDLG 2873
            K++ +P + AG+ G ++L+KR PK+  + +KWL+ Q AY+L+KPM+++F TRAYKT G +DLWQMDLMEMIPY+KIN G +YILTCIDVFSRFAR    KTK A  +  AI  M  +   P ++QTD G
Sbjct:   17 KVYYDPSEPAGYAGASRLQKRFPKT--DVKKWLATQPAYTLHKPMKRKFATRAYKTSGADDLWQMDLMEMIPYAKINGGNRYILTCIDVFSRFARAEAVKTKDAITVCAAIRKMLAQKSSPRHVQTDAG 143          
BLAST of Protein CBG22824 vs. TrEMBL
Match: A0A1Y1KS04 (Uncharacterized protein OS=Photinus pyralis OX=7054 PE=4 SV=1)

HSP 1 Score: 139.428 bits (350), Expect = 2.499e-33
Identity = 77/196 (39.29%), Postives = 108/196 (55.10%), Query Frame = 2
Query: 1316 IVGPSGSGKTMFVCKLLKS--NLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAMV 1897
            + GPSGSGK+ FV   LK+  N+    F +I WH      +   Q+ +  LK ++  +G     +    K  ++IIDDL +E+N      ++FTK   H  ++V F+TQN+FHQG   R  +LN  Y+++FKNPRD   I  L RQ  P N  FL  A+ DAT  PHGY+  D  Q   D  R +T IF ++GA V
Sbjct:   13 VSGPSGSGKSYFVVNFLKNVGNICNVVFERIVWH------YAEWQELYDDLKGIEFHQGLPDMSNFDGLKPTLVIIDDLMRESNGS--MVDIFTKGCHHRNLSVFFLTQNIFHQGKGQRDISLNAHYIILFKNPRDRAQIKHLTRQILPENSKFLEEAYNDATSKPHGYLLFDLKQSTPDVFRYRTSIFEEDGACV 200          
BLAST of Protein CBG22824 vs. TrEMBL
Match: A0A0J7MMC4 (Uncharacterized protein (Fragment) OS=Lasius niger OX=67767 GN=RF55_25379 PE=4 SV=1)

HSP 1 Score: 135.576 bits (340), Expect = 2.415e-32
Identity = 73/196 (37.24%), Postives = 118/196 (60.20%), Query Frame = 2
Query: 1316 IVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAMVYK 1903
            + GP+GSGK++FV K+L+ ++F    +K+ +       +G+ Q  F  +K++   +G   ++ ++++   +II+DDL  E + +   +NLFTK S H  +++IFI QNLFHQG   RT +LN  Y+ +FKNPRD + +  LARQ +P         FQDAT   +GY+FLD   +  D LR+++ IF  +   VY+
Sbjct:    5 VSGPTGSGKSVFVKKMLEHHMFHPWPDKVIYC------YGVYQPLFNSMKDVIFEEGL-PSYLNQIQNA-LIIVDDLMTELSGDSRLSNLFTKGSHHRYISIIFIVQNLFHQGKEMRTIHLNCHYMTLFKNPRDKSQVMHLARQMFPGKSKAFQEIFQDATHPAYGYLFLDLRPETEDRLRMRSGIFPGDKHYVYE 192          
BLAST of Protein CBG22824 vs. Ensembl Cavefish
Match: ENSAMXT00000055500.1 (pep primary_assembly:Astyanax_mexicanus-2.0:APWO02001032.1:12601:13692:-1 gene:ENSAMXG00000041341.1 transcript:ENSAMXT00000055500.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 90.5077 bits (223), Expect = 7.599e-19
Identity = 50/136 (36.76%), Postives = 80/136 (58.82%), Query Frame = 3
Query: 2487 EKIWTNPEDEAGFGGVAKLKKRVPKSK------KETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGH-PDNLQTDLG 2873
            +KI+ +P +   +GGV  L++ + + K      K+ + WLS Q AY+L+KP+  +F T       +++ WQ DL++M   SK NK  K++LTCID+ S++A     + K+  E++ A NS+ K G  P  LQTD G
Sbjct:    7 KKIYYDPSNPGSYGGVESLQRAIFEKKGSRANIKDVKNWLSAQDAYTLHKPVLGKFKTNRVFVKNMDEQWQADLVDMSNLSKDNKDMKFMLTCIDILSKYAWVRVLRNKTGVEVTDAFNSILKEGRVPKKLQTDQG 142          
BLAST of Protein CBG22824 vs. Planmine SMEST
Match: SMESG000051837.1 (SMESG000051837.1)

HSP 1 Score: 422.165 bits (1084), Expect = 4.012e-131
Identity = 201/203 (99.01%), Postives = 202/203 (99.51%), Query Frame = 2
Query: 1286 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 1894
            M+LRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCN DLRVKTDIFNKEGAM
Sbjct:    1 MDLRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNGDLRVKTDIFNKEGAM 203          

HSP 2 Score: 260.766 bits (665), Expect = 8.137e-73
Identity = 123/128 (96.09%), Postives = 125/128 (97.66%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 2873
            KIWTNPEDEAGFGGVAKLKKR+P SKKETQKWLSDQLAYSLNKPM KRFPTRAY+TFGINDLWQMDLMEMIPYSKINKGYKYIL CIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG
Sbjct:  204 KIWTNPEDEAGFGGVAKLKKRLPNSKKETQKWLSDQLAYSLNKPMGKRFPTRAYETFGINDLWQMDLMEMIPYSKINKGYKYILMCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 331          
BLAST of Protein CBG22824 vs. Planmine SMEST
Match: SMESG000065288.1 (SMESG000065288.1)

HSP 1 Score: 427.172 bits (1097), Expect = 4.173e-125
Identity = 203/203 (100.00%), Postives = 203/203 (100.00%), Query Frame = 2
Query: 1286 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 1894
            MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM
Sbjct:    1 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 203          

HSP 2 Score: 270.781 bits (691), Expect = 1.393e-73
Identity = 128/128 (100.00%), Postives = 128/128 (100.00%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 2873
            KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG
Sbjct:  204 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 331          
BLAST of Protein CBG22824 vs. Planmine SMEST
Match: SMESG000029226.1 (SMESG000029226.1)

HSP 1 Score: 424.861 bits (1091), Expect = 3.693e-124
Identity = 202/203 (99.51%), Postives = 203/203 (100.00%), Query Frame = 2
Query: 1286 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 1894
            M+LRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM
Sbjct:    1 MDLRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 203          

HSP 2 Score: 270.011 bits (689), Expect = 2.817e-73
Identity = 127/128 (99.22%), Postives = 128/128 (100.00%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 2873
            KIWTNPEDEAGFGGVAKLKKRVPKSKK+TQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG
Sbjct:  204 KIWTNPEDEAGFGGVAKLKKRVPKSKKKTQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 331          
BLAST of Protein CBG22824 vs. Planmine SMEST
Match: SMESG000062279.1 (SMESG000062279.1)

HSP 1 Score: 419.468 bits (1077), Expect = 1.252e-122
Identity = 201/203 (99.01%), Postives = 202/203 (99.51%), Query Frame = 2
Query: 1286 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 1894
            M+LRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKN SSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM
Sbjct:    1 MDLRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNSSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 203          

HSP 2 Score: 268.085 bits (684), Expect = 8.626e-73
Identity = 126/128 (98.44%), Postives = 127/128 (99.22%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 2873
            KIWTNPEDEAGFGGVAKLKKRVP SKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARG+PTKTKSAEEISKAINSMFKNGHPDNLQTDLG
Sbjct:  204 KIWTNPEDEAGFGGVAKLKKRVPNSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGLPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 331          
BLAST of Protein CBG22824 vs. Planmine SMEST
Match: SMESG000076042.1 (SMESG000076042.1)

HSP 1 Score: 419.853 bits (1078), Expect = 1.581e-122
Identity = 199/203 (98.03%), Postives = 201/203 (99.01%), Query Frame = 2
Query: 1286 MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 1894
            MELRND VYQIVGPSGSGKTMFVCKLLKSNLFQT FNKIYWH+GADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQ+ANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM
Sbjct:    1 MELRNDEVYQIVGPSGSGKTMFVCKLLKSNLFQTNFNKIYWHKGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQKANKEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKEGAM 203          

HSP 2 Score: 268.47 bits (685), Expect = 6.421e-73
Identity = 126/128 (98.44%), Postives = 127/128 (99.22%), Query Frame = 3
Query: 2490 KIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 2873
            KIWTNPEDEAGFGGV KLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARG+PTKTKSAEEISKAINSMFKNGHPDNLQTDLG
Sbjct:  204 KIWTNPEDEAGFGGVVKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKRFPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGLPTKTKSAEEISKAINSMFKNGHPDNLQTDLG 331          
The following BLAST results are available for this feature:
BLAST of Protein CBG22824 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 4
Match NameE-valueIdentityDescription
sp|A6UTF6|TBP_META31.895e-731.54TATA-box-binding protein OS=Methanococcus aeolicus... [more]
sp|A6URP5|TBP_METVS3.439e-632.20TATA-box-binding protein OS=Methanococcus vannieli... [more]
sp|Q6M0L3|TBP_METMP3.472e-632.20TATA-box-binding protein OS=Methanococcus maripalu... [more]
sp|Q9P9I9|TBP_METTL7.396e-633.33TATA-box-binding protein OS=Methanothermococcus th... [more]
back to top
BLAST of Protein CBG22824 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A2L2YU975.691e-3841.33Uncharacterized protein (Fragment) OS=Parasteatoda... [more]
A0A4Y2QW741.801e-3641.38Uncharacterized protein OS=Araneus ventricosus OX=... [more]
A0A3C1P8028.501e-3655.04Uncharacterized protein OS=Planktothrix sp. UBA840... [more]
A0A1Y1KS042.499e-3339.29Uncharacterized protein OS=Photinus pyralis OX=705... [more]
A0A0J7MMC42.415e-3237.24Uncharacterized protein (Fragment) OS=Lasius niger... [more]
back to top
BLAST of Protein CBG22824 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 1
Match NameE-valueIdentityDescription
ENSAMXT00000055500.17.599e-1936.76pep primary_assembly:Astyanax_mexicanus-2.0:APWO02... [more]
back to top
BLAST of Protein CBG22824 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Protein CBG22824 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 5
Match NameE-valueIdentityDescription
SMESG000051837.14.012e-13199.01SMESG000051837.1[more]
SMESG000065288.14.173e-125100.00SMESG000065288.1[more]
SMESG000029226.13.693e-12499.51SMESG000029226.1[more]
SMESG000062279.11.252e-12299.01SMESG000062279.1[more]
SMESG000076042.11.581e-12298.03SMESG000076042.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30009376 ID=SMED30009376|Name=Protein CBG22824|organism=Schmidtea mediterranea sexual|type=transcript|length=3674bp
GATAATAATAGTAATAATAATAATAGTAATAATAAATTACCATAATGTTT
AATTTCATTCATAACTTGAACCGAGCTCGGTAGATCCATTTTAATGATAA
AAATTGAAATAACAATTACTTTTATTCTAATTTAAACCAATGAGAAGTTA
AACACACACATTTAGCTATAAACAATAAATATTAATTTATATAGATTTGA
AAATGATTAAAGAAATTTTAATAATATTATTAGTTTTTAATATGGCCAAC
TCAATTGTTGTAACCGTTCAAGGTAAAAATTCAACAAGATGTGTATGTCC
TGATACAGCAGTATTGATGGATATTTTGTATATGATAGATAGGCCTAGAT
ATTTGAAAACTCCAGTCAAAAGATTATCTGAAAGAATGAGTAATTTAGAA
TTCCGATTTGAACATACATATGAGACTTTAAAAAGAGCTAAAAACGAATT
ATATGATGATATTATATATCTAAATAAGACCATTGGTATTATCAAAAATC
AATTAAGAAAATTAATATTAGAATTGAATATGAGCTGTGGATACATTGTA
GGCTCTTGATTCTTCAAAATATCATTCATTTGAGGATATTAATAAATAAT
TCGGTCAATTACCATAGGAGCATATGTGTCTAAATAAAATATAAAAAATA
ATAAATTTACCATAACTTTCATTCATTGAATTTATCATCGGCATAGTATT
ATCAAAAATGAAATAACCAAGTTGTTTTAATCATTCATTAACCAATCAGA
TATCGACCAATCCCATGATTTCATTATGTATGTCTGTAACAATATTTTTG
TACTCTAGATTTCGCAACCCTAATATGACTATTTTCCCACTGGCAAACAC
GTTAACACACATTGGGTTGTATTTTAAAAGCCTCAGTGCAGGAAATAATT
CTGGCTCGAACATACAAAGACATTTTGTTGACATGTTGTATAGATTGATA
GATTGTCCTATATCCATGGTAACTGTAATTGACTGTATTTTAAGATCCTT
TATTCTATATTGTAATTTATTGACATCTAAAGGTTTCTTACATCCCATGA
TTCTGCATTTGCCGGATTTGAAAAATATTATAGGGTATTTTCCACTCCTA
TCTACAATTTGCTGGGGTTTACCATTTGGAAACGTCATTTTTGATACATC
GAAAGTTCCTTTATAATTTATATTTGATAACATAATTAAACAATAAAGAA
ATATTAAGCTATATATATGAGTTTTTAGGAAGAGTTCATTAAATCATATA
ATATAAACAAAAACCTATAATTATACAAATTAATAATGGAATTAAGAAAT
GATGCGGTATATCAAATCGTCGGCCCAAGTGGTAGTGGTAAAACTATGTT
TGTGTGTAAATTACTGAAATCTAATTTATTTCAAACAAAATTCAACAAGA
TTTATTGGCATAGGGGTGCAGATGAAGAACATGGATTAACTCAAGATAAT
TTCTGTAAATTGAAAAACATGAAAATAGTAAAGGGTTTCGATAAAAACTG
GTCAAGTCGTTTAAGGAAAGGTGATGTCATCATTATTGATGATTTGTATC
AGGAAGCTAATAAAGAAAAGGATTTTAATAATTTATTCACAAAAATCAGT
AGACATGTTGGTGTTACTGTGATCTTTATTACTCAAAATCTATTTCATCA
GGGTGGCGCACATCGAACAAGGAATTTAAATGTTCAATATTTAGTTATTT
TCAAGAATCCTAGAGATGCAACAGTTATTGATTTTCTTGCTCGACAAGCA
TACCCTAATAATCGTAATTTTCTCATCAGTGCATTTCAAGATGCTACAAA
ATCACCACATGGATATATATTTCTTGATTTCACACAACAATGCAATGATG
ATTTACGGGTAAAAACAGATATTTTCAATAAAGAAGGTGCCATGGTATAT
AAACAAAGTTGAAAACTTTAACATGTAAAAATGTTTATGAAATCCAGGCT
TAAAAATATCCGAAAACTTAGACAAAGTATTAAAAAGAAAATAAAGACTG
AAAAAGTTCCACCAAAAACTAGTTATACACCATCTAAAAAACTAATAACC
GAGAATTACCCTGCAATGAATCTAATTTCGAAAAACATTATTAAAAAGAG
TAAGAAAACAGATAAAATTGTAAAAAATCATAGGAGTTTCCCATATTTTA
ATGCATTGTTAAAGGCTTCGAGTATGAAAAGAATGTCTATTTTACAATCA
TTTCCAACTTTTGTTGTCGATGATTTACTCAAAATTCTATTAAAAGTAGT
CAGAGGTAAAATTAAAATCAGTAATTCTAAAAAACTAGTATTGAATAAAC
ATCGTAAGCCTTTGTTATCACTAGTAAATAATAAAAATCGTAAGCAAATG
AGAAAAATCATATATAAACAACAAGGTGGTTTTATCGGAGCAATGTTACC
ATTAGCATTATCATTATTATCACAATAAACGATGGGGCAATCATTACCTA
ATGTTAAATTAAAATTTAAATCATCTTGTTGTAATGGAGAAGATTTGGAC
GAACCCAGAAGACGAAGCAGGTTTCGGTGGAGTGGCAAAGTTAAAGAAAA
GAGTCCCAAAGTCGAAAAAGGAAACGCAAAAGTGGTTGTCAGATCAGCTT
GCGTACAGTTTAAACAAACCAATGCGGAAAAGATTTCCAACGAGAGCTTA
TAAGACATTCGGTATTAATGATTTATGGCAAATGGATTTGATGGAGATGA
TACCGTATTCGAAGATTAACAAGGGTTATAAATATATTTTAACGTGTATT
GATGTTTTCAGTCGTTTTGCTCGCGGTGTACCAACAAAGACAAAATCTGC
AGAGGAAATATCTAAAGCAATTAACTCAATGTTTAAAAATGGGCATCCAG
ATAATTTGCAAACCGATTTAGGTATTATTATTATTATTATTATTACAAAT
AATTCATAATCTATATGGTAAAATGTCTAACAATTGCTTTCTAATATTCC
TCGCGTGTATTTAACATTTCAGGCATTCTTATTTTGTTAGAATTATTGTT
TAAATAAACAAAAATATGTAAAGTTTTCATTAAAACGATTATTTGAAGAA
TTTTCAAGGCGCAGTTGTGTTTATTTAATATATTACCTGCATTCTTCTGT
TGCCTTTCAAATCTTATCTATCGACGTCTTTAATGTTTTATTTTAAATGC
GATTTTTGACGGAATAAACATTTTTCTTTTAGTTTTATGTCACTAATGTT
CAAAACGTTAATATCTATATATTGCCAATTTGGCTTGAACAATTTCCTAA
TCATTTCAATTTCAATACATTAGAATAATATAAAATATCAATGGGAAAGT
GAAAGCTCTCTACAGTTTTAGCCACATGTAATGCCACTAATTTTTTTAAT
TAAAGGATGATATTTTCACTTAATTATTCCTACAATCGTTTGCTTTTATT
ATTAATAGAATAAGATTCGGATGCCATCTATATATTTGTAATGTATTTTG
GTAAGGTATTTTAACAGCAATTTCAAATGCAGAAGTTTGTTCACATTTCC
ACTACCACCCTCTCTCTAAACCTTGTGACATGAACACCATAATCATTGGT
AAATATTTTGTTTTAGGTTTGGATTAGAAAAAATACATCAAAAATTTTGT
ATTTATGAATTTTATATTTGTTTTCTTAATTTGCGAATTGTTGGAAATTA
TTTTGGGCTTGACGAATTATATTT
back to top

protein sequence of SMED30009376-orf-1

>SMED30009376-orf-1 ID=SMED30009376-orf-1|Name=SMED30009376-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=119bp
MIKEILIILLVFNMANSIVVTVQGKNSTRCVCPDTAVLMDILYMIDRPRY
LKTPVKRLSERMSNLEFRFEHTYETLKRAKNELYDDIIYLNKTIGIIKNQ
LRKLILELNMSCGYIVGS*
back to top

protein sequence of SMED30009376-orf-2

>SMED30009376-orf-2 ID=SMED30009376-orf-2|Name=SMED30009376-orf-2|organism=Schmidtea mediterranea sexual|type=polypeptide|length=209bp
MELRNDAVYQIVGPSGSGKTMFVCKLLKSNLFQTKFNKIYWHRGADEEHG
LTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLYQEANKEKDFNNL
FTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIFKNPRDATVIDF
LARQAYPNNRNFLISAFQDATKSPHGYIFLDFTQQCNDDLRVKTDIFNKE
GAMVYKQS*
back to top

protein sequence of SMED30009376-orf-3

>SMED30009376-orf-3 ID=SMED30009376-orf-3|Name=SMED30009376-orf-3|organism=Schmidtea mediterranea sexual|type=polypeptide|length=142bp
MEKIWTNPEDEAGFGGVAKLKKRVPKSKKETQKWLSDQLAYSLNKPMRKR
FPTRAYKTFGINDLWQMDLMEMIPYSKINKGYKYILTCIDVFSRFARGVP
TKTKSAEEISKAINSMFKNGHPDNLQTDLGIIIIIIITNNS*
back to top

protein sequence of SMED30009376-orf-4

>SMED30009376-orf-4 ID=SMED30009376-orf-4|Name=SMED30009376-orf-4|organism=Schmidtea mediterranea sexual|type=polypeptide|length=166bp
MFMKSRLKNIRKLRQSIKKKIKTEKVPPKTSYTPSKKLITENYPAMNLIS
KNIIKKSKKTDKIVKNHRSFPYFNALLKASSMKRMSILQSFPTFVVDDLL
KILLKVVRGKIKISNSKKLVLNKHRKPLLSLVNNKNRKQMRKIIYKQQGG
FIGAMLPLALSLLSQ*
back to top

protein sequence of SMED30009376-orf-5

>SMED30009376-orf-5 ID=SMED30009376-orf-5|Name=SMED30009376-orf-5|organism=Schmidtea mediterranea sexual|type=polypeptide|length=146bp
MLSNINYKGTFDVSKMTFPNGKPQQIVDRSGKYPIIFFKSGKCRIMGCKK
PLDVNKLQYRIKDLKIQSITVTMDIGQSINLYNMSTKCLCMFEPELFPAL
RLLKYNPMCVNVFASGKIVILGLRNLEYKNIVTDIHNEIMGLVDI*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000070intestinal phagocyte
PLANA:0000074oocyte
PLANA:0000099neuron
PLANA:0000136whole organism
PLANA:0000231vitelline gland
PLANA:0002089reproductive organ
PLANA:0002142parenchyma
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR000605Helicase_SF3_ssDNA/RNA_vir
IPR012337RNaseH-like_sf
IPR001584Integrase_cat-core
IPR000814TBP
IPR012295TBP_dom_sf
Vocabulary: molecular function
TermDefinition
GO:0003724RNA helicase activity
GO:0003723RNA binding
GO:0003676nucleic acid binding
GO:0003677DNA binding
Vocabulary: biological process
TermDefinition
GO:0015074DNA integration
GO:0006352DNA-templated transcription, initiation