C2H2-type domain-containing protein

Overview
NameC2H2-type domain-containing protein
Smed IDSMED30020076
Length (bp)3252
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

▻ Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of C2H2-type domain-containing protein (SMED30020076) t-SNE clustered cells

Violin plots show distribution of expression levels for C2H2-type domain-containing protein (SMED30020076) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Sub-lethal Irradiated Surviving X1 and X2 Cell Population

 

t-SNE plot of surviving X1 and X2 cells (n = 1,039 after QC filter) after sub-lethal irradiation. Colors indicate unbiased cell classification via graph-based clustering. SL, sub-lethal irradiated cell groups.

Expression of C2H2-type domain-containing protein (SMED30020076) in the t-SNE clustered sub-lethally irradiated X1 and X2 cells.

Violin plots show distribution of expression levels for C2H2-type domain-containing protein (SMED30020076) in cells (dots) of each of the 10 clusters of sub-leathally irradiated X1 and X2 cells.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30020076

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 1

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
protonephridiaSMED30020076SMESG000055233.1 dd_Smed_v4_14364_0_1dd_Smed_v4PMID:29674431
Fincher et al., 2018
FACS sorted cell population asexual adult single-cell RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of C2H2-type domain-containing protein vs. Ensembl Fly
Match: CG31510 (gene:FBgn0051510 transcript:FBtr0084783)

HSP 1 Score: 53.1434 bits (126), Expect = 1.440e-6
Identity = 46/215 (21.40%), Postives = 91/215 (42.33%), Query Frame = 3
Query: 1236 LLNNEDISKASGTFTCKLCNVTCSSQKDFK--SHLNGEKHSSKNTNEEPIALLISDEKKIVKDR----VISKPIKKPVARIQILLDQFTSP---------------------LIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYEL----VEHDPSSKA--VRMKRLEVYSDQVELFEGRKKMNIMIEKL 1781
            ++ N   +K S     KL N T   +KD      L  E      T ++PI +  S+ K ++  +    V+++P++  + ++  +   + +P                     L+G++YV +  K  + +++ Y C LC    D +S+ +H+  Y HR  Y  +++     L    V H P S+   + M   +  +  +E   GRK  ++  E +
Sbjct:  349 IVKNPLATKISCVPLAKLINSTPEREKDLVIIDDLPKEATKPPETPKKPIPVQTSNPKPVIAAKKPINVVAQPVRPTINKV--VKPSYVAPKDYTTARPGGTFESRKGHVMGLVGVEYVLKIVKTLADNNARYQCCLCEITADEQSMHNHLLGYNHRLKYFDKHFPTAMRLYRQYVSHVPESEVCKIMMPIFDKLAHAIETHHGRKNAHLCYEHV 561          
BLAST of C2H2-type domain-containing protein vs. Ensembl Xenopus
Match: ssrp1 (structure specific recognition protein 1 [Source:Xenbase;Acc:XB-GENE-994294])

HSP 1 Score: 68.1662 bits (165), Expect = 1.005e-11
Identity = 58/178 (32.58%), Postives = 80/178 (44.94%), Query Frame = 3
Query: 1275 FTCKLCNVTCSSQKDFKSHLNGEKH-----SSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQF-----TSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRM----KRLEVYSDQVELFEGRKKMNI 1766
            F C +C +TC+S  + +SH  G KH     S KN   E   L      K  K + +  P  +P    Q L D       T P +GL+YV EY + ES S   Y C LCN +     +  H+   KHR  YL +++  L       PS+  V+     K+L+     VE   GRK +NI
Sbjct:   14 FWCNICRITCASALNLQSHFMGFKHRQVEESLKNNGAEKPVL------KRKKRQFLDDP--EPAVAGQTLDDLLRACKETEPALGLEYVYEYHQDES-SYRAYECRLCNLQTGLAHMFMHVVGAKHRIAYLSKHHSTL-----GIPSTFQVKSSTKNKKLKDACFTVEKTFGRKSINI 177          
BLAST of C2H2-type domain-containing protein vs. Ensembl Xenopus
Match: ssrp1 (structure specific recognition protein 1 [Source:Xenbase;Acc:XB-GENE-994294])

HSP 1 Score: 67.3958 bits (163), Expect = 1.041e-10
Identity = 58/178 (32.58%), Postives = 80/178 (44.94%), Query Frame = 3
Query: 1275 FTCKLCNVTCSSQKDFKSHLNGEKH-----SSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQF-----TSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRM----KRLEVYSDQVELFEGRKKMNI 1766
            F C +C +TC+S  + +SH  G KH     S KN   E   L      K  K + +  P  +P    Q L D       T P +GL+YV EY + ES S   Y C LCN +     +  H+   KHR  YL +++  L       PS+  V+     K+L+     VE   GRK +NI
Sbjct:   14 FWCNICRITCASALNLQSHFMGFKHRQVEESLKNNGAEKPVL------KRKKRQFLDDP--EPAVAGQTLDDLLRACKETEPALGLEYVYEYHQDES-SYRAYECRLCNLQTGLAHMFMHVVGAKHRIAYLSKHHSTL-----GIPSTFQVKSSTKNKKLKDACFTVEKTFGRKSINI 177          
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Match: A0A5K4F158 (Uncharacterized protein OS=Schistosoma mansoni OX=6183 PE=4 SV=1)

HSP 1 Score: 201.06 bits (510), Expect = 8.781e-49
Identity = 123/370 (33.24%), Postives = 192/370 (51.89%), Query Frame = 3
Query:  693 DPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMF---VSDLQAIDMRNVRIDMDTLDLTNKF----FDKSAASEMPKEGTSVIFTDDMQSRNTTETAHSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWL----LNN-EDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNI 1766
            DP A+++ YF  Y + G+W W  W N +  I  T PQW+E+F   SR +GI+W++M+   VS +      N    M        F    +    ++++P      +  DD       ++  S+ +C  C Q F + R F +H+RG+ H Q  ++             IN  +++++ + N E KH +WL    LN    I   SG+F C+LC V   S K   SHLNG +H      E       + ++ ++K +  ++   K  A+IQ  LD  T PLIGL+Y+ EYQ +E L +  Y C LCN    +KS+I+H+ S  HR TYL  YY  L+ +++ D S K+++  RLEVY+ ++E FEGRK++ I
Sbjct:  190 DPNAVMKEYFDRYVKAGDWDWASWNNYLSWIARTNPQWYEIFVNLSRGLGIDWDDMYKKWVSSVSGDSSCNKNPPMSYYSCDQNFSFANYLGLGSTDIPTVCKQNVENDDYD-----DSEGSQHYCVTCKQDFGTERAFMLHIRGVTHTQRTLQSSG----------INSFDSTVLEE-NWEVKHHEWLKSQYLNGLNSIVGLSGSFYCELCEVEFPSHKALGSHLNGRRH-----RENVFIFESTGDRSLLKGKRQAQ--VKVTAKIQPFLDVCTQPLIGLNYMIEYQLQE-LDECLYICSLCNQWLPKKSVINHLCSIVHRKTYLNTYYLPLFRIIDRDYSDKSLQACRLEVYARKIEDFEGRKRLII 535          
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Match: A0A3R7G9E6 (C2H2-type domain-containing protein OS=Clonorchis sinensis OX=79923 GN=CSKR_1485s PE=4 SV=1)

HSP 1 Score: 191.43 bits (485), Expect = 1.455e-45
Identity = 113/368 (30.71%), Postives = 194/368 (52.72%), Query Frame = 3
Query:  693 DPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASE-----MPKEGTSVIFTDDMQSRNTTETAHSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLN-----NEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNI 1766
            +PQA+++ YF+ + E G W W  W N +  I  T PQW+ +F E+S+++GI+W++M+ S  +  D            LT + FD+ A S       P   T  ++   + +    +     LFC  C   F + R F IHLRG+ HIQ  ++ + ++ PE  +++   + +    D     +H  WL +     N+      G+  C+LC V  SS    ++HL+G +H  +N ++    L  S    +++++  S+   K  AR+Q LLD  T PLIGL+Y+ E Q  +   +  YYCELC+ +  R+  I+H+    HR  Y+K +Y D+  +V  D S   +R+KR+++++ ++E FEGRK++ +
Sbjct:  321 NPQAVMEEYFERFGEIGKWDWASWNNYLSWIARTNPQWYGIFLEYSQSLGIDWDQMY-SQWKQSD-----------PLTRQSFDEFADSVDGFSIQPPNLTQSVYHGLLFTETADDDDDDGLFCRTCQMRFNTDRAFSIHLRGVTHIQRALENLQRRNPE-NIDKYTEQYSQFNPDAGGA-RHYAWLQSQYMEKNKSAFIRPGSLYCELCQVGSSSFASLQAHLSGRRHR-ENVSQ----LQSSGGPNLLREKRPSQ--AKSSARLQTLLDVCTQPLIGLNYIVERQFGDD-EECMYYCELCDDRITRQDAINHVCGVAHRTHYMKAHYPDMCGIVASDQSDLDMRLKRIDLFARKIEDFEGRKRVKV 666          
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Match: A0A4S2LP79 (C2H2-type domain-containing protein OS=Opisthorchis felineus OX=147828 GN=CRM22_005834 PE=4 SV=1)

HSP 1 Score: 191.045 bits (484), Expect = 1.609e-45
Identity = 110/365 (30.14%), Postives = 192/365 (52.60%), Query Frame = 3
Query:  693 DPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRID--MDTLDLTNKFFDKSAASEMPKEGTSVIFTDDMQSRNTTETAHSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLN-----NEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNI 1766
            +PQA+++ YF+ + E G W W  W N +  I  T PQW+ +F E+S+++GI+W++M+    Q+  +     D   D++D           S  P   T  ++   + +    +     LFC  C   F + R F IHLRG+ HIQ  ++ + ++ PE  +++   + +    D     +H  WL +     N+      G+  C+LC V  SS    ++HL+G +H  +N ++    L  S    +++++  S+   K  AR+Q LLD  T PLIGL+Y+ E Q  +   +  YYCELC+ +  R+  I+H+    HR  Y+K +Y D+  +V  D S   +R+KR+++++ ++E FEGRK++ +
Sbjct:  185 NPQAVMEEYFERFGEIGKWDWASWNNYLSWIARTNPQWYGIFLEYSQSLGIDWDQMYSQWKQSDPLTRQSFDEFADSVD---------GISIQPPNLTQSVYHGLLFTETADDDDDDGLFCRTCQMRFNTDRAFSIHLRGVTHIQRALENLQRRNPE-NIDKYTEQYSQFNPDAGGA-RHYAWLQSQYMEKNKSAFIRPGSLYCELCQVGSSSFASLQAHLSGRRHR-ENVSQ----LQSSSGPNLLREKRPSQ--AKSSARLQTLLDVCTQPLIGLNYIVERQFGDD-EECMYYCELCDDRITRQDAINHVCGVAHRTHYMKAHYPDMCGIVASDQSDLDMRLKRIDLFARKIEDFEGRKRVKV 530          
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Match: A0A5J4NLK3 (C2H2-type domain-containing protein OS=Paragonimus westermani OX=34504 GN=DEA37_0000148 PE=4 SV=1)

HSP 1 Score: 191.045 bits (484), Expect = 1.648e-45
Identity = 121/385 (31.43%), Postives = 194/385 (50.39%), Query Frame = 3
Query:  693 DPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMF----VSDLQAIDMRNVRIDMDTLDLTNKFFDK------SAASEMPKEGT-----SVIFTDDMQSRNTTETA---HSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLN-----NEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKH----SSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNI 1766
            DP+A+++ YF+ + +TG W W  W N +  I  T PQW+ +F E+S+++GI+W++M+    +SD                 LT + FD          S  P   T      ++F D  Q +   +      +  FC  C   F + R F +HLRG+ HIQ  ++ + ++ PE     +++  A    D N   +H  WL       N+      G+  C+LC V  SS    ++HLNG +H    S  N++ +P +L         KD+   +   K  AR+Q LLD  T PLIGL+Y+ E Q  +   +  Y+CELC+ +  R+  I H+ S  HR TY+K +Y D+  +V  D SS +VR KR++ ++ ++E FEGRK++ +
Sbjct:  190 DPRAVMEQYFERFGDTGKWDWASWNNYLSWIARTNPQWYTIFLEYSQSLGIDWDQMYDRWKMSD----------------PLTRQSFDDEFVGPVGVVSLQPPHLTQPVYHGLLFGDAFQLQQQDDDPIEDQNGFFCRTCMLRFDTDRAFTLHLRGVSHIQRALEALQRRNPE----HLDKYPAQQHED-NGGARHYAWLQTQYMQKNKSEFVVPGSLYCELCQVHSSSYASLQAHLNGRRHRENVSLFNSSGDPASL---------KDKQSGQ--SKVTARLQTLLDVCTQPLIGLNYIVERQFGDD-EECVYHCELCDERITRQDAIKHVCSVAHRTTYMKTHYPDMCGIVVQDCSSLSVRTKRIDFFARKIEDFEGRKRVKV 541          
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Match: A0A4Z2CT95 (UBP1-associated proteins 1C OS=Schistosoma japonicum OX=6182 GN=EWB00_007742 PE=4 SV=1)

HSP 1 Score: 187.963 bits (476), Expect = 1.333e-44
Identity = 125/386 (32.38%), Postives = 189/386 (48.96%), Query Frame = 3
Query:  693 DPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASE--MPKEGTSVIFTD-DMQSRNTTETA--------------------HSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWL----LNNED-ISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNI 1766
            DP A+++ YF  Y + G+W W  W N +  I  T PQW+E+F   SR +GI+W++M+     ++                   D S + +  MP   T   FT  +     TT+T                     +S  +C  C   F + R F +H+RG+ H Q  +     Q   I L +      S + + N E KH +WL    LN  D +   SG+F C+LC V   S K   SHLNG +H      E       + ++ +++ +  + P+K   A+IQ  LD  T PLIGL+Y+ EYQ +E L +  Y C LCN    RKS+I+H+ S  HR TYL   Y  L+ +++ D S ++++  RLEVY+ ++E FEGRK++ I
Sbjct:  167 DPNAVMKEYFDRYVKAGDWDWASWNNYLSWIARTNPQWYEIFVNLSRGLGIDWDDMYKRWASSVSG-----------------DPSCSEDVLMPHCSTGQSFTTANYFGIGTTQTTIASEQDVGNNDDGDDDDDNEYSWNYCTTCKHDFVTERAFMLHIRGVTHTQKAL-----QSNGITLFD------STVLEENWETKHHEWLKSQYLNELDTVVGLSGSFYCELCEVEFPSHKALGSHLNGRRH-----RENVFIFESTGDRSLLRGKRKT-PVKV-TAKIQPFLDVCTQPLIGLNYMIEYQLKE-LDECLYVCSLCNQWLPRKSVINHLCSIVHRKTYLNTCYLPLFRIIDRDFSDRSLQTCRLEVYARKIEDFEGRKRLVI 516          
BLAST of C2H2-type domain-containing protein vs. Ensembl Sea Lamprey
Match: ENSPMAT00000008581.1 (pep scaffold:Pmarinus_7.0:GL476642:829812:835988:-1 gene:ENSPMAG00000007770.1 transcript:ENSPMAT00000008581.1 gene_biotype:protein_coding transcript_biotype:protein_coding)

HSP 1 Score: 47.7506 bits (112), Expect = 3.413e-6
Identity = 34/125 (27.20%), Postives = 66/125 (52.80%), Query Frame = 3
Query: 1275 FTCKLCNVTCSSQKDFKSHLNGEKHSS-----KNTNEEPIALLISDEKKIVKDRVISKPIKKPVAR-IQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELC--NCKCDRKSIISHITSYKHR 1625
             TC LCNVTC++ + FK+H+NG  H+      +  +   +A+L++ +   +    +S  + KP +R   +  + FT  ++       ++K ++L+    YC +C  N +  RK  + H++S +HR
Sbjct:   36 ITCHLCNVTCANNRHFKNHMNGAMHARCMQEVQQKSSVQVAMLLAQDGSNMLSSHLS--LGKPASRWCSMCKEYFTCNVVEHRRSQMHKKAKNLTRP--YCTICRWNTRSVRK-FVEHMSSEEHR 155          
BLAST of C2H2-type domain-containing protein vs. Planmine SMEST
Match: SMESG000055233.1 (SMESG000055233.1)

HSP 1 Score: 1935.61 bits (5013), Expect = 0.000e+0
Identity = 1004/1016 (98.82%), Postives = 1012/1016 (99.61%), Query Frame = 3
Query:  105 MLSKVIHHFFEISKNIPKEPKSKYKTKEFQNYKFKMQEYLTSYCCHLFDICNKMNNVTEQFKEAYNNLYSDFKSYNKFLKKFKCISQFNFEHNQDDGSMSPSNPCYTNFNNDNELEDIKSNNYENNDQNINLCDENFLNNFHNKYKVIKRFIXXXXXXXXXXXXXXXXXXXXXXXXXXXEETVANNVEPVHYNNDEDPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASEMPKEGTSVIFTDDMQSRNTTETAHSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLNNEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNIMIEKLSINSPEIISGNSKSDQTDSKGDKISFLLXXXXXXXXXIKLSVTNEAQINLEDVDREDGEIESSDDEFCCDRKSESISNQEITKSVLKQEIFTELEAGXXXXXXXXXXXXXQCEQIEADINETTFIDKSEFNNYEPLYINNRIPFCAKSIIRDNGTLEFMNHRENKKVKLKEVSEDILNHFKQEAKWALDRLKEINITKNTYHKRNFHLPPSKNPVGPVYPGVSELSNKLQHEQKKXXXXXXTENRYMSPVTEFVDTKLSSVNNSLLHDVQNMLVNKVTNDPLNEIIRQEFLFSSPSKDVDQTISSSSNENNRKSILCSPIDLKLLNETLKTVQNFTEFSSNRAEINNSSDSVMFKKTTSNLEIGKKLSKHNEFLGMDQPPPPIFPKTNPDIYYKIPQYNPMTNIMPPFLTPGMIEHQTIYPAFDRNFVYSYPSIFYSNNNSNHAPSDIFNGYDRTNL 3152
            MLSKVIHHFFEISKNIPKEPKSKYKTKEFQNYKFKMQEYLTSYCCHLFDICNKMNNVTEQFKEAYNNLYSDFKSYNKFLKKFKCISQFNFEHNQDDGSMSPSNPCYTNF+NDNELEDIKSNNYENNDQNIN CDENFLNNFHNKYKVIKRFIKKEKHQVKIKKLSKSQKKKLRKKRKLREETVANN+E VHYNNDEDPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASEMPKEGTSVIFTDD+QSRNTTET+HSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLNNEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNIMIEKLSINSPEIISGNSKSDQTDSKGDKISFLLNDEENNNDDIKLSVTNE+QINLEDVD+EDGEIESSDDEFCCDRKSESISNQEITKSVLKQEIFTELEAGEIENEIIDDDSDEQCEQIEADINETTFIDKSEFNNYEPLYINNRIPFCAK+IIRDNGTLEFMNHRENKKVKLKEVSEDILNHFKQEAKWALDRLKEINITKNTYHKRNFHLPPSKNPVGPVYPGVSELSNKLQHE+ KSSSSSSTENRYMSPVTEFVDTKLSSVNNSLLHDVQNMLVNKVTNDPLNEIIRQEFLFSSP KDVDQTISSSSNENNRKSILCSPIDLKLLNETLKTVQNFTEFSSNRAEINNSSDSVMFKKTTSNLEIGKKLSKHNEFLGMDQPPPPIFPKTNPDIYYKIPQYNPMTNIMPPFLTPGMIEHQTIYPAFDRNFVYSYPSIFYSNNNSNHAPSDIFNGYDRTNL
Sbjct:    1 MLSKVIHHFFEISKNIPKEPKSKYKTKEFQNYKFKMQEYLTSYCCHLFDICNKMNNVTEQFKEAYNNLYSDFKSYNKFLKKFKCISQFNFEHNQDDGSMSPSNPCYTNFSNDNELEDIKSNNYENNDQNINSCDENFLNNFHNKYKVIKRFIKKEKHQVKIKKLSKSQKKKLRKKRKLREETVANNIELVHYNNDEDPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFEMFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASEMPKEGTSVIFTDDVQSRNTTETSHSELFCDACNQGFRSTRYFHIHLRGIKHIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLNNEDISKASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIVKDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCELCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRLEVYSDQVELFEGRKKMNIMIEKLSINSPEIISGNSKSDQTDSKGDKISFLLNDEENNNDDIKLSVTNESQINLEDVDKEDGEIESSDDEFCCDRKSESISNQEITKSVLKQEIFTELEAGEIENEIIDDDSDEQCEQIEADINETTFIDKSEFNNYEPLYINNRIPFCAKNIIRDNGTLEFMNHRENKKVKLKEVSEDILNHFKQEAKWALDRLKEINITKNTYHKRNFHLPPSKNPVGPVYPGVSELSNKLQHEKNKSSSSSSTENRYMSPVTEFVDTKLSSVNNSLLHDVQNMLVNKVTNDPLNEIIRQEFLFSSPCKDVDQTISSSSNENNRKSILCSPIDLKLLNETLKTVQNFTEFSSNRAEINNSSDSVMFKKTTSNLEIGKKLSKHNEFLGMDQPPPPIFPKTNPDIYYKIPQYNPMTNIMPPFLTPGMIEHQTIYPAFDRNFVYSYPSIFYSNNNSNHAPSDIFNGYDRTNL 1016          
The following BLAST results are available for this feature:
BLAST of C2H2-type domain-containing protein vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 1
Match NameE-valueIdentityDescription
CG315101.440e-621.40gene:FBgn0051510 transcript:FBtr0084783[more]
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 2
Match NameE-valueIdentityDescription
ssrp11.005e-1132.58structure specific recognition protein 1 [Source:X... [more]
ssrp11.041e-1032.58structure specific recognition protein 1 [Source:X... [more]
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A5K4F1588.781e-4933.24Uncharacterized protein OS=Schistosoma mansoni OX=... [more]
A0A3R7G9E61.455e-4530.71C2H2-type domain-containing protein OS=Clonorchis ... [more]
A0A4S2LP791.609e-4530.14C2H2-type domain-containing protein OS=Opisthorchi... [more]
A0A5J4NLK31.648e-4531.43C2H2-type domain-containing protein OS=Paragonimus... [more]
A0A4Z2CT951.333e-4432.38UBP1-associated proteins 1C OS=Schistosoma japonic... [more]
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 1
Match NameE-valueIdentityDescription
ENSPMAT00000008581.13.413e-627.20pep scaffold:Pmarinus_7.0:GL476642:829812:835988:-... [more]
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of C2H2-type domain-containing protein vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 1
Match NameE-valueIdentityDescription
SMESG000055233.10.000e+098.82SMESG000055233.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30020076 ID=SMED30020076|Name=C2H2-type domain-containing protein|organism=Schmidtea mediterranea sexual|type=transcript|length=3252bp
AATCGAACTGCTAAAATTCTTTTTATTTGTATATATATATCAAGTTTTGT
AGATCTAATAAAATGATAAGTAATTTTCAAAATGATGAATATTATACTAG
TGAAATGTTATCTAAAGTCATCCATCATTTTTTTGAAATTTCTAAAAATA
TACCGAAGGAACCAAAGTCCAAATACAAAACTAAGGAGTTTCAGAATTAT
AAATTCAAAATGCAAGAATATCTTACATCATATTGCTGCCATCTATTTGA
TATCTGTAATAAAATGAATAACGTCACAGAACAATTTAAAGAAGCTTATA
ATAATCTTTACAGTGATTTTAAATCTTACAACAAATTCTTAAAAAAATTT
AAATGTATAAGCCAATTCAATTTTGAACATAATCAAGATGATGGTTCTAT
GTCACCATCTAATCCATGCTACACAAATTTCAATAATGATAATGAACTGG
AAGATATTAAATCTAATAATTATGAGAATAATGATCAAAACATTAATTTA
TGCGATGAAAATTTTTTAAATAATTTTCACAATAAATATAAAGTAATAAA
ACGTTTCATAAAAAAAGAGAAACATCAAGTAAAAATCAAGAAATTATCAA
AGAGTCAGAAAAAGAAATTAAGAAAAAAACGGAAATTACGAGAAGAAACA
GTCGCAAATAATGTTGAACCAGTCCATTACAATAATGATGAAGATCCACA
AGCTATGATTCAAAATTACTTTAAATCATATACTGAAACGGGAAATTGGA
GTTGGACTGGCTGGATTAATTGCGTTCAAAAAATACAACTAACTTATCCA
CAATGGTTTGAAATGTTCAATGAATTCAGCAGAAATATGGGAATTAACTG
GAATGAAATGTTCGTGAGTGACCTTCAGGCCATCGATATGAGAAATGTTA
GAATCGATATGGATACATTGGACCTAACCAACAAATTCTTCGATAAATCA
GCAGCTAGTGAAATGCCGAAAGAGGGTACATCGGTGATATTCACTGATGA
TATGCAATCGCGCAACACCACAGAAACGGCACACTCTGAACTGTTCTGTG
ACGCCTGTAATCAAGGGTTTCGATCGACCAGATATTTTCACATTCATCTA
AGAGGCATAAAACATATTCAAAATGAAATCAAATATGTTTCTAAGCAAAA
ACCAGAAATATTCCTTGAGGAAATCAATAGAGAGAATGCATCAATAATGA
GTGATACAAATAAAGAAGATAAGCATATGAAATGGTTATTGAATAATGAA
GATATTTCCAAAGCATCTGGAACTTTTACTTGCAAGCTATGTAATGTTAC
TTGTTCATCACAAAAAGATTTTAAAAGCCATTTGAATGGTGAAAAACATT
CTAGCAAAAATACAAACGAAGAACCAATTGCATTGTTGATTTCTGATGAG
AAAAAAATCGTTAAAGATAGAGTTATTTCAAAACCCATTAAAAAACCGGT
TGCAAGAATACAGATACTTCTCGATCAATTCACTAGTCCTCTTATCGGAC
TTGATTACGTAAATGAATATCAGAAAAGAGAGTCGTTATCCGACAGCTGG
TATTATTGCGAGTTATGTAATTGTAAATGCGACAGGAAGTCTATAATTTC
TCACATTACTTCATACAAGCACAGAGCAACATATTTGAAACAATACTATT
ATGATCTCTATGAGTTGGTTGAGCACGACCCCAGTTCCAAAGCTGTTAGA
ATGAAACGATTAGAAGTTTATTCTGATCAAGTCGAATTGTTTGAAGGAAG
AAAAAAGATGAATATTATGATTGAAAAGCTATCAATTAATTCGCCTGAAA
TCATATCTGGTAATTCCAAGTCAGATCAAACGGATTCGAAGGGTGATAAA
ATCAGTTTTTTATTAAATGATGAGGAAAATAATAACGACGATATAAAACT
TTCTGTAACAAATGAAGCGCAAATAAATTTAGAAGATGTAGATAGAGAAG
ATGGTGAAATTGAATCGTCTGATGATGAATTTTGTTGTGACAGAAAATCA
GAAAGTATTTCCAATCAAGAAATTACTAAAAGTGTTTTAAAACAAGAAAT
TTTCACTGAATTAGAAGCAGGAGAAATAGAAAATGAAATTATAGATGACG
ATAGTGATGAACAGTGTGAGCAAATTGAAGCAGATATCAATGAAACCACC
TTTATCGATAAATCTGAATTCAATAATTATGAGCCCTTGTATATTAACAA
TAGAATACCATTTTGTGCCAAAAGTATAATCCGAGACAATGGAACACTTG
AGTTTATGAATCATAGAGAAAACAAAAAAGTCAAATTAAAAGAAGTAAGT
GAAGACATTCTTAATCATTTCAAGCAAGAGGCTAAATGGGCATTAGATCG
ATTGAAAGAGATTAATATTACTAAAAATACTTATCATAAAAGAAATTTTC
ACCTGCCTCCATCAAAAAATCCAGTAGGTCCGGTCTACCCGGGAGTTTCT
GAATTATCAAATAAATTACAGCACGAACAAAAGAAATCTTCATCTTCTTC
ATCCACTGAAAATAGATATATGTCACCAGTAACTGAATTTGTCGATACAA
AGTTGTCTTCAGTAAATAACAGTCTTTTACACGACGTGCAGAATATGTTA
GTCAATAAAGTAACCAATGACCCATTAAATGAAATCATAAGACAAGAATT
CCTTTTCTCATCTCCTAGTAAAGATGTTGATCAAACGATTTCTTCATCTT
CAAACGAAAACAACCGGAAATCTATTTTATGCTCACCAATTGATTTGAAG
TTGTTAAATGAAACATTGAAAACGGTTCAAAACTTTACTGAATTTTCTTC
TAATCGTGCCGAAATAAACAATTCATCAGATTCCGTTATGTTCAAGAAAA
CAACGAGTAATTTAGAAATCGGAAAGAAATTATCGAAACACAACGAATTT
CTAGGTATGGATCAGCCGCCGCCGCCAATTTTTCCAAAAACAAATCCAGA
TATTTATTACAAAATTCCGCAATATAATCCAATGACAAATATTATGCCAC
CCTTTCTTACGCCGGGAATGATTGAACATCAGACTATCTATCCTGCTTTT
GATAGAAATTTTGTGTATTCATATCCATCCATTTTCTATAGTAATAACAA
TTCTAATCATGCGCCGTCAGATATTTTCAATGGTTATGATAGGACTAACT
TATAATTTTAAATATTTGAAAAATACTTGAATCTGCATATATTTTTACGT
GTATAGTGTTTAATAAAATATTTTACTGTTTTTATGAAATACAATAATAG
AC
back to top

protein sequence of SMED30020076-orf-1

>SMED30020076-orf-1 ID=SMED30020076-orf-1|Name=SMED30020076-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=1031bp
MISNFQNDEYYTSEMLSKVIHHFFEISKNIPKEPKSKYKTKEFQNYKFKM
QEYLTSYCCHLFDICNKMNNVTEQFKEAYNNLYSDFKSYNKFLKKFKCIS
QFNFEHNQDDGSMSPSNPCYTNFNNDNELEDIKSNNYENNDQNINLCDEN
FLNNFHNKYKVIKRFIKKEKHQVKIKKLSKSQKKKLRKKRKLREETVANN
VEPVHYNNDEDPQAMIQNYFKSYTETGNWSWTGWINCVQKIQLTYPQWFE
MFNEFSRNMGINWNEMFVSDLQAIDMRNVRIDMDTLDLTNKFFDKSAASE
MPKEGTSVIFTDDMQSRNTTETAHSELFCDACNQGFRSTRYFHIHLRGIK
HIQNEIKYVSKQKPEIFLEEINRENASIMSDTNKEDKHMKWLLNNEDISK
ASGTFTCKLCNVTCSSQKDFKSHLNGEKHSSKNTNEEPIALLISDEKKIV
KDRVISKPIKKPVARIQILLDQFTSPLIGLDYVNEYQKRESLSDSWYYCE
LCNCKCDRKSIISHITSYKHRATYLKQYYYDLYELVEHDPSSKAVRMKRL
EVYSDQVELFEGRKKMNIMIEKLSINSPEIISGNSKSDQTDSKGDKISFL
LNDEENNNDDIKLSVTNEAQINLEDVDREDGEIESSDDEFCCDRKSESIS
NQEITKSVLKQEIFTELEAGEIENEIIDDDSDEQCEQIEADINETTFIDK
SEFNNYEPLYINNRIPFCAKSIIRDNGTLEFMNHRENKKVKLKEVSEDIL
NHFKQEAKWALDRLKEINITKNTYHKRNFHLPPSKNPVGPVYPGVSELSN
KLQHEQKKSSSSSSTENRYMSPVTEFVDTKLSSVNNSLLHDVQNMLVNKV
TNDPLNEIIRQEFLFSSPSKDVDQTISSSSNENNRKSILCSPIDLKLLNE
TLKTVQNFTEFSSNRAEINNSSDSVMFKKTTSNLEIGKKLSKHNEFLGMD
QPPPPIFPKTNPDIYYKIPQYNPMTNIMPPFLTPGMIEHQTIYPAFDRNF
VYSYPSIFYSNNNSNHAPSDIFNGYDRTNL*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000020protonephridia
Vocabulary: INTERPRO
TermDefinition
IPR013087Znf_C2H2_type
IPR036236Znf_C2H2_sf
IPR003604Matrin/U1-like-C_Znf_C2H2
IPR015880Znf_C2H2-like
IPR007087Znf_C2H2
Vocabulary: molecular function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO:0046872metal ion binding
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 175..195
NoneNo IPR availableGENE3DG3DSA:3.30.160.60coord: 390..437
e-value: 3.7E-8
score: 34.7
NoneNo IPR availablePFAMPF12874zf-metcoord: 405..429
e-value: 2.2E-8
score: 34.2
NoneNo IPR availablePANTHERPTHR45762FAMILY NOT NAMEDcoord: 197..656
IPR003604Matrin/U1-C-like, C2H2-type zinc fingerSMARTSM00451ZnF_U1_5coord: 402..436
e-value: 0.0036
score: 26.5
coord: 324..358
e-value: 2.3
score: 9.3
coord: 494..527
e-value: 0.18
score: 18.2
IPR013087Zinc finger C2H2-typeSMARTSM00355c2h2final6coord: 405..429
e-value: 0.064
score: 22.4
coord: 497..520
e-value: 73.0
score: 5.6
coord: 327..351
e-value: 6.1
score: 15.2
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 329..351
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 407..429
IPR036236Zinc finger C2H2 superfamilySUPERFAMILYSSF57667beta-beta-alpha zinc fingerscoord: 314..355
IPR036236Zinc finger C2H2 superfamilySUPERFAMILYSSF57667beta-beta-alpha zinc fingerscoord: 397..432