Chromosome 4 open reading frame 47

Overview
NameChromosome 4 open reading frame 47
Smed IDSMED30029737
Length (bp)1073
Neoblast Clusters

Zeng et. al., 2018




▻ Overview

▻ Neoblast Population

 



 

Overview

 

Single cell RNA-seq of pluripotent neoblasts and its early progenies


We isolated X1 neoblasts cells enriched in high piwi-1 expression (Neoblast Population), and profiled ∼7,614 individual cells via scRNA-seq. Unsupervised analyses uncovered 12 distinct classes from 7,088 high-quality cells. We designated these classes Nb1 to Nb12 and ordered them based on high (Nb1) to low (Nb12) piwi-1 expression levels. We further defined groups of genes that best classified the cells parsed into 12 distinct cell clusters to generate a scaled expression heat map of discriminative gene sets for each cluster. Expression of each cluster’s gene signatures was validated using multiplex fluorescence in situ hybridization (FISH) co-stained with piwi-1 and largely confirmed the cell clusters revealed by scRNA-seq.

We also tested sub-lethal irradiation exposure. To profile rare pluripotent stem cells (PSCs) and avoid interference from immediate progenitor cells, we determined a time point after sub-lethal irradiation (7 DPI) with minimal piwi-1+ cells, followed by isolation and single-cell RNA-seq of 1,200 individual cells derived from X1 (Piwi-1 high) and X2 (Piwi-1 low) cell populations (Sub-lethal Irradiated Surviving X1 and X2 Cell Population)




Explore this single cell expression dataset with our NB Cluster Shiny App




 

Neoblast Population

 

t-SNE plot shows two-dimensional representation of global gene expression relationships among all neoblasts (n = 7,088 after filter). Cluster identity was assigned based on the top 10 marker genes of each cluster (Table S2), followed by inspection of RNA in situ hybridization patterns. Neoblast groups, Nb.


Expression of Chromosome 4 open reading frame 47 (SMED30029737) t-SNE clustered cells

Violin plots show distribution of expression levels for Chromosome 4 open reading frame 47 (SMED30029737) in cells (dots) of each of the 12 neoblast clusters.

 

back to top


 

Embryonic Expression

Davies et. al., 2017




Hover the mouse over a column in the graph to view average RPKM values per sample.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

Embryonic Stages: Y: yolk. S2-S8: Stages 2-8. C4: asexual adult. SX: virgin, sexually mature adult.
For further information about sample preparation and analysis for the single animal RNA-Seq experiment, please refer to the Materials and Methods

 

back to top
Anatomical Expression

PAGE et. al., 2020




SMED30029737

has been reported as being expressed in these anatomical structures and/or regions. Read more about PAGE



PAGE Curations: 4

  
Expressed InReference TranscriptGene ModelsPublished TranscriptTranscriptomePublicationSpecimenLifecycleEvidence
testisSMED30029737 Contig4124GPL15192PMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite colorimetric in situ hybridization evidence
testisSMED30029737 PL06006A1D11ncbi_smed_estsPMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite colorimetric in situ hybridization evidence
reproductive organSMED30029737 Contig41651newmark_estsPMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
reproductive organSMED30029737 Contig41651uc_Smed_v2PMID:28434803
Rouhana et al., 2017
whole organism adult hermaphrodite RNA-sequencing evidence
Note: Hover over icons to view figure legend
Homology
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Human
Match: C4orf47 (chromosome 4 open reading frame 47 [Source:HGNC Symbol;Acc:HGNC:34346])

HSP 1 Score: 127.872 bits (320), Expect = 1.378e-33
Identity = 107/310 (34.52%), Postives = 154/310 (49.68%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISIND--------PFKTVLINNFSGKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPF--EGGSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D        PF      N   K  +  GSK       GYF+  F R+F+ E G  +   +RR+  +E   +NLGK ++PS+   K CGLGS  G+    +P F  ++K +  Y    KN+ TNPG KG GYGY+N+   K   + + + Y+  +  YKK +E H   + G P F       DYF  NP+  E     IK  + +  +   F  +S  KK GGMKAGT +P+P+H   PY+     +A     D      P+G  P   P +S++  NV    N+ N  K SSV
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSQFNRPFNEAASKN---KQMLPGGSKEMSDLQAGYFDPHFVRIFEGE-GYINLNQVRRRDMVEAAKKNLGKAFLPSNGEKKPCGLGSYYGTIGGPVPFFSAQSKPREKYKAPGKNLYTNPGKKGTGYGYANITIGKQFSH-SADFYDAAKLKYKKANEEHHRLLKGAP-FKLNLHPRDYFDANPYFSEESLPPIKKEEKKKTISNTFKPSSPGKKPGGMKAGTFDPYPSHSADPYVAK---LANISGKDDKIFHPPSG--PKSRPVESIMTLNVRRALNSKNY-KTSSV 306          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Human
Match: C4orf47 (chromosome 4 open reading frame 47 [Source:HGNC Symbol;Acc:HGNC:34346])

HSP 1 Score: 90.8929 bits (224), Expect = 2.069e-21
Identity = 58/154 (37.66%), Postives = 86/154 (55.84%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFKTVL---INNFSGKN--FVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDK 494
            D++RIGLF E+ YI++ D + +      N  + KN   +  GSK       GYF+  F R+F+ E G  +   +RR+  +E   +NLGK ++PS+   K CGLGS  G+    +P F  ++K +  Y    KN+ TNPG KG GYGY+N+   K
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSQFNRPFNEAASKNKQMLPGGSKEMSDLQAGYFDPHFVRIFEGE-GYINLNQVRRRDMVEAAKKNLGKAFLPSNGEKKPCGLGSYYGTIGGPVPFFSAQSKPREKYKAPGKNLYTNPGKKGTGYGYANITIGK 161          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Human
Match: C4orf47 (chromosome 4 open reading frame 47 [Source:HGNC Symbol;Acc:HGNC:34346])

HSP 1 Score: 91.2781 bits (225), Expect = 2.274e-21
Identity = 58/154 (37.66%), Postives = 86/154 (55.84%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFKTVL---INNFSGKN--FVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDK 494
            D++RIGLF E+ YI++ D + +      N  + KN   +  GSK       GYF+  F R+F+ E G  +   +RR+  +E   +NLGK ++PS+   K CGLGS  G+    +P F  ++K +  Y    KN+ TNPG KG GYGY+N+   K
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSQFNRPFNEAASKNKQMLPGGSKEMSDLQAGYFDPHFVRIFEGE-GYINLNQVRRRDMVEAAKKNLGKAFLPSNGEKKPCGLGSYYGTIGGPVPFFSAQSKPREKYKAPGKNLYTNPGKKGTGYGYANITIGK 161          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Zebrafish
Match: zgc:153146 (zgc:153146 [Source:NCBI gene;Acc:751702])

HSP 1 Score: 126.716 bits (317), Expect = 2.323e-33
Identity = 101/320 (31.56%), Postives = 158/320 (49.38%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQK------KNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAG---RPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEK------PFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D++RIG+FKE+GYISI D + + +   F+      K  ++ G+K  CG   GYF+  F+R+F+ E   T  + I R++R+    +N+GK ++PS+     CG+G+  G+F   I A          IP+K      KN  TNP  KG GYGY ++   K   Y + + Y+  + + K+    H S + G   R    P     +YF  NP++      KP+ P   +E+      PF  +S  KK GGMKAG  + +P +   PY            N++ K F P+   P   P +S++  NV    N+TN  ++ SV
Sbjct:    7 SDMERIGVFKEMGYISIGDKYTSFIYRPFNDSAYKNKQMLLGGTKSKCGLQTGYFDTQFKRIFERE-AFTDPVRIDRQYRILQAKKNIGKAFLPSNGEKTTCGMGTYYGTFGGPIQAMSALQ-----IPRKQNKSAGKNFYTNPPKKGSGYGYPDITLSKMVSY-SSDPYDRAKEMLKREITAHKSMLKGGAFRLNLHP----NEYFDGNPYKFD----KPLPPPKKIEEKKHFAVPFKPSSPSKKAGGMKAGAFDSYPTYSAEPYGTKK--TKSVVANNEVKIFHPSPG-PKSTPIKSIISLNVNKAVNSTNYNRIPSV 308          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Zebrafish
Match: C14H4orf47 (zgc:153146 [Source:NCBI gene;Acc:751702])

HSP 1 Score: 124.02 bits (310), Expect = 2.633e-32
Identity = 101/321 (31.46%), Postives = 159/321 (49.53%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQK------KNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAG---RPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEK------PFIVASYPKKDGGMKAGTLNPFPAHQPCPY-IDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D++RIG+FKE+GY SI D + + +   F+      K  ++ G+K  CG   GYF+  F+R+F+ E   T  + I R++R+    +N+GK ++PS+     CG+G+  G+F   I A          IP+K      KN  TNP  KG GYGY ++   K   Y + + Y+  + + K+    H S + G   R    P     +YF  NP++      KP+ P   +E+      PF  +S  KK GGMKAG  + +P +   PY    N  +     N++ K F P+   P   P +S++  NV    N+TN  ++ SV
Sbjct:    7 SDMERIGVFKEMGYTSIGDKYTSFIYRPFNDSAYKNKQMLLGGTKSKCGLQTGYFDTQFKRIFERE-AFTDPVRIDRQYRILQAKKNIGKAFLPSNGEKTTCGMGTYYGTFGGPIQAMSALQ-----IPRKQNKSAGKNFYTNPPKKGSGYGYPDITLSKMVSY-SSDPYDRAKEMLKREITAHKSMLKGGAFRLNLHP----NEYFDGNPYKFD----KPLPPPKKIEEKKHFAVPFKPSSPSKKAGGMKAGAFDSYPTYSAEPYGTKKNKSVV---ANNEVKIFHPSPG-PKSTPIKSIISLNVNKAVNSTNYNRIPSV 308          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Zebrafish
Match: C14H4orf47 (zgc:153146 [Source:NCBI gene;Acc:751702])

HSP 1 Score: 124.02 bits (310), Expect = 2.633e-32
Identity = 101/321 (31.46%), Postives = 159/321 (49.53%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQK------KNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAG---RPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEK------PFIVASYPKKDGGMKAGTLNPFPAHQPCPY-IDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D++RIG+FKE+GY SI D + + +   F+      K  ++ G+K  CG   GYF+  F+R+F+ E   T  + I R++R+    +N+GK ++PS+     CG+G+  G+F   I A          IP+K      KN  TNP  KG GYGY ++   K   Y + + Y+  + + K+    H S + G   R    P     +YF  NP++      KP+ P   +E+      PF  +S  KK GGMKAG  + +P +   PY    N  +     N++ K F P+   P   P +S++  NV    N+TN  ++ SV
Sbjct:    7 SDMERIGVFKEMGYTSIGDKYTSFIYRPFNDSAYKNKQMLLGGTKSKCGLQTGYFDTQFKRIFERE-AFTDPVRIDRQYRILQAKKNIGKAFLPSNGEKTTCGMGTYYGTFGGPIQAMSALQ-----IPRKQNKSAGKNFYTNPPKKGSGYGYPDITLSKMVSY-SSDPYDRAKEMLKREITAHKSMLKGGAFRLNLHP----NEYFDGNPYKFD----KPLPPPKKIEEKKHFAVPFKPSSPSKKAGGMKAGAFDSYPTYSAEPYGTKKNKSVV---ANNEVKIFHPSPG-PKSTPIKSIISLNVNKAVNSTNYNRIPSV 308          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Xenopus
Match: C4orf47 (chromosome 4 open reading frame 47 [Source:NCBI gene;Acc:100379819])

HSP 1 Score: 127.102 bits (318), Expect = 2.095e-33
Identity = 110/309 (35.60%), Postives = 152/309 (49.19%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISIND---PFKTVLINNFSGKNF-VVPGSKVPCGND-DGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKP------QIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKV 938
             D++RIGLF E+GY SI D   P  +   N  + KN  ++PG      N   GYF+  F+RVF+ E   +  +  RR++R++   +NLGK ++PSS   K  GLGS  G+    + AF  E K +  Y    KN  TNP  +G GYGY++V   K P   +   Y+    L KK  E H SK+ G P F       D+F  NP+       KP+ P      +   EKPF  +S  K+ GGMKAGT + +P H   PY   +   AK  + ++ K F P G  P  +P  S+L  NV       N   V
Sbjct:    6 TDMERIGLFSEMGYTSIGDKYAPPGSKPFNESASKNRQILPGGSKSMANILGGYFDAQFKRVFEGE-SYSDVLKQRRQYRMQQSKKNLGKPFLPSSGEKKPSGLGSFYGTVGGPVVAFSAELKPRKAYTAPGKNFYTNPPKQGSGYGYTSVTIGK-PYLYSSENYDIATELIKKEIENHKSKLKGGP-FKLNLHPKDHFNPNPY----YTDKPLPPLKVHSQKKETEKPFKPSSPGKQAGGMKAGTFDTYPTHSNDPYTIKS---AKTPIKER-KIFHPPGG-PKTYPVHSILTSNVIKSVTAVNYKTV 302          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Mouse
Match: 1700029J07Rik (RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:MGI:1916729])

HSP 1 Score: 147.132 bits (370), Expect = 6.606e-41
Identity = 110/310 (35.48%), Postives = 155/310 (50.00%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISIND----PFKTVLINNFSGKNFVVPG-SKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRP--TFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLE---KPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D    PF        S    ++PG +K       GYF+  F R+F+ E G  +   +RR++ L +  +NLGK +IPSS   K  GLGS  G+    +P F  + K K  Y P  KN+ TNPG KG GYGY+NV   K   + + + Y+     YKK SE H   + G P    + P    DYF  NP+    + + P++ +   E   KPF  +S  KK GGMKAG  +P+PAH   PY+       +  +  KG+R     N P   P +S++  NV    N  N    SS 
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSPFNRPFNEAASKNRQILPGGTKEMSSLQAGYFDSQFARIFEGE-GYVNLNQVRRRYMLAESKKNLGKAFIPSSGEKKPSGLGSYYGTIGGPVPFFSAQIKPKDKYQPPGKNLYTNPGKKGTGYGYANVTIGKQLSH-SSDLYDAARQSYKKESEEHHRLIKGSPFKLHLHPK---DYFDTNPY-FLEHHLPPLRREEKKEVSFKPFKPSSPGKKAGGMKAGAFDPYPAHSADPYV----VKVEKAIPSKGERVFHPPNGPKSRPVESIMALNVKRALNVKNYKNASST 308          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Mouse
Match: 1700029J07Rik (RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:MGI:1916729])

HSP 1 Score: 147.132 bits (370), Expect = 6.606e-41
Identity = 110/310 (35.48%), Postives = 155/310 (50.00%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISIND----PFKTVLINNFSGKNFVVPG-SKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRP--TFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLE---KPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D    PF        S    ++PG +K       GYF+  F R+F+ E G  +   +RR++ L +  +NLGK +IPSS   K  GLGS  G+    +P F  + K K  Y P  KN+ TNPG KG GYGY+NV   K   + + + Y+     YKK SE H   + G P    + P    DYF  NP+    + + P++ +   E   KPF  +S  KK GGMKAG  +P+PAH   PY+       +  +  KG+R     N P   P +S++  NV    N  N    SS 
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSPFNRPFNEAASKNRQILPGGTKEMSSLQAGYFDSQFARIFEGE-GYVNLNQVRRRYMLAESKKNLGKAFIPSSGEKKPSGLGSYYGTIGGPVPFFSAQIKPKDKYQPPGKNLYTNPGKKGTGYGYANVTIGKQLSH-SSDLYDAARQSYKKESEEHHRLIKGSPFKLHLHPK---DYFDTNPY-FLEHHLPPLRREEKKEVSFKPFKPSSPGKKAGGMKAGAFDPYPAHSADPYV----VKVEKAIPSKGERVFHPPNGPKSRPVESIMALNVKRALNVKNYKNASST 308          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Mouse
Match: 1700029J07Rik (RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:MGI:1916729])

HSP 1 Score: 57.3806 bits (137), Expect = 6.160e-10
Identity = 36/100 (36.00%), Postives = 54/100 (54.00%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISIND----PFKTVLINNFSGKNFVVPG-SKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVK 332
             D++RIGLF E+ YI++ D    PF        S    ++PG +K       GYF+  F R+F+ E G  +   +RR++ L +  +NLGK +IPSS   K
Sbjct:    8 TDMERIGLFSEMEYITVGDKYVSPFNRPFNEAASKNRQILPGGTKEMSSLQAGYFDSQFARIFEGE-GYVNLNQVRRRYMLAESKKNLGKAFIPSSGEKK 106          
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Match: sp|Q3U1D9|CD047_MOUSE (UPF0602 protein C4orf47 homolog OS=Mus musculus OX=10090 PE=2 SV=1)

HSP 1 Score: 147.132 bits (370), Expect = 4.621e-40
Identity = 110/310 (35.48%), Postives = 155/310 (50.00%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISIND----PFKTVLINNFSGKNFVVPG-SKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRP--TFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLE---KPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D    PF        S    ++PG +K       GYF+  F R+F+ E G  +   +RR++ L +  +NLGK +IPSS   K  GLGS  G+    +P F  + K K  Y P  KN+ TNPG KG GYGY+NV   K   + + + Y+     YKK SE H   + G P    + P    DYF  NP+    + + P++ +   E   KPF  +S  KK GGMKAG  +P+PAH   PY+       +  +  KG+R     N P   P +S++  NV    N  N    SS 
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSPFNRPFNEAASKNRQILPGGTKEMSSLQAGYFDSQFARIFEGE-GYVNLNQVRRRYMLAESKKNLGKAFIPSSGEKKPSGLGSYYGTIGGPVPFFSAQIKPKDKYQPPGKNLYTNPGKKGTGYGYANVTIGKQLSH-SSDLYDAARQSYKKESEEHHRLIKGSPFKLHLHPK---DYFDTNPY-FLEHHLPPLRREEKKEVSFKPFKPSSPGKKAGGMKAGAFDPYPAHSADPYV----VKVEKAIPSKGERVFHPPNGPKSRPVESIMALNVKRALNVKNYKNASST 308          
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Match: sp|Q2T9M0|CD047_BOVIN (UPF0602 protein C4orf47 homolog OS=Bos taurus OX=9913 PE=2 SV=1)

HSP 1 Score: 132.109 bits (331), Expect = 1.732e-34
Identity = 107/310 (34.52%), Postives = 160/310 (51.61%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFKTVL---INNFSGKNF-VVPGSKVPCGN-DDGYFN-GFQRVFDNERGITSFIN---IRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPF--EGGSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D + +      N  + KN  ++PG      N   GYF+  F R+F+ E    S++N   +RR++ +E+  +NL K ++PS+   K CGLGS  G+    +P F  ++K K  Y P  KN+ TNPG KG GYGY+N+   K   + + + Y+  +   KK +E H   + G   F       +YF  NP+  E     IK V+ +  +  PF  +S  KK GGMKAGT +P+P+H   PY+       K   +   K F P G  P   P +S++  NV    N  N  K +SV
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSQFNRPFNESASKNRQILPGGSKEMSNLQAGYFDPHFVRIFEGE----SYVNPNQVRRRYMMEEAKKNLSKAFLPSNGEKKPCGLGSYYGTIGGPVPFFSAQSKPKEKYEPPGKNLYTNPGKKGTGYGYANITIGKQFSH-SSDLYDAAKLNNKKENEEHRRLLKGT-AFKLNLYTREYFDTNPYMSEKPLPPIKKVEKKETVGNPFKPSSPGKKAGGMKAGTFDPYPSHSADPYV----VKLKSPSSKSAKVFHPPGG-PKSRPIESIMALNVKRALNMKNY-KTASV 306          
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Match: sp|A7E2U8|CD047_HUMAN (UPF0602 protein C4orf47 OS=Homo sapiens OX=9606 GN=C4orf47 PE=2 SV=1)

HSP 1 Score: 127.872 bits (320), Expect = 6.619e-33
Identity = 107/310 (34.52%), Postives = 154/310 (49.68%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISIND--------PFKTVLINNFSGKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPF--EGGSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            D++RIGLF E+ YI++ D        PF      N   K  +  GSK       GYF+  F R+F+ E G  +   +RR+  +E   +NLGK ++PS+   K CGLGS  G+    +P F  ++K +  Y    KN+ TNPG KG GYGY+N+   K   + + + Y+  +  YKK +E H   + G P F       DYF  NP+  E     IK  + +  +   F  +S  KK GGMKAGT +P+P+H   PY+     +A     D      P+G  P   P +S++  NV    N+ N  K SSV
Sbjct:    9 DMERIGLFSEMEYITVGDKYVSQFNRPFNEAASKN---KQMLPGGSKEMSDLQAGYFDPHFVRIFEGE-GYINLNQVRRRDMVEAAKKNLGKAFLPSNGEKKPCGLGSYYGTIGGPVPFFSAQSKPREKYKAPGKNLYTNPGKKGTGYGYANITIGKQFSH-SADFYDAAKLKYKKANEEHHRLLKGAP-FKLNLHPRDYFDANPYFSEESLPPIKKEEKKKTISNTFKPSSPGKKPGGMKAGTFDPYPSHSADPYVAK---LANISGKDDKIFHPPSG--PKSRPVESIMTLNVRRALNSKNY-KTSSV 306          
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Match: sp|Q5XHC1|CD047_XENLA (UPF0602 protein C4orf47 homolog OS=Xenopus laevis OX=8355 PE=2 SV=1)

HSP 1 Score: 127.487 bits (319), Expect = 7.802e-33
Identity = 111/306 (36.27%), Postives = 149/306 (48.69%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFK---TVLINNFSGKN--FVVPGSKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVK---PQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKV 938
             D++RIGLF E+GY SI D +    +   N  + KN   +  GSK       GYF+G F+RVF+ E     F   RR+ R++   +NLGK ++PSS   K  GLGS  G+    + AF  E K +  Y    KN  TNP   G GYGY +V   K   Y + N Y+    L KK  E+H SK+ G   F       DYF  NP+      + P+K    +   EKPF  +S  K+ GGMKAGT +P+P H   PY       +K  + ++ K F P G  P  +P  S+L  NV       N   V
Sbjct:    6 TDMERIGLFSEMGYTSIGDKYAAPGSKPFNESASKNRQMLPGGSKSMANMLGGYFDGQFKRVFEGESYSDPFKQ-RRQHRMQQSKKNLGKPFLPSSGEKKRSGLGSFYGTLGGPVVAFSAELKSRKAYTAPGKNFYTNPPKDGSGYGYPSVTIGKPYPYSSEN-YDISRELIKKEIEHHKSKLKGG-AFKLNLHPKDYFEPNPYYT-DKTLPPLKVHSQKKETEKPFKPSSPAKEAGGMKAGTFDPYPTHSNDPYTAKP---SKTPVKER-KVFHPPGG-PKTYPVHSILTSNVIKSVTALNYKTV 302          
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Match: sp|Q0P4C5|CD047_DANRE (UPF0602 protein C4orf47 homolog OS=Danio rerio OX=7955 GN=zgc:153146 PE=2 SV=1)

HSP 1 Score: 121.324 bits (303), Expect = 1.712e-30
Identity = 99/320 (30.94%), Postives = 157/320 (49.06%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQK------KNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAG---RPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEK------PFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D++RIG+FKE+GYISI D + + +   F+      K  ++ G+K  CG   GYF+  F+R+F+ E   T  + I R++R+    +N+GK ++PS+     CG+G+  G+F   I A          IP+K      KN  TNP  +G GYG  ++   K   Y + + Y+  + + K+    H S + G   R    P     +YF  NP++      KP+ P   +E+      PF  +S  KK GGMKAG  + +P +   PY            N++ K F P+   P   P +S++  NV    N+TN  ++ SV
Sbjct:    7 SDMERIGVFKEMGYISIGDKYTSFIYRPFNDSAYKNKQMLLGGTKSKCGLQTGYFDTQFKRIFERE-AFTDPVRIDRQYRILQAKKNIGKAFLPSNGEKTTCGMGTYYGTFGGPIQAMSALQ-----IPRKQNKSAGKNFYTNPPKEGSGYGCPDITLSKMVSY-SSDPYDRAKEMLKREITAHKSMLKGGAFRLNLHP----NEYFDGNPYKFD----KPLPPPKKIEEKKHFAVPFKPSSPSKKAGGMKAGAFDSYPTYSAEPYGTKK--TKSVVANNEVKIFHPSPG-PKSTPIKSIISLNVNKAVNSTNYNRIPSV 308          
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Match: A0A267FK75 (Uncharacterized protein OS=Macrostomum lignano OX=282301 GN=BOX15_Mlig016035g1 PE=4 SV=1)

HSP 1 Score: 180.644 bits (457), Expect = 1.634e-50
Identity = 111/305 (36.39%), Postives = 162/305 (53.11%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFK-----TVLINNFSGKNFVVPGSKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGN-ETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D+DRIG+FKE+ Y ++ DP+K         + + G+  +  G+K   G   GY + F RVF+ E G +  I  RR+ R +   RNL + W P+S      G GS  G+F   +  F    + K    P+ KNI+TNP  KG GYGY+ V  +KYP+Y + + Y  ++ + K+ +E H  KVAG+  F   +   D+F  NP++GGS   K      P+ +PF  ++  KKDGGMKAGT + FP +    Y   +       +N  GK F+PN    T   T SVL +NVT   N++N   V SV
Sbjct:    5 SDMDRIGIFKEMSYHTVGDPYKNPGNFQFNTSAYKGRQMLPGGTKKKSGLSQGYMSEFGRVFEGE-GYSDPIKTRRQERQKASSRNLSRSWNPTSFPKVGSGTGSHYGTFSGPVQYFNAFSSSKGKAAPEGKNILTNPPKKGTGYGYAFVTLNKYPDYKS-DTYEKYKDIQKRENEQHKQKVAGKGAFKLSTHPADFFDANPYKGGSALTKSEAKAEPIARPFKPSNPGKKDGGMKAGTFDSFPTYSSEKYQKKSLTRPVQVVNSSGKTFVPNSGPKTTLQT-SVLHQNVTRSVNSSNYKSVQSV 306          
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Match: A0A183B436 (Uncharacterized protein OS=Echinostoma caproni OX=27848 GN=ECPE_LOCUS13971 PE=4 SV=1)

HSP 1 Score: 158.688 bits (400), Expect = 5.770e-42
Identity = 103/314 (32.80%), Postives = 153/314 (48.73%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDP---FKTVLINNFSGKNFVVPG------SKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLG-KDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNK-YNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEKPFIVASYPKK---------------DGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCN 917
            DL R+G+FKEL Y +I DP   F T +  N S    + P       SK    N DGYF+ F+  F  + G+T + +++R+ R  ++ + +G +DWIP+S      G+GS  G+FQ    A    TK    +P  KN  TNPG KG GYGY +VC + +P +  G+        ++++    H +K+ GR  F+      D F  NP+  G     P+ P  P    F +A++PK                DGGMKAGT+N FP +   PY+D ++   +D     G  ++PN       P  S++ KN  L  N
Sbjct:    6 DLQRLGIFKELSYHTIGDPYVPFATTIKTNTSSNKGLAPMYLAGGYSKTKAANSDGYFHPFESAFIGDGGMT-YADLQRQERKANKAKMIGGRDWIPASGHKNRSGVGSTIGNFQEVHTAMDPTTKVPQKVPVLKNFYTNPGKKGTGYGYPDVCMNPFPAWQAGDTGLGAARRIFEQARAEHATKLKGRAEFVSTCRSLDAFENNPWASGD----PLAPGGPSNLKFGIAAFPKSMIIGPTFIPSSPAKHDGGMKAGTINRFPEYTNEPYVDPHHIGRRDKTKYVGGEWIPNRCTAVVVPQPSIVNKNTMLRIN 314          
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Match: A0A158R8P4 (Uncharacterized protein OS=Taenia asiatica OX=60517 GN=TASK_LOCUS5829 PE=4 SV=1)

HSP 1 Score: 166.777 bits (421), Expect = 1.046e-41
Identity = 113/325 (34.77%), Postives = 154/325 (47.38%), Query Frame = 3
Query:   42 YIMADLDRIGLFKELGYISINDPFKTVLINNFSGKN-------FVVPG-SKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNK-YNDFETL-YKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEKPFIVASYPKK---------------DGGMKAGTLNPFPAHQPCPYIDSNYFMAKDF--------LNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCN 917
            ++   L  +G+F +L Y S  DP+       + G+        +   G SK    N DGYF  F+R+F+ E        + R WR  +  R LGK+W+PSS++ K CGLGS  G+F A IPA G E  +    P+  N  TNPG KG GYGY NV  +  PE+  G+  ++D   +  K+L+E H  K  GR  F+        F  NP+EGG     P+ P  P  + F VA +PK                DGG KAGTLN +P H   PY +    MAK          L   G+ ++P        P+ SV+ KNV L  N
Sbjct:    6 FLKPALRTLGVFSQLPYSS--DPYVDPDPYGYKGRTNRGLRPMYCAGGHSKSRAANSDGYFESFKRIFEGE-AFNDETALHRSWRRLNEARRLGKEWVPSSSSRKRCGLGSPYGNFTAVIPAMGTEHTEIEKAPELHNFYTNPGKKGTGYGYPNVAINPLPEWKPGDGIHSDLNAVSVKELNERHQEKCLGRGVFVNQQPSGRAFGPNPYEGGD----PLAPGGPPLQKFGVACFPKDLIIGPIFYPQNPGKLDGGCKAGTLNKWPEHVSEPYKE----MAKVLQEQSIFKSLEKGGRVWIPTSGTAIEKPSMSVVNKNVDLAIN 319          
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Match: A0A158QUH3 (Uncharacterized protein OS=Mesocestoides corti OX=53468 GN=MCOS_LOCUS6246 PE=4 SV=1)

HSP 1 Score: 165.236 bits (417), Expect = 3.001e-41
Identity = 103/272 (37.87%), Postives = 143/272 (52.57%), Query Frame = 3
Query:  165 SKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNK-YNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEKPFIVASYP---------------KKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDF----LNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNN 920
            SK   GN DGYF  F+R F+ E  I++ + IRR WR ++ +R +GK+W+PSS T +  GLGS  G+F   I A   E KKYP  P+  N  TNPG KG GYGY +VC + YPE+  G   Y +     K+L+E H  +  GRP FI       +F  NPFEG      P+ P  P  + F +++YP               KKDGG K G L+ +PAH   P+ +    + + F     N + K ++P        PT+S++ KNV L  N+
Sbjct:    8 SKTRAGNSDGYFEPFKRTFEGE-AISNIVAIRRGWRKDNNMRKVGKEWLPSSPTKRRAGLGSTLGNFTTVIEAMSPEQKKYPKPPKLPNFYTNPGKKGTGYGYVDVCINPYPEWKPGASIYANPGPSMKELNEKHQLQCLGRPIFINQQASGGFFGPNPFEG----TDPLAPGGPPIQKFGISNYPKELMIGPIFCPPNPAKKDGGCKDGALSKWPAHSAEPFKEMTKILLEKFNSSPENQQLKLWVPQPCSAIEKPTKSIINKNVDLAINS 274          
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Match: H2KV45 (UPF0602 protein C4orf47 homolog OS=Clonorchis sinensis OX=79923 GN=CLF_111235 PE=4 SV=1)

HSP 1 Score: 155.992 bits (393), Expect = 8.435e-41
Identity = 104/315 (33.02%), Postives = 152/315 (48.25%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFK------TVLINNFSG--KNFVVPG-SKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGK-DWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDF-ETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEKPFIVASYP---------------KKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNN 920
            DLDR+G+FKE+ YI++ +P+        ++  N  G    ++  G SK    N DGYF  F+ +   + G   + +I R+ R + + R +G+ +WIPSS      G GS  G+FQ    AF    K  P +P+ KN  TNPG KG GYGY +VC + YP    G+   +    +Y+   + H +K+ GR  FI      D F  NP+  G     P+ P  P    F  A++P               K+DGG K G LNPFP +   PY+       +D     G +++PN       PT S++ KN  LH N+
Sbjct:    7 DLDRLGIFKEMSYITVGEPYVPPQSHLAMIRTNPRGVPPMYLAGGYSKSKAANADGYFAPFESIHIGD-GNVRYADILRESRKQSKARRIGRGEWIPSSGPKLRSGTGSSYGNFQERYEAFDPARKAVPRVPELKNFYTNPGKKGTGYGYVDVCLNPYPSAAVGDSPGEIARRMYEAQVKDHIAKLKGRKPFISTCRSLDAFDGNPWAEGD----PLAPGGPSTVKFGTAAFPKSMIIGPTFVPSSPAKRDGGKKDGALNPFPEYSSEPYVGLQGIHKEDKSKFVGSQWIPNPGTALVIPTPSIVAKNTMLHFNS 316          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Cavefish
Match: zgc:153146 (chromosome 24 C4orf47 homolog [Source:NCBI gene;Acc:103030756])

HSP 1 Score: 139.043 bits (349), Expect = 4.203e-38
Identity = 112/316 (35.44%), Postives = 158/316 (50.00%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGN-ETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIPLEK----------PFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
            +D++R+GLFKE+GYISI D +   L   F+      K  +V GSK   G   GYF+  F+R+F+ E  +T  I I+R+++LE   RNLGK ++PSS   K  G+GS  G+    I A    +  K PY    KN+ T+P  KG GYGY  V   K   Y + + Y+  + + K+    H S + G P F       + F  NP+       KP KP  P +K          PF  +S  KK GGMKAGT + +P H   PYI      A   LN + K F P+   P   P +S++  NV    N+ N  ++ S+
Sbjct:    7 SDMERVGLFKEMGYISIGDKYTPFLYRAFNESAHKNKQMLVGGSKKKSGLQTGYFDAQFKRIFEKE-ALTDLIKIQRQYKLEQTKRNLGKAFLPSSGEKKRSGIGSYYGTLGGSIQAMSPLKIPKQPYKSPGKNMYTSPPKKGSGYGYPGVTLGKLDLY-SSDPYDRAKDIVKRELIAHKSMLKGGP-FRLNLHPKECFDANPY-------KPDKPLPPSKKTEDKKTHFGVPFKPSSPSKKIGGMKAGTFDLYPTHSVDPYIPHKPKTAG--LNKELKIFHPSPG-PKSTPVKSIISLNVNKFVNSKNCNQIPSI 309          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Sea Lamprey
Match: zgc:153146 (pep scaffold:Pmarinus_7.0:GL477013:54605:76454:-1 gene:ENSPMAG00000001809.1 transcript:ENSPMAT00000001995.1 gene_biotype:protein_coding transcript_biotype:protein_coding gene_symbol:zgc:153146)

HSP 1 Score: 134.806 bits (338), Expect = 3.660e-37
Identity = 96/254 (37.80%), Postives = 128/254 (50.39%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFN-GFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFG-NETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGG--SNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYI 788
            D++R+GLF ELGYISI D +K+     F+     G+  +  G K       GYF+ GF RV   E   +  I IRR+ RL++  +NLGK  +PSS   K  GLGS  G+    IPA    +  K PY    +N +TNP  KG GYGY  V     P Y+  + Y+  + + +K  E H   V G P F       D F VNP+     + A  P +     EK F  +S  KK GGMKAGT + +P+H    YI
Sbjct:    8 DMERVGLFSELGYISIGDKYKSSFNKAFNAAANRGRQMLTEGPKSRSALPAGYFSQGFTRVMQGE-AYSDPIKIRRQRRLQEAKKNLGKALLPSSCPKKPSGLGSYYGTLGGPIPALSPAQLPKKPYSAPGRNFVTNPPKKGTGYGYPQVMIGSLPPYMP-DPYDRAKEIRRKEMETHKKMVKGAP-FRLNLHPLDIFDVNPYRSDQPARATHPPRATKSSEKAFKPSSPAKKMGGMKAGTFDSYPSHSEDAYI 258          
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Nematostella
Match: EDO34007 (Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7SQL2])

HSP 1 Score: 130.954 bits (328), Expect = 1.947e-35
Identity = 108/312 (34.62%), Postives = 156/312 (50.00%), Query Frame = 3
Query:   36 INYIMADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETK-KYPYIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAG---RPTFIPPSTFCDYFTVNPFEGGSNAIKPVK---PQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKR--FLPNGNFPTYFPTQSVLQKNVTLHCNNTN 926
            +N +  DL+R+GLF ELGYISI DP+K +   NF+     GK  +  GSK+     DGYF+  F RV + E   +  +  RR  RL+    N+GK ++PS+      G+G+  G+F   +PAF   TK K  ++   KN +TNP  KG GYGY  V     P+Y++ + Y+  + L KK  +    ++ G   +    P S    YF  NP++     + PVK      P  KPF  +  PK+ GGMKAG    +P+H   P      F  K   N  G+R  F P+   P   P +S++ +NV    N  N
Sbjct:    5 MNDVKNDLNRVGLFSELGYISIGDPYK-IQNGNFNLAAHKGKQMLPGGSKIRSSKQDGYFDQKFNRVMEGE-AFSDPVKRRRLDRLKATKLNIGKAFVPSNGDKLPSGIGNHYGTFAGAVPAFSPVTKSKSAHVSPGKNFLTNPPKKGTGYGYLQVTIGASPKYMS-DAYDRGKELRKKEMDTSGKQMKGGAFKLNLHPKS----YFDGNPYK-SDRPLPPVKDGRKAKPDFKPFKPSHPPKEIGGMKAGCFTSYPSHSEDP------FQPKKKKNGDGERKIFRPS-QGPKSTPMKSIINQNVDRRINRLN 301          
BLAST of Chromosome 4 open reading frame 47 vs. Planmine SMEST
Match: SMESG000042363.1 (SMESG000042363.1)

HSP 1 Score: 172.94 bits (437), Expect = 3.894e-51
Identity = 117/312 (37.50%), Postives = 168/312 (53.85%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINN----FSGKNFVVPGSKVPCGNDDGYFNG-FQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYP-YIPQKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLY--KKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNAIKPVKPQIP---LEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDS-NYFMAKDFLNDKGKRFLPN-GNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
             D++R+GLFKE+GY +INDP+K    ++      GK F+   ++     DDGYF   F RVF+ E   T+ + +RR+WR +++ +N+GK +I S+   K  G GSR G     IP F   T+     +P KKN ITNPG KG GYGY++VC + YPE+ + + Y+    +Y  K+++E H  +   R  FIP + +  YF  NP+EGG    +    +      ++PFI  SYPK  GG K G L P+P   P  Y D     +  D  N+  K +    G+    F  QSVLQKNV    NN N   + +V
Sbjct:    5 TDMERVGLFKEMGYHTINDPYKPFFYSSRQDIIKGKGFLGDMTRSKAATDDGYFTKPFSRVFEGE-SETNLMAMRRRWRNQNKEKNVGKTFIMSNPGKKQSGTGSRMGCMDYMIPYFSGITRTVRGKLPNKKNFITNPGKKGSGYGYADVCLNPYPEWKS-DVYDGARNMYFRKEMAE-HQKRTLNRKEFIPTNAYGSYFYSNPYEGGMGPYEEKHGKTVNYLQKRPFIPGSYPKILGGNKDGCLTPYPPAFPAEYKDKFTRNLVHDVRNNTNKLYNHTPGSKSCIF--QSVLQKNVNFRVNNLNSHSIRNV 311          
BLAST of Chromosome 4 open reading frame 47 vs. Planmine SMEST
Match: SMESG000078789.1 (SMESG000078789.1)

HSP 1 Score: 158.688 bits (400), Expect = 1.302e-45
Identity = 108/307 (35.18%), Postives = 154/307 (50.16%), Query Frame = 3
Query:   51 ADLDRIGLFKELGYISINDPFKTVLINNFS-----GKNFVVPGSKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIP-QKKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEGGSNA--IKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSNYFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV 947
             D++R+GLFKELGY +I DP+K+  +  F+     GK  +  G K    ++ GYF  F R+FD E   +  I + RK R E   + LGK+W+P +   + CG G+  G+F   IP F   +K+      Q KN  TNP  KG GYG+  V  +KYPEY +  KY+  + + K+ +E H SK+  R  F       DYF +NP+ G S    +  V  +    K F  +S  K DGGMKAG    FP +    Y         + +N  G+ F+P    P   P  SVLQ+N+       N  ++ SV
Sbjct:    5 TDMERVGLFKELGYHTIGDPYKSSALQQFNQSASKGKQVLPGGEKSKSASNVGYFQEFTRIFDGE-SYSDPIKLMRKRRNESSKKKLGKNWVPVNGIKESCGSGTYYGTFSGAIPHFEAISKEQRAKGIQGKNFYTNPAKKGVGYGFVGVTLNKYPEYQS-EKYDRSKDITKQENEEHKSKLHSRGVFRLGMHLRDYFDMNPYSGPSEKYRVTSVNSKSQNFKTFKPSSPSKLDGGMKAGCFGSFPEYSSETYKAKIPKRPVNVVNSSGRTFIPPIG-PKSRPVNSVLQQNIIRTITPANYRQIQSV 308          
BLAST of Chromosome 4 open reading frame 47 vs. Planmine SMEST
Match: SMESG000033217.1 (SMESG000033217.1)

HSP 1 Score: 88.5817 bits (218), Expect = 1.095e-19
Identity = 94/314 (29.94%), Postives = 138/314 (43.95%), Query Frame = 3
Query:   54 DLDRIGLFKELGYISINDPFKT-----VLINNFSGKNFVVPGSKVPCGNDDGYFNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLGSRDGSFQAHIPAFGNETKKYPYIPQ--KKNIITNPGPKGCGYGYSNVCFDKYPEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEG---GSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPY---IDSNYFMAKDFLNDKGKRF--LPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSVY 950
            DL++ GLF+E  Y S  D +KT        ++  GK  V+  +K     + GYF+ F R F+ E   +  I IRR+    ++ +  GK +   S      GLGS  G+F   IPA   ET ++    +  K+N  TN   KG GYGY  + F+KYPEY +          Y  L      +   +P     S   D+F  NP++G        KPVK  +   K F V      +    + + N FP ++   Y   +  N    K+ +ND  + F  +P    P    T S+L KN+    N  N G V SV+
Sbjct:    8 DLEKFGLFQEFPYQS--DLYKTPGNFEFNASSAKGKQIVIESNKTKSALNHGYFSNFLRTFEGE-AYSDPIKIRRELNRPNKEKLAGKVFHLPSIPKNPEGLGSYYGTFSGPIPAMLGETIQFSKSDEIKKRNFYTNCSKKGTGYGYVGLTFNKYPEYKS--------EPYGILKSSEEKENQNKPFHTVKS--LDFFDSNPYKGSIFSEGKEKPVKENL---KSFHV------NTSHLSNSFNAFPKYENDVYDEKLKRNLPYIKNVMNDSKRTFGAIPG---PKTMRTFSILNKNIERSINAQNYGTVGSVF 296          
The following BLAST results are available for this feature:
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Human
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Human e!99)
Total hits: 3
Match NameE-valueIdentityDescription
C4orf471.378e-3334.52chromosome 4 open reading frame 47 [Source:HGNC Sy... [more]
C4orf472.069e-2137.66chromosome 4 open reading frame 47 [Source:HGNC Sy... [more]
C4orf472.274e-2137.66chromosome 4 open reading frame 47 [Source:HGNC Sy... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Celegans
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Celegan e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Fly
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Drosophila e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Zebrafish
Analysis Date: 2016-08-08 (Schmidtea mediterranea smed_20140614 BLASTX Zebrafish e!99)
Total hits: 3
Match NameE-valueIdentityDescription
zgc:1531462.323e-3331.56zgc:153146 [Source:NCBI gene;Acc:751702][more]
C14H4orf472.633e-3231.46zgc:153146 [Source:NCBI gene;Acc:751702][more]
C14H4orf472.633e-3231.46zgc:153146 [Source:NCBI gene;Acc:751702][more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Xenopus
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Xenopus e!99)
Total hits: 1
Match NameE-valueIdentityDescription
C4orf472.095e-3335.60chromosome 4 open reading frame 47 [Source:NCBI ge... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Mouse
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX Mouse e!99)
Total hits: 3
Match NameE-valueIdentityDescription
1700029J07Rik6.606e-4135.48RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:... [more]
1700029J07Rik6.606e-4135.48RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:... [more]
1700029J07Rik6.160e-1036.00RIKEN cDNA 1700029J07 gene [Source:MGI Symbol;Acc:... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. UniProt/SwissProt
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI UniProt)
Total hits: 5
Match NameE-valueIdentityDescription
sp|Q3U1D9|CD047_MOUSE4.621e-4035.48UPF0602 protein C4orf47 homolog OS=Mus musculus OX... [more]
sp|Q2T9M0|CD047_BOVIN1.732e-3434.52UPF0602 protein C4orf47 homolog OS=Bos taurus OX=9... [more]
sp|A7E2U8|CD047_HUMAN6.619e-3334.52UPF0602 protein C4orf47 OS=Homo sapiens OX=9606 GN... [more]
sp|Q5XHC1|CD047_XENLA7.802e-3336.27UPF0602 protein C4orf47 homolog OS=Xenopus laevis ... [more]
sp|Q0P4C5|CD047_DANRE1.712e-3030.94UPF0602 protein C4orf47 homolog OS=Danio rerio OX=... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. TrEMBL
Analysis Date: 2020-05-01 (Schmidtea mediterranea smed_20140614 BLASTX EMBL-EBI TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A267FK751.634e-5036.39Uncharacterized protein OS=Macrostomum lignano OX=... [more]
A0A183B4365.770e-4232.80Uncharacterized protein OS=Echinostoma caproni OX=... [more]
A0A158R8P41.046e-4134.77Uncharacterized protein OS=Taenia asiatica OX=6051... [more]
A0A158QUH33.001e-4137.87Uncharacterized protein OS=Mesocestoides corti OX=... [more]
H2KV458.435e-4133.02UPF0602 protein C4orf47 homolog OS=Clonorchis sine... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Cavefish
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Cavefish e!99)
Total hits: 1
Match NameE-valueIdentityDescription
zgc:1531464.203e-3835.44chromosome 24 C4orf47 homolog [Source:NCBI gene;Ac... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Sea Lamprey
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Sea Lamprey e!99)
Total hits: 1
Match NameE-valueIdentityDescription
zgc:1531463.660e-3737.80pep scaffold:Pmarinus_7.0:GL477013:54605:76454:-1 ... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Yeast
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Yeast e!Fungi46)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Nematostella
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Nematostella e!Metazoa46)
Total hits: 1
Match NameE-valueIdentityDescription
EDO340071.947e-3534.62Predicted protein [Source:UniProtKB/TrEMBL;Acc:A7... [more]
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Ensembl Medaka
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Medaka e!99)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Chromosome 4 open reading frame 47 vs. Planmine SMEST
Analysis Date: 2020-05-08 (Schmidtea mediterranea smed_20140614 BLASTX Planmine SMEST)
Total hits: 3
Match NameE-valueIdentityDescription
SMESG000042363.13.894e-5137.50SMESG000042363.1[more]
SMESG000078789.11.302e-4535.18SMESG000078789.1[more]
SMESG000033217.11.095e-1929.94SMESG000033217.1[more]
back to top
Sequences
The following sequences are available for this feature:

transcript sequence

>SMED30029737 ID=SMED30029737|Name=Chromosome 4 open reading frame 47|organism=Schmidtea mediterranea sexual|type=transcript|length=1073bp
AATAATCATTTTTTTTTCTAAAAAATCAATAAAATATTAATTATATAATG
GCAGACTTGGACAGGATTGGGTTGTTTAAAGAGCTGGGATATATTTCCAT
CAATGACCCTTTCAAAACAGTTTTAATAAATAATTTTTCCGGAAAAAATT
TTGTCGTTCCAGGATCTAAAGTTCCATGTGGAAATGATGACGGCTACTTC
AATGGATTTCAAAGGGTTTTCGATAATGAAAGGGGAATTACTAGTTTTAT
TAATATCCGTAGAAAATGGCGATTAGAGGATAGATTGAGAAACCTTGGGA
AAGATTGGATTCCAAGTAGCAACACTGTAAAATTGTGTGGATTAGGCAGT
CGGGACGGCTCGTTTCAAGCCCACATTCCGGCATTTGGTAATGAAACGAA
AAAATACCCATATATCCCCCAGAAAAAGAACATAATCACAAATCCAGGCC
CAAAAGGCTGCGGGTACGGTTACAGCAATGTTTGCTTTGATAAATACCCG
GAATACATCACAGGAAATAAGTATAACGATTTTGAAACGCTATATAAAAA
ATTGAGTGAATATCATTTTTCAAAAGTAGCTGGAAGGCCAACTTTCATAC
CACCCAGCACGTTTTGTGACTATTTCACTGTTAATCCATTCGAAGGAGGA
AGCAATGCTATCAAACCAGTAAAACCTCAAATACCTTTGGAAAAACCATT
TATAGTAGCGTCTTATCCGAAGAAAGATGGCGGAATGAAAGCGGGAACAT
TAAATCCATTTCCAGCTCATCAACCTTGTCCTTATATAGATTCAAATTAT
TTTATGGCAAAAGACTTCTTAAATGATAAAGGAAAAAGATTTTTACCAAA
TGGAAACTTTCCAACATACTTTCCTACTCAATCAGTGTTACAGAAAAATG
TTACATTACATTGTAATAATACTAACATAGGAAAAGTATCAAGTGTTTAC
TGACATCTGCAATGTTACTGGATGAAATTAGATGAAAAATACGTGAATGT
ATGTAAAACAAATTTTGTATGCAAATACACACATTGAGTTAAAATTAATT
GTGTGTTTGAGTTGTGATTGGAG
back to top

protein sequence of SMED30029737-orf-1

>SMED30029737-orf-1 ID=SMED30029737-orf-1|Name=SMED30029737-orf-1|organism=Schmidtea mediterranea sexual|type=polypeptide|length=302bp
MADLDRIGLFKELGYISINDPFKTVLINNFSGKNFVVPGSKVPCGNDDGY
FNGFQRVFDNERGITSFINIRRKWRLEDRLRNLGKDWIPSSNTVKLCGLG
SRDGSFQAHIPAFGNETKKYPYIPQKKNIITNPGPKGCGYGYSNVCFDKY
PEYITGNKYNDFETLYKKLSEYHFSKVAGRPTFIPPSTFCDYFTVNPFEG
GSNAIKPVKPQIPLEKPFIVASYPKKDGGMKAGTLNPFPAHQPCPYIDSN
YFMAKDFLNDKGKRFLPNGNFPTYFPTQSVLQKNVTLHCNNTNIGKVSSV
Y*
back to top
Annotated Terms
The following terms have been associated with this transcript:
Vocabulary: Planarian Anatomy
TermDefinition
PLANA:0000208testis
PLANA:0002089reproductive organ
Vocabulary: INTERPRO
TermDefinition
IPR029358DUF4586
Vocabulary: molecular function
TermDefinition
GO:0003674molecular_function
Vocabulary: cellular component
TermDefinition
GO:0005737cytoplasm
GO:0005813centrosome
GO:0005815microtubule organizing center
GO:0005856cytoskeleton
Vocabulary: biological process
TermDefinition
GO:0008150biological_process
InterPro
Analysis Name: Schmidtea mediteranean smed_20140614 Interproscan
Date Performed: 2020-05-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029358Protein of unknown function DUF4586PFAMPF15239DUF4586coord: 3..285
e-value: 4.4E-59
score: 201.1
IPR029358Protein of unknown function DUF4586PANTHERPTHR31144FAMILY NOT NAMEDcoord: 2..292