C3orf38

From Wikipedia, the free encyclopedia
C3orf38
Identifiers
AliasesC3orf38, chromosome 3 open reading frame 38
External IDsMGI: 1914859 HomoloGene: 27867 GeneCards: C3orf38
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_173824

NM_026273

RefSeq (protein)

NP_776185

NP_080549

Location (UCSC)Chr 3: 88.15 – 88.17 MbChr 16: 64.57 – 64.59 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.

Gene[edit]

Figure depicting human chromosome 3 and the 3p11.1 location at which the C3orf38 gene is found. Image derived from GeneCards.[5]

The C3orf38 gene is located on chromosome 3 (3p11.1) on the forward strand.[5] It spans 18,771 bases from chr3:88,149,959-88,168,729.[5] It contains 3 exons.[6] Common aliases for this gene are MGC26717, LOC285237, and FLJ54270.[7] Some of the genes neighboring C3orf38 include ZNF654, CGGBP1, and LOC105377202.[8]

Transcripts[edit]

C3orf38 Transcripts
Protein Name Gene ID Transcript Accession Length (nt) Length (aa)
uncharacterized protein C3orf38 285237 NM_173824.4 2414 329
uncharacterized protein C3orf38 isoform X1 285237 XM_005264745.5 2356 328

Protein[edit]

Multiple sequence alignment of C3orf38 protein in humans and various orthologs showing DUF conservation. MSA created using BoxShade tools.

The C3orf38 protein is 329 amino acids in length.[9] A large domain of unknown function, DUF4518, encompasses majority of the C3orf38 protein.[9] This domain is a part of the protein family pfam15008, which is thought to be involved in apoptosis regulation.[10] This pfam15008 is the only member of the cl20886 superfamily.[10] While the C3orf38 protein does not have any abnormal amino acid abundance as a whole, the DUF4518 has a high abundance of histidines and a low abundance of serines, according to compositional analysis.[11] The predicted molecular weight of the entire C3orf38 protein is 37.0 kD and the isoelectric point is 6.01.[12] The DUF4518 contained inside the C3orf38 protein has a predicted molecular weight of 31 kD and an isoelectric point of 6.49.[12]

Regulation[edit]

Gene Level Regulation[edit]

There have been a number of potential promoters identified for the C3orf38 gene, which are described in the table below.[13]

Potential Promoters for the C3orf38 Gene[13]
Promoter Start End Length (bp) Transcripts
GXP_203118 88148634 88150046 1413 GXT_23216585, GXT_22791246, GXT_2803824, GXT_26239186
GXP_9795962 88148768 88149807 1040 no transcript assigned; promoter based on comparative genomics
GXP_9795963 88148794 88150027 1234 no transcript assigned; promoter based on comparative genomics
GXP_3194836 88149604 88150643 1040 GXT_24485561

The C3orf38 gene exhibits ubiquitous expression in human tissues.[14]

Normal human tissue expression profiling (HG-U95E) for the C3orf38 gene. Data pictured is captured from the NCBI GEO database.[14]

Protein Level Regulation[edit]

The C3orf38 protein is expected to be found with the highest confidence in the cytoplasm.[15] This finding is supported by examination of an array of C3orf38 orthologs.[15]

There are several well conserved post translation modification sites found amongst the human C3orf38 protein and its orthologs, which are depicted in the table below.[16] Majority of these PTMs are PKC phosphorylation sites.[16] Additionally, two confirmed active sites are located in the C3orf38 protein. The first is an aldehyde dehydrogenases glutamic acid active site located from amino acids 1-8.[16] The second site is a eukaryotic thiol (cysteine) proteases histidine active site located from amino acids 227-237.[16]

Predicted cellular localization of C3orf38 in humans and several orthologs. Localization predictions gathered from PSORT II Prediction tool.[15]
Conserved Post Translational Modification Sites
PTM Protein Location (aa)
Myristyl site 235-240
PKC phosphorylation site 34-36
PKC phosphorylation site 86-88
PKC phosphorylation site 199-201
PKC phosphorylation site 265-267

Homology/evolution[edit]

Orthologs for the C3orf38 protein can be found in mammals, reptiles, birds, amphibians, fish, and invertebrates using BLAST searches.[17] A selection of these orthologs can be found in the ortholog table below. There are no paralogs.[17] Additionally, by comparing sequences of C3orf38 protein with cytochrome C and fibrinogen alpha proteins, a moderate rate of evolution was determined for the C3orf38 protein.

C3orf38 Ortholog Table[17][18][19]
Genus, species Common Name Taxonomic Group Divergence Date (MYA) Accession Number Sequence Length (aa) Sequence Identity (%) Sequence Similarity (%)
Mammals Homo sapiens Human Primates 0 NP_776185.2 329 100 100
Pan paniscus Bonobo Primates 6.7 XP_003831564.1 329 99.4 99.7
Puma concolor Puma Carnivora 96 XP_025769652.1 348 79.8 86.6
Reptiles Mauremys reevesii Reeve's Turtle Testudines 312 XP_039379932.1 315 55.7 70.5
Chelonoidis abingdonii Abingdon Island Giant Tortoise Testudines 312 XP_032650981.1 304 55.4 69.9
Birds Strigops habroptila Kakapo Psittaciformes 312 XP_030327387.1 309 52.1 66.3
Taeniopygia guttata Zebra Finch Passeriformes 312 XP_002190058.5 306 51 63.9
Gallus gallus Chicken Galliformes 312 XP_004938363.2 312 44.2 59.9
Amphibians Rhinatrema bivittatum Two-Lined Caecilian Gymnophiona 351.8 XP_029434832.1 289 49.7 64.5
Bufo bufo Common Toad Anura 351.8 XP_040279187.1 289 43.9 62.1
Xenopus tropicalis Tropical Clawed Frog Anura 351.8 XP_017946806.1 261 38.6 54.8
Fish Chelmon rostratus Copperband Butterflyfish Perciformes 435 XP_041807133.1 302 42.7 58.2
Coregonus clupeaformis Lake Whitefish Salmoniformes 435 XP_041700482.1 308 42.4 60.6
Carcharodon carcharias Great White Shark Lamniformes 473 XP_041066710.1 308 45 59.8
Amblyraja radiata Thorny Skate Rajiformes 473 XP_032888490.1 382 32.5 46.5
Invertebrates Lytechinus variegatus Sea Urchin Temnopleuroida 684 XP_041465399.1 312 36.4 48.3
Patiria miniata Bat Star Valvatida 684 XP_038067113.1 294 34.1 46.2
Cryptotermes secundus Termite Blattodea 797 XP_023724689.1 296 30.1 48
Crassostrea virginica Eastern Oyster Ostreidae 797 XP_022335568.1 340 29.6 46.5
Diabrotica virgifera Western Corn Rootworm Coleoptera 797 XP_028133096.1 284 26.9 43.6
Acropora millepora Branching Stony Coral Scleractinia 824 XP_029194133.1 288 32.6 50.9
Figure showing an evolution rate graph comparing the C3orf38, cytochrome C, and fibrinogen alpha proteins. Noting the cytochrome C to be a relatively slow-evolving protein and the fibrinogen alpha to be a relatively fast-evolving protein, it is clear that C3orf38 protein evolves at a comparatively moderate rate.

Function[edit]

Although investigation into the function of the C3orf38 gene is ongoing, a couple studies have granted valuable insights into its role. One study has identified C3orf38 as a candidate proapoptotic gene.[20] Another study identified C3orf38 as a top candidate tumor suppressor gene (TSG).[21]

Interacting proteins[edit]

Of the various proteins C3orf38 protein interacts with, two are particularly interesting seeing as C3orf38 is a candidate proapoptotic and tumor suppressor gene. First, BAG family molecular chaperone regulator 4 (BAG4) is an anti-apoptotic protein that is known to interact with a number of apoptosis and growth-related proteins.[22] Second, DnaJ Heat Shock Protein Family Member B4 (DNAJB4) is a member of the heat shock protein-40 family (Hsp40), a molecular chaperone, and a tumor suppressor (specifically for colorectal carcinoma).[23]

References[edit]

  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000179021Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000059920Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ a b c "C3orf38". www.genecards.org. Archived from the original on 2011-11-29. Retrieved 2021-09-30.
  6. ^ "Homo sapiens chromosome 3 open reading frame 38 (C3orf38), mRNA". 2021-04-16. {{cite journal}}: Cite journal requires |journal= (help)
  7. ^ "AceView: Gene:C3orf38, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2021-09-30.
  8. ^ "C3orf38 chromosome 3 open reading frame 38 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  9. ^ a b "uncharacterized protein C3orf38 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-09-30.
  10. ^ a b "CDD Conserved Protein Domain Family: DUF4518". www.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  11. ^ "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-17.
  12. ^ a b "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2021-12-17.
  13. ^ a b "Genomatix Software Suite". Archived from the original on 2012-01-14.
  14. ^ a b "2928464 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  15. ^ a b c "PSORT II Prediction". psort.hgc.jp. Retrieved 2021-12-18.
  16. ^ a b c d "Motif Scan". myhits.sib.swiss. Retrieved 2021-12-18.
  17. ^ a b c "Protein BLAST: search protein databases using a protein query". blast.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  18. ^ "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-17.
  19. ^ "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2021-12-17.
  20. ^ Park, Kyung Mi; Kang, Eunju; Jeon, Yeo-Jin; Kim, Nayoung; Kim, Nam-Soon; Yoo, Hyang-Sook; Yeom, Young Il; Kim, Soo Jung (2007-04-30). "Identification of novel regulators of apoptosis using a high-throughput cell-based screen". Molecules and Cells. 23 (2): 170–174. doi:10.1016/S1016-8478(23)07370-3. ISSN 1016-8478. PMID 17464193.
  21. ^ Cody, Neal A. L.; Shen, Zhen; Ripeau, Jean-Sebastien; Provencher, Diane M.; Mes-Masson, Anne-Marie; Chevrette, Mario; Tonin, Patricia N. (2009). "Characterization of the 3p12.3-pcen region associated with tumor suppression in a novel ovarian cancer cell line model genetically modified by chromosome 3 fragment transfer". Molecular Carcinogenesis. 48 (12): 1077–1092. doi:10.1002/mc.20535. ISSN 1098-2744. PMID 19347865. S2CID 10259832.
  22. ^ "BAG4". www.genecards.org. Archived from the original on 2011-11-27. Retrieved 2021-12-18.
  23. ^ "DNAJB4". www.genecards.org. Archived from the original on 2021-12-18. Retrieved 2021-12-18.