CCDC121

From Wikipedia, the free encyclopedia
CCDC121
Identifiers
AliasesCCDC121, coiled-coil domain containing 121
External IDsMGI: 2685601 HomoloGene: 136217 GeneCards: CCDC121
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001142682
NM_001142683
NM_024584

NM_207280

RefSeq (protein)

NP_001136155
NP_078860

n/a

Location (UCSC)Chr 2: 27.63 – 27.63 MbChr 1: 181.34 – 181.34 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms.[5] All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.

Gene[edit]

Aliases, locus and size[edit]

CCDC121 has known aliases of FLJ43364, FLJ13646, hCG_1988995, LOC79635, and coiled-coil domain containing 121.[6]

CCDC121 is located on the minus strand of chromosome 2 at 2q23.3. It is 3,394 base pairs in length.[5]

Isoforms and alternative splicing[edit]

CCDC121 produces four different mRNAs: three alternatively spliced variants and one unspliced form.[7] The three alternatively spliced mRNAs give rise to three known protein isoforms. Transcripts for isoforms 1-3 are 2,880, 2,762 and 2,361 base pairs in length respectively.[8][9][10] Each of the mRNA variants contains two exons separated by a gt-ag intron.[5]

Protein[edit]

Primary sequence, molecular weight and pI[edit]

Protein Accession Number Length (amino acids) Molecular Weight (kDa) Predicted pI
Isoform 1[11] XP_005264617 442 50.8 9.80
Isoform 2[12] NP_001136155 440 50.9 9.81
Isoform 3[13] NP_078860 278 33.1 9.84

Molecular weight and pI were calculated using ExPasy.[14]

Compositional analysis[edit]

Compositional analysis of all isoforms shows that they have below-average levels of aspartate (D) and valine (V) and above-average levels of glutamine (Q). In addition, they have above-average levels of lysine (K) and arginine (R) groupings. Isoform 3 also exhibits above-average lysine levels and below-average proline and glycine levels. Chimpanzee, dog, and ferret orthologs also exhibited above-average glutamine levels and lysine and arginine groupings.[15]

Secondary and tertiary structure[edit]

The secondary structure prediction for CCDC121 was obtained using Ali2D.[16] CCDC121 adopts a predominant alpha helical secondary structure (shown in red) due to the presence of the Coiled-coil motif.[17]

The tertiary structure of CCDC121 is composed mostly of alpha helices and contains some random coil.[18]

Domains, motifs and post-translational modifications[edit]

CCDC121 contains one domain of unknown function, DUF4515 or pfam14988. It also contains three predicted coiled-coil motifs from residues A165 to E192, L264 to E305 and N363 to E397.[19]

CCDC121 is predicted to have post-translational modification sites for: acetylation,[20][21] Protein Kinase C and Casein Kinase II phosphorylation,[22] glycation,[23] GalNAc O-glycosylation,[24] SUMOylation,[25][26] and O-β-GlcNAc attachment.[27]

Subcellular localization[edit]

Current evidence suggests that CCDC121 is partially localized in the nucleus. CCDC121 has a predicted nuclear localization signal from amino acids R327 to L337. This sequence has a score of 7, which is consistent with being a partial nuclear protein.[28] In addition, PSORT II found that there is 56.5% chance that CCDC121 is found in the nucleus.[29]

There is also evidence to suggest that CCDC121 is partially located in the cytosol. Cytochemistry studies of the Anti-CCDC121 antibody from The Human Protein Atlas indicate that CCDC121 is expressed in the cytosol and actin filaments. These tentative results are promising but further research of other anti-CCDC121 antibodies is needed.[30] Additionally, TargetP did not find a mitochondrial transfer peptide, which suggests that CCDC121 is likely not a mitochondrial protein.[31]

Expression and function[edit]

CCDC121 is expressed at the highest levels in the testes, ovaries, prostate, and thyroid.[32] It is expressed 40% less than the average gene so it is considered to have low levels of expression.[7]

The function of CCDC121 protein is not yet well understood by the scientific community. There is no known phenotype associated with the CCDC121 gene.[7]

Homology[edit]

Rate of molecular evolution[edit]

CCDC121 Rate of Molecular Evolution

Cytochrome c is a highly conserved protein and fibrinogen is a rapidly evolving protein. CCDC121 has a faster rate of molecular of evolution relative to both these proteins, suggesting that CCDC121 evolves very rapidly on an evolutionary timescale.

Orthologs[edit]

There are 126 confirmed orthologs of CCDC121.[33] CCDC121 orthologs are most abundant in mammals. 122 of the 126 orthologs are within the Eutheria, Marsupialia, and Monotremata clades. 119 of the 122 mammalian orthologs are found within eutherian mammals. The four remaining orthologs are the two-lined caecilian, the West Indian Ocean coelacanth, the electric eel, and the northern pike. These orthologs represent the Amphibia, Sarcopterygii, and Actinopterygii clades respectively. The CCDC121 gene likely appeared 433 million years ago in a common ancestor of Actinopterygii and Sarcopterygii.

Paralogs[edit]

CCDC166 is the only known paralog of CCDC121. They share a 23% sequence identity. Both CCDC121 and CCDC166 include the Domain of Unknown Function 4515 (DUF4515), or pfam14988, as a highly conserved sequence.[34]

Clinical significance[edit]

Mutations in the CCDC121 gene have been found in patients with certain cancers such as endometrial, lung, bladder, gastric/stomach, head/neck, and prostate cancers but no causal relationship has been determined.[35][36] CCDC121 may also serve as a marker gene for inner ear development.[37]

References[edit]

  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000176714Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000050625Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ a b c GeneCards entry on CCDC121
  6. ^ HGNC (HUGO Gene Nomenclature Committee) entry on CCDC121
  7. ^ a b c NCBI-Aceview entry on CCDC121
  8. ^ PREDICTED: Homo sapiens coiled-coil domain containing 121 (CCDC121), transcript variant X1, mRNA. NCBI Nucleotide. [1]
  9. ^ Homo sapiens coiled-coil domain containing 121 (CCDC121), transcript variant 2, mRNA. NCBI Nucleotide. [2]
  10. ^ Homo sapiens coiled-coil domain containing 121 (CCDC121), transcript variant 3, mRNA. NCBI Nucleotide. [3]
  11. ^ coiled-coil domain-containing protein 121 isoform X1 [Homo sapiens]. NCBI Protein. [4]
  12. ^ coiled-coil domain-containing protein 121 isoform 2 [Homo sapiens]. NCBI Protein. [5]
  13. ^ coiled-coil domain-containing protein 121 isoform 3 [Homo sapiens]. NCBI Protein. [6]
  14. ^ ExPASy Compute pI/MW tool
  15. ^ Statistical Analysis of Protein Sequences Tool
  16. ^ Secondary Structure prediction for CCDC121. Ali2D
  17. ^ Secondary Structure Prediction for CCDC121. Chou Fassman Secondary Structure Prediction Server (CFSSP). [7]
  18. ^ The Phyre2 web portal for protein modeling, prediction and analysis. Kelley LA et al.. Nature Protocols 10, 845-858 (2015) [8]
  19. ^ Coiled-Coils Prediction. Prediction for CCDC121 isoform 1
  20. ^ NETAcet-1.0 Server
  21. ^ Terminus--N-Terminal PTM prediction. Swiss Institute of Bioinformatics
  22. ^ NetPhos-3.1 Server. Prediction for CCDC121
  23. ^ NetGlycate-1.0 Server. Predition for CCDC121
  24. ^ NetOGlyc-4.0 Server. Prediction for CCDC121
  25. ^ SUMOplotTM Analysis Program. Prediction for CCDC121
  26. ^ GPS-SUMO: Prediction of SUMOylation Sites & SUMO-binding Motifs. Prediction for CCDC121. [9] Archived 2019-02-17 at the Wayback Machine
  27. ^ YinOYang 1.2 Server. Prediction for CCDC121
  28. ^ "NLS Mapper. Predition for CCDC121". Archived from the original on 2021-11-22. Retrieved 2020-05-03.
  29. ^ PSORT II Prediction Tool
  30. ^ Human Protein Atlas entry on CCDC121
  31. ^ TargetP-2.0 Server
  32. ^ NCBI GeoProfile entry on CCDC121. GDS3113 Various normal tissues
  33. ^ NCBI Gene Database entry on CCDC121
  34. ^ Clustal Omega: Multiple Sequence Alignment Tool. Alignment of CCDC166 and CCDC121 isoforms 1 and 2. [10]
  35. ^ Zhang J, Huang JY, Chen YN, Yuan F, Zhang H, Yan FH, et al. (October 2015). "Erratum: Whole genome and transcriptome sequencing of matched primary and peritoneal metastatic gastric carcinoma". Scientific Reports. 5: 15309. Bibcode:2015NatSR...515309Z. doi:10.1038/srep15309. PMC 4613365. PMID 26485306.
  36. ^ PhosphoSitePlus® entry on CCDC121 protein
  37. ^ Liu, Q., Chen, J., Gao, X., Ding, J., Tang, Z., Zhang, C., … Wang, J. (2015). Identification of stage-specific markers during the differentiation of hair cells from mouse inner ear stem cells or progenitor cells in vitro. International Journal of Biochemistry and Cell Biology, 60, 99–111. https://doi.org/10.1016/j.biocel.2014.12.024