C16orf46

From Wikipedia, the free encyclopedia
C16orf46 protein
Identifiers
Aliases
External IDsGeneCards: [1]
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

n/a

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC)n/an/a
PubMed searchn/an/a
Wikidata
View/Edit Human
3D Rendering of C16orf46.[1]

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.[2]

Gene[edit]

An alternative name for this gene is FLJ32702, however it is most commonly referred to as C16orf46.[3]

Location[edit]

The C16orf26 gene is found on chromosome 16q23.2 negative strand.[4] The promoter region is 1152 base pairs long.[5] It has three exons, one from 1-380 bp, the second from 381 to 1254 bp, and the third from 1255 to 1982 bp.[2]

Expression[edit]

C16orf46 is broadly expressed in the testis and thyroid as well as 18 other tissues.[4] These tissue expression patterns are found to be low to moderate (25-50%).[6] When looking at tissue profiles, the highest expression is in the adult mammalian kidney, liver, prefrontal cortex, cerebellum, heart, and brain.[7]

Protein[edit]

Immunofluorescence for C16orf46 in a rabbit.[8]

Protein Analysis[edit]

The full C16orf46 protein is 417 amino acids long.[9] It has no isoforms, and its most distant ortholog, Rhincodon typus (whale shark), also has no known isoforms.[10] The molecular weight was found to be 45.8 kdal.[11] The isoelectric point is 7.4, average for all proteins, and C16orf46 is electrically neutral.[12]

C16orf46 is predicted to be found in the nucleus by all orthologs.[13]

The secondary structure of C16orf46 has alternating alpha helices and beta sheets.[14]

Protein Level Regulation[edit]

In C16orf46, there is N-linked glycosylation, O-linked glycosylation, and SUMOylation.[15][16]

There are phosphorylation sites found with the kinases CKII, CKI, PKC, and cdc2.[17]

A coronavirus cleavage site is predicted at the 235 amino acid position.[18] There are also tyrosine motif locations between amino acids 42-45 and 251–252.[19]

Transcript Level Regulation[edit]

mRNA folding on the 5' UTR predicts a stem loop twice in the area between base pairs 47–90.[20]

Homologs[edit]

Orthologs[edit]

C16orf46 has over 50 orthologs ranging from primate to chordate.[21] The table below shows a representation of the diversity of C16orf46 by listing a selection of orthologs found using NCBI. When C16orf46 Homo sapiens was run through a multiple alignment sequence program, Clustal Omega, against 20 true orthologs and 16 distant orthologs, Trp74 and Pro212 were found to be conserved in all.[22]

Species Common Name Divergence (MYA) Accession Number Identity
Homo sapiens Humans --- XP_016878405.1 100.0%
Ochotona princeps American Pika 90 XP_004584265.1 52.7%
Octodon degus Common Degu 90 XP_003434773.2 47.8%
Ursus maritimus Polar Bear 96 XP_008687958.1 67.5%
Leptonychotes weddellii Weddell Seal 96 XP_006748170.1 67.2%
Canis lupus Gray Wolf 96 XP_003434773.2 65.8%
Pteropus vampyrus Large Flying Fox 96 XP_011354946.1 63.5%
Sus scrofa Wild Boar 96 XP_020952705.1 61.5%
Bos indicus Zebu 96 XP_019835282.1 60.2%
Erinaceus europaeus European Hedgehog 96 XP_007516703.1 56.7%
Loxodonta africana African Bush Elephant 105 XP_010596137.1 60.9%
Sarcophilus harrisii Tasmanian Devil 159 XP_003757901.1 43.1%
Apteryx australis Southern Brown Kiwi 312 XP_013796688.1 18.5%
Aptenodytes forsteri Emperor Penguin 312 XP_019327074.1 17.4%
Chelonia mydas Green Sea Turtle 312 XP_007059324.1 29.7%
Gekko japonicus Gekko Japonicus 312 XP_015261305.1 25.3%
Nanorana parkeri High Himalaya Frog 352 XP_018410908.1 22.4%
Pygocentrus nattereri Red Bellied Piranha 435 XP_017578196.1 21.2%
Lepisosteus oculatus Spotted Gar 435 XP_015223705.1 20.6%
Callorhinchus milii Australian Ghost Shark 473 XP_007887408.1 22.7%

Paralogs[edit]

C16orf46 has no known paralogs.[21]

Mutations[edit]

C16orf46 has been compared against Fibrinogen, a protein which mutates rapidly, and Cytochrome C, a protein which mutates slowly.

As can be seen below, when multiple species of the three proteins were plotted, C16orf46 more closely resembled that of Fibrinogen than Cytochrome C, suggesting a possible rapid mutation.[21]

The trend of C16orf46, as compared to Fibrinogen and Cytochrome C, suggests faster mutation rates as it diverges from Homo sapiens.

Interacting Proteins[edit]

C16orf46 interacts with FAT3 which has been linked to neurite interactions during development.[23] C16orf46 is thought to have coexpression with the PLAC8L1 and CFAP43 gene, both of unknown function.[24]

Clinical Significance[edit]

There are higher levels of C16orf46 expression in pancreatic adenocarcinoma tumor epithelia tissue compared to the control.[25] There is also higher gene expression in patients with small-cell carcinoma compared to the control.[26]

References[edit]

  1. ^ "I-TASSER results". zhanglab.ccmb.med.umich.edu. Retrieved 2018-05-07.[permanent dead link]
  2. ^ a b "Gene: C16orf46 (OTTHUMG00000137629) - Summary - Homo sapiens - Vega Genome Browser 68". vega.archive.ensembl.org. Retrieved 2018-05-07.
  3. ^ Database, GeneCards Human Gene. "C16orf46 Gene - GeneCards | CP046 Protein | CP046 Antibody". www.genecards.org. Retrieved 2018-05-07.
  4. ^ a b "C16orf46 Symbol Report | HUGO Gene Nomenclature Committee". www.genenames.org. Retrieved 2018-05-01.
  5. ^ "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Archived from the original on 2001-02-24. Retrieved 2018-05-07.
  6. ^ geo. "Home - GEO - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  7. ^ "Gene: C16orf46 - ENSG00000166455". bgee.org. Retrieved 2018-05-07.
  8. ^ "Anti-C16orf46 antibody produced in rabbit HPA041136". Immunohistochemistry, Immunofluorescence. Retrieved 2018-05-07.
  9. ^ "uncharacterized protein C16orf46 isoform X1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  10. ^ "uncharacterized protein C16orf46 homolog [Rhincodon typus] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  11. ^ Kozlowski, Lukasz P. "CALCULATION OF PROTEIN ISOELECTRIC POINT". isoelectric.org. Retrieved 2018-05-07.
  12. ^ EMBL-EBI. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2018-05-06.
  13. ^ "PSORT WWW Server". psort.hgc.jp. Retrieved 2018-05-07.
  14. ^ "Bioinformatics Toolkit". toolkit.tuebingen.mpg.de. Retrieved 2018-05-07.
  15. ^ "NetNGlyc 1.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  16. ^ "NetOGlyc 4.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  17. ^ "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  18. ^ "NetCorona 1.0 Server". www.cbs.dtu.dk. Retrieved 2018-05-07.
  19. ^ "Human Protein Reference Database". www.hprd.org. Archived from the original on 2006-04-24. Retrieved 2018-05-07.
  20. ^ "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2018-05-07.
  21. ^ a b c "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  22. ^ EMBL-EBI. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2018-05-07.
  23. ^ Lab, Mike Tyers. "BioGRID | Database of Protein, Chemical, and Genetic Interactions". thebiogrid.org. Retrieved 2018-05-07.
  24. ^ "C16orf46 protein (human) - STRING interaction network". string-db.org. Retrieved 2018-05-07.
  25. ^ "GDS4103 / 230281_at". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  26. ^ "GDS4794 / 230281_at". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.