Trisha Shetty (Editor)

C4orf29

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Symbol
  
C4orf29

HUGO
  
26111

UniProt
  
Q0P651

Entrez
  
80167

RefSeq
  
NP_001034806.1

Locus
  
Chr. 4 q28.2

Chromosome 4 open reading frame 29 (C4orf29) is a protein that in Homo sapiens is encoded by the C4orf29 gene.

Contents

Gene

C4orf29 is found on the positive strand of the human genome at 4q28.2. It is 74.4 kbp. The gene contains 17 exons. The longest mRNA transcript is composed of 13 exons and is 2200 base pairs.

Orthologs

Many orthologs to human C4orf29 have been discovered, with the most distant ortholog with high (over 90%) coverage is found in rice Oryza sativa. The protein is not found in fungi. Bacteria of the order Myxobacteria and genus Chitinimonas contain orthologous regions to the C4orf29 protein. The few bacterial homologs indicate a horizontal gene transfer event. The domain of unknown function, DUF2048, is conserved throughout orthologs.

Protein

C4orf29 codes a 414 amino acid sequence of 46.9 kDa in humans. The predicted isoelectric point is 9.37. The domain of unknown function, DUF2048, is found from amino acid residues 25 to 414 in the precursor C4orf29 protein. This domain is part of the alpha/beta hydrolase superfamily, which comprises enzymes that catalyze fat metabolism. Predicted post-translational modifications include glycosylation at residues Ser287 and Ser319 and sumoylation at the motifs Phe240 to Gly243, Ala377 to Asp340, and Phe408 to Gly411.

Expression

The protein product of C4orf29 in humans is predicted to be a secreted product. It is ubiquitously expressed at low to moderate levels. In humans, the protein is found at high levels the digestive tract and parathyroid gland. The homologous mouse protein 3110057O12Rik is expressed at high levels in the granule layer of the cerebellum.

Clinical significance

C4orf29 contains highly variable numbers of Alu repeats. A low number of Alu repeats in the human C4orf29 protein is associated with increase prevalence of hepatocellular carcinoma (HCC) in Asian populations. This information is used as a genetic marker to determine genetic risk of HCC. Swine muscle transcriptome analysis indicates high expression of C4orf29 in swine with extreme low levels of fatty acid composition.

References

C4orf29 Wikipedia