User:Wanederner
Figure 1. NCBI VAST depiction of TMEM14C with transmembrane regions.
TMEM14C
TMEM14C or transmembrane protein 14C, is a transmembrane protein gene found in Homo sapiens, as well as many other animal species spanning from vertebrates to invertebrates, along with various plant species1. Gene TMEM14C is a 1,187 base pair gene that encodes a 112 amino acid transmembrane protein found in the genome of Homo sapiens, along with many other orthologs. The gene is located on the petite arm of chromosome 6, in region 2, band 4, and sub-band 2 (6p24.2), between PAK1IP1 and TMEM14B (a paralog of TMEM14C). It has 6 exons and is highly expressed in fat, adrenal, and 25 other tissue types in Homo sapiens, with the highest expression occurring in fat and lowest measured expression in the pancreas1. It has 5 aliases, including C6orf53, BA421M1.6, HSPC194, MSTP073, and NET26 One pseudogene has been discovered, TMEM14EP, which is located at 3q25.2 and could be involved in overexpression of MTBP cells, which is associated with poor survival in lung adenocarcinoma. TMEM14C is conserved across many Eukaryotic orthologs, spanning from Metazoa (Alligator mississippiensis) to Bacteria (Chlamydia trachomatis)1 Google Patents revealed no active patents for TMEM14C, but when searching for C6orf53 (an alias), four patents are active with possible linkage to colorectal cancer, based on mass value of associated markers6.
Transcripts
TMEM14C has one known transcript variant, designated TMEM14C transcript variant 2. Both gene variants encode the same protein and have 6 exons, but variant 2 has 171 less base pairs in the 5’ Untranslated Region of Exon 1 than variant 1.1
Proteins
Molecular weight data analysis suggests TMEM14C to weight about 23 kdal . However, when a SAPS protein composition analysis was done, a molecular weight of 11.6 kdal was found. Interestingly, the two differing values are about a magnitude of 2 in difference, but due to the antibody being the only one found for TMEM14C in research, it is more probable that the correct molecular weight was determined by SAPS. No isoelectric point data was found during research on protein analysis.
There appears to be significantly high relative G, M and AGP content, and significantly low relative E, D, and T content.
I-Tasser depicts a very long and filamentous structure containing loose and flexible alpha helices. This prediction is supported by the amino acid composition analysis data found in SAPS, as TMEM14C is extremely high in Glycine content. This is important as Glycine is the simplest amino acid, and a high G content would suggest loose and flexible alpha helix composition in structure. Figure depicts a bulbous tertiary structure for the protein, which appears very bulky due to the high transmembrane region content of the protein, which prevents a very loose or lengthy structure. Colors separate chemical makeup of amino acids present in structure, with no particular indication or significance of color makeup.
Gene Level Regulation
Promoter
Homo sapiens TMEM14C Promoter
ZF07
CTTACTCCCCATCCCCCTTTAAGATGCTTCTCAAGCAGCCTCACTCCCTA 50 GAATGAGGGGTAGGGGGAAATTCTACGAAGAGTTCGTCGGAGTGAGGGAT
KLFS FKHD NF1F
AAACAAACGGATATTTGGCTTTGAGCCACATTATTCTGCAGCCCCCGTAT 100 TTTGTTTGCCTATAAACCGAAACTCGGTGTAATAAGACGTCGGGGGCATA
NF1F STAT
TTTCTTCCAGGCAGGCCCTTCCTGGTGTAGACAAGATCGGGCTTGAGTGA 150 AAAGAAGGTCCGTCCGGGAAGGACCACATCTGTTCTAGCCCGAACTCACT
CDXF
CACTCCTCCCTGATGGCCTGCATCTGGTTTATACCCTGTAACTTGTTCCT 200 GTGAGGAGGGACTACCGGACGTAGACCAAATATGGGACATTGAACAAGGA
VTBP HBOX AP4R
CTATTAATGGGGTCCTTCAAAATCCAGCCTCAGATTCCCTGGTCCCGCAG 250 GATAATTACCCCAGGAAGTTTTAGGTCGGAGTCTAAGGGACCAGGGCGTC
HBOX
AHRR CEBP
CGGTGGCCTCACCTCTGGCGTGGCCGAGCTCACGTGGTCCGGCTTGTGCA 300 GCCACCGGAGTGGAGACCGCACCGGCTCGAGTGCACCAGGCCGAACACGT
HESF
GLIF
AGTCCCAGGTCCAACTCCGGGTCTCCTGCTTTTGGCCACTCAGGATTGGA 350 TCAGGGTCCAGGTTGAGGCCCAGAGGACGAAAACCGGTGAGTCCTAACCT
CLOX
GATA Start of Txn ETSF
CCTGGGACTGATACTGGTCGGCCCTGCAGGCGCTGCGGACAGGGGAAGCA 400 GGACCCTGACTATGACCAGCCGGGACGTCCGCGACGCCTGTCCCCTTCGT
INRE
NACA ZF02 CAGAGATTCCCCGCCGCGTTCCCTGGACTCAGGAGCTCGCCGCGATGCCC 450 GTCTCTAAGGGGCGGCGCAAGGGACCTGAGTCCTCGAGCGGCGCTACGGG
MZF1
CGCCCCACTCTCCACCCGCTGA 472 GCGGGGTGAGAGGTGGGCGACT
SP1F
Figure 2. TMEM14C promoter with annotated restriction enzymes. Data gathered from Genomatix. Expression Pattern
Figure 3. RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes.1
Figure 4. Transcription profiling by high throughput sequencing of individual and mixture of 16 human tissues RNA.1
Figure 5. 35 human fetal samples from 6 tissues (3 - 7 replicates per tissue) collected between 10 and 20 weeks gestational time were sequenced using Illumina TruSeq Stranded Total RNA.1
It appears that TMEM14C has high expression among many tissues, and does not appear to be developmentally specific, as expression is heavily present in both fetal and adult tissue samples
Figure 6. Microarray Data of TMEM14C Tissue Expression.
This expression chart of three different samples for each major human tissue shows a very high percentile rank of each tissue, ranging from about 80% to roughly 95%, indicating TMEM14C’s significance as a gene within all tissue types. Although some tissues have low counts compared to others, the abundance relative to other genes found in the same tissue types is very high.
Transcript Level Regulation
mRNA localization
k = 9/23
33.3 %: endoplasmic reticulum
22.2 %: Golgi
22.2 %: cytoplasmic
11.1 %: nuclear
11.1 %: mitochondrial
Figure 12. PSORTII mRNA localization analysis sorted by location.
It appears that the mRNA is most likely localized to the Endoplasmic Reticulum, which in the case of TMEM14C, indicates the protein anchoring in the plasma membrane of the ER, with its cytosolic portions in the lumen of the ER, and its non-cytosolic portions outside of the ER.
miRNA Targeting
No observable miRNA were found in a TargetScan program run of TMEM14C.
Stem Loop Structure
Figure 7. Stem Loop structures of TMEM14C. Protein Level Regulation
Human TMEM14C annotated conceptual translation (Variant 1, NM_001165258.1)
gccctgcaggcgctgcggacaggggaagcacagagattccccgccgcgttccctggactc 60
aggagctcgccgcgatgccccgccccactctccacccgctgaatgcagggcgcatgctgc 120
tacttggcggctcaagccccgcccgcaccgtccccattctctgaccgcccctctcccggt 180
acactgcgcaggcacaacagagccgctcccctctcctcgccccgccaccgggacggagag 240
cgcccgccgctgcatttccggcgacacctcgcagtcattcctgcggcttgcgcgcccttg 300 conserved
tagacagccggggccttcgtgagaccgcttgttttctgcaggtgcaggcctggggtagtc 360 exon 1/2upstream stop
tcctgtctggacagagaagagaaaaatgcaggacactggctcagtagtgcctttgcattg 420 exon 2/3
M Q D T G S V V P L H W 12 cytoplasmic yinoyang
start
gtttggctttggctacgcagcactggttgcttctggtgggatcattggctatgtaaaagc 480 signal peptide cleavage
F G F G Y A A L V A S G G I I G Y V K A 32 trans.region noncytoplamic
sumoylation
aggcagcgtgccgtccctggctgcagggctgctctttggcagtctagccggcctgggtgc 540 exon 3/4
G S V P S L A A G L L F G S L A G L G A 52 trans.region nuclear export
ttaccagctgtctcaggatccaaggaacgtttgggttttcctagctacatctggtacctt 600 exon 4/5
Y Q L S Q D P R N V W V F L A T S G T L 72 cytoplasmic yinoyang
phosphorylation
ggctggcattatgggaatgaggttctaccactctggaaaattcatgcctgcaggtttaat 660
A G I M G M R F Y H S G K F M P A G L I 92trans.region noncytoplasmic
phosphorylation
tgcaggtgccagtttgctgatggtcgccaaagttggagttagtatgttcaacagacccca 720 exon 5/6
A G A S L L M V A K V G V S M F N R P H 112 8transregion sumoylation
tagcagaagtcatgttccagcttagactgatgaagaattaaaaatctgcatcttccact 780 stop
*
attttcaatatattaagagaaataagtgcagcatttttgcatctgacattttacctaaaa 840
aaaaagacaccaaacttggcagagaggtggaaaatcagtcatgattacaaacctacagag 900
gtggcgagtatgtaacacaagagcttaataagaccctcatagagcttgattcttgtatat 960
tgatgttgtcttttctttctgtatctgtaggtaaatctcaagggtaaaatgttaggtgtc 1020
agctttcagggctctgaaaccccattccctgctctgaggaacagtgtgaaaaaaagtctt 1080
ttaggagatttacaatatctgttcttttgctcatcttagaccacagactgactttgaaat 1140
tatgttaagtgaaatatcaatgaaaataaagtttactataaataataaaaaaaaaaaaaa 1200 polyA
a 1201
Figure 8. Conceptual translation of TMEM14C protein regulatory features.
Post-Translational Modifications DAS-TMfilter http://mendel.imp.ac.at/sat/DAS/DAS.html 4 Transmembrane Regions found. GPS http://gps.biocuckoo.cn/online.php 487 predicted kinase-specific phosphorylation sites HelicalWheel https://grigoryanlab.org/drawcoil/drawcoil.pl Helical wheel representation HMMTOP http://www.enzim.hu/hmmtop/ N-terminus is outside of cell. NetGlycate http://www.cbs.dtu.dk/services/NetGlycate/ Predicted glycation of epsilon amino groups of lysines 85 and 102 NetNES http://www.cbs.dtu.dk/services/NetNES/ Predicted leucine-rich nuclear export signal at 47 L NetPhos http://www.cbs.dtu.dk/services/NetPhos-3.1/ Conserved phosphorylation sites at 71 T and 83 S Phobius http://phobius.sbc.su.se/ 4 Transmembrane regions and locations Predotar https://urgi.versailles.inra.fr/predotar/ ER predicted putative N-terminal targeting sequence PrePS http://mendel.imp.ac.at/sat/PrePS/index.html 3 predicted protein prenylation sites ProP http://www.cbs.dtu.dk/services/ProP/ Predicted signal peptide cleavage site between 18 A and 19 A PSORTII https://psort.hgc.jp/cgi-bin/runpsort.pl Subcellular localization appears to be primarily in ER. SecretomeP http://www.cbs.dtu.dk/services/SecretomeP/ One signal peptide predicted SOSUI http://harrier.nagahama-i-bio.ac.jp/sosui/ 4 Transmembrane helices found. SUMOplot https://www.abgent.com/sumoplot Predicted sumoylation sites at 31 K and 102 K TargetP http://www.cbs.dtu.dk/services/TargetP/ Subcellular location predicted in secretory pathway TatP http://www.cbs.dtu.dk/services/TatP/ 3 predicted Twin-arginine signal peptides TMHMM http://www.cbs.dtu.dk/services/TMHMM-2.0/ Transmembrane helices locations predicted YinOYang http://www.cbs.dtu.dk/services/YinOYang/ 4 T and 56 S are YINYANG sites Table 1. Findings from table are depicted in Figure 8.
Homology/Evolution
Genus Species
Common Name
Taxonomic group
Diverge-nce date (Estimat-ed)
Accession
Sequence Length (aa)
Identity
Similarity
Oryctolagus cuniculus
European Rabbit
Mammals
90 MYA
XP_002714217.1
114
92%
97%
Felis catus
Domestic Cat
Mammals
96 MYA
XP_003985861.1
114
89%
96%
Gallus gallus
Chicken
Bird
312 MYA
XP_015131459.1
107
74%
82%
Balearica regulorum gibbericeps
Grey Crowned Crane
Bird
312 MYA
XP_010300009.1
107
75%
83%
Ophiophagus hannah
King Cobra
Reptile
312 MYA
ETE69889.1
107
78%
89%
Anolis carolinensis
Green Anole
Reptile
312 MYA
XP_003223611.1
107
76%
86%
Xenopus tropicalis
Western Clawed Frog
Amphibian
352 MYA
NP_001072424.1
107
76%
87%
Erpetoichthys calabaricus
Reedfish
Bony Fish
435 MYA
XP_028646691.1
107
76%
87%
Denticeps clupeoides
Denticle Herring
Bony Fish
435 MYA
XP_028814336.1
107
76%
84%
Callorhinchus milii
Australian Ghost Shark
Cartilaginous Fish
473 MYA
NP_001279149.1
107
68%
85%
Nephila clavipes
Golden Silk Spider
Arthropods
797 MYA
PRD35861.1
124
51%
64%
Orussus abietinus
Parasitic Wood Wasp
Arthropods
797 MYA
XP_012283827.1
109
58%
70%
Pseudomyrmex gracilis
Elongate Twig Ant
Arthropods
797 MYA
XP_020285229.1
110
60%
74%
Exaiptasia pallida
Sea Anemone
Cnidaria
824 MYA
XP_020914767.1
114
54%
67%
Rhizopus azygosporus
N/A
Fungi
1105 MYA
RCI00732.1
100
57%
71%
Choanephora cucurbitarum
N/A
Fungi
1105 MYA
OBZ90553.1
104
52%
68%
Acanthamoeba castellanii str. Neff
N/A
Protist
1480 MYA
XP_004336177.1
149
51%
62%
Theobroma cacao
Cacao Tree
Plant
1496 MYA
EOY30104.1
234
47%
57%
Chlamydia trachomatis
Chlamydia
Bacteria
4290 MYA
CQB89653.1
110
48%
61%
Table 2. TMEM14C orthologs orgainized by relative divergence and clade type.
Phylogenetic Tree Figure 9. Unrooted phylogenetic tree of 20 orthologs. Species names can be found in Table 2.
Divergence
Figure 10. TMEM14C Ortholog Molecular Clock plot. Corrected % Divergence is plotted against Date of Divergence in Million Years Ago. (m/100 = -ln(1-n(100)). It appears that TMEM14C has diverged over time at about the same rate as Cytochrome c has. (0.0324/0.0314) It is a slow evolving gene.1
MSA (Distant)
Figure 11. Multiple sequence alignment of TMEM14C distant orthologs. Color represents conserved amino acid chemistry, and full species information can be found in Table 2. Conserved amino acids appear to be present mostly in transmembrane regions of TMEM14C.1, 14
Function
One PubMed article mentions the TMEM14C gene by name in the title, published as “TMEM14C is required for erythroid mitochondrial heme metabolism”. In the article, researchers claimed to have determined that the inner mitochondrial membrane protein is essential for erythropoiesis (red blood cell synthesis) and heme synthesis in vivo through gene expression profiling. Further support for heme biosynthesis is provided by an article that identified TMEM14C as a mitochondrial protein whose transcript coexpress the core mechanisms of heme biosynthesis consistently.
Another article depicts the NET26 gene alias as being a type 2 membrane protein having possible functions in both nuclear envelope and Endoplasmic Reticulum, but failing to appear in microsomal membranes (potentially due to microsomal membranes being devoid of nuclear envelope due to intact nuclei sediment readily via centrifugation.
Additionally, TMEM14C has been shown to interact with an LNX1 PDZ domain in mouse embryos, particularly in beta-galactosidase activity. LNX1 is a RING finger domain containing four PDZ domains, which are protein interaction domains found in signal transduction associated molecules. This activity suggests a role in glycosidic bond breakage in hydrolysis producing monosaccharides.
Interacting Proteins
Figure 12. STRING visualization of closely associated functional protein partners.
Table 3. Associated protein partners found in Figure 12.18
Clinical Significance Mutations (SNPs)
Table 6. SNPs of TMEM14C.
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.