Representative illustration of bHLH motif binding to the consensus sequence on DNA
The upstream stimulatory factor (USF) gene encodes atranscription factor USF that belongs to theproto-oncogeneMYC family and features abasic helix-loop-helixleucine zipper (bHLH-LZ) motif in the protein structure.[7] USF was originally identified as regulating the major late promoters ofadenovirus, and recent research has further revealed its role in tissue protection.[7] The bHLH-LZ motif enables thetransactivation capacity of the USF protein through interacting with theinitiator element (Inr) andE-box motif on the bound DNA.[7][8] In the context of insulin and glucose-induced USF activities, those E-box motifs can act as a glucose-responsive element (GRE) and a part of thecarbohydrate response element (ChoRE) to interact with transcription factors.[8]
USF comprises two majorisoforms: USF1 andUSF2. The USF1 gene is located on the chromosome region 1q22-q23 in both humans and mice; the USF2 gene is located on the chromosome 19q13 in humans and chromosome 19q7 in mice, respectively.[9] Both USF1 and USF2 transcripts comprise 10exons and can undergo exon 4-excision duringalternative splicing.[7][9] From an auto-regulation perspective, these exon 4-excision products act as dominant negative regulators and are found to suppress USF-dependent gene expression.[7][9]
Comparison between USF1 and USF2 in transcripts and protein function
Although USF1 and USF2 share 70% of the amino acid sequence in their bHLH-LZ region, only 40% similarity is found in their full-length proteins. In addition, USF1 and USF2 exhibit different protein abundances in a cell type-specific manner.[7] It has been found that USF1 and USF2 expression increases during the differentiation oferythroid cells.[10] Despite the ubiquitous expression of both isoforms, USF1 and USF2 mediate different biological processes and functions in cells. While USF1 modulates metabolism,immune response, and tissue protection, USF2 primarily controlsembryonic development, brain function,iron metabolism, andfertility.[7] Structurally, the highly conserved bHLH-LZ structure on the C-terminus of USF yields high binding specificity and promotes the formation of USF1homodimers or USF1-USF2 heterodimers for DNA binding.[7][9][11] The USF-specific region (USR) on the N-terminal region, on the other hand, facilitates the nuclear translocation and activation of USF1.
The symmetrical E-box motif is the main target of bHLH-LZ transcription factors, and USF1 has a high binding affinity for the core sequence CACGTG in the motif.[9] USF1-DNA binding activity can be modulated by cell type-specific DNAmethylation andacetylation on the E-box motif or bypost-transcriptional modifications of the USF1 protein. For example,CpG methylation on the central E-box motif inhibits the complex formation of USF1 with its co-transcription factors and therefore decreases the corresponding gene expression in mouselymphosarcoma cells.[9] In contrast, phosphorylation of USF1 byp38 mitogen-activated protein kinases,protein kinase A, orprotein kinase C increases its binding to the E-box motif and activates gene transcription.[9]
Mitogen-activated protein kinase (MAPK) phosphorylates serine and threonine residues of substrate proteins and converts extracellular signals induced by growth factors, mitogens, or cytokines into intracellular phosphorylation cascades, which regulate cell proliferation, differentiation, stress responses, andapoptosis (programmed cell death).[7]
ERK and JNK signaling cascades
Phosphorylation by MAPKs induce a conformational change of the USF protein and exposes its DNA-binding domain for interaction. This increased structural exposure enhances DNA binding and therefore the transcriptional activity of USF.[13]
ERK1 (also known asMAPK3) and ERK2 (also known asMAPK1) phosphorylate USF1 in response to TFG-β signaling in vascular smooth muscle cells.[13] SMAD2 and SMAD3 signaling, following the TFG-β receptor activation, can also cooperate with EGFR / ERK pathways to activate USF1, which in turn regulates the gene expression ofplasminogen activator inhibitor-1 (PAI-1), a significant biomarker and predictor of cardiovascular disease-related death[13] and a marker of poor prognosis in breast cancer.[7]
Casein kinase 2 or CK-II (CK2) is a tetrameric enzyme composed of two catalytic and two regulatory subunits. In pancreatic cells, CK2 phosphorylates USF1,PDX1, andMST1 to suppress insulin expression.[14]
Transforming growth factor β 1 (TGF beta 1) is encoded by the TFGB1 gene, which contains an E-box within the promoter region and has been implicated in excessive extracellular matrix accumulation under a high-glucose condition.[8] Overexpression of either USF1 or USF2 is found to elevate the TFGB1 promoter activity in human embryonic kidney cells. However, only USF1 overexpression leads to increased TGF-β1 secretion.
Thrombospondin 1 (TSP1) is involved in the development ofdiabetic nephropathy. USF1/2 binds to the E-box motif (CAGATG) on the human THBS1 promoter and regulates high-glucose-induced TSP1 expression inmesangial cells.[8] USF2 overexpression has been found to augment THBS1 promoter activity and TSP1 expression. The resulting increase in TSP1 expression further promotes the formation of active TGF-β.[8]
factor (ATF) that bind to theAP-1 binding site on DNA.[16] cJun-cJun / cJun-cFos dimers preferentially bind to the phorbol12-O-Tetradecanoylphorbol-13-acetate (TPA)-responsive element (TRE region, TGACTCA), whereas cJun-ATF dimers and ATF homodimers preferentially bind to the cAMP-responsive element (CRE, TGACGTCA).[16] The AP-1 complex becomes activated in response to high glucose, oxidative stress,low-density lipoprotein(LDL) and oxidised LDL. It has been reported that a high glucose level upregulates USF and AP-1 binding activities, as well as the protein level of cFos.[8]
Interaction between USF1 and other transcription factors, includingSP1, PEA3 (also known asETV4) andMTF1, also leads to cooperative transcriptional regulation. For instance, theleucine zipper motif of USF1 recruits PEA3 to form a ternary complex and co-regulates the transcription ofBAX, an apoptosis regulator.[9] Another USF1-regulated target istopoisomerase III (hTOP3⍺), which catalyzes the topological changes of DNA, modifies DNA supercoil structures, and increases the chromatin accessibility for gene expression.[9] Similar interactions exist between USF1 andJMJD1C orH3K9demethylase, in which the molecular interactions change chromatin accessibility and elevate the transcription of a series of lipogenic genes, including those forfatty acid synthase (FASN),acetyl-CoA carboxylase (ACC),ATP citrate synthase (ACLY), andsterol regulatory element-binding protein 1 (SREBP1).[15]
General model for duplication of heterochromatin during cell division
Chromosomes are generally classified aseuchromatin orheterochromatin with distinct histone modifications, compaction levels, and the resulting gene expression patterns. Heterochromatin is a tightly condensed and transcriptionally repressed chromatin domain that is characterized by distinct combinations of histone post-translational modifications.[17] Heterochromatin is required for genome stability and gene expression regulation. However, it can spread into neighboring DNA regions and inactivate gene expression.[17][18] Chromosome boundary elements are thus necessary to block such stochastic spreads of heterochromatin and maintain stable gene expression.[19] USF1 and USF2 have been found to recruit various histone-modifying complexes, including the histone H3 methyltransferaseSet1 complex and the H4 arginine 3 methyltransferasePRMT1, with the latter known to establish active chromatin domains.[19] USF1/USF2 binding deposits a high level of activating histone modifications on adjacent nucleosomes and thus prevents the propagation of chromatin silencing modifications from the heterochromatin, such as H3K9 and K27 methylation.[19]
DNA histone modifications
Other USF1/USF2-related chromatin modifications include the recruitment of the E3 ubiquitin ligase,RNF20, to monoubiquitinatehistone H2B.[19] The loss of RNF20 is found to cause an extension of the silencing modifications from the 16kb heterochromatic domain into theβ-globin locus.[19] Moreover, USF1 and USF2 can bind to the 5' DNase I hypersensitive site HS4 and recruit an H3 acetyltransferase,PCAF, which blocks the heterochromatin spread into the β-globin locus.[18]
USF is known to bind the L-typepyruvate kinase promoter on DNA at high glucose and insulin levels. Excessive insulin activates kinases and phosphatases that post-translationally modify USF, sterol regulatory element-binding protein 1C (SREBP1C),carbohydrate-responsive element-binding protein (ChREBP), andLiver X receptor (LXRs).[11] With insulin stimulation, USF1 and USF2 bind to the E-boxes at -332 and -65 in the promoter region of FASN that encodesfatty acid synthase (FAS) for lipogenesis.[11]
Various post-translational modifications of USF1 determine its activity and signaling pathways, and can affect the lipogenesis process. An abnormal increase in the USF-mediatedde novo fatty acid synthesis is found to cause intracellular fatty acid accumulation and to deregulate gene expression and cellular processes such as tumor cell survival.[20]
In response to insulin elevation,DNA-protein kinase (DNA-PK) involved in DNA damage repair becomes dephosphorylated and activated.[11] The active form of DNA-PK indirectly phosphorylates USF1 at S262 throughAMP-activated protein kinase (AMPK). The S262 phosphorylation increases USF1 interaction with SREBP1C near the sterol regulatory element (SRE) and facilitates the synergistic activation of SREBP1C and transcription of the downstream lipogenic genes.[11]
USF1 S262 phosphorylation also recruitsPCAF to acetylate USF1 at the site K237. Both S262 phosphorylation and K237 acetylation enhance USF1 activities and the subsequent transcriptional activation of thefatty acid synthase gene (FASN).[11] Fatty acid synthase (FAS), together withacetyl-CoA carboxylase (ACC), producesmalonyl-CoA, converts it to long-chain fatty acids, and promotes thede novo fatty-acid synthesis for energy provision andprotein lipidation.[21][20]
USF1 modified with S262 phosphorylation an K237 acetylation also recruits BGR1 (also known asSMARCA4)-associated factor 60c (BAF60c).[15] BAF60c is then phosphorylated by atypicalprotein kinase C (aPKC) at S257, allowing it to form a LipoBAF complex at promoters of lipogenic genes to regulate chromatin structure and gene transcription.[15]
In contrast,HDAC9 deacetylates USF1 during cell fasting, prevents the recruitment of USF1-interacting factors, and suppresses the transcriptional activation of lipogenic genes.[11]
USF1 transcription undergoes active dynamics during cell meiosis, in which the USF1 mRNA first increases significantly when cells 2-8 are formed and then decreases to an undetectable level at the blastocyst stage, indicating its role in the embryo genome activation.[22] USF1 siRNA knockout has been shown to compromise the blastocyst rate and deregulate the transcripts oftwist-related protein 2 (increased) andgrowth differentiation factor-9 andfollistatin (decreased) by affecting their promoter-binding element E-box region duringoocyte maturation.[22]
Diabetic glomerulosclerosis shown by hematoxylin and eosin (HE) staining
Diabetic kidney disease (DKD) (ordiabetic nephropathy) is a progressivemicroalbuminuria disease with a slight loss of albumin in the urine (30–300 mg per day); DKD has been viewed as a diabetic complication-relatedmicrovascular disorder in a renal manifestation.[23] In kidney biopsy, DKD is characterized by glomerular and tubular basement thickening,mesangial expansion,glomerulosclerosis,podocyteeffacement and nephron loss.[24] DKD occurs in 30%-50% of the diabetic patient population and leads to kidney failures in up to 20% of thetype 1 diabetic patients.[24] However, a substantial portion of DKD patients do not manifestalbuminuria.[24] DKD pathogenesis is attributed to the dysregulated glucose transport at a higher glucose level and the excessive influx of intracellular glucose into endothelial cells.[23] The elevated glucose level is sustained, along with multiple metabolic phenotypes such as excessfatty acids andoxidative stress, as well asshear stresses induced byhypertension andhyperfusion, and can lead to microvascularrarefaction, hypoxia, andmaladaptation in glomerularneoangiogenesis.[23]
USF1, as an insulin-sensitive transcription factor that becomes active in response to a high glucose level, promotes the transactivation of genes involved in lipid metabolism, includinghepatic lipase (LIPC),hepatocyte nuclear factor 4 alpha (HNF4A),Apolipoprotein AI (APOA1),Apolipoprotein L1 (APOL1) andHaptoglobin-related protein (HPR).[25] Especially, APOL1 is known to complex with APOA-I and HDL to facilitate cellautophagy in response to injuries and to prevent glomerular diseases; however, an APOL1 risk variant specific to podocyte inhibits cell autophagy and can trigger kidney disease.[25]
Cancer cells exhibit a set of phenotypes, including a highlighted increase inaerobic glycolysis, lactic acid production (known asthe Warburg effect), elevated protein and DNA synthesis, and increasedde novo or endogenousfatty acid synthesis byfatty acid synthase (FAS).[20] FAS primarily synthesizespalmitate from malonyl-CoA, which is further esterified totriglycerides for energy storage. Normally, the fatty acid synthase gene (FASN) is active during embryogenesis and in fetal lungs for lubricant production; however, it is physiologically low-expressed in non-cancerous adult cells. In contrast, abnormal FASN overexpression is detected in multiple cancer types:breast,colorectal,prostate,pancreatic, andovarian.[26] FASN-mediatedde novo lipid synthesis accounts for more than 93% oftriglycerides in tumor cells.[20] Specifically, tumor cells prefer glycolysis over oxidation for energy consumption and redirect the glycolytic products towardsde novo fatty acid synthesis to supply lipids for membrane production andprotein lipidation for fast cell proliferation.[20] For example, thePI3K-AKT pathway is found to increase inLNCaP prostate cancer cells to stimulate FASN overexpression. Concurrently, fatty acid synthase overexpression is also post-translationally sustained byUSP2a-mediated ubiquitination reduction, stabilizing FAS for constitutive signal transduction.[20] In addition tode novo lipogenesis, FAS promotes the localization of VEGFR-2 to the lipid raft of the endothelial cell membrane and thus enhances angiogenesis in tumor development.[26] Meanwhile, mutual activation between FAS and ERBB2 (HER2) signaling also potentiates tumorigenesis, in which ERBB2 amplification is associated with elevated survival and proliferation of cancer cells and poor prognosis in breast and gastric cancers; an ERBB2 increase, especially, contributes to 18-25% of breast cancers.[27] In prostate cancer cells and promyelocytic leukemia cells, USF1 activation also attains a high-levelof PAI-1 expression and inhibits spontaneous orcamptothecin-induced apoptosis.[13]
MDM2 mediates p53 poly-ubiquitination and proteasomal degradation
Decreased USF1-p53 interaction and increased p53 instability
The poor prognosis of gastric cancers is associated with low expression of USF1 and p53.[28] Among gastric cancer patients, 88% of the patients are diagnosed withH. pylori infection, and half of the patients show lower USF1 expression in tumor tissues. Mechanistically,H. pylori induces DNA hypermethylation in the promoter regions of USF1 and USF2 and inhibits expression. Decreased expression reduces the interaction between USF1 and p53 when DNA damage occurs, rendering p53 to associate more frequently with the E3-ubiquitin ligase HDM2 (also known asMDM2) and increasing p53 instability in cancer cells.[28]
Familial combinedhyperlipidemia (FCHL) was first used to describe lipid abnormalities in 47 Seattle pedigree-containing members with hypercholesterolemia and hypertriglyceridemia.[29] The core FCHL lipid profiles feature high serum cholesterol/triglyceride,apolipoprotein B (APOB), and LDL levels. Genetic evidence has suggested a FCHL-related locus on the human chromosome 1q21-q23, which is linked to metabolic syndromes.[30] Fine-mapping of those linked regions identifies USF1 as the first positionally cloned gene for FCHL and a target for FCHL treatment. In addition,hepatocyte nuclear factor 4 alpha (HNF4A) is also implicated in high lipid levels and metabolic syndromes. Cooperative effects of USF1 and HNF4A have been shown to regulate the expression ofapolipoprotein A-II (APOA2) andapolipoprotein C-III (APOC3).[30] Mutations in USF1, HNF4A, and apolipoproteins also increase patients' susceptibility to FCHL.[30] Additional genes subjected to USF1 regulation and involved in glucose/lipid metabolism include apolipoprotein A5 (APOA5),apolipoprotein E (APOE),hormone-sensitive lipase (LIPE),hepatic lipase (LIPC),glucokinase (GCK), islet-specific glucose-6-phosphatase catalytic-subunit-related protein (IGRP), insulin,glucagon receptor (GCGR) and ATP-binding cassette transporter A1 (ABCA1).[30]
^"Human PubMed Reference:".National Center for Biotechnology Information, U.S. National Library of Medicine.
^"Mouse PubMed Reference:".National Center for Biotechnology Information, U.S. National Library of Medicine.
^Shieh BH, Sparkes RS,Gaynor RB, Lusis AJ (April 1993). "Localization of the gene-encoding upstream stimulatory factor (USF) to human chromosome 1q22-q23".Genomics.16 (1):266–268.doi:10.1006/geno.1993.1174.PMID8486371.
^abcdefMenendez JA, Lupu R (October 2007). "Fatty acid synthase and the lipogenic phenotype in cancer pathogenesis".Nature Reviews. Cancer.7 (10):763–777.doi:10.1038/nrc2222.PMID17882277.S2CID205468233.
^Roy AL, Meisterernst M, Pognonec P, Roeder RG (November 1991). "Cooperative interaction of an initiator-binding transcription initiation factor and the helix-loop-helix activator USF".Nature.354 (6350):245–248.Bibcode:1991Natur.354..245R.doi:10.1038/354245a0.PMID1961251.S2CID4260885.
Roy AL, Meisterernst M, Pognonec P, Roeder RG (November 1991). "Cooperative interaction of an initiator-binding transcription initiation factor and the helix-loop-helix activator USF".Nature.354 (6350):245–248.Bibcode:1991Natur.354..245R.doi:10.1038/354245a0.PMID1961251.S2CID4260885.
Henrion AA, Martinez A, Mattei MG, Kahn A, Raymondjean M (January 1995). "Structure, sequence, and chromosomal location of the gene for USF2 transcription factors in mouse".Genomics.25 (1):36–43.doi:10.1016/0888-7543(95)80107-W.PMID7774954.
Ghosh AK, Datta PK, Jacob ST (February 1997). "The dual role of helix-loop--helix-zipper protein USF in ribosomal RNA gene transcription in vivo".Oncogene.14 (5):589–594.doi:10.1038/sj.onc.1200866.PMID9053857.S2CID23764497.
Villavicencio EH, Yoon JW, Frank DJ, Füchtbauer EM, Walterhouse DO, Iannaccone PM (April 2002). "Cooperative E-box regulation of human GLI1 by TWIST and USF".Genesis.32 (4):247–258.doi:10.1002/gene.10078.PMID11948912.S2CID12132097.
Pickwell GV, Shih H, Quattrochi LC (April 2003). "Interaction of upstream stimulatory factor proteins with an E-box located within the human CYP1A2 5'-flanking gene contributes to basal transcriptional gene activation".Biochemical Pharmacology.65 (7):1087–1096.doi:10.1016/S0006-2952(03)00037-6.PMID12663044.