Alternative splicing (AS
In 1941, Beadle and Tatum provided data that led to the “one gene, one protein” paradigm (
While the removal of introns is common to all eukaryotic life forms, the percentage of genes that undergo splicing varies by species. Generally, the abundance of AS events is proportional to organismal complexity, with AS being most prevalent in more complex organisms such as vertebrates (
There are several types of AS that can take place within a given transcript, including exon skipping (ES), alternative donor sites (AD), alternative acceptor sites (AA), intron retention (IR), and the rare complex event known as mutually exclusive (ME) exon splicing. Other rare complex events, such as the skipping of multiple exons in a row, also occur (
Splicing occurs by two consecutive transesterification reactions (
Most introns contain conventional consensus sequences that delineate the primary splice sites (
In addition to the primary determinants discussed above, there are other sequence signals outside of the conventional splice sites that regulate splicing. For instance, splicing enhancers and suppressors either promote or reduce the incidence of splicing at a particular site (
With the currently available understanding about splice site consensus sequences and other signals that affect splicing, one might expect there to be robust computational approaches that would accurately predict exon/intron boundaries. Indeed this is the goal of several online splice site locator and gene structure prediction tools (
Alternative splicing provides an important mechanism for higher eukaryotes to both increase genetic diversity and regulate protein activity. Splicing variation affects the structure, function, and expression levels of protein products. Exploring alterations in physiology caused by AS is important to better understand the ramifications of misregulated AS and, ultimately, the role of AS in disease processes (
MPs play key roles in numerous cellular functions including cell signaling, transport, energy transduction, as well as in the organization and stabilization of cellular structure. MPs differ from soluble proteins in terms of native environment, folding topology, and organization of domains and subunits. It is well established that RNA splicing can have a wide range of effects on soluble protein expression, localization, folding, structure, and function (
The number and arrangement of transmembrane (TM) segments in a MP defines the orientation of a protein and is often closely integrated with function and stability. Alternative or aberrant splicing can disrupt TM segments when splice sites are found within a TM-encoding region of the transcript. Likewise, it is possible that removal of an entire TM segment via splicing can significantly alter the topology of a MP or even eliminate membrane anchoring altogether. In this section, we review placement of splice junction locations within MP gene structure and examine examples where splicing significantly alters MP topology. We also briefly discuss the impact of these AS events on MP function.
The aberrant introduction of splice site junctions within TM segments would likely lead to serious disruption of the encoded membrane protein structure. Indeed, computational studies assessing the likelihood of an intronic division within a TM region have found that that the probability of a TM not being divided by an intron was slightly higher than the expected probability for a random 22-mer amino acid sequence (
Rh blood group antigens are important in blood-typing for blood transfusion and organ transplantation. These antigens are typically predicted to have 12 TM segments (
Another example of altered TM topology is found in the V2 vasopressin receptor, a G-protein coupled receptor (GPCR). Its two isoforms, V2a and V2b, were found to have different topological states and different levels of stability (
A third protein whose TM topology is affected by alternative splicing is the glycine receptor (
Signal peptides are important for membrane targeting and determination of protein subcellular localization and are often altered as a result of AS. In a computational analysis of the mouse transcriptome, 40% of transcriptional units (sets of transcripts derived from the same gene) were shown to have signal peptide variation, with the majority of transcript variation being the consequence of AS (
Signal peptide removal by AS has been documented in the parathyroid hormone (PTH)/PTH-related peptide (PTHrP) receptor (
In addition to altered TM topology, alternative splicing can eliminate TM domains from transcripts that would otherwise encode MPs. This results in soluble protein isoforms that may have vastly different roles in (or out) of the cell. A survey (
Another example of production of a soluble isoform from a MP transcript is cadherin-7, a cell-adhesion TM glycoprotein. In developing chickens, it was found that a soluble form of cadherin-7 was expressed (
Ions cannot cross the hydrophobic lipid bilayer via diffusion. Cells therefore express numerous types of channels, both to regulate cytosolic ion content and, in the case of specialized cells such as neurons and muscle cells, for signaling purposes. Ion channel activity can be regulated by AS. For example, transient receptor potential (TRP) potassium channels are heavily regulated by AS (
The TRPM3 channel is alternatively spliced to produce different sequences within the presumed pore domain, generating different cation selectivities. For instance, TRPM3α1 and TRPM3α2 are both outwardly rectifying cation channel isoforms, meaning they allow higher current flux out of the cell than in, whereas other TRPM3 variants are inwardly rectifying. TRPM3α2 is highly permeable to divalent cations while TRPM3α1 favors monovalent conductance. Additionally, monovalent cations block currents through TRPM3α2 but not TRPM3α1. These TRPM3 variants clearly demonstrate a role for AS in regulating channel ion permeability, selectivity, and regulation (
Transporters facilitate the passage or flip-flop of solutes across the membrane, often in an energy-consuming manner against the TM concentration gradient (active transport). Transporter activity is typically tightly regulated, with AS being one mechanism of regulation.
An important family of regulated transporters are the P-type ATPases. This family includes the plasma membrane calcium pump, which is present in all eukaryotic cells and helps to maintain appropriate Ca2+ levels within the cytosol by moving calcium ions from the cytosol to the extracellular matrix. The calcium pump is alternatively spliced such that its calmodulin-binding regulatory domain is altered to reduce its net positive charge, resulting in a lower calmodulin affinity (
Another example is the glutamate transporter family. After release of neurotransmitter into the synaptic cleft, transporters are responsible for transmitter reuptake by the cell and regulation of the volume of these vital molecules at the synapse. Levels of glutamate, the major excitatory neurotransmitter in the brain, are regulated by glutamate transporters, the most prominent of which is GLT1 (EEAT2 in humans). The three alternatively spliced isoforms of this protein (GLT1a, GLT1b, GLT1c) differ in their C-termini (
G protein-coupled receptors (GPCRs) are an essential subset of MP, the target of almost 40% of the drugs on the market today (
In the case of the cannabinoid receptor hCB1, two splice variants have been identified (hCB1a and hCB1b) in addition to the canonical receptor. These variants have altered amino termini; hCB1a has an altered N-terminal sequence and hCB1b is missing the first 33 N-terminal amino acid residues. In both cases AS occurs through use of different splice site donor/acceptor sites. hCB1a and hCB1b display altered pharmacological properties in relation to the canonical hCB1 receptor (
TM enzymes are also affected by AS. Carbonic anhydrase XII (CA XII) is an enzyme whose alternative splicing is associated with cancer (
The proteolipid protein (PLP) is the most abundant protein of central nervous system (CNS) myelin and is thought to maintain structural integrity of myelin and support myelin compaction, although it may have additional roles in signaling (
Differences in alternatively spliced isoforms can impact the ability of the protein product to interact with partners. As discussed above, the GLT1 transporter isoforms differ in their possession of a PDZ-binding motif, a motif likely used to bind protein partners containing PDZ-binding domains. Another example is the N-methyl-D-asparate (NMDA) receptor. Yotiao, an A-kinase anchoring protein, scaffolds to the NR1A splice variant of the NR1 subunit of the NMDA receptor but not to the NR1C splice variant (
Through tissue-specific isoform expression, AS can alter the function of the protein product of a gene to better suit the requirements of its environment. For example, TMEM16A is thought to function as a calcium-dependent chloride channel (CaCC) with eight proposed TM segments. Recent studies of the patterns of alternative splicing that produce distinct isoforms of this protein show that differential exclusion/inclusion of exons 6b, 13, and 15 in the mRNA transcript occurs in different organs (
AS can also involve non-coding regions of the mRNA. For instance, peripheral myelin protein 22 (PMP22), a protein involved in the inherited peripheral neuropathy Charcot-Marie-Tooth Disease, is spliced in a tissue-specific manner. The first exon in the 5' untranslated region (UTR) is alternatively spliced to include exon 1a or exon 1b (
Trafficking sequence motifs play key roles in directing the trafficking and subcellular distribution of proteins within the cell. AS can result in the removal or retention of these signals to directly affect protein trafficking and organelle sub-localization. For instance, there are two sets of
MP folding is still poorly understood in comparison to soluble protein folding and the evidence for the impact of alternative splicing on MP stability, folding pathways, and interactions with chaperones is currently sparse. Some AS events do result in the retention of specific variants (but not others) in the ER, which may indicate that AS sometimes results in changes in protein structure that are perceived by the protein folding quality control as signatures of misfolding. For instance, the V2b variant of the vasopressin receptor (
Alternative splicing of soluble proteins has long been documented as a contributing factor to disease (
Myotonic Dystrophy Type 1 (DM1) is an inherited multisystem progressive disorder affecting 1 in 8000 people. DM1 is characterized by myotonia, heart defects, cataracts, gastrointestinal defects, insulin resistance, muscle wasting, and neuropsychiatric disorders (
Aberrant AS of the insulin receptor (InsR) results in the insulin resistance phenotype observed in DM1 patients. The insulin receptor functions as a heterotetramer with two α- and two β-subunits. In DM1, there is a switch from the normal production in skeletal muscle of the IR-B splice variant of the α subunit, which includes a 12 residue C-terminus, to production of the IR-A variant lacking this terminus (
One of the most common symptoms of DM-1 is myotonia, a delay in the relaxation of muscle after contraction. Hyperexcitability is likely caused by loss of function of chloride channels. It is thought that this is due to aberrant alternative splicing of the muscle-specific chloride channel (ClC-1), which results in decreased levels of the channel (
Perhaps the most debilitating symptom of DM-1 is muscle wasting and weakness. It has been shown in other disorders that this symptom can result from changes to calcium homeostasis in muscle cells that increase intracellular Ca2+ levels (
RyR1 has 2 variants seen in DM1: the neonatal form of RyR1 lacks exon 70 (residues 3481–3485), which corresponds to five residues found in the receptor modulatory region, (
Two SERCA genes, SERCA1 and SERCA2 are also implicated in DM-1. SERCA1 has two splice variants: SERCA1a is the adult version of the protein and contains exon 22, which encodes 7 amino acids at the C-terminus. The neonatal form, SERCA1b lacks exon 22, and its C-terminus has an eight-residue, highly-charged C-terminal tail. In DM-1, a switch occurs, where the major isoform produced is the neonatal form (SERCA1b) as opposed to the adult form. It was also found that SERCA2 has a novel variant (SERCA2d) with lowered expression in DM-1. SERCA2d contains intron 19, which results in the addition of 27 amino acids, a frameshift, and a premature stop codon in exon 20. The aberrantly expressed SERCA isoforms both have an altered C-terminus, but how this impacts overall pump function is not known (
The voltage-dependent Cav1.1 calcium channel plays a central and important role in excitation/contraction coupling in skeletal muscle (
Alzheimer’s Disease (AlzD) is a progressive neurodegenerative disorder affecting over 5 million people in the United States alone (
Three major splice isoforms of APP have been documented: APP695 (lacking exons 7 and 8; the predominant neuronal isoform), APP751 (lacking exon 8 and expressed abundantly in non-neuronal CNS tissue and peripheral tissue), and APP770 (containing both exons 7 and 8 and expressed predominantly in peripheral tissues and at only low levels in the CNS) (
BACE1 is a single span MP with its catalytic subunit located in the ectoplasm. BACE1 is a prominent target in the as-of-yet unsuccessful search for a drug to lower amyloid-β production. A recent series of publications (
PS is the aspartyl protease in the heterotetrameric γ-secretase complex responsible for intramembrane cleavage of a large number of different single span MP substrates (
Inherited mutations of presenilin-1 or presenilin-2 (PS-1 and PS-2) are one of the known causes of familial (early onset) AlzD. Some of these mutations appear to impact pre-mRNA splicing. Additionally, aberrant splicing of the presenilin transcript has been shown to occur in the more common sporadic AlzD.
It has been shown that presenilin-1 (PS1) is affected by multiple splicing events, and that some heritable PS-1 mutations modulate splicing. Here, we will discuss three instances where splicing may play a role in disease progression.
First, in human tissues, two isoforms of PS1 are expressed that differ only by the inclusion or exclusion of four amino acids, VRSQ, in the N-terminal domain (
Aberrant splicing of PS1 can also be induced by mutation. For instance, a mutation within intron 4 was identified in several patients with early-onset AlzD (
Another interesting aberrant splicing event occurs when a mutation destroys a splice acceptor site at exon 9 (
For presenilin-2 (PS2), an aberrant splicing event in sporadic AlzD, resulting in exon 5 skipping (PS2V) and protein truncation, has been extensively characterized (
Recently, a novel splice variant of PS2, termed PS2β, was described to be a γ-secretase inhibitor (
Another γ-secretase subunit, anterior pharynx defective-1 (APH-1) is a seven-TM spanning protein encoded by two homologous human genes, either APH-1a or APH-1b (
Nicastrin is a highly glycosylated type I MP that participates, along with APH-1, in the assembly of the γ-secretase complex. Two independent genome surveys (
In 2005, a novel skipped splice variant of nicastrin was reported to be expressed in rat tissues and a human neuroblastoma cell line (
Soon thereafter, a second novel nicastrin splice variant was identified, this time lacking the entirety of exon 16 (
The serotonin (5-hydroxytryptamine) receptor 5-HT4, a GPCR, has been implicated in the regulation of APP processing and receptor agonists have been shown to increase the production of the soluble APP ectodomain released by α-secretase cleavage of full length APP to initiate the non-amyloidogenic pathway (
Alternative splicing of pre-mRNA can impact membrane protein trafficking, structure, function, and ultimately, contribute to disease. The examples highlighted in this review represent only a sample from what is already a very large pool of literature on this subject. Given that studies of RNA splicing remain in a rapid state of development, these examples represent only modest beginnings. There are undoubtedly new discoveries waiting to be made. It is already clear that splicing can alter almost every aspect of a MP’s structure, topology, function, trafficking, and interactions. When studying MPs from eukaryotes, the possibility of different isoforms with varying properties should not be overlooked.
This work was supported by US NIH grants U54 GM94608, PO1 GM080512, and RO1 NS0508815 (to CRS) and by 1DP2OD004483 to MDO. KFM was supported by an NSF Graduate Research Fellowship (Grant DGE0909667).
Abbreviations: AA, alternative acceptor sites; Aβ, amyloid beta; AD, alternative donor sites; AICD, amyloid precursor protein intracellular domain; AlzD, Alzheimer’s disease; APH-1, anterior phalanx defective-1 protein; APP, amyloid precursor protein; AS, alternative splicing; BACE1, β-site APP cleavage enzyme 1; CA, carbonic anhydrase; ClC1, muscle-specific chloride channel; CNS, central nervous system; CTF, presenilin C-terminal fragment; D2, dopamine receptor 2; DM1, myotonic dystrophy type 1; DMPK, myotonic dystrophy protein kinase; DNA, deoxyribonucleic acid; ER, endoplasmic reticulum; ES, exon skipping; GLT1, glutamate transporter 1; GlyR, glycine receptor; GPCR, G protein-coupled receptor; 5-HT4, serotonin (5-hydroxytryptamine) receptor; IR, intron retention; InsR, insulin receptor; KPI, Kunitz protease inhibitor domain; mRNA, messenger ribonucleic acid; ME, mutually exclusive exon splicing; MP, integral membrane protein; mRNA, messenger ribonucleic acid; MNBL, muscleblind protein; NMD, nonsense-mediated decay; NMDA, N-methyl-D-aspartate; NTC, nineteen complex; NTF, presenilin N-terminal fragment; PEN-2, presenilin ehancer 2 protein; PKC, protein kinase C; PLP, proteolipid protein; PMP22, peripheral myelin protein 22; PS, presenilin; Rh, Rh blood group antigen protein; RNA, ribonucleic acid; RyR1, ryanodine receptor; SDC-4, syndecan receptor 4; SERCA, sarcoplasmic endoplasmic reticulum Ca2+ ATPase; snRNA, small nuclear ribonucleic acid; TM, transmembrane; TRP, transient receptor potential; V2R, vasopressin V2 receptor.
We thank Nicholas Reiter for his helpful comments and suggestions and Wade Van Horn for his help with figures.
From top, clockwise: U1 snRNP associates with the 5' splice site to make complex E. U2 snRNP is recruited and brings the branch site near the 5' splice site, forming complex A. The U4/U6.U5 tri-snRNP is recruited to form a pre-catalytic complex B. The arrival of the nineteen complex (NTC) and the release of U1 and U4 snRNAs mark the conversion to complex B*, an active spliceosome. Complex C forms after the first transesterification reaction and carries out the second transesterification reaction. The mature message is released and the post-splicing complex is recycled.
Mutation of the DMPK gene to include CUG repeats sequesters the MNBL1 protein and stimulates hyperphosphorylation of the CUG-BP through a PKC-mediated mechanism. These changes result in dysregulated splicing of downstream targets of MNBL1 and CUG-BP.