Puneet Varma (Editor)

Serpin

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Pfam
  
PF00079

PROSITE
  
PDOC00256

SUPERFAMILY
  
1hle

InterPro
  
IPR000215

SCOP
  
1hle

Serpin

Symbol
  
Serpin, SERPIN (root symbol of family)

Serpins are a superfamily of proteins with similar structures that were first identified for their protease inhibition activity and are found in all kingdoms of life. The acronym serpin was originally coined because the first serpins to be identified act on chymotrypsin-like serine proteases (serine protease inhibitors). They are notable for their unusual mechanism of action, in which they irreversibly inhibit their target protease by undergoing a large conformational change to disrupt its active site. This contrasts with the more common competitive mechanism for protease inhibitors that bind to and block access to the protease active site.

Contents

Protease inhibition by serpins controls an array of biological processes, including coagulation and inflammation, and consequently these proteins are the target of medical research. Their unique conformational change also makes them of interest to the structural biology and protein folding research communities. The conformational-change mechanism confers certain advantages, but it also has drawbacks: serpins are vulnerable to mutations that can result in serpinopathies such as protein misfolding and the formation of inactive long-chain polymers. Serpin polymerisation not only reduces the amount of active inhibitor, but also leads to accumulation of the polymers, causing cell death and organ failure.

Although most serpins control proteolytic cascades, some proteins with a serpin structure are not enzyme inhibitors, but instead perform diverse functions such as storage (as in egg white—ovalbumin), transport as in hormone carriage proteins (thyroxine-binding globulin, cortisol-binding globulin) and molecular chaperoning (HSP47). The term serpin is used to describe these members as well, despite their non-inhibitory function, since they are evolutionarily related.

History

Protease inhibitory activity in blood plasma was first reported in the late 1800s, but it was not until the 1950s that the serpins antithrombin and alpha 1-antitrypsin were isolated. Initial research focused on their role in human disease: alpha 1-antitrypsin deficiency is one of the most common genetic disorders, causing emphysema, and antithrombin deficiency results in thrombosis.

In the 1980s, it became clear that these inhibitors were part of superfamily of related proteins that included both protease inhibitors (e.g. alpha 1-antitrypsin) and non-inhibitory members (e.g. ovalbumin). The name "serpin" was coined based on the most common activity of the superfamily (serine protease inhibitors). Around the same time, the first structures were solved for serpin proteins (first in the relaxed, and later in the stressed conformation). The structures indicated that the inhibitory mechanism involved an unusual conformational change and prompted the subsequent structural focus of serpin studies.

Over 1000 serpins have now been identified, including 36 human proteins, as well as molecules in all kingdoms of life—animals, plants, fungi, bacteria, and archaea—and some viruses. In the 2000s, a systematic nomenclature was introduced in order to categorise members of the serpin superfamily based on their evolutionary relationships. Serpins are therefore the largest and most diverse superfamily of protease inhibitors.

Activity

Most serpins are protease inhibitors, targeting extracellular, chymotrypsin-like serine proteases. These proteases possess a nucleophilic serine residue in a catalytic triad in their active site. Examples include thrombin, trypsin, and human neutrophil elastase. Serpins act as irreversible, suicide inhibitors by trapping an intermediate of the protease's catalytic mechanism.

Some serpins inhibit other protease classes, typically cysteine proteases, and are termed "cross-class inhibitors". These enzymes differ from serineproteases in that they use a nucleophilic cysteine residue, rather than a serine, in their active site. Nonetheless, the enzymatic chemistry is similar, and the mechanism of inhibition by serpins is the same for both classes of protease. Examples of cross-class inhibitory serpins include serpin B4 a squamous cell carcinoma antigen 1 (SCCA-1) and the avian serpin myeloid and erythroid nuclear termination stage-specific protein (MENT), which both inhibit papain-like cysteine proteases.

Protease inhibition

Approximately two-thirds of human serpins perform extracellular roles, inhibiting proteases in the bloodstream in order to modulate their activities. For example, extracellular serpins regulate the proteolytic cascades central to blood clotting (antithrombin), the inflammatory and immune responses (antitrypsin, antichymotrypsin, and C1-inhibitor) and tissue remodelling (PAI-1). By inhibiting signalling cascade proteases, they can also affect development. The table of human serpins (below) provides examples of the range of functions performed by human serpin, as well as some of the diseases that result from serpin deficiency.

The protease targets of intracellular inhibitory serpins have been difficult to identify, since many of these molecules appear to perform overlapping roles. Further, many human serpins lack precise functional equivalents in model organisms such as the mouse. Nevertheless, an important function of intracellular serpins may be to protect against the inappropriate activity of proteases inside the cell. For example, one of the best-characterised human intracellular serpins is Serpin B9, which inhibits the cytotoxic granule protease granzyme B. In doing so, Serpin B9 may protect against inadvertent release of granzyme B and premature or unwanted activation of cell death pathways.

Some viruses use serpins to disrupt protease functions in their host. The cowpox viral serpin CrmA (cytokine response modifier A) is used in order to avoid inflammatory and apoptotic responses of infected host cells. CrmA increases infectivity by suppressing its host's inflammatory response through inhibition of IL-1 and L-18 processing by the cysteine protease caspase-1. In eukaryotes, a plant serpin inhibits both metacaspases and a papain-like cysteine protease.

Non-inhibitory roles

Non-inhibitory extracellular serpins also perform a wide array of important roles. Thyroxine-binding globulin and transcortin transport the hormones thyroxine and cortisol, respectively. The non-inhibitory serpin ovalbumin is the most abundant protein in egg white. Its exact function is unknown, but it is thought to be a storage protein for the developing foetus. Heat shock serpin 47 is a chaperone, essential for proper folding of collagen. It acts by stabilising collagen's triple helix whilst it is being processed in the endoplasmic reticulum.

Some serpins are both protease inhibitors and perform additional roles. For example, the nuclear cysteine protease inhibitor MENT, in birds also acts as a chromatin remodelling molecule in a bird's red blood cells.

Structure

All serpins share a common structure (or fold), despite their varied functions. All typically have three β-sheets (named A, B and C) and eight or nine α-helices (named hA–hI). The most significant regions to serpin function are the A-sheet and the reactive centre loop (RCL). The A-sheet includes two β-strands that are in a parallel orientation with a region between them called the 'shutter', and upper region called the 'breach'. The RCL forms the initial interaction with the target protease in inhibitory molecules. Structures have been solved showing the RCL either fully exposed or partially inserted into the A-sheet, and serpins are thought to be in dynamic equilibrium between these two states. The RCL also only makes temporary interactions with the rest of the structure, and is therefore highly flexible and exposed to the solvent.

The serpin structures that have been determined cover several different conformations, which has been necessary for the understanding of their multiple-step mechanism of action. Structural biology has therefore played a central role in the understanding of serpin function and biology.

Conformational change and inhibitory mechanism

Inhibitory serpins do not inhibit their target proteases by the typical competitive (lock-and-key) mechanism used by most small protease inhibitors (e.g. Kunitz-type inhibitors). Instead, serpins use an unusual conformational change, which disrupts the structure of the protease and prevents it from completing catalysis. The conformational change involves the RCL moving to the opposite end of the protein and inserting into β-sheet A, forming an extra antiparallel β-strand. This converts the serpin from a stressed state, to a lower-energy relaxed state (S to R transition).

Serine and cysteine proteases catalyse peptide bond cleavage by a two-step process. Initially, the catalytic residue of the active site triad performs a nucleophilic attack on the peptide bond of the substrate. This releases the new N-terminus and forms a covalent ester-bond between the enzyme and the substrate. This covalent complex between enzyme and substrate is called an acyl-enzyme intermediate. For standard substrates, the ester bond is hydrolysed and the new C-terminus is released to complete catalysis. However, when a serpin is cleaved by a protease, it rapidly undergoes the S to R transition before the acyl-enzyme intermediate is hydrolysed. The efficiency of inhibition depends on fact that the relative kinetic rate of the conformational change is several orders of magnitude faster than hydrolysis by the protease.

Since the RCL is still covalently attached to the protease via the ester bond, the S to R transition pulls protease from the top to the bottom of the serpin and distorts the catalytic triad. The distorted protease can only hydrolyse the acyl enzyme intermediate extremely slowly and so the protease remains covalently attached for days to weeks. Serpins are classed as irreversible inhibitors and as suicide inhibitors since each serpin protein permanently inactivates a single protease, and can only function once.

Allosteric activation

The conformational mobility of serpins provides a key advantage over static lock-and-key protease inhibitors. In particular, the function of inhibitory serpins can be regulated by allosteric interactions with specific cofactors. The X-ray crystal structures of antithrombin, heparin cofactor II, MENT and murine antichymotrypsin reveal that these serpins adopt a conformation wherein the first two amino acids of the RCL are inserted into the top of the A β-sheet. The partially inserted conformation is important because co-factors are able to conformationally switch certain partially inserted serpins into a fully expelled form. This conformational rearrangement makes the serpin a more effective inhibitor.

The archetypal example of this situation is antithrombin, which circulates in plasma in a partially inserted relatively inactive state. The primary specificity determining residue (the P1 arginine) points toward the body of the serpin and is unavailable to the protease. Upon binding a high-affinity pentasaccharide sequence within long-chain heparin, antithrombin undergoes a conformational change, RCL expulsion, and exposure of the P1 arginine. The heparin pentasaccharide-bound form of antithrombin is, thus, a more effective inhibitor of thrombin and factor Xa. Furthermore, both of these coagulation proteases also contain binding sites (called exosites) for heparin. Heparin, therefore, also acts as a template for binding of both protease and serpin, further dramatically accelerating the interaction between the two parties. After the initial interaction, the final serpin complex is formed and the heparin moiety is released. This interaction is physiologically important. For example, after injury to the blood vessel wall, heparin is exposed, and antithrombin is activated to control the clotting response. Understanding of the molecular basis of this interaction enabled the development of Fondaparinux, a synthetic form of Heparin pentasaccharide used as an anti-clotting drug.

Latent conformation

Certain serpins spontaneously undergo the S to R transition without having been cleaved by a protease, to form a conformation termed the latent state. Latent serpins are unable to interact with proteases and so are no longer protease inhibitors. The conformational change to latency is not exactly the same as the S to R transition of a cleaved serpin. Since the RCL is still intact, the first strand of the C-sheet has to peel off to allow full RCL insertion.

Regulation of the latency transition can act as a control mechanism in some serpins, such as PAI-1. Although PAI-1 is produced in the inhibitory S conformation, it "auto-inactivates" by changing to the latent state unless it is bound to the cofactor vitronectin. Similarly, antithrombin can also spontaneously convert to the latent state, as an additional modulation mechanism to its allosteric activation by heparin. Finally, the N-terminus of tengpin, a serpin from Thermoanaerobacter tengcongensis, is required to lock the molecule in the native inhibitory state. Disruption of interactions made by the N-terminal region results in spontaneous conformational change of this serpin to the latent conformation.

Conformational change in non-inhibitory functions

Certain non-inhibitory serpins also use the serpin conformational change as part of their function. For example, the native (S) form of thyroxine-binding globulin has high affinity for thyroxine, whereas the cleaved (R) form has low affinity. Similarly, transcortin has higher affinity for cortisol when in its native (S) state, than its cleaved (R) state. Thus, in these serpins, RCL cleavage and the S to R transition has been commandeered to allow for ligand release, rather than protease inhibition.

In some serpins, the S to R transition can activate cell signalling events. In these cases, a serpin that has formed a complex with its target protease, is then recognised by a receptor. The binding event then leads to downstream signalling by the receptor. The S to R transition is therefore used to alert cells to the presence of protease activity. This differs from the usual mechanism whereby serpins affect signalling simply by inhibiting proteases involved in a signalling cascade.

Degradation

When a serpin inhibits a target protease, it forms a permanent complex, which needs to be disposed of. For extracellular serpins, the final serpin-enzyme complexes are rapidly cleared from circulation. One mechanism by which this occurs in mammals is via the low-density lipoprotein receptor-related protein (LRP), which binds to inhibitory complexes made by antithrombin, PA1-1, and neuroserpin, causing cellular uptake. Similarly, the Drosophila serpin, necrotic, is degraded in the lysosome after being trafficked into the cell by the Lipophorin Receptor-1 (homologous to the mammalian LDL receptor family).

Disease and serpinopathies

Serpins are involved in a wide array of physiological functions, and so mutations in genes encoding them can cause a range of diseases. Mutations that change the activity, specificity or aggregation properties of serpins all affect how they function. The majority of serpin-related diseases are the result of serpin polymerisation into aggregates, though several other types of disease-linked mutations also occur. The disorder α-Antitrypsin deficiency is one of the most common hereditary diseases.

Inactivity or absence

Since the stressed serpin fold is high-energy, mutations can cause them to incorrectly change into their lower-energy conformations (e.g. relaxed or latent) before they have correctly performed their inhibitory role.

Mutations that affect the rate or the extent of RCL insertion into the A-sheet can cause the serpin to undergo its S to R conformational change before having engaged a protease. Since a serpin can only make this conformational change once, the resulting misfired serpin is inactive and unable to properly control its target protease. Similarly, mutations that promote inappropriate transition to the monomeric latent state cause disease by reducing the amount of active inhibitory serpin. For example, the disease-linked antithrombin variants wibble and wobble, both promote formation of the latent state.

The structure of the disease-linked mutant of antichymotrypsin (L55P) revealed another, inactive "δ-conformation". In the δ-conformation, four residues of the RCL are inserted into the top of β-sheet A. The bottom half of the sheet is filled as a result of one of the α-helices (the F-helix) partially switching to a β-strand conformation, completing the β-sheet hydrogen bonding. It is unclear whether other serpins can adopt this conformer, and whether this conformation has a functional role, but it is speculated that the δ-conformation may be adopted by Thyroxine-binding globulin during thyroxine release. The non-inhibitory proteins related to serpins can also cause diseases when mutated. For example, mutations in SERPINF1 cause osteogenesis imperfecta type VI in humans.

In the absence of a required serpin, the protease that it normally would regulate is over-active, leading to pathologies. Consequently, simple deficiency of a serpin (e.g. a null mutation) can result in disease. Gene knockouts, particularly in mice, are used experimentally to determine the normal functions of serpins by the effect of their absence.

Specificity change

In some rare cases, a single amino acid change in a serpin's RCL alters its specificity to target the wrong protease. For example, the Antitrypsin-Pittsburgh mutation (M358R) causes the α1-antitrypsin serpin to inhibit thrombin, causing a bleeding disorder.

Polymerisation and aggregation

The majority of serpin diseases are due to protein aggregation and are termed "serpinopathies". Serpins are vulnerable to disease-causing mutations that promote formation of misfolded polymers due to their inherently unstable structures. Well-characterised serpinopathies include α1-antitrypsin deficiency (alpha-1), which may cause familial emphysema and sometimes liver cirrhosis, certain familial forms of thrombosis related to antithrombin deficiency, types 1 and 2 hereditary angioedema (HAE) related to deficiency of C1-inhibitor, and familial encephalopathy with neuroserpin inclusion bodies (FENIB; a rare type of dementia caused by neuroserpin polymerisation).

Each monomer of the serpin aggregate exists in the inactive, relaxed conformation (with the RCL inserted into the A-sheet). The polymers are therefore hyperstable to temperature and unable to inhibit proteases. Serpinopathies therefore cause pathologies similarly to other proteopathies (e.g. prion diseases) via two main mechanisms. First, the lack of active serpin results in uncontrolled protease activity and tissue destruction. Second, the hyperstable polymers themselves clog up the endoplasmic reticulum of cells that synthesize serpins, eventually resulting in cell death and tissue damage. In the case of antitrypsin deficiency, antitrypsin polymers cause the death of liver cells, sometimes resulting in liver damage and cirrhosis. Within the cell, serpin polymers are slowly removed via degradation in the endoplasmic reticulum. However, the details of how serpin polymers cause cell death remains to be fully understood.

Physiological serpin polymers are thought to form via domain swapping events, where a segment of one serpin protein inserts into another. Domain-swaps occur when mutations or environmental factors interfere with the final stages of serpin folding to the native state, causing high-energy intermediates to misfold. Both dimer and trimer domain-swap structures have been solved. In the dimer (of antithrombin), the RCL and part of the A-sheet incorporates into the A-sheet of another serpin molecule. The domain-swapped trimer (of antitrypsin) forms via the exchange of an entirely different region of the structure, the B-sheet (with each molecule's RCL inserted into its own A-sheet). It has also been proposed that serpins may form domain-swaps by inserting the RCL of one protein into the A-sheet of another (A-sheet polymerisation). These domain-swapped dimer and trimer structures are though to be the building blocks of the disease-causing polymer aggregates, but the exact mechanism is still unclear.

Therapeutic strategies

Several therapeutic approaches are in use or under investigation to treat the most common serpinopathy: antitrypsin deficiency. Antitrypsin augmentation therapy is approved for severe antitrypsin deficiency-related pulmonary emphysema. In this therapy, antitrypsin is purified from the plasma of blood donors and administered intravenously (first marketed as Prolastin). To treat severe antitrypsin deficiency-related disease, lung and liver transplantation has proven effective. In animal models, gene targeting in induced pluripotent stem cells has been successfully used to correct an antitrypsin polymerisation defect and to restore the ability of the mammalian liver to secrete active antitrypsin. Small molecules have also been developed that block antitrypsin polymerisation in vitro.

Evolution

Serpins are the most widely distributed and largest superfamily of protease inhibitors. They were initially believed to be restricted to eukaryote organisms, but have since been found in bacteria, archaea and some viruses. It remains unclear whether prokaryote genes are the descendants of an ancestral prokaryotic serpin or the product of horizontal gene transfer from eukaryotes. Most intracellular serpins belong to a single phylogenetic clade, whether they come from plants or animals, indicating that the intracellular and extracellular serpins may have diverged before the plants and animals. Exceptions include the intracellular heat shock serpin HSP47, which is a chaperone essential for proper folding of collagen, and cycles between the cis-Golgi and the endoplasmic reticulum.

Protease-inhibition is thought to be the ancestral function, with non-inhibitory members the results of evolutionary neofunctionalisation of the structure. The S to R conformational change has also been adapted by some binding serpins to regulate affinity for their targets.

Human

The human genome encodes 16 serpin clades, termed serpinA through serpinP, including 29 inhibitory and 7 non-inhibitory serpin proteins. The human serpin naming system is based upon a phylogenetic analysis of approximately 500 serpins from 2001, with proteins named serpinXY, where X is the clade of the protein and Y the number of the protein within that clade. The functions of human serpins have been determined by a combination of biochemical studies, human genetic disorders, and knockout mouse models.

Specialised mammalian serpins

Many mammalian serpins have been identified that share no obvious orthology with a human serpin counterpart. Examples include numerous rodent serpins (particularly some of the murine intracellular serpins) as well as the uterine serpins. The term uterine serpin refers to members of the serpin A clade that are encoded by the SERPINA14 gene. Uterine serpins are produced by the endometrium of a restricted group of mammals in the Laurasiatheria clade under the influence of progesterone or estrogen. They are probably not functional proteinase inhibitors and may function during pregnancy to inhibit maternal immune responses against the conceptus or to participate in transplacental transport.

Insect

The Drosophila melanogaster genome contains 29 serpin encoding genes. Amino acid sequence analysis has placed 14 of these serpins in serpin clade Q and three in serpin clade K with the remaining twelve classified as orphan serpins not belonging to any clade. The clade classification system is difficult to use for Drosophila serpins and instead a nomenclature system has been adopted that is based on the position of serpin genes on the Drosophila chromosomes. Thirteen of the Drosophila serpins occur as isolated genes in the genome (including Serpin-27A, see below), with the remaining 16 organised into five gene clusters that occur at chromosome positions 28D (2 serpins), 42D (5 serpins), 43A (4 serpins), 77B (3 serpins) and 88E (2 serpins).

Studies on Drosophila serpins reveal that Serpin-27A inhibits the Easter protease (the final protease in the Nudel, Gastrulation Defective, Snake and Easter proteolytic cascade) and thus controls dorsoventral patterning. Easter functions to cleave Spätzle (a chemokine-type ligand), which results in toll-mediated signaling. As well as its central role in embryonic patterning, toll signaling is also important for the innate immune response in insects. Accordingly, serpin-27A also functions to control the insect immune response. In Tenebrio molitor (a large beetle), a protein (SPN93) comprising two discrete tandem serpin domains functions to regulate the toll proteolytic cascade.

Nematode

The genome of the nematode worm C. elegans contains 9 serpins, all of which lack signal sequences and so are likely intracellular. However, only 5 of these serpins appear to function as protease inhibitors. One, SRP-6, performs a protective function and guards against stress-induced calpain-associated lysosomal disruption. Further, SRP-6 inhibits lysosomal cysteine proteases released after lysosomal rupture. Accordingly, worms lacking SRP-6 are sensitive to stress. Most notably, SRP-6 knockout worms die when placed in water (the hypo-osmotic stress lethal phenotype or Osl). It has therefore been suggested that lysosomes play a general and controllable role in determining cell fate.

Plant

Plant serpins were amongst the first members of the superfamily that were identified. The serpin barley protein Z is highly abundant in barley grain, and one of the major protein components in beer. The genome of the model plant, Arabidopsis thaliana contain 18 serpin-like genes, although only 8 of these are full-length serpin sequences.

Plant serpins are potent inhibitors of mammalian chymotrypsin-like serine proteases in vitro, the best-studied example being barley serpin Zx (BSZx), which is able to inhibit trypsin and chymotrypsin as well as several blood coagulation factors. However, close relatives of chymotrypsin-like serine proteases are absent in plants. The RCL of several serpins from wheat grain and rye contain poly-Q repeat sequences similar to those present in the prolamin storage proteins of the endosperm. It has therefore been suggested that plant serpins may function to inhibit proteases from insects or microbes that would otherwise digest grain storage proteins. In support of this hypothesis, specific plant serpins have been identified in the phloem sap of pumpkin (CmPS-1) and cucumber plants. Although an inverse correlation between up-regulation of CmPS-1 expression and aphid survival was observed, in vitro feeding experiments revealed that recombinant CmPS-1 did not appear to affect insect survival.

Alternative roles and protease targets for plant serpins have been proposed. The Arabidopsis serpin, AtSerpin1 (At1g47710; 3LE2​), mediates set-point control over programmed cell death by targeting the 'Responsive to Desiccation-21' (RD21) papain-like cysteine protease. AtSerpin1 also inhibits metacaspase-like proteases in vitro. Two other Arabidopsis serpins, AtSRP2 (At2g14540) and AtSRP3 (At1g64030) appear to be involved in responses to DNA damage.

Fungal

A single fungal serpin has been characterized to date: celpin from Piromyces spp. strain E2. Piromyces is a genus of anaerobic fungi found in the gut of ruminants and is important for digesting plant material. Celpin is predicted to be inhibitory and contains two N-terminal dockerin domains in addition to its serpin domain. Dockerins are commonly found in proteins that localise to the fungal cellulosome, a large extracellular multiprotein complex that breaks down cellulose. It is therefore suggested that celpin may protect the cellulosome against plant proteases. Certain bacterial serpins similarly localize to the cellulosome.

Prokaryotic

Predicted serpin genes are sporadically distributed in prokaryotes. In vitro studies on some of these molecules have revealed that they are able to inhibit proteases, and it is suggested that they function as inhibitors in vivo. Several prokaryote serpins are found in extremophiles. Accordingly, and in contrast to mammalian serpins, these molecules possess elevated resistance to heat denaturation. The precise role of most bacterial serpins remains obscure, although Clostridium thermocellum serpin localises to the cellulosome. It is suggested that the role of cellulosome-associated serpins may be to prevent unwanted protease activity against the cellulosome.

Viral

Serpins are also expressed by viruses as a way to evade the host's immune defense. In particular, serpins expressed by pox viruses, including cow pox (vaccinia) and rabbit pox (myxoma), are of interest because of their potential use as novel therapeutics for immune and inflammatory disorders as well as transplant therapy. Serp1 suppresses the TLR-mediated innate immune response and allows indefinite cardiac allograft survival in rats. Crma and Serp2 are both cross-class inhibitors and target both serine (granzyme B; albeit weakly) and cysteine proteases (caspase 1 and caspase 8). In comparison to their mammalian counterparts, viral serpins contain significant deletions of elements of secondary structure. Specifically, crmA lacks the D-helix as well as significant portions of the A- and E-helices.

References

Serpin Wikipedia