Biotechnology Industry
 

Process Biotechnology

Electronic Journal of Biotechnology ISSN: 0717-3458 Vol. 10 No. 2, Issue of April 15, 2007
© 2007 by Pontificia Universidad Católica de Valparaíso -- Chile Received June 6, 2006 / Accepted November 28, 2006
DOI: 10.2225/vol10-issue2-fulltext-13 How to reference this article
REVIEW ARTICLE

Peptide synthesis: chemical or enzymatic

Fanny Guzmán
Instituto de Biología
Pontificia Universidad Católica de Valparaíso
Avenida Brasil 2950
Valparaíso, Chile
Fax: 56 32 212746
E-mail: fanny.guzman@ucv.cl

Sonia Barberis
Facultad de Química, Bioquímica y Farmacia
Universidad Nacional de San Luis
Ejército de los Andes 950 (5700)
San Luis, Argentina
E-mail: sbarberi@unsl.edu.ar

Andrés Illanes*
Escuela de Ingeniería Bioquímica
Pontificia Universidad Católica de Valparaíso
Avenida Brasil 2147
Fax: 56 32 2273803
E-mail: aillanes@ucv.cl

*Corresponding author

Financial support: This work was done within the framework of Project CYTED IV.22 Industrial Application of Proteolytic Enzymes from Higher Plants.

Keywords: enzymatic synthesis, peptides, proteases, solid-phase synthesis.

Abbreviations:

CD: circular dichroism
CLEC: cross linked enzyme crystals
DDC: double dimer constructs
ESI: electrospray ionization
HOBT: hydroxybenzotriazole
HPLC: high performance liquid hromatography
KCS: kinetically controlled synthesis
MALDI: matrix-assisted laser desorption ionization
MAP: multiple antigen peptide system
MS: mass spectrometry
NMR: nuclear magnetic resonance
SPS: solution phase synthesis
SPPS: solid-phase peptide synthesis
t-Boc: tert-butoxycarbonyl
TCS: thermodynamically controlled synthesis
TFA: trifluoroacetic acid

Abstract   Reprint (PDF)
Abstract
Article
References

Peptides are molecules of paramount importance in the fields of health care and nutrition. Several technologies for their production are now available, among which chemical and enzymatic synthesis are especially relevant. The present review pretends to establish a non-biased appreciation of the advantages, potentials, drawbacks and limitations of both technologies. Chemical synthesis is thoroughly reviewed and their potentials and limitations assessed, focusing on the different strategies and challenges for large-scale synthesis. Then, the enzymatic synthesis of peptides with proteolytic enzymes is reviewed considering medium, biocatalyst and substrate engineering, and recent advances and challenges in the field are analyzed. Even though chemical synthesis is the most mature technology for peptide synthesis, lack of specificity and environmental burden are severe drawbacks that can in principle be successfully overcame by enzyme biocatalysis. However, productivity of enzymatic synthesis is lower, costs of biocatalysts are usually high and no protocols exist for its validation and scale-up, representing challenges that are being actively confronted by intense research and development in this area. The combination of chemical and enzymatic synthesis is probably the way to go, since the good properties of each technology can be synergistically used in the context of one process objective.

Article
Article
Chemical synthesis of peptides
  • Synthesis of peptides in solution
  • Solid-phase synthesis of peptides
  • Methodologies of synthesis
  • Modification of peptides
  • Large scale chemical synthesis of peptides
    Enzymatic synthesis of peptides
  • Proteolytic enzymes: classification and catalytic mechanisms
  • Proteases in peptide synthesis
  • Strategies of enzymatic synthesis of peptides
  • Advances and challenges in enzymatic synthesis of peptides
    Chemical versus Enzymatic Synthesis: Pros and Cons

  • Concluding Remarks
    Acknowledgments
    Figure 1
    Figure 2
    Figure 3
    Figure 4
    Figure 5
    Figure 6
    Figure 7
    Figure 8
    Table 1
    Table 2
    References

    Peptides are heteropolymers composed by amino acid residues linked by peptidic bonds between the carboxyl group of one amino acid residue and the α-amino group of the next one. The definition is rather vague in terms of chain length, peptides ranging from two to a few dozens residues. Its lower limit of molecular mass has been set rather arbitrarily in 6000 Da; molecules larger than that are considered proteins. Peptides are molecules of paramount importance in several fields, especially in health care and nutrition. The case of the hormone insulin (51 residues, 5773 Da) and the non-caloric sweetener aspartame (a dipeptide of aspartic acid and esterified phenylalanine) are relevant examples of those fields of application and the range of molecular size. Medium to small size peptides are, however, the most relevant for such applications.

    Different technologies are now available for the production of peptides and proteins: the extraction from natural sources (Hipkiss and Brownson, 2000), the production by recombinant DNA technology (Gill et al. 1996), the production in cell-free expression systems (Katzen et al. 2005), the production in transgenic animals (Wright et al. 1991) and plants (Cunningham and Porter, 1997), the production by chemical synthesis (Du Vigneaud et al. 1953; Merrifield, 1963) and by enzyme technology using proteolytic enzymes under conditions of displacement of the equilibrium of the reaction towards the formation of peptide bond (Feliú et al. 1995).

    The size of the molecule determines the technology most suitable for its production. Recombinant DNA technology is particularly suitable for the synthesis of large peptides and proteins, as illustrated by the case of insulin and other hormones (Walsh, 2005). Chemical synthesis is a viable technology for the production of small and medium size peptides ranging from about 5 to 80 residues (Kimmerlin and Seebach, 2005). Enzymatic synthesis is more restricted and has been hardly applied for the synthesis of peptides exceeding 10 residues. Its potential relies on the synthesis of very small peptides and, in fact, most of the cases reported correspond to dipeptides and tripeptides (Kumar and Bhalla, 2005). In this sense, the technologies for peptide production are not competitive with each other in most of the cases. The present review focuses on the chemical and enzymatic synthesis of peptides, aiming to establish a non-biased appreciation of their advantages, potentials, drawbacks and limitations. Chemical and enzymatic approaches do not exclude each other and a recent trend is the combination of both (Hou et al. 2005).

    Chemical synthesis of peptides

    The chemical route is often a better technological option than the biotechnological methods of recombinant DNA and biocatalysis for the synthesis of medium size peptides that comprise most of the pharmaceutically relevant molecules. It is also a fundamental tool for understanding the structure-function relationship in proteins and peptides, the discovery of novel therapeutic and diagnosis agents and the production of synthetic vaccines (Noya et al. 2003). Recently it has been used for the design of synthetic biocatalysts, which is a very promising area of research (Kaplan and DeGrado, 2004; Carrea et al. 2005). The synthesis of peptides was originally performed in solution. However, since the introduction of solid-phase synthesis by Merrifield (1986), this technology has gained more relevance (Stewart and Young, 1984) and significant advances have been made in the development of polymeric carriers and linkers, reversible protective groups (Goodman, 2002) and methods for the activation of covalent bond formation (Albericio, 2004), contributing in this way to the advancement of organic chemistry as a powerful tool for protein and peptide research.

    Synthesis of peptides in solution

    This technique has been used for the synthesis of small peptides composed by only a few amino acid residues. Its main advantage is that the intermediate products can be isolated and purified after each step of synthesis, deprotected and recombined to obtain larger peptides of the desired sequence. This technique is highly flexible with respect to the chemistry of coupling and the combination of the peptidic blocks. New strategies for synthesis in solution have been developed, going from the design of functional groups for the side chains and condensation of fragments for the synthesis of large molecules (Nishiuchi et al. 1998) to the use of new coupling reagents (Hiebl et al. 1999).

    Solid-phase synthesis of peptides

    Solid-phase peptide synthesis (SPPS) consists in the elongation of a peptidic chain anchored to a solid matrix by successive additions of amino acids which are linked by amide (peptide) bond formation between the carboxyl group of the incoming amino acid and the amino group of the amino acid previously bound to the matrix, until the peptide of the desired sequence and length has been synthesized (Nilsson et al. 2005). When Merrifield introduced the method of solid-phase synthesis in 1963, the scientific community reacted with skepticism: the synthesis in solution was at that time well established and in the new proposed system the purification of the peptide could only be done after cleavage, with the concomitant cleavage of most of the byproducts accumulated during the synthesis (Andersson et al. 2000). Despite these drawbacks, solid-phase synthesis has many advantages over the classical system in solution: the reaction can be automated and the problem of solubilization of the peptide no longer exists since it remains attached to the solid matrix.

    The strategy of synthesis (Fmoc or t-Boc), the nature of the solid carrier, the coupling reagents and the procedure of cleavage of the peptide from the solid matrix are the most relevant variables in SPPS. A general scheme of the stepwise SPPS is presented in Figure 1. The first step is the coupling of the C-terminal amino acid to the solid matrix. The Nα (A) group is then removed by treatment with trifluoroacetic acid (TFA) in the t-Boc strategy and with piperidine in the Fmoc strategy. The next (Nα protected) amino acid is coupled to the already synthesized peptide chain bound to the polymeric matrix and, once coupled, its Nα amino group is deprotected. This coupling-deprotection cycle is repeated until the desired amino acid sequence has been synthesized. Finally, the peptide-matrix complex is cleaved and side chain protecting groups are removed to yield the peptide with either a free acid or amide depending on the chemical nature of the functional group in the solid matrix. The cleavage reagent must remove the protecting groups of the side chains of the amino acids, which are stable at the conditions of Nα deprotection.

    Protection strategies. In the last few years, more than 250 protecting groups have been proposed as suitable for peptide synthesis (Goodman et al. 2001); however, a relatively small number of those is actually used because of the stringent requirements that a protecting group should meet, particularly with respect to the requirement of the preservation of other functionalities. Research and development in SPPS has conducted to two main schemes of protection, which are known as t-Boc/Bzl and Fmoc/tBu strategies (Chan and White, 2000). In t-Boc/Bzl, the t-Boc (tert-butoxycarbonyl) group is used for the protection of the Nα amino group and a benzyl or cyclohexyl for the side chains of several amino acids. In Fmoc/tBu, the Fmoc (9-fluorenyl methoxycarbonyl) group is used for the protection of the Nα amino group and the tert-butyl group for the side chains of several amino acids (Albericio, 2000). Protecting groups for the side chains commonly used in the t-Boc/Bzl and Fmoc/tBu strategies are listed in Table 1.

    Solid supports. Solid supports should meet several requirements: particles should be of conventional and uniform size, mechanically robust, easily filterable, chemically inert and chemically stable under the conditions of synthesis and highly accessible to the solvents allowing the penetration of the reagents and the enlargement of the peptide chain within its microstructure. They must not interact physically with the peptide chain being synthesized and should be capable of being functionalized by a starting group. Several polymeric supports are now available which can be derivatized with functional groups to produce a highly stable linkage to the peptide being synthesized (Barlos et al. 1991) and peptides with different functionalities in the terminal carboxyl group (i.e.: amide, acid, thioester) (Canne et al. 1999). Some examples are the p-methoxybenzhydrylamine (MBHA), 4-hydroxymethyl-phenylacetamidomethyl (PAM) and hydroxymethyl functionalized resins used for t-Boc/Bzl, and the 4-(2',4'-dimethoxyphenyl-aminomethyl)-phenoxymethyl-polystyrene (Rink), 2-chlorotrityl chloride, and diphenyldiazomethane functionalized resins used for Fmoc/tBu.

    Coupling reagents. Several reagents that activate the carboxyl groups of the amino acids are used for the coupling reaction. The most used now are uronium and phosphonium salts and hydroxybenzotriazole (HOBT) because of the high reactivity, high coupling yield and higher specificity than obtained in conventional systems, such as DCC/HOBT (dicyclohexyl-carbodiimide/1-hydroxybenzotriazol), HOBT derivatives and symmetric anhydrides (Miranda and Alewood, 2000). Chaotropic salts (CuLi, NaClO4, KSCN) and mixtures of solvents, such as N,N dimethylformamide, trifluoroethanol, dimethylacetamide and N-methylpirrolidone, have been used to improve the efficiency of coupling and the elongation of the peptide chain in tough sequences.

    Cleavage from the solid support. Once the peptide synthesis of the desired sequence is finished, the protecting groups of the side chains are removed and the peptide freed from the support. In the t-Boc/Bzl strategy, the most popular method is the one developed by Tam et al. (1983). Deprotection is carried out with strong acids that may lead to unwanted secondary reactions of alkylation or acylation in certain amino acids that are promoted by the leaving protecting groups. To avoid such reactions, combinations of solvents acting as nucleophiles and acids that allow the process of deprotection have been pursued for decades. Different is the case of the Fmoc/tBu strategy, in which simpler solvents as TFA in combination with triisopropylsilane can be used.

    Methodologies of synthesis

    The main SPPS strategies are sequential synthesis, convergent synthesis and chemical ligation. Sequential synthesis involves the stepwise addition of amino acids until the desired sequence is synthesized. Convergent synthesis involves the independent synthesis of peptide sequences that are then linked by condensation in liquid phase. In chemical ligation those fragments are linked by chemoselective reactions involving thioether (Lu and Tam, 2005), oxime (Nardin et al. 1998), hydrazone and thiazolidine (Tam et al. 1995) linkages.

    Sequential. Sequential synthesis was already depicted in Figure 1. This strategy is used for the synthesis of small to medium size peptides having up to 50 residues. However, larger size polypeptides can be constructed using sequential synthesis by the technique of cysteine polymerization, the construction of dendrimers using lysine matrices, or the construction of Template-Assembled Synthetic Protein (TASP) (Tuchscherer and Mutter, 1996; Banfi et al. 2004).

    Cysteine polymerization is performed by locating cysteine residues at the amino and carboxy terminals of the peptide. Cysteine has a sulfhydryl group prone to oxidation to form disulfide bridges so that the peptide can be polymerized both at the amino and carboxy terminals to yield polypeptides of high molecular weight. This technique produces a wide range of oxidized peptidic species ranging from the cyclic monomer and dimer to high molecular weight polypeptides, as shown in Figure 2. Cysteine polymerization was used for the construction of the first chemically synthesized anti-malaria vaccine, Spf-66 (Patarroyo et al. 1988; Amador et al. 1992). Cysteine chain polymer peptides are often the best presentation of a peptidic vaccine in terms of immunogenicity (Patarroyo et al. 2002). The problem is that it is not possible to tightly control the degree of polymerization to obtain a single peptide species, so that different species are produced whose distribution is not necessarily reproducible from one batch to another. This drawback can be important for the construction of human vaccines and for the validation of the process. In fact, proper control of SPPS by cysteine polymerization still represents a challenge.

    High molecular weight peptides can be conveniently synthesized by the t-Boc/Bzl strategy using in-situ neutralization (Miranda and Alewood, 1999; Taylor et al. 2005), which cannot be easily done by Fmoc/tBu because of the steric hindrances in the deprotection stage of the Fmoc group as the peptide chain grows (Tickler et al. 2001).

    Another option for obtaining high molecular weight peptides is the construction of dendrimers, which are highly ordered and branched peptides that contain in their core a matrix of lysine residues to whose amino groups several copies of the peptide sequence of interest can be linked. In this way a peptidic macromolecule with several copies of the desired sequence can be synthesized. A type of dendrimer, known as multiple antigen peptide system (MAP), composed of a lysine matrix and an external zone of antigenic peptides is illustrated in Figure 3 (Tam, 1988). One of the problems of this technique is that truncated or incomplete sequences of the peptide may be synthesized in some of the branches of the structure.

    Dendrimers known as double dimer constructs (DDC) have been reported by Calvo et al. (1999). The construction of a DDC occurs in two steps: initially a peptide is constructed with two arms containing a cysteine at each carboxy terminal; then, this dimer is purified and subsequently oxidized forming a tetramer as shown in Figure 4. DDC type tetramers have been used as immunogens in the study of possible candidates for anti-malaria vaccines, being this presentation more immunogenic than the one based on cysteine polymers in the experimental model with Aotus monkeys (Rivera et al. 2002). Compared with MAPs, DDCs produce higher yields of synthesis of branched peptides and its purification is simpler (Chávez et al. 2001; Rivera et al. 2002). Its main limitation is that this system is not applicable to dendrimers with copies of antigens whose sequences have conformational restrictions.

    A dendrimeric architecture containing eight variable positions connected by three successive branching diamino acid units was recently used to develop a library of potentially catalytic peptides (Clouet et al. 2004). Combinatorial libraries have been also developed for peptide enzyme mimics (Kofoed et al. 2006).

    Convergent. In convergent synthesis, peptides (up to 50 residues) are separately produced by sequential synthesis and then linked in solution or in solid phase to obtain the desired high molecular weight peptide or protein. A schematic representation of convergent synthesis is presented in Figure 5. The advantage of convergent synthesis is that each peptide fragment is purified and characterized before being linked. In this way, side reactions are minimized during synthesis. Depending on the protective groups and the functional group linked to the support, it is possible to use some orthogonal protection scheme in which two or more types of groups are involved that are removed by different mechanisms (Zhang and Tam, 1999). An example of orthogonality in convergent synthesis is the combination of t-Boc/Bzl and Fmoc/tBu using supports whose linkage to the peptide can be either labile to acid, base, palladium or photolabile. These combinations allow the selective protection or deprotection of the reactive groups to avoid side reactions and direct the synthesis. However, convergent synthesis has some drawbacks: the solubility of the protected fragments in the aqueous solvents used in the purification by HPLC and in the organic solvents used in the coupling reactions is usually low, reaction rates for the coupling of fragments are substantially lower than for the activated amino acid species in the conventional stepwise synthesis and, finally, the C terminal of each peptide fragment can be racemized during coupling. Some of these problems have been circumvented by using mixtures of solvents to increase the solubility, by using prolonged reaction times to increase the efficiency of coupling and by using glycine or proline in the C terminal to avoid the problem of racemization. In this way, despite the technical problems yet to be solved, convergent synthesis represents the best option for the chemical synthesis of large peptides and proteins. A variety of large peptides have been successfully produced by convergent synthesis, such as P41icf (Chiva et al. 2003) and T-20 (Bray, 2003). New generation globular polyproline dendrimers have bee synthesized recently by convergent SPPS which are relevant structures for the delivery of peptide drugs (Fillon et al. 2005; Sanclimens et al. 2005).

    Chemical ligation. Chemical ligation is a particularly appealing strategy for the chemical synthesis of large peptides and proteins (Baca et al. 1995; Yan and Dawson, 2001). It is based on the chemical linkage of short unprotected peptides which are easy to handle because of its high solubility in the solvents used for synthesis. These peptides are functionalized with groups that react chemoselectively with only one group of the acceptor peptide preserving the integrity of the unprotected side chains. Many proteins and peptides of biological interest have been synthesized by chemical ligation using a variety of ligands with the formation of thioester, oxime, disulfide, thiazolidine and peptide bonds. Initially, the main problem was the incorporation in the peptide sequence of non-natural (non-peptide) bonds; however, there are several examples of these non-natural structures that maintain the activity of the natural peptide. Based on the original principle of chemical ligation, Dawson et al. (1994) developed a method that allows the linkage of unprotected peptide segments by thioester bond that suffers a spontaneous rearrangement to produce a peptide bond. This strategy is termed native chemical ligation (Figure 6) and several peptides and proteins have been synthesized by this method without the introduction of non-peptidic bonds (Dawson and Kent, 2000). The method involves a thioester mediated reaction in aqueous solution at neutral pH of two unprotected peptide segments resulting in the ligation of the terminal cysteine of one peptide segment to the other peptide segment via a native amide bond. Some typical examples of this methodology are the synthesis of phospholipase A (124 residues with 14 cysteine residues and 7 disulfide bridges), the synthesis of the microprotein S (Hackeng et al. 2000) and the total chemical synthesis of the H-Ras proteins of the c-Raf-1 effector having 166 residues (Becker et al. 2003). A review on the synthesis of peptides by chemical ligation has been recently published (Johnson and Kent, 2006).

    Experimental procedures. SPPS can be performed in different ways. There are manual and automated systems available for small and large scale synthesis of only one peptide or several peptides at the time (multiple peptide synthesis). All operations in SPPS, namely coupling, deprotection and final removal, are conducted in the same recipient so that several washing steps have to be considered; reagents are used in large excess to speed-up the reaction and drive it to completion.

    Manual synthesis of individual peptides can be performed in syringes of different sizes provided with a bottom sintered-glass or plastic filter. Multiple peptide synthesis at the micromolar level can be conducted in functionalized cellulose, polypropylene or polyvynilidene difluoride membranes, according to the spot-synthesis methodology developed by Frank (2002). Fmoc/tBu and t-Boc/Bzl multiple peptide synthesis at the millimolar level can be performed using the tea-bag methodology developed by Houghten (1985), in which up to 400 peptides of 20 residues or less can be synthesized at the same time. The tea-bag system of SPPS has been successfully employed to construct peptide libraries (Houghten et al. 2000).

    Several systems for the automated t-Boc/Bzl and Fmoc/tBu SPPS, going from 1.5 mg to 5 kg scale, are now available from Applied Biosystems, Shimadzu, Advanced ChemTech, Protein Tecnologies, Abimed and Zinsser Analytic, among others (Chan and White, 2000). These systems allow a tight control of the conditions of synthesis, which is important for the reproducibility of results and the validation of the process. However, automated SPPS is expensive and a thorough economic evaluation should be done in each case before adopting it.

    During the process of peptide synthesis, several analyses must be performed at intermediate stages but mostly to the final product. During synthesis, the determination of free amino groups is essential in the stages of coupling and Nα deprotection in the case of Fmoc strategy. The most common assay for free amino groups is the ninhydrine test because of its sensitivity and reproducibility, but it is a destructive test so that samples have to be removed from the product being synthesized (Sarin et al. 1981). Non destructive tests are also used. The bromophenol blue (Krchñak et al. 1988) and picric acid (Gisin and Merrifield, 1972) tests are prominent among non-destructive tests, but they are not so commonly used because of their lesser sensitivity and reproducibility; in the case of picric acid test a trained eye is required to assess the difference in yellow tones. Reverse-phase high performance liquid chromatography (HPLC), using C-18 or C-8 columns, is the most used procedure for analysis and purification of peptides. Separation is based on hydrophobicity and mobile phase gradients are used with polar solvents such as acetonitrile. When peptides are synthesized as cysteine polymers, the molecular mass of the product can be determined by size-exclusion HPLC (Santovena et al. 2002) or capillary electrophoresis (Rosas et al. 2001). The determination of the molecular mass of the peptide is fundamental for its proper identification and mass spectrometry (MS) is the most powerful analytical technology available, which can also give information on structural modifications of a particular peptide (i.e. alkylations, formation of aspartiimides) that occur during cleavage of the peptide in SPSS. It is extremely rapid and hundreds of samples can be analyzed sequentially over extremely small amounts of peptides (Kalia and Gupta, 2005). MS can be divided into two main categories depending on the method of ionization, which are termed matrix-assisted laser desorption ionization (MALDI) and electrospray ionization (ESI). MALDI is the most powerful technique nowadays for molecular mass determination of peptides, since it produces no fragmentation. Most used matrices for MALDI are 2-5 dihydroxybenzoic acid (DHB), 3-5 dimethoxy-4-hydroxy cinnamic acid (sinapinic acid, SA) and α-cyano-4-hydroxycinnamic acid (CCA) (Moore, 1997). MALDI is an extremely sensitive method: samples ranging from 5-10 pmoles/µL can be assayed and only one µL is required. ESI shares the good properties of MALDI but has some additional restrictions since only rather polar molecules can be analyzed (Kalia and Gupta, 2005). Both MALDI and ESI are less sensitive for high molecular weight peptides (over 30000 Da). The method of choice will depend on the type of peptide, although both systems deliver in most cases the same type of information (Ambulos et al. 2000).

    The precise quantification of the amount of peptide synthesized is determined through amino acid analysis by HPLC after peptide hydrolysis in acid medium considering those acid-resistant amino acids. Although amino acid analysis is usually performed after cleavage from the resin, it can also be conducted to the peptide still attached to the resin (Ambulos et al. 2000). The quantification of cysteinic SH groups in the peptide is performed according to the method of Ellman (1958).

    The determination of peptide structure can be done by circular dichroism (CD) (Brahms and Brahms, 1980) and nuclear magnetic resonance (NMR) (Rance et al. 1983). Secondary structure determination by CD can be done with only 20 µg of peptide on a 0.1 mL cell. The analysis will determine if the peptide has a secondary structure and if the structure is α-helix (one maximum at 193 nm and two minima at 208 and 222 nm), or β-sheet (one maximum at 198 nm and one minimum at 217 nm). If the peptide has no secondary structure, only one maximum will be obtained at 211 nm. Other relevant structures in peptides are β-turns, which can be mistaken as no structure, since they yield a similar spectrum (Guzmán et al. 2002). NMR requires from 7 to 10 mg of peptide per sample, but in addition it provides information about the amino acid residues involved in each structure (Bermúdez et al. 2003).

    Modification of peptides

    The application of native peptides for pharmacological use may have some restrictions because of the degradation by endogenous proteases, hepatic clearance, undesired side effects by interaction with different receptors, and low membrane permeability due to their hydrophilic character (Sewald and Jakubke, 2002).

    Chemically modified peptides with improved bioavailability and metabolic stability may be directly used as drugs and many efforts have been made to develop peptide-based, pharmacologically active compounds. The most straightforward approach for peptide modification is to introduce changes into the side chains of single amino acids. The introduction of such functional groups that do not occur naturally in peptides restricts its conformational flexibility and enhances its metabolic stability. Other option is the modification of the peptide chain (backbone), which can be accomplished in different ways: its exchange by an amide analogue, the alkylation of the NH group of the peptide bond (Ahn et al. 2002), the reduction of the carbonyl group of the peptide bond (Lozano et al. 2004), or the exchange of the NH group of a peptide bond by an oxygen atom (depsipeptide) (Kuisle et al. 1999; Albericio et al. 2005; Cruz et al. 2006) a sulfur atom (thioester) (Vabeno et al. 2004) or a CH2 group (ketomethylene isostere) (Martin et al. 2001). Notable among these modified peptides are depsipeptides that are compounds having sequences of amino and hydroxy carboxylic acid residues regularly alternating; they can also form cyclic structures named cyclodepsipeptides, where the residues are connected in a ring. There is great interest in this class of products due to their wide range of biological properties such as immunosuppressant, antibiotic, antifungi, antiinflammatory and antitumoral activities (Hamel and Covell, 2002; Sarabia et al. 2004).

    Another approach is the construction of constrained structure peptidomimetics in which a region of a defined structure in the natural peptide is synthesized and bound by its amino and carboxy terminals through a non protein ligand (Ehrlich et al. 1996; Lioy et al. 2001). In this way, better recognition by antibodies can be obtained (Calvo et al. 2003). Such peptidomimetics have also been produced to inhibit protein-protein interaction (Yin et al. 2006).

    Peptides can also be glycosylated. In fact, many proteins and peptides of pharmacological relevance are naturally glycosylated and in such cases the carbohydrate portion of the moiety can be essential for recognition and improvement of clearance rate and other pharmacodynamic parameters. In the chemical synthesis of glycopeptides, glycosylation can be obtained either by using glycosylated amino acids or by conjugation of the final carbohydrate unit to the full-length peptide (Von Moos and Ben, 2005). Several other reactions of modification of peptides (i.e. with biotin, fluorescein, rhodamine) at its amino terminal are useful as markers for conducting in vitro kinetic studies (Chan and White, 2000).

    The synthesis of β-peptides (peptides constructed from non-natural β-amino acids) is a new promising area of peptide research. Short chains of β-amino acids usually form more stable secondary structures than the natural α counterparts and have proven to be endowed with interesting biological properties, for instance: inhibition of cholesterol and fat intestinal absorption (Werder et al. 1999). These non-natural peptides are resistant to enzymatic degradation which can be a relevant characteristic for their therapeutic application (Schreiber et al. 2002). Small β-peptides can be synthesized using conventional Fmoc/tBu SPPS strategy (Guichard and Seebach, 1997), but the protocols have to be modified for longer peptides because of the high folding propensity of β-peptides, which increases coupling times and prevents complete deprotection. However, the replacement of piperidine by stronger bases has circumvented the problem of incomplete deprotection in Fmoc/tBu solid phase synthesis of β-peptides (Schreiber and Seebach, 2000) and longer β-peptides and mixed type α-β-peptides have been successfully synthesized by chemical ligation (Kimmerlin et al. 2002). β-amino acids are, however, very expensive building blocks so that the usual practice in SPPS of using excess reagents has to be avoided. This problem has not been fully addressed and remains to be solved from a technological point of view. Methodologies for β-peptide synthesis have been recently reviewed (Kimmerlin and Seebach, 2005).

    Large scale chemical synthesis of peptides

    More than 40 therapeutic peptides are now in the market, which represents a considerable increase, since less than 10 were in the market by 1990. A much higher number of therapeutic peptides are in the different phases of approval. The therapeutic application of peptides has an enormous potential, which has been enhanced by the advances in the fields of formulation and administration of pharmaceuticals.

    Among the technologies available for the large scale synthesis of peptides, chemical synthesis is the most developed and SPPS can be considered now a mature technology. SPPS can be used sometimes in combination with synthesis in solution (hybrid synthesis). Even though SPPS can be troublesome in practice for peptides of more than 10 residues, in some cases peptides of about 50 residues have been synthesized successfully by this system (Patarroyo et al. 1988) and several peptides from 30 to 50 residues synthesized by SPPS are now in the market (Bruckdorfer et al. 2004). Commonly, peptides of less than 30 residues are produced entirely by solid phase sequential synthesis (Lloyd-Williams and Giralt, 2000), while larger peptides (up to 60 residues) must be produced by convergent synthesis in which protected peptide fragments are synthesized by SPPS and then combined by liquid phase synthesis (Riniker et al. 1993; Barlos and Gatos, 1999). Proteins are preferably produced by chemoselective ligation in which all the unprotected linked fragments have been previously synthesized by SPPS.

    The development and commercialization of acid labile resins (Rink, 1987; Bray, 2003) has allowed the synthesis of large peptides by hybrid synthesis. Small fragments with protected side chains are synthesized over the acid-labile resins using Fmoc protected amino acids; these fragments are removed from the resin with 1% TFA solution. The fragments that contain the whole peptide or protein sequence are condensed in solution to yield a product with protected side chains which is then deprotected and purified to complete the synthesis of the desired product.

    Independent of the SPPS system chosen, peptide synthesis is a multi-step process where the number of operations required increases proportionally to the number of amino acid residues of the peptide. This in turn produces a decrease in yield, which is exponential with the number of steps involved in the synthesis of the peptide. In fact, if Ri is the yield of recovery in step i, then the global yield of recovery (R) of a synthesis of N steps is:

    The synthesis of a peptide of 10 to 20 amino acids involves 20 to 50 operations. If an average yield of recovery of 90% is assumed for each amino acid linked, the total yield of recovery would be within the range of 35% (0.910 = 0.35) and 12% (0.920 = 0.12). Even if the average yield of recovery could be increased to 98%, the total yield of recovery would be within 82% (0.9810 = 0.82) and 67% (0.9820 = 0.67). This simple exercise highlights the importance of improving the coupling efficiencies, which is a key aspect for the success of a process for peptide synthesis at industrial scale. If one takes into consideration that in the production of 1 kg of a peptide drug, more than 1000 liters of liquid waste are produced, any improvement in yield will have an impact not only in terms of productivity but also in terms of environmental protection. This is particularly so in the case of the chemical synthesis of peptides, which involve coupling, protection and deprotection reagents that are environmentally aggressive and even toxic (Andersson et al. 2000).

    The synthesis of sophisticated peptides for therapeutic use is first conducted at the milligram scale for performing biological studies and elucidating structure-activity relationships. Even though yields of recovery can be quite low at this stage, they can be good enough to conduct such studies. However, the identification of the peptide as a candidate for clinical studies demands the synthesis of large amounts of product. When the therapeutic dose has been established in Phase II, the scale of production will be higher and a scale-up strategy for the process will be necessary. At this point, the cost-benefit analysis of the process of synthesis becomes relevant. The strategy should satisfy not only the requirements for the licensing of the product but also comply with the regulatory requirements and validation of the process, which implies the establishment of the chemical and biological equivalence with the product synthesized at the small scale. In this context, the amino acid protection systems should be similar, and ideally identical, in both stages of development. The selection of the most effective route for synthesis at large scale represents a challenge and all efforts should be directed towards the design of the procedure of synthesis that renders the higher productivities and yields. When the process is scaled up to production levels (kilograms or tons, depending on the product), criteria that are not necessarily relevant at small scale become relevant. Production costs, purity of the product, security and environmental impact of the process are now issues of major concern. Costs of reagents are usually high; therefore, the use of large excess, which is common practice in the early stages of development, is inadmissible at large scale, where it is highly desirable to use reagents as close to stoichiometry as possible. In fact, large amounts of reagents cannot be wasted just to ensure the completion of the reaction. Aside from costs, environmental burden is also a matter of concern because of the surplus effluent produced. Therefore, the kinetics of the reactions should be studied in greater detail when scaling up to reduce the costs associated with reagents and waste treatment after reaction (Andersson et al. 2000; Bruckdorfer et al. 2004; Patarroyo and Guzmán, 2004). Some relevant examples of peptides produced at large scale and the production system used are shown in Table 2.

    The final goal of a process for the large scale synthesis of a peptide is to establish its technological and economic viability and to satisfy the requirements of the regulatory agencies in terms of product quality standards and process validation. A system of GMP (good manufacturing practices) must be established to ensure that the production process is highly reproducible and the final product is delivered with a consistent quality (WHO, 1999). Process validation should consider reproducibility in terms of yields of intermediate and final products and consistency in the profile of impurities of the product (Andersson et al. 2000). This is a complex task in the case of the synthesis of peptides because of the complexity and the number of operations involved in the production process. The stringent requirements for validation, despite the high cost and long time required, has to be appreciated as the adequate way to ensure the higher standards of quality and safety required by the final consumer, which is the recipient patient.

    Enzymatic synthesis of peptides

    Enzymes are the biological catalysts responsible for cell metabolism. As such, they must perform well at the mild conditions required for cell functioning. To become process biocatalysts they must be robust enough to withstand the harsh conditions of an industrial process, which usually implies the modification of the enzyme to produce a stable biocatalyst (Illanes, 1999). Proteolytic enzymes comprise a group of hydrolases that is the most relevant in technological terms, sharing about one half of the world market of enzymes, with annual sales of about US$ 3 billion (Chellapan et al. 2006). Microbial and plant proteases are the most relevant and have been widely utilized in medicine and in different industrial processes for decades (Wiseman, 1980; Hultin, 1983; Klein and Langer, 1986; Van Brunt, 1987). Microbial proteases are the most important in terms of market-share because of the advantages of their intensive production. However, plant and some animal proteases are still relevant for certain medical and industrial applications. Microbial and plant proteases have been traditionally used in degradation processes and are available at low cost; however, new sources of proteases are required to produce more potent and specific enzymes. New proteases from endogenous plant species have been characterized and proved their efficacy for performing hydrolytic and synthetic reactions (Arribére et al. 1998; Arribére et al. 1999; Priolo et al. 2000; Obregón et al. 2001). New sources of proteases are continuously being reported, especially from exotic organisms that thrive in extreme environments, being their proteases abnormally stable and/or active at such extreme conditions. As an example, the thermophilic and alkalophilic proteases from the salt tolerant marine fungus Engyodontum album has been recently reported and characterized (Chellapan et al. 2006).

    Proteases are active at mild conditions, with pH optima in the range of 6 to 8; they are robust and stable, do not require stoichiometric cofactors and are also highly stereo and regioselective (Bordusa, 2002). These properties are quite relevant to use them as catalysts in organic synthesis. This is possible because proteases can not only catalyze the cleavage of peptide bonds but also their formation (Capellas et al. 1997; Björup et al. 1999; So et al. 2000), as well as other reactions of relevance for organic synthesis, for instance: the regiospecific hydrolysis of esters and the kinetic resolution of racemic mixtures (Khmelnitsky et al. 1997; Carrea and Riva, 2000; Bordusa, 2002). Subtilisin, chymotrypsin, trypsin and papain have been widely used proteases in the chemical synthesis of peptides.

    Proteolytic enzymes: classification and catalytic mechanisms

    Proteases are among the best studied enzymes in terms of structure-function relationship (Barret et al. 1998; Priolo et al. 2001; Drauz and Waldmann, 2002). The proteases that are used for peptide synthesis are selected on the basis of their specificity against amino acid residues on either side of the splitting point (Kumar and Bhalla, 2005) and include the majority of the commercially available endo and exoproteases. The broad specificity of proteases restricts their application in peptide synthesis, since the peptide product that accumulates during the reaction can be attacked by the proteases simultaneously with the reaction of synthesis (Schellenberger et al. 1991).

    There are five families of proteases in which serine, threonine, cysteine, aspartic or metallic groups play a primary catalytic role. Serine, cysteine and threonine proteases are quite different from aspartic and metalloproteases. In the first three groups, the nucleophile in the catalytic center is part of an amino acid residue, while in the second two groups the nucleophile is an activated water molecule. In cysteine proteases the nucleophile is a sulfhydryl group and the catalytic mechanism is similar to the serine proteases in which the proton donor is a histidine residue (Figure 7 and Figure 8).

    Besides the hydrolysis and formation of peptide bonds between amino acids, proteases can catalyze other types of reactions as well, such as esterification and transesterification in the resolution of racemic alcohols and carboxylic acids and the stereoselective acylation of meso and prochiral diols (Broos et al. 1995; Khmelnitsky et al. 1997; Liu and Tam, 2001), the synthesis of glycoconjugates (Wong et al. 1993; Bordusa, 2002) and the kinetic resolution of racemic mixtures, even though in this case lipases and esterases are more useful for the resolution of non amino acidic derivatives (Kirchner et al. 1985; Bornscheuer and Kazlauskas, 1999; Pogorevc and Faber, 2000; Roberts, 2001). On the other hand, non-proteolytic enzymes can also be used in connection with peptides, as illustrated by the kinetic resolution of N-phenylacetyl-DL-tert-leucine in the production of the non-proteinogenic chiral amino acid L-tert-leucine recently reported with penicillin G acylase from Kluyvera citrophila (Liu et al. 2006).

    Proteases in peptide synthesis

    Biocatalysis in non-conventional (non-aqueous) media has expanded the spectrum of application of proteases to those reactions that cannot proceed effectively in aqueous environments, this is, the synthesis of peptide bonds instead of their hydrolysis (Illanes and Barberis, 1994; Quiroga et al. 2000a). These reaction media, that include organic solvents (Clark, 2004; Gupta and Roy, 2004), supercritical fluids (Kamat et al. 1992; Mesiano et al. 1999), eutectic mixtures (Gill and Vulfson, 1994), solid-state (Halling et al. 1995; Erbeldinger et al. 1998) and, more recently, ionic liquids (Park and Kazlauskas, 2003; van Rantwijk et al. 2003; Lou et al. 2004; Machado and Saraiva, 2005) offer other potential advantages as well: the possibility of using poorly water soluble substrates; the modification of the equilibrium of reaction as a consequence of the alteration of the partition coefficients of substrates and products in the case of biphasic systems; the reduction of inhibitory effects by substrates and products; the easiness of biocatalyst and product recovery; the increase in the thermostability of the biocatalyst and, in some cases, the variation in substrate specificity and the increase in the stereo and enantiospecificity in the resolution of racemic mixtures (Kawashiro et al. 1997; Klibanov 2001). As compared to chemical synthesis, a most important advantage of biocatalysis is the specificity of the reaction, which reduces the requirement of side-chain protection.

    Enzymatically synthesized small peptides (usually di or tripetides) are being used successfully for human and animal nutrition and also as pharmaceuticals and agrochemicals. Some relevant examples are the synthesis of the leading non-caloric sweetener aspartame, the lysine sweet peptide, kyotorphin, angiotensin, enkephalin and dynorphin (Kullman, 1979; Takai et al. 1981; Kullman, 1982; Oyama et al. 1987; Aso, 1989; Clapés et al. 1989; Nakanishi et al. 1990), and some nutritional dipeptides and tripeptides (Kimura et al. 1990b, Kimura et al. 1990c; Monter et al. 1991). Some small peptides have been synthesized at commercial scale in continuously operated enzyme reactors (Herrmann et al. 1991; Serralheiro et al. 1994). Kimura et al. (1990b) have proposed various kinds of reactors to synthesize peptides from essential amino acids using papain, α-chymotripsin, and thermolysin. Telios Pharmaceutical Co. has explored the enzymatic synthesis of the tripeptide Arg-Gly-Asp as a new drug for the healing of heavy burns and dermal ulcer (Chen et al. 1998). Several other examples of enzymatically synthesized peptides have been reported in the last decade (Monter et al 1991; Murakami and Hirata, 1997; Murphy and Fagain, 1997; Sergeeva et al. 1997; Zaks and Dodds, 1997, Liu et al. 2002). We have synthesized biologically active small peptides using proteases isolated from the latex of plants in biphasic reaction systems (Barberis et al. 2002).

    Synthesis under thermodynamic control. Peptide synthesis by proteases can proceed by two mechanisms: thermodynamic and kinetic control (Kumar and Bhalla, 2005), which will be described in this and the following section respectively.

    The thermodynamically controlled synthesis of peptides (TCS) with proteases represents the reverse of the hydrolytic breakage of peptide bond catalyzed by those enzymes, as shown in the scheme (Jakubke et al. 1985):

    were Kion is the equilibrium constant of ionization and Kcon is the equilibrium constant of conversion.

    Proteases, as any catalyst, do not alter the equilibrium of the reaction but merely increases the rate to attain it. According to the principle of microscopic reversibility, both the formation and the hydrolysis of the peptide bond proceed by the same mechanism and through the same intermediate. The formation of the acyl intermediate from a carboxylic acid is a very slow reaction and represents the rate limiting step in TCS (Sakina et al. 1988).

    Main issues of TCS are the use of an acyl donor with the free carboxylic group and the possibility of using any type of proteases, independent of their catalytic mechanism. Their main drawbacks are the low reaction rates and product yield (determined by the equilibrium constant of the reaction) attainable, the high amount of enzyme biocatalyst often required and the need of precise reaction conditions to displace the equilibrium towards synthesis (Bordusa, 2002). This in practice can produce severe compromises with enzyme activity and stability. From the above scheme, it is apparent that the equilibrium will be displaced to hydrolysis in an aqueous medium. The displacement of equilibrium towards peptide bond formation can be attained by manipulation of the equilibrium of ionization (i.e. pH change) and the equilibrium of the reaction (i.e. by product precipitation or by modification of medium composition) (Homandberg et al. 1978; Jakubke et al. 1985). The addition of organic cosolvents and the use of aqueous-organic biphasic systems are good strategies to displace the equilibrium towards synthesis. The presence of organic solvents will reduce the activity of water in the reaction medium, which will favour the equilibrium, and will also reduce the dielectric constant of the medium, which will reduce the acidity of the carboxylic group of the acyl donor and so it will increase the equilibrium constant Kion promoting the reaction of synthesis. The use of cosolvents is a rather simple strategy but high concentrations of cosolvents are usually detrimental for enzyme activity (Jakubke et al. 1985). In biphasic systems, the partition of the peptide products from the aqueous phase that contains the enzyme to the organic phase drives the equilibrium towards synthesis, with the additional benefit that the product is no longer subjected to hydrolysis. However, reaction rates in biphasic systems are low because of limitation by substrate diffusion; besides, proteases tend to denature in the water-solvent interphase (Halling, 1994).

    The use of neat hydrophobic solvents with very low water content can in principle be effective for peptide synthesis by reducing the backward hydrolytic reaction. However, proteases may exhibit very low activity and reduced stability, and substrates and products can be poorly soluble in this kind of media. Replacing the essential water by other hydrogen bond forming solvents can increase oligopeptide solubility.

    In summary, each process of TCS requires of a study to determine which of the process alternatives (water-cosolvent medium, biphasic medium or hydrophobic solvents with low amount of water) is the best. The choice will depend on the characteristics of the reaction, the robustness of the biocatalyst and also on the properties of substrates and products.

    Synthesis under kinetic control. The kinetically controlled synthesis of peptides (KCS) with proteases can be represented by the following scheme (Bordusa, 2002):

    where EH is the free enzyme; Ac-X is the acyl donor substrate; [E...Ac-X] is the Michaelis-Menten acyl-enzyme complex; HX is the released group; Ac-E is the acyl-enzyme intermediate, HN is the acceptor substrate (nucleophile), Ac-N is the product of synthesis (peptide) and Ac-OH is the product of hydrolysis of the acyl donor.

    As shown in the above scheme, the acyl donor, that needs  to be activated in the form of an ester, an amide or a nitrile, first binds to the enzyme to form a tetrahedric enzyme-substrate complex [E...Ac-X] that collapses to form the covalent acyl-enzyme intermediate [Ac-E]. This intermediate can be nucleophilically attacked by water and by the nucleophile (HN), which can be an amine, an alcohol or a thiol that will compete with water for the deacylation reaction. The success of the reaction of synthesis will depend on the kinetics of these nucleophilic reactions; this is why this strategy is termed “kinetically controlled”.

    As opposite to TCS, only serine or cysteine proteases can be used to perform KCS, because the enzyme acts in this case as a transferase and catalyzes the transference of an acyl group from the acyl donor to the amino acid nucleophile through the formation of an acyl-enzyme intermediate. Generally KCS proceeds faster and requires lower enzyme to substrate ratios than TCS because the acyl donor is now in the form of an activated carboxylic acid (Bordusa, 2002). Papain, thermolysin, trypsin and α-chymotrypsin are the most used enzymes in KCS (Mitin et al. 1984; Ooshima et al. 1985; Nakanishi and Matsuno, 1986; Reslow et al. 1987; Nakanishi and Matsuno, 1988; Mitin et al. 1997; Björup et al. 1998; Fité et al. 1998; Björup et al. 1999). It is desirable that the peptide product be removed from the reaction medium to avoid unwanted secondary hydrolysis. The yield of peptide product will depend on the apparent ratio of transferase to hydrolase rate constants (KT/KH)app and the rate at which the peptide product is hydrolyzed. Protease used in KCS have values of (KT/KH)app in the range of 102-104 (Kasche, 1996).

    Biphasic systems are not adequate to perform KCS since in this case, the neutral esters commonly used as acyl donors partition poorly into the aqueous phase and therefore the concentration in that phase, where the enzymatic reaction occurs, is low. However, the synthesis of N-α-CBZ-Ala-Phe.OMe was successfully accomplished in a biphasic system composed by Tris-HCl buffer (pH 8.5) and ethylacetate (50%), using a crude protease from the latex of Araujia hortorum Fourn (Quiroga et al. 2000b). In this case, the favourable product partition coefficient to the organic phase allowed its recovery from this phase and increased yield; besides, the biocatalyst exhibited high specificity and selectivity for the synthesis of the dipeptide without significant polymerization. As in TCS, the decrease of water activity, by using an organic cosolvent, favours synthesis in KCS by reducing the hydrolysis of the acyl-enzyme intermediate and the final product, but, again, the reaction medium can be harmful to the enzyme (Cassells and Halling, 1990; Barberis et al. 2002). Mutant enzymes have been used in the KCS of peptides in alkaline medium containing organic cosolvents where the parent enzymes were denatured, obtaining high conversion yields (West et al. 1990; Wong et al. 1990).

    Temperature, pH, substrates concentrations and enzyme to substrate ratio are parameters that will affect KCS (Chen et al. 1994). An increase in nucleophile concentration will increase the rate of nucleophilic attack to the acyl-enzyme intermediate (Jakubke, 1987); an increase in pH will also be beneficial since it will increase the pK of the nucleophile whose only reactive form is the neutral species (Barberis et al. 2002; Bordusa, 2002). As opposed to TCS, the enzyme itself will determine the efficiency of the peptide synthesis in KCS. While the reaction rate is mainly determined by the specificity of the enzyme for the acyl donor, a specific linkage of the nucleophile to the S’ subsite of the protease is essential for a high yield of synthesis (Schechter and Berger, 1967). Since the specificity for the acyl donor and the nucleophile linkage are individual parameters for each enzyme, the efficiency of synthesis and the potential for peptide synthesis will greatly differ from one protease to another.

    Strategies of enzymatic synthesis of peptides

    Despite their good catalytic properties, proteases are not ideal catalysts for the synthesis of peptides. Its specificity and selectivity might limit their potential, particularly in the case of rather large peptides where unwanted hydrolytic reactions will occur over the formed product and the substrates. Besides, the use of non-conventional reaction media and the conditions of temperature and pH required for synthesis can be detrimental both for protease activity and stability (Barberis et al. 2002; Bordusa, 2002; Quiroga et al. 2005; Quiroga et al. 2006). However, there are different strategies to overcome such problems, which comprise the engineering of the reaction medium, the biocatalyst and the substrate (Lombard et al. 2005).

    Medium engineering: homogeneous, macroheterogeneous and microheterogeneous systems. Medium engineering refers to the rational manipulation of the reaction medium to positively influence the properties of the enzyme with respect to the reaction of synthesis (Clapés et al. 1990a; Clapés et al. 1990b; Khmelnitsky et al. 1991; Ryu and Dordick, 1992; Wescott and Klibanov, 1994; Carrea et al. 1995, Chaudhary et al. 1996). This frequently implies the substitution of the usual aqueous medium for a non conventional medium in which water has been replaced partially or almost totally by another solvent (Hari Krishna, 2002).

    There are basically two types of biocatalytic systems in non conventional medium: homogeneous systems which are mixtures of water and a water miscible solvent (Castro and Knubovets, 2003) and heterogeneous systems (Krieger et al. 2004) in which a second phase is produced by the presence of a water-immiscible solvent. Heterogeneous systems can be divided into macroheterogeneous, in which two immiscible liquid phases are apparent, and microheterogeneous in which one of the phases (usually the aqueous phase that surrounds the enzyme) is not visible to the naked eye. Both homogeneous and heterogeneous systems can perform with the enzyme dissolved in the medium or insolubilized in it, be it because the enzyme protein is itself insoluble in that medium or because it is immobilized into a solid carrier (Illanes and Barberis, 1994). In this latter case, the system will be always heterogeneous in nature. The replacement of water for another solvent can be beneficial for the enzymatic synthesis of peptides for one or more of several reasons, which apply differently to each of the systems mentioned above, namely the increased solubility of hydrophobic amino acid or peptide substrates, the attenuation of the reverse or competing hydrolytic reactions, the reduction of product hydrolysis, the simplicity of product and/or biocatalyst recovery and the increased biocatalyst thermal stability.

    Homogeneous systems. Homogeneous systems are composed by a mixture of water and a water-miscible solvent (cosolvent) in which the enzyme is dissolved (Torres and Castro, 2004). Cosolvents are usually detrimental for enzyme activity at moderately high concentrations, because they tend to penetrate the aqueous microenvironment that surrounds the enzyme molecules, altering the pattern of interaction of the enzyme with the solvent and distorting its three-dimensional structure (Klibanov, 1986; Laane et al. 1987a; Laane et al. 1987b; Deetz and Rozzell, 1988; Zaks and Klibanov, 1988a; Zaks and Klibanov, 1988b; Mozhaev et al. 1989; Levitsky et al. 1999). However, polyols and glymes are notable exceptions among cosolvents (Castro, 2000; Illanes and Fajardo, 2001). Another way of circumventing the problem of enzyme deactivation is immobilization, so that it is usual that enzymes are used in immobilized form when performing in homogeneous liquid medium. There are several examples of proteases and other peptide bond forming enzymes which have been successfully employed in peptide synthesis in such media (Guisán et al. 1987; Rodger et al. 1987; Lobell and Schneider, 1998; Ye et al. 1998; Castro, 1999; Illanes et al. 2004). Several studies have reported the effect of cosolvents on the activity and stability of proteases and other enzymes, comparing them with the behaviour in aqueous media (Björup et al. 1996; Lozano et al. 1993; Tyagi and Gupta, 1998; Partridge et al. 1999; Rodakiewicz-Nowak et al. 2000; Ghorbel et al. 2003). In general, both the activity and stability of enzymes is impaired with the exceptions noted above.

    Heterogeneous systems. Macrohetrogeneous systems are composed by two immiscible liquids. They are termed biphasic systems and are usually composed by water and a hydrophobic solvent (Martinek and Semenov, 1981; Martinek et al. 1981; Kuhl and Jakubke, 1990). Substrates can be dissolved in the organic or in the aqueous phase but, independent of substrates partition, biocatalysis will always occur in the aqueous phase where the protease is. The peptide product formed can be partitioned to the organic phase, which is highly desirable to avoid unwanted hydrolysis and drive the equilibrium toward synthesis (Bordusa, 2002). In some cases the organic phase can be one of the substrates itself (Harbron et al. 1986). Biphasic systems have been extensively used for enzymatic peptide synthesis and represents a good strategy because it is highly flexible and can accommodate to the properties of substrates and products (Kimura et al. 1990a; Kuhl and Jakubke, 1990; Bemquerer et al. 1991; Feliú et al. 1995; Wubbolts et al. 1996; Capellas et al. 1997; Sarmento et al. 1998; Murakami et al. 2000; Barberis et al. 2002; Trusek-Holownia, 2003; Barberis et al. 2006). The main drawback of biphasic systems is the presence of an interphase that can impose diffusional restrictions to the substrates and so reduce the reaction rate. This can be alleviated by intense agitation, but this will in turn promote enzyme inactivation at the interphase (Feliú et al. 1995; Barros et al. 1998). We have not observed this phenomenon when conducting peptide synthesis with phytoproteases (Barberis et al. 2002; Quiroga et al. 2005).

    Suspension of nearly anhydrous enzymes in hydrophobic solvents can be considered as a microheterogeneous system in the sense that the liquid phase appears as homogeneous to the naked eye; however, the system is microscopically heterogeneous since the solid enzyme is covered by a water shell tightly bound to the enzyme and a bulk hydrophobic organic solvent surrounding it. The enzyme is usually protected from the aggressive hydrophobic solvent by a layer of denatured enzyme (Kanerva and Klibanov, 1989; Klibanov, 1989; Castro, 1999). This is maybe the most simple and most promising strategy for enzymatic synthesis since it exploits to the highest extent the advantages of working in non-conventional medium (Clark, 2004; Gupta and Roy, 2004), namely high thermal stability (Zaks and Klibanov, 1984; Rodger et al. 1987; Klibanov, 2001), potentially favourable changes in substrate specificity (Zaks and Klibanov, 1985; Wescott and Klibanov, 1994; Ebert et al. 1996; Kawashiro et al. 1997; Carrea and Riva, 2000) and easiness of biocatalyst and product recovery (Dickinson and Fletcher, 1989). There are numerous examples of proteases and other enzymes (ribonuclease, pancreatic lipase, terpene cyclase and cytochrome oxidase) that have been proven much more stable in hydrophobic organic solvents than in water (Ayala et al. 1986; Wheeler and Croteau, 1986; Volkin et al. 1991). The enzyme biocatalyst is simply an acetone precipitated enzyme powder or a lyophilized preparation that is suspended in the organic medium in which is completely insoluble (Russell and Klibanov, 1988; Zaks and Klibanov, 1988a). Therefore, immobilization is not necessarily required since the enzyme is already insolubilized in the reaction medium. However, in some cases immobilization to a solid surface can be convenient by offering an increased surface of contact with the substrate and providing additional stabilization. Best results are obtained with highly hydrophobic solvents (log P > 4; where P is the partition coefficient between n-octanol and water) since the intrusion of solvent in the water shell is hindered and so the enzyme is better protected from the direct contact with the organic solvent molecules (Zaks and Klibanov, 1985). The activity of an enzyme in such media is highly dependent from the pH of the aqueous solution from which it was obtained (by precipitation or lyophilization). This has been called “pH memory” and it stems from the fact that the enzyme retains in the hydrophobic medium the protonation stage of the aqueous solution at which it was exposed previously. This has been elucidated by FTIR spectroscopy (Constantino et al. 1997). Despite its advantages, this strategy has important drawbacks, being the most important the dramatic decrease in activity that enzymes usually express in this kind of media (Klibanov, 1997; Quiroga et al. 2006). However, some improvements in activity have been obtained by the addition of water mimics, such as formamide or ethylene glycol, to the reaction medium or the addition of crown ethers during the preparation of the lyophilized enzyme (van Unen et al. 2001). A 425 fold increase in enzyme activity was observed by the addition of 18-crown-6 to chymotrypsin in the synthesis of a dipeptide in acetonitrile medium (van Unen et al. 1998). In addition, most hydrophobic solvents do not solubilize some of the substrates used in the reactions of synthesis, which also holds for peptide synthesis.

    Another system that can be considered as microheterogeneous is reverse micelles, which are spontaneously formed when small amounts of water are added to a hydrophobic solvent in the presence of a surfactant and under agitation (Martinek et al. 1986; Gómez-Puyou and Gómez-Puyou, 1998). It has been claimed that the microenvironment in the internal cavity of the micelles is more natural to the enzyme than the bulk aqueous medium, which leads in some cases to an enhanced expression of its catalytic potential (Castro and Cabral, 1989). Reverse micelles have, however, several drawbacks: they are mechanically week, there are no rational methods for its optimization and the surfactant impairs the recovery and purification of products (Bordusa, 2002). t-Boc-Arg-Gly-OEt and Ac-Gly-Asp-diOMe have been successfully synthesized under kinetic control in organic medium using proteases immobilized in reverse micelles (Chen et al. 1998; Chen et al. 1999).

    An alternative to the classic biphasic and micellar systems has been proposed by Clapés et al. (2001) as reaction medium for peptide synthesis with chymotrypsin using a water-in-oil (W/O) emulsion with a high water content (95%) which is the opposite as in reverse micelles. Such systems have been occasionally used in organic synthesis (Manabe et al. 2000), but its application in biocatalysis is still in its early development.

    Biocatalyst engineering. Biocatalyst engineering refers to all strategies aimed to obtain biocatalysts well suited to perform under the conditions of synthesis and include approaches that range from chemical modification to genetic and protein engineering (Roig and Kennedy, 1992; Bordusa, 2002; Adamczak and Hari-Krishna, 2004; Hudson et al. 2005).

    The solubilization of enzymes in nearly anhydrous organic solvents can be attained by their chemical modification. Such modifications may range from simple non-specific acetylation of free amino groups of the enzyme (Murphy and Fagain, 1997) to specific modifications with hydrophilic polymers (Inada et al. 1995; De Santis and Jones, 1999; Salleh et al. 2002) that bind to amphipatic groups at the enzyme surface.

    The insolubilization of the biocatalyst by immobilization to a solid carrier or by protein aggregation represents the most relevant biocatalyst engineering strategy for producing robust enzyme catalysts well suited to withstand the harsh conditions prevailing during the reactions of synthesis (Adlercreutz, 1991; Wong and Whitesides, 1994; Yan et al. 2002; García-Junceda et al. 2004). Immobilization has been extensively used to produce protease biocatalysts for peptide synthesis (Guisán et al. 1991; Guisán et al. 1997; Filippova and Lysogorskaia, 2003; Lei et al. 2004). Insolubilization by protein cross-linking is a promising technology to produce biocatalysts for synthesis. Cross linked enzyme crystals (CLEC) are robust biocatalysts produced by crystallization of the enzyme protein followed by cross-linking with the bifunctional reagent glutaraldehyde (St. Clair and Navia, 1992; Lalonde, 1997; Abraham et al. 2004). Its specific activity is very high since there is no inert matrix and the whole mass of biocatalyst is essentially pure enzyme protein; however, it has the serious disadvantage of requiring the enzyme in a pure state to be able to crystallize it, which in practice means that the cost of the biocatalyst is very high. CLECs of subtilisin have been recently used successfully in organic synthesis both in repeated batch and continuous operation (Amorim Fernandes et al. 2005). A most promising type of new generation biocatalysts is cross-linked enzyme aggregates (CLEAs) in which the protein solution containing the enzyme is subjected to non-denaturing precipitation by conventional procedures (i.e. salting-out, solvent or polymer precipitation) and then cross-linked with bifunctional reagents (López-Serrano et al. 2002; Shah et al. 2006). CLEAs are similar to CLECs in their properties, with the additional advantage of its simplicity and low cost since the enzyme does not require to be purified to any extent (Cao et al. 2000; Cao et al. 2003). CLEAs prepared by co-aggregation with polyionic polymers (Wilson et al. 2004a) and CLEAs encapsulated into hydrophilic polyvynil alcohol hydrogel (Wilson et al. 2004b) have provided an adequate microenvironment for enzymatic peptide-bond formation in organic synthesis in non-aqueous environments. Protease CLEAs are promising for peptide synthesis even though autoproteolysis can play a role in this case because of the close proximity and flexibility of the enzyme molecules.

    Other strategies for improving the catalytic performance of enzymes consider the manipulation of the genes encoding the enzyme protein (Arnold, 1990; Jaeger and Reetz, 2000; Reetz and Jaeger, 2000; Arnold, 2001). One relevant strategy is site-directed mutagenesis (sometimes termed as rationale design) which is a powerful tool to analyze structure-activity relationships and also to screen for mutant enzymes with improved properties of activity, stability or specificity. Success of rational design depends on the progress made in structure determination, improved computer modeling protocols and significant new insights into structure-activity relationships (Adamczak and Hari Krishna, 2004). Proteases have been preferred models for applying this technique. In fact, subtilisin has been extensively studied, and considerable progress has been made in engineering this protease and its substrates for peptide bond formation in aqueous solution (Abrahmsen et al. 1991). In P. aeruginosa, a disulfide bond between Cys-30 and Cys-58 played an important role in the organic solvent stability of the PST-01 protease (Ogino et al. 2001). The effect of a novel disulfide bond engineered in subtilisin E from Bacillus sp. based on the structure of a thermophilic subtilisin-type serine protease (aqualysin I) was examined and suggested that an electrostatic interaction between Lys 170 and Gly 195 is important for catalysis and stability in that protease (Takagi et al. 2000). Substitution of native amino acids by fluoroalkyl analogs in the commercially available proteases trypsin and α-chymotrypsin represents a new approach for the design of biologically active peptides with increased stability as well as defined secondary structure, which provides a powerful tool for spectroscopic investigations (Thust and Koksch, 2003). Site-directed mutagenesis has also been employed to improve the properties of trypsin for performing peptide synthesis (Lombard et al. 2005). Other strategy for improving enzyme performance based on genetic manipulation is directed evolution (Arnold, 2001), which is particularly suitable when not enough information on enzyme structure is available. It consists basically in sequential rounds of random mutagenesis using error-prone PCR. At each round, the mutant having the best feature is screened and subjected to the next round. At the end, as a result of this evolutionary progress, the biocatalyst will be improved with respect to the property used for its screening (Gupta and Roy, 2004). Random mutagenesis has been successfully applied to improve the thermal stability of subtilisin (Adamczak and Hari Krishna, 2004).

    Substrate engineering. Even though most proteases are able to recognize more than one amino acid, not all are able to couple any aminoacidic sequence. Due to the specificity of proteases to a particular amino acid, only those acyl donors that have a specific amino acid in the C-terminal position can be coupled without side reactions. For instance, trypsin requires arginine or lysine residues as carboxylic terminal components in the structure of the acyl donor (Bordusa, 2002). In this way, the manipulation of the leaving group is generally useful to increase the specificity of the protease to a previously less specific amino acid, so increasing reaction rate (Miyazawa et al. 2001a; Miyazawa et al. 2001b). The manipulation of the leaving group affects the aminolysis/hydrolysis ratio of an acyl donor and therefore the conversion yield, since the acyl-enzyme intermediate formed is the same regardless of the change produced in the leaving group. Another approach is the use of mimetic substrates. Contrary to the classic manipulation of the leaving group, focused on the increase in enzyme specificity, mimetic substrates are designed to bind to the active site of the enzyme. In this way, serine and cysteine proteases can react with non specific amino acids or peptide sequences without altering the enzyme or the reaction medium (Schellenberger et al. 1991; Thormann et al. 1999). The main advantage of this strategy is that mimetic substrates allow the formation of the acyl-enzyme and the nucleophile reaction to establish a peptide bond that cannot be further hydrolyzed because it does not correspond to the protease specificity (Bordusa et al. 1997; Lombard et al. 2005). However, this approach is limited to reactions with nonspecific amino acid containing peptides, whereas the coupling of specific ones leads to unwanted cleavages due to the native proteolytic activity of the biocatalyst (Grunberg et al. 2000).

    Advances and challenges in enzymatic synthesis of peptides

    The application of proteases to peptide synthesis, although offering several advantages, has been discouraged to some extent by some unfavourable factors. However, several technologies are being actively explored to overcome these difficulties. Low activity of enzymes in non-aqueous solvents, as compared to that in water, has been largely overcome by crown ether treatment of enzymes, crown ether-enhanced enzyme activity being retained even after removal of the crown by washing with a dry organic solvent (Van Unen et al. 2002). Subtilisin Carlsberg in the form of an enzyme-surfactant complex, displayed high catalytic activity in anhydrous hydrophilic organic solvents and did not require excess amount of water, so that the side hydrolytic reaction was effectively suppressed and the yield of the dipeptide formed became considerably high (Okazaki et al. 2000). However, peptide synthesis using organic solvents as reaction medium remains limited to the synthesis of model di-and tripeptides and little effort has been done on the synthesis of bioactive oligopeptides as compared to conventional aqueous systems (Jakubke et al. 1985; Morihara, 1987; Isono and Nakajima, 2002).

    A general approach to the enzymatic peptide synthesis remains to be formulated despite the large number of studies carried out on the synthesis of model and bioactive peptides. Use of protected substrates in oligopeptide synthesis has led to a loss of continuity in the reaction sequence and a reduction in yield and productivity, limiting the usefulness of enzymes for such purpose. An interesting approach was designed for the enzymatic synthesis of the cholescystokinin C-terminal octapeptide (CCK-8), which is a biologically active peptide of therapeutic value as an agent for the control of gastrointestinal function (Capellas et al. 1996a). The synthesis was performed in organic media and designed in a convergent mode in which a tripeptide and a pentapeptide were synthesized separately using papain, α-chymotrypsin and thermolysin, all commercially available inexpensive proteases (Capellas et al. 1996b; Capellas et al. 1997; Romero et al. 1997; Xiang et al. 2004), and finally the two fragments were linked. The process was scaled up to pilot plant and the economics of the process was assessed (Romero et al. 1997; Ruiz et al. 1997; Fité et al. 2002). The global yield of the process was rather low and the requirement of protection at different stages of the process was a major obstacle to go into full scale operation. However, the strategy may prove useful for other high-value biologically active peptides.

    Despite the technological advances in peptide synthesis by biocatalysis, low productivity, low yield and high cost of enzymes are problems to be solved to make it competitive in a broad spectrum of cases. More specific, active and stable enzymes are intensively being pursued and the trend should be to improve existing proteases by the use of the modern technologies of genetic and protein engineering and automated screening of proteases from novel microbial strains, mainly extremophiles.

    Chemical versus Enzymatic Synthesis: Pros and Cons

    Since the pioneering works of du Vigneaud and Merrifield in solid phase synthesis (du Vigneaud et al. 1953; Merrifield, 1963; Merrifield, 1996), the chemical synthesis of peptides has been developed and automated to a high degree in the last decades. Protocols have been well established, which are amenable for scale-up to match the production levels required by the market. A key advantage of the chemical synthesis in solid phase is that the peptide product can be easily separated from impurities and side products. Major drawbacks refer to the racemization during peptide bond formation, the requirement of protection of the side chains of the amino acids that increases the cost of the substrates and reduces the yield of product recovery during deprotection, the difficulty of recycling the coupling reagent and the acyl donor used in excess to achieve rapid and complete acylation of the nucleophile, the time consumed in protection and deprotection reactions that reduce the productivity of the process and the toxic nature of solvents and coupling reagents that may lead to health and environmental concerns (Gill et al. 1996; Nilsson et al. 2005). Despite these restrictions, chemical synthesis of peptides can be considered the most mature technology available, being especially suited for medium size peptides up to one hundred amino acid residues, which comprises most of the peptides of therapeutic relevance.

    The development of new methods suitable for the large-scale production of biologically active peptides with proteases has been actively pursued over the last decade. However, enzymes are in general labile catalysts, so that process engineering of enzymatic reactions should be designed carefully. This implies the optimization of most relevant operational parameters: pH, temperature, organic solvent concentration, and the assessment of the activity and stability of the biocatalyst under operation conditions, the solubility of reactants, the stability of reactants and products and the selection of conversion yield or productivity as an adequate objective function, or an economic objective functions comprising both (Blanco et al. 1991; Illanes and Wilson, 2003). In this sense, enzymatic synthesis of peptides is a less mature technology than chemical synthesis and no gene