Next Article in Journal
Peppermint Oil Decreases the Production of Virulence-Associated Exoproteins by Staphylococcus aureus
Next Article in Special Issue
Design and Screening of M13 Phage Display cDNA Libraries
Previous Article in Journal
Synthesis and Conformation of Substituted Chiral Binaphthyl-Azobenzene Cyclic Dyads with Chiroptical Switching Capabilities
Previous Article in Special Issue
Challenges in Optimizing a Prostate Carcinoma Binding Peptide, Identified through the Phage Display Technology
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

DNA Libraries for the Construction of Phage Libraries: Statistical and Structural Requirements and Synthetic Methods

by
Thomas Lindner
1,
Harald Kolmar
2,
Uwe Haberkorn
1 and
Walter Mier
1,*
1
Department of Nuclear Medicine, University Hospital Heidelberg, Im Neuenheimer Feld 400, D-69120 Heidelberg, Germany
2
Institute for Organic Chemistry and Biochemistry, Darmstadt University of Technology, Petersenstrase 22, 64287 Darmstadt, Germany
*
Author to whom correspondence should be addressed.
Molecules 2011, 16(2), 1625-1641; https://doi.org/10.3390/molecules16021625
Submission received: 22 December 2010 / Revised: 26 January 2011 / Accepted: 11 February 2011 / Published: 15 February 2011
(This article belongs to the Special Issue Phage Display of Combinatorial Libraries)

Abstract

:
Peptide-based molecular probes identified by bacteriophage (phage) display technology expand the peptide repertoire for in vivo diagnosis and therapy of cancer. Numerous peptides that bind cancer-associated antigens have been discovered by panning phage libraries. However, until now only few of the peptides selected by phage display have entered clinical applications. The success of phage derived peptides essentially depends on the quality of the library screened. This review summarizes the methods to achieve highly homogenous libraries that cover a maximal sequence space. Biochemical and chemical strategies for the synthesis of DNA libraries and the techniques for their integration into the viral genome are discussed in detail. A focus is set on the methods that enable the exclusion of disturbing sequences. In addition, the parameters that define the variability, the minimal numbers of copies per library and the use of alternating panning cycles to avoid the loss of selected hits are evaluated.

1. Peptides Presented on Phages

Peptides have been proven as valuable tools in tumor diagnostic and radiotherapy. By specifically binding to receptors or other structures expressed on the surface of tumor cells, peptides are able to shuttle therapeutic drugs or radionuclides into the cells [1]. Being accessible by solid phase synthesis [2,3,4], peptides are efficient tools for diagnosis [5], therapy [6] and prevention [7] of many other diseases besides cancer.
Peptide ligands for the somatostatin or integrin receptor families are the prime examples that have been extensively optimized to enable a high tissue selectivity [8,9,10,11]. In order to extend the area of utilization of peptides for targeting purposes, ligands that bind yet unexploited receptors constitute promising candidates for the development of drugs and diagnostics.
Introduced in 1985 by George P. Smith, the phage display technology allows the screening of a vast amount of different peptides [12]. The phage display technique has been utilized for a multitude of applications [13,14,15,16], in particular as a tool for anti-cancer research. The availability of commercial libraries has further accelerated the discovery of de novo peptide sequences [17,18]. In this process, phages that carry a peptide that is able to interact with an immobilized target molecule are enriched via target binding and removal of non-binders by washing. Those phages that remain target-bound in the panning cycle are used to re-infect E. coli cells. The resulting enriched population can be used for further rounds of panning until a population of phages emerges that present target binding peptides on their surface. Determination of the DNA sequence of individual phage clones allows one to deduce the amino acid sequence of its affiliated peptide. Phage display uses the natural L-amino acids and offers a fast and convenient method for high throughput screening. Alternative peptide screening protocols that rely on genotype-phenotype coupling in vitro (ribosome display [19,20,21]) or in vivo (microbial surface display [22,23,24] have been extensively reviewed elsewhere.
The filamentous phage M13 is the most commonly used host for peptide engineering by phage display, common alternatives are the closely related phage fd [25,26] or the lytic phage λ [27]. The preference of M13 is the result of the commercial availability of its engineered vectors and ready-to-use libraries, for example those offered by New England Biolabs, Inc. Furthermore its straightforward manipulation and the comprehensive understanding of the viral life cycle and the phage structure contribute to its popularity. Following the infection of the host, the single-stranded M13 genome is converted to its double-stranded replicative form to produce viral proteins and single-stranded DNA progeny. The viral coat proteins are anchored in the host cell membrane and form the viral particle while the single-stranded DNA is extruded through the membrane. The virion is a long and flexible rod, about 1 µm in length and 10 nm in diameter. The viral coat consists of approximately 2,700 copies of the helical major coat protein pVIII and the class of minor coat proteins, each approximately 5 copies. The minor coat proteins pIII and pVI occupy one end, while the other minor coat proteins pVII and pIX cover the opposite end. Despite producing ca. 1,000 particles per hour, the infected host survives and proliferates due to the non lytic life cycle of M13 [28].
Even though the coat protein pIII is essential for the reproduction by interacting with the pili of E. coli, it is the most popular target for modifications. This is due to the fact that it can carry up to 50 additional amino acids without reduced infectivity. Randomization of a stretch of nucleotides that are fused to the pIII gene allows for the construction of a phage population, with a different peptide on the surface of each individual clone [29,30]. Moreover, phagemid vectors have been established that contain the coding sequence for a full-length or shortened version of pIII with a fused peptide/protein sequence together with a phage replication origin. Upon phage infection of E. coli, cells carrying that vector phage particles can be produced that display both the infectious full length pIII and the plasmid-encoded modified pIII protein.
The phage T7 offers an additional established platform for phage display. The common attachment point is its minor capsid protein 10B. Libraries or kits for library construction are marketed by Calbiochem. It has a lysogenic and a lytic life cycle. The lytic life cycle can be induced and ends in the release of mature phages. While its reproduction requires a higher number of steps than the reproduction of M13, in-vitro encapsulation of foreign DNA enables a more economic way to introduce synthetic DNA libraries. The various advantages of T7 over M13 display techniques [31] result from the facts that the capsid is not involved in the docking steps of infection and that the assembly of the virion proceeds without migration through the cell membrane of the host.

2. DNA Libraries

Since 20 different natural amino acids exist, the number of different peptide sequences that can be obtained by randomizing N residues is 20N. Table 1 shows the number of possible variants of fully randomized peptide sequences that can be obtained by simultaneously randomizing 7 to 16 residues. The success of a phage display experiment strictly relies on the quality of the initial DNA-library, which is mainly defined by its diversity. There are two factors that limit the maximal number of different phage clones that can be obtained, namely the amount of phage-encoding DNA molecules that can be generated in vitro and the efficiency of their introduction into E. coli cells via transformation. The maximal number of different phage encoded peptides of a hexapeptide library can be estimated as follows: an equimolar mixture of the four nucleotides has an average weight of 325 g/mol, the mean weight of a triplet is approximately 920 g/mol. A randomized DNA of a hexapeptide library, for example the one constructed by Cwirla et al. [32], has a molecular weight of about 5,500 g/mol (15,000 g/mol including primer regions). 1 µg of this insert contains more than 1013 molecules (which represents a very small fraction of the 7.2 kb long single strand DNA of the phage with approximately 2.2 × 106 g/mol). In its digested and modified double stranded form, 1 µg of pure vector contains about 1011 copies. Considering that only 1% of this DNA is transferred into E. coli by electroporation, about 10 µg of an engineered vector are required to obtain a library with 109 individual clones, each present in 1,000 copies.
Table 1. Correlation between the number of random amino acids and the maximal diversity of a peptide library.
Table 1. Correlation between the number of random amino acids and the maximal diversity of a peptide library.
random positionsindividual sequencesrandom positionsindividual sequences
71 × 109124 × 1015
82 × 1010131 × 1017
95 × 1011142 × 1018
101 × 1013153 × 1019
112 × 1014166 × 1020
Table 1 shows that by applying this type of library construction, all possible sequences of a random 7-mer peptide can be covered. However, only 1% of the sequence space of a random 9-mer peptide is approximately covered, while a 12-mer library expresses less than 0.001% of the possible individuals.
A primary task of library synthesis is to obtain an optimal distribution of these possible individuals in the sequence space. In the following sections we provide the essential knowledge to understand how the synthetic procedures used for the construction of phage libraries give access to high quality and reliable peptide pools.

2.1. Chemical Synthesis of Library Inserts

Based on the structural elucidation by Watson and Crick in 1953 [33,34] and the introduction of the phosphoramidite chemistry in 1981 [35], solid phase synthesis [36] is the method of choice for the synthesis of standard and random oligomeric DNA primers. When used for peptide screening, peptide-encoding DNA libraries based on partially randomized oligonucleotides have to meet many requirements as defined chain length, correct sequences vicinal to the random section, appropriate placement of primer and restriction sites and above all, the purity of the DNA pool.
Figure 1 shows the cycle of the automated solid phase DNA synthesis using phosphoramidite building blocks. The first nucleotide is attached via its 3’ hydroxyl function to a flexible linker on the solid support, controlled pore glass. Long spacers and low degrees of loading of the solid support reduce the amount of side products formed and improve the yields of oligonucleotide synthesis. After trichloroacetic acid deprotection of the first dimethoxytrityl moiety, the free 5’-hydroxy function is reacted with an activated nucleotide. Phosphoramidites are in the oxidation state +3, in order to obtain phosphates; the phosphorus in the coupling product has to be oxidized by iodine to obtain oxidation state +5 prior to deprotection. These three steps, coupling, oxidation and deprotection are repeated for every nucleotide to obtain the desired sequence.
Figure 1. Solid phase oligonucleotide synthesis cycle. DMT = 4,4’-dimethoxytrityl;B,B’ = protected nucleobase. Key: (i) activation (e.g., with tetrazole); (ii) iodine-oxidation; (iii) trichloroacetic acid deprotection; (iv) NH3-cleavage.
Figure 1. Solid phase oligonucleotide synthesis cycle. DMT = 4,4’-dimethoxytrityl;B,B’ = protected nucleobase. Key: (i) activation (e.g., with tetrazole); (ii) iodine-oxidation; (iii) trichloroacetic acid deprotection; (iv) NH3-cleavage.
Molecules 16 01625 g001
The final oligonucleotide is cleaved from the solid support by concentrated ammonia to remove the remaining β-cyanoethyl protecting groups from the phosphate backbone and to liberate the nucleobases from their protection groups. The DMT protection of the last base is split off after purification to improve the separation from capped sequences. The DMT protection group acts as a lipophilic anchor in the reversed phase HPLC purification process [37]. For biochemical applications the 5’-hydroxy function can be phosphorylated using a kinase. With modern fully automated oligonucleotide synthesizers, coupling yields up to 99.5% per synthesis step can be achieved making the synthesis of oligonucleotides longer than 100 bases feasible.

2.2. Source of Variability

The most straightforward method to obtain a randomized oligonucleotide sequence is the use of a equimolar mixture of the four activated nucleotides in the coupling step. However, this strategy suffers from the fact that, as shown in Table 2, together with the 61 sense codons, three stop codons are incorporated at the randomized position into the oligonucleotide primers. The stop codons prevent the biosynthesis of the coat protein by the host and uninfective virions are produced.
Table 2. Trinucleotide codons and translation into amino acids.
Table 2. Trinucleotide codons and translation into amino acids.
TTTPheTCTSerTATTyrTGTCys
TTC TCC TAC TGC
TTALeuTCA TAASTOPTGASTOP
TTG TCG TAGSTOPTGGTrp
CTT CCTProCATHisCGTArg
CTG CCC CAC CGC
CTA CCA CAAGlnCGA
CTG CCG CAG CGG
ATTIleACTThrAATAsnAGTSer
ATG ACG AAC AGC
ATA ACA AAALysAGAArg
ATGMetACG AAG AGG
GTTValGCTAlaGATAspGGTGly
GTC GCC GAC GGC
GTA GCA GAAGluGGA
GTG GCG GAG GGG
The formation of homogenously distributing sequences is enhanced by incorporating only a mixture of guanine and thymine or guanine and cytosine in position three of the codons. This strategy leads to the elimination of two of three stop-codons, while the remaining 32 triplets code for all 20 amino acids. The remaining stop codon TAG can be suppressed by a supE E. coli strain used for phage propagation that contains genes for the corresponding tRNA, which eventually results in the incorporation of a glutamine residue at the position of the stop codon during translation, albeit with varying efficiency [38].

Improvement of statistic distribution by exclusion of rare codons

Among the 64 possible trinucleotides are several combinations, which code for one amino acid, but the relative abundance of the amino acids is not proportional to the number of its codons. Moreover, certain codons that possess suboptimal tRNA anticodon binding are avoided by E. coli, leading to marginal concentrations of the corresponding tRNA. Arginine, for example is represented by one major and one minor codon, while four codons are virtually unused in the highly expressed proteins of E. coli [39,40]. Certain rare codons are used to regulate expression, induce structural information by deceleration of protein assembly or are necessary as reading frame shifts and some others have still yet unidentified features [41,42,43,44].
In certain cases it is advantageous to limit the number of possible amino acids in the randomized positions. Various possibilities to construct limited random codon sets exist, e.g. usage of thymidine in position two and randomization of the first and third nucleotide results in the exclusive expression of isoleucine, leucine, methionine, phenylalanine and valine. Mena et al. developed a computational tool to design degenerate codons, providing assistance in library design [45]. Table 3 shows a selection of codon sets to narrow the degree of randomization to four amino acids, which were utilized by Fellouse et al. to construct Fab fragments with impressive antigen binding characteristics [46].
Table 3. Selected codon sets from Feelouse et al. [46], which limit randomization to four possible amino acids.
Table 3. Selected codon sets from Feelouse et al. [46], which limit randomization to four possible amino acids.
G/TA/CTTyr, Ala, Asp, SerA/GG/CAThr, Arg, Gly, Ala
A/TA/CTTyr, Thr, Asn, SerG/CA/GCHis, Arg, Asp, Gly
C/TA/CTTyr, Pro, His, SerA/GG/CTGly, Ala, Thr, Ser
The best method for the synthesis of defined random primers, which homogenously incorporate all amino acids, is the trinucleotide approach. In this strategy, the initial DNA-library is assembled by utilizing the trinucleotide building-blocks shown in Table 4. This eliminates the integration of stop as well as rare codons and their possible accumulation, which may cause translational problems, like frame shifting [47].
Table 4. List of codons recommended for the trinucleotide approach by Kayushin et al. [48].
Table 4. List of codons recommended for the trinucleotide approach by Kayushin et al. [48].
AlaGCTArgCGTAsnAACAspGACCysTGC
GlnCAGGluGAAGlyGGTHisCATIleATC
LeuCTGLysAAAMetATGPheTTCProCCG
SerTCTThrACTTrpTGGTyrTACValGTT
Even though expensive phosphoramidite trimer building blocks are required, this technique is the method of choice for long sequences, or protein evolution applications, based on chemical synthesis. Due to the decreased number of coupling steps the amount of side products is reduced, which facilitates purification and increases the overall yield. By this way the amino acid distribution is also improved, as e.g. shown by Krumpe et al. who generated a T7-phage 12-mer peptide library by the trinucleotide method and found it to possess a higher diversity than its conventionally assembled counterpart [49]. The elimination of redundant or incompatible codons and a precisely tuneable distribution of amino acids constitute further advantages.
The trinucleotide building blocks are efficiently synthesized in solution making use of the enhanced selectivity of the MSNT-activated o-chlorophenyl phosphordiesters, as shown in Scheme 1. Both 3’ and 5’-hydroxy functions do not require protecting groups, but the products require laborious workup following each reaction step. Introduced by Virnekäs et al. in 1994 [50], several techniques have been employed to enhance the outcome of the phosphoramidite synthesis [48,51]. One major problem with the use of trinucleotide building blocks are the large differences in the reactivity of the trinucleotide phosphoramidites. Hence, to obtain equal distribution of all codons at each position of a random oligonucleotide, non-equimolar mixtures of trinucleotides have to be used where more reactive trinucletoides are present at reduced concentration and vice versa [48,51].
Scheme 1. Preparation of trinucleotide building blocks. (i) MSNT-coupling; (ii) DMT-cleavage; (iii) phosphoramidite formation using a phosphordiamidite reagent.
Scheme 1. Preparation of trinucleotide building blocks. (i) MSNT-coupling; (ii) DMT-cleavage; (iii) phosphoramidite formation using a phosphordiamidite reagent.
Molecules 16 01625 g008
Other DNA synthesis strategies were developed that encode random amino acids using a subset of orthogonally protected tri- and dinucleotide phosphoramidites [52]. An alternative is to split the resin prior to the coupling followed by treatment with four different mixtures of dinucleotides and subsequent coupling of a single nucleotide. This method allows the incorporation of all highly expressed codons as well [53].

2.3. Biological Synthesis of Random Library Inserts

To introduce random mutations in large proteins as e.g., antibodies or enzymes, error prone PCR or gene shuffling techniques often are the method of choice [54]. To introduce mutations in peptides, local randomization can be achieved using oligonucleotide mixtures as primers for in vitro DNA synthesis. The best known procedure was introduced 1978 by Michael Smith, who was honoured with the Nobel Prize in 1993 [55]. It is based on the fact that short oligonucleotide primers can bind with high sequence specificity to a DNA template and be extended by DNA-polymerases, i.e., the Klenow-fragment. The polymerase can tolerate mismatches, as long as a stretch of at least 6–10 nucleotides at the 3’ end of the oligonucleotide is fully complementary to the template strand. Under optimized conditions, starting from a random oligonucleotide that is hybridized to M13 phage single stranded DNA, a full length DNA strand can be generated that contains the desired mutation. Several techniques were established to remove the unmodified template strand, the most popular being the use of a template strand that contains deoxyuridine in place of thymidine which results in its degradation in E. coli upon transformation [56].

2.4. Cloning Technique for the Integration of Oligonucleotide Sequences into the Phage Genome

During recent years, alternative protocols were developed that rely on generation of double stranded DNA stretches by annealing two complementary oligonucleotides followed by fill-in reaction (Figure 2). These are designed such that they contain appropriate restriction sites at their ends which also occur at the terminus of the phage pIII gene. To achieve an optimal insertion, it is recommended to use two different restriction enzymes that produce sticky ends. Non-productive inserts, i.e., primer dimers, partially digested DNAs, or incomplete linear ligation products are produced as contaminants in the PCR amplification and the ligation processes. For purification, the double stranded products are subjected to preparative gel electrophoresis (using either acryl amide or agarose gels). This separation technique discriminates side products according to their size. As a result of the randomisation the product bands are broadened. Consequently, the removal is limited to side products that significantly differ in size and structure from the products desired. Temperature or denaturing gradient as well as pulsed field gel electrophoresis offer alternatives, but are time and work demanding.
Figure 2. Schematic presentation of the introduction of foreign DNA; (i) annealing and PCR; (ii) separate double digestion, followed by DNA-ligation with an appropriate vector in digested form.
Figure 2. Schematic presentation of the introduction of foreign DNA; (i) annealing and PCR; (ii) separate double digestion, followed by DNA-ligation with an appropriate vector in digested form.
Molecules 16 01625 g002
The vector and the insert usually are digested separately and purified before the fragments are annealed and fused by a DNA ligase to obtain the double stranded cyclic DNA ready for transformation by electroporation. To avoid background transformants that contain a plasmid lacking the desired DNA fragment it is highly recommended to purify the vector fragment from remaining traces of undigested or partially cleaved vector DNA after restriction enzyme cleavage. A very straightforward procedure for removal of these side products is the isolation of linear vector DNA by sucrose gradient density centrifugation [57]. Though the transformation efficiency of the electrocompetent cells is high and may approach 109–1010 transformants per µg supercoiled plasmid DNA, in practice the yield is often much lower for ligation products and multiple parallel transformations using several hundred µg of DNA are often required to obtain >109 transformants.

2.5. Determining the Variability of the Completed Peptide Library

After electroporation and growth, individual bacterial colonies are propagated and the produced viral material is analysed by DNA sequencing. To assure a reasonable significance, at least 50–100 phage clones should be sequenced. Factors of interest are the frequency of appearance of each amino acid and the distribution of dipeptide fragments. For example, DeGraaf et al. constructed a decapeptide library and 52 individual clones were examined to determine the nucleotide diversity and frequency of the amino acids. The analysis of 520 dipeptides showed that 245 dipeptides were present in this library. Considering the theoretical number of 400 possible permutations this analysis revealed that a high diversity was achieved [58]. Equations to calculate the diversity of a phage library and the RELIC database offer improved statistical analysis for this process [59,60,61]. Moreover, next-generation-sequencing offers the opportunity to check the quality of the library by sequencing tens of thousands individual clones [62]. In addition the library can be tested by isolating phages that bind well-characterized targets like immobilized streptavidin. In the case of streptavidin the bound phages, which are eluted by biotin, are expected to carry the mimotope sequence His-Pro-Gln in the displayed peptide [63].
After the final electroporation and diversity analysis of the gene pool it would be desirable to expand the number of phage copies of the initial phage library via re-infection of E. coli and generation of multiple copies of each library member. However, this procedure is at risk to compromise the initial library diversity due to non linear propagation of the individual clones. Figure 3 shows the relative clone accumulation obtained in two amplification steps, if 5 percent of the individual clones in the initial library propagate with a growth rate differing by the factor 1.5.
Figure 3. Schematic illustration of the results obtained with clones possessing different propagation rates.
Figure 3. Schematic illustration of the results obtained with clones possessing different propagation rates.
Molecules 16 01625 g003
This clearly causes a significant bias of the homogeneity of the library to be used in the following screening experiments.

3. Screening Procedures

Screening protocols differ in many aspects such as presentation of the immobilized target to the phage population or the extraction of target-bound phages [64]. The target can for example be immobilized to plastic surfaces, magnetic beads or presented on the surface of whole cells [65]. It is even possible to use tumor bearing animals for in vivo selection by sacrificing the animal and propagation of the phages enriched in the tumor tissue [66].
In an idealized panning experiment less than 1 percent of the initially used library can be expected to bind to the target prior to extraction and amplification in E. coli. Considering an amplification factor of three for a selectively binding clone, the library would consist of 81 percent binding individuals after four cycles as illustrated in Figure 4.
Figure 4. Enrichment of binding phages by affinity selection.
Figure 4. Enrichment of binding phages by affinity selection.
Molecules 16 01625 g004
Considering that most clones present in a library bind the target by unspecific interactions and possess different evolutionary fitness, phages can be allocated to three categories: non-binding, unspecific binding and specific binding, which are subdivided into fit and unfit individuals as schematically shown in Figure 5.
Figure 5. Schematic representation of the different types of phages with respect to their target binding and evolutionary fitness.
Figure 5. Schematic representation of the different types of phages with respect to their target binding and evolutionary fitness.
Molecules 16 01625 g005
A serious difficulty can emerge for specific binders that are present in a low number of copies and slowly propagating. Unspecific binders with high propagation competence can outgrow the specific binders in the following amplification step. Subsequent selection by stringent competition amongst the clones or negative panning is not sufficient to remove the unspecific clones (Figure 6).
Figure 6. Schematic representation of the relative amplification of unspecific binders with a high propagation rate versus selective binders with moderate propagation rates after one panning round.
Figure 6. Schematic representation of the relative amplification of unspecific binders with a high propagation rate versus selective binders with moderate propagation rates after one panning round.
Molecules 16 01625 g006
This problem can be overcome by starting the panning, as shown in Figure 7, with a negative panning round. In this case the amount of the unspecifically binding phages with a high propagation rate is significantly reduced. The depletion (negative panning) can be achieved by affinity chromatography of the library using a column that contains the agent that is used to immobilize the target structure [30]. The subsequent positive panning is then performed with a population that is depleted from unselective clones.
Figure 7. Schematic representation of the effect obtained by applying negative panning prior to the positive selection step.
Figure 7. Schematic representation of the effect obtained by applying negative panning prior to the positive selection step.
Molecules 16 01625 g007
As phage libraries with a high number of random positions contain only a small fraction of the possible peptide motifs, a mutagenesis approach can be applied for the fine tuning of the initially obtained clones [67]. This secondary library can be constructed for instance by error prone PCR or by using random primers. Despite to the low mutation rate, error prone PCR possesses the advantage that only DNA isolated from panning is required, while the primer method offers high mutation rates and directed leitmotif optimization but requires defined oligonucleotides, which can cover the whole random region.

4. Sequence and Structural Motifs for Displayed Peptides

Peptides that form secondary structures possess multiple advantages, for example higher stability in vivo and enhanced target affinity due to a decreased loss of entropy associated with their ligand-receptor binding. Phage display enables the application of a variety of such mini-protein scaffolds. The scaffolds can mimic antibodies, posses better pharmacokinetics and are accessible by synthetic chemistry [68,69,70]. Disulfide bridges are a widespread component of many natural miniproteins like scorpion-venoms or plant-toxins [71]. They are often used in phage display, because of their enhanced half life and binding characteristics [72]. One or more disulfide bridges are the most common used method to introduce constraints similar to natural compounds like antibodies or de novo cyclic peptides [73,74,75,76,77].
Chemical modification of surface peptides is a new and promising approach. The prime example for this methodology is a library containing bicyclic peptides developed by Heinis et al. In this library three cysteine residues at fixed positions are interconnected by 1,3,5-tris(bromomethylene)benzene. Expression of the peptide, which contains three thiol groups, required the constructing pIII gene that encodes for a cysteine free pIII protein. This chemically modified phage library enabled the selection of a plasma kallikrein inhibitor with a nanomolar inhibition constant [78]. Alpha helices and beta sheets can be constructed by coding lipophilic and hydrophilic amino acids at distinct repeats. A random lipophilic position is achieved by placing in a codon a thymidine central between two fully randomized nucleotides; the hydrophilic counterpart is coded by a C or A in the first, A in the second and a mix of every nucleotide in third position. Moreover, basic or acidic positions can be designated by choosing the codon sets A-A/G-A/G for Lys and Arg or G-A-T/C/A/G for Asp and Glu. If these two randomized codon groups are arranged in an alternating fashion, beta sheets result. A heptad repeat, i.e. a lipophilic amino acid in positions a, d and e, produces a bundle of four alpha helices [79].
A very promising new approach relies on the expansion of the genetic code via phage display of peptides and proteins containing unnatural amino acids. This work was pioneered by Schultz and co-workers and relies on the presence in E. coli of an additional orthogonal tRNA/aminoacyl-tRNA synthetase pair capable of incorporating various unnatural amino acids into proteins in response to unique nonsense codons [80,81]. More examples for phage library screening of peptides containing unnatural residues can be expected to come over the next years expanding the scope of phage display technology.

5. Conclusions

Most of the work on phage display screening is performed with commercially available libraries. However, the potential of these libraries is restricted – specifically designed libraries expand the scope of phage display techniques. The review discusses the biochemical and chemical strategies that allow the synthesis of homogenous libraries with an optimal coverage of their maximal diversity. Moreover, the statistical and technical considerations that allow to deduce optimal screening strategies are discussed. The consideration of several factors such as the selection of the technique to introduce the diversity and the methods of amplification of the library is mandatory to obtain reliable libraries with an optimal diversity. Besides the synthesis of the random primers and their insertion into the phage library, the selection and propagation of the specifically binding phage populations have to be taken into account.

Acknowledgements

This work was supported by the Bundesministerium für Bildung und Forschung, grant number 13N 10269 Biotrace Mobi Tech and by the Deutsche Forschungsgemeinschaft grant number HA 2901/6-1.

References

  1. Zoller, F.; Eisenhut, M.; Haberkorn, U.; Mier, W. Endoradiotherapy in cancer treatment-basic concepts and future trends. Eur. J. Pharmacol. 2009, 625, 55–62. [Google Scholar] [CrossRef]
  2. Merrifield, R.B. Solid-phase peptide synthesis. 3. An improved synthesis of bradykinin. Biochemistry 1964, 3, 1385–1390. [Google Scholar] [CrossRef]
  3. Merrifield, R.B.; Steward, J.M. Automated peptide synthesis. Nature 1965, 207, 522–523. [Google Scholar] [CrossRef]
  4. Chang, C.D.; Meienhofer, J. Solid-phase peptide synthesis using mild base cleavage of N-alpha-fluorenylmethoxycarbonylamino acids. Int. J. Pept. Protein Res. 1978, 11, 246–249. [Google Scholar]
  5. Reubi, J.C. Peptide receptors as molecular targets for cancer diagnosis and therapy. Endocr. Rev. 2003, 24, 389–427. [Google Scholar] [CrossRef]
  6. Pang, W.; Tam, S.C.; Zheng, Y.T. Current peptide HIV type-1 fusion inhibitors. Antivir. Chem. Chemother. 2009, 20, 1–18. [Google Scholar] [CrossRef]
  7. Jackson, D.C.; Lau, Y.F.; Le, T.; Suhrbier, A.; Deliyannis, G.; Cheers, C.; Smith, C.; Zeng, W.; Brown, L.E. A totally synthetic vaccine of generic structure that targets Toll-like receptor 2 on dendritic cells and promotes antibody or cytotoxic T cell responses. Proc. Natl. Acad. Sci. USA 2004, 101, 15440–15445. [Google Scholar]
  8. Graham, K.; Wang, Q.; Boy, R.G.; Eisenhut, M.; Haberkorn, U.; Mier, W. Synthesis and evaluation of intercalating somatostatin receptor binding peptide conjugates for endoradiotherapy. J. Pharm. Pharm. Sci. 2007, 10, 286. [Google Scholar]
  9. Reubi, J.C.; Schär, J.C.; Waser, B.; Wenger, S.; Heppeler, A.; Schmitt, J.S.; Mäcke, H.R. Affinity profiles for human somatostatin receptor subtypes SST1-SST5 of somatostatin radiotracers selected for scintigraphic and radiotherapeutic use. Eur. J. Nucl. Med. 2000, 27, 273–282. [Google Scholar] [CrossRef]
  10. Liu, S. Radiolabeled cyclic RGD peptides as integrin αvβ3-targeted radiotracers: Maximizing binding affinity via bivalency. Bioconjug. Chem. 2009, 20, 2199–2213. [Google Scholar] [CrossRef]
  11. Kubas, H.; Schäfer, M.; Bauder-Wüst, U.; Eder, M.; Oltmanns, D.; Haberkorn, U.; Mier, W.; Eisenhut, M. Multivalent cyclic RGD ligands: influence of linker lengths on receptor binding. Nucl. Med. Biol. 2010, 37, 885–891. [Google Scholar] [CrossRef]
  12. Smith, G.P. Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface. Science 1985, 228, 1315–1317. [Google Scholar]
  13. Serizawa, T.; Sawada, T.; Matsuno, H.; Matsubara, T.; sato, T. A peptide motif recognizing a polymer stereoregularity. J. Am. Chem. Soc. 2005, 127, 13780–13781. [Google Scholar] [CrossRef]
  14. Chen, J.; Serizawa, T.; Komiyama, M. Peptides recognize photoresponsive targets. Angew. Chem. Int. Ed. 2009, 48, 2917–2920. [Google Scholar] [CrossRef]
  15. Seker, U.O.S.; Wilson, B.; Sahin, D.; Tamerler, C.; Sarikaya, M. Quantitaive affinity of genetically engineered repeating polypetides to inorganic surfaces. Biomacromolecules 2009, 10, 250–257. [Google Scholar] [CrossRef]
  16. Mao, C.; Liu, A.; Cao, B. Virus-based chemical and biological sensing. Angew. Chem. Int. Ed. Engl. 2009, 48, 6790–6810. [Google Scholar] [CrossRef]
  17. Haberkorn, U.; Eisenhut, M.; Altmann, A.; Mier, W. Endoradiotherapy with peptides-status and future development. Curr. Med. Chem. 2008, 15, 219–234. [Google Scholar] [CrossRef]
  18. Zang, L.; Shi, L.; Guo, J.; Pan, Q.; Wu, W.; Pan, X.; Wang, J. Screening and identification of a peptide specifically targeted to NCI-H1299 from a phage display peptide library. Cancer Lett. 2009, 281, 64–70. [Google Scholar] [CrossRef]
  19. He, M.; Taussig, M.J. Ribosome display: Cell-free protein display technology. Brief. Funct. Genomic Proteomic. 2002, 1, 204–212. [Google Scholar] [CrossRef]
  20. Kopsidas, G.; Carman, R.; Stutt, E.; Raicevic, A.; Roberts, A.; Siomos, M.A.; Dobric, N.; Pontes-Braz, L.; Coia, G. RNA mutagenesis yields highly diverse mRNA libraries for in vitro protein evolution. BMC Biotechnol. 2007, 7, 18. [Google Scholar] [CrossRef]
  21. Zahnd, C.; Amstutz, P.; Plückthun, A. Ribosome display: selecting and evolving proteins in vitro that specifically bind to a target. Nat. Methods 2007, 4, 269–279. [Google Scholar] [CrossRef]
  22. Georgiou, G.; Poetschke, H.L.; Stathopoulos, C.; Francisco, J.A. Practical applications of engineering gram-negative bacterial cell surfaces. Trends Biotechnol. 1993, 11, 6–10. [Google Scholar] [CrossRef]
  23. Little, M.; Fuchs, P.; Breitling, F.; Dubel, S. Bacterial surface presentation of proteins and peptides: an alternative to phage technology? Trends Biotechnol. 1993, 11, 3–5. [Google Scholar] [CrossRef]
  24. Shibasaki, S.; Maeda, H.; Ueda, M. Molecular display technology using yeast-arming technology. Anal. Sci. 2009, 25, 41–49. [Google Scholar] [CrossRef]
  25. Marvin, D.A. Filamentous phage structure, infection and assembly. Curr. Opin. Struct. Biol. 1998, 8, 150–158. [Google Scholar] [CrossRef]
  26. Krichevsky, A.; Rusnati, M.; Bugatti, A.; Waigmann, E.; Shohat, S.; Loyter, A. The fd phage and a peptide derived from its p8 coat protein interact with the HIV-1 Tat-NLS and inhibit its biological functions. Antivir. Res. 2005, 66, 67–78. [Google Scholar]
  27. Garufi, G.; Minenkova, O.; Lo Passo, C.; Pernice, I.; Felici, F. Display libraries on bacteriophage lambda capsid. Biotechnol. Ann. Rev. 2005, 11, 153–190. [Google Scholar] [CrossRef]
  28. Russel, M.; Model, P. Filamentous phage. In The Bacteriophages, 2nd; Calendar, R., Ed.; Oxford University Press Inc.: New York, NY, USA, 2005. [Google Scholar]
  29. Sidhu, S.S. Engineering M13 for phage display. Biomol. Eng. 2001, 18, 57–63. [Google Scholar] [CrossRef]
  30. Pande, J.; Szewczyk, M.M.; Grover, A.K. Phage display: Concept, innovations, applications and future. Biotechnol. Adv. 2010, 28, 849–858. [Google Scholar] [CrossRef]
  31. Krumpe, L.R.; Atkinson, A.J.; Smythers, G.W.; Kandel, A.; Schumacher, K.M.; McMahon, J.B.; Makowski, L.; Mori, T. T7 lytic phage-displayed peptide libraries exhibit less sequence bias than M13 filamentous phage-displayed peptide libraries. Proteomics 2006, 6, 4210–4222. [Google Scholar] [CrossRef]
  32. Cwirla, S.E.; Peters, E.A.; Berrett, R.W.; Dower, W.J. Peptides on phage: A vast library of peptides for identifying ligands. Proc. Natl. Acad. Sci. USA 1990, 87, 6378–6382. [Google Scholar] [CrossRef]
  33. Watson, J.D.; Crick, F.H. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 1953, 171, 737–738. [Google Scholar] [CrossRef]
  34. Watson, J.D.; Crick, F.H. Genetical implications of the structure of deoxyribonucleic acid. Nature 1953, 171, 964–967. [Google Scholar] [CrossRef]
  35. Beaucage, S.L.; Caruthers, M.H. Deoxynucleoside phosphoramidites-A new class of key intermediates for deoxypolynucleotide synthesis. Tetrahedron Lett. 1981, 22, 1859–1862. [Google Scholar] [CrossRef]
  36. McBride, L.J.; Caruthers, M.H. An investigation of several deoxynucleoside phosphoramidites useful for synthesizing deoxyoligonucleotides. Tetrahedron Lett. 1983, 24, 245–248. [Google Scholar] [CrossRef]
  37. Beaucage, S.L.; Iyer, R.P. Advances in the synthesis of oligonucleotides by the phosphoramidite approach. Tetrahedron 1992, 48, 2223–2311. [Google Scholar] [CrossRef]
  38. Bossi, L. Context effects: translation of UAG codon by suppressor tRNA is affected by the sequence following UAG in the message. J. Mol. Biol. 1983, 164, 73–87. [Google Scholar] [CrossRef]
  39. Pedersen, S. Escherichia coli ribosomes translate in vivo with variable rate. EMBO J. 1984, 3, 2895–2898. [Google Scholar]
  40. Grosjean, H.; Fiers, W. Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. Gene 1982, 18, 199–209. [Google Scholar] [CrossRef]
  41. Makoff, A.J.; Oxer, M.D.; Romanos, M.A.; Fairweather, N.F.; Ballantine, S. Expression of tetanus toxin fragment C in E.coli: high level expression by removing rare codons. Nucleic Acids Res. 1989, 17, 10191–10202. [Google Scholar]
  42. Thanaraj, T.A.; Argos, P. Protein secondary structural types are differentially coded on messenger RNA. Protein Sci. 1996, 5, 1973–1983. [Google Scholar] [CrossRef]
  43. Phoenix, D.A.; Korotov, E. Evidence of rare codon clusters within Escherichia coli coding regions. FEMS Microbiol. Lett. 1997, 155, 63–66. [Google Scholar] [CrossRef]
  44. Widmann, M.; Clairo, M.; Dippon, J.; Pleiss, J. Analysis of the distribution of functionally relevant rare codons. BMC Genomics 2008, 9, 207. [Google Scholar] [CrossRef]
  45. Mena, M.A.; Daugherty, P.S. Automated design of degenerate codon libraries. Protein Eng. Des. Sel 2005, 18, 559–561. [Google Scholar] [CrossRef]
  46. Fellouse, F.A.; Wiesmann, C.; Sidhu, S.S. Synthetic antibodies from a four-amino-acid code: a dominant role for tyrosine in antigen recognition. Proc. Natl. Acad. Sci. USA 2004, 101, 12467–12472. [Google Scholar] [CrossRef]
  47. Belcourt, M.F.; Farabaugh, P.J. Ribosomal frameshifting in the yeast retrotransposon Ty: tRNAs induce slippage on a 7 nucleotide minimal site. Cell 1990, 62, 339–352. [Google Scholar] [CrossRef]
  48. Kayushin, A.L.; Korosteleva, M.D.; Miroshnikov, A.I.; Kosch, W.; Zubov, D.; Piel, N. A convenient approach to the synthesis of trinucleotide phosphoramidites-synthons for the generation of oligonucletide/peptide libraries. Nucleic Acids Res. 1996, 24, 1748–3755. [Google Scholar]
  49. Krumpe, L.R.H.; Schumacher, K.M.; McMahon, J.B.; Makowski, L.; Mori, T. Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library. BMC Biotechol. 2007, 7, 65. [Google Scholar] [CrossRef]
  50. Virnekäs, B.; Ge, L.; Plückthun, A.; Schneider, K.C.; Wellnhofer, G.; Simon, E. Moroney Trinucleotide phosphoramidites: Ideal reagents for the synthesis of mixed oligonucleotides for random mutagenesis. Nucleic Acids Res. 1994, 22, 5600–5607. [Google Scholar]
  51. Yagodkin, A.; Azhayev, A.; Roivainen, J.; Antopolsky, M.; Kayushin, A.; Korosteleva, M.; Miroshnikov, A.; Randolph, J.; Mackie, A.H. Improved Synthesis of Trinucleotide Phosphoramidites and Generation of Randomized Oligonucleotide Libraries. Nucleos. Nucleot. Nucleic Acids 2007, 26, 473–497. [Google Scholar] [CrossRef]
  52. Gaytán, P.; Contreras-Zambrano, C.; Ortiz-Alvarado, M.; Morales-Pablos, A.; Yánez, J. TrimerDimer: an oligonucleotide-based saturation mutagenesis approach that removes redundant and stop codons. Nucleic Acids Res. 2009, 37, e125. [Google Scholar]
  53. Neuner, P.; Cortese, R.; Monaci, P. Codon-based mutagenesis using dimer-phosphoramidites. Nucleic Acids Res. 1998, 26, 1223–1227. [Google Scholar] [CrossRef]
  54. Stemmer, W.P. Rapid evolution of a protein in vitro by DNA shuffling. Nature 1994, 370, 389–391. [Google Scholar] [CrossRef]
  55. Smith, M. Synthetic DNA and biology (Nobel Lecture). Angew. Chem. Int. Ed. Engl. 1994, 33, 1214–1221. [Google Scholar] [CrossRef]
  56. Kunkel, T.A.; Roberts, J.D.; Zakour, R.A. Rapid and efficient site-specific mutagenesis without phenotypic selection. Meth. Enzymol. 1987, 154, 367–382. [Google Scholar]
  57. Adams, T.M.; Schmoldt, H.U.; Kolmar, H. FACS screening of combinatorial peptide and protein librariers displayed on the surface of Escherichia coli cells. In Evolutionary Methods in Biotechnology: Clever Tricks for Directed Evolution; Brakmann, S., Schwienhorst, A., Eds.; Wiley-VCH Verlag GmbH & Co. KGaA: Weinheim, Germany, 2004; Volume 1, pp. 31–46. [Google Scholar]
  58. DeGraaf, M.E.; Miceli, R.M.; Mott, J.E.; Fischer, H.D. Biochemical diversity in a phage display library of random decapeptides. Gene 1993, 128, 13–17. [Google Scholar] [CrossRef]
  59. Rodi, D.J.; Soares, A.S.; Makowski, L. Quantitative assessment of peptide sequence diversity in M13 combinatorial peptide phage display libraries. J. Mol. Biol. 2002, 322, 1039–1052. [Google Scholar] [CrossRef]
  60. Makowski, L.; Soares, A. Estimating the diversity of peptide populations from limited sequence data. Bioinformatics 2003, 19, 183–489. [Google Scholar] [CrossRef]
  61. Mandava, S.; Makowski, L.; Devarapalli, S.; Uzubell, J.; Rodi, D.J. RELIC-A bioinformatics server for combinatorial peptide analysis and identification of protein-ligand interaction sites. Proteomics 2004, 4, 1439–1460. [Google Scholar] [CrossRef]
  62. Dias-Neto, E.; Nunes, D.N.; Giordano, R.J.; Sun, J.; Botz, G.H.; Yang, K.; Setubal, J.C.; Pasqualini, R.; Arap, W. Next-generation phage display: integrating and comparing available molecular tools to enable cost-effective high-throughput analysis. PLoS ONE 2009, 4, e8338. [Google Scholar]
  63. Noren, K.A.; Noren, C.J. Construction of high-complexity combinatorial phage display peptide libraries. Methods 2001, 23, 169–178. [Google Scholar] [CrossRef]
  64. Bratkovic, T. Progress in phage display: Evolution of the technique and its applications. Cell. Mol. Life Sci. 2010, 67, 749–767. [Google Scholar] [CrossRef]
  65. Sun, X.; Niu, G.; Yan, Y.; Yang, M.; Chen, K.; Ma, Y.; Chan, N.; Shen, B.; Chen, X. Phage display-derived Peptides for osteosarcoma imaging. Clin. Cancer Res. 2010, 16, 4268–4277. [Google Scholar] [CrossRef]
  66. Rajotte, D.; Arap, W.; Hagedorn, M.; Koivunen, E.; Pasqualini, R.; Ruoslathi, E. Molecular heterogeneity of the vascular endothelium revealed by in vivo phage display. J. Clin. Invest. 1998, 102, 430–437. [Google Scholar] [CrossRef]
  67. Fleming, T.J.; Sachdeva, M.; Delic, M.; Beltzer, J.; Wescott, C.R.; Devlin, M.; Lander, R.C.; Nixon, A.E.; Roschke, V.; Hilbert, D.M.; Sexton, D.J. Discovery of high-affinity peptide binders to BLyS by phage display. J. Mol. Recognit. 2005, 18, 94–102. [Google Scholar] [CrossRef]
  68. Smith, G.P.; Petrenko, V.A. Phage display. Chem. Rev. 1997, 97, 391–410. [Google Scholar]
  69. Binz, H.K.; Amstutz, P.; Plückthun, A. Engineering novel binding proteins from nonimmunoglobulin domains. Nat. Biotechnol. 2005, 23, 1257–1268. [Google Scholar] [CrossRef]
  70. Skerra, A. Alternative non-antibody scaffolds for molecular recognition. Curr. Opin. Biotechnol. 2007, 18, 295–304. [Google Scholar] [CrossRef]
  71. Kolmar, H. Biological diversity and therapeutic potential of natural and engineered cystine knot miniproteins. Curr. Opin. Pharmacol. 2009, 9, 608–614. [Google Scholar] [CrossRef]
  72. McConnell, S.J.; Kendall, M.L.; Reilly, T.M.; Hoess, R.H. Constrained peptide libraries as a tool for finding mimotopes. Gene 1994, 151, 115–118. [Google Scholar] [CrossRef]
  73. Pini, A.; Bracci, L. Phage display of antibody fragments. Curr. Protein Pept. Sci. 2000, 1, 155–169. [Google Scholar] [CrossRef]
  74. Filpula, D. Antibody engineering and modification technologies. Biomol. Eng. 2007, 24, 201–215. [Google Scholar] [CrossRef]
  75. Ladner, R.C. Constrained peptides as binding entities. Trends Biotechnol. 1995, 13, 426–430. [Google Scholar] [CrossRef]
  76. McLafferty, M.A.; Kent, R.B.; Ladner, R.C.; Markland, W. M13 bacteriophage displaying disulfide-constrained microproteins. Gene 1993, 128, 29–36. [Google Scholar] [CrossRef]
  77. Uchiyama, F.; Tanaka, Y.; Minari, Y.; Tokui, N. Designing scaffolds of peptides for phage display libraries. J. Biosci. Bioengineer. 2005, 99, 448–456. [Google Scholar] [CrossRef]
  78. Heinis, C.; Rutherford, T.; Freund, S.; Winter, G. Phage-encoded combinatorial chemical libraries based on bicyclic peptides. Nat. Chem. Biol. 2009, 5, 502–507. [Google Scholar] [CrossRef]
  79. Moffet, D.A.; Hecht, M.H. De novo proteins from combinatorial libraries. Chem. Rev. 2001, 101, 3191–3203. [Google Scholar] [CrossRef]
  80. Tian, F.; Tsao, M.L.; Schultz, P.G. A phage display system with unnatural amino acids. J. Am. Chem. Soc. 2004, 126, 15962–15963. [Google Scholar] [CrossRef]
  81. Liu, C.C.; Mack, A.V.; Tsao, M.L.; Mills, J.H.; Lee, H.S.; Choe, H.; Farzan, M.; Schultz, P.G.; Smider, V.V. Protein evolution with an expanded genetic code. Proc. Natl. Acad. Sci. USA 2008, 105, 17688–17693. [Google Scholar]

Share and Cite

MDPI and ACS Style

Lindner, T.; Kolmar, H.; Haberkorn, U.; Mier, W. DNA Libraries for the Construction of Phage Libraries: Statistical and Structural Requirements and Synthetic Methods. Molecules 2011, 16, 1625-1641. https://doi.org/10.3390/molecules16021625

AMA Style

Lindner T, Kolmar H, Haberkorn U, Mier W. DNA Libraries for the Construction of Phage Libraries: Statistical and Structural Requirements and Synthetic Methods. Molecules. 2011; 16(2):1625-1641. https://doi.org/10.3390/molecules16021625

Chicago/Turabian Style

Lindner, Thomas, Harald Kolmar, Uwe Haberkorn, and Walter Mier. 2011. "DNA Libraries for the Construction of Phage Libraries: Statistical and Structural Requirements and Synthetic Methods" Molecules 16, no. 2: 1625-1641. https://doi.org/10.3390/molecules16021625

APA Style

Lindner, T., Kolmar, H., Haberkorn, U., & Mier, W. (2011). DNA Libraries for the Construction of Phage Libraries: Statistical and Structural Requirements and Synthetic Methods. Molecules, 16(2), 1625-1641. https://doi.org/10.3390/molecules16021625

Article Metrics

Back to TopTop