$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
With the use of automated magnetic-activated cell sorting (autoMCS) rather than manual magnetic cell sorting, false negative candidates are significantly reduced during biopanning of bacterial display libraries 9,14. Negative sorting steps can be also be added as needed to reduce non-specific binders to the target of interest, and it is suggested at a minimum to add a negative sort against the magnetic beads themselves up front. This negative selection against the magnetic beads themselves was completed before four rounds of biopanning the chosen bacterial display library for protective antigen (PA) binders, as shown in Figure 2, and before additional negative selection against rivax and four rounds of positive selection for abrax, as shown in Figure 3. The beads used here are conjugated to streptavidin for capture of biotinylated proteins, so unintended isolation of peptides binding to streptavidin itself is a concern and is monitored using FACS with streptavidin-conjugated to Phycoerythrin (Figure 3, SAPE). At a minimum, binding to streptavidin should be tested for any promising individual candidates when assessing binding to the target protein. Both the percent binding and nMFI for SAPE should be low values in all sorting rounds. The level of binding to streptavidin can increase somewhat with each sorting round, as occurred in Figure 3, because the population is repeatedly exposed to the streptavidin-coated magnetic beads after each sorting round. If the level of background streptavidin binding becomes problematic to downstream analysis, further negative sorting against the magnetic beads could be performed following the positive sorting round in which the streptavidin binding population increased in order to reduce downstream screening of false positives. When proteins or materials are available which could specifically interfere with downstream use of the peptide for a specific application, or which are expected to cross-react with affinity reagents for the target due to structural and/or sequence similarity, further negative sorting against that protein or material is recommended. An example is negative sorting against rivax before isolation of abrax binding peptides, due to the structural similarity and sequence homology of these two proteins 1,15. Again, cross-reactive binding to proteins used for negative sorting steps can be monitored during the analysis of sorting rounds using FACS (Figure 3, Rivax-488). In the best case scenario, percent binding and nMFI are low, as in this example.
When available, a fixed positive control peptide, such as the P2X peptide at the C-terminus of the display scaffold produced by the bacterial display library used in the representative results here, can help determine if lack of binding affinity in the FACS assay is due to lack of expression of the display scaffold itself, rather than lack of affinity of the peptide(s) for the target. In Figure 2 and Figure 3, YPet Mona binding to this P2X peptide is monitored and demonstrates that the early rounds of sorting express the scaffold poorly. This is likely predominantly due to the high frequency of stop codons in N-terminal peptides in the random library, which means that the scaffold itself was not produced. Other effects may additionally contribute, such as the presence of peptides with unexpected toxic effects to the E. coli, or decreased growth or peptide display rates for other reasons (such as mutations in the bacterial genome). Addition of EDTA during induction may improve expression during these early rounds of sorting 42, but has not yet been tested. Binding to YPet Mona in each biopanning population is improved significantly by round 3. Comparing Figure 2 to Figure 3, it is also apparent that expression level can be improved by increasing induction time to 90 min (as in Figure 2, YPet Mona) rather than 45 min (as in Figure 3, YPet Mona), although 45 min is typically sufficient. Note that the nMFI for YPet Mona is normalized to the YPet Mona binding level of the negative control cells, so values near 1.0 are typical if expression level is acceptable. Normalizing YPet Mona MFI to PBS alone MFI from the same sample is also a valid way to compare relative expression levels, but gives much higher values (compare Figure 2 and Figure 3 with results from Sarkes et al. 2015 15).
Enrichment of the library for peptides binding to the specific target of interest is typically achieved within three positive sorting rounds, but continuing to a fourth round of sorting can be beneficial, as shown in Figure 2 and Figure 3. Here, percent bound cells and nMFI continue to increase from round 3 to round 4 for both example targets, PA and abrax, which are boxed in red in Figure 2 and Figure 3, respectively. The highest affinity peptide sequences may already be present in round 3 but are likely to further enrich in round 4 as well, which aids in down-selection of potential candidates 14. Analysis of sequences from round 4 tends to reveal repeating sequences, and these repeating sequences are an excellent starting point for affinity and specificity analysis and consensus sequence determination. The eCPX_Sequencing macro provided as a supplement to this manuscript helps to streamline this initial analysis, as shown in Figure 4. Beginning with a folder of .seq or text files, the DNA sequence which lies between chosen 5' and 3' sequences is translated, organized by amino acid sequence, and analyzed for number of individual amino acid residues in each peptide. The sorted list of isolated peptide sequences, as shown on the Summary Table sheet screenshot in Figure 4, can be used to easily determine which sequences repeat and at what frequency. The cells in this spreadsheet containing repeating sequences are outlined in different colors for easier analysis. It is recommended to check these repeating sequences for consensus sequences and other trends. This is aided by sequence alignment software such as Kalign and Clustal Omega, and analysis can be performed on the entire sequence output (including both repeating and non-repeating sequences at the frequency they appeared) and on the down-selected list of repeating sequences (with or without their relative frequency). Representative results for PA and abrax repeating sequences, without taking frequency into account, are shown in Figure 5A and 5B, respectively. Note that in Figure 5A for the PA target, several of the individual repeating sequences themselves contain the consensus sequence WFCFTC (or a similar sequence), underlined in red, which was determined by applying Jalview software analysis to a Kalign sequence alignment of the repeating sequences. This consensus, as WXCFTC, was previously determined to be a PA binding consensus, which provides confidence in the sorting method 9. In Figure 5B, where repeating sequences for the abrax target were analyzed in the same manner, the result was quite different. First, there were only five sequences that repeated in round 4 of biopanning for abrax binders, as opposed to the fifteen repeating sequences isolated from round 4 of biopanning for PA binders (although 44% more colonies were also sequenced for PA). Second, none of the five repeating sequences contained the most promising "consensus" sequence (FWAWF, underlined in purple), although candidate AX-A15 contained the sequence that best matched (FWDTWF). Further analysis, including binding affinity and specificity determination, helped to provide the consensus of FWDTWF determined from the top five candidates for abrax binding in Figure 5C, as explained below.
A representative colony from each repeating sequence can be further analyzed for on-cell affinity and specificity using FACS, as in Figure 6A for the repeating sequences from biopanning for abrax binding peptides. Here it is clear that all five repeating sequences have higher affinity for abrax, the intended target, than rivax, the structurally similar protein used for negative sorting, or streptavidin, which is present during the sort due to the magnetic beads used for biopanning. This is consistent with the already promising results for affinity and specificity of the sorting rounds shown in Figure 3, where it is clear that enrichment for binding to the intended target, abrax, is increasing with each round of biopanning while affinity for the similar protein, rivax, is not. However, a satisfactory consensus sequence was not determined for the abrax sort using the repeating sequences from round 4 alone (Figure 5B and above discussion). Returning to the 100 sequenced colonies from round 4, however, it was noted by eye when searching for sequences similar to the best match for the predicted consensus that one additional candidate, AX-A12, contained the FWDTWF sequence that was noted in isolate AX-A15, and that another candidate, AX-A14, contained a similar sequence, DWNTWF. These and other sequences that either contained similarities to the repeating sequences, demonstrated similarities to other non-repeating sequences, or were chosen randomly, were analyzed by FACS for binding affinity and specificity. Those tested are ranked in Sarkes et al. 2016 1 and the top 5 binders from this analysis are shown in Figure 6B, with the consensus sequence determined to be FWDTWF, shown in Figure 5C.
A similar analysis of binding affinity for repeating peptide sequences in round 4 of biopanning for peptides that recognize the PA target revealed that the best binders, as ranked by the ratio of PA-488 nMFI:SAPE nMFI, contained the WXCFTC consensus or a similar sequence (Table 1). Using this ratio, the sequences self-organized into those that contained the consensus and those that did not. The top candidates, all containing sequences related to the consensus, were analyzed similarly to the methods described above for abrax in Figure 6, as presented in Sarkes et al. 2015 14. These results demonstrate that for some targets, analysis of peptides with repeating sequences may be adequate to obtain binders with significant affinity and specificity for a chosen target, and to determine a consensus sequence that may be, in itself, sufficient for binding. Note, however, that the number of repeats does not necessarily reflect the relative binding affinity for the target of interest (Table 1). When analysis of repeating sequences alone is not sufficient to determine a pattern, it is helpful to alternate between sequence analysis and binding analysis to narrow down the pool of candidates and reexamine the trends among those candidates with higher affinity. Even when a trend is observed from analyzing the repeating sequences alone, additional non-repeating sequences may demonstrate the same trends, or other trends, upon further investigation.

Figure 1: Schematic of biopanning protocol for bacterial display libraries. As illustrated here, biopanning bacterial display libraries is a cyclical process. Each cell in the bacterial display library (see table of materials) contains plasmid DNA which is retained as long as the cells are grown in the presence of the antibiotic for which the plasmid contains a resistance gene (choloramphenicol in this case). Each plasmid also encodes the display scaffold protein which exposes a random peptide to the exterior of the cell. Since each cell should only contain a single plasmid DNA sequence, each cell should only display a single peptide, in multiple locations throughout the cell membrane. Depending on level of library diversity at the start of biopanning and after each sorting round, the DNA sequence may or may not be unique from cell to cell. Biopanning begins with growth and induction of the cell library to display individual peptides on their outer membrane. These peptides can then interact with a target protein (which is biotinylated or otherwise tagged for capture). For magnetic-activated cell sorting (MCS), unbound protein is removed and the cells are incubated with magnetic beads that are coated with a capture protein (streptavidin in this example, which has a strong interaction with biotin). The entire culture is then exposed to a magnet to separate bound cells from unbound cells. Using an automated magnetic sorting device is preferred for cell separation to reduce false positives. Illustrated here is a positive sort, in which the cells that are bound to and co-elute with the magnetic beads are retained. For a negative sort, which we typically perform up front, the process is the same except that the cells that are washed off of the magnetic beads are retained instead. The library fraction of interest, bound (positive sort) or unbound (negative sort), is grown overnight and used in the next round of sorting. Each subsequent round of positive biopanning requires a decrease in target concentration and bead volume to improve stringency. After each round of biopanning, affinity and specificity of the peptides for the protein target are assessed using fluorescence-activated cell sorting (FACS) to help determine when to stop biopanning and start investigating individual candidates. As needed, DNA sequencing and peptide sequence analysis are performed. These analysis steps may be in themselves a cyclical process, particularly for revealing trends in the individual isolates after the final round of biopanning. At this point, peptide sequence trends and/or consensus are correlated with binding affinity and specificity. Please click here to view a larger version of this figure.

Figure 2: Example data for FACS analysis of sorting rounds: PA target. Binding affinity as determined by FACS is shown after 4 rounds of biopanning (positive sort) the chosen bacterial display library for peptides binding to the target, protective antigen (PA). This was performed after first performing a negative sort against the streptavidin-coated magnetic beads. For binding assessment, cells were induced with 0.4% L-arabinose for 90 min before incubation with target and control solutions and analyzing by FACS. Binding affinity analysis for the intended target is boxed in red. Shown here are scatter plots of FITC-A vs FSC-A and values for percent cells bound (as compared to the gated negative control incubated with PBS alone) and normalized median fluorescence intensity (nMFI, as compared to peptide-free negative control incubated with the same fluorophore-labeled protein) of induced cells that were incubated for 45 min with: PBS buffer alone, 150 nM YPet Mona (positive control for peptide expression), or 250 nM PA-488 (labeled target). Note the increasing enrichment in binding affinity for the target of interest after each round of biopanning. In contrast to Figure 3, all autoMCS cell separations were completed using program Posselds. Part of this data can also be visualized in scatter plots of FITC-H vs FSC-H in Sarkes et al. 2015 14 for comparison to the FITC-A vs FSC-A plots shown here. Please click here to view a larger version of this figure.

Figure 3: Example data for FACS analysis of sorting rounds: abrax target. Similarly to Figure 2, shown here is comparative binding affinity for a complete biopanning experiment, as determined by FACS. The bacterial display library was subjected to negative sorting against the streptavidin-coated magnetic beads, as in Figure 2, but was then subjected to an additional round of negative sorting against a homologous protein with similar structure and function, rivax, before performing 4 rounds of positive biopanning for peptides binding to the intended target, abrax. For binding assessment, cells were induced with 0.4% L-arabinose for 45 min before binding to target, positive control, and negative controls and reading on FACS. Binding affinity for the intended target is boxed in red. Shown here are scatter plots of FITC-A vs FSC-A (or PE-A vs FSC-A in the case of SAPE) and values for percent cells bound (as compared to the gated negative control incubated with PBS alone) and nMFI of induced cells that were incubated for 45 min with: PBS buffer alone, 150 nM YPet Mona (positive control for peptide expression), 250 nM Abrax-488 (labeled protein target), 250 nM Rivax-488 (labeled potential cross-reactive protein target), or 250 nM SAPE (negative control for direct binding to streptavidin-coated magnetic beads). Note the increasing enrichment in binding affinity for the target of interest, abrax, after each round of biopanning, and minimal binding affinity for the structurally similar protein, rivax. In contrast to Figure 2, positive biopanning round 1 was performed using the autoMCS Posselds program while subsequent sorting rounds were completed using program Posseld, as suggested for biopanning in this manuscript. This data can also be visualized in scatter plots of FITC-H vs FSC-H in Sarkes et al. 2016 15 for comparison to the FITC-A vs FSC-A plots shown here. Please click here to view a larger version of this figure.

Figure 4: Example sequence analysis output using "eCPX_Sequencing" macro. Shown is a screenshot of 48 sequenced colonies from round 4 of biopanning the chosen bacterial display library for PA binders using the Posselds method. The "Summary Table" sheet is shown here. Note that the colonies were numbered in the alphabetical order of their given file name during the sequencing process and that the boxes surrounding sequences that repeat at least once are outlined in different colors for easier data analysis. The eCPX_Sequencing macro is provided as a supplemental code file. Please click here to view a larger version of this figure.

Figure 5: Sequence alignment and consensus determination for two distinct protein targets. All images shown here were generated using Jalview software with Clustal_X coloring after aligning sequences using Kalign online analysis software 51. A) Alignment of repeating sequences from PA biopanning round 4 (corresponds with Table 1). B) Alignment of repeating sequences from abrax biopanning round 4 (corresponds with Figure 6A). C) Alignment of top 5 candidates from abrax biopanning round 4, as determined by FACS analysis of individual candidates (corresponds with Figure 6B). The red line underlines the consensus sequence in A and C, while the purple line in B underlines the precursor to the consensus sequence determined for abrax binding when examining repeating sequences only, highlighting the potential need to test binding affinity using FACS before a consensus can be determined in some cases. Similar alignments generated with different analysis software can be seen in Sarkes et al. 2015 14 and Sarkes et al. 2016 1. Please click here to view a larger version of this figure.

Figure 6: Example binding affinity and specificity for individual candidates analyzed using FACS. Shown here is the normalized median fluorescence intensity (nMFI) determined using FACS for A) repeating sequences and B) the top 5 candidates, as determined by comparative binding affinity for the target, abrax, from round 4 of biopanning. The data represents the average (bars) and standard deviation (error bars) of three independent replicate experiments. Note that all candidates shown are quite specific for abrax (abrax-488, green bars) over a structurally similar protein, rivax (Rivax-488, red bars), and the streptavidin negative control (SAPE, black bars). Although two of the repeating sequences in A (AX-07 and AX-15) were also among the top 5 candidates in B, comparison of results in A and B demonstrates that analysis of repeating sequences alone may not be enough to isolate the highest affinity peptides. Data shown here was adapted from Sarkes et al. 2016 1. Please click here to view a larger version of this figure.

Table 1: Ratio of specific binding to non-specific binding for individual repeat candidates. In this example, the number of repeat sequences and the ratio of PA (target) nMFI to SAPE (negative control) nMFI is shown for the repeating candidate sequences from PA biopanning round 4. This data was sorted by relative PA nMFI:SAPE nMFI ratio and analyzed for the presence of the known WXCFTC consensus, or a similar sequence, which is shown in bold font and underlined. Notice that the sequences self-organize such that those candidates with the consensus have the highest PA nMFI:SAPE nMFI ratio, and therefore interact more specifically with the target. The results shown are from a single FACS experiment. Peptides with lowest affinity for the target were excluded from the additional replicates and analysis that were published in Sarkes et al. 2015 14.
Suppplemental File 1: Please click here to download this file.
Suppplemental File 2: Please click here to download this file.