We provide a protocol that can be generally applied to select aptamers that bind to infectious viruses only and not to viruses that have been rendered non-infectious by a disinfection method or to any other similar viruses. This opens the possibility of determining infectivity status in portable and rapid tests.
Virus infections have a major impact on society; most methods of detection have difficulties in determining whether a detected virus is infectious, causing delays in treatment and further spread of the virus. Developing new sensors that can inform on the infectability of clinical or environmental samples will meet this unmet challenge. However, very few methods can obtain sensing molecules that can recognize an intact infectious virus and differentiate it from the same virus that has been rendered non-infectious by disinfection methods. Here, we describe a protocol to select aptamers that can distinguish infectious viruses vs non-infectious viruses using systematic evolution of ligands by exponential enrichment (SELEX). We take advantage of two features of SELEX. First, SELEX can be tailor-made to remove competing targets, such as non-infectious viruses or other similar viruses, using counter selection. Additionally, the whole virus can be used as the target for SELEX, instead of, for example, a viral surface protein. Whole virus SELEX allows for the selection of aptamers that bind specifically to the native state of the virus, without the need to disrupt of the virus. This method thus allows recognition agents to be obtained based on functional differences in the surface of pathogens, which do not need to be known in advance.
Virus infections have enormous economic and societal impacts around the world, as became increasingly apparent from the recent COVID-19 pandemic. Timely and accurate diagnosis is paramount in treating viral infections while preventing the spread of viruses to healthy people. While many virus detection methods have been developed, such as PCR tests1,2 and inmunoassays3, most of the currently used methods are not capable of determining whether the detected virus is actually infectious or not. This is because the presence of components of the virus alone, such as viral nucleic acid or proteins, does not indicate that the intact, infectious virus is present, and levels of these biomarkers have shown poor correlation with infectivity4,5,6. For example, viral RNA, commonly used for the current PCR-based COVID-19 tests, has very low levels in the early stages of infection when the patient is contagious, while the RNA level is often still very high when patients have recovered from the infection and are no longer contagious7,8. The viral protein or antigen biomarkers follow a similar trend, but typically appear even later than the viral RNA and thus are even less predictive of infectability6,9. To address this limitation, some methods that can inform on the infectivity status of the virus have been developed, but are based on cell culture microbiology techniques that require a long time (days or weeks) to obtain results4,10. Thus, developing new sensors that can inform on the infectability of clinical or environmental samples can avoid delays in treatment and further spread of the virus. However, very few methods can obtain sensing molecules that can recognize an intact infectious virion and differentiate it from the same virus that has been rendered non-infectious.
In this context, aptamers are particularly well-suited as a unique biomolecular tool11,12,13,14. Aptamers are short, single-stranded DNA or RNA molecules with a specific nucleotide sequence that allows them to form a specific 3D conformation to recognize a target with high affinity and selectivity15,16. They are obtained by a combinatorial selection process called systematic evolution of ligands by exponential enrichment (SELEX), also known as in vitro selection, that is carried out in test tubes with a large random DNA sampling library of 1014-1015 sequences17,18,19. In each round of this iterative process, the DNA pool is first subjected to a selection pressure through incubation with the target under the desired conditions. Any sequences that are not bound to the target are then removed, leaving behind only those few sequences that are able to bind under the given conditions. Finally, the sequences that have been selected in the previous step are amplified by PCR, enriching the population of the pool with the desired functional sequences for the next round of selection, and the process is repeated. When the activity of the selection pool reaches a plateau (typically after 8-15 rounds), the library is analyzed by DNA sequencing to identify the winning sequences exhibiting the highest affinity.
SELEX has unique advantages that can be exploited to gain increased selectivity against other similar targets20,21, such as for infectivity status of the virus22. First, a wide variety of different types of targets can be used for the selection, from small molecules and proteins to whole pathogens and cells16. Thus, to obtain an aptamer that binds to an infectious virus, an intact virus can be used as the target, instead of a viral surface protein19. Whole virus SELEX allows for the selection of aptamers that bind specifically to the native state of the virus, without the need for disruption of the virus. Second, SELEX can be tailor-made to remove competing targets21,23, such as other similar viruses or non-infectious inactivated viruses, using counter selection steps in each round of selection22. During the counter selection steps, the DNA pool is exposed to targets for which binding is not desired, and any sequences that bind are discarded.
In this work, we provide a protocol that can be generally applied for selecting aptamers that bind to an infectious virus but not to the same virus that has been rendered non-infectious by a particular disinfection method or to another related viruses. This method allows recognition agents to be obtained based on functional differences of the virus surface, which do not need to be known in advance, and so offers an additional advantage for the detection of newly emerged pathogens or for understudied diseases.
1. Preparation of reagents and buffers
2. Design and synthesis of DNA library and primers
3. Infectious and non-infectious virus samples
CAUTION: The infectious and non-infectious virus samples are biosafety level 2 (BSL2) samples that require extra care to handle safely and appropriately. All steps of the procedure that include these samples must be performed in a biosafety cabinet, or the virus solution must be in a sealed container (e.g., capped plastic tubes).
4. In vitro selection or SELEX process: initial round
NOTE: For all the steps using the infectious virus, work in a BSL2 cabinet.
5. Subsequent selection rounds
6. Monitoring the SELEX process
NOTE: To monitor the enrichment of the pool, qPCR is used in two ways. First, by absolute quantification, it is possible to test the enrichment of the pools (elution yield). Second, by monitoring the melting curve, the diversity of the pools (convergence of the aptamer species) can be evaluated30.
7. High-throughput sequencing
8. Sequencing analysis
9. Aptamer binding validation and assays
Since DNA aptamers can be obtained using SELEX in a test tube15, this SELEX strategy was carefully designed to include both positive selection steps toward the intact, whole infectious virus (i.e., retain the DNA molecules that bind to the infectious virus), as well as counter selection steps for the same virus that has been rendered non-infectious by a particular disinfection method, specifically UV-treatment, by discarding the DNA sequences that can bind to the non-infectious virus. A schematic representation of the selection process is shown in Figure 1.
As representative results of this protocol, we chose the selection of an aptamer for infectious SARS-CoV-2. Due to the requirement of biosafety level 3 (BSL3) working conditions to handle the intact, infectious SARS-CoV-2, we have chosen instead to work with a pseudotyped virus. Pseudotyped viruses are generated from a relatively harmless backbone virus, in this case a type of lentivirus (HIV), that has been modified to display the surface protein of a different virus of interest within its viral envelope, allowing it to closely mimic the surface and entry mechanism of the desired virus without the risks associated from working with dangerous human pathogens. Importantly, these pseudoviruses are modified to be defective in continuous viral replication25, 42. In this work, three different pseudotyped viruses were used: p-SARS-CoV-2, where SARS-CoV-2 spike (S) proteins are incorporated in the envelope; p-SARS-CoV-1, containing SARS-CoV-1 spike (S) protein; and p-H5N1, containing hemagglutinin and neuraminidase isolated from the highly pathogenic avian influenza virus A/Goose/Qinghai/59/05 (H5N1) strain.
In the first two rounds of selection, no counter selection step was included to minimize the nonspecific removal of DNA binders that are typically only present as single copies in beginning rounds. After the first two rounds, positive and counter selection steps were included in each round to reach a high selectivity. For counter selection, p-SARS-CoV-2 that had been rendered non-infectious by UV-treatment, as well as p-SARS-CoV-1 and p-H5N1 were used to gain selectivity against other viruses.
To monitor the selection progress, quantitative polymerase chain reaction (qPCR) was used. This technique represents a simple method that does not require labelling of the pool and can be generally applied to virtually any target30. It takes advantage of two features of the qPCR technique. First, the possibility of absolute quantification using a standard curve to calculate the elution yield, defined by the ssDNA bound to the infectious virus over the total added ssDNA. Our results showed that the elution yield initially increased with each early round of SELEX and leveled out at rounds 8 and 9 (R8 and R9), suggesting an enrichment of the pools in sequences that bind to the infectious virus (Figure 2A). Second, melting curves of every round of the DNA pool provide further evidence of the sequence diversity of the pools30. By comparing the melting curves, a shift from the peak at high melting temperature (Tm) from 77 °C to 79 °C can be observed, suggesting that the DNA pool has converged from random sequences with low Tm to more conserved sequences with higher Tm (Figure 2B). Moreover, in the last rounds (R8 and R9), a peak at low Tm appears (~70 °C). This could be related to the production of primer-dimers during the PCR. One possible solution to avoid this is to reduce the number of cycles during PCR (for instance, from 18 cycles to 12 or 15 cycles).
After confirming an enrichment of the pool by qPCR, we used high-throughput sequencing (HTS) for rounds 3, 5, 7, 8, and 9 of the SELEX to find out which sequences are responsible for the binding of the virus target. Compared with traditional cloning-sequencing procedures, HTS allows the evolution of individual sequences over multiple selections rounds to be monitored, to finally identify the aptamer sequences that are enriched over subsequent rounds. Figure 3A shows the relative abundance (in reads per millions) for the sequence SARS2-AR10, obtained over consecutive selection rounds. The secondary structure of SARS2-AR10 is predicted by Mfold software, and the results (Figure 3B) show a structured secondary structure that contains a stem loop region which may be involved in recognizing the virus. Further characterization of the affinity binding of this sequence has been reported previously22. Microscale thermophoresis (MST) and an enzyme-linked oligonucleotide assay (ELONA) demonstrated that the SARS2-AR10 aptamer binds to the infectious p-SARS-CoV-2 with a Kd = 79 ± 28 nM, and does not bind to the non-infectious p-SARS-CoV-2 or to other coronaviruses such as p-SARS-CoV-1 and 229E.
Figure 1: Schematic representation of the SELEX process of aptamers that distinguishes infectious from non-infectious viruses. Positive and counter selection steps were added in each round after the second round to reach high specificity toward the infectious virus. Please click here to view a larger version of this figure.
Figure 2: Monitoring the progress of SELEX of infectious SARS-CoV-2-specific aptamers. (A) Quantification of the elution yield (i.e., the bound ssDNA over the added ssDNA) for each round of SELEX using qPCR. (B) Melting curve for the different pools during SARS-CoV-2 aptamer selection. Please click here to view a larger version of this figure.
Figure 3: Enrichment and secondary structure of SARS2-AR10 aptamer. Reads per million (RPM) obtained by analysis of the HTS data for SARS2-AR10 sequence as a function of the selection rounds, using FASTAptamer-Count. Inset: The predicted most stable secondary structure of the SARS2-AR10 sequence based on the UNAFold software. Calculations were made at 25 °C, 100 mM NaCl, and 2 mM MgCl2. Please click here to view a larger version of this figure.
Name | DNA sequence (5′ to 3′) | ||
T20 | TTTTTTTTTTTTTTTTTTTT | ||
DNA library | ACCGTCAGTTACAATGCT- N45 -GGCTGGACTATCTGTGTA | ||
Forward primer (FwdP) | ACCGTCAGTTACAATGCT | ||
Reverse primer (RevP) | Biot-TACACAGATAGTCCAGCC | ||
SARS-AR10 | CCCGACCAGCCACCATCAGCAACTCTTCCGCGTCCATCCCTGCTG | ||
N: represent a random nucleotide; Biot: Biotin modification in the 5´end. |
Table 1: List of DNA sequences.
Supplementary Coding File 1: Cutadapt_script.txt identifies any sequences in a given input file that have the specified forward and reverse primers, discards any that do not have the primers, and removes the primers from the sequences that did have them. See reference31 for more information on Cutadapt. Please click here to download this File.
Supplementary Coding File 2: FASTAptamer_cluster_script.txt will cluster all sequences from a single count file into families/clusters of closely related/similar sequences. This is useful if a candidate sequence is identified, so that other similar sequences in the same cluster can also be identified as potential candidates as well. See reference34 for more information on FASTAptamer. Please click here to download this File.
Supplementary Coding File 3: FASTAptamer_count_script.txt will count the number of occurrences of each unique sequence from a given input FASTA/FASTQ file and produce an output file of each unique sequence in descending order (highest abundance to lowest abundance). The Count output files are required as inputs for all other FASTAptamer programs. See reference34 for more information on FASTAptamer. Please click here to download this File.
Supplementary Coding File 4: FASTAptamer_enrich_script.txt compares all sequences from any two or three input files, gives the abundance of all unique sequences between all of the files, and gives all combinations of pairwise enrichment values (in RPM) of all sequences found in multiple files. See reference34 for more information on FASTAptamer. Please click here to download this File.
Supplementary Coding File 5: fastqc_script.txt is a quality control tool for FastQ files that provides easy to read quality information for high-throughput sequencing data, such as duplication levels, read lengths, and quality scores. See reference30 for more information on FastQC. Please click here to download this File.
Supplementary Coding File 6: FASTX_fwd_rev_merge_script.txt uses the FASTX-Toolkit to output the reverse-complement of all sequences from a given reverse file. It then uses the cat command (standard in most UNIX operating systems) to merge this reverse reverse-complement file with a given forward file of sequences, to give a single file with all sequences. See reference33 for more information on FASTX-Toolkit. Please click here to download this File.
Supplementary Coding File 7: FASTX_quality_filter_script.txt uses the FASTX-Toolkit to check the quality of sequences in a given FastQ file and can discard any sequences that are of low quality. This method is generally better than trimming methods of quality filtering for analysis of in vitro selection sequences. Trimming involves removing bases (typically near the ends) of sequences based on low quality, which is acceptable for genomic sequencing, but because the entire sequence is needed for functional DNA, trimming the ends is not useful. Instead, if too many bases of a sequence are low quality, the entire sequence is discarded. See reference33 for more information on FASTX-Toolkit. Please click here to download this File.
Supplementary Coding File 8: grep_searcher_script.txt is used to search for a specific input sequence in a specific input file, and output both the ID and the sequence in an output file (if it was found in the input). This script is useful for pools that are very heterogenous (i.e., have a lot of unique sequences), which make FASTAptamer Clust take too long to run properly and result in FASTAptamer Enrich files that are too large to open in typical spreadsheet programs for analysis. Please click here to download this File.
Supplementary Coding File 9: gzip_compress_decompress_script.txt uses Gzip to compress or uncompress files. A compressed file will typically have a .gz extension appended to the file name. It is recommended to compress large files to save on file space. Please click here to download this File.
Supplementary Coding File 10: PEAR_script.txt is used only for paired-end sequencing files and will merge the Read 1 file with the corresponding Read 2 file. See reference32 for more information on PEAR. Please click here to download this File.
Supplementary Coding File 11: tar_extraction_creation_script.txt will extract or create Tar archives, which are used to collate multiple files and/or directories into a single file. They are also often compressed to reduce file size, such as with bz2 compression, as included in this script. Please click here to download this File.
SELEX allows not only the identification of aptamers with high affinity, in the pM-nM range22,43,44,45, but also with high and tunable selectivity. By taking advantage of counter selection, aptamers with challenging selectivity can be obtained. For instance, the Li group has demonstrated the ability to obtain sequences that can differentiate pathogenic bacterial strains from non-pathogenic strains21. Also, Le et al. identified an aptamer able to differentiate serotypes of Streptococcus pyogenes bacteria46. This opens the possibility of integrating aptamer molecules with unique selectivity in different functional DNA sensors12,47,48,49,50 with new nanotechnologies22,51,52 to construct sensors for different applications, such as for portable and rapid diagnostic tests or for environmental detection.
The protocol presented here allows aptamers to be obtained with high selectivity for an infectious virus over the same virus that has been inactivated and is thus non-infectious. To obtain such selectivity, the SELEX approach in this work is based on the design of the counter selection step. First, for a successful counter selection step, a correct choice and characterization of non-target samples needs to be performed. Thus, this method depends on starting with well-known infectious and non-infectious intact virus stocks. Second, a counter selection step is incorporated in each round of the selection after the first round, to apply stringent conditions from the beginning of the selection and to maximize the probability of identifying aptamers that can differentiate between the slight differences in the surface of the virus particle from the infectious and non-infectious state of the same virus.
In this protocol, the focus is on the selectivity of the aptamer. If stronger aptamer affinities are required, it is possible to increase the stringency of the process after certain SELEX rounds. For instance, by decreasing the infectious virus concentration and incubation time during the positive selection, it is possible to obtain aptamers with a stronger binding affinity44.
Another important consideration when designing the SELEX protocol is the final sample in which the aptamer will be applied. Aptamers specific to viruses are usually incorporated in complex samples (e.g., serum, saliva, and real-water samples), and this need to be considered during the aptamer selection to increase the chance of obtaining functional aptamers in the final samples. Therefore, it is important to perform SELEX in a buffer that closely mimics these samples, particularly with regard to cation concentration and pH. For instance, for the SARS-CoV-2 aptamer, we chose PBS buffer (pH 7.4) containing 2 mM MgCl2 and 0.5 mM CaCl2 to closely mimic biological samples. In addition, if other species in the samples where the aptamer will be applied can potentially compete with the target to bind the aptamer, it is possible to minimize interference from these species by including them in the counter selection step, to eliminate sequences that could bind these species.
The successful results of this SELEX process22, which has been applied to different viruses (adenovirus and SARS-CoV-2) and different inactivation methods (UV-treatment and free chlorine) indicate that this method can be widely applied for other viruses and other inactivation methods, opening a broad range of applications in the future. Aptamers that distinguish infectious viruses have potential not only in diagnostic applications to identify patients that are still contagious, but also in environmental applications where the sample is defined as contaminated if the pathogens in these samples are still active. Thus, the incorporation of aptamers with the ability to identify infectious viruses in rapid sensors offers opportunities to monitor disinfection treatments.
Furthermore, it was possible to differentiate the infectivity status of the virus without any information on the structural differences between the two infectivity states. The only information that is required is to know that the differences between both states are related to differences in the structure of molecules on the surface of intact viral particles. Thus, we are confident that the same approach can be used to differentiate variants and serotypes of the same virus, where the differences are due to small mutations in the residues of proteins on the surface of viral particles, similar to those that differentiate the infectivity status of the virus. For instance, we envision it will be possible to obtain aptamers specific to variants of concern of SARS-CoV-2 or other viruses, such as influenza, opening the possibility of rapid monitoring of variants that are the biggest threat for society.
Finally, because the SELEX strategy to obtain aptamers that can differentiate infectious from non-infectious viruses does not require any foreknowledge about what specific structural differences are present between them, it may be possible to use aptamers with this selectivity to identify the specific surface changes responsible for the loss of infectivity from various disinfection methods by a detailed characterization and identification of the binding targets of our aptamers.
The authors have nothing to disclose.
We wish to thank Ms. Laura M. Cooper and Dr. Lijun Rong from the University of Illinois at Chicago for providing the pseudovirus samples used in this protocol (SARS-CoV-2, SARS-CoV-1, H5N1), as well as Dr. Alvaro Hernandez and Dr. Chris Wright of the DNA Services facility of the Roy J. Carver Biotechnology Center at the University of Illinois at Urbana-Champaign for their assistance with high-throughput sequencing, and many members of the Lu group who have helped us with in vitro selection and aptamer characterization techniques. This work was supported by a RAPID grant from the National Science Foundation (CBET 20-29215) and a seed grant from the Institute for Sustainability, Energy, and Environment at the University of Illinois at Urbana-Champaign and Illinois-JITRI Institute (JITRI 23965). A.S.P. thanks the PEW Latin American Fellowship for financial support. We also thank the Robert A. Welch Foundation (Grant F-0020) for support of the Lu group research program at the University of Texas at Austin.
10% Ammonium persulfate (APS) | BioRad | 1610700 | |
100% Ethanol | Sigma-Aldrich | E7023 | |
1x PBS without calcium & magnesium | Corning | 21-040-CM | |
40% acrylamide/bisacrylamide (29:1) solution | BioRad | 1610146 | |
Agencourt AMPure XP Beads | Beckman Coulter | A63880 | DNA clean-up beads – Section 7.2.2 |
Amicon Ultra-0.5 Centrifugal Filter Unit | Merck | UFC501024 | cut-off 10 kDa |
Amicon Ultra-0.5 Centrifugal Filter Unit | Merck | UFC510024 | cut-off 100 kDa |
Boric Acid | Sigma-Aldrich | 100165 | |
C1000 Touch Thermal Cycler with Dual 48/48 Fast Reaction Module | BioRad | 1851148 | |
Calcium Chloride | Sigma-Aldrich | C4901 | |
CFX Connect Real-Time PCR Detection System | BioRad | 1855201 | |
Digital Dry Baths/Block Heaters | Thermo Scientific | 88870001 | |
Dynabeads MyOne Streptavidin C1 | Thermo Fisher | 65001 | streptavidin-modified magnetic beads – Section 4.9 |
EDTA disodium salt | Sigma-Aldrich | 324503 | |
Eppendorf Safe-Lock microcentrifuge tubes | Sigma-Aldrich | T9661 | 1.5 mL |
Lenti-X p24 Rapid Titer Kit | Takara Bio USA, Inc. | 632200 | Lentivirus quantification kit – Section 3.3.2.1 |
MagJET Separation Rack, 12 x 1.5 mL tube | Thermo Scientific | MR02 | |
Magnesium chloride | Sigma-Aldrich | M8266 | |
Microseal 'B' PCR Plate Sealing Film, adhesive, optical | BioRad | MSB1001 | non-UV absorbing |
Mini-PROTEAN Tetra Cell for Ready Gel Precast Gels | BioRad | 1658004EDU | |
Mini-PROTEAN Short Plates | BioRad | 1653308 | |
Mini-PROTEAN Spacer Plates with 0.75 mm Integrated Spacers | BioRad | 1653310 | |
Molecular Biology Grade Water | Lonza | 51200 | |
Multiplate 96-Well PCR Plates, high profile, unskirted, clear | BioRad | MLP9611 | |
Nanodrop One | Thermo Scientific | ND-ONE-W | |
OneTaq DNA Polymerase | New England BioLab | M0480S | |
Ovation Ultralow v2 + UDI | Tecan | 0344NB-A01 | High-troughput sequencing library preparation kit – Section 7.2. |
PIPETMAN G (100-1000 µL, 20-200 µL, 2-20 µL and 0.2-2 µL) | Gilson | F144059M, F144058M, F144056M, F144054M | |
Purifier Logic+ Class II, Type A2 Biosafety Cabinets | Labconco | 4261 | |
Qubit dsDNA BR Assay Kit | Invitrogen | Q32850 | fluorescence-based dsDNA quantification kit – Section 7.2.3 |
SHARP Classic Low Retention Pipet Tips (10 uL, 200 uL, 1000 uL) | Thomas Scientific | 1158U43, 1159M44, 1158U40 | |
Sodium acetate | Sigma-Aldrich | S2889 | |
Sodium chloride | Sigma-Aldrich | S7653 | |
Sorvall Legend Micro 17R Microcentrifuge | Thermo Scientific | 75002440 | |
SsoFast EvaGreen Supermix | BioRad | 1725201 | qPCR mastermix – Section 6.2. |
Tris(hydroxymethyl)aminomethane | Sigma-Aldrich | T1503 | |
Tubes and Ultra Clear Caps, strips of 8 | USA scientific | AB1183 | PCR tubes |
Urea | Sigma-Aldrich | U5128 |