Malaria transmitting mosquitoes have a number of epidemiologically important characteristics that can only be detected using molecular techniques. Utilizing a MALDI-TOF based SNP genotyping platform, we developed an assay for simultaneously detecting multiple key traits (species, insecticide resistance, parasite infection and host choice) of malaria vectors.
The Anopheles gambiae species complex includes the major malaria transmitting mosquitoes in Africa. Because these species are of such medical importance, several traits are typically characterized using molecular assays to aid in epidemiological studies. These traits include species identification, insecticide resistance, parasite infection status, and host preference. Since populations of the Anopheles gambiae complex are morphologically indistinguishable, a polymerase chain reaction (PCR) is traditionally used to identify species. Once the species is known, several downstream assays are routinely performed to elucidate further characteristics. For instance, mutations known as KDR in a para gene confer resistance against DDT and pyrethroid insecticides. Additionally, enzyme-linked immunosorbent assays (ELISAs) or Plasmodium parasite DNA detection PCR assays are used to detect parasites present in mosquito tissues. Lastly, a combination of PCR and restriction enzyme digests can be used to elucidate host preference (e.g., human vs. animal blood) by screening the mosquito bloodmeal for host-specific DNA. We have developed a multi-detection assay (MDA) that combines all of the aforementioned assays into a single multiplex reaction genotyping 33SNPs for 96 or 384 samples at a time. Because the MDA includes multiple markers for species, Plasmodium detection, and host blood identification, the likelihood of generating false positives or negatives is greatly reduced from previous assays that include only one marker per trait. This robust and simple assay can detect these key mosquito traits cost-effectively and in a fraction of the time of existing assays.
Anopheles arabiensis, Anopheles coluzzii and Anopheles gambiae are the major vectors responsible for malaria transmission in Africa1. These three species are morphologically indistinguishable2 and can only be distinguished by molecular assays3-9. In addition, there are many downstream assays routinely conducted to aid epidemiological and population genetics studies. These include (1) a genotyping assay for speciation islands10-12, (2) a genotyping assay recognizing non-synonymous SNPs in the 1014th amino acid codon position of para gene (the knock-down resistance, or kdr, SNP)13-18, (3) parasite detection PCR19-23, and (4) screening for host-specific DNA in mosquito midguts24,25.
We developed a multi-detection assay (MDA) that combines all these assays into a single multiplex reaction with the goal of analyzing epidemiologically important characteristics of malaria vectors in Africa. The MDA assay includes multiple markers for detecting (1) species (A. arabiensis, A. gambiae, A. coluzzii, or other (none of the three)), (2) insecticide resistance represented in kdr SNPs (both L1014Fand L1014S), (3) presence of two major malaria parasites, Plasmodium falciparum and P. vivax, and (4) blood source from one avian and six mammalian hosts.
Traditionally, these assays were conducted in separate polymerase chain reactions. When all these assays are done using a conventional PCR platform, it requires conducting 8-10 PCR reaction assays and accompanying gel electrophoresis steps. Each individual PCR reaction from preparation to documenting results takes 4-5 hr, while the MDA method presented here takes about 5 hr in totality. This is equivalent to a 90% savings in labor costs alone. The MDA presented here costs $5 per sample to genotype all 33 SNPs. This is considerably less expensive than the single agarose gel-based assays, which costs about $1.50 per sample. Assays detecting all the characteristics covered by the MDA would require a minimum of 8-10 separate agarose gel-based assays at a cost of $12-15 per sample. Moreover, the MDA greatly reduces the chance of generating a false positive or negative by utilizing at least three markers for each parasite or host source detection.
The platform we utilized is not limited to malaria vectors but can be used in a wide variety of applications such as medicine, veterinary medicine, and basic biology26-28. In-depth association studies or population genetics studies involving a large (in order of 100s) number of samples require cost-effective assays screening for multiple markers simultaneously. Most studies that utilize two or more separate PCR assays could implement the MDA for quicker results at a lower cost.
1. PCR Amplification
2. SAP Treatment
3. SNP Extension
4. Conditioning the SNP Extension Product to Optimize Mass Spectrometry Analysis
5. MALDI-TOF Mass Spectrometry
6. Data Analysis
Species identification:
The following 5 SNPs together identify three species (A. arabiensis, A. coluzzii and A. gambiae) (Table 4). If a sample is not one of the three species, the three SNPs (01073-213, 04679-157 and 10313-052) fail to amplify.
kdr genotype for inferring insecticide resistance:
The 1014th codon of the para voltage-gated sodium channel corresponds to kdr. Two SNPs on the 2nd and 3rd base of the 1014th codon determine the three possible amino acids. The possible combinations of the genotypes are listed in Table 5. This SNP works for three species of African malaria vectors: A. arabiensis, A. coluzzii and A. gambiae. These SNPs fail to amplify in A. darlingi, a Brazilian malaria vector. This is not surprising given that mitochondrial genome sequence similarity between A. darlingi and A. gambiae is only 88.9% while mitochondrial genome sequence similarity among three African malaria vectors are over 98%. Other species have not been tested. We were successful in amplification from samples collected in Mali, Cameroon, Tanzania and Zambia.
Plasmodium detection:
If intact Plasmodium DNA is present in the mosquito tissue, we expect to detect P. falciparum or P. vivax specific DNA among the DNA sample extracted from the mosquito. The species specific SNPs were identified from sequence alignment of the cytochrome b sequence of 123 P. falciparum, 97 P. vivax, 11 P. malariae and 18 P. ovale isolate sequences from Africa available on Genbank. We designed 6 SNPs which produce unique SNP genotypes to distinguish P. falciparum or P. vivax from other Plasmodium species (Table 6). We have tested this assay on mosquito samples collected in Mali, Cameroon, Tanzania, Zambia and Brazil and confirmed that this particular set of markers works regardless of mosquito species.
Bloodmeal detection PCR:
Cytochrome b sequences from 7 host species (human, chicken, cow, dog, goat, pig and sheep) were collected from Genbank and consensus sequences were generated. SNPs informative for each host were then selected for genotyping. We tested this with blood sources from 7 different animals (Table 7).
Because PCR fragments are short (80-120 bp), this works on somewhat degraded DNA or on partially digested bloodmeals from mosquito tissue. The probability of false positive calls is greatly reduced by having multiple markers per host type.
Figure 1. Thermocycler program for SNP extension step. The amplification cycle is total of 200 (5×40).
Figure 2. Mass-spectrogram for the MDA. The X axis is mass (Da) of SNP extension products and the Y axis is signal intensity. The Red lines indicate the expected mass of Unincorporated Extension primer, SNP allele 1 and allele 2 for the selected marker. Gray lines indicate expected mass of other markers in the MDA. This plot is generated from data analysis software (Typer Analyzer or Typer Viewer) from the output data file after Step 5. Please click here to view a larger version of this figure.
Figure 3. Individual SNP genotype cluster view The X axis is the arctangent value (labeled as “Angle”) of the signal intensities of SNP allele 1 and SNP allele 2. The Y axis is the magnitude of genotype call signal. This plot is provided in the data analysis software (Typer Analyzer or Typer Viewer). Any particular data can be selected with a click of a mouse button and selected sample data is indicated by circle. Clustering analysis provided in the software clusters genotypes with a user-defined variance threshold and automatically colors three genotypes of a biallelic SNP. Example shown here indicates homozygous T (TT) individuals as blue triangles, heterozygote (TA) as green squares and homozygous A (AA) individuals as orange inverted triangles. Red circles represent No call (no amplification). Please click here to view a larger version of this figure.
Reagent | Concentration in 5 μl | Volume | Volume |
(1 rxn) | (96 rxns) | ||
Water (HPLC grade) | NA | 0.8 μl | 92.2 μl |
10x PCR Buffer with 20 mM MgCl2 | 1x (2 mM MgCl2) | 0.5 μl | 57.6 μl |
MgCl2 (25 mM) ** | 2 mM | 0.4 μl | 46.1 μl |
dNTP mix (25 mM each) *** | 500 μM | 0.1 μl | 11.5 μl |
Primer mix (500 nM each) | 100 nM | 1.0 μl | 115.2 μl |
PCR Enzyme (5 U/μl) | 1.0 U/rxn | 0.2 μl | 23.0 μl |
Total Volume: | 3.0 μl | 345.6 μl |
Table 1. Multiplex PCR cocktail recipe. The volume of each reagent for the PCR extension step for one reaction and 96 reactions with a 20% overhang.
Reagent | Volume (1 rxn) | Volume (96 rxn) |
Water (HPLC grade) | 1.53 μl | 176.3 μl |
SAP Buffer (10x) | 0.17 μl | 19.6 μl |
SAP enzyme (1.7 U/μl) | 0.30 μl | 34.6 μl |
Total Volume | 2.00 μl | 230.4 μl |
Table 2. SAP enzyme solution. The volume of each reagent for the Shrimp Alkaline Phosphatase treatment for one reaction and 96 reactions with a 20% overhang.
Reagent | Conc. In 9 μl | Volume (1rxn) | Volume (96 rxns) |
Water (HPLC grade) | NA | 0.6190 μl | 71.3 μl |
iPLEX Buffer Plus (10x) | 0.222x | 0.2000 μl | 23.0 μl |
iPLEX Termination mix | 1x | 0.2000 μl | 23.0 μl |
Extension Primer mix | 0.9400 μl | 108.3 μl | |
iPLEX enzyme | 1x | 0.0410 μl | 4.7 μl |
Total Volume | 2.0000 μl | 230.4 μl |
Table 3. SNP Extension cocktail The volume of each reagent for the SNP extension step for one reaction and 96 reactions with a 20% overhang.
Table 4. SNP genotypes for each species. 28S IGS-540, 28S IGS-649 and 01073-213 are located on the X chromosome, 04679-157 is on chromosome 2 and 10313-052 is on chromosome 3. Hybrid genotype (heterozygote) is highlighted in yellow. Fixed variant in A. coluzzii is marked in light blue and fixed variant in A. gambiae is marked in dark blue.
Table 5. SNP genotypes for kdr. Two SNPs KDR#1 and KDR#2 constitute a genotype for the 1014th codon of para gene. The first base is invariable so it is not included in the assay. Susceptible homozygous genotypes are marked in blue. The most resistant L1014F homozygote genotype is marked in dark blue. Yellow color indicates heterozygote. The L1014S, intermediately resistant genotype, is shown in orange.
Table 6. SNP genotypes for P. falciparum and P. vivax. Total 6 SNPs are used to determine the presence of malaria parasite. Pl-0041, Pl-1269 and Pl-1549 contain P. falciparum specific alleles (G, T and T respectively). Simultaneous amplification of the three alleles indicates presence of relatively intact P. falciparum DNA in sample DNA. Amplification of alternative alleles (A for Pl-0041 and Pl-1269) indicates presence of other Plasmodium species (e.g., P. vivax, P. ovale or P. malariae) in the sample. Pf-1549 is located in regions with many more P. falciparum specific flanking regions thus this SNP is only amplified when P. falciparum is present. Pl-0071, Pl-1245 and Pl-0157 contain P. vivax specific alleles (G, G and T respectively). Amplification of alternative alleles (A, A and C respectively) indicates presence of other Plasmodium DNA in the sample. Uninfected mosquitoes will have no amplification on all 6 markers.
Table 7. SNP genotypes for 7 hosts: Human, Chicken, Cow, Dog, Goat, Pig and Sheep. “NC” stands for “No Call” (no amplification) in the following table. Note that some non-specific amplification may occur at some loci, but not for all the SNPs of a particular host. Please click here to view a larger version of this figure.
The MDA is composed of five major steps: PCR amplification, shrimp alkaline phosphatase (SAP) reaction, SNP extension, extension product conditioning, and matrix-assisted laser desorption/ionization – time of flight (MALDI-TOF) mass spectrometry33-37. The first PCR amplification step amplifies DNA flanking each SNP so that enough template DNA will be available at the SNP extension step. The SAP reaction neutralizes unused dNTPs which can interfere with the following SNP extension step. SNP extension involves a single extension primer per SNP locus and mass-modified terminator nucleotides. The 3’-end modified nucleotides prevent subsequent nucleotides from being incorporated into the extension primer. Thus the mass of the single base indicates the genotype of the corresponding SNP.
The most critical step in ensuring the success of this approach is having a good dataset of existing polymorphisms in target organisms. We used well characterized SNPs for species identification (N>900) 10,12,38-40 and kdr (N>1,000)15,18. The species specific SNPs for Plasmodium were identified from sequence alignment of the cytochrome b sequences of 123 P. falciparum, 97 P. vivax, 11 P. malariae and 18 P. ovale isolate sequences from Africa which were available on Genbank. Cytochrome b sequences from 7 host species (human, chicken, cow, dog, goat, pig and sheep) were collected from Genbank for host-specific SNP identification. Based on this large body of sequence data, we were able to design a robust diagnostic assay using an assay designer software.
The number of markers each reaction can accommodate is determined by the biochemical compatibility of the primer mix given a tolerance to primer dimer potential (0.9 of 0-1 scale). Authors used the default value (1; most strict) for false priming potential. Relaxation of this parameter can potentially increase the number of SNPs that can be assayed in a single reaction in addition to the SNPs included in this study.
The PCR amplification steps are robust and typically do not require adjustment from the initial assay design. The SNP extension mix (Table 3), however, may need modification in relative quantity of each primer using mass spectrometry to ensure a sufficient signal to noise ratio is achieved for each primer. The provided SNP extension primer recipe is the result of these adjustments. Compatibility among extension primers may limit the total number of SNPs that can be multiplex in a single reaction. Since the reaction volume is small (5-8 µl), the sample plate needs to be properly sealed to prevent evaporation during amplification steps.
This platform can simultaneously score up to 35 SNPs per reaction, while other SNP genotyping platforms can accommodate over one thousand SNPs per reaction41. With the advances in genome sequencing technology, one can acquire millions of SNPs using next-generation sequencing 29,42,43. However, the technique used here provides an ideal application for studies that require genotyping a relatively small number of markers for many (100s-1,000s) individuals. Additional discussion of the design constraints of different SNP genotyping platform is covered in previous studies 41.
The MDA is aimed at characterizing epidemiologically important traits of malaria vectors and provides an ideal protocol for medium-throughput SNP genotyping assays analyzing thousands of mosquitoes collected from natural populations. The MDA presented here provides (1) over a 90% reduction in labor time, (2) a 60% reduction in reagent costs, and (3) reduction in false positives/negatives by utilizing multiple markers. The proposed assay can enhance epidemiological studies by greatly facilitating data collection. Also, it can improve genome-wide association studies by characterizing the mosquito samples with respect to population origin, insecticide resistance, parasite susceptibility and host choice. Finally, this MDA may provide inspiration and a template for designing other multiplex assays using a MALDI-TOF based SNP genotyping platform.
The authors have nothing to disclose.
We thank Drs. Anthony Cornel and Laura Norris at UC Davis and Dr. Katharina Kreppel at the University of Glasgow for providing mosquito specimens from Tanzania. We thank Ms. Smita Das and Dr. Douglas Norris from Johns Hopkins School of Public Health for sharing mosquito samples from Zambia. We also thank Mr. Lee V. Millon at the Veterinary Genetics Laboratory for training on assay design. This work was supported by the National Institute of Health grant R01AI 078183 and R21AI062929.
Name of Material/ Equipment | Company | Catalog Number | Comments/Description |
MassARRAY Analyzer Compact | Sequenom | MT9 | MALDI-TOF mass spectrometry for genomic applications to analyze nucleic acids. |
MassARRAY Nanodispenser | Sequenom | RS1000 | Transfers completed iPLEX reaction products to the SpectroCHIP |
iPLEX Gold Genotyping Reagent Set | Sequenom | 10158 | Reagents used for iPLEX assay including SAP kit. |