Next Article in Journal
Four New Species of the Genus Eoneureclipsis (Trichoptera: Psychomyiidae) from China Inferred from Morphology and DNA Barcodes
Next Article in Special Issue
Location and Creation of Nest Sites for Ground-Nesting Bees in Apple Orchards
Previous Article in Journal
Not All Field Margins Are Equally Useful: Effects of the Vegetation Structure of Margins on Cereal Aphids and Their Natural Enemies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development of Nuclear DNA Markers for Applications in Genetic Diversity Study of Oil Palm-Pollinating Weevil Populations

by
Fairuz Farhana Mohd Rodzik
1,2,*,
Nurshazwani Amalina Sudirman
1,2,
Chee-Keng Teh
1,
Ai-Ling Ong
1,
Huey-Ying Heng
1,
Salmah Yaakop
2,
Norfarhan Mohd-Assaad
3,4,
Meilina Ong-Abdullah
5,
Nabeel Ata
5,
Samsudin Amit
1,
Burhanuddin Saragih
1,6,
David Ross Appleton
1 and
Harikrishna Kulaveerasingam
1
1
Sime Darby Plantation R&D Centre, Serdang 43400, Selangor, Malaysia
2
Centre for Insect Systematics, Department of Biological Science and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia
3
Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia
4
Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
5
Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board (MPOB), 6 Persiaran Institusi, Bandar Baru Bangi, Kajang 43000, Selangor, Malaysia
6
Minamas Research Centre Pekanbaru, Jalan Baru Bakal, Tualang Timur, Kecamatan Tualang, Kabupaten Siak, Perawang 28772, Provinsi Riau, Indonesia
*
Author to whom correspondence should be addressed.
Insects 2023, 14(2), 157; https://doi.org/10.3390/insects14020157
Submission received: 10 November 2022 / Revised: 19 December 2022 / Accepted: 20 December 2022 / Published: 3 February 2023

Abstract

:

Simple Summary

The main pollinator for the African oil palm, Elaeis guineensis, in Malaysia and Indonesia is Elaeidobius kamerunicus. The weevil species is not native to these countries, but it was introduced from Cameroon, West Africa, in 1981 to improve pollination efficiency, thus improving bunch formation and yield. Forty years after the weevil introduction, recent reductions in bunch yield reported in Malaysia caused by poor bunch formation could be associated with a decrease in pollination efficiency due to the founder effect. Several genetic diversity studies of weevil populations based on morphological and mitochondrial markers have been carried out; however, the studies did not provide sufficient evidence for explaining the genetic variation, particularly at the intra-species level. This study aims to develop a set of robust E. kamerunicus-specific nuclear DNA markers to directly assess the genetic diversity of weevil populations. The marker development lays the foundation for future applications by extending the survey into larger areas where the oil palm is cultivated to obtain a more comprehensive understanding of the genetic diversity and inbreeding occurrence status of E. kamerunicus in these introduced regions. This could facilitate sustainable genetic monitoring and conservation planning of E. kamerunicus, especially in non-native oil palm-growing countries.

Abstract

The oil palm-pollinating weevil (Elaeidobius kamerunicus Faust) was introduced from Cameroon, West Africa, to Malaysia in 1981, and subsequently, to other oil palm-growing countries as well. This study aims to develop a set of robust E. kamerunicus-specific nuclear DNA markers to directly assess the genetic diversity of the weevil populations. A total of 19,148 SNP and 223,200 SSR were discovered from 48 weevils representing three origins (Peninsular Malaysia, Sabah, and Riau) using RAD tag sequencing. Subsequent filtering steps further reduced these to 1000 SNP and 120 SSR. The selected 220 SNP exhibited a polymorphism information content (PIC) of 0.2387 (±0.1280), and 8 SSR had the PIC of 0.5084 (±0.1928). These markers were found to show sufficient polymorphism, making it possible to assign 180 weevils into three major clusters from Ghana, Cameroon, and Southeast Asia (mainly in Malaysia and Indonesia). These DNA markers successfully confirmed the Cameroon origin of the Southeast Asian cluster. However, the presence of null alleles in the SSR markers, due to limited flexibility of the probe design on the short RAD tags, led to an underestimation of heterozygosity within the populations. Hence, the developed SNP markers turned out to be more efficient than the SSR markers in the genetic diversity assessment of the E. kamerunicus populations. The genetic information provides useful insight into developing guidelines for the genetic monitoring and conservation planning of E. kamerunicus.

1. Introduction

Oil palm, Elaeis guineensis Jacq., originated from West Africa. The earliest record of its introduction to Southeast Asia is found regarding the planting of four dura palms in Bogor Botanical Garden, Indonesia, in 1848 [1]. Currently, descendants of these palms have become the major planting material of Southeast Asia and Oceania [2], supplying about 34.3% of the total palm oil trade [3]. The oil crop is monoecious, producing male and female inflorescences in an alternating cycle to promote cross pollination among palms for fruit bunch development, mainly assisted by weevils under genus Elaeidobius in West Africa [4]. However, the native pollinator was unavailable in Southeast Asia when the oil palm was first planted in the region. A lower than 60% fruit-to-bunch ratio indicates inefficient pollination, which can cause a significant yield loss of fresh fruit bunches (FFB) [5]. In the late 1970s, Thrips hawaiiensis was identified in Malaysia as an alternative pollinator to the native weevil, but the species was far less efficient and usually absent from young plantings [4]. Hence, the commercial plantings were mainly hand pollinated, which was extremely costly and laborious [6]. The oil palm industry was left with no other option but to consider introducing the native pollinator from Cameroon to Malaysia [7].
The species Elaeidobius kamerunicus Faust, 1898 (Coleoptera: Curculionidae) was chosen after the species was studied to be the most predominant oil palm pollinating weevil in Cameroon. The species is also well adapted to both wet and hot seasons, and has a high pollen carrying capacity among the other Elaeidobius species [7]. In addition, host-specificity testing found that the species was unable to complete its life cycle on any plant species in the country other than the oil palm; thus, it was deemed safe to be introduced [8]. Elaeidobius kamerunicus is holometabolous, completing its development on post anthesis oil palm male inflorescence [9,10]. Adult female oviposit eggs on the male oil palm inflorescence, and the larva emerge and undergo perfect metamorphosis. The adult (weevil), attracted to the strong aniseed smell produced by male flowers at anthesis, visits the male flowers as a food source (nectar) and oviposits its eggs. Pollen grains are attached to the adult weevils’ body, mostly on the elytra, but some on the ventral part of the thorax and abdomen and under the elytra [4]. The weevil visit the female flowers by deception, as they are attracted to the same aniseed smell given off by male flower at the receptive stage. This results in the transference of pollens from the male to the female flowers (pollination) [11].
In 1981, the E. kamerunicus species in the form of pupae was imported from Plantations Pamol du Cameroon, Lobe Estate, Cameroon, to Malaysia [12]. A total of 3000 adult weevils were reared and first released into two plantations in Malaysia in 1981; the introduction was a huge success. The rapid establishment of weevils throughout the plantations led to significant improvements in fruit set and oil yield [6,12,13]. Hand pollination was then terminated, with an estimated saving of MYR 50 million per year during the 1980s [6]. The release of weevil species extended to Indonesia, Papua New Guinea, Thailand, and Colombia revolutionized the entire palm oil industry [2,14].
A declining trend of FFB, however, has occasionally been reported in Malaysia since the 1990s. Low rainfall and over-pruning, causing a reduction in the formation of female inflorescences, could be contributing factors to this trend. In addition, the descendant population of introduced E. kamerunicus probably suffered a bottleneck effect that weakened their adaptability to local climate change, making them vulnerable to parasitic nematode infection [15,16,17,18]. This may directly reduce the pollination ability of the insect. Many studies on the genetic diversity of E. kamerunicus populations were conducted to determine the presence of a bottleneck effect, but multivariate analysis based on 15 morphological characteristics of E. kamerunicus mainly explained the variation between male and female insects, rather than between geographical regions [19]. Due to strong influences from the environment, some of these morphologic traits could mask the genetic variations [20]; therefore, the founder effect still remains uncertain.
Hence, geneticists started adopting molecular markers to directly measure the genetic diversity of this important weevil population. Mitochondrial marker COI, COII and a nuclear gene fragment of arginine kinase (AK) were used in estimating the phylogeography of the E. kamerunicus populations [21]. The South American and Asian populations were found to have originated from West and Central Tropical Africa [21]. However, mitochondrial markers alone may not be sufficient to detect genetic polymorphism for an intra-species population; hence, the use of a nuclear marker was suggested [22]. Nuclear markers, which include single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers, are able to detect allele heterozygosity information, hence more useful than mitochondrial markers in insects’ genetic diversity studies [23]. Simple sequence repeats (SSR) have the advantage of being multi-allelic (having a higher marker polymorphism than SNP). In addition, SSR can be genotyped using a standard PCR-based method, thus possessing a simpler start-up requirement. On the other hand, SNP is bi-allelic, which means that it contains less genetic information per locus than SSR markers, as presented in [24]. Nevertheless, it has the capacity to be used in a high-throughput system to genotype a high number of SNP markers with a large sample size [25].
We hypothesize that the designed novel DNA markers will be effective to measure the inbreeding effect among the Malaysia populations, as well as to estimate the genetic diversity of the other introduced region (Indonesia), and to contrast with this effect with those of the origin countries (Cameroon and Ghana). Hence, the aim of this study was to develop two robust E. kamerunicus-specific nuclear marker systems, i.e., single nucleotide polymorphisms (SNP) and simple sequence repeats (SSR), using RAD (restriction site associated DNA) tag sequencing, and then to assess the effectiveness and applicability of the two marker sets to capture the expected genetic patterns in the population.

2. Materials and Methods

2.1. Sample Collection and Genomic DNA Extraction

A total of 48 individual weevils originating from Malaysia (40 individual weevils representing northern, central, southern Peninsular Malaysia, as well as 5 individual weevils from Sabah) and Indonesia (3 individual weevils from Riau) were collected for marker discovery and development. Tissues from the head and membranous part of the hind wings of each weevil were sampled for DNA extraction using Analytik JenaTM Tick DNA kit (Analytik Jena, Berlin, Germany), according to the manufacturer’s protocol. This was to ensure good quality DNA for the RAD tag sequencing and to minimize the risk of cross-contamination from other organisms, particularly nematodes, which are usually present in the abdominal segment of weevils. Subsequently, the evaluation of marker applicability for genetic diversity studies was performed, based on a larger set of 180 weevils representing two native populations (from Cameroon and Ghana) and 10 introduced populations in Malaysia and Indonesia (Table 1); each population consists of 15 individuals. The genomic DNA was extracted using Sbeadex™ Mini kit (LGC Genomics, Hoddesdon, UK) in an oKtopure™ (LGC Genomics, Hoddesdon, UK). Standard protocols, with the modification of an additional washing step using 80% ethanol (150 µL) and 25 µL of Proteinase-K, were performed. The quality of the extracted DNA was quantified using a Microplate reader (ThermoFisher Scientific, Massachusetts, USA).

2.2. RAD Tag Sequencing and Sequence Alignment

The genomic DNA of each individual weevil was digested using the Msll restriction enzyme. Digested DNA fragments were subjected to 150 bases paired-end sequencing using Illumina NextSeq500 V2 platform (Illumina Inc., California, USA). Standard pre-processing steps were taken after the de-multiplexing of the library groups using Illumina bcl2fastq 2.17.1.14 software (Illumina Inc., California, USA). Raw sequences per sample were obtained by the de-multiplexing of library groups into samples, according to their inline barcodes (with a maximum of 2 mismatches or Ns) and verification of the restriction site. All reads were processed by an adaptor clipper, and reads with a final length shorter than 20 bases were discarded as well. The next step was quality trimming by removing reads containing Ns and at the 3′-end to get the average Phred quality score (Q > 20) within a window size of 10 bases. Processed sequence reads were pooled and clustered using the CD-HIT-EST program at 95% similarity cut-off [26]. The alignment of the subsampled quality-trimmed reads against the cluster sequences was performed using BWA version 0.7.12 [27]. Variant discovery and genotyping were then performed using Freebayes v1.0.2-16 [28] by following the parameters set to minimum base quality of 10, a minimum coverage of 5, a mismatch base quality threshold of 10, a minimum supporting allele qsum of 10, and a read mismatch limit of 3. These combined variants, scored in a variant call format (vcf) file, were re-filtered for the read count, ensuring that each locus showed at least 8 reads and a minor allele frequency >5%. Lastly, the genotypes must be observed in at least 32 out of 48 samples.

2.3. SNP Discovery and Genotyping

An in-house script was used to check the availability of minimal 75 bp upstream and downstream flanking sequences of each SNP. The fastacmd program [29] under BLAST [30] was used to extract the SNP markers with adequate flanking sequences. Remapping of these markers on the tag sequence pool was performed using BLASTN with an e-value threshold of 1e-05 and 95% identity over a minimum of 140 bp to ensure high specificity to the targeted SNP markers. In addition, unique markers were then mapped against the protein (NR) database and the bacteria complete genome from the NCBI database, using BLASTN with a maximum e-value of 1e-05. The list of bacteria can be found at ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/bacteria/assembly_summary.txt (accessed on 26 January 2018). This step was performed to remove possible contaminants from weevil-specific sequences. In addition, as E. kamerunicus belongs to the order Coleoptera, those identified markers that were mapped to genomes of beetle species (Aethina tumida, Agriotes lineatus, Anoplophora glabripennis, Biphyllus lunatus, Carabus granulatus, Dascillus cervinus, Dendroctonus ponderosae, Julodis onopordi, Leptinotarsa decemlineata, Onthophagus taurus, Scarabaeus laticollis, and Tribolium castaneum) in the NR database were selected for this study. Additionally, those unmapped markers with a >90% call rate calculated in the PLINK program [31] were selected to eventually make up the 1000 SNP markers. To validate these candidate SNP markers, 75 bp upstream and downstream from an SNP were probed to genotype a total of 180 weevil samples using SeqSNPTM Genotyping by Sequencing (LGC Genomics, Berlin, Germany. Only SNP with a >95% call rate were retained for subsequent analysis [32].

2.4. SSR Discovery and Genotyping

FullSSR suite v1.0 [33] was adopted to identify SSR. Unique contigs that contained the SSR motifs were confirmed by aligning all-against-all BLASTN at the e-value threshold of 1e-25 to the cluster sequences. Only those assured contigs harboring SSR motifs with > 8X coverage in BAM files were then imported into the Primer Premier 6 program (PREMIER Biosoft, US) for primer design. The design complied with a primer length ranging from 18 bases to 23 bases, target PCR products ranging from 100 bp to 280 bp, and a melting temperature (Tm) ranging from 54 °C to 60 °C; the maximum Tm difference between the forward and the reverse primer was set at 0.5 °C. A total of 120 primer pairs with an M13-tailed forward primer for subsequent amplicon dye-labeling [34] were synthesized (IDT, Singapore). A total of 8 weevil samples originated from 8 populations—Cameroon (CAM), South Riau (SR), North Sarawak (NS), South Sarawak (SS), North Peninsular Malaysia (NPM), Central Peninsular Malaysia (CPM), South Peninsular Malaysia (SPM) and Sabah (SBH)—were genotyped to test for amplifiability and polymorphism of these 120 SSR markers. The PCR-based genotyping was carried out in 10 μL reactions consist of approximately 10 ng of genomic DNA, 7 µL 1x MyTaq Bioline buffer, 0.05 µM of forward and reverse primer, and 0.1 µM of fluorescence dye (FAM, HEX, NED or PET; Applied Biosystem, US) using Biometra PCR Thermocycler (Analytik Jena, Berlin, Germany). The PCR program was set at 95 °C for 60 s, 35 cycles of 95 °C for 15 s, and an annealing temperature of 54–58 °C for 15 s, 72 °C for 10 s, and 72 °C for 8 min. Genotypic data were generated in an ABI 3730 DNA Analyzer and Gene Mapper 7 program (Applied Biosystems, Massachusetts, USA). The amplicon of each polymorphic SSR was validated through single pass Sanger Sequencing using a genetic analyzer (Applied Biosystems, Massachusetts, USA).

2.5. Genetic Diversity and Clustering Analysis

Only SNP and SSR markers with a call rate >95% were included in the genetic diversity analyses [32]. At the marker level, genetic diversity parameters, including major allele frequency (MAF), mean number of alleles (NA), expected heterozygosity (He), observed heterozygosity (Ho), polymorphism information content (PIC) [35], and inbreeding coefficient (Fis) [36], were generated in the PowerMarker v3.25 program [37] for SSR and SNP, respectively. Same parameters of each marker system were also estimated at the population level, i.e., the assayed 12 weevil populations (Table 1). In the PowerMarker v3.25 program, genetic relatedness among 12 assayed weevil populations was subsequently estimated based on pairwise Nei’s genetic distance (D) [38] to construct a phenogram using the unweighted pair group method with arithmetic average (UPGMA) approach. These phenograms were visualized in the TreeView program [39]. In addition, principal component analyses (PCA) were performed on 180 individual weevils in R packages, namely adegenet [40], Hiefstat [41] and Pegas v 3.2.4 [42], following the references for Population Genetics in R [43]. The frequency of the null alleles (NAF) of SSR was further investigated in MICRO-CHECKER v2.2.3 [44] using the Brookfield 1 method for frequency estimation [45]. Allelic frequencies of SSR markers were corrected using the same program, followed by the re-estimation of SSR-based Fis across the populations using the PowerMarker v3.25 program [37].

3. Results

3.1. SNP Discovery and Marker Informativeness

The RAD tag sequencing produced a total of 190,055,454 reads, with an average of 1.98 million reads per weevil, out of which 1,911,233 sequences were clustered and used for marker discovery. The number of SNPs obtained in each filtering step is shown in Figure 1. The initial number of SNPs discovered in 48 weevil samples was 19,148 SNP. For probe design, the number was reduced to 8354 SNP with the length of the flanking sequences of at least 75 bp upstream and downstream. Only 4360 SNP were found to be unique, and the number subsequently dropped to 2746 SNP that had no hit to the protein database and bacterial genome from the NCBI database. From this pool, as well as an additional 46 SNPs with good hits to the NR database, a total of 1000 SNP were eventually shortlisted and deposited in the European Variation Archive (EVA) database. Nevertheless, out of these 1000 SNPs with a mean call rate of 0.53% (±0.4322), only 220 SNPs with a call rate > 95% (Table S1) were selected for genetic diversity analyses. The informativeness of the selected 220 SNPs was first evaluated using mean PIC per SNP = 0.2387 (±0.1280), ranging from 0.0000 to 0.3750. Furthermore, the mean Ho per SNP was 0.2917 (±0.2221), which was close to the mean He per SNP = 0.2982 (±0.1742). In this study, NA = 435 were detected, with the average of 1.9773 alleles per SNP (Table S1).

3.2. SSR Discovery and Marker Informativeness

A total of 223,200 SSR motifs were identified from the contigs, with an average length of 121 bp within the range between 103 bp and 320 bp. Only 645 SSR motifs were flanked with a sufficient length of sequences for primer designs. The number was further reduced to 120 sequences due to a good predicted match of primer pairs. The 120 designed primers were tested in a subset of 8 samples representing 1 native and 7 introduced populations for amplifiability and polymorphism tests. The results show that 30 SSR consistently produced amplicons with the expected size, with only 8 SSR found to be polymorphic (Table 2), while the remaining 22 monomorphic SSR are shown in Table S2. In addition, these polymorphic SSR, which were successfully validated based on single pass sequencing results (Supplementary Figures S1–S8), were subjected to full genotyping, with 180 weevil samples representing 12 populations, as shown in Table 1.
A total of 50 alleles were detected in the sample pool, and the mean NA per SSR was 6.25 (±3.2) alleles, ranging from 4 to 13 alleles (Table 2). Moreover, the mean Ho per SSR = 0.2747 ±0.1614 was lower than the mean He per SSR = 0.5550 ±0.1934. The PIC per SSR was between the range of 0.2540 and 0.8024, with the mean of 0.5084 (±0.1928). All eight SSR markers indicated a significant deviation from the Hardy–Weinberg equilibrium (HWE) at p = 0.05 threshold (Table S3), but no strong linkage disequilibrium (LD) was detected among them (r2 < 0.5) (Table S4).
Unfortunately, null alleles were detected in all polymorphic SSR, with an NAF (null allele frequency) ranging from 0.0341 to 0.3361 and the mean of 0.1903 (±0.1243) (Table 2). There are three classes of NAF: negligible (NAF < 0.05), moderate (0.05 ≤ NAF < 0.20), or large (NAF ≥ 0.20 [46]. Thus, two SSR, namely SDPek_R0022 (0.0341) and SDPek_R0142 (0.0390), exhibited low NAF, followed by SDPek_R0139 (0.0970) and SDPek_R0064 (0.1495), with moderate frequencies. The remaining four SSR showed a high incidence of null alleles (NAF > 0.2).

3.3. Genetic Diversity Analysis

The genetic diversity of 12 assayed weevil populations was analyzed using 220 SNP and 8 SSR, respectively, to compare the applicability of both marker systems. For SNP, MAF of the CAM population (0.8222) and Ghana (GHN) population (0.9199) were higher than 0.7974 (±0.0409), which is the MAF mean (Table 3). This also resulted in a lower NA mean (CAM = 1.6500 and GHN = 1.3318), Ho (CAM = 0.2417 and GHN = 0.0992), and PIC (CAM = 0.1810 and GHN = 0.0833) in both native populations, compared to the mean values across populations (mean NA = 1.7690 ± 0.1535, Ho = 0.2914 ± 0.0649, and PIC = 0.2102 ± 0.0425). Interestingly, these four diversity parameters were comparable among the introduced populations and higher than the mean values across populations.
In Table 3, using SNP, all populations recorded a negative Fis, ranging from −0.1146 (Central West Kalimantan, CWK) to −0.0244 (CAM), except for the GHN population, with 0.0330. The SSR markers, however, indicated the opposite trend of Fis , in which all 12 populations showed positive values in the range between 0.2931 (South Sarawak, SS) to 0.5857 (North Peninsular Malaysia, NPM). As this was observed due to the presence of null alleles in some markers, allelic frequencies were then adjusted for re-estimation of SSR-based Fis. After the adjustment, the SSR-based Fis were reduced, ranging from −0.0246 (CAM) to 0.1173 (SS). Particularly, the SBH (−0.0024), CWK (−0.0030), and CAM (−0.0246) populations recorded negative Fis values (Table 3). As expected, all assayed populations carried more SSR alleles (mean NA ≥ 3.0000) than did the SNP, leading to a lower MAF mean (0.6370 ± 0.0665) (Table 3). However, the GHN population still retained the highest MAF, with 0.7542, and the lowest Ho, with 0.1500. More alleles per SSR conferred better segregations of Ho and PIC across the populations. For instance, the CWK and SBH populations indicated the highest Ho, with 0.3417, and the highest PIC, with 0.5142, respectively.

3.4. Genetic Distance Estimate and Clustering Analysis

In general, the mean pairwise Nei’s genetic distance (D) among the 12 assayed populations calculated using SNP and SSR was 0.0329 and 0.1330, respectively. The lowest genetic relatedness between the GHN population and the CAM population was revealed by SNP (D = 0.1452) and SSR (D = 0.3841). However, the closest related population pair detected in both marker systems varied: NPM-CPM pair (D = 0.011) in SNP and CPM-SR pair (D = 0.0367) in SSR. The genetic relatedness among populations was subsequently visualized in a UPGMA phenogram, based on D estimates (Figure 2). Both SNP and SSR mutually assigned the assayed 12 populations into 3 major divisions (2 lineages and 1 cluster): (i) GHN, (ii) CAM, and (iii) introduced populations in Malaysia and Indonesia (Figure 2A,C). Apparently, the introduced cluster was more genetically related to the native CAM than that of the native GHN. Besides similarity clustering, PCA plots (Figure 2B,D) further explained the resolution of both marker systems. The assayed SNP panel indicated a much higher resolution by tightly isolating the three clusters (Figure 2B), compared to assaying with the SSR panel (Figure 2D). The assayed 220 SNP provided a higher resolution than did the 8 SSR markers, with a reduced within-group variation and an increased between-group distance.

4. Discussion

4.1. Nuclear DNA Marker Development and Informativeness

RAD tag sequencing is a cost-effective and rapid method for discovering DNA markers, especially for non-model species with no prior genome references [47,48,49,50,51]. The method enabled the identification of 19,148 SNP and 223,200 SSR motifs from 48 weevils. Normally, SNPs were expected to be more abundant throughout the genome of most species compared to SSR [52,53], but this was not observed in this study. The identification of SNP is dependent on the presence of nucleotide variants across individuals; therefore, overrepresentation of the introduced populations in Malaysia and Indonesia might limit the genetic polymorphism. Hence, the inclusion of a wider collection of native populations is recommended. For example, in the SNP discovery to study the population structure of the Asian longhorned beetle, the longhorned beetle samples from a wide coverage area in China and Korea were used [54].
Although discovery of SSR is more feasible, even with a single reference genome, only 0.28% of the identified SSR in this study were suitable for primer design. This was due to short generated RAD tag sequences, which usually spanned between 150 bp and 300 bp [55], thereby limiting the availability of flanking sequence for primer designs. A similar experience was reported in a previous study: only 237 out of 94,851 reads with SSR motif (0.24%) of an endangered yew species, Taxus florinii, were suitable for primer design [56]. In that study, the generated RAD tag sequences were on average 141 bp. Besides, in the absence of the E. kamerunicus reference genome, the identification of weevil-specific SNPs and SSRs, based on cross-mapping with reference genomes of the order Coleoptera, was found to be promising. A similar SNP marker discovery using RAD tag sequencing for a coconut leaf beetle without a reference genome was reported in [57]. As an example in plants, , the conserved DNA markers of the oil palm has successfully improved the genome assemblies in two other economically important palm species: the coconut and date palms [58].
Despite the reduced number of DNA markers, 220 SNP and 8 SSR were found to be informative. Markers with PIC values ≥0.5 are usually considered as highly informative, 0.25–0.5 as moderate, and <0.25 as low [35]. For bi-allelic SNP, the maximum PIC value per marker can only achieve 0.345 [59]. This explains why multi-allelic SSRs generally exhibited a higher PIC than that of the assayed SNP. The constrained informativeness of SNP, nevertheless, can always be compensated by using a large number of SNPs with high genotyping throughput. In this study, the NA of SNP = 435 was observed to be 8.7 times higher than the NA of SSR at 50.

4.2. Applications of Weevil-Specific SNP and SSR in Genetic Diversity Assessment

Both SNP and SSR mutually assigned 12 assayed populations into three major divisions: 2 lineages namely (i) GHN, (ii) CAM, and a cluster of (iii) introduced populations in Malaysia and Indonesia. With a higher mean NA of multi-allelic SSR, the marker system had more advantages in explaining within-cluster variation than inter-clusters (Figure 2C,D). Previous studies reported that SSR was far more informative than SNP for population structure inference, where a randomly chosen set of SSRs would have 4–12 times greater informativeness than a random chosen set of SNPs [60,61]. For example, a total of 20 SSR markers were reported as useful for measuring gene heterozygosity and elucidating the population genetic structure of the adzuki bean weevil [62]. However, as mentioned earlier, SNP has a higher genotyping throughput compared to SSR, to compensate for the shortcoming. This feature enabled the assayed 220 SNP to confer better PCA clustering among the three groups, with reduced within-group variation and increased between-group distance. Similarly, a total of 239 SNP of red mangrove species effectively provided greater resolution on the population structure compared to the 8 SSR markers [63].
From the UPGMA phenogram, the cluster of the introduced populations in Malaysia and Indonesia was more closely related to CAM than GHN, which corresponded to the introduction event of the weevil from Cameroon to Southeast Asia. Furthermore, separate genetic clustering between Ghana, Central Africa, and Cameroon, West Africa, is in agreement with that obtained in a previous study reporting the difference in mitochondrial genetic clustering of E. kamerunicus between Central and West Africa [21].
Interestingly, 11 weevil populations indicated excessive SNP-based heterozygosity (Fis < 0), suggesting an absence of inbreeding in these populations. These findings might further support those of previous studies. Morphometric measurements of E. kamerunicus in Malaysia and Indonesia were not significantly different from those of the native Liberia weevils [64]. The population abundance of E. kamerunicus has also been reported to be stable [65]. It is worth noting that both SNP and SSR revealed unexpectedly poorer PIC and Ho in native GHN and CAM populations compared to the introduced populations, leading to a homozygosity deficit (Fis > 0) in the CAM population. One possible reason for this was non-random sampling, where these samples were collected from a limited number of palms, thereby not representing the heterozygosity of the real populations. Hence, random sampling from a wider range of the native population is recommended in future studies.
On the other hand, SSR-based Fis showed the total opposite results, with all populations recording positive values, suggesting excess homozygosity which can be partly due to the presence of null alleles [66]. Unfortunately, null alleles with high NAF > 0.2 were present in half of the assayed SSR (Table 2). Despite the reduction in the adjusted SSR-based Fis, most values remained positive across the populations (Table 3). A null allele event, which was first reported in 1993, can occur when alleles fail to amplify due to the mutated annealing sites of SSR primers, leading to incorrect scoring of two different alleles as one allele [67]. This means that the genotyping of the samples may appear homozygous rather than heterozygous for the null allele. Certainly, underestimation of the heterozygosity parameter, such as Wright’s inbreeding estimates, can arise [68,69,70]. For instance, the estimated Fis of the oriental fruit moth populations derived from 10 SSR with high NAF was reported to be two to three times higher than the estimation derived from another set of 10 SSR with low NAF [71]. In this study, it is suggested that the existence of null allele in SSR has led to overestimation of homozygosity of assayed weevil populations, hence contributing to the positive values of SSR-based Fis. Consequently, the generated positive Fis does not correctly represent inbreeding occurrence in both native and introduced E. kamerunicus populations.
The presence of null alleles predicates the need to redesign the primers [70], but that was not feasible in this study. The length of the flanking region of RAD tag-SSR was extremely limited. Therefore, RAD tag-derived SNP was clearly more advantageous over RAD tag-derived SSR markers. The sequencing of the weevil genome as a reference is recommended to minimize the presence of null alleles, as illustrated in the Asian honeybee [72]. The SSR markers developed in this study can be deployed across other oil palm-pollinating weevil, especially from the Elaeidobius genus, i.e., E. plagiatus, E. singularis, E. bilineatus, E. subvittatus, and E. spatilifer. Closely related species would be expected to share more conserved sequence sites [73,74]. If the sequence is conserved between the model and the species of interest, the primers that flank the conserve regions will have a higher probability of amplifying DNA fragments in the species of interest [73]. Many successful cross-taxa applications of DNA markers, such as planthopper Laodelphax striatellus, with its related taxa [75], Diabrotica beetles [76], and oil palm Elaeis guineensis, with its close relative E. oleifera [77], were reported.
These 220 informative SNP markers can be deployed as a PCR-based genotyping method, such as Seq-SNP (LGC Genomics, Berlin, Germany), Kompetitive Allele Specific PCR (KASP) (LGC Genomics, Hoddesdon, UK) [78] and TaqMan® (Applied Biosystem, Massachusetts, USA) to obtain consistent and high call rates for more reliable results in future studies. In addition, PCR-based SNP markers can be cost-effective when genotyping a large number of samples. The Seq-SNP genotyping takes 3 days and costs USD 7 for each DNA sample genotyped with 163 SNPs in 261 cucumber varieties [25]. This feature is crucial for other researchers to extend similar surveys in different areas of interest and eventually, to form a comprehensive genetic diversity profile of weevil populations across Africa and Southeast Asia.

5. Conclusions

The discovery of nuclear DNA markers for E. kamerunicus using RAD tag sequencing was promising. The applicability of informative SNP in the genetic diversity assessment of weevil populations was observed to be superior to SSR. The main limitation of SSR is the null allele, which is common in RAD tag sequences. The presence of null allele in SSR markers from this study has led to an upward bias in inbreeding coefficient estimates. The inbreeding coefficient that was estimated using SNP-based data has recorded negative values in introduced E. kamerunicus populations indicating inbreeding is not being observed for now. For further confirmation, we propose expanding the survey to more populations across Malaysia, Indonesia, and Papua New Guinea using the developed specific nuclear DNA markers. Collaborative studies can be performed among industry players in oil palm-growing countries. This would obtain a more comprehensive genetic diversity concerning weevil populations in these regions. The information will be used to create guidelines regarding conservation planning and sustainability monitoring of this important pollinator species, from the estate to the national level.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/insects14020157/s1, Figures S1–S8: Validation of 8 polymorphic SSR markers through Sanger sequencing; Table S1: List of 220 SNP markers with primer sequences; Table S2: List of 22 monomorphic markers with primer sequences; Table S3: Hardy–Weinberg Equilibrium (HWE) of 8 SSR markers; Table S4: Pairwise calculation of linkage disequilibrium (LD) for 8 SSR markers.

Author Contributions

Conceptualization, C.-K.T. and A.-L.O.; methodology, F.F.M.R., N.A.S., H.-Y.H., N.A., S.A. and B.S.; validation, F.F.M.R., H.-Y.H. and M.O.-A.; formal analysis, F.F.M.R., N.A.S. and H.-Y.H.; investigation, F.F.M.R., N.A.S., A.-L.O., H.-Y.H., S.Y., M.O.-A., N.A., S.A. and B.S.; resources, M.O.-A., B.S., S.Y., S.A., N.A., N.M.-A., D.R.A. and H.K.; data curation, F.F.M.R., N.A.S. and H.-Y.H.; software, A.-L.O. and N.M.-A.; writing—original draft preparation, F.F.M.R., N.A.S. and H.-Y.H.; writing—review and editing, C.-K.T., A.-L.O., S.Y., N.M.-A., M.O.-A., N.A., S.A., B.S., D.R.A. and H.K.; visualization, F.F.M.R., N.A.S., C.-K.T., A.-L.O. and H.-Y.H.; supervision, C.-K.T., A.-L.O., S.Y., N.M.-A., D.R.A. and H.K.; project administration, F.F.M.R. and C.-K.T.; funding acquisition, D.R.A., H.K. and C.-K.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was fully supported by internal grants from Sime Darby Plantation, Malaysia.

Data Availability Statement

The datasets generated for this study can be found in the European Nucleotide Archive (ENA) with Accession No. GCA_945836925. A list of selected SNP markers has been deposited in the European Variation Archive (EVA), with Accession No. PRJEB55362.

Acknowledgments

We would like to acknowledge the contribution of the Crop Protection Section, the Sime Darby Plantation R&D Centre, the Advanced Biotechnology and Breeding Centre, MPOB, Crop Protection Applied Agricultural Resources (AAR), and the Minamas Research Centre, Indonesia, for sample collection, as well as input for discussion during data interpretation. We would also like to thank the Molecular Breeding Lab and the Plant Functional Lab for lending equipment for lab processes.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pamin, K.A. A hundred and fifty years of oil palm in Indonesia: From the Bogor Botanical Garden to the industry. In Proceedings of the International Oil Palm Conference ‘Commodity of the Past, Today and the Future’, Bali, Indonesia, 23–25 September 1998; pp. 3–23. [Google Scholar]
  2. Corley, R.H.; Tinker, P. Chapter 1. The Origin and Development of the Oil Palm Industry. In The Oil Palm, 4th ed.; Blackwell Science Ltd.: Oxford, UK, 2003; pp. 1–26. [Google Scholar]
  3. Parveez, G.K.A. Oil palm economic performance in Malaysia and R&D progress in 2020. J. Oil Palm Res. 2021, 33, 181–214. [Google Scholar] [CrossRef]
  4. Syed, R.A. Studies on oil palm pollination by insects. Bull. Entomol. Res. 1979, 69, 213–224. [Google Scholar] [CrossRef]
  5. Hardon, J. Assisted pollination in oil palm: A review. In Advances in Oil Palm Cultivation; Wastie, R.L., Earp, D.A., Eds.; Incorporated Society of Planters: Kuala Lumpur, Malaysia, 1973; pp. 184–195. [Google Scholar]
  6. Wahid, M.B. Developments of the Oil Palm Pollinator, Elaiedobius kamerunicus in Malaysia. Palm Oil Dev. 1984, 2, 1–3. [Google Scholar]
  7. Syed, R. Insect Pollination of Oil Palm: Feasibility of Introducing Elaeidobius Spp. into Malaysia. In Oil Palm in the Eighties. A Report of the Proceedings of the International Conference on Oil Palm in the Eighties; Kuala Lumpur (Malaysia) Inc. Society of Planters: Kuala Lumpur, Malaysia, 1982; pp. 263–289. [Google Scholar]
  8. Kang, S.; Zam, A. Quarantine aspects of the introduction into Malaysia of an oil palm insect pollinator. In Proceedings of the International Conference on Plant Protection in the Tropics, Kuala Lumpur, Malaysia, 1–4 March 1982; pp. 615–626. [Google Scholar]
  9. Tuo, Y.; Koua, H.K.; Hala, N. Biology of Elaeidobius kamerunicus and Elaeidobius plagiatus (Coleoptera: Curculionidae) main pollinators of oil palm in West Africa. Eur. J. Sci. Res. 2011, 49, 426–432. [Google Scholar]
  10. Siswanto; Soetopo, D. Population of oil palm pollinator insect (Elaeidobius kamerunicus Faust.) at PTP Nusantara VIII Cisalak Baru, Rangkasbitung-Banten. IOP Conf. Ser. Earth Environ. Sci. 2020, 418, 12045. [Google Scholar] [CrossRef]
  11. Corley, R.H.V.; Tinker, P.B. The Oil Palm, 5th ed.; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2015. [Google Scholar]
  12. Syed, R.A.; Law, I.H.; Corley, R.H. Insect pollination of oil palm: Introduction, establishment and pollinating efficiency of Elaeidobius kamerunicus in Malaysia. Planter 1982, 58, 547–561. [Google Scholar]
  13. Mariau, D.; Genty, P. IRHO contribution to the study of oil palm insect pollinators in Africa, South America and Indonesia. Oléagineux 1988, 43, 233–240. [Google Scholar]
  14. Moslim, R.; Kamarudin, N. Comparative Traits of Pollinating Weevils and Factors Affecting Its Population in Malaysia. In Task Force Oil Palm Pollinating Weevil and Fruit Set; Malaysian Palm Oil Board: Bangi, Selangor, Malaysia, 2016. [Google Scholar]
  15. Donough, C.; Chew, K.; Law, I. Effect of fruit set on OER and KER-results from studies at pamol estates. Plant 1996, 72, 203–219. [Google Scholar]
  16. Rao, V.; Law, I. The problem of poor fruitset in parts of East Malaysia. Plant 1998, 74, 463–483. [Google Scholar]
  17. Ming, K.S. The Elaeidobius kamerunicus story. Plant 1999, 75, 143–150. [Google Scholar]
  18. Krantz, G.W.; Poinar, G.O. Mites, nematodes and the multimillion dollar weevil. J. Nat. Hist. 2004, 38, 135–141. [Google Scholar] [CrossRef]
  19. Nasir, D.M.; Mamat, N.-S.; Abdul Muneim, N.A.; Ong-Abdullah, M.; Abd Latip, N.F.B.; Su, S.; Hazmi, I.R.B. Morphometric Analysis of the Oil Palm Pollinating Weevil, Elaeidobius kamerunicus (Faust, 1878) (Coleoptera: Curculionidae) from Oil Palm Plantations in Malaysia. J. Entomol. Res. Soc. 2020, 22, 275–291. [Google Scholar] [CrossRef]
  20. Sukhodolskaya, R. Intraspecific Body Size Variation in Ground Beetles (Coleoptera, Carabidae) in Urban-Suburban-Rural-Natural Gradient. Acta Biol. Univ. Daugavp 2013, 13, 121–128. [Google Scholar]
  21. Haran, J.; Ndzana Abanda, R.F.X.; Benoit, L.; Bakoumé, C.; Beaudoin-Ollivier, L. Multilocus phylogeography of the world populations of Elaeidobius kamerunicus (Coleoptera, Curculionidae), pollinator of the palm Elaeis guineensis. Bull. Entomol. Res. 2020, 110, 654–662. [Google Scholar] [CrossRef]
  22. Ballard, J.W.O.; Whitlock, M.C. The incomplete natural history of mitochondria. Mol. Ecol. 2004, 13, 729–744. [Google Scholar] [CrossRef]
  23. Qin, Y.; Buahom, N.; Krosch, M.N.; Du, Y.; Wu, Y.; Malacrida, A.R. Genetic diversity and population structure in Bactrocera correcta (Diptera: Tephritidae) inferred from mtDNA cox1 and microsatellite markers. Sci. Rep. 2016, 6, 38476. [Google Scholar] [CrossRef]
  24. Yang, X.; Xu, Y.; Shah, T.; Li, H.; Han, Z.; Li, J.; Yan, J. Comparison of SSRs and SNPs in assessment of genetic relatedness in maize. Genetica 2011, 139, 1045–1054. [Google Scholar] [CrossRef]
  25. Zhang, J.; Yang, J.; Zhang, L.; Luo, J.; Zhao, H.; Zhang, J.; Wen, C. A new SNP genotyping technology Target SNP-seq and its application in genetic analysis of cucumber varieties. Sci. Rep. 2020, 10, 5623. [Google Scholar] [CrossRef] [Green Version]
  26. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef]
  27. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  28. Garrison, E.P.; Marth, G.T. Haplotype-based variant detection from short-read sequencing. arXiv 2012, arXiv:1207.3907. [Google Scholar]
  29. The NCBI C++ Toolkit; National Center for Biotechnology Information, U.S. National Library of Medicine: Bethesda, MD, USA, 2004.
  30. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  31. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  32. Melville, J.; Haines, M.L.; Boysen, K.; Hodkinson, L.; Kilian, A.; Smith Date, K.L.; Potvin, D.A.; Parris, K.M. Identifying hybridization and admixture using SNPs: Application of the DArTseq platform in phylogeographic research on vertebrates. R. Soc. Open Sci. 2017, 4, 161061. [Google Scholar] [CrossRef] [PubMed]
  33. Metz, S.; Cabrera, J.M.; Rueda, E.; Giri, F.; Amavet, P. FullSSR: Microsatellite Finder and Primer Designer. Adv. Bioinform. 2016, 2016, 6040124. [Google Scholar] [CrossRef]
  34. Schuelke, M. An economic method for the fluorescent labeling of PCR fragments A poor man ’ s approach to genotyping for research and high-throughput diagnostics. Nat. Biotechnol. 2000, 18, 233–234. [Google Scholar] [CrossRef] [PubMed]
  35. Botstein, D.; White, R.L.; Skolnick, M.; Davis, R.W. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet. 1980, 32, 314–331. [Google Scholar] [PubMed]
  36. Wright, S. The interpretation of population structure by F-statistics with special regard to systems of mating. Evolution 1965, 19, 395–420. [Google Scholar] [CrossRef]
  37. Liu, K.; Muse, S.V. PowerMarker: An integrated analysis environment for genetic marker analysis. Bioinform. Appl. Notes 2005, 21, 2128–2129. [Google Scholar] [CrossRef] [Green Version]
  38. Nei, M.; Tajima, F.; Tateno, Y. Accuracy of estimated phylogenetic trees from molecular data. J. Mol. Evol. 1983, 19, 153–170. [Google Scholar] [CrossRef]
  39. Page, R.D.M. Tree View: An application to display phylogenetic trees on personal computers. Bioinformatics 1996, 12, 357–358. [Google Scholar] [CrossRef]
  40. Jombart, T. adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics 2008, 24, 1403–1405. [Google Scholar] [CrossRef]
  41. Goudet, J. HIERFSTAT, a package for R to compute and test hierarchical F-statistics. Mol. Ecol. Notes 2005, 4, 184–186. [Google Scholar] [CrossRef]
  42. Paradis, E. Pegas: An R Package for Population Genetics with an Integrated-Modular Approach. Bioinformatics 2010, 26, 419–420. [Google Scholar] [CrossRef] [PubMed]
  43. Kamvar, Z.N.; López-Uribe, M.M.; Coughlan, S.; Grünwald, N.J.; Lapp, H.; Manel, S. Developing educational resources for population genetics in R: An open and collaborative approach. Mol. Ecol. Resour. 2017, 17, 120–128. [Google Scholar] [CrossRef] [PubMed]
  44. Van Oosterhout, C.; Hutchinson, W.F.; Wills, D.P.M.; Shipley, P. MICRO-CHECKER: Software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes 2004, 4, 535–538. [Google Scholar] [CrossRef]
  45. Brookfield, J.F. A simple new method for estimating null allele frequency from heterozygote deficiency. Mol. Ecol. 1996, 5, 453–455. [Google Scholar] [CrossRef] [PubMed]
  46. Chapuis, M.-P.; Estoup, A. Microsatellite Null Alleles and Estimation of Population Differentiation. Mol. Biol. Evol. 2007, 24, 621–631. [Google Scholar] [CrossRef] [PubMed]
  47. Peterson, B.K.; Weber, J.N.; Kay, E.H.; Fisher, H.S.; Hoekstra, H.E. Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species. PLoS ONE 2012, 7, e37135. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Miller, M.R.; Dunham, J.P.; Amores, A.; Cresko, W.A.; Johnson, E.A. Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 2007, 17, 240–248. [Google Scholar] [CrossRef] [PubMed]
  49. Baird, N.A.; Etter, P.D.; Atwood, T.S.; Currey, M.C.; Shiver, A.L.; Lewis, Z.A.; Selker, E.U.; Cresko, W.A.; Johnson, E.A. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 2008, 3, e3376. [Google Scholar] [CrossRef]
  50. Ogden, R.; Gharbi, K.; Mugue, N.; Martinsohn, J.; Senn, H.; Davey, J.W.; Pourkazemi, M.; McEwing, R.; Eland, C.; Vidotto, M.; et al. Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing. Mol. Ecol. 2013, 22, 3112–3123. [Google Scholar] [CrossRef]
  51. Pavinato, V.A.C.; Margarido, G.R.A.; Wijeratne, A.J.; Wijeratne, S.; Meulia, T.; Souza, A.P.; Michel, A.P.; Zucchi, M.I. Restriction site associated DNA (RAD) for de novo sequencing and marker discovery in sugarcane borer, Diatraea saccharalis Fab. (Lepidoptera: Crambidae). Mol. Ecol. Resour. 2017, 17, 454–465. [Google Scholar] [CrossRef]
  52. Guichoux, E.; Lagache, L.; Wagner, S.; Chaumeil, P.; Léger, P.; Lepais, O.; Lepoittevin, C.; Malausa, T.; Revardel, E.; Salin, F.; et al. Current trends in microsatellite genotyping. Mol. Ecol. Resour. 2011, 11, 591–611. [Google Scholar] [CrossRef]
  53. Chen, W.; Hou, L.; Zhang, Z.; Pang, X.; Li, Y. Genetic Diversity, Population Structure, and Linkage Disequilibrium of a Core Collection of Ziziphus jujuba Assessed with Genome-wide SNPs Developed by Genotyping-by-sequencing and SSR Markers. Front. Plant Sci. 2017, 8, 575. [Google Scholar] [CrossRef]
  54. Cui, M.; Wu, Y.; Javal, M.; Giguère, I.; Roux, G.; Andres, J.A.; Keena, M.; Shi, J.; Wang, B.; Braswell, E.; et al. Genome-scale phylogeography resolves the native population structure of the Asian longhorned beetle, Anoplophora glabripennis (Motschulsky). Evol. Appl. 2022, 15, 934–953. [Google Scholar] [CrossRef] [PubMed]
  55. Andrews, K.R.; Good, J.M.; Miller, M.R.; Luikart, G.; Paul, A.; Sciences, W.; Avenue, O.S.; Station, L.B.; Group, W.G.; Studies, E. Harnessing the power of RADseq for ecological and evolutionary genomics. Nat. Rev. Genet. 2016, 17, 81–92. [Google Scholar] [CrossRef]
  56. Qin, H.; Yang, G.; Provan, J.; Liu, J.; Gao, L. Using MiddRAD-seq data to develop polymorphic microsatellite markers for an endangered yew species. Plant Divers. 2017, 39, 294–299. [Google Scholar] [CrossRef]
  57. Chen, Z.; Wang, G.; Li, M.; Peng, Z.; Ali, H.; Xu, L.; Gurr, G.M.; Hou, Y. Development of Single Nucleotide Polymorphism (SNP) Markers for Analysis of Population Structure and Invasion Pathway in the Coconut Leaf Beetle Brontispa longissima (Gestro) Using Restriction Site-Associated DNA (RAD) Genotyping in Southern China. Insects 2020, 11, 230. [Google Scholar] [CrossRef] [Green Version]
  58. Ong, A.-L.; Teh, C.-K.; Mayes, S.; Massawe, F.; Appleton, D.R.; Kulaveerasingam, H. An Improved Oil Palm Genome Assembly as a Valuable Resource for Crop Improvement and Comparative Genomics in the Arecoideae Subfamily. Plants 2020, 9, 1476. [Google Scholar] [CrossRef]
  59. Hildebrand, C.E.; Torney, D.C.; Wagner, R.P. Informativeness of Polymorphic DNA Markers. Los Alamos Sci. 1992, 20, 100–102. [Google Scholar]
  60. Rosenberg, N.A.; Li, L.M.; Ward, R.; Pritchard, J.K. Informativeness of genetic markers for inference of ancestry. Am. J. Hum. Genet. 2003, 73, 1402–1422. [Google Scholar] [CrossRef] [PubMed]
  61. Liu, N.; Chen, L.; Wang, S.; Oh, C.; Zhao, H. Comparison of single-nucleotide polymorphisms and microsatellites in inference of population structure. BMC Genet. 2005, 6 (Suppl. 1), S26. [Google Scholar] [CrossRef]
  62. Duan, C.; Li, D.; Sun, S.; Wang, X.; Zhu, Z. Rapid Development of Microsatellite Markers for Callosobruchus chinensis Using Illumina Paired-End Sequencing. PLoS ONE 2014, 9, e95458. [Google Scholar] [CrossRef]
  63. Hodel, R.G.J.; Chen, S.; Payton, A.C.; McDaniel, S.F.; Soltis, P.; Soltis, D.E. Adding loci improves phylogeographic resolution in red mangroves despite increased missing data: Comparing microsatellites and RAD-Seq and investigating loci filtering. Sci. Rep. 2017, 7, 17598. [Google Scholar] [CrossRef] [PubMed]
  64. Latip, N.F.A.; Ghani, I.A.; Hazmi, I.R.; Cik Mohd Rizuan Zainal Abidin, D.S. Morphometric Comparison of the Oil Palm Pollinator Elaeidobius kamerunicus Faust (Coleoptera: Curculionidae) from Malaysia, Indonesia, and Liberia. Coleopt. Bull. 2019, 73, 746–756. [Google Scholar] [CrossRef]
  65. Nurul Fatihah, A.L.; Muhamad Fahmi, M.H.; Luqman, H.A.; Syarifah Nadiah, S.M.D.; Teo, T.M.; Izfa Riza, H.; Idris, A.B. Effects of Rainfall, Number of Male Inflorescences and Spikelets on the Population Abundance of Elaeidobius kamerunicus (Coleoptera: Curculionidae). Sains Malays. 2019, 48, 15–21. [Google Scholar] [CrossRef]
  66. Pompanon, F.; Bonin, A.; Bellemain, E.; Taberlet, P. Genotyping errors: Causes, consequences and solutions. Nat. Rev. Genet. 2005, 6, 847–859. [Google Scholar] [CrossRef]
  67. Callen, D.F.; Thompson, A.D.; Shen, Y.; Phillips, H.A.; Richards, R.I.; Mulley, J.C.; Sutherland, G.R. Incidence and origin of “null” alleles in the (AC)n microsatellite markers. Am. J. Hum. Genet. 1993, 52, 922–927. [Google Scholar]
  68. Chybicki, I.J.; Oleksa, A.; Burczyk, J. Increased inbreeding and strong kinship structure in Taxus baccata estimated from both AFLP and SSR data. Heredity 2011, 107, 589–600. [Google Scholar] [CrossRef]
  69. Kelly, A.C.; Mateus-Pinilla, N.E.; Douglas, M.; Douglas, M.; Shelton, P.; Novakofski, J. Microsatellites behaving badly: Empirical evaluation of genotyping errors and subsequent impacts on population studies. Genet. Mol. Res. 2011, 10, 2534–2553. [Google Scholar] [CrossRef]
  70. Wu, X.; Wang, L.; Zhang, D.; Wen, Y. Microsatellite null alleles affected population genetic analyses: A case study of Maire yew (Taxus chinensis var. mairei). J. For. Res. 2019, 24, 230–234. [Google Scholar] [CrossRef]
  71. Song, W.; Cao, L.-J.; Wang, Y.-Z.; Li, B.-Y.; Wei, S.-J. Novel microsatellite markers for the oriental fruit moth Grapholita molesta (Lepidoptera: Tortricidae) and effects of null alleles on population genetics analyses. Bull. Entomol. Res. 2017, 107, 349–358. [Google Scholar] [CrossRef] [PubMed]
  72. Liu, L.; Qin, M.; Yang, L.; Song, Z.; Luo, L.; Bao, H.; Ma, Z.; Zhou, Z.; Xu, J. A genome-wide analysis of simple sequence repeats in Apis cerana and its development as polymorphism markers. Gene 2017, 599, 53–59. [Google Scholar] [CrossRef]
  73. Wang, M.; Barkley, N.; Jenkins, T. Microsatellite Markers in Plants and Insects. Part I: Applications of Biotechnology. Genes Genomes Genom. 2009, 3, 54–67. [Google Scholar]
  74. Ding, S.; Wang, S.; He, K.; Jiang, M.; Li, F. Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects. BMC Genom. 2017, 18, 848. [Google Scholar] [CrossRef]
  75. Liu, Y.-D.; Zhang, B.; Hou, M. Thirteen microsatellite loci for Laodelphax striatellus and cross amplification in related taxa. Entomol. Sci. 2014, 17. [Google Scholar] [CrossRef]
  76. Waits, E.R.; Stolz, U.W.E. Polymorphic microsatellite loci from northern and Mexican corn rootworms (Insecta: Coleoptera: Chrysomelidae) and cross-amplification with other Diabrotica spp. Mol. Ecol. Resour. 2008, 8, 707–709. [Google Scholar] [CrossRef]
  77. Billotte, N.; Risterucci, A.M.; Barcelos, E.; Noyer, J.L.; Amblard, P.; Baurens, F.C. Development, characterisation, and across-taxa utility of oil palm (Elaeis guineensis Jacq.) microsatellite markers. Genome 2001, 44, 413–425. [Google Scholar] [CrossRef]
  78. Alvarez-Fernandez, A.; Bernal, M.J.; Fradejas, I.; Martin Ramírez, A.; Md Yusuf, N.A.; Lanza, M.; Hisam, S.; Pérez de Ayala, A.; Rubio, J.M. KASP: A genotyping method to rapid identification of resistance in Plasmodium falciparum. Malar. J. 2021, 20, 16. [Google Scholar] [CrossRef]
Figure 1. A flowchart of SNP filtration steps. The number in brackets at each filtration step represents the number of markers.
Figure 1. A flowchart of SNP filtration steps. The number in brackets at each filtration step represents the number of markers.
Insects 14 00157 g001
Figure 2. Phenogram and genetic clustering of 180 E. kamerunicus collected from 12 populations using (A,B) 220 SNP and (C,D) 8 SSR. Both SNP and SSR mutually assigned the assayed 12 populations into 3 major divisions (2 lineages and 1 cluster): (i) GHN, (ii) CAM, and (iii) introduced populations in Malaysia and Indonesia. GHN = Ghana, CAM = Cameroon, SS = South Sarawak, SR = South Riau, NR = North Riau, CPM = Central Peninsular Malaysia, NPM = North Peninsular Malaysia, CWK = Central West Kalimantan, SK = South Kalimantan, SPM = South Peninsular Malaysia, NS = North Sarawak, SBH = Sabah, East Malaysia.
Figure 2. Phenogram and genetic clustering of 180 E. kamerunicus collected from 12 populations using (A,B) 220 SNP and (C,D) 8 SSR. Both SNP and SSR mutually assigned the assayed 12 populations into 3 major divisions (2 lineages and 1 cluster): (i) GHN, (ii) CAM, and (iii) introduced populations in Malaysia and Indonesia. GHN = Ghana, CAM = Cameroon, SS = South Sarawak, SR = South Riau, NR = North Riau, CPM = Central Peninsular Malaysia, NPM = North Peninsular Malaysia, CWK = Central West Kalimantan, SK = South Kalimantan, SPM = South Peninsular Malaysia, NS = North Sarawak, SBH = Sabah, East Malaysia.
Insects 14 00157 g002
Table 1. Localities of 12 assayed E. kamerunicus populations in Malaysia, Indonesia, Ghana, and Cameroon.
Table 1. Localities of 12 assayed E. kamerunicus populations in Malaysia, Indonesia, Ghana, and Cameroon.
No.PopulationPop. CodeEstateLatitudeLongitudeSample Size
Native Population
1CameroonCAMBuea5.29165329.420852415
2GhanaGHNKusi6.1104472−0.267379515
Introduced population
3South Riau, IndonesiaSRBhumireksa Nusasejati0.1172028103.605408315
4North Riau, IndonesiaNRMenggala2.8770875103.728783215
5Central West Kalimantan, IndonesiaCWKSekunyir−2.4329821112.00017215
6South Kalimantan, IndonesiaSKAngsana3.6063617115.589720615
7North Peninsular MalaysiaNPMSungai Dingin5.3678468100.706380315
8Central Peninsular MalaysiaCPMWest3.4032204101.401307315
9South Peninsular MalaysiaSPMYong Peng2.0160711102.996010915
10South Sarawak, East MalaysiaSSSessang1.8433571111.210160715
11North Sarawak, East MalaysiaNSBayu3.3757040113.410991515
12Sabah, East MalaysiaSBHTiger4.4069250117.817230915
Total180
Table 2. Eight polymorphic SSR and their marker characteristics.
Table 2. Eight polymorphic SSR and their marker characteristics.
No.SSR MarkerPrimer Sequence (5′-3′)Repeat MotifSize Range (bp)Ta (°C)MAFNANGPICNAF
1SDPek_R0022F: GCCTATAATAGACCGTTTGG;
R: GTGAAGAACATTGTAACATCTC
(AAC)6124–136560.43066.0015.000.66510.0341
2SDPek_R0032F: CGCTCTCCTCCTCATTATCA;
R: TCTCTTGCATCTAGGTAACG
(AG)6168–174540.73065.008.000.39370.2875
3SDPek_R0064F:GTCACCAATAAGTTCCAAAGC;
R: GTGGTGTTGGTGAACCTGAT
(CTGCA)4185–200560.83894.007.000.25400.1495
4SDPek_R0079F: TATTTGGATGTATTTCGGTTTG;
R: GCGAGTATTTGTAGCGATC
(T)10146–150540.39399.0017.000.66950.2875
5SDPek_R0082F: TCTCAAGGTGGCTCTCAT;
R: CACATCTATCCGCACTACA
(T)13117–130560.296113.0027.000.80240.2917
6SDPek_R0139F: CTCCAGTTATAGTACCACAATG;
R: CGACTCGGCTCTTGTATT
(AG)6131–139540.53895.0011.000.54010.0970
7SDPek_R0142F: CCTAAATAAGGACCACCCTA;
R: CGACCTGTTAGCCTCTAC
(AT)5132–140540.55594.006.000.42940.0390
8SDPek_R0147F: TTGGTTCTATCGAGTAATGC;
R: GCAATTTACCTCCAATGACA
(ATG)5144–156540.79444.008.000.31310.3361
Mean0.57246.2512.380.50840.1903
Standard deviation0.19823.27.090.19280.1243
Ta—annealing temperature, MAF—major allele frequency, PIC—polymorphism information content, NA—number of alleles, NG—number of genotypes, NAF—null allele frequency (null allele frequency is based on Brookfield 1 estimates [45].
Table 3. Genetic diversity parameters of 180 samples in 12 populations using 220 SNP and 8 SSR.
Table 3. Genetic diversity parameters of 180 samples in 12 populations using 220 SNP and 8 SSR.
Pop. CodeNSNPSSR
MAFMean NAHeHoPICFisMAFMean NAHeHoPICFisAdjusted Fis
SR150.78491.80000.27950.31250.2226−0.08370.68333.25000.42360.25000.37470.43810.0110
NS150.77481.84090.29040.31810.2312−0.06080.60003.50000.48780.30830.44540.39730.0501
CAM150.82221.65000.22820.24170.1810−0.02440.65423.00000.44970.25830.39450.4534 −0.0246
CWK150.77641.84550.29080.33390.2316−0.11460.56253.37500.52830.34170.47320.3831−0.0030
GHN150.91991.33180.10350.09920.08330.03300.75423.12500.33750.15000.31310.57890.1042
NR150.78601.83640.27990.30730.2236−0.06370.64173.25000.45080.28330.39310.40090.1470
NPM150.78041.84090.28860.32220.2306−0.08220.55003.87500.55250.24170.50370.58570.0667
SK150.77591.83640.29080.31790.2313−0.05880.66253.50000.44310.30830.40490.33500.1154
SS150.79471.79550.26840.30310.2147−0.09550.64233.37500.45510.33630.39880.29310.1173
SBH150.78451.84090.28030.30550.2241−0.05530.52503.87500.56500.25830.51420.5667−0.0024
CPM150.77421.84090.29500.33190.2352−0.09110.69583.37500.42420.25830.38080.41980.0392
SPM150.79471.78640.26730.30340.2131−0.10080.67233.12500.43080.30360.38120.32690.0623
Mean0.79741.76900.26360.29140.2102−0.06650.63703.38540.46240.27480.41480.43160.0569
Standard Dev0.04090.15350.05350.06490.04250.03970.06650.27420.06320.05180.05850.0992 0.0554
MAF—major allele frequency, N—number of analyzed individual, Mean NA = mean number of allele, He—expected heterozygosity, Ho—observed heterozygosity, PIC—polymorphism information content, Fis—fixation index in the population, adjusted Fis—adjusted fixation index in the population.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mohd Rodzik, F.F.; Sudirman, N.A.; Teh, C.-K.; Ong, A.-L.; Heng, H.-Y.; Yaakop, S.; Mohd-Assaad, N.; Ong-Abdullah, M.; Ata, N.; Amit, S.; et al. Development of Nuclear DNA Markers for Applications in Genetic Diversity Study of Oil Palm-Pollinating Weevil Populations. Insects 2023, 14, 157. https://doi.org/10.3390/insects14020157

AMA Style

Mohd Rodzik FF, Sudirman NA, Teh C-K, Ong A-L, Heng H-Y, Yaakop S, Mohd-Assaad N, Ong-Abdullah M, Ata N, Amit S, et al. Development of Nuclear DNA Markers for Applications in Genetic Diversity Study of Oil Palm-Pollinating Weevil Populations. Insects. 2023; 14(2):157. https://doi.org/10.3390/insects14020157

Chicago/Turabian Style

Mohd Rodzik, Fairuz Farhana, Nurshazwani Amalina Sudirman, Chee-Keng Teh, Ai-Ling Ong, Huey-Ying Heng, Salmah Yaakop, Norfarhan Mohd-Assaad, Meilina Ong-Abdullah, Nabeel Ata, Samsudin Amit, and et al. 2023. "Development of Nuclear DNA Markers for Applications in Genetic Diversity Study of Oil Palm-Pollinating Weevil Populations" Insects 14, no. 2: 157. https://doi.org/10.3390/insects14020157

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop