Analysis of Genetic Diversity in Indian Isolates of Rhipicephalus microplus Based on Bm86 Gene Sequence

The control of cattle tick, Rhipicephalus microplus, is focused on repeated use of acaricides. However, due to growing acaricide resistance and residues problem, immunization of animals along with limited use of effective acaricides is considered a suitable option for the control of tick infestations. To date, more than fifty vaccine candidates have been identified and tested worldwide, but two vaccines were developed using the extensively studied candidate, Bm86. The main reason for limited vaccine commercialization in other countries is genetic diversity in the Bm86 gene leading to considerable variation in vaccine efficacy. India, with 193.46 million cattle population distributed in 28 states and 9 union territories, is suffering from multiple tick infestation dominated by R. microplus. As R. microplus has developed multi-acaricide resistance, an efficacious vaccine may provide a sustainable intervention for tick control. Preliminary experiments revealed that the presently available commercial vaccine based on the BM86 gene is not efficacious against Indian strain. In concert with the principle of reverse vaccinology, genetic polymorphism of the Bm86 gene within Indian isolates of R. microplus was studied. A 578 bp conserved nucleotide sequences of Bm86 from 65 R. microplus isolates collected from 9 Indian states was sequenced and revealed 95.6–99.8% and 93.2–99.5% identity in nucleotides and amino acids sequences, respectively. The identities of nucleotides and deduced amino acids were 94.7–99.8% and 91.8–99.5%, respectively, between full-length sequence (orf) of the Bm86 gene of IVRI-I strain and published sequences of vaccine strains. Six nucleotides deletion were observed in Indian Bm86 sequences. Four B-cell epitopes (D519-K554, H563-Q587, C598-T606, T609-K623), which are present in the conserved region of the IVRI-I Bm86 sequence, were selected. The results confirm that the use of available commercial Bm86 vaccines is not a suitable option against Indian isolates of R. microplus. A country-specific multi-epitope Bm86 vaccine consisting of four specific B-cell epitopes along with candidate molecules, subolesin and tropomyosin in chimeric/co-immunization format may provide a sustainable option for implementation in an integrated tick management system.


Introduction
India houses the largest cattle population (193.46 million) in the world [1] and is also the highest producer of milk [2]. However, the per capita productivity is low due to multiple reasons. Among the various reasons, tick infestation is an important contributor Vaccines 2021, 9,194. https://doi.org/10.3390/vaccines9030194 https://www.mdpi.com/journal/vaccines Vaccines 2021, 9, 194 2 of 15 to the low-level of animal productivity. Among the 109 species of ticks reported from India, Rhipicephalus microplus is a widely distributed species that infests livestock, wildlife, and zoo animals and also causes significant losses to cattle production [3]. This species inhabits India, South East Asia, Central and South America, northern and eastern Australia, eastern and southern Africa, Madagascar, the Mascarene Islands, New Caledonia, and French Polynesia [4,5]. Besides causing a significant reduction in weight gain and milk production, R. microplus also transmits Babesia bigemina, B. bovis, Anaplasmamarginalein the Indian subcontinent [6]. As per the United Nations Food and Agriculture Organization (FAO) report, 80% of the world's cattle population is exposed to tick infestation and has an estimated impact of US$7.30/head/year [7]. In India, the cost of controlling ticks and tick-borne diseases (TTBDs) has been estimated at US$498.7 million/annum [8].
The most widely adopted method for tick control is the repeated use of different classes of acaricides. However, indiscriminate use of chemical acaricides "on" and "off" the hosts has led to the emergence and establishment of acaricide-resistant tick populations throughout tropical and subtropical regions of the world [9][10][11][12], including India [13,14]. Besides adding to environmental pollution, the acaricide residues also contaminate milk and meat products [15][16][17][18][19].
Initial studies using native Bm86 as an immunogen showed significant efficacy against heterologous tick species and later, TickGARD (Hoechst Animal Health; Australia), TickGARD PLUS (Intervet Australia) in Australia, Gavac TM (Heber Biotec; Havana, Cuba) in Latin American countries and BovimuneIxovac (BOVIMUNE IXOVAC) in Mexico was developed and commercialized. The efficacy and benefits of using the anti-tick vaccine as a component of integrated tick management are well established [20][21][22]. The Bm86 based commercial vaccine provided significant efficacy against some tick strains [23][24][25], and a reduction of the incidence of bovine babesiosis has already been reported [26]. However, the commercial vaccines have shown variable efficacy of 0 to 91% in different geographical areas [20,[27][28][29][30][31][32][33][34][35][36][37][38][39][40][41][42], and this has been considered as one of the impediments of commercialization of the vaccine in wide geographical areas. One of the many reasons for the variable efficacy of a recombinant protein-based vaccine is the variability of the Bm86 amino acid sequence between reference strains used to produce the recombinant vaccines and the field strains [43]. A variation greater than 2.8% in the amino acid sequence of the protein expressed would be sufficient to confer variable efficacy [27]. Partial protection in earlier pen trials [44] with the Cuban rBm86 vaccine, Gavac in India indicated variation in the Indian Bm86 gene sequence. However, investigations have not been conducted to identify the level of diversity in the Bm86 gene within the Indian strains of R. microplus and how this compares to the globally available strains. This information is crucial before exploring the possibility of using the Bm86 gene for the development of an effective vaccine against Indian cattle tick. Thus, mapping of the Bm86 variability in strains of interest and prediction of B-cell epitope sequences between Indian and several previously characterized strains, including one commercial tick strain, is targeted in the present study as a guide for the development of effective Bm86-based vaccines for India and other countries.

Materials and Methods
Workflow of the current study mentioned in Supplementary Figure S1.

Tick Samples
The R. microplus IVRI-I strain (registration No. NBAII/IVRI/BM/1/1998) was maintained in the Entomology laboratory, Division of Parasitology, ICAR-Indian Veterinary Research Institute, was used as the reference sample. The reference ticks (N = 6) (generation 54) were used for the generation of full-length Bm86 gene sequence. For the Bm86 gene sequence diversity study, male and female R. microplus were collected from cross-bred (Bos taurus × B. indicus), native Indian breeds of cattle and from buffaloes of 65 districts across India (Supplementary Table S1). The tick isolates were collected between January 2018 to December 2019.

Study Area
Nine states belonging to different agro-climatic zones were selected for sample collection (Supplementary Figure S2). The number of districts from each state was selected based on cattle population [1], tick infestation level and incidence of tick-borne diseases (TBDs). From each district, a pooled sample of about 100-150 ticks were collected following a stratified random sampling procedure and was designated as an isolate. The collected tick samples were cleaned, morphologically identified as R. microplus using standard key [30] and stored at −80 • C.

RNA Isolation and cDNA Synthesis
Three engorged female ticks were randomly picked from each field isolate and reference tick strain (IVRI-I), weighed and stored at −80 • C. The ticks were triturated in 2 mL of Trizol ® (Thermo Fisher Scientific, Waltham, MA, USA) reagent (1 mL/100 mg tissue). The tick lysate was centrifuged at 10,000 rpm for 5 min; the supernatant was mixed with chloroform (0.2 mL/1 mL of Trizol), incubated at room temperature after vigorous shaking. The aqueous phase was mixed with absolute isopropyl alcohol and centrifuged at 13,000 rpm for 15 min at 4 • C. The supernatant was discarded, and the pellet was washed with 70% ethanol. The RNA pellet was air-dried and allowed to dissolve in DEPC treated water after heating the tube at 55 • C for 15 min. The RNA was stored at −80 • C. The cDNA was synthesized from extracted RNA using a first-strand cDNA synthesis kit (Thermo Fisher Scientific, Waltham, MA, USA) following the manufacturer's instructions.

Amplification of Full-Length IVRI-I Bm86 (orf) Gene, Cloning and Sequencing
To amplify the Bm86 cDNA target sequence, a pair of primers were designed using Primer 3 software (https://primer3plus.com/cgi-bin/dev/primer3plus.cgi (accessed on 6 February 2021)). Primers were designed based on the sequence of R. microplus reference strain Yeerongpilly (NCBI accession number M29321) as a template. The full-length Bm86orf was amplified using the forward (5 -ATG CGT GGC ATC GCT TTA TT-3 ; nucleotides 33-52) and reverse (5 -GTT TAG CCC AAC TAT CTT TAT TTG ACA TC-3 ; nucleotides 1985-1964) primers. The 25 µL PCR was optimized with the following components: 2.5 µL of 10× DreamTaq green PCR buffer, 1 µL 25 mM MgCl 2 , 0.5 µL of 10 mM dNTPs (Thermo Fisher Scientific, Waltham, MA, USA), 1 µL 50 ng cDNA, 10 µM of each primer, 0.3 µL DreamTaq DNA polymerase (5 U/µL) (Thermo Fisher Scientific, Waltham, MA, USA) and sterile nuclease-free water in sufficient quantity to make up the volume. The PCR (Veriti 96-well thermal cycler, Applied Biosystems, Foster City, CA, USA) condition was: 4 min at 95 • C followed by 35 cycles of denaturing step of 30 s at 95 • C, an annealing step of 40 s at 62.5 • C, an extension step of 2 min at 72 • C and a final extension step of 72 • C for 15 min. The band representing the 1953 bp Bm86 cDNA amplicon was excised and purified using a QIAquick gel extraction kit (Qiagen, Hilden, Germany). The purified PCR products were cloned in a pTZ57R/T cloning vector (Thermo Fisher Scientific, Waltham, MA, USA). Preparation of competent cells and transformation was carried out using Transform Aid TM Bacterial transformation kit (Thermo Fisher Scientific, Waltham, MA, USA) following manufacturer's protocol. Identification of positive clones was based on blue-white colony screening and colony PCR. Subcultures of positive clones were outsourced for double-stranded sequencing to the Department of Biochemistry, Delhi University, New Delhi. Generated sequences were analyzed, annotated and submitted to GenBank (NCBI, Bethesda, MD, USA).
Vaccines 2021, 9, 194 4 of 15 The PCR conditions were optimized as; 4 min at 94 • C followed by 35 cycles of a denaturing step of 30 s at 94 • C, an annealing step of 30 s at 52 • C, an extension step of 1 min at 72 • C and a final extension step of 72 • C for 15 min. A total of 195 PCR were performed (65 × 3: three reactions per cDNA sample). The band representing 578 bp Bm86 cDNA amplicon was excised, purified and cloned as mentioned above. The positive clones (five in each isolate) were outsourced for single-stranded sequencing. Each generated sequence was analyzed, annotated and submitted to GenBank.

Phylogenetic Analysis
Nucleotide and amino acid sequences were aligned with ClustalW, BioEdit software (Version 7.0.5.3) (BioEdit Limited, Manchester, England), and phylogenetic analysis was performed using neighbor-joining and maximum-likelihood methods and based on the Pdistance and Jones-Taylor-Thornton (JTT) model. Phylogenetic and molecular evolutionary analyses were conducted using MEGA X [31]. Bootstrap analysis was conducted using 1000 replicates to assess the reliability of inferred tree topologies.
First, the prediction of linear B-cell epitopes was carried out using the IEDB web server Bepipred 2.0 (National Institute of Allergy and Infectious Diseases, Bethesda, MD, USA). For each FASTA input sequence, a prediction score for each amino acid was obtained. To determine potential B-cell linear epitopes, we utilized the recommended cutoff of 0.5 [45], where an average score of at least nine consecutive amino acids were used for determining the cutoff. Sequences with a Bepipred score above 0.5 were considered as potential linear B-cell epitopes and analyzed by VaxiJen, the first server used for alignment-independent prediction of protective antigens. It was developed to allow antigens classification based on the physicochemical properties of proteins without recourse to sequence alignment. Bacterial, viral, parasite and tumor protein datasets were used to derive models for the prediction of whole protein antigenicity with prediction accuracy from 70% to 89% [45,46]. To evaluate the antigenicity of predicted epitopes, we utilized the default cutoff (0.5), suggested to parasite antigens. Therefore, sequences with a Bepipred score above 0.5 and a VaxiJen score above 0.5 were considered potential linear B-cell epitopes and evaluated for specificity.

Evaluation of Degree of Conservation of Linear B-Cell Epitopes
Sequences identified as potential linear B-cell epitopes were aligned to amino acid sequences of IVRI-I Bm86 for comparison with reference sequences, Yeerongpilly

Sequence Analysis
The full-length and partial targeted sequences of the Bm86 gene were amplified as 1953 bp and 578 bp, respectively, without any nonspecific reactions (Supplementary Figure S3). One of the objectives of the study was to measure the level of polymorphism between Indian (IVRI-I) Bm86 gene with worldwide published full-length Bm86 gene sequences. The amino acid sequence identity matrix (Table 1) revealed that the Indian (IVRI-I) Bm86 protein has 93.2% homology (6.76% polymorphism) and 92.7% (7.22% polymorphism) with the Yeerongpilly (TickGARD TM ) and Camcord (Cuba) (GAVAC) vaccine strains, respectively. The multiple sequence alignment (MSA) analysis showed (Supplementary Figure S4) that the specific amino acids of IVRI-I Bm86 differs from the Yeerongpilly vaccine strain at 44 loci, including 42 substitutions and 2 deletions (186, 187) and also differs from Camcord (Cuba) vaccine strain at 44 loci (44 substitutions) (Supplementary Table S2).   Figure 1 shows the phylogenetic tree of the IVRI-I nucleotide sequence with 9 other sequences from different isolates using the neighbor-joining method based on the P-distance model. Two main clades, A and B, were formed. Clade A consists of two subclades (A1, A2). The A1 is formed by three sequences of Zapata 1 (USA), Mexico and Mozambique. Subclade A2 corresponds to two sequences of Yeerongpilly and XJNJ (China). Clade B is formed by 4 sequences. It forms two subclades: one small B1 and a large B2. The subclade B1 is formed by the IVRI-I sequence, and the subclade B2 is mainly represented by two sequences of Thailand. It is observed that the IVRI-I sequence is closely related to two Thailand isolates (M1 and M2) and Hidalgo isolate (USA) rather than the Yeerongpilly, China and other strains. Whereas, Yeerongpilly vaccine strain is closer to the Chinese isolate and forms a separate sister group and more distant to Indian and Thailand isolates (Figure 1). The nucleotide sequences identity matrix revealed that the IVRI-I isolate had 95% homology with that of the Cuban and Australian isolates, respectively (Table 2). closely related to two Thailand isolates (M1 and M2) and Hidalgo isolate (USA) rathe than the Yeerongpilly, China and other strains. Whereas, Yeerongpilly vaccine strain i closer to the Chinese isolate and forms a separate sister group and more distant to India and Thailand isolates (Figure 1). The nucleotide sequences identity matrix revealed tha the IVRI-I isolate had 95% homology with that of the Cuban and Australian isolates, re spectively (Table 2).      When the same Bm86 sequences of the deduced amino acid were compared using a maximum-likelihood tree based on Jones-Taylor-Thornton (JTT) model (Figure 2), two clades (C, D) are formed. The clade C consists of two subclades (C1, C2). The C1 subclade is formed by three sequences of Yeerongpilly, XJNJ (China) and Mozambique. The subclade C2 consists of two sequences of Zapata1 (USA) and Mexico. The clade D is formed by 4 sequences and divided into two subclades (D1, D2). The subclade D1 is formed by Hidalgo (USA) sequence, and IVRI-I and Thailand M1, M2 are together formed subclade D2. The IVRI-I strain was closely related to two Thailand isolates (M1, M2), while Hidalgo isolates of the USA formed a single cluster (Figure 2). The Chinese isolates were closely related to the Yeerongpilly reference sequence and formed a single clade. Here again, except for, Hidalgo isolate, the remaining Bm86 sequences from USA were arranged in a single clade and clustered with the China-Yeerongpilly clade. The Bm86 amino acid identity matrix (  The other objective of the present study was to assess the level of polymorphism i conserved Bm86 sequences of different Indian and in other isolates. Analysis of partia sequences of the Bm86 generated from different Indian isolates revealed 95.6% to 99.8% and 93.2% to 99.5% identity in nucleotides and amino acids sequences, respectively Phylogenetic comparison of the sixty-five Indian sequences with the published sequence from Thailand, China, Mozambique and Brazil (Campo Grande) formed two distinc clades (E and F). The clade F is formed by the conserved sequences of China, Mozam bique, Brazil (Campo Grande) and the Dausa sequence of India. Clade E consists of th highest number of conserved sequences; however, most of them do not form any ident fiable subclades due to high diversity. The cade E is subdivided into subclade E1 and E2 The subclade E1 is formed by forty-three conserved sequences of Indian states. Interes ingly, the Gujarat sequences were arranged in a group (red color box) (Figure 3). Th subclade E2 is formed by twenty-two conserved sequences of different Indian isolate and four Bm86 sequences of Thailand isolates. On comparing the 578 bp sequence o sixty-five Indian isolates with a sequence of IVRI-I, few specific amino acid changes mutations were observed. The conserved Bm86 Indian isolates showed a minimum (1) t maximum (11) different amino acid substitutions/mutations (Supplementary Table S3 Analysis of the state-wise share of total amino acid substitutions revealed that Rajastha state is contributing a maximum share of (20%) substitutions/mutations and acting as th geographical hot spot of the Bm86 mutations. The state-wise share of amino acid muta tions (geographical hot spots) in conserved Bm86 is as follows, Rajasthan (20%)> Maha rashtra and Haryana (15%)> Madhya Pradesh and Gujarat (12%) > Uttarakhand and As sam (9%)> Uttar Pradesh and Punjab (4%) (Supplementary Figure S5). The other objective of the present study was to assess the level of polymorphism in conserved Bm86 sequences of different Indian and in other isolates. Analysis of partial sequences of the Bm86 generated from different Indian isolates revealed 95.6% to 99.8% and 93.2% to 99.5% identity in nucleotides and amino acids sequences, respectively. Phylogenetic comparison of the sixty-five Indian sequences with the published sequences from Thailand, China, Mozambique and Brazil (Campo Grande) formed two distinct clades (E and F). The clade F is formed by the conserved sequences of China, Mozambique, Brazil (Campo Grande) and the Dausa sequence of India. Clade E consists of the highest number of conserved sequences; however, most of them do not form any identifiable subclades due to high diversity. The cade E is subdivided into subclade E1 and E2. The subclade E1 is formed by forty-three conserved sequences of Indian states. Interestingly, the Gujarat sequences were arranged in a group (red color box) (Figure 3). The subclade E2 is formed by twenty-two conserved sequences of different Indian isolates and four Bm86 sequences of Thailand isolates. On comparing the 578 bp sequence of sixty-five Indian isolates with a sequence of IVRI-I, few specific amino acid changes/ mutations were observed. The conserved Bm86 Indian isolates showed a minimum (1) to maximum (11) different amino acid substitutions/mutations (Supplementary Table S3). Analysis of the state-wise share of total amino acid substitutions revealed that Rajasthan state is contributing a maximum share of (20%) substitutions/mutations and acting as the geographical hot spot of the

Discussion
In India, due to problems associated with tick infestations in animals and the everincreasing problem of selection and establishment of acaricide-resistant tick populations, the demand for alternative control strategies, including an anti-tick vaccine, is very high. Identification of vaccine targets is key to the success of any vaccine, and genetic homogeneity of the identified candidate antigen(s) is to be assured before further experimentation. Accordingly, the present study was designed to evaluate the variation in the Bm86 gene sequence of Indian R. microplus strains. The sequencing analysis of IVRI-I Bm86 revealed that the Bm86 gene is 648 amino acids long with two amino acid mutations. In contrast, most of the other reference sequences are 650 amino acids long. Similarly, amino acid deletions were also seen in the Hidalgo isolate of the USA and Chennai isolate of India.
The phylogenetic analysis of nucleotide and amino acid sequence of IVRI-I Bm86 with published reference Bm86 sequences revealed that IVRI-I Bm86 is evolutionarily closely related to Thailand isolates and distant to commercial vaccine strains (Yeerongpilly, Camcord, Mexico and others). This may be due to the geographical location of Thailand isolates, which are closer than other countries, and these data are in agreement with the previous observations by Kaewmongkol and coworkers' [29]. The sequence identity matrix analysis showed that the IVRI-I Bm86 protein has 93.2% homology (6.76% divergence) and 92.7% (7.22% divergence) with the Yeerongpilly (TickGARD TM ) and Camcord (Cuba) (GAVAC) vaccine strains, respectively. The divergence level of more than 2.8% has been reported as a limiting factor in the variation of efficacy of the Bm86-based vaccines [27,29]. The sequence divergence data validates the earlier observation in which 44.5% and 25.1% efficacy against R. microplus (IVRI-I strain) and Hyalomma anatolicum (IVRI-II strain), respectively, was recorded in a pen trial using commercial Cuban Bm86 vaccine [44]. The high diversity of IVRI-I Bm86 and low efficacy of commercial Mexican Bm86 vaccine in India showed that there is a strong need foran Indian-specific Bm86 vaccine.
Multiple sequence analysis of 578 bp IVRI-I conserved sequences with 65 Indian field isolates revealed 95.6 to 99.8% and 93.2 to 99.5% identity in nucleotides and amino acids sequences, respectively (Supplementary Table S4).
The analysis of the state-wise total number of substitutions/mutation (presented in pie chart form) (Supplementary Figure S5) revealed that Rajasthan state contributes the maximum share (20%) and Uttar Pradesh and Punjab states contribute the minimum share (4%). India is a highly diversified country in terms of geography, climatic conditions, and cattle breeds. A significant level of polymorphism among Indian Bm86 may have resulted from the adaptation of the tick species to different climatic conditions and cattle breeds. The R. microplus isolates from the various regions have undergone different environmental pressures, and these may have influenced the physiological, morphological and genetic variations among these isolates.
Due to the diversity in full-length IVRI-I Bm86 gene sequences and in conserved Bm86 sequence of different field isolates, the development of the Bm86 antigen-based vaccine using the entire Bm86 sequence under Indian conditions may not give maximum protection against R. microplus. Instead of using a whole antigen vaccine, epitope-based vaccines have advantages, such as safety, specificity, and low production cost. For example, due to its significant efficacy, WHO has approved a multi-epitope-based malaria vaccine (RTS, S (Mosquirix™)) for human use [47]. Accordingly, in the present study, the IVRI-I Bm86 antigen was screened for B-cell epitopes.
Nine liner B-cell epitopes were identified after screening through Bepipred and Vax-iJen servers. Four epitopes (D519-K554, H563-Q587, C598-T606, T609-K623), which are present in the conserved region of the IVRI-I Bm86 sequence, were selected. The similarity percentage of these epitopes with published Bm86 reference sequences was in the range from 86.6 to 100%, while with conserved Bm86 Indian field isolates, it was from 88 to 100%. The analysis of the impact of substitution/mutations in antigenicity/immunogenicity of B-cell epitope based on VaxiJen scores revealed that the amino acid deletion of G177-D224 in IVRI-I Bm86 epitope increased the epitope antigenicity. The in silico analysis of the impact of deletions and substitutions/mutations on the antigenicity of B-cell epitopes provided an idea of the predicted efficacy of the vaccine.
In the related fields of vaccine research, similar results have been reported. For example, initially, the apical membrane antigen 1 (AMA-1) was proposed as the most suitable subunit vaccine candidate for apicomplexan parasites, including Eimeria tenella [33], E. maxima [34], and P. falciparum [35][36][37]. However, high allelic diversity, with more than 60 polymorphic amino acids, has limited the development of an AMA-1-based P. falciparum vaccine [36][37][38][39]. In India, while exploring the possibility of developing vaccines against E. tenella, P. falciparum and Theileria annulata, high single nucleotide polymorphism (SNP) haplotype diversity in south Indian isolates of E. tenella compared to north Indian isolates [40] was noticed. High allelic sequence variation in merozoite surface antigen-1 (MSA-1) between 98 Indian isolates of P. falciparum [41], P. falciparumPfg377 gametocyte gene in 122 field isolates [42] and Tams1 gene of T. annulata parasite [48] were observed and vaccine development work was reoriented accordingly. The results of the present study and the lesson learned from the earlier experiments suggest that the Indian Bm86 protein sequence is showing high polymorphism. The B-cell epitope analysis and diversity study revealed four India-specific B-cell epitopes (D519-K554, H563-K606, C598-K606, T609-K623), which were common and highly similar in all the isolates collected across the country. The current study also identified six B-cell epitopes (T18-D45, K319-K501,  D519-K554, H563-K606, C598-K606, T609-K623), which were common in all the commercial Bm86 vaccine strains, including the Indian Bm86 sequence. These B-cell epitopes will be helpful in designing future universal multi-B-cell-epitope-based Bm86 vaccine. The four India-specific Bm86 B-cell epitopes along with other tick vaccine molecules viz., subolesin [49,50], tropomyosin [51] in chimeric vaccine/ co-vaccination format may be suitable for R. microplus management under Indian conditions.

Conclusions
A significant level of polymorphism in the full-length Bm86 gene and 65 conserved Bm86 sequences were found in R. microplus populations collected from 9 states of India. Based on the present sequence diversity study and previous in vivo pen trials, data showed that commercial vaccines based on whole antigen Bm86 vaccines might not be suitable under Indian conditions. Future studies should be on the diversity study on India-specific Bm86 B-cell epitope sequences. Sampling sites should include other states that were not part of the current work. Additionally, studies should aim to develop India-specific multi-B-cell epitope-based chimeric/cocktail/co-vaccination strategies using computational technologies.
Supplementary Materials: The following are available online at https://www.mdpi.com/2076-3 93X/9/3/194/s1, Figure S1: Flowchart of study methodology. Figure S2: An outline map showing district wise sample collection sites. Figure Figure S4: Multiple align sequence analysis of full-length IVRI-I Bm86 gene with published vaccine strains; green box depicting the amino acid deletion in Indian Bm86 sequence; red box showing the Bm86 conservation sequence. Figure S5: Pie chart representing the percentage of total amino acid share of each state (each state conserved Bm86 compared to IVRI-I Bm86 conserved sequence). Table S1: Location of Rhipicephalus microplus engorged female tick samples collected across India. Table S2: Specific amino acid substitutions/mutations of full-length Indian (IVRI-I) Bm86 gene with respect to commercial vaccine strains. Table S3: Specific amino acid substitutions/mutations in different Indian isolates conservation sequences with respect to Indian (IVRI-I) Bm86 conservation sequence. Table S4: The percentage similarity of conserved IVRI-I Bm86 B-cell epitopes with Indian Bm86 conserved field isolates B-cell epitopes. The levels of amino acid similarity were classified as low (75-85%; green cells), medium (85-95%; yellow cells) and high (95-100%; red cells). Table S5: Table S5

Data Availability Statement:
The data sets used and/or analyzed during the present study are available from the corresponding authors on reasonable request.