Next Article in Journal
Research Progress on Sex Pheromone Receptors in Insects
Previous Article in Journal
Evaluation of High-Efficacy Insecticides and Control Efficacy Using UAV Application Against the Sea Buckthorn Fruit Fly, Rhagoletis batava obscuriosa
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-View Pareto Optimization for Minimal-Diagnostic-Set Identification of Disease Vectors

by
Nuofei Lin
1,†,
Jingjing Wang
2,†,
Yixiang Qian
1,
Li Wei
1,
Hongxia Liu
2,*,
Bo Dai
1,*,
Songlin Zhuang
1 and
Dawei Zhang
1
1
Engineering Research Center of Optical Instrument and System, The Ministry of Education, Shanghai Key Laboratory of Modern Optical System, University of Shanghai for Science and Technology, Shanghai 200093, China
2
Department of Infectious Disease Control, Shanghai Municipal Center for Disease Control and Prevention, Shanghai 201107, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Insects 2026, 17(4), 381; https://doi.org/10.3390/insects17040381
Submission received: 28 January 2026 / Revised: 21 March 2026 / Accepted: 27 March 2026 / Published: 1 April 2026
(This article belongs to the Section Insect Pest and Vector Management)

Simple Summary

Accurate identification of disease vectors such as mosquitoes and flies is essential for preventing vector-borne disease outbreaks. However, distinguishing morphologically similar species is challenging and typically requires expert examination of fine diagnostic traits. Existing AI approaches often depend on large, multi-perspective image datasets, which are resource-intensive to acquire. To address this, we developed MVP-Net, an intelligent system that maintains high accuracy with limited data by learning a set of key anatomical views. Using regionally collected fly and mosquito datasets from routine surveillance in Shanghai, the model retained comparable classification performance using only 5 and 2 views for flies and mosquitoes, respectively. This approach reduces image acquisition effort and computational cost and may support regional auxiliary identification workflows.

Abstract

Accurate identification of disease vectors is crucial for public health, yet distinguishing morphologically similar species demands significant taxonomic expertise and data resources. This study proposes MVP-Net, an AI-driven framework designed to extract a minimal sufficient set of diagnostic anatomical views from multi-view imagery for efficient identification. The framework was evaluated on regionally collected datasets of Calyptratae (8 views) and Culicidae (11 views) from routine surveillance in Shanghai. Under all-view fusion, MVP-Net achieved Top-1 accuracies of 87.04% for Calyptratae and 100% for Culicidae. After Pareto-based view optimization, the required input was reduced to 5 views for Calyptratae and 2 views for Culicidae, lowering computational cost by 37.49% and 81.82%, respectively, while retaining comparable classification performance (86.11% for the recommended Calyptratae configuration and 100% for the recommended Culicidae configuration). These results show that MVP-Net can reduce view redundancy while preserving comparable identification performance within the current Shanghai surveillance setting, providing a practical approach for optimizing regional multi-view auxiliary identification workflows.

1. Introduction

The escalating global burden of vector-borne diseases, driven by rapid urbanization and climate variability, has created an imperative for efficient and scalable disease vector surveillance [1,2]. Across the disciplines of public health, medicine, and forensic entomology, the precise identification of Diptera, particularly species within the families Culicidae (mosquitoes) and Calyptratae (flies), is paramount for effective vector control and epidemiological modeling [3,4,5]. However, taxonomic identification at the species level remains a bottleneck. It relies on expert examination of minute, highly localized morphological structures, such as wing venation patterns, thoracic setae arrangement (chaetotaxy), and leg banding [6,7]. As the global shortage of trained taxonomists constrains manual processing, computer vision empowered by deep learning (DL) has emerged as a transformative tool to bridge the gap between biological data acquisition and taxonomic classification [1,8].
Despite the success of convolutional neural networks (CNNs) [9] and Vision Transformers (ViTs) [10] in general object recognition, insect species identification remains a challenging fine-grained visual classification (FGVC) task [11,12]. A substantial body of research has focused on identifying insects from field images or automated light traps [13,14,15,16]. While effective for population monitoring, these in-the-wild approaches often rely on low-resolution global features that are susceptible to background noise [17]. They frequently fail to capture the subtle diagnostic traits prioritized by entomologists, which are crucial for distinguishing sibling species (e.g., Lucilia sericata and Lucilia cuprina) [18,19]. Recent studies utilizing high-definition lab imagery have demonstrated that focusing on specific body parts (e.g., wings) can significantly outperform whole-body classification, as rigid structures provide stable features unaffected by abdominal distortion [20,21].
To emulate the expert taxonomist’s multi-perspective examination, advanced imaging setups increasingly adopt multi-view strategies, capturing specimens from dorsal, lateral, and ventral angles to overcome occlusion [22,23]. However, simply fusing features from all available views introduces significant data redundancy and computational overhead, which is prohibitive for real-time deployment [24]. Furthermore, not all morphological views contribute equally to discrimination; for instance, the dorsal thorax is critical for distinguishing Aedes species, while wing venation is definitive for Culex [21]. Indiscriminate fusion of irrelevant views can lead to negative transfer, where noise dilutes the signal from diagnostic traits [25,26]. Therefore, a central challenge in modern computational entomology is to transition from “processing everything” to “learning what matters.”
Recent advancements in differentiable programming and multi-objective optimization (MOO) offer a pathway to address this efficiency–accuracy trade-off. Techniques such as differentiable Top-k selection and optimal transport allow neural networks to learn to rank and select the most informative image patches or views in an end-to-end manner [27,28,29]. Moreover, the selection of optimal feature subsets can be rigorously modeled using Pareto optimization, which seeks to identify a set of “Pareto-optimal” solutions where classification accuracy is maximized while computational cost is minimized [30,31]. This paradigm enables the discovery of efficient architectures that are both accurate and lightweight, filtering out redundant information effectively [32].
To address this problem, we propose MVP-Net (Multi-View Pareto Network), a multi-view identification framework designed to select compact sets of diagnostically informative anatomical parts from multi-view image data. The framework was evaluated on two datasets collected through approximately one year of routine surveillance conducted by the Shanghai CDC system, covering the major mosquito and fly taxa under local routine monitoring. These datasets include a primary Calyptratae dataset comprising 12 species and 512 specimens, and an auxiliary Culicidae dataset comprising 5 species and 363 specimens. Under all-view fusion, MVP-Net achieved Top-1 accuracies of 87.04% for Calyptratae, exceeding the best single-view baseline by 3.71%, and 100% for Culicidae. By integrating a Hard-Concrete gating mechanism with Pareto optimization, the required number of input views was reduced from 8 to 5 for Calyptratae and from 11 to 2 for Culicidae, corresponding to reductions in computational cost of 37.49% and 81.82%, respectively, while maintaining comparable classification performance. These results suggest that multi-view morphological identification can be made more efficient through task-specific view selection, supporting the application of the proposed framework for regional auxiliary identification of vector insects.

2. Materials and Methods

2.1. Dataset Acquisition and Problem Formulation

To validate the proposed multi-view identification framework, two distinct fine-grained datasets were constructed to represent different taxonomic challenges. The primary dataset serves as the core case study, focusing on Calyptratae, while an auxiliary dataset was compiled for Culicidae to examine whether the same framework can be applied to a second vector group under a comparable imaging and analysis pipeline.
All specimens were obtained through routine vector surveillance conducted across various administrative districts of Shanghai, China, with geographic and temporal coverage summarized in Table A1 and Table A4. The procedures for specimen collection, euthanasia, and preparation followed conventional practices employed in CDC-based vector surveillance. Fly specimens were collected either directly using sweep nets or through cage trapping, whereas mosquito specimens were primarily collected using mosquito aspirators. Following collection, live specimens were euthanized with ether or ammonia water and immediately prepared as pinned specimens. In accordance with routine surveillance specimen preparation practices, each pin was inserted vertically through the right side of the mesonotum. Only fresh and intact specimens were retained for imaging and analysis; damaged specimens were discarded during preparation. The prepared specimens were stored in dry specimen boxes and photographed as soon as possible after collection, typically within three months. The species included in the two datasets were selected from the Culicidae and Calyptratae taxa that are common in Shanghai and relevant to routine local surveillance (see Figure 1 and Figure A4 for representative examples).
Prior to digitization, specimen identification and image annotation were carried out through a collaborative workflow. Experienced taxonomic experts first defined the diagnostic morphological structures to be photographed, based on practical identification experience and relevant taxonomic references [33,34]. For the Calyptratae dataset, eight diagnostic morphological structures, including the basicosta and hypopleura, were selected (Figure 2). Frontline vector surveillance personnel, who had received basic training in locating the required anatomical parts, then acquired images according to the predefined diagnostic views, using the routine microscopic equipment and imaging settings available in the respective regional surveillance units (Table A3). Species identification was primarily performed by two taxonomic experts. In cases where their identifications were inconsistent, an additional taxonomic expert was invited to review the specimen and provide the final determination. This process yielded a total of 512 verified specimens representing 12 species across eight genera. These were partitioned into 404 training samples and 108 validation samples (approximately 4:1 ratio), with the detailed species-wise distribution provided in Table A2.
For the auxiliary Culicidae dataset, 363 specimens spanning 5 species were compiled. In this dataset, 11 specific anatomical parts were defined (Figure A4). These were partitioned into 289 training and 74 validation samples, with the specific species-wise distribution detailed in Table A5.

2.2. MVP-Net Architecture and Learning Strategy

To reconcile the critical trade-off between identification accuracy and computational efficiency, MVP-Net is proposed as a unified framework that integrates feature extraction, multi-view synthesis, and differentiable subset selection. Implementation was carried out in Python (version 3.10.19) using the PyTorch (version 2.9.0+cu126) library, with all model training and performance evaluations performed on a workstation equipped with an Intel Core i9-9940X CPU (3.30 GHz), 32 GB of RAM, and an NVIDIA GeForce RTX 4090 GPU.

2.2.1. Feature Extraction and Transformer-Based Fusion

Discriminative representations are first extracted from each independent morphological view using a shared-weight encoder. A ConvNeXt-V2-Pico backbone [35], initialized with ImageNet [36] weights, is employed as the feature extractor, followed by an ML-Decoder head [37]. The ML-Decoder utilizes learnable queries to aggregate spatial features into a compact view-embedding vector, denoted as vᵢ.
To preserve the structural context of anatomical parts, a learnable view-specific positional encoding (PEᵢ) is injected into the extracted features before fusion:
e i = v i + P E i
where eᵢ serves as the input token for the subsequent fusion module. These tokens are processed via a Transformer-based fusion module utilizing multi-head self-attention [38] with 8 heads. This mechanism allows the model to dynamically assign attention weights to local features based on their relevance to the global classification task, effectively handling occlusion and view-specific noise.
The backbone is fine-tuned using the AdamW optimizer [39] with an initial learning rate of 1 × 10−6 and a weight decay of 5 × 10−3, following a cosine annealing warm restart schedule. All input images were resized to 224 × 224 pixels prior to being fed into the network during training, whereas in practical applications, the neural network can adapt to images of arbitrary size. During training, data augmentation techniques, including random flipping, rotation (±60°), color jittering, and random erasing, were applied to enhance robustness against image variability (Figure A1). The learning curves for the single-view feature extraction stage and the multi-view fusion stage are shown in Figure A2, indicating stable convergence throughout the training process.

2.2.2. Differentiable View Selection via Pareto Optimization

To autonomously identify the minimal sufficient set of views, the model aims to minimize the number of active inputs, which corresponds to optimizing the L0 norm. Since the standard L0 norm is non-differentiable, a continuous relaxation based on the Hard-Concrete distribution [40] is implemented.
For each view i, a binary gate variable zᵢ ∈ {0, 1} is sampled from a rectified distribution parameterized by a learnable logit αᵢ. To enable gradient-based optimization, the reparameterization trick is applied:
z i = clip σ log u log 1 u + α i β ζ γ + γ , 0 , 1
where u U 0,1 is a uniform random noise, σ is the sigmoid function, and β is the temperature parameter annealed from 0.67 to 0.1 during training. The stretching parameters are set to ζ =   1.1 and γ =   0.1 to allow for exact zero/one sampling. The gated output is then computed as z i e i .
The network is trained for 300 epochs using a compound objective function:
L t o t a l = L C E + λ c o n t r a s t L c o n t r a s t + λ s p a r s i t y L L 0
where
(1)
LCE is the standard cross-entropy loss.
(2)
Lcontrast denotes the supervised contrastive loss [41] with a temperature parameter τ = 0.1. The weighting coefficient λcontrast is set to 0.2 to enhance the separation of sibling species.
(3)
L L 0 represents the expected L0 norm (computational cost), calculated as the sum of the gate probabilities:
L L 0 = i = 1 N σ α i β log γ ζ
To construct the Pareto frontier, a grid search is performed over λsparsity ∈ {0.1, 0.2, …, 0.5}. The optimal configuration is then selected to minimize the view count while maintaining accuracy within a 1% margin of the all-view baseline.

2.3. Software Implementation for Offline Identification

To facilitate the application of the proposed framework in routine use scenarios, a customized desktop application was developed in Python to integrate the Pareto-optimized multi-view identification pipeline. The graphical user interface (GUI) was built using PySide6 (version 6.6.1), providing functionalities for specimen management and image uploading (Figure A3). The software was designed for offline local operation, allowing users to upload images corresponding to the selected morphological views described in Section 2.2. These images are automatically preprocessed by the backend and subsequently passed to the local ConvNeXt–Transformer network for inference. To enable convenient use on standard laboratory computers, the software environment, trained models, and required dependencies were packaged as a standalone executable (.exe) for the Windows 10/11 operating system using PyInstaller (version 6.12.0).
Upon completion of the analysis, the software displays the top-3 predicted species along with their respective confidence scores on the interface to assist decision-making. Comprehensive identification logs, encompassing the specimen identifier and file paths of the specific morphological views, are automatically generated and exported as comma-separated values (CSV) documents.

3. Results

3.1. Performance Benchmarking: Single-View vs. Multi-View Fusion

Systematic evaluation on the 12-species Calyptratae validation set demonstrates a consistent performance advantage of multi-view fusion over individual morphological recognition. Among the eight independent views, the mesonotum achieves the highest individual classification accuracy of 83.33% (Figure 3A), serving as the single-view baseline. The Transformer-based fusion model, synthesizing features from all eight views, attains a Top-1 accuracy of 87.04%. This represents an absolute improvement of 3.71 percentage points over the strongest single-view result, confirming that cross-view morphological synthesis is essential for capturing discriminative traits that are otherwise occluded in isolated perspectives.
Species-wise analysis indicates that the impact of multi-view fusion is heterogeneous across different taxa (Figure 3B). For specific disease vectors such as Musca sorbens and Muscina stabulans, the fusion strategy yields a net gain of two correctly identified samples each, resolving ambiguities inherent in dorsal-only observations. However, evidence of negative interference is also observed in certain categories; for instance, in the cases of Lucilia cuprina and Calliphora nigribarbis, the fusion of all eight views results in a marginal decrease of one correctly predicted sample compared to the mesonotum baseline. This suggests that indiscriminate feature stacking can introduce localized informational redundancy, which potentially dilutes the diagnostic signals of specific anatomical structures.
Furthermore, the requirement for high-precision multi-view classification on resource-constrained hardware led to the selection of the ConvNeXt-V2-Pico backbone (≈9.1 M parameters). This weight-sharing configuration is intentional, it prunes unnecessary parameter growth across multiple inputs and redirects the available computational capacity toward the Transformer fusion layer to model complex dependencies. Consequently, the architecture navigates these relationships at a cost of just 1.41 GFLOPs per view. Ultimately, such a design ensures that robust taxonomic identification is achieved without compromising inference speed, so as to facilitate high-performance deployment on resource-constrained devices with strict power and memory limitations.

3.2. Identification of Pareto-Optimal Subsets and Efficiency Gains

The evolution of gating probabilities during the 300-epoch training trajectory illustrates the model’s ability to autonomously prioritize a minimal sufficient set of diagnostic parts (Figure 4A). Anatomical views such as the mesonotum, abdomen, and hypopleura exhibit rapidly increasing retention probabilities, converging toward high-priority status. Conversely, structures including the head, wings, and basicosta are consistently suppressed (Figure 4B), reflecting their lower informational density for fine-grained differentiation within this taxonomic scope. This divergence demonstrates that the differentiable gating mechanism successfully identifies the most discriminative feature combinations while discarding redundant views.
The relationship between the number of views, identification accuracy, and computational overhead is characterized by the Pareto frontier (Figure 4C, Table 1). While the all-view configuration achieves the highest accuracy (87.04%) with a computational cost of 11.31 GFLOPs, the recommended configuration (5-part) identified at lambda λ = 0.4 represents the optimal efficiency-to-performance saturation point. This configuration achieves 86.11% accuracy, maintaining the performance drop within a critical 1% margin (0.93%) relative to the all-view baseline. This recommended configuration is prioritized over the aggressive configuration (3-part) because the latter causes accuracy to drop to 84.26%, representing a 2.78% loss that falls below the predefined threshold for acceptable accuracy.
The superiority of the recommended strategy is further validated by the radar chart (Figure 4D) and Table 1. The recommended configuration represents a strategic peak where the marginal accuracy gain from adding further views (6 through 8) is negligible compared to the 37.49% reduction in theoretical computational cost achieved through part pruning (from 11.31 G to 7.07 G). Species-level confusion analysis (Figure 4E) demonstrates that this optimized subset preserves core diagnostic features, with accuracy losses confined to a single misclassification between the sibling species M. domestica and M. sorbens. In specific cases like Lucilia illustris, the recommended model actually exhibits a marginal improvement (+1 sample) relative to the baseline, suggesting that removing redundant inputs can occasionally mitigate localized classification interference.
To test real-world performance, the identification software (Figure A3) was run on a basic laptop with an Intel Core i7-5500U CPU (2.4 GHz) and 8 GB of RAM. Even without a dedicated GPU, the system achieved real-time inference. This indicates that the lower computational cost (FLOPs) allows the model to work efficiently on common hardware for field use.

3.3. Performance Evaluation on the Culicidae Dataset

The framework was further evaluated on the Culicidae dataset to verify its performance across different disease vector taxa. Baseline benchmarking of eleven individual morphological views (Figure 5) shows that the pre-spiracular and post-spiracular setae provides the highest single-view accuracy at 97.18%. In comparison, the integrated multi-view fusion achieves a 100% identification accuracy, representing a performance gain of +2.82%.
Under the recommended λ configuration, the gating mechanism identifies an optimized subset of diagnostic parts (Figure 6A). The selection process consistently prioritizes the prespiracular and post-spiracular setae and lateral abdomen, while suppressing redundant views such as the wings and legs. The resulting 2-part configuration (Figure 6B) maintains the 100% accuracy ceiling while reducing theoretical computational costs (FLOPs) by 81.82% relative to the 11-view baseline (Table 2). Multi-dimensional evaluation (Figure 6C) confirms that this optimized subset achieves an ideal balance between classification precision and resource efficiency.
The confusion matrix (Figure 6D) confirms that the 2-view model achieves absolute precision across all 5 species. These results indicate that the framework can autonomously determine efficient diagnostic part sets for diverse vector groups while maintaining high classification performance.

4. Discussion

4.1. AI Part Selection and Morphological Consistency

The identification of a minimal sufficient set through MVP-Net suggests that accurate taxonomic differentiation may be achieved without exhaustive morphological coverage, provided that informative diagnostic structures are retained. For Calyptratae, the model consistently prioritizes the mesonotum, abdomen, and hypopleura, which matches the diagnostic characters traditionally used in taxonomic keys. The mesonotum is particularly important as it integrates core diagnostic features, including coloration (e.g., metallic luster), chaetotaxy patterns (such as the arrangement and size of setae), and thoracic markings (e.g., characteristic longitudinal stripes), which collectively serve as the foundation for fly classification. This correspondence indicates that the differentiable gating mechanism is able to select anatomically meaningful views and provides a degree of biological interpretability for the reduced-view configuration.

4.2. Task Dependence and Regional Applicability of Minimal View Sets

The samples used in this study were obtained from approximately one year of routine surveillance conducted by the Shanghai CDC system and encompass the major mosquito and fly taxa under routine monitoring. In the 12-species Calyptratae task, the 8-view configuration achieved an accuracy of 87.04%, while reducing the input to 5 views preserved performance within a margin of 0.93%. In contrast, further reduction to 3 views resulted in a decrease in accuracy to 84.26%. These findings suggest that, within the current dataset context, view reduction for Calyptratae is subject to a practical lower threshold. Accordingly, the objective of this framework is not to minimize the number of views at any cost, but rather to identify a compact subset that maintains performance comparable to that of the all-view configuration. Under this premise, the 5-view configuration appears to offer a balanced solution, retaining most of the discriminative information while reducing computational cost by 37.49%.
For the Culicidae dataset, 100% accuracy was achieved using only two views, and the included specimens cover the major mosquito taxa routinely monitored by the Shanghai CDC system. Within the current Shanghai surveillance context, this two-view configuration appears sufficient for identifying the major mosquito taxa under routine monitoring.
These findings indicate that the optimal subset of diagnostic parts is task-dependent. Within the current Shanghai surveillance context, the reduced-view configurations identified for both Calyptratae and Culicidae appear sufficient for routine identification purposes. However, should the framework be extended to broader surveillance scenarios involving a larger number of species or greater morphological similarity, the relative importance of individual anatomical parts, as well as the size and composition of the minimal diagnostic subset, may vary accordingly.

4.3. Practical Robustness and Future Extension

The datasets were generated under conventional CDC collection and simple specimen-preparation procedures, and the images were acquired using routine microscopes from different regional units with varying configurations, including differences in color temperature and image resolution (Table A3). Before being analyzed by MVP-Net, the images were also subjected to data augmentation, including illumination-related perturbation and occlusion simulation (Figure A1). Under these conditions, useful classification performance was still maintained after resizing the inputs to 224 × 224, suggesting some potential for extension to less specialized imaging hardware, although this would still require dedicated validation under new acquisition conditions.
The present study focused on Pareto optimization of minimal view subsets for species identification within the collected taxa and did not incorporate sex as a prediction target. This remains an important limitation for vector surveillance, because biological sex is directly related to epidemiological significance. Future work should therefore examine whether sex recognition accuracy can be incorporated as an additional objective into the current Pareto-optimization framework, so that species identification, sex recognition, and minimal view selection can be jointly optimized.

5. Conclusions

This study evaluated whether multi-view morphological identification of mosquito and fly taxa under routine surveillance in Shanghai could be simplified while preserving comparable classification performance. Using microscopy-based multi-view datasets of Calyptratae and Culicidae collected through approximately one year of routine surveillance by the Shanghai CDC system, MVP-Net identified compact subsets of diagnostically informative anatomical views while reducing image acquisition burden and computational cost.
For the 12-species Calyptratae dataset, the recommended configuration reduced the required input from 8 views to 5, lowered computational cost by 37.49%, and maintained 86.11% accuracy, remaining within 0.93% of the all-view configuration. For the 5-species Culicidae dataset, the required input was reduced from 11 views to 2, with an 81.82% reduction in computational cost while maintaining 100% accuracy for the species included in the current dataset. These findings indicate that, within the taxa routinely monitored in Shanghai, comparable identification performance can be achieved using a reduced set of anatomical views, although the number and composition of the selected views vary according to the specific taxonomic task.
Overall, these results support the application of MVP-Net as a regional auxiliary identification framework for mosquito and fly taxa under routine surveillance in Shanghai. Rather than minimizing the number of views at any cost, the framework provides a way to identify compact view subsets that preserve most of the discriminative information needed for identification under the current dataset setting. Future work should expand the regional and taxonomic coverage of the datasets and examine whether sex recognition accuracy can be incorporated as an additional objective into the current Pareto-optimization framework.

Author Contributions

N.L., B.D., H.L., J.W. and Y.Q. initiated the project. B.D., H.L., S.Z. and D.Z. supervised the project. N.L., B.D., J.W., Y.Q. and L.W. designed the multi-view identification framework. N.L. and Y.Q. developed the differentiable part importance mechanism and the Pareto-optimization strategy. N.L., J.W. and Y.Q. implemented the neural network architectures and carried out model training and validation. N.L., Y.Q., J.W., L.W., H.L. and B.D. performed the data analysis. N.L., B.D., H.L., J.W., Y.Q. and D.Z. wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by grants from the National Natural Science Foundation of China (62475157), AI for Science Program, Shanghai Municipal Commission of Economy and Informatization (RZ-RGZN-01-25-0957) and Science and Technology Commission of Shanghai Municipality (24DX2800300).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We extend our sincere gratitude to the personnel from the Shanghai Municipal Center for Disease Control and Prevention, the Pudong Airport Customs, and the CDC units of the Minhang, Qingpu, Yangpu, Jiading, Chongming, Putuo, and Fengxian Districts. Their dedication to field surveillance, specimen collection, taxonomic identification, and microscopic imaging was fundamental to this research.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
FLOPsFloating point operations
DLDeep learning
CNNsConvolutional neural networks
ViTsVision transformers
MOOMulti-objective optimization
PEPositional encoding
CECross-entropy
GUIGraphical user interface
CDCCenter for Disease Control and Prevention
CSVComma-separated values

Appendix A

Table A1. District and month of the Calyptratae species included in the Shanghai routine surveillance dataset.
Table A1. District and month of the Calyptratae species included in the Shanghai routine surveillance dataset.
SpeciesDistrict CoverageMonth CoverageTotal
Aldrichina grahami71039
Boettcherisca peregrina8758
Calliphora nigribarbis2211
Chrysomya megacephala8755
Fannia prisca6722
Lucilia cuprina6837
Lucilia illustris81038
Lucilia sericata8762
Muscina angustifrons7840
Musca domestica8767
Musca sorbens8732
Muscina stabulans8851
Total810512
Note: District coverage indicates the number of administrative districts in Shanghai from which each species was collected, and month coverage indicates the number of collection months represented in the dataset.
Table A2. Dataset split distribution of Calyptratae species.
Table A2. Dataset split distribution of Calyptratae species.
SpeciesTrainingValidationTotal
Aldrichina grahami31839
Boettcherisca peregrina461258
Calliphora nigribarbis8311
Chrysomya megacephala441155
Fannia prisca17522
Lucilia cuprina29837
Lucilia illustris30838
Lucilia sericata491362
Muscina angustifrons32840
Musca domestica531467
Musca sorbens25732
Muscina stabulans401151
Total404108512
Table A3. Microscope specifications across representative geographic collection regions in Shanghai city.
Table A3. Microscope specifications across representative geographic collection regions in Shanghai city.
SiteMicroscope ModelSensorResolution
(W × H/MP)
Color
Temperature
PutuoLeica SAPO (Leica, Wetzlar, Germany)Leica Flexacam C1 (Leica, Wetzlar, Germany)3840 × 2160/8.3 MP5500 K
QingpuLeica S9i (Leica, Wetzlar, Germany)Integrated S9i (Leica, Wetzlar, Germany)3648 × 2736/10.0 MP4500 K
FengxianOlympus SZ61 (Olympus, Tokyo, Japan)External CMOS (Olympus, Tokyo, Japan)2592 × 1944/5.0 MPVariable
CustomsOlympus CX31 (Olympus, Tokyo, Japan)Nikon D300 (Nikon, Tokyo, Japan)4288 × 2848/12.3 MP3200–3400 K
Figure A1. Data augmentation examples generated from a head image of Aldrichina grahami.
Figure A1. Data augmentation examples generated from a head image of Aldrichina grahami.
Insects 17 00381 g0a1
Figure A2. Learning curves on the Calyptratae dataset. (A) Training and validation loss curves for the single-view feature extraction stage (Stage 1). (B) Cross-entropy loss ( L C E ) curves for the multi-view fusion stage (Stage 2). Note: While the fusion network is optimized using a compound objective function ( L t o t a l ), only the classification component ( L C E ) is visualized here to explicitly demonstrate the convergence of the discriminative task.
Figure A2. Learning curves on the Calyptratae dataset. (A) Training and validation loss curves for the single-view feature extraction stage (Stage 1). (B) Cross-entropy loss ( L C E ) curves for the multi-view fusion stage (Stage 2). Note: While the fusion network is optimized using a compound objective function ( L t o t a l ), only the classification component ( L C E ) is visualized here to explicitly demonstrate the convergence of the discriminative task.
Insects 17 00381 g0a2
Figure A3. Graphical user interface of the offline multi-view identification software. The software provides a centralized dashboard for specimen management and offline inference in resource-constrained environments. Users can select model types, configure storage paths, and upload images corresponding to the Pareto-optimized morphological views. Input images are automatically processed by the local ConvNeXt-Transformer engine to display the top-3 predicted species with confidence scores and generate standardized CSV identification logs.
Figure A3. Graphical user interface of the offline multi-view identification software. The software provides a centralized dashboard for specimen management and offline inference in resource-constrained environments. Users can select model types, configure storage paths, and upload images corresponding to the Pareto-optimized morphological views. Input images are automatically processed by the local ConvNeXt-Transformer engine to display the top-3 predicted species with confidence scores and generate standardized CSV identification logs.
Insects 17 00381 g0a3

Appendix B

Figure A4. Representative images of the Culicidae (A) and diagnostic morphological structures (B).
Figure A4. Representative images of the Culicidae (A) and diagnostic morphological structures (B).
Insects 17 00381 g0a4
Table A4. District and month of the Culicidae species included in the Shanghai routine surveillance dataset.
Table A4. District and month of the Culicidae species included in the Shanghai routine surveillance dataset.
SpeciesDistrict CoverageMonth CoverageTotal
Culex tritaeniorhynchus8579
Anopheles sinensis6677
Culex pipiens pallens8570
Aedes albopictus8470
Armigeres subalbatus8567
Total88363
Table A5. Dataset split distribution of Culicidae species.
Table A5. Dataset split distribution of Culicidae species.
SpeciesTrainingValidationTotal
Culex tritaeniorhynchus631679
Anopheles sinensis611677
Culex pipiens pallens561470
Aedes albopictus561470
Armigeres subalbatus531467
Total28974363

Appendix C

To comprehensively evaluate the performance of different view selection strategies and the Pareto-optimized configurations, five multi-dimensional metrics are defined. These metrics, visualized in the radar charts (Figure 4D and Figure 6C), quantify the trade-off between taxonomic accuracy and computational efficiency.
The following variables are used in the definitions: A c c s u b s e t and A c c b a s e denote the Top-1 accuracy of the selected subset and the all-view baseline, respectively; A c c s i n g l e represents the accuracy of the best-performing single-view model; N s u b s e t and N t o t a l are the number of selected views and the total available views; F L O P s s u b s e t and F L O P s t o t a l refer to the computational costs of the subset and the baseline.
1.
Accuracy preservation (AP)
Measures the extent to which the optimized subset maintains the performance of the all-view baseline:
A P = A c c s u b s e t A c c b a s e
2.
Marginal contribution (MC)
Quantifies the average accuracy gain per selected part relative to the most discriminative single view:
M C = A c c s u b s e t A c c s i n g l e N s u b s e t
3.
Resource efficiency (RE)
Evaluates the classification performance achieved per unit of computational cost:
R E = A c c s u b s e t F L O P s s u b s e t
4.
Part reduction (PR)
Represents the proportion of redundant morphological views eliminated from the input set:
P R = 1 N s u b s e t N t o t a l
5.
Deployment advantage (DA)
A composite index measuring the synergy between computational savings and accuracy retention, reflecting the model’s suitability for resource-constrained edge devices:
D A = 1 F L O P s s u b s e t F L O P s t o t a l × A c c s u b s e t A c c b a s e

References

  1. Romanello, M.; di Napoli, C.; Drummond, P.; Green, C.; Kennard, H.; Lampard, P.; Scamman, D.; Arnell, N.; Ayeb-Karlsson, S.; Ford, L.B.; et al. The 2023 report of the Lancet Countdown on health and climate change: The imperative for a health-centred response in a world facing irreversible harms. Lancet 2023, 402, 2346–2394. [Google Scholar] [CrossRef] [PubMed]
  2. Kraemer, M.U.G.; Reiner, R.C.; Brady, O.J.; Messina, J.P.; Gilbert, M.; Pigott, D.M.; Yi, D.; Johnson, K.; Earl, L.; Marczak, L.B.; et al. Past and future spread of the arbovirus vectors Aedes aegypti and Aedes albopictus. Nat. Microbiol. 2019, 4, 854–863. [Google Scholar] [CrossRef] [PubMed]
  3. Ong, S.Q.; Nair, G.; Yusof, U.K.; Ahmad, H. Community-based mosquito surveillance: An automatic mosquito-on-human-skin recognition system with a deep learning algorithm. Pest Manag. Sci. 2022, 78, 4092–4104. [Google Scholar] [CrossRef] [PubMed]
  4. Bai, S.; Shi, L.; Yang, K. Deep learning in disease vector image identification. Pest Manag. Sci. 2025, 81, 527–539. [Google Scholar] [CrossRef] [PubMed]
  5. Gohe, A.K.; Kottek, M.J.; Buettner, R.; Penava, P. Classifying forensically important flies using deep learning to support pathologists and rescue teams during forensic investigations. PLoS ONE 2024, 19, e0314533. [Google Scholar] [CrossRef] [PubMed]
  6. Valan, M.; Makonyi, K.; Maki, A.; Vondráček, D.; Ronquist, F. Automated Taxonomic Identification of Insects with Expert-Level Accuracy Using Effective Feature Transfer from Convolutional Networks. Syst. Biol. 2019, 68, 876–895. [Google Scholar] [CrossRef] [PubMed]
  7. Sauer, F.G.; Werny, M.; Nolte, K.; Villacañas de Castro, C.; Becker, N.; Kiel, E.; Lühken, R. A convolutional neural network to identify mosquito species (Diptera: Culicidae) of the genus Aedes by wing images. Sci. Rep. 2024, 14, 3094. [Google Scholar] [CrossRef] [PubMed]
  8. Fujisawa, T.; Noguerales, V.; Meramveliotakis, E.; Papadopoulou, A.; Vogler, A. Image-based taxonomic classification of bulk insect biodiversity samples using deep learning and domain adaptation. Syst. Entomol. 2023, 48, 387–401. [Google Scholar] [CrossRef]
  9. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  10. Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. An Image is Worth 16 × 16 Words: Transformers for Image Recognition at Scale. In Proceedings of the International Conference on Learning Representations (ICLR), Virtual Event, 3–7 May 2021. [Google Scholar]
  11. Wei, X.; Song, Y.; Mac Aodha, O.; Wu, J.; Peng, Y.; Tang, J.; Yang, J.; Belongie, S. Fine-Grained Image Analysis with Deep Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 8927–8948. [Google Scholar] [CrossRef] [PubMed]
  12. Chang, D.; Tong, Y.; Du, R.; Hospedales, T.; Song, Y.-Z.; Ma, Z. An Erudite Fine-Grained Visual Classification Model. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 18–22 June 2023; pp. 7268–7277. [Google Scholar]
  13. Li, W.; Wang, D.; Li, M.; Gao, Y.; Wu, J.; Wang, X. Classification and detection of insects from field images using deep learning for smart pest management: A systematic review. Ecol. Inform. 2021, 66, 101460. [Google Scholar] [CrossRef]
  14. Bjerge, K.; Nielsen, J.B.; Sepstrup, M.V.; Helsing-Nielsen, F.; Høye, T.T. An Automated Light Trap to Monitor Moths (Lepidoptera) Using Computer Vision-Based Tracking and Deep Learning. Sensors 2021, 21, 343. [Google Scholar] [CrossRef] [PubMed]
  15. Chiranjeevi, S.; Saadati, M.; Deng, Z.K.; Koushik, J.; Jubery, T.Z.; Mueller, D.S.; O’Neal, M.; Merchant, N.; Singh, A.; Singh, A.K.; et al. InsectNet: Real-time identification of insects using an end-to-end machine learning pipeline. PNAS Nexus 2025, 4, pgae575. [Google Scholar] [CrossRef] [PubMed]
  16. Jain, A.; Cunha, F.; Bunsen, M.J.; Cañas, J.S.; Pasi, L.; Pinoy, N.; Helsing, F.; Russo, J.; Botham, M.; Sabourin, M.; et al. Insect Identification in the Wild: The AMI Dataset. In Proceedings of the European Conference on Computer Vision (ECCV), Milan, Italy, 29 September–4 October 2024. [Google Scholar]
  17. Schmidt, J.; Stober, S.; Denzler, J.; Bodesheim, P. Saccadic Vision for Fine-Grained Visual Classification. arXiv 2025, arXiv:2509.15688. [Google Scholar] [CrossRef]
  18. Nguyen, H.; Truong, T.; Nguyen, X.B.; Dowling, A.; Li, X.; Luu, K. Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 17–21 June 2024; pp. 21945–21955. [Google Scholar]
  19. Motta, D.; Santos, A.A.B.; Winkler, I.; Machado, B.A.S.; Pereira, D.A.D.I.; Cavalcanti, A.M.; Fonseca, E.O.L.; Kirchner, F.; Badaró, R. Application of convolutional neural networks for classification of adult mosquitoes in the field. PLoS ONE 2019, 14, e0210829. [Google Scholar] [CrossRef] [PubMed]
  20. Nolte, K.; Sauer, F.G.; Baumbach, J.; Kollmannsberger, P.; Lins, C.; Lühken, R. Robust mosquito species identification from diverse body and wing images using deep learning. Parasit. Vectors 2024, 17, 372. [Google Scholar] [CrossRef] [PubMed]
  21. Qian, Y.; Xiao, Z.; Deng, Z. Fine-grained crop pest classification based on multi-scale feature fusion and mixed attention mechanisms. Front. Plant Sci. 2025, 16, 1500571. [Google Scholar] [CrossRef] [PubMed]
  22. Hou, Y.; Gould, S.; Zheng, L. Learning to Select Views for Efficient Multi-View Understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 17–21 June 2024; pp. 20135–20144. [Google Scholar]
  23. An, J.; Du, Y.; Hong, P.; Liu, J.; Sun, Y.; Li, Y. Insect recognition based on complementary features from multiple views. Sci. Rep. 2023, 13, 2966. [Google Scholar] [CrossRef] [PubMed]
  24. Lan, Z.; Mao, W.; Li, H.; Wang, L.; Wang, T.; Fan, H.; Yoshie, O. BFA: Best-Feature-Aware Fusion for Multi-View Fine-grained Manipulation. arXiv 2025, arXiv:2502.11161. [Google Scholar] [CrossRef]
  25. Cordonnier, J.; Mahendran, A.; Dosovitskiy, A.; Weissenborn, D.; Uszkoreit, J.; Unterthiner, T. Differentiable Patch Selection for Image Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 2351–2360. [Google Scholar]
  26. Yan, X.; Jin, Z.; Han, F.; Ye, Y. Differentiable Information Bottleneck for Deterministic Multi-view Clustering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 17–21 June 2024; pp. 26320–26329. [Google Scholar]
  27. Petersen, F.; Kuehne, H.; Borgelt, C.; Deussen, O. Differentiable Top-k Classification Learning. In Proceedings of the International Conference on Machine Learning (ICML), Baltimore, MD, USA, 17–23 July 2022; pp. 17656–17668. [Google Scholar]
  28. Xie, Y.; Dai, H.; Chen, M.; Dai, B.; Zhao, T.; Zha, H.; Wei, W.; Pfister, T. Differentiable Top-k with Optimal Transport. In Advances in Neural Information Processing Systems (NeurIPS); Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2020; Volume 33, pp. 20520–20531. [Google Scholar]
  29. Betello, F.; Purificato, A.; Vineis, V.; Tolomei, G.; Silvestri, F. One Search Fits All: Pareto-Optimal Eco-Friendly Model Selection. arXiv 2025, arXiv:2505.01468. [Google Scholar]
  30. Qian, C.; Yu, Y.; Tang, K. Pareto Optimization for Subset Selection: Theories and Practical Algorithms. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO ’25), Malaga, Spain, 14–18 July 2025; pp. 1592–1616. [Google Scholar]
  31. Lomurno, E.; Samele, S.; Matteucci, M.; Ardagna, D. Pareto-Optimal Progressive Neural Architecture Search. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO ’21), Lille, France, 10–14 July 2021; pp. 1726–1734. [Google Scholar]
  32. Dimitriadis, N.; Frossard, P.; Fleuret, F. Pareto Low-Rank Adapters: Efficient Multi-Task Learning with Preferences. arXiv 2024, arXiv:2407.08056. [Google Scholar]
  33. Lu, B.L.; Wu, H.Y. Classification and Identification of Important Medical Insects of China; Henan Science and Technology Publishing: Zhengzhou, China, 2003. [Google Scholar]
  34. Fan, Z.D. Key to the Common Flies of China; China Science Publishing: Beijing, China, 1992. [Google Scholar]
  35. Woo, S.; Debnath, S.; Hu, R.; Chen, X.; Liu, Z.; Kweon, I.S.; Xie, S. ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 18–22 June 2023; pp. 16133–16142. [Google Scholar]
  36. Ridnik, T.; Ben-Baruch, E.; Payer, A.; Loell, A.; Klaiman, E.; Noy, A.; Zelnik-Manor, L.; Friedman, I. ML-Decoder: Scalable and Versatile Classification Head. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 4–8 January 2022; pp. 375–384. [Google Scholar]
  37. Deng, J.; Dong, W.; Socher, R.; Li, L.-J.; Li, K.; Fei-Fei, L. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
  38. Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. In Proceedings of the International Conference on Learning Representations (ICLR), New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
  39. Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is All you Need. In Advances in Neural Information Processing Systems; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Long Beach, CA, USA, 2017; Volume 30. [Google Scholar]
  40. Louizos, C.; Welling, M.; Kingma, D.P. Learning Sparse Neural Networks through L0 Regularization. In Proceedings of the International Conference on Learning Representations (ICLR), Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
  41. Khosla, P.; Teterwak, P.; Wang, C.; Sarna, A.; Tian, Y.; Isola, P.; Maschinot, A.; Liu, C.; Krishnan, D. Supervised Contrastive Learning. In Advances in Neural Information Processing Systems; Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2020; Volume 33, pp. 18661–18673. [Google Scholar]
Figure 1. Representative images of the Calyptratae. The dataset includes 12 species of Calyptratae.
Figure 1. Representative images of the Calyptratae. The dataset includes 12 species of Calyptratae.
Insects 17 00381 g001
Figure 2. Representative diagnostic morphological structures captured for the primary fly dataset. Aldrichina grahami is shown as an example illustrating the 8 targeted morphological views.
Figure 2. Representative diagnostic morphological structures captured for the primary fly dataset. Aldrichina grahami is shown as an example illustrating the 8 targeted morphological views.
Insects 17 00381 g002
Figure 3. Comparison of single-view classification performance and multi-view fusion. (A) Top-1 accuracy obtained from individual morphological views and the multi-view fusion model. (B) Species-wise change in the number of correctly classified samples (Δ), with n denoting the number of validation samples for each species.
Figure 3. Comparison of single-view classification performance and multi-view fusion. (A) Top-1 accuracy obtained from individual morphological views and the multi-view fusion model. (B) Species-wise change in the number of correctly classified samples (Δ), with n denoting the number of validation samples for each species.
Insects 17 00381 g003
Figure 4. Performance analysis and Pareto-optimized part selection. (A,B) Evolution of gating probabilities (A) and selection dynamics (B) for eight morphological views over 300 training epochs under the lambda threshold corresponding to the recommended configuration, where P(z = 1) represents the retention probability of each view and the dashed line indicates the selection threshold under this lambda constraint. (C) Trade-off between validation accuracy, number of selected parts (1 to 8), and computational cost (FLOPs), with the asterisk marking the optimized configuration. (D) Radar chart evaluating multi-dimensional performance metrics across different selection strategies. (E) Confusion matrix for the recommended 5-view configuration (left), and the species-wise change in the number of correctly classified samples relative to the all-view baseline (right).
Figure 4. Performance analysis and Pareto-optimized part selection. (A,B) Evolution of gating probabilities (A) and selection dynamics (B) for eight morphological views over 300 training epochs under the lambda threshold corresponding to the recommended configuration, where P(z = 1) represents the retention probability of each view and the dashed line indicates the selection threshold under this lambda constraint. (C) Trade-off between validation accuracy, number of selected parts (1 to 8), and computational cost (FLOPs), with the asterisk marking the optimized configuration. (D) Radar chart evaluating multi-dimensional performance metrics across different selection strategies. (E) Confusion matrix for the recommended 5-view configuration (left), and the species-wise change in the number of correctly classified samples relative to the all-view baseline (right).
Insects 17 00381 g004
Figure 5. Comparison of single-view classification performance and multi-view fusion.
Figure 5. Comparison of single-view classification performance and multi-view fusion.
Insects 17 00381 g005
Figure 6. Pareto-optimized part selection for the Culicidae dataset. (A) Evolution of gating probabilities for 11 morphological views under the recommended λ configuration. (B) Trade-off between validation accuracy, number of selected parts, and computational cost (FLOPs). (C) Radar chart of multi-dimensional performance metrics for the recommended 2-part configuration. (D) Confusion matrix for the recommended configuration.
Figure 6. Pareto-optimized part selection for the Culicidae dataset. (A) Evolution of gating probabilities for 11 morphological views under the recommended λ configuration. (B) Trade-off between validation accuracy, number of selected parts, and computational cost (FLOPs). (C) Radar chart of multi-dimensional performance metrics for the recommended 2-part configuration. (D) Confusion matrix for the recommended configuration.
Insects 17 00381 g006
Table 1. Quantitative analysis of Pareto-optimization strategies for Calyptratae.
Table 1. Quantitative analysis of Pareto-optimization strategies for Calyptratae.
λ Number of PartsAccuracy (%)Accuracy Drop (%)FLOPs (G)FLOPs Saved (%)Configuration
0.1, 0.2887.040.0011.310.00All-view
0.3787.040.009.9012.47Reduced
0.4586.110.937.0737.49Recommended
0.5384.262.784.2462.51Aggressive
0.6183.333.711.4187.53Single-view
Note: For clarity, the table uses “all-view” to denote the configuration using all captured morphological views, “reduced” to denote intermediate Pareto-derived settings, “recommended” to denote the preferred operating point highlighted in the main analysis, and “aggressive” to denote the more strongly compressed setting with a clearer accuracy trade-off.
Table 2. Quantitative analysis of Pareto-optimization strategies for Culicidae.
Table 2. Quantitative analysis of Pareto-optimization strategies for Culicidae.
λ Number of PartsAccuracy (%)Accuracy Drop (%)FLOPs (G)FLOPs Saved (%)Configuration
0.0011100.000.0049.560.00All-view
0.059100.000.0040.5518.20Reduced
0.108100.000.0036.0427.28Reduced
0.153100.000.0013.5272.72Reduced
0.202100.000.009.0181.82Recommended
0.25197.182.824.5087.53Single-view
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lin, N.; Wang, J.; Qian, Y.; Wei, L.; Liu, H.; Dai, B.; Zhuang, S.; Zhang, D. Multi-View Pareto Optimization for Minimal-Diagnostic-Set Identification of Disease Vectors. Insects 2026, 17, 381. https://doi.org/10.3390/insects17040381

AMA Style

Lin N, Wang J, Qian Y, Wei L, Liu H, Dai B, Zhuang S, Zhang D. Multi-View Pareto Optimization for Minimal-Diagnostic-Set Identification of Disease Vectors. Insects. 2026; 17(4):381. https://doi.org/10.3390/insects17040381

Chicago/Turabian Style

Lin, Nuofei, Jingjing Wang, Yixiang Qian, Li Wei, Hongxia Liu, Bo Dai, Songlin Zhuang, and Dawei Zhang. 2026. "Multi-View Pareto Optimization for Minimal-Diagnostic-Set Identification of Disease Vectors" Insects 17, no. 4: 381. https://doi.org/10.3390/insects17040381

APA Style

Lin, N., Wang, J., Qian, Y., Wei, L., Liu, H., Dai, B., Zhuang, S., & Zhang, D. (2026). Multi-View Pareto Optimization for Minimal-Diagnostic-Set Identification of Disease Vectors. Insects, 17(4), 381. https://doi.org/10.3390/insects17040381

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop