Wide + Tiles Vision Transformer Framework for Smartphone-Based Grassland Biomass Prediction in Heterogeneous Field Conditions
Abstract
1. Introduction
2. Materials and Methods
- Development of digital maps of pasture lands by administrative districts;
- Selection and thematic processing of remote sensing data to identify classification units by types of pasture vegetation;
- Preparation of test plots with presumed types of pasture vegetation and preliminary analysis of the study area;
- Conducting ground-based geobotanical surveys according to the approved methodology within test plots by districts;
- Development of scientifically grounded livestock grazing load standards by districts based on spatial analysis of pasture vegetation types.
3. Results
3.1. Ablation Study
3.2. Error Analysis Across Biomass Ranges
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Theau, J.; Lauzier-Hudon, É.; Aubé, L.; Devillers, N. Estimation of forage biomass and vegetation cover in grasslands using UAV imagery. PLoS ONE 2021, 16, e0245784. [Google Scholar] [CrossRef] [PubMed]
- da Silva, R.C.E.; Tommaselli, A.M.G.; Imai, N.N.; Martins-Neto, R.P.; da Silva da Silveira, D.; Moro, E. Dry mass grassland estimation using UAV ultra-wide RGB images. ISPRS Ann. 2024, X-3, 69–75. [Google Scholar] [CrossRef]
- Spaeth, K.E.; Weltz, M.A.; Nesbit, J.; Qi, J.; Rutherford, W.A.; Williams, C.J.; Toledo, D.; Newingham, B.A.; Iskakova, G.; Kussainova, M.; et al. Rangeland resource assessment in the Aqmola region of Kazakhstan. Rangel. Ecol. Manag. 2025, 98, 389–398. [Google Scholar] [CrossRef]
- Zhao, B.; Hiller, J.; Awada, T.; Wardlow, B.; Erickson, G.; Shi, Y. Forage biomass estimation using UAV-based remote sensing and machine learning. Ecol. Inform. 2025, 90, 103361. [Google Scholar] [CrossRef]
- Zhang, H.; Sun, Y.; Chang, L.; Qin, Y.; Chen, J.; Qin, Y.; Du, J.; Yi, S.; Wang, Y. Estimation of grassland canopy height and aboveground biomass at the quadrat scale using UAV. Remote Sens. 2018, 10, 851. [Google Scholar] [CrossRef]
- Grüner, E.; Astor, T.; Wachendorf, M. Biomass prediction of heterogeneous temperate grasslands using an SfM approach based on UAV imaging. Agronomy 2019, 9, 54. [Google Scholar] [CrossRef]
- Lussem, U.; Bolten, A.; Menne, J.; Gnyp, M.L.; Schellberg, J.; Bareth, G. Estimating biomass in temperate grassland with high resolution canopy surface models from UAV-based RGB images and vegetation indices. J. Appl. Remote Sens. 2019, 13, 034525. [Google Scholar] [CrossRef]
- Bendig, J.; Bolten, A.; Bennertz, S.; Broscheit, J.; Eichfuss, S.; Bareth, G. Estimating Biomass of Barley Using Crop Surface Models (CSMs) Derived from UAV-Based RGB Imaging. Remote Sens. 2014, 6, 10395–10412. [Google Scholar] [CrossRef]
- Viljanen, N.; Honkavaara, E.; Näsi, R.; Hakala, T.; Niemeläinen, O.; Kaivosoja, J. A novel machine learning method for estimating biomass of grass swards using a photogrammetric canopy height model. Agriculture 2018, 8, 70. [Google Scholar] [CrossRef]
- Acorsi, M.G.; das Dores Abati Miranda, F.; Martello, M.; Smaniotto, D.A.; Sartor, L.R. Estimating biomass of black oat using UAV-based RGB imaging. Agronomy 2019, 9, 344. [Google Scholar] [CrossRef]
- Guan, Q.; Jiang, M.; Du, W.; Chen, X.; Yan, B. Integrating UAV Visible and Multispectral Imagery to Assess Grazing-Induced Vegetation Responses in Sandy Grasslands. Front. Plant Sci. 2025, 16, 1730583. [Google Scholar] [CrossRef] [PubMed]
- Zhang, H.; Tang, Z.; Wang, B.; Meng, B.; Qin, Y.; Sun, Y.; Lv, Y.; Zhang, J.; Yi, S. A non-destructive method for rapid acquisition of grassland aboveground biomass for satellite ground verification using UAV RGB images. Glob. Ecol. Conserv. 2022, 33, e01999. [Google Scholar] [CrossRef]
- Schreiber, L.V.; Amorim, J.G.A.; Guimarães, L.; Matos, D.M.; da Costa, C.M.; Parraga, A. Aboveground biomass wheat estimation: Deep learning with UAV-based RGB images. Appl. Artif. Intell. 2022, 36, 2055392. [Google Scholar] [CrossRef]
- Castro, W.; Junior, J.M.; Polidoro, C.; Osco, L.P.; Gonçalves, W.; Rodrigues, L.; Santos, M.; Jank, L.; Barrios, S.; Valle, C.; et al. Deep learning applied to phenotyping of biomass in forages with UAV-based RGB imagery. Sensors 2020, 20, 4802. [Google Scholar] [CrossRef] [PubMed]
- Karila, K.; Oliveira, R.A.; Ek, J.; Kaivosoja, J.; Koivumäki, N.; Korhonen, P.; Niemeläinen, O.; Nyholm, L.; Näsi, R.; Pölönen, I.; et al. Estimating grass sward quality and quantity parameters using drone remote sensing with deep neural networks. Remote Sens. 2022, 14, 2692. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16×16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Alvarez-Mendoza, C.I.; Guzman, D.; Casas, J.; Bastidas, M.; Polanco, J.; Valencia-Ortiz, M.; Montenegro, F.; Arango, J.; Ishitani, M.; Selvaraj, M.G. Predictive modeling of above-ground biomass in Brachiaria pastures from satellite and UAV imagery. Remote Sens. 2022, 14, 5870. [Google Scholar] [CrossRef]
- Liu, W.; Xu, C.; Zhang, Z.; De Boeck, H.; Wang, Y.; Zhang, L.; Xu, X.; Zhang, C.; Chen, G.; Xu, C. Machine learning-based grassland aboveground biomass estimation and its response to climate variation in Southwest China. Front. Ecol. Evol. 2023, 11, 1146850. [Google Scholar] [CrossRef] [PubMed]
- Dong, W.; Mitchard, E.T.A.; Yu, H.; Hancock, S.; Ryan, C.M. Forest aboveground biomass estimation using GEDI and earth observation data through attention-based deep learning. arXiv 2023, arXiv:2311.03067. [Google Scholar]
- Tian, X.; Li, J.; Zhang, F.; Zhang, H.; Jiang, M. Forest aboveground biomass estimation using multisource remote sensing data and deep learning algorithms. Remote Sens. 2024, 16, 1074. [Google Scholar] [CrossRef]
- Fan, X.; He, G.; Zhang, W.; Long, T.; Zhang, X.; Wang, G.; Sun, G.; Zhou, H.; Shang, Z.; Song, X. Sentinel-2 Images Based Modeling of Grassland Above-Ground Biomass Using Random Forest Algorithm: A Case Study on the Tibetan Plateau. Remote Sens. 2022, 14, 5321. [Google Scholar] [CrossRef]
- Kalmani, V.H.; Gadekar, P.R.; Adamuthe, A.C. Deep learning-based RGB image modelling for multi-component pasture biomass estimation in precision grazing. Agric. Sci. Dig. 2026; in press.
- Nakajima, K.; Saito, K.; Tsujimoto, Y.; Takai, T.; Mochizuki, A.; Yamaguchi, T.; Ibrahim, A.; Mairoua, S.G.; Andrianary, B.H.; Katsura, K.; et al. Robustness of RGB image-based estimation for rice above-ground biomass using multi-location datasets. Smart Agric. Technol. 2025, 11, 100998. [Google Scholar] [CrossRef]
- Kan, X.; Lu, Z.; Zhang, Y.; Zhu, L.; Lim Kam Sian, K.T.C.; Wang, J.; Liu, X.; Zhou, Z.; Cao, H. DSRSS-Net: Improved-Resolution Snow Cover Mapping from FY-4A Satellite Images Using the Dual-Branch Super-Resolution Semantic Segmentation Network. Remote Sens. 2023, 15, 4431. [Google Scholar] [CrossRef]
- Dhawi, F.; Ghafoor, A.; Almousa, N.; Ali, S.; Alqanbar, S. Predictive modelling employing machine learning, CNNs, and smartphone RGB images for non-destructive biomass estimation of pearl millet (Pennisetum glaucum). Front. Plant Sci. 2025, 16, 1594728. [Google Scholar] [CrossRef] [PubMed]
- Liao, Q.; Wang, D.; Haling, R.; Liu, J.; Li, X.; Plomecka, M.; Robson, A.; Pringle, M.; Pirie, R.; Walker, M.; et al. Estimating pasture biomass from top-view images: A dataset for precision agriculture. arXiv 2025, arXiv:2510.22916. [Google Scholar]
- Woodrow, L.; Carter, J.; Fraser, G.; Barnetson, J. Using Continuous Output Neural Nets to Estimate Pasture Biomass from Digital Photographs in Grazing Lands. AgriEngineering 2023, 5, 1051–1067. [Google Scholar] [CrossRef]
- Mandal, M. Fusion Complexity Inversion: Why Simpler Cross View Modules Outperform SSMs and Cross View Attention Transformers for Pasture Biomass Regression. arXiv 2026, arXiv:2603.07819. [Google Scholar]
- Rueda-Ayala, V.P.; Peña, J.M.; Höglind, M.; Bengochea-Guevara, J.M.; Andújar, D. Comparing UAV-based technologies and RGB-D reconstruction methods for plant height and biomass monitoring on grass ley. Sensors 2019, 19, 535. [Google Scholar] [CrossRef] [PubMed]
- Vahidi, M.; Shafian, S.; Thomas, S.; Maguire, R. Pasture biomass estimation using ultra-high-resolution RGB UAV images and deep learning. Remote Sens. 2023, 15, 5714. [Google Scholar] [CrossRef]
- Grüner, E.; Wachendorf, M.; Astor, T. The potential of UAV-borne spectral and textural information for predicting aboveground biomass and nitrogen fixation in legume-grass mixtures. PLoS ONE 2020, 15, e0234703. [Google Scholar] [CrossRef] [PubMed]
- Siméoni, O.; Vo, H.V.; Seitzer, M.; Baldassarre, F.; Oquab, M.; Jose, C.; Khalidov, V.; Szafraniec, M.; Yi, S.; Ramamonjisoa, M.; et al. DINOv3. arXiv 2025, arXiv:2508.10104. [Google Scholar]
- Caron, M.; Touvron, H.; Misra, I.; Jegou, H.; Mairal, J.; Bojanowski, P.; Joulin, A. Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Advances in Neural Information Processing Systems (NeurIPS); Curran Associates, Inc.: Red Hook, NY, USA, 2017. [Google Scholar]
- Woo, S.; Debnath, S.; Hu, R.; Chen, X.; Liu, Z.; Kweon, I.S.; Xie, S. ConvNeXt V2: Co-designing and scaling ConvNets with masked autoencoders. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023. [Google Scholar]
- Arystanov, A.; Karabkina, N.; Sagin, J.; Nurguzhin, M.; King, R.; Bekseitova, R. Use of Indices Applied to Remote Sensing for Establishing Winter–Spring Cropping Areas in the Republic of Kazakhstan. Sustainability 2024, 16, 7548. [Google Scholar] [CrossRef]
- Kabzhanova, G.; Arystanova, R.; Bissembayev, A.; Arystanov, A.; Sagin, J.; Nasiyev, B.; Kurmasheva, A. Remote Sensing Applications for Pasture Assessment in Kazakhstan. Agronomy 2025, 15, 526. [Google Scholar] [CrossRef]
- Arystanov, A.; Sagin, J.; Karabkina, N.; Arystanova, R.; Yermekov, F.; Kabzhanova, G.; Bekseitova, R.; Aktymbayeva, A.; Kutymova, N. Automatic Classification of Agricultural Crops Using Sentinel-2 Data in the Rainfed Zone of Southern Kazakhstan. Agronomy 2025, 15, 2040. [Google Scholar] [CrossRef]
- Arystanov, A.; Sagin, J.; Kabzhanova, G.; Sarsekova, D.; Bekseitova, R.; Molzhigitova, D.; Balkozha, M.; Yeleuova, E.; Satvaldiyev, B. Winter Cereal Re-Sowing and Land-Use Sustainability in the Foothill Zones of Southern Kazakhstan Based on Sentinel-2 Data. Sustainability 2026, 18, 1053. [Google Scholar] [CrossRef]
- Sarsekova, D.; Sagin, J.; Perzadayeva, A.; Arystanova, R.; Arystanov, A.; Kezheneva, A.; Jumassultanova, S.; Satybaldiyeva, G.; Ospangaliyev, A. Farmers’ Land Sustainability Improvement with Soil, Geology, and Water Retention Assessment in North Kazakhstan. Sustainability 2026, 18, 1316. [Google Scholar] [CrossRef]
- Arystanov, A.; Arystanova, R.; Boribay, E.; Sagin, J.; Karabkina, N.; Sarsekova, D.; Perzadayeva, A.; Munaitpassova, A.; Yelikbayeva, S.; Tleubekuly, E. Interannual Dynamics of Fallow Land Extent in North Kazakhstan Based on Sentinel-2 Data for the Recent Period (2021–2025). Agronomy 2026, 16, 1008. [Google Scholar] [CrossRef]
- Mukanov, Y.; Arystanova, R.; Sagin, J.; Samarkhanov, K.; Usmanov, T.; Baisholanov, S.; Arystanov, A.; Koshim, A.; Duisebek, B.; Zhukenova, A. A Statistical Analysis of Multi-Decadal Trends in Temperature, Precipitation and Drought Indices in Eastern and Southeastern Kazakhstan Between 1981 and 2023. Agronomy 2026, 16, 1097. [Google Scholar] [CrossRef]
- Kyrgyzbay, K.; Usmanov, T.; Sagin, J.; Duisebek, B.; Arystanova, R.; Kulbekova, S.; Utepov, A.; Amanzholova, R. Spatial Assessment of Flood Susceptibility in the Abai Region, Kazakhstan. Water 2026, 18, 817. [Google Scholar] [CrossRef]
- Varthani, S.; Donaghy, D.J.; Kenyon, P.R.; Sneddon, N.W.; Cartmill, A.D. Measuring Herbage Mass: A Review. Agronomy 2025, 15, 2264. [Google Scholar] [CrossRef]
- Huber, P.J. Robust Estimation of a Location Parameter. Ann. Math. Stat. 1964, 35, 73–101. [Google Scholar] [CrossRef]











| Augmentation | Description | Parameters | p |
|---|---|---|---|
| Geometric Transformations | |||
| RandomScale + PadBlack | Randomly scales the image and fills empty areas with black color | scale ∈ [0.90, 1.00] | 0.20 |
| RandomResizedCrop | Randomly crops a region of the image and resizes it | scale ∈ [0.85, 1.00]; ratio ∈ [0.90, 1.10] | 1.00 |
| HorizontalFlip | Horizontal flipping | – | 0.50 |
| VerticalFlip | Vertical flipping | – | 0.50 |
| RandomRotate90 | Rotation by multiples of 90° | – | 0.50 |
| Rotate | Small-angle arbitrary rotation | limit ∈ [−3°, +3°] | 0.50 |
| Photometric Transformations | |||
| RandomBrightnessContrast | Adjusts brightness and contrast | Δbrightness, Δcontrast ≤ 0.15 | 0.60 |
| HueSaturationValue | Adjusts hue, saturation, and value | hue ≤ 8; sat ≤ 18; val ≤ 12 | 0.35 |
| CLAHE | Enhances local contrast | clip_limit ∈ [1, 3]; tile = 8 × 8 | 0.15 |
| Sharpen | Sharpens the image | – | 0.10 |
| GaussNoise | Adds Gaussian noise | – | 0.15 |
| Plant-Specific Spectral Transformations | |||
| EnhanceGreenLAB | Enhances the green channel in LAB color space | a-scale = 0.70 | 0.20 |
| GreenChannelEmphasis | Computes green index (g/(r + b)) | – | 0.08 |
| ChannelShuffle | Randomly permutes RGB channels | – | 0.10 |
| Regularization Transformations | |||
| RandomErasing | Randomly removes a region (filled with noise) | erase_area ∈ [1%, 12%] | 0.25 |
| Subset | Administrative Districts | Number of Districts |
|---|---|---|
| Training | Sarkand; Eskeldi; Korday; Karatal; Alakol; Merki; Koksu; Ryskulov; Aksu; Shu; Talas; Panfilov; Zhambyl; Kazygurt; Keles; Aral; Maktaaral; Sarysu; Zhanakorgan; Tolebi; Shardara; Ordabasy; Tulkibas; Zhetysai; Baidibek; Karmakshi; Kazaly; Syrdarya; Shieli; Sozak; Zhalagash; Turkestan | 32 |
| Validation | Zhualy; Baizak; Kerbulak; Moiynkum; Saryagash; Sauran; Sairam; Otyrar | 8 |
| Model | Input | Grid | Window/Tile, px | MAE | RMSE | R2 |
|---|---|---|---|---|---|---|
| DINOv3 | original | 3 × 3 | 1024/896 | 0.774 | 1.186 | 0.732 |
| DINOv3 | masked | 3 × 3 | 1024/896 | 0.7 79 | 1.183 | 0.733 |
| DINOv3 | cropped | 2 × 3 | 1024/896 | 0.794 | 1.190 | 0.731 |
| DINOv3 | cropped + strong aug | 2 × 3 | 1024/992 | 0.826 | 1.245 | 0.705 |
| DINOv3 | Original-tiles | 3 × 3 | 896 | 0.794 | 1.268 | 0.696 |
| DINOv3 | Original-wide | 1024 | 0.978 | 1.305 | 0.676 | |
| DINOv3 | original (small) | 3 × 3 | 896/640 | 0.904 | 1.384 | 0.671 |
| ConvNeXtV2 | cropped | 2 × 3 | 1024/896 | 0.951 | 1.425 | 0.614 |
| Model | Input | Grid | Window/Tile, px | MAE | RMSE | R2 |
|---|---|---|---|---|---|---|
| DINOv3 (SSL) | original | 3 × 3 | 1024/896 | 0.896 | 1.454 | 0.598 |
| DINOv3 | cropped + bin cls | 2 × 3 | 1024/896 | 0.868 | 1.407 | 0.623 |
| Model | Input/Features | Train Samples | Validation Samples | MAE, c/ha | RMSE, c/ha | R2 | Relative MAE, % |
|---|---|---|---|---|---|---|---|
| Random Forest | RGB statistics + vegetation indices + GLCM texture features + 3 × 3 spatial features | 956 | 240 | 1.495 | 2.227 | 0.082 | 40.4 |
| XGBoost | RGB statistics + vegetation indices + GLCM texture features + 3 × 3 spatial features | 956 | 240 | 1.472 | 2.232 | 0.078 | 39.8 |
| CatBoost | RGB statistics + vegetation indices + GLCM texture features + 3 × 3 spatial features | 956 | 240 | 1.461 | 2.223 | 0.085 | 39.5 |
| Biomass Range | Number of Samples | Mean Observed Biomass, c/ha | MAE, c/ha | RMSE, c/ha | Relative MAE, % |
|---|---|---|---|---|---|
| Low biomass (0–2 c/ha) | 70 | 1.754 | 0.281 | 0.422 | 16.0 |
| Medium biomass (>2–6 c/ha) | 132 | 3.159 | 0.786 | 1.109 | 24.9 |
| High biomass (>6 c/ha) | 38 | 7.829 | 1.672 | 2.059 | 21.4 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Arystanova, R.; Zeinulla, D.; Kabzhanova, G.; Bissembayev, A.; Bekseitova, R.; Sarsekova, D.; Saule, B.; Arystanov, A.; Sagin, J.; Nurtay, M. Wide + Tiles Vision Transformer Framework for Smartphone-Based Grassland Biomass Prediction in Heterogeneous Field Conditions. Agriculture 2026, 16, 1401. https://doi.org/10.3390/agriculture16131401
Arystanova R, Zeinulla D, Kabzhanova G, Bissembayev A, Bekseitova R, Sarsekova D, Saule B, Arystanov A, Sagin J, Nurtay M. Wide + Tiles Vision Transformer Framework for Smartphone-Based Grassland Biomass Prediction in Heterogeneous Field Conditions. Agriculture. 2026; 16(13):1401. https://doi.org/10.3390/agriculture16131401
Chicago/Turabian StyleArystanova, Ranida, Darkhan Zeinulla, Gulnara Kabzhanova, Anuarbek Bissembayev, Roza Bekseitova, Dani Sarsekova, Bakhbayeva Saule, Asset Arystanov, Janay Sagin, and Margulan Nurtay. 2026. "Wide + Tiles Vision Transformer Framework for Smartphone-Based Grassland Biomass Prediction in Heterogeneous Field Conditions" Agriculture 16, no. 13: 1401. https://doi.org/10.3390/agriculture16131401
APA StyleArystanova, R., Zeinulla, D., Kabzhanova, G., Bissembayev, A., Bekseitova, R., Sarsekova, D., Saule, B., Arystanov, A., Sagin, J., & Nurtay, M. (2026). Wide + Tiles Vision Transformer Framework for Smartphone-Based Grassland Biomass Prediction in Heterogeneous Field Conditions. Agriculture, 16(13), 1401. https://doi.org/10.3390/agriculture16131401

