Investigating Sibilant Fricative Representation in Bangla Telemedicine Speech: A Cost-Aware Sampling Rate Optimization Study
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Design
2.2. Speech Data Collection and Recording Protocol
2.3. Audio Preprocessing and ASR Evaluation Pipeline
2.4. Computational Latency and Bandwidth Cost Measurement
2.5. Acoustic Analysis for Sibilant Characterization
2.6. Frame-Level Sibilant Likelihood Estimation
2.7. Sampling Rate-Dependent ASR Performance Analysis
2.8. Low-Pass Filtering Control Analysis
2.9. Elbow-Point Detection: Identifying Diminishing Returns
2.10. Pareto Frontier Analysis: Balancing Accuracy and Bandwidth
2.11. Composite Scoring and Minimum Acceptable Trade-Off Selection
2.12. Ethical Considerations
3. Results
3.1. Sibilant-Related Acoustic Measures
3.2. Sampling Rate Optimization
3.3. Low-Pass Filtered High-Rate Control Condition
3.4. Elbow-Point Detection: Identifying Diminishing Returns
3.5. Pareto Frontier Analysis: Balancing Accuracy and Bandwidth
3.6. Weighted Scoring Model: Composite Ranking of Configurations
3.7. Minimum Acceptable Trade-Off (MAT): Cost-Efficient Near-Optiomal Accuracy
4. Discussion
4.1. Interpretation of Sibilant Acoustic Cues
4.2. Sampling Rate-Dependent Accuracy Gains and Diminishing Returns
4.3. Isolating the Contribution of Extended High-Frequency Information
4.4. Elbow Point Identification of the Accuracy-Efficiency Trade-Off
4.5. Pareto-Optimal Balance Between Accuracy and Bandwidth
4.6. Composite Ranking and Minimum Acceptable Trade-Off Selection
4.7. Recommended Sampling Rate for Bangla Medical Telehealth ASR
4.8. Limitations
4.9. Comparison with Prior Work
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ASR | Automatic Speech Recognition |
| WER | Word Error Rate |
| MAT | Minimum Acceptable Trade-off |
| LRL | Low-Resource Language |
| LPF | Low-Pass Filter |
| HF | High Frequency |
| IQR | Interquartile Range |
References
- Magueresse, A.; Carles, V.; Heetderks, E. Low-resource languages: A review of past work and future challenges. arXiv 2020, arXiv:2006.07264. [Google Scholar] [CrossRef]
- Kibria, S.; Samin, A.M.; Kobir, M.H.; Rahman, M.S.; Selim, M.R.; Iqbal, M.Z. Bangladeshi Bangla speech corpus for automatic speech recognition research. Speech Commun. 2022, 136, 84–97. [Google Scholar] [CrossRef]
- Aiman, U.; Islam, M.N.; Chowdhury, M.H.; Rahman, M.S.; Habib, M.T.; Hasan, M. BRADS and BRWDS: Multipurpose audio and text datasets for automatic Bangla regional speech recognition. Data Brief 2025, 63, 112177. [Google Scholar] [CrossRef] [PubMed]
- Hossain, S.; Rihan, M.R.; Imtiaz, A.; Boni, P.; Gomes, D. Enhancing Bangla local speech-to-text conversion using fine-tuning Wav2vec 2.0 with OpenSLR and self-compiled datasets through transfer learning. In 7th IEOM Bangladesh International Conference on Industrial Engineering and Operations Management; IEOM Society International: Southfield, MI, USA, 2024; Volume 20240161. [Google Scholar] [CrossRef]
- Nandi, R.N.; Menon, M.; Muntasir, T.; Sarker, S.; Muhtaseem, Q.S.; Islam, M.T.; Chowdhury, S.; Alam, F. Pseudo-labeling for domain-agnostic Bangla automatic speech recognition. In Proceedings of the First Workshop on Bangla Language Processing (BLP-2023); Association for Computational Linguistics: Stroudsburg, PA, USA, 2023; pp. 152–162. [Google Scholar]
- Rakib, F.R.; Dip, S.S.; Alam, S.; Tasnim, N.; Shihab, M.I.; Ansary, M.N.; Hossen, S.M.; Meghla, M.H.; Mamun, M.; Sadeque, F.; et al. Ood-speech: A large Bengali speech recognition dataset for out-of-distribution benchmarking. arXiv 2023, arXiv:2305.09688. [Google Scholar]
- Rakib, M.; Hossain, M.I.; Mohammed, N.; Rahman, F. Bangla-wave: Improving Bangla automatic speech recognition utilizing n-gram language models. In Proceedings of the 12th International Conference on Software and Computer Applications; Association for Computing Machinery: New York, NY, USA, 2023; pp. 297–301. [Google Scholar]
- Just, S.A.; Elvevåg, B.; Pandey, S.; Nenchev, I.; Bröcker, A.L.; Montag, C.; Morgan, S.E. Moving beyond word error rate to evaluate automatic speech recognition in clinical samples: Lessons from research into schizophrenia-spectrum disorders. Psychiatry Res. 2025, 352, 116690. [Google Scholar] [CrossRef] [PubMed]
- Mani, A.; Palaskar, S.; Konam, S. Towards understanding ASR error correction for medical conversations. In Proceedings of the First Workshop on Natural Language Processing for Medical Conversations; Association for Computational Linguistics: Stroudsburg, PA, USA, 2020; pp. 7–11. [Google Scholar]
- Klusty, M.A.; Logan, W.V.; Armstrong, S.E.; Mullen, A.D.; Leach, C.N.; Calvert, K.; Talbert, J.; Bumgardner, V.C. Toward automated clinical transcriptions. AMIA Summits Transl. Sci. Proc. 2025, 2025, 235–241. [Google Scholar] [PubMed]
- Salloum, W.; Edwards, E.; Ghaffarzadegan, S.; Suendermann-Oeft, D.; Miller, M. Crowdsourced continuous improvement of medical speech recognition. In AAAI Workshops; AAAI Press: Washington, DC, USA, 2017. [Google Scholar]
- Gonçalves, Y.T.; Alves, J.V.; Sá, B.A.; da Silva, L.N.; de Macedo, J.A.; da Silva, T.L. MedTalkAI: Assisted anamnesis creation with automatic speech recognition. In Simpósio Brasileiro de Banco de Dados (SBBD); SBC: Porto Alegre, Brazil, 2024; pp. 83–88. [Google Scholar]
- Kodish-Wachs, J.; Agassi, E.; Kenny, P., III; Overhage, J.M. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech. AMIA Annu. Symp. Proc. 2018, 2018, 683. [Google Scholar] [PubMed]
- O’Kane, R.; Stonehouse-Smith, D.; Ota, L.C.; Patel, R.; Johnson, N.; Slipper, C.; Seehra, J.; Papageorgiou, S.N.; Cobourne, M.T. Transcription accuracy of automatic speech recognition for orthodontic clinical records. J. Dent. Res. 2025, 00220345251382452. [Google Scholar] [CrossRef] [PubMed]
- Jongman, A.; Wayland, R.; Wong, S. Acoustic characteristics of English fricatives. J. Acoust. Soc. Am. 2000, 108, 1252–1263. [Google Scholar] [CrossRef] [PubMed]
- Maniwa, K.; Jongman, A.; Wade, T. Acoustic characteristics of clearly spoken English fricatives. J. Acoust. Soc. Am. 2009, 125, 3962–3973. [Google Scholar] [CrossRef] [PubMed]
- Guo, Z.C.; Chandrasekaran, B. Extended high-frequency cues to phoneme recognition: Insights from ASR. In Proceedings of the Interspeech 2025; International Speech Communication Association: Grenoble, France, 2025; pp. 1038–1042. [Google Scholar]
- Kozierski, P.; Sadalla, T.; Drgas, S.; Dabrowski, A.; Giemacki, W. Polish Whispery Speech Recognition—Minimum Sampling Frequency; IEEE: New York, NY, USA, 2017; pp. 611–615. [Google Scholar] [CrossRef]
- Hokking, R.; Woraratpanya, K. A hybrid of fractal code descriptor and harmonic pattern generator for improving speech recognition of different sampling rates. In Recent Advances in Information and Communication Technology 2017; IC2IT 2017; Advances in Intelligent Systems and Computing; Meesad, P., Sodsee, S., Unger, H., Eds.; Springer: Berlin/Heidelberg, Germany, 2018; Volume 566. [Google Scholar] [CrossRef]
- Bauerecker, H.; Nadeu, C.; Padrell, J. On the advantage of frequency-filtering features for speech recognition with variable sampling frequencies: Experiments with speechdatcar databases. In 8th European Conference on Speech Communication and Technology (EUROSPEECH 2003–INTERSPEECH 2003); International Speech Communication Association: Grenoble, France, 2003; pp. 869–872. [Google Scholar]
- Liu, F.H.; Picheny, M. On variable sampling frequencies in speech recognition. In Proceedings of the 5th International Conference on Spoken Language Processing; International Speech Communication Association: Grenoble, France, 1998. [Google Scholar]
- Nadeu, C.; Tolos, M. Recognition experiments with the SpeechDat-Car Aurora Spanish database using 8 kHz- and 16 kHz-sampled signals. In IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU’01); IEEE: New York, NY, USA, 2001; pp. 135–138. [Google Scholar]
- Ssnderson, C.; Paliwal, K.K. Effect of different sampling rates and feature vector sizes on speech recognition performance. In Proceedings of the IEEE TENCON’97; IEEE: New York, NY, USA, 1997; Volume 1, pp. 161–164. [Google Scholar]
- Hirsch, H.G.; Hellwig, K.; Dobler, S. Speech recognition at multiple sampling rates. In Proceedings of the EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event; International Speech Communication Association: Grenoble, France, 2001; pp. 1837–1840. [Google Scholar]
- Guo, Z.C.; Chandrasekaran, B. Extended high frequencies improve phoneme recognition: Evidence from automatic speech recognition in spatial speech mixtures. J. Acoust. Soc. Am. 2025, 158, 3365–3377. [Google Scholar] [CrossRef] [PubMed]
- Roberts, P.J.; Reetz, H.; Lahiri, A. Corpus-testing a fricative discriminator; or, just how invariant is this invariant? In 15th Annual Conference of the International Speech Communication Association; International Speech Communication Association: Grenoble, France, 2014; pp. 189–192. [Google Scholar]
- Steiner, I.M.A. Observations on the Dynamic Control of an Articulatory Synthesizer Using Speech Production Data. Ph.D. Thesis, Karlsruhe Institute of Technology, Karlsruhe, Germany, 2010. [Google Scholar] [CrossRef]
- Hokking, R.; Woraratpanya, K.; Kuroki, Y. Speech recognition of different sampling rates using fractal code descriptor. In 13th International Joint Conference on Computer Science and Software Engineering (JCSSE); IEEE: New York, NY, USA, 2016; pp. 1–5. [Google Scholar]
- Shadle, C.H.; Chen, W.R.; Koenig, L.L.; Preston, J.L. Refining and extending measures for fricative spectra, with special attention to the high-frequency range. J. Acoust. Soc. Am. 2023, 154, 1932–1944. [Google Scholar] [CrossRef] [PubMed]
- Ahmed, A.; Inoue, S.; Kai, E.; Nakashima, N.; Nohara, Y. Portable health clinic: A pervasive way to serve the unreached community for preventive healthcare. In International Conference on Distributed, Ambient, and Pervasive Interactions; Springer: Berlin/Heidelberg, Germany, 2013; pp. 265–274. [Google Scholar]
- Valin, J.M.; Vos, K.; Terriberry, T.B. RFC 6716; Definition of the Opus Audio Codec; Internet Engineering Task Force: Wilmington, DE, USA, 2012; Available online: https://tools.ietf.org/html/rfc6716 (accessed on 25 February 2026).
- Monson, B.B.; Hunter, E.J.; Lotto, A.J.; Story, B.H. The perceptual significance of high-frequency energy in the human voice. Front. Psychol. 2014, 5, 587. [Google Scholar] [CrossRef] [PubMed]
- Forrest, K.; Weismer, G.; Milenkovic, P.; Dougall, R.N. Statistical analysis of word-initial voiceless obstruents: Preliminary data. J. Acoust. Soc. Am. 1988, 84, 115–123. [Google Scholar] [CrossRef] [PubMed]
- Kong, Y.Y.; Mullangi, A.; Kokkinakis, K. Classification of fricative consonants for speech enhancement in hearing devices. PLoS ONE 2014, 9, e95001. [Google Scholar] [CrossRef] [PubMed]
- Ramirez, J.; Górriz, J.M.; Segura, J.C. Voice Activity Detection. Fundamentals and Speech Recognition System Robustness. In Robust Speech Recognition and Understanding; Grimm, M., Kroschel, K., Eds.; I-Tech Education and Publishing: Rijeka, Croatia, 2007; pp. 1–22. [Google Scholar]








| File | Mean Spectral Centroid (Hz) | Mean Spectral Flatness | Mean HF Energy Ratio | Likely Sibilant (%) | Possible Sibilant (%) | Likely Noise (%) | Possible Noise (%) | Unclear (%) |
|---|---|---|---|---|---|---|---|---|
| 1.wav | 1848.223 | 0.131 | 0.053 | 13.381 | 11.053 | 0.003 | 4.961 | 75.495 |
| 2.wav | 2052.420 | 0.159 | 0.065 | 15.334 | 13.676 | 0.024 | 13.064 | 70.772 |
| 3.wav | 2455.571 | 0.223 | 0.086 | 21.260 | 18.793 | 0.123 | 29.383 | 59.295 |
| 4.wav | 2279.628 | 0.212 | 0.079 | 22.857 | 14.090 | 0.002 | 0.647 | 63.049 |
| 5.wav | 2251.891 | 0.191 | 0.070 | 16.377 | 18.411 | 0.004 | 27.373 | 64.520 |
| 6.wav | 2006.801 | 0.146 | 0.063 | 14.819 | 12.467 | 0.009 | 0.983 | 72.714 |
| 7.wav | 1961.270 | 0.178 | 0.067 | 21.365 | 10.505 | 0.003 | 0.107 | 68.130 |
| 8.wav | 2182.456 | 0.194 | 0.067 | 17.379 | 18.413 | 0.001 | 24.211 | 63.946 |
| 9.wav | 2295.956 | 0.184 | 0.064 | 17.086 | 16.954 | 0.028 | 26.026 | 64.986 |
| 10.wav | 2089.816 | 0.160 | 0.059 | 13.879 | 15.507 | 0.168 | 20.544 | 69.885 |
| 11.wav | 2560.026 | 0.200 | 0.075 | 16.072 | 22.242 | 1.646 | 26.995 | 60.316 |
| 12.wav | 3018.359 | 0.287 | 0.109 | 25.231 | 27.668 | 0.019 | 43.224 | 45.737 |
| 13.wav | 2459.080 | 0.217 | 0.078 | 21.203 | 18.540 | 0.001 | 31.632 | 59.265 |
| 14.wav | 2945.454 | 0.304 | 0.122 | 32.794 | 22.905 | 0.001 | 4.490 | 44.283 |
| 15.wav | 2208.959 | 0.147 | 0.058 | 11.967 | 16.210 | 0.173 | 19.653 | 70.579 |
| 16.wav | 2028.531 | 0.127 | 0.052 | 10.604 | 13.692 | 0.029 | 15.575 | 74.709 |
| 17.wav | 2300.926 | 0.139 | 0.053 | 11.054 | 16.990 | 6.150 | 11.756 | 70.871 |
| 18.wav | 2156.108 | 0.119 | 0.052 | 9.706 | 15.578 | 1.534 | 13.286 | 73.008 |
| 19.wav | 2206.857 | 0.126 | 0.046 | 9.906 | 16.646 | 3.797 | 12.088 | 72.233 |
| 20.wav | 1905.613 | 0.105 | 0.049 | 8.470 | 11.743 | 0.073 | 11.358 | 78.530 |
| File Name | Sibilant Frames Count | Sibilant Score | Centroid Component | Flatness Component | HF Energy Component | Sibilant Energy Component | ZCR Component | Skewness Component | Energy Component |
|---|---|---|---|---|---|---|---|---|---|
| 1.wav | 6811 | 0.6096 | 0.8241 | 0.6871 | 0.7594 | 0.6769 | 0.5199 | 0.0377 | 0.0458 |
| 2.wav | 6020 | 0.6037 | 0.8148 | 0.7227 | 0.7726 | 0.6262 | 0.4803 | 0.0550 | 0.0308 |
| 3.wav | 6509 | 0.5974 | 0.8128 | 0.7558 | 0.7776 | 0.5747 | 0.4498 | 0.0615 | 0.0190 |
| 4.wav | 9191 | 0.6268 | 0.8120 | 0.7771 | 0.7786 | 0.6678 | 0.5286 | 0.0479 | 0.0185 |
| 5.wav | 2566 | 0.5889 | 0.8270 | 0.7375 | 0.7640 | 0.5498 | 0.4431 | 0.0577 | 0.0268 |
| 6.wav | 4025 | 0.6092 | 0.7780 | 0.7178 | 0.7420 | 0.7073 | 0.4763 | 0.0791 | 0.0355 |
| 7.wav | 1793 | 0.6533 | 0.8230 | 0.7834 | 0.8095 | 0.7819 | 0.5041 | 0.0300 | 0.0162 |
| 8.wav | 5056 | 0.5971 | 0.8140 | 0.7219 | 0.7338 | 0.5948 | 0.5142 | 0.0494 | 0.0912 |
| 9.wav | 19,148 | 0.5892 | 0.8197 | 0.7098 | 0.7292 | 0.5628 | 0.5090 | 0.0687 | 0.0725 |
| 10.wav | 2094 | 0.5912 | 0.8188 | 0.7170 | 0.7524 | 0.5902 | 0.4507 | 0.0680 | 0.0247 |
| 11.wav | 4167 | 0.5713 | 0.8149 | 0.6869 | 0.7278 | 0.5225 | 0.4434 | 0.0938 | 0.0710 |
| 12.wav | 5621 | 0.5800 | 0.8033 | 0.7546 | 0.7577 | 0.5079 | 0.4170 | 0.0941 | 0.0426 |
| 13.wav | 3925 | 0.5922 | 0.8209 | 0.7360 | 0.7567 | 0.5322 | 0.5096 | 0.0733 | 0.0518 |
| 14.wav | 3064 | 0.6100 | 0.7594 | 0.7871 | 0.7784 | 0.6209 | 0.4811 | 0.1044 | 0.0235 |
| 15.wav | 6023 | 0.5714 | 0.7957 | 0.6623 | 0.7185 | 0.5588 | 0.4675 | 0.0978 | 0.0740 |
| 16.wav | 4981 | 0.5793 | 0.8046 | 0.6538 | 0.7164 | 0.6010 | 0.4765 | 0.0830 | 0.0793 |
| 17.wav | 4838 | 0.5597 | 0.7885 | 0.6170 | 0.6873 | 0.5572 | 0.4716 | 0.1177 | 0.1029 |
| 18.wav | 3066 | 0.5552 | 0.7783 | 0.5552 | 0.6474 | 0.6009 | 0.5221 | 0.1038 | 0.1718 |
| 19.wav | 5377 | 0.5568 | 0.8035 | 0.5858 | 0.6386 | 0.5941 | 0.4899 | 0.0935 | 0.1193 |
| 20.wav | 3310 | 0.5741 | 0.8025 | 0.6299 | 0.6969 | 0.6146 | 0.4990 | 0.0723 | 0.0608 |
| Filename & Time Stamp | Bangla | English Meaning | Fricative (s) Present |
|---|---|---|---|
| 1.wav (2 min 15 s–2 min 18 s) | শুকনো কাশি | Dry Cough | শ (/ʃ/) |
| 6.wav (52 s–56 s) | ওষুধ | Medicine | ষ (/ʂ/) |
| 14.wav (15 s–18 s) | শারীরিক অসুস্থতা | Physical Discomfort | শ (/ʃ/), স (/s/) |
| 17.wav (24 s–29 s) | প্রেসার বেশি | High Pressure | স (/s/), শ (/ʃ/) |
| Sampling Rate (Hz) | Global WER | Latency Median (s) | Latency IQR (s) | Payload Median (kbps) |
|---|---|---|---|---|
| 8000 | 0.3383 | 2.0558 | 0.8758 | 132.0235 |
| 8500 | 0.3036 | 3.4907 | 1.4915 | 136.0235 |
| 15,250 | 0.2570 | 3.5705 | 1.5301 | 244.0235 |
| 15,500 | 0.2463 | 3.5877 | 1.5819 | 248.0235 |
| 15,750 | 0.2492 | 3.6042 | 1.5241 | 252.0235 |
| 16,000 | 0.2505 | 3.6562 | 1.5315 | 256.0235 |
| 16,250 | 0.2478 | 3.7584 | 1.5453 | 260.0235 |
| 16,500 | 0.2433 | 3.5727 | 1.4933 | 264.0235 |
| 16,750 | 0.2437 | 3.8609 | 1.4934 | 268.0235 |
| 17,000 | 0.2420 | 3.6006 | 1.5008 | 272.0235 |
| 17,250 | 0.2341 | 3.7912 | 1.5355 | 276.0235 |
| 17,500 | 0.2346 | 3.8089 | 1.5675 | 280.0235 |
| 17,750 | 0.2411 | 3.7927 | 1.4872 | 284.0235 |
| 18,000 | 0.2444 | 3.9073 | 1.5756 | 288.0235 |
| 18,250 | 0.2357 | 3.7327 | 1.4722 | 292.0235 |
| 18,500 | 0.2389 | 4.0662 | 1.6441 | 296.0235 |
| 18,750 | 0.2320 | 3.5826 | 1.4947 | 297.0235 |
| 19,000 | 0.2420 | 3.8670 | 1.5573 | 304.0235 |
| 19,250 | 0.2400 | 3.9875 | 1.5398 | 308.0235 |
| 19,500 | 0.2378 | 4.1998 | 1.4575 | 312.0235 |
| 19,750 | 0.2333 | 4.1171 | 1.4909 | 318.0235 |
| 20,000 | 0.2342 | 3.9819 | 1.5807 | 324.0235 |
| 21,000 | 0.2379 | 4.0645 | 1.4763 | 336.0235 |
| 25,000 | 0.2317 | 4.1946 | 1.4781 | 400.0235 |
| 26,000 | 0.2394 | 4.0909 | 1.5471 | 416.0235 |
| 27,000 | 0.2400 | 4.0993 | 1.5345 | 432.0235 |
| 31,000 | 0.2352 | 4.0316 | 1.5869 | 496.0235 |
| 32,000 | 0.2312 | 4.1119 | 1.5223 | 512.0235 |
| Sampling Rate (Hz) | Effective Bandwidth | Global WER | Latency Median (s) | Latency IQR (s) | Payload Median kbps |
|---|---|---|---|---|---|
| 32,000 | <=8 kHz | 0.2516 | 3.4115 | 1.6257 | 512.0235 |
| Sampling Rate (Hz) | Global WER | Latency Median (s) | Payload Median (kbps) | Weighted Score |
|---|---|---|---|---|
| 18,750 | 0.2320 | 3.5826 | 297.0235 | 0.2266 |
| 17,250 | 0.2341 | 3.7912 | 276.0235 | 0.2458 |
| 17,500 | 0.2346 | 3.8089 | 280.0235 | 0.2526 |
| 18,250 | 0.2357 | 3.7327 | 292.0235 | 0.2583 |
| 17,000 | 0.2420 | 3.6006 | 272.0235 | 0.2712 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Paul, P.; Bouh, M.M.; Shah, M.V.; Hossain, F.; Ahmed, A. Investigating Sibilant Fricative Representation in Bangla Telemedicine Speech: A Cost-Aware Sampling Rate Optimization Study. Signals 2026, 7, 44. https://doi.org/10.3390/signals7030044
Paul P, Bouh MM, Shah MV, Hossain F, Ahmed A. Investigating Sibilant Fricative Representation in Bangla Telemedicine Speech: A Cost-Aware Sampling Rate Optimization Study. Signals. 2026; 7(3):44. https://doi.org/10.3390/signals7030044
Chicago/Turabian StylePaul, Prajat, Mohamed Mehfoud Bouh, Manan Vinod Shah, Forhad Hossain, and Ashir Ahmed. 2026. "Investigating Sibilant Fricative Representation in Bangla Telemedicine Speech: A Cost-Aware Sampling Rate Optimization Study" Signals 7, no. 3: 44. https://doi.org/10.3390/signals7030044
APA StylePaul, P., Bouh, M. M., Shah, M. V., Hossain, F., & Ahmed, A. (2026). Investigating Sibilant Fricative Representation in Bangla Telemedicine Speech: A Cost-Aware Sampling Rate Optimization Study. Signals, 7(3), 44. https://doi.org/10.3390/signals7030044

