Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (54)

Search Parameters:
Keywords = multi-bit quantizer

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 1723 KB  
Article
Edge-Ready Romanian Language Models: Training, Quantization, and Deployment
by T. A. Diac, P. F. de Viana, A. F. Neagoe, A. Oprea, M. C. Raportaru and A. Nicolin-Żaczek
AI 2026, 7(2), 61; https://doi.org/10.3390/ai7020061 - 6 Feb 2026
Viewed by 214
Abstract
We present RoBaseLM-S (125 M) and RoBaseLM-M (260 M), two compact Romanian decoder-only language models trained from scratch on a 4.3 B-token curated corpus. Architecturally, they follow a modern LLaMA-style recipe with pre-norm RMSNorm, rotary position embeddings, SwiGLU feed-forward blocks, grouped-query attention, and [...] Read more.
We present RoBaseLM-S (125 M) and RoBaseLM-M (260 M), two compact Romanian decoder-only language models trained from scratch on a 4.3 B-token curated corpus. Architecturally, they follow a modern LLaMA-style recipe with pre-norm RMSNorm, rotary position embeddings, SwiGLU feed-forward blocks, grouped-query attention, and 4 k-token context windows. We release both full-precision (FP16) and post-training 5-bit (Q5_K_M) checkpoints in GGUF format for lightweight local inference. The 5-bit variants fit under 500 MB and generate text in real time on a Jetson Nano 4 GB, enabling fully offline Romanian text generation on consumer-grade edge hardware. We evaluate the models intrinsically (multi-domain perplexity across news, literary prose, poetry, and heterogeneous web text) and extrinsically (LaRoSeDa sentiment classification and RO-STS sentence similarity). Relative to Romanian GPT-2–style baselines at similar parameter scales, RoBaseLM-S and RoBaseLM-M reduce perplexity substantially, e.g., from 30.7 to 15.9 on our held-out news split. The 5-bit post-training quantized checkpoints remain within FP16 performance across all reported tasks. To our knowledge, these are the first Romanian small language models explicitly optimized for long-context inference, post-training quantization, and low-power on-device deployment. Full article
(This article belongs to the Topic Challenges and Solutions in Large Language Models)
Show Figures

Figure 1

39 pages, 3706 KB  
Article
Performance Assessment of DL for Network Intrusion Detection on a Constrained IoT Device
by Armin Mazinani, Daniele Antonucci, Luca Davoli and Gianluigi Ferrari
Future Internet 2026, 18(1), 34; https://doi.org/10.3390/fi18010034 - 7 Jan 2026
Viewed by 295
Abstract
This work investigates the deployment of Deep Learning (DL) models for network intrusion detection on resource-constrained IoT devices, using the public CICIoT2023 dataset. In particular, we consider the following DL models: Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), [...] Read more.
This work investigates the deployment of Deep Learning (DL) models for network intrusion detection on resource-constrained IoT devices, using the public CICIoT2023 dataset. In particular, we consider the following DL models: Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), Temporal Convolutional Network (TCN), Multi-Layer Perceptron (MLP). Bayesian optimization is employed to fine-tune the models’ hyperparameters and ensure reliable performance evaluation across both binary (2-class) and multi-class (8-class, 34-class) intrusion detection. Then, the computational complexity of each DL model is analyzed—in terms of the number of Multiply–ACCumulate operations (MACCs), RAM usage, and inference time—through the STMicroelectronics Cube.AI Analyzer tool, with models being deployed on an STM32H7S78-DK board. To assess the practical deployability of the considered DL models, a trade-off score (balancing classification accuracy and computational efficiency) is introduced: according to this score, our experimental results indicate that MLP and TCN outperform the other models. Furthermore, Post-Training Quantization (PTQ) to 8-bit integer precision is applied, allowing the model size to be reduced by more than 90% with negligible performance degradation. This demonstrates the effectiveness of quantization in optimizing DL models for real-world deployment on resource-constrained IoT devices. Full article
Show Figures

Graphical abstract

29 pages, 3695 KB  
Article
Multi-Objective Parameter Stochastic Optimization Method for Time-Delayed Integration Optical Remote Sensing System Used for Kelvin Wake Imaging
by Mingzhu Song, Lizhou Li, Xuechan Zhao and Junsheng Wang
Appl. Sci. 2025, 15(21), 11307; https://doi.org/10.3390/app152111307 - 22 Oct 2025
Cited by 1 | Viewed by 437
Abstract
When using optical remote sensing methods for Kelvin wake imaging, the imaging is affected by sea-surface stochastic fluctuation, imaging noise, and weak reflectivity contrast, resulting in weak wake image signals. In order to better obtain wake optical remote sensing images, this article proposes [...] Read more.
When using optical remote sensing methods for Kelvin wake imaging, the imaging is affected by sea-surface stochastic fluctuation, imaging noise, and weak reflectivity contrast, resulting in weak wake image signals. In order to better obtain wake optical remote sensing images, this article proposes a multi-objective parameter stochastic optimization method for a time-delayed integration optical remote sensing imaging system. By constructing the wake imaging mechanism framework integrating a hydrodynamic model, rough sea surface probability and statistics model, and Time-Delay Integration Charge-Coupled Device (TDI-CCD) imaging link model, a stochastic multi-objective optimization model with constraints is established. The multi-objective function of this model is specifically defined as follows: maximizing the digital number difference between the crest and trough of Kelvin wakes in imaging results, maximizing the F number, minimizing the integration stages, and minimizing the quantization bits. Meanwhile, a two-stage solution method based on sample average approximation (SAA), branch and bound method (B&B), and the complex method is designed. The model can be used to obtain optimized design results for remote sensing imaging parameters, providing theoretical and methodological support for the design of remote sensing imaging systems. Numerical simulation results show that the optimized parameter combination can achieve clear imaging of the Kelvin wake, and the core indicators meet the design requirements. Full article
Show Figures

Figure 1

15 pages, 577 KB  
Article
Optimal Feedback Rate Analysis in Downlink Multi-User Multi-Antenna Systems with One-Bit ADC Receivers over Randomly Modeled Dense Cellular Networks
by Moonsik Min, Sungmin Lee and Tae-Kyoung Kim
Mathematics 2025, 13(20), 3312; https://doi.org/10.3390/math13203312 - 17 Oct 2025
Viewed by 623
Abstract
Stochastic geometry provides a powerful analytical framework for evaluating interference-limited cellular networks with randomly deployed base stations (BSs). While prior studies have examined limited channel state information at the transmitter (CSIT) and low-resolution analog-to-digital converters (ADCs) separately, their joint impact in multi-user multiple-input [...] Read more.
Stochastic geometry provides a powerful analytical framework for evaluating interference-limited cellular networks with randomly deployed base stations (BSs). While prior studies have examined limited channel state information at the transmitter (CSIT) and low-resolution analog-to-digital converters (ADCs) separately, their joint impact in multi-user multiple-input multiple-output (MIMO) systems remains largely unexplored. This paper investigates a downlink cellular network in which BSs are distributed according to a homogeneous Poisson point process (PPP), employing zero-forcing beamforming (ZFBF) with limited feedback, and receivers are equipped with one-bit ADCs. We derive a tractable approximation for the achievable spectral efficiency that explicitly accounts for both the quantization error from limited feedback and the receiver distortion caused by coarse ADCs. Based on this approximation, we determine the optimal feedback rate that maximizes the net spectral efficiency. Our analysis reveals that the optimal number of feedback bits scales logarithmically with the channel coherence time but its absolute value decreases due to coarse quantization. Simulation results validate the accuracy of the proposed approximation and confirm the predicted scaling behavior, demonstrating its effectiveness for interference-limited multi-user MIMO networks. Full article
Show Figures

Figure 1

17 pages, 4081 KB  
Article
Neural Network-Based Atlas Enhancement in MPEG Immersive Video
by Taesik Lee, Kugjin Yun, Won-Sik Cheong and Dongsan Jun
Mathematics 2025, 13(19), 3110; https://doi.org/10.3390/math13193110 - 29 Sep 2025
Viewed by 910
Abstract
Recently, the demand for immersive videos has surged with the expansion of virtual reality, augmented reality, and metaverse technologies. As an international standard, moving picture experts group (MPEG) has developed MPEG immersive video (MIV) to efficiently transmit large-volume immersive videos. The MIV encoder [...] Read more.
Recently, the demand for immersive videos has surged with the expansion of virtual reality, augmented reality, and metaverse technologies. As an international standard, moving picture experts group (MPEG) has developed MPEG immersive video (MIV) to efficiently transmit large-volume immersive videos. The MIV encoder generates atlas videos to convert extensive multi-view videos into low-bitrate formats. When these atlas videos are compressed using conventional video codecs, compression artifacts often appear in the reconstructed atlas videos. To address this issue, this study proposes a feature-extraction-based convolutional neural network (FECNN) to reduce the compression artifacts during MIV atlas video transmission. The proposed FECNN uses quantization parameter (QP) maps and depth information as inputs and consists of shallow feature extraction (SFE) blocks and deep feature extraction (DFE) blocks to utilize layered feature characteristics. Compared to the existing MIV, the proposed method improves the Bjontegaard delta bit-rate (BDBR) by −4.12% and −6.96% in the basic and additional views, respectively. Full article
(This article belongs to the Special Issue Coding Theory and the Impact of AI)
Show Figures

Figure 1

23 pages, 4446 KB  
Article
A Modular Framework for RGB Image Processing and Real-Time Neural Inference: A Case Study in Microalgae Culture Monitoring
by José Javier Gutiérrez-Ramírez, Ricardo Enrique Macias-Jamaica, Víctor Manuel Zamudio-Rodríguez, Héctor Arellano Sotelo, Dulce Aurora Velázquez-Vázquez, Juan de Anda-Suárez and David Asael Gutiérrez-Hernández
Eng 2025, 6(9), 221; https://doi.org/10.3390/eng6090221 - 2 Sep 2025
Cited by 1 | Viewed by 1050
Abstract
Recent progress in computer vision and embedded systems has facilitated real-time monitoring of bioprocesses; however, lightweight and scalable solutions for resource-constrained settings remain limited. This work presents a modular framework for monitoring Chlorella vulgaris growth by integrating RGB image processing with multimodal sensor [...] Read more.
Recent progress in computer vision and embedded systems has facilitated real-time monitoring of bioprocesses; however, lightweight and scalable solutions for resource-constrained settings remain limited. This work presents a modular framework for monitoring Chlorella vulgaris growth by integrating RGB image processing with multimodal sensor fusion. The system incorporates a Logitech C920 camera and low-cost pH and temperature sensors within a compact photobioreactor. It extracts RGB channel statistics, luminance, and environmental data to generate a 10-dimensional feature vector. A feedforward artificial neural network (ANN) with ReLU activations, dropout layers, and SMOTE-based data balancing was trained to classify growth phases: lag, exponential, and stationary. The optimized model, quantized to 8 bits, was deployed on an ESP32 microcontroller, achieving 98.62% accuracy with 4.8 ms inference time and a 13.48 kB memory footprint. Robustness analysis confirmed tolerance to geometric transformations, though variable lighting reduced performance. Principal component analysis (PCA) retained 95% variance, supporting the discriminative power of the features. The proposed system outperformed previous vision-only methods, demonstrating the advantages of multimodal fusion for early detection. Limitations include sensitivity to lighting and validation limited to a single species. Future directions include incorporating active lighting control and extending the model to multi-species classification for broader applicability. Full article
(This article belongs to the Special Issue Artificial Intelligence for Engineering Applications, 2nd Edition)
Show Figures

Figure 1

20 pages, 1633 KB  
Article
A Digital Simulation Model of Broadband Phased Array RF System and Its Application
by Jia Ding, Huaizong Shao, Jianxing Lv and Fake Ding
Sensors 2025, 25(13), 4133; https://doi.org/10.3390/s25134133 - 2 Jul 2025
Viewed by 849
Abstract
The design and application of broadband phased array RF links is a complex and highly precise endeavor. To achieve optimal performance, it is essential to compare and validate multiple schemes during the system design phase. Utilizing simulation models to simulate system structures and [...] Read more.
The design and application of broadband phased array RF links is a complex and highly precise endeavor. To achieve optimal performance, it is essential to compare and validate multiple schemes during the system design phase. Utilizing simulation models to simulate system structures and validate parameters can effectively reduce research and development time and costs. This article takes the broadband phased array RF system (RFS04) currently being developed by Nanhu Laboratory as a reference and constructs a behavioral-level signal simulation model. Through this model, the antenna pattern of RFS04 was generated, and the relationship between beam pointing accuracy and delay quantization bit number was analyzed. The 3 dB beam coverage range of the 18 GHz antenna array was calculated, and the synthesis scheme of multi-phased arrays was explored. Additionally, the correspondence between the angle measurement accuracy and signal-to-noise ratio of the RFS04 system was analyzed. This article also measured the delay module parameters of the RF system and developed a correction strategy for the delay control scheme. Through simulation calculations and laboratory testing, it has been proven that this strategy can effectively improve delay accuracy. After applying the modified delay control scheme to the RFS04 simulation model, the beam pointing accuracy during phased array antenna scanning was significantly enhanced. The model research and integrated simulation software construction of the broadband phased array RF system provide an efficient and accurate simulation tool for system design and optimization. Full article
(This article belongs to the Section Communications)
Show Figures

Figure 1

23 pages, 11002 KB  
Article
Exploring the Inherent Variability of Economically Fabricated ZnO Devices Towards Physical Unclonable Functions for Secure Authentication
by Savvas Ermeidis, Dimitrios Tassis, George P. Papageorgiou, Stavros G. Stavrinides and Eleni Makarona
Micromachines 2025, 16(6), 627; https://doi.org/10.3390/mi16060627 - 26 May 2025
Viewed by 785
Abstract
Meeting the rising need for secure authentication in IoT and Industry 4.0, this work presents chemically synthesized ZnO nanostructured homojunctions as powerful and scalable physical unclonable functions (PUFs). By leveraging intrinsic variability from Li doping and the stochastic hydrothermal growth process, we systematically [...] Read more.
Meeting the rising need for secure authentication in IoT and Industry 4.0, this work presents chemically synthesized ZnO nanostructured homojunctions as powerful and scalable physical unclonable functions (PUFs). By leveraging intrinsic variability from Li doping and the stochastic hydrothermal growth process, we systematically identified electrical parameters offering outstanding variability, stability, and reproducibility. ZnO devices outperform commercial diodes by delivering richer parameter diversity, lower costs, and superior environmental sustainability. Pushing beyond traditional approaches, we introduce multi-level quantization for boosted accuracy and entropy, demonstrate the normal distribution of challenge candidate parameters to support a novel method under development, and extract multiple parameters (8–10) per device instead of relying on a single-bit output. Parameter optimization and selection are performed upfront through a rigorous assessment of variability and inter-correlation, maximizing uniqueness and reliability. Thanks to their strong scalability and eco-friendliness, ZnO-based homojunctions emerge as a dynamic, future-proof platform for building low-cost, high-security, and sustainable digital identity systems. Full article
(This article belongs to the Section D:Materials and Processing)
Show Figures

Figure 1

16 pages, 3139 KB  
Article
Adaptive Threshold Wavelet Denoising Method and Hardware Implementation for HD Real-Time Processing
by Xuhui Wang and Jizhong Zhao
Electronics 2025, 14(11), 2130; https://doi.org/10.3390/electronics14112130 - 23 May 2025
Cited by 4 | Viewed by 2274
Abstract
To meet the demands of real-time and high-definition (HD) image processing applications, denoising methods must be both computationally efficient and hardware friendly. Traditional image denoising techniques are typically simple, fast, and resource-efficient but often fall short in terms of denoising performance and adaptability. [...] Read more.
To meet the demands of real-time and high-definition (HD) image processing applications, denoising methods must be both computationally efficient and hardware friendly. Traditional image denoising techniques are typically simple, fast, and resource-efficient but often fall short in terms of denoising performance and adaptability. This paper proposes an adjustable-threshold denoising method along with a corresponding hardware implementation designed to support the real-time processing of large-array images commonly used in image signal processors (ISPs). The proposed technique employs a LeGall 5/3 wavelet with a row-transform structure and multilevel decomposition. A 2D Pyramid VisuShrink thresholding algorithm is introduced, where the threshold is derived from the median value of the HH sub-band using a multi-stage segmentation approach. To further optimize performance, a quantization strategy with fixed-point parameter design is applied to minimize storage requirements and computational errors. A specialized hardware architecture is developed to enable the real-time denoising of 4K images while adhering to constraints on speed and resource utilization. The architecture incorporates a finite state machine (FSM) and a reusable median calculation unit to efficiently share threshold-related storage and computational resources. The system is implemented and verified on an FPGA, achieving real-time performance at a maximum frequency of 230 MHz. It supports flexible input data formats with resolutions up to 4096×4096 pixels and 16-bit depth. Comprehensive comparisons with other real-time denoising methods demonstrate that the proposed approach consistently achieves better PSNR and SSIM across various noise levels and image sizes. In addition to delivering improved denoising accuracy, the hardware implementation offers advantages in processing speed and resource efficiency while supporting a wide range of large-array images. Full article
Show Figures

Figure 1

30 pages, 34873 KB  
Article
Text-Guided Synthesis in Medical Multimedia Retrieval: A Framework for Enhanced Colonoscopy Image Classification and Segmentation
by Ojonugwa Oluwafemi Ejiga Peter, Opeyemi Taiwo Adeniran, Adetokunbo MacGregor John-Otumu, Fahmi Khalifa and Md Mahmudur Rahman
Algorithms 2025, 18(3), 155; https://doi.org/10.3390/a18030155 - 9 Mar 2025
Cited by 6 | Viewed by 2651
Abstract
The lack of extensive, varied, and thoroughly annotated datasets impedes the advancement of artificial intelligence (AI) for medical applications, especially colorectal cancer detection. Models trained with limited diversity often display biases, especially when utilized on disadvantaged groups. Generative models (e.g., DALL-E 2, Vector-Quantized [...] Read more.
The lack of extensive, varied, and thoroughly annotated datasets impedes the advancement of artificial intelligence (AI) for medical applications, especially colorectal cancer detection. Models trained with limited diversity often display biases, especially when utilized on disadvantaged groups. Generative models (e.g., DALL-E 2, Vector-Quantized Generative Adversarial Network (VQ-GAN)) have been used to generate images but not colonoscopy data for intelligent data augmentation. This study developed an effective method for producing synthetic colonoscopy image data, which can be used to train advanced medical diagnostic models for robust colorectal cancer detection and treatment. Text-to-image synthesis was performed using fine-tuned Visual Large Language Models (LLMs). Stable Diffusion and DreamBooth Low-Rank Adaptation produce images that look authentic, with an average Inception score of 2.36 across three datasets. The validation accuracy of various classification models Big Transfer (BiT), Fixed Resolution Residual Next Generation Network (FixResNeXt), and Efficient Neural Network (EfficientNet) were 92%, 91%, and 86%, respectively. Vision Transformer (ViT) and Data-Efficient Image Transformers (DeiT) had an accuracy rate of 93%. Secondly, for the segmentation of polyps, the ground truth masks are generated using Segment Anything Model (SAM). Then, five segmentation models (U-Net, Pyramid Scene Parsing Network (PSNet), Feature Pyramid Network (FPN), Link Network (LinkNet), and Multi-scale Attention Network (MANet)) were adopted. FPN produced excellent results, with an Intersection Over Union (IoU) of 0.64, an F1 score of 0.78, a recall of 0.75, and a Dice coefficient of 0.77. This demonstrates strong performance in terms of both segmentation accuracy and overlap metrics, with particularly robust results in balanced detection capability as shown by the high F1 score and Dice coefficient. This highlights how AI-generated medical images can improve colonoscopy analysis, which is critical for early colorectal cancer detection. Full article
Show Figures

Figure 1

15 pages, 1132 KB  
Article
Optimizing Multi-View CNN for CAD Mechanical Model Classification: An Evaluation of Pruning and Quantization Techniques
by Victor Pinto, Verusca Severo and Francisco Madeiro
Electronics 2025, 14(5), 1013; https://doi.org/10.3390/electronics14051013 - 3 Mar 2025
Cited by 3 | Viewed by 1568
Abstract
In the realm of product design and development, efficient retrieval and reuse of 3D CAD models are vital for optimizing workflows and minimizing redundant efforts. Manual labeling of CAD models, while traditional, is labor-intensive and prone to inconsistency, highlighting the need for automated [...] Read more.
In the realm of product design and development, efficient retrieval and reuse of 3D CAD models are vital for optimizing workflows and minimizing redundant efforts. Manual labeling of CAD models, while traditional, is labor-intensive and prone to inconsistency, highlighting the need for automated classification systems. Multi-view convolutional neural networks (MVCNNs) offer an automated solution by leveraging 2D projections to represent 3D objects, balancing high classification accuracy with computational efficiency. Despite their effectiveness, the computational demands of MVCNNs pose challenges in large-scale CAD applications. This study investigates the use of optimization strategies, precisely pruning and quantization, in the scenario of MVCNN applied to the classification of 3D CAD mechanical models. By using different pruning and quantization strategies, we evaluate trade-offs between classification accuracy, execution time, and memory usage. In our evaluation of pruning and quantization techniques, 8-bit quantization reduced the memory used by the model from 83.78 MB to 21.01 MB, with accuracy only slightly decreasing from 93.83% to 93.59%. When applying 25% structured pruning, the model’s memory usage was reduced to 47.16 MB, execution time decreased from 133 to 97 s, and accuracy decreased to 92.14%. A combined approach of 25% pruning and 8-bit quantization achieved even better resource efficiency, with memory usage at 11.86 MB, execution time at 99 s, and accuracy at 92.06%. This combination of pruning and quantization leads to efficient MVCNN model optimization, balancing resource usage and classification performance, which is especially relevant in large-scale applications. Full article
Show Figures

Figure 1

20 pages, 581 KB  
Article
Low-Resolution Quantized Precoding for Multiple-Input Multiple-Output Dual-Functional Radar–Communication Systems Used for Target Sensing
by Xiang Feng, Zhongqing Zhao, Jiongshi Wang, Jian Wang, Zhanfeng Zhao and Zhiquan Zhou
Remote Sens. 2025, 17(2), 198; https://doi.org/10.3390/rs17020198 - 8 Jan 2025
Cited by 1 | Viewed by 1217
Abstract
Dual-functional radar–communication systems are extensively employed for the detection and control of unmanned aerial vehicle groups and play crucial roles in scenario monitoring. In this study, we address the downlink precoding problem in large-scale multi-user multiple-input multiple-output dual-function radar–communication systems equipped with low-resolution [...] Read more.
Dual-functional radar–communication systems are extensively employed for the detection and control of unmanned aerial vehicle groups and play crucial roles in scenario monitoring. In this study, we address the downlink precoding problem in large-scale multi-user multiple-input multiple-output dual-function radar–communication systems equipped with low-resolution quantized digital-to-analog converters. To tackle this issue, we develop a weighted optimization framework that minimizes the mean squared error between the transmitted symbols and their estimates while satisfying specific radar performance requirements. Due to the complexity introduced by discrete constraints, we decompose the original problem into three sub-problems to reduce computational burden. Furthermore, we propose a dynamic projection refinement algorithm within the alternating direction method of multiplier framework to efficiently solve these sub-problems. Numerical experiments demonstrate that our proposed method outperforms existing state-of-the-art techniques, particularly in terms of bit error rate in low signal-to-noise ratio scenarios. Full article
Show Figures

Figure 1

29 pages, 5099 KB  
Article
Configurable Multi-Layer Perceptron-Based Soft Sensors on Embedded Field Programmable Gate Arrays: Targeting Diverse Deployment Goals in Fluid Flow Estimation
by Tianheng Ling, Chao Qian, Theodor Mario Klann, Julian Hoever, Lukas Einhaus and Gregor Schiele
Sensors 2025, 25(1), 83; https://doi.org/10.3390/s25010083 - 26 Dec 2024
Cited by 4 | Viewed by 1571
Abstract
This study presents a comprehensive workflow for developing and deploying Multi-Layer Perceptron (MLP)-based soft sensors on embedded FPGAs, addressing diverse deployment objectives. The proposed workflow extends our prior research by introducing greater model adaptability. It supports various configurations—spanning layer counts, neuron counts, and [...] Read more.
This study presents a comprehensive workflow for developing and deploying Multi-Layer Perceptron (MLP)-based soft sensors on embedded FPGAs, addressing diverse deployment objectives. The proposed workflow extends our prior research by introducing greater model adaptability. It supports various configurations—spanning layer counts, neuron counts, and quantization bitwidths—to accommodate the constraints and capabilities of different FPGA platforms. The workflow incorporates a custom-developed, open-source toolchain ElasticAI.Creator that facilitates quantization-aware training, integer-only inference, automated accelerator generation using VHDL templates, and synthesis alongside performance estimation. A case study on fluid flow estimation was conducted on two FPGA platforms: the AMD Spartan-7 XC7S15 and the Lattice iCE40UP5K. For precision-focused and latency-sensitive deployments, a six-layer, 60-neuron MLP accelerator quantized to 8 bits on the XC7S15 achieved an MSE of 56.56, an MAPE of 1.61%, and an inference latency of 23.87 μs. Moreover, for low-power and energy-constrained deployments, a five-layer, 30-neuron MLP accelerator quantized to 8 bits on the iCE40UP5K achieved an inference latency of 83.37 μs, a power consumption of 2.06 mW, and an energy consumption of just 0.172 μJ per inference. These results confirm the workflow’s ability to identify optimal FPGA accelerators tailored to specific deployment requirements, achieving a balanced trade-off between precision, inference latency, and energy efficiency. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

19 pages, 4840 KB  
Article
High-Quality Image Compression Algorithm Design Based on Unsupervised Learning
by Shuo Han, Bo Mo, Jie Zhao, Junwei Xu, Shizun Sun and Bo Jin
Sensors 2024, 24(20), 6503; https://doi.org/10.3390/s24206503 - 10 Oct 2024
Cited by 3 | Viewed by 5325
Abstract
Increasingly massive image data is restricted by conditions such as information transmission and reconstruction, and it is increasingly difficult to meet the requirements of speed and integrity in the information age. To solve the urgent problems faced by massive image data in information [...] Read more.
Increasingly massive image data is restricted by conditions such as information transmission and reconstruction, and it is increasingly difficult to meet the requirements of speed and integrity in the information age. To solve the urgent problems faced by massive image data in information transmission, this paper proposes a high-quality image compression algorithm based on unsupervised learning. Among them, a content-weighted autoencoder network is proposed to achieve image compression coding on the basis of a smaller bit rate to solve the entropy rate optimization problem. Binary quantizers are used for coding quantization, and importance maps are used to achieve better bit allocation. The compression rate is further controlled and optimized. A multi-scale discriminator suitable for the generative adversarial network image compression framework is designed to solve the problem that the generated compressed image is prone to blurring and distortion. Finally, through training with different weights, the distortion of each scale is minimized, so that the image compression can achieve a higher quality compression and reconstruction effect. The experimental results show that the algorithm model can save the details of the image and greatly compress the memory of the image. Its advantage is that it can expand and compress a large number of images quickly and efficiently and realize the efficient processing of image compression. Full article
(This article belongs to the Topic Applications in Image Analysis and Pattern Recognition)
Show Figures

Figure 1

17 pages, 3827 KB  
Article
Direction of Arrival Estimation Based on DNN and CNN
by Wu Cao, Wen Ren, Zhenyu Zhang, Weiqiang Huang, Jun Zou and Guangzu Liu
Electronics 2024, 13(19), 3866; https://doi.org/10.3390/electronics13193866 - 29 Sep 2024
Cited by 4 | Viewed by 2514
Abstract
The accuracy of Direction of Arrival (DOA) estimation primarily depends on the precision of the data. When the receiver uses a low-precision analog-to-digital converter (ADC), traditional DOA estimation algorithms exhibit poor accuracy. To face the challenge of multi-target DOA estimation in scenarios with [...] Read more.
The accuracy of Direction of Arrival (DOA) estimation primarily depends on the precision of the data. When the receiver uses a low-precision analog-to-digital converter (ADC), traditional DOA estimation algorithms exhibit poor accuracy. To face the challenge of multi-target DOA estimation in scenarios with low-precision ADC quantized sampling, this paper proposes a novel DOA estimation algorithm for quantized signals based on classification problems. A deep learning network was constructed using Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs), divided into the quantized signal recovery framework and the DOA estimation framework. The DNN network is utilized to recover signals that have undergone low-precision quantization, while the CNN network addresses the classification problem to estimate the DOA from received data with an unknown number of signal sources. A comprehensive analysis of the impact of signal-to-noise ratio (SNR), the number of array elements, and the number of quantization bits on the proposed algorithm was conducted. Simulation results indicate that the proposed algorithm exhibits superior DOA estimation performance in low-precision scenarios, characterized by reduced computational complexity, thereby facilitating real-time DOA estimation. Full article
Show Figures

Figure 1

Back to TopTop