Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (280)

Search Parameters:
Keywords = Discrete Cosine Transform

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 6785 KiB  
Article
Spatiality–Frequency Domain Video Forgery Detection System Based on ResNet-LSTM-CBAM and DCT Hybrid Network
by Zihao Liao, Sheng Hong and Yu Chen
Appl. Sci. 2025, 15(16), 9006; https://doi.org/10.3390/app15169006 - 15 Aug 2025
Abstract
As information technology advances, digital content has become widely adopted across diverse fields such as news broadcasting, entertainment, commerce, and forensic investigation. However, the availability of sophisticated multimedia editing tools has significantly increased the risk of video and image forgery, raising serious concerns [...] Read more.
As information technology advances, digital content has become widely adopted across diverse fields such as news broadcasting, entertainment, commerce, and forensic investigation. However, the availability of sophisticated multimedia editing tools has significantly increased the risk of video and image forgery, raising serious concerns about content authenticity at both societal and individual levels. To address the growing need for robust and accurate detection methods, this study proposes a novel video forgery detection model that integrates both spatial and frequency-domain features. The model is built on a ResNet-LSTM framework enhanced by a Convolutional Block Attention Module (CBAM) for spatial feature extraction, and further incorporates Discrete Cosine Transform (DCT) to capture frequency domain information. Comprehensive experiments were conducted on several mainstream benchmark datasets, encompassing a wide range of forgery scenarios. The results demonstrate that the proposed model achieves superior performance in distinguishing between authentic and manipulated videos. Additional ablation and comparative studies confirm the contribution of each component in the architecture, offering deeper insight into the model’s capacity. Overall, the findings support the proposed approach as a promising solution for enhancing the reliability of video authenticity analysis under complex conditions. Full article
Show Figures

Figure 1

28 pages, 11876 KiB  
Article
Improved WTCN-Informer Model Based on Frequency-Enhanced Channel Attention Mechanism and Wavelet Convolution: Prediction of Remaining Service Life of Ion Etcher Cooling System
by Tingyu Ma, Jiaqi Liu, Panfeng Xu, Yan Song and Xiaoping Bai
Sensors 2025, 25(16), 4883; https://doi.org/10.3390/s25164883 - 8 Aug 2025
Viewed by 243
Abstract
Etching has become a critical step in semiconductor wafer fabrication, and its importance in semiconductor manufacturing highlights the fact that it directly determines the ability of the fab to produce high-process products, as well as the application performance of the chip. While the [...] Read more.
Etching has become a critical step in semiconductor wafer fabrication, and its importance in semiconductor manufacturing highlights the fact that it directly determines the ability of the fab to produce high-process products, as well as the application performance of the chip. While the health of the etcher is a concern, especially for the cooling system, accurately predicting the remaining useful life (RUL) of the etcher cooling system is a critical task. Predictive maintenance (PDM) can be used to monitor the basic condition of the equipment by learning from historical data, and it can help solve the task of RUL prediction. In this paper, we propose the FECAM-WTCN-Informer model, which first obtains a new WTCN structure by inserting wavelet convolution into the TCN, and then combines the discrete cosine transform (DCT) and channel attention mechanism into the temporal neural network (TCN). Multidimensional feature extraction of time series data can be realized, and the processed features are input into the Informer network for prediction. Experimental results show that the method is significantly more accurate in terms of overall prediction performance (MSE, RMSE, and MAE), compared with other state-of-the-art methods, and is suitable for solving the problem of predictive maintenance of etching machine cooling systems. Full article
(This article belongs to the Section Industrial Sensors)
Show Figures

Graphical abstract

23 pages, 3561 KiB  
Article
Chaos-Based Color Image Encryption with JPEG Compression: Balancing Security and Compression Efficiency
by Wei Zhang, Xue Zheng, Meng Xing, Jingjing Yang, Hai Yu and Zhiliang Zhu
Entropy 2025, 27(8), 838; https://doi.org/10.3390/e27080838 - 6 Aug 2025
Viewed by 259
Abstract
In recent years, most proposed digital image encryption algorithms have primarily focused on encrypting raw pixel data, often neglecting the integration with image compression techniques. Image compression algorithms, such as JPEG, are widely utilized in internet applications, highlighting the need for encryption methods [...] Read more.
In recent years, most proposed digital image encryption algorithms have primarily focused on encrypting raw pixel data, often neglecting the integration with image compression techniques. Image compression algorithms, such as JPEG, are widely utilized in internet applications, highlighting the need for encryption methods that are compatible with compression processes. This study introduces an innovative color image encryption algorithm integrated with JPEG compression, designed to enhance the security of images susceptible to attacks or tampering during prolonged transmission. The research addresses critical challenges in achieving an optimal balance between encryption security and compression efficiency. The proposed encryption algorithm is structured around three key compression phases: Discrete Cosine Transform (DCT), quantization, and entropy coding. At each stage, the algorithm incorporates advanced techniques such as block segmentation, block replacement, DC coefficient confusion, non-zero AC coefficient transformation, and RSV (Run/Size and Value) pair recombination. Extensive simulations and security analyses demonstrate that the proposed algorithm exhibits strong robustness against noise interference and data loss, effectively meeting stringent security performance requirements. Full article
(This article belongs to the Section Multidisciplinary Applications)
Show Figures

Figure 1

19 pages, 3436 KiB  
Article
An Improved Wind Power Forecasting Model Considering Peak Fluctuations
by Shengjie Yang, Jie Tang, Lun Ye, Jiangang Liu and Wenjun Zhao
Electronics 2025, 14(15), 3050; https://doi.org/10.3390/electronics14153050 - 30 Jul 2025
Viewed by 264
Abstract
Wind power output sequences exhibit strong randomness and intermittency characteristics; traditional single forecasting models struggle to capture the internal features of sequences and are highly susceptible to interference from high-frequency noise and predictive accuracy is still notably poor at the peaks where the [...] Read more.
Wind power output sequences exhibit strong randomness and intermittency characteristics; traditional single forecasting models struggle to capture the internal features of sequences and are highly susceptible to interference from high-frequency noise and predictive accuracy is still notably poor at the peaks where the power curve undergoes abrupt changes. To address the poor fitting at peaks, a short-term wind power forecasting method based on the improved Informer model is proposed. First, the temporal convolutional network (TCN) is introduced to enhance the model’s ability to capture regional segment features along the temporal dimension, enhancing the model’s receptive field to address wind power fluctuation under varying environmental conditions. Next, a discrete cosine transform (DCT) is employed for adaptive modeling of frequency dependencies between channels, converting the time series data into frequency domain representations to extract its frequency features. These frequency domain features are then weighted using a channel attention mechanism to improve the model’s ability to capture peak features and resist noise interference. Finally, the Informer generative decoder is used to output the power prediction results, this enables the model to simultaneously leverage neighboring temporal segment features and long-range inter-temporal dependencies for future wind-power prediction, thereby substantially improving the fitting accuracy at power-curve peaks. Experimental results validate the effectiveness and practicality of the proposed model; compared with other models, the proposed approach reduces MAE by 9.14–42.31% and RMSE by 12.57–47.59%. Full article
(This article belongs to the Special Issue Digital Intelligence Technology and Applications)
Show Figures

Figure 1

19 pages, 3841 KiB  
Article
An Improved Chosen Plaintext Attack on JPEG Encryption
by Junhui He, Kaitian Gu, Yihan Huang, Yue Li and Xiang Chen
J. Sens. Actuator Netw. 2025, 14(4), 72; https://doi.org/10.3390/jsan14040072 - 14 Jul 2025
Viewed by 479
Abstract
Format-compatible encryption can be used to ensure the security and privacy of JPEG images. Recently, a JPEG image encryption method proved to be secure against known plaintext attacks by employing an adaptive encryption key, which depends on the histogram of the number of [...] Read more.
Format-compatible encryption can be used to ensure the security and privacy of JPEG images. Recently, a JPEG image encryption method proved to be secure against known plaintext attacks by employing an adaptive encryption key, which depends on the histogram of the number of non-zero alternating current coefficients (ACC) in Discrete Cosine Transform (DCT) blocks. However, this scheme has been demonstrated to be vulnerable to chosen-plaintext attacks (CPA) based on the run consistency of MCUs (RCM) between the original image and the encrypted image. In this paper, an improved CPA scheme is proposed. The method of incrementing run-length values instead of permutation is utilized to satisfy the uniqueness of run sequences of different minimum coded units (MCUs). The experimental results show that the proposed method can successfully recover the outlines of plaintext images from the encrypted images, even with lower-quality factors. Full article
Show Figures

Figure 1

22 pages, 7562 KiB  
Article
FIGD-Net: A Symmetric Dual-Branch Dehazing Network Guided by Frequency Domain Information
by Luxia Yang, Yingzhao Xue, Yijin Ning, Hongrui Zhang and Yongjie Ma
Symmetry 2025, 17(7), 1122; https://doi.org/10.3390/sym17071122 - 13 Jul 2025
Viewed by 392
Abstract
Image dehazing technology is a crucial component in the fields of intelligent transportation and autonomous driving. However, most existing dehazing algorithms only process images in the spatial domain, failing to fully exploit the rich information in the frequency domain, which leads to residual [...] Read more.
Image dehazing technology is a crucial component in the fields of intelligent transportation and autonomous driving. However, most existing dehazing algorithms only process images in the spatial domain, failing to fully exploit the rich information in the frequency domain, which leads to residual haze in the images. To address this issue, we propose a novel Frequency-domain Information Guided Symmetric Dual-branch Dehazing Network (FIGD-Net), which utilizes the spatial branch to extract local haze features and the frequency branch to capture the global haze distribution, thereby guiding the feature learning process in the spatial branch. The FIGD-Net mainly consists of three key modules: the Frequency Detail Extraction Module (FDEM), the Dual-Domain Multi-scale Feature Extraction Module (DMFEM), and the Dual-Domain Guidance Module (DGM). First, the FDEM employs the Discrete Cosine Transform (DCT) to convert the spatial domain into the frequency domain. It then selectively extracts high-frequency and low-frequency features based on predefined proportions. The high-frequency features, which contain haze-related information, are correlated with the overall characteristics of the low-frequency features to enhance the representation of haze attributes. Next, the DMFEM utilizes stacked residual blocks and gradient feature flows to capture local detail features. Specifically, frequency-guided weights are applied to adjust the focus of feature channels, thereby improving the module’s ability to capture multi-scale features and distinguish haze features. Finally, the DGM adjusts channel weights guided by frequency information. This smooths out redundant signals and enables cross-branch information exchange, which helps to restore the original image colors. Extensive experiments demonstrate that the proposed FIGD-Net achieves superior dehazing performance on multiple synthetic and real-world datasets. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

31 pages, 6761 KiB  
Article
Improved Modulation Classification Based on Hough Transforms of Constellation Diagrams Using CNN for the UWA-OFDM Communication System
by Mohamed A. Abdel-Moneim, Mohamed K. M. Gerwash, El-Sayed M. El-Rabaie, Fathi E. Abd El-Samie, Khalil F. Ramadan and Nariman Abdel-Salam
Eng 2025, 6(6), 127; https://doi.org/10.3390/eng6060127 - 14 Jun 2025
Viewed by 465
Abstract
The Automatic Modulation Classification (AMC) for underwater acoustic signals enables more efficient utilization of the acoustic spectrum. Deep learning techniques significantly improve classification performance. Hence, they can be applied in AMC work to improve the underwater acoustic (UWA) communication. This paper is based [...] Read more.
The Automatic Modulation Classification (AMC) for underwater acoustic signals enables more efficient utilization of the acoustic spectrum. Deep learning techniques significantly improve classification performance. Hence, they can be applied in AMC work to improve the underwater acoustic (UWA) communication. This paper is based on the adoption of Hough Transform (HT) and Edge Detection (ED) to enhance modulation classification, especially for a small dataset. Deep neural models based on basic Convolutional Neural Network (CNN), Visual Geometry Group-16 (VGG-16), and VGG-19 trained on constellation diagrams transformed using HT are adopted. The objective is to extract features from constellation diagrams projected onto the Hough space. In addition, we use Orthogonal Frequency Division Multiplexing (OFDM) technology, which is frequently utilized in UWA systems because of its ability to avoid multipath fading and enhance spectrum utilization. We use an OFDM system with the Discrete Cosine Transform (DCT), Cyclic Prefix (CP), and equalization over the UWA communication channel under the effect of estimation errors. Seven modulation types are considered for classification, including Phase Shift Keying (PSK) and Quadrature Amplitude Modulation (QAM) (2/8/16-PSK and 4/8/16/32-QAM), with a Signal-to-Noise Ratio (SNR) ranging from −5 to 25 dB. Simulation results indicate that our CNN model with HT and ED at perfect channel estimation, achieves a 94% classification accuracy at 10 dB SNR, outperforming benchmark models by approximately 40%. Full article
Show Figures

Figure 1

31 pages, 5729 KiB  
Article
Signal-Induced Heap Transform-Based QR-Decomposition and Quantum Circuit for Implementing 3-Qubit Operations
by Artyom M. Grigoryan, Alexis Gomez, Isaac Espinoza and Sos S. Agaian
Information 2025, 16(6), 466; https://doi.org/10.3390/info16060466 - 30 May 2025
Cited by 1 | Viewed by 508
Abstract
This article presents a novel approach to the decomposition of unitary operations for 3-qubit systems by 28 controlled rotations and no permutations. The QR decomposition is described, which is based on the concept of the discrete signal-induced heap transform (DsiHT) and its quantum [...] Read more.
This article presents a novel approach to the decomposition of unitary operations for 3-qubit systems by 28 controlled rotations and no permutations. The QR decomposition is described, which is based on the concept of the discrete signal-induced heap transform (DsiHT) and its quantum analogue. This transform is generated by a given signal and may use different paths, or orders, of processing the data, and, among them, one can find paths that allow one to construct efficient quantum circuits for implementing multi-qubit unitary gates. The case of real unitary matrices is considered. The proposed approach is described in detail, and quantum circuits are presented for computing 3-qubit operations. This approach allowed us to write simple Qiskit codes to implement the decomposition of 3-qubit operations. Examples with quantum circuits for the quantum 3-qubit quantum cosine and Hartley transforms are described. Full article
(This article belongs to the Section Information Theory and Methodology)
Show Figures

Figure 1

44 pages, 12058 KiB  
Article
Harmonizer: A Universal Signal Tokenization Framework for Multimodal Large Language Models
by Amin Amiri, Alireza Ghaffarnia, Nafiseh Ghaffar Nia, Dalei Wu and Yu Liang
Mathematics 2025, 13(11), 1819; https://doi.org/10.3390/math13111819 - 29 May 2025
Viewed by 1374
Abstract
This paper introduces Harmonizer, a universal framework designed for tokenizing heterogeneous input signals, including text, audio, and video, to enable seamless integration into multimodal large language models (LLMs). Harmonizer employs a unified approach to convert diverse, non-linguistic signals into discrete tokens via its [...] Read more.
This paper introduces Harmonizer, a universal framework designed for tokenizing heterogeneous input signals, including text, audio, and video, to enable seamless integration into multimodal large language models (LLMs). Harmonizer employs a unified approach to convert diverse, non-linguistic signals into discrete tokens via its FusionQuantizer architecture, built on FluxFormer, to efficiently capture essential signal features while minimizing complexity. We enhance features through STFT-based spectral decomposition, Hilbert transform analytic signal extraction, and SCLAHE spectrogram contrast optimization, and train using a composite loss function to produce reliable embeddings and construct a robust vector vocabulary. Experimental validation on music datasets such as E-GMD v1.0.0, Maestro v3.0.0, and GTZAN demonstrates high fidelity across 288 s of vocal signals (MSE = 0.0037, CC = 0.9282, Cosine Sim. = 0.9278, DTW = 12.12, MFCC Sim. = 0.9997, Spectral Conv. = 0.2485). Preliminary tests on text reconstruction and UCF-101 video clips further confirm Harmonizer’s applicability across discrete and spatiotemporal modalities. Rooted in the universality of wave phenomena and Fourier theory, Harmonizer offers a physics-inspired, modality-agnostic fusion mechanism via wave superposition and interference principles. In summary, Harmonizer integrates natural language processing and signal processing into a coherent tokenization paradigm for efficient, interpretable multimodal learning. Full article
Show Figures

Figure 1

23 pages, 14842 KiB  
Article
Information-Guided Diffusion Model for Downscaling Land Surface Temperature from SDGSAT-1 Remote Sensing Images
by Jianxin Wang, Zhitao Fu, Bohui Tang and Jianhui Xu
Remote Sens. 2025, 17(10), 1669; https://doi.org/10.3390/rs17101669 - 9 May 2025
Viewed by 729
Abstract
Land Surface Temperature (LST) is a parameter retrieved through the thermal infrared band of remote sensing satellites, and it is a crucial parameter in various climate and environmental models. Compared to other multispectral bands, the thermal infrared bands have lower spatial resolution, which [...] Read more.
Land Surface Temperature (LST) is a parameter retrieved through the thermal infrared band of remote sensing satellites, and it is a crucial parameter in various climate and environmental models. Compared to other multispectral bands, the thermal infrared bands have lower spatial resolution, which limits their practical applications. Taking the Heihe River Basin in China as a case study, this research focuses on LST data retrieved from the SDGSAT-1 using the three-channel split-window algorithm. In this paper, we propose a novel approach, the Information-Guided Diffusion Model (IGDM), and apply it to downscale the SDGSAT-1 LST image. The results indicate that the downscaling accuracy of the SDGSAT-1 LST image using the proposed IGDM model outperforms that of Linear, Enhanced Deep Super-Resolution Network (EDSR), Super-Resolution Convolutional Neural Network (SRCNN), Discrete Cosine Transform and Local Spatial Attention (DCTLSA), and Denoising Diffusion Probabilistic Models (DDPM). Specifically, the RMSE of IGDM is reduced by 55.16%, 51.29%, 48.39%, 52.88%, and 17.18%. By incorporating auxiliary information, particularly when using NDVI and NDWI as auxiliary inputs, the performance of the IGDM model is significantly improved. Compared to DDPM, the RMSE of IGDM decreased from 0.666 to 0.574, MAE dropped from 0.517 to 0.376, and PSNR increased from 38.55 to 40.27. Overall, the results highlight the effectiveness of the auxiliary information-guided SDGSAT-1 LST downscaling diffusion model in generating high-resolution remote sensing LST data. Additionally, the study reveals the spatial feature impact of different auxiliary information in LST downscaling and the variations in features across different regions and temperature ranges. Full article
Show Figures

Figure 1

10 pages, 1800 KiB  
Article
Automatic Focusing of Off-Axis Digital Holographic Microscopy by Combining the Discrete Cosine Transform Sparse Dictionary with the Edge Preservation Index
by Zhaoliang Liu, Peizhen Qiu and Yupei Zhang
Optics 2025, 6(2), 17; https://doi.org/10.3390/opt6020017 - 6 May 2025
Viewed by 716
Abstract
Automatic focusing is a crucial research issue for achieving high-quality reconstructed images in digital holographic microscopy. This paper proposes an automatic focusing method that combines the discrete cosine transform (DCT) sparse dictionary with edge preservation index (EPI) criteria for off-axis digital holographic microscopy. [...] Read more.
Automatic focusing is a crucial research issue for achieving high-quality reconstructed images in digital holographic microscopy. This paper proposes an automatic focusing method that combines the discrete cosine transform (DCT) sparse dictionary with edge preservation index (EPI) criteria for off-axis digital holographic microscopy. Specifically, within a predefined search range, Fresnel transform is utilized to reconstruct the off-axis digital hologram, yielding reconstruction images at various reconstruction distances. Synchronously, the DCT sparse dictionary is employed to reduce speckle noise, and the EPI is calculated between the denoised image and original image. The value of EPI is used as an indicator for assessing the focal position. A single-peak focusing curve is obtained within the search range 10 mm, with a step size of 0.1 mm. Once the optimal focus position is determined, a focused and noise-reduced reconstructed image can be simultaneously achieved. Full article
Show Figures

Figure 1

26 pages, 7054 KiB  
Article
An Ensemble of Convolutional Neural Networks for Sound Event Detection
by Abdinabi Mukhamadiyev, Ilyos Khujayarov, Dilorom Nabieva and Jinsoo Cho
Mathematics 2025, 13(9), 1502; https://doi.org/10.3390/math13091502 - 1 May 2025
Viewed by 1264
Abstract
Sound event detection tasks are rapidly advancing in the field of pattern recognition, and deep learning methods are particularly well suited for such tasks. One of the important directions in this field is to detect the sounds of emotional events around residential buildings [...] Read more.
Sound event detection tasks are rapidly advancing in the field of pattern recognition, and deep learning methods are particularly well suited for such tasks. One of the important directions in this field is to detect the sounds of emotional events around residential buildings in smart cities and quickly assess the situation for security purposes. This research presents a comprehensive study of an ensemble convolutional recurrent neural network (CRNN) model designed for sound event detection (SED) in residential and public safety contexts. The work focuses on extracting meaningful features from audio signals using image-based representation, such as Discrete Cosine Transform (DCT) spectrograms, Cocheagrams, and Mel spectrograms, to enhance robustness against noise and improve feature extraction. In collaboration with police officers, a two-hour dataset consisting of 112 clips related to four classes of emotional sounds, such as harassment, quarrels, screams, and breaking sounds, was prepared. In addition to the crowdsourced dataset, publicly available datasets were used to broaden the study’s applicability. Our dataset contains 5055 audio files of different lengths totaling 14.14 h and strongly labeled data. The dataset consists of 13 separate sound categories. The proposed CRNN model integrates spatial and temporal feature extraction by processing these spectrograms through convolution and bi-directional gated recurrent unit (GRU) layers. An ensemble approach combines predictions from three models, achieving F1 scores of 71.5% for segment-based metrics and 46% for event-based metrics. The results demonstrate the model’s effectiveness in detecting sound events under noisy conditions, even with a small, unbalanced dataset. This research highlights the potential of the model for real-time audio surveillance systems using mini-computers, offering cost-effective and accurate solutions for maintaining public order. Full article
(This article belongs to the Special Issue Advanced Machine Vision with Mathematics)
Show Figures

Figure 1

21 pages, 5433 KiB  
Article
Efficient Implementation of Matrix-Based Image Processing Algorithms for IoT Applications
by Sorin Zoican and Roxana Zoican
Appl. Sci. 2025, 15(9), 4947; https://doi.org/10.3390/app15094947 - 29 Apr 2025
Viewed by 522
Abstract
This paper analyzes implementation approaches of matrix-based image processing algorithms. As an example, an image processing algorithm that provides both image compression and image denoising using random sample consensus and discrete cosine transform is analyzed. Two implementations are illustrated: one using the Blackfin [...] Read more.
This paper analyzes implementation approaches of matrix-based image processing algorithms. As an example, an image processing algorithm that provides both image compression and image denoising using random sample consensus and discrete cosine transform is analyzed. Two implementations are illustrated: one using the Blackfin processor with 32-bit fixed-point representation and the second using the convolutional neural network (CNN) accelerator in the MAX78000 microcontroller. Implementation with Blackfin can be considered a classic approach, in C language, possible on all existing microcontrollers. This implementation is improved by using two cores. The proposed implementation with the CNN accelerator is a new approach that effectively uses the dedicated accelerator for convolutional neural networks, with better results than a classical implementation. The execution time of matrix-based image processing algorithms can be reduced by using CNN accelerators already integrated in some modern microcontrollers to implement artificial intelligence algorithms. The proposed method uses CNN in a different way, not for artificial intelligence algorithms, but for matrix calculations using CNN resources effectively while maintaining the accuracy of the calculations. A comparison of these two implementations and the validation using MATLAB with 64 bits floating point representation are conducted. The obtained performance is good both in terms of quality of reconstructed image and execution time, and the performance differences between the infinite precision implementation and the finite precision implementation are small. The CNN accelerator implementation, based on matrix multiplication implemented using CNN, has a better performance suitable for Internet of Things applications. Full article
Show Figures

Figure 1

36 pages, 21603 KiB  
Article
Forensic Joint Photographic Experts Group (JPEG) Watermarking for Disk Image Leak Attribution: An Adaptive Discrete Cosine Transform–Discrete Wavelet Transform (DCT-DWT) Approach
by Belinda I. Onyeashie, Petra Leimich, Sean McKeown and Gordon Russell
Electronics 2025, 14(9), 1800; https://doi.org/10.3390/electronics14091800 - 28 Apr 2025
Viewed by 1053
Abstract
This paper presents a novel forensic watermarking method for digital evidence distribution in non-cloud environments. The approach addresses the critical need for the secure sharing of Joint Photographic Experts Group (JPEG) images in forensic investigations. The method utilises an adaptive Discrete Cosine Transform–Discrete [...] Read more.
This paper presents a novel forensic watermarking method for digital evidence distribution in non-cloud environments. The approach addresses the critical need for the secure sharing of Joint Photographic Experts Group (JPEG) images in forensic investigations. The method utilises an adaptive Discrete Cosine Transform–Discrete Wavelet Transform (DCT-DWT) domain technique to embed a 64-bit watermark in both stand-alone JPEGs and those within forensic disk images. This occurs without alterations to disk structure or complications to the chain of custody. The system implements uniform secure randomisation and recipient-specific watermarks to balance security with forensic workflow efficiency. This work presents the first implementation of forensic watermarking at the disk image level that preserves structural integrity and enables precise leak source attribution. It addresses a critical gap in secure evidence distribution methodologies. The evaluation occurred on extensive datasets: 1124 JPEGs in a forensic disk image, 10,000 each of BOSSBase 256 × 256 and 512 × 512 greyscale images, and 10,000 COCO2017 coloured images. The results demonstrate high imperceptibility with average Peak Signal-to-Noise Ratio (PSNR) values ranging from 46.13 dB to 49.37 dB across datasets. The method exhibits robust performance against geometric attacks with perfect watermark recovery (Bit Error Rate (BER) = 0) for rotations up to 90° and scaling factors between 0.6 and 1.5. The approach maintains compatibility with forensic tools like Forensic Toolkit FTK and Autopsy. It performs effectively under attacks including JPEG compression (QF ≥ 60), filtering, and noise addition. The technique achieves high feature match ratios between 0.684 and 0.690 for a threshold of 0.70, with efficient processing times (embedding: 0.0347 s to 0.1187 s; extraction: 0.0077 s to 0.0366 s). This watermarking technique improves forensic investigation processes, particularly those that involve sensitive JPEG files. It supports leak source attribution, preserves evidence integrity, and provides traceability throughout forensic procedures. Full article
(This article belongs to the Special Issue Advances in Cyber-Security and Machine Learning)
Show Figures

Figure 1

20 pages, 4930 KiB  
Article
Light Field Super-Resolution via Dual-Domain High-Frequency Restoration and State-Space Fusion
by Zhineng Zhang, Tao Yan, Hao Huang, Jinsheng Liu, Chenglong Wang and Cihang Wei
Electronics 2025, 14(9), 1747; https://doi.org/10.3390/electronics14091747 - 25 Apr 2025
Viewed by 612
Abstract
The current light field super-resolution methods mainly face the following challenges: difficulty in handling redundant information in light fields; heavy reliance on the spatial domain to recover details; and insufficient interaction of spatial and angular features. We propose a novel light field super-resolution [...] Read more.
The current light field super-resolution methods mainly face the following challenges: difficulty in handling redundant information in light fields; heavy reliance on the spatial domain to recover details; and insufficient interaction of spatial and angular features. We propose a novel light field super-resolution (LF-SR) network, termed DHSFNet, which effectively enhances super-resolution performance from a dual-domain perspective, encompassing both the frequency and spatial domains. Our DHSFNet contains three key points. (1) A local sparse angular attention module (LSAA) is proposed to selectively capture relationships between adjacent sub-views using geometric prior information to reduce computational complexity. (2) We design a dual-domain high-frequency restoration sub-network, with a frequency-domain branch using mask-guided multi-scale discrete cosine transform (DCT) restoration and a spatial-domain branch employing multi-scale cross-attention to recover texture details. (3) A Mamba-based fusion module (MF) is introduced to efficiently facilitate global spatial–angular interaction, which achieves linear complexity and outperforms Transformer-based methods in both accuracy and speed. Comprehensive experiments conducted on three benchmark datasets demonstrate the superior performance of our method in the LF-SR task. Full article
Show Figures

Figure 1

Back to TopTop