MDPI Contact

MDPI AG
St. Alban-Anlage 66,
4052 Basel, Switzerland
Support contact
Tel. +41 61 683 77 34
Fax: +41 61 302 89 18

For more contact information, see here.

Advanced Search

You can use * to search for partial matches.

Search Results

16 articles matched your search query. Search Parameters:
Keywords = spectrogram

Matches by word:

SPECTROGRAM (16)

View options
order results:
result details:
results per page:
Articles per page View Sort by
Displaying article 1-50 on page 1 of 1.
Export citation of selected articles as:
Open AccessArticle Simulations on Monitoring and Evaluation of Plasticity-Driven Material Damage Based on Second Harmonic of S0 Mode Lamb Waves in Metallic Plates
Materials 2017, 10(7), 827; doi:10.3390/ma10070827
Received: 13 May 2017 / Revised: 9 July 2017 / Accepted: 12 July 2017 / Published: 19 July 2017
Viewed by 142 | PDF Full-text (4075 KB) | HTML Full-text | XML Full-text
Abstract
In this study, a numerical approach—the discontinuous Meshless Local Petrov-Galerkin-Eshelby Method (MLPGEM)—was adopted to simulate and measure material plasticity in an Al 7075-T651 plate. The plate was modeled in two dimensions by assemblies of small particles that interact with each other through bonding
[...] Read more.
In this study, a numerical approach—the discontinuous Meshless Local Petrov-Galerkin-Eshelby Method (MLPGEM)—was adopted to simulate and measure material plasticity in an Al 7075-T651 plate. The plate was modeled in two dimensions by assemblies of small particles that interact with each other through bonding stiffness. The material plasticity of the model loaded to produce different levels of strain is evaluated with the Lamb waves of S0 mode. A tone burst at the center frequency of 200 kHz was used as excitation. Second-order nonlinear wave was extracted from the spectrogram of a signal receiving point. Tensile-driven plastic deformation and cumulative second harmonic generation of S0 mode were observed in the simulation. Simulated measurement of the acoustic nonlinearity increased monotonically with the level of tensile-driven plastic strain captured by MLPGEM, whereas achieving this state by other numerical methods is comparatively more difficult. This result indicates that the second harmonics of S0 mode can be employed to monitor and evaluate the material or structural early-stage damage induced by plasticity. Full article
(This article belongs to the Special Issue Structural Health Monitoring for Aerospace Applications 2017)
Figures

Figure 1

Open AccessArticle Deep Visual Attributes vs. Hand-Crafted Audio Features on Multidomain Speech Emotion Recognition
Computation 2017, 5(2), 26; doi:10.3390/computation5020026
Received: 31 March 2017 / Revised: 25 May 2017 / Accepted: 27 May 2017 / Published: 1 June 2017
Viewed by 234 | PDF Full-text (5164 KB) | HTML Full-text | XML Full-text
Abstract
Emotion recognition from speech may play a crucial role in many applications related to human–computer interaction or understanding the affective state of users in certain tasks, where other modalities such as video or physiological parameters are unavailable. In general, a human’s emotions may
[...] Read more.
Emotion recognition from speech may play a crucial role in many applications related to human–computer interaction or understanding the affective state of users in certain tasks, where other modalities such as video or physiological parameters are unavailable. In general, a human’s emotions may be recognized using several modalities such as analyzing facial expressions, speech, physiological parameters (e.g., electroencephalograms, electrocardiograms) etc. However, measuring of these modalities may be difficult, obtrusive or require expensive hardware. In that context, speech may be the best alternative modality in many practical applications. In this work we present an approach that uses a Convolutional Neural Network (CNN) functioning as a visual feature extractor and trained using raw speech information. In contrast to traditional machine learning approaches, CNNs are responsible for identifying the important features of the input thus, making the need of hand-crafted feature engineering optional in many tasks. In this paper no extra features are required other than the spectrogram representations and hand-crafted features were only extracted for validation purposes of our method. Moreover, it does not require any linguistic model and is not specific to any particular language. We compare the proposed approach using cross-language datasets and demonstrate that it is able to provide superior results vs. traditional ones that use hand-crafted features. Full article
Figures

Figure 1

Open AccessArticle Performance Comparison of Time-Frequency Distributions for Estimation of Instantaneous Frequency of Heart Rate Variability Signals
Appl. Sci. 2017, 7(3), 221; doi:10.3390/app7030221
Received: 20 December 2016 / Revised: 17 February 2017 / Accepted: 23 February 2017 / Published: 27 February 2017
Viewed by 412 | PDF Full-text (1147 KB) | HTML Full-text | XML Full-text
Abstract
The instantaneous frequency (IF) of a non-stationary signal is usually estimated from a time-frequency distribution (TFD). The IF of heart rate variability (HRV) is an important parameter because the power in a frequency band around the IF can be used for the interpretation
[...] Read more.
The instantaneous frequency (IF) of a non-stationary signal is usually estimated from a time-frequency distribution (TFD). The IF of heart rate variability (HRV) is an important parameter because the power in a frequency band around the IF can be used for the interpretation and analysis of the respiratory rate but also for a more accurate analysis of heart rate (HR) signals. In this study, we compare the performance of five states of the art kernel-based time-frequency distributions (TFDs) in terms of their ability to accurately estimate the IF of HR signals. The selected TFDs include three widely used fixed kernel methods: the modified B distribution, the S-method and the spectrogram; and two adaptive kernel methods: the adaptive optimal kernel TFD and the recently developed adaptive directional TFD. The IF of the respiratory signal, which is usually easier to estimate as the respiratory signal is a mono-component with small amplitude variations with time, is used as a reference to examine the accuracy of the HRV IF estimates. Experimental results indicate that the most reliable estimates are obtained using the adaptive directional TFD in comparison to other commonly used methods such as the adaptive optimal kernel TFD and the modified B distribution. Full article
Figures

Figure 1

Open AccessArticle Micro-Doppler Based Classification of Human Aquatic Activities via Transfer Learning of Convolutional Neural Networks
Sensors 2016, 16(12), 1990; doi:10.3390/s16121990
Received: 19 September 2016 / Revised: 7 November 2016 / Accepted: 18 November 2016 / Published: 24 November 2016
Cited by 2 | Viewed by 610 | PDF Full-text (2193 KB) | HTML Full-text | XML Full-text
Abstract
Accurate classification of human aquatic activities using radar has a variety of potential applications such as rescue operations and border patrols. Nevertheless, the classification of activities on water using radar has not been extensively studied, unlike the case on dry ground, due to
[...] Read more.
Accurate classification of human aquatic activities using radar has a variety of potential applications such as rescue operations and border patrols. Nevertheless, the classification of activities on water using radar has not been extensively studied, unlike the case on dry ground, due to its unique challenge. Namely, not only is the radar cross section of a human on water small, but the micro-Doppler signatures are much noisier due to water drops and waves. In this paper, we first investigate whether discriminative signatures could be obtained for activities on water through a simulation study. Then, we show how we can effectively achieve high classification accuracy by applying deep convolutional neural networks (DCNN) directly to the spectrogram of real measurement data. From the five-fold cross-validation on our dataset, which consists of five aquatic activities, we report that the conventional feature-based scheme only achieves an accuracy of 45.1%. In contrast, the DCNN trained using only the collected data attains 66.7%, and the transfer learned DCNN, which takes a DCNN pre-trained on a RGB image dataset and fine-tunes the parameters using the collected data, achieves a much higher 80.3%, which is a significant performance boost. Full article
(This article belongs to the Special Issue Non-Contact Sensing)
Figures

Figure 1

Open AccessCommunication Eventogram: A Visual Representation of Main Events in Biomedical Signals
Bioengineering 2016, 3(4), 22; doi:10.3390/bioengineering3040022
Received: 19 August 2016 / Revised: 15 September 2016 / Accepted: 18 September 2016 / Published: 22 September 2016
Cited by 2 | Viewed by 1071 | PDF Full-text (1309 KB) | HTML Full-text | XML Full-text
Abstract
Biomedical signals carry valuable physiological information and many researchers have difficulty interpreting and analyzing long-term, one-dimensional, quasi-periodic biomedical signals. Traditionally, biomedical signals are analyzed and visualized using periodogram, spectrogram, and wavelet methods. However, these methods do not offer an informative visualization of main
[...] Read more.
Biomedical signals carry valuable physiological information and many researchers have difficulty interpreting and analyzing long-term, one-dimensional, quasi-periodic biomedical signals. Traditionally, biomedical signals are analyzed and visualized using periodogram, spectrogram, and wavelet methods. However, these methods do not offer an informative visualization of main events within the processed signal. This paper attempts to provide an event-related framework to overcome the drawbacks of the traditional visualization methods and describe the main events within the biomedical signal in terms of duration and morphology. Electrocardiogram and photoplethysmogram signals are used in the analysis to demonstrate the differences between the traditional visualization methods, and their performance is compared against the proposed method, referred to as the “eventogram” in this paper. The proposed method is based on two event-related moving averages that visualizes the main time-domain events in the processed biomedical signals. The traditional visualization methods were unable to find dominant events in processed signals while the eventogram was able to visualize dominant events in signals in terms of duration and morphology. Moreover, eventogram-based detection algorithms succeeded with detecting main events in different biomedical signals with a sensitivity and positive predictivity >95%. The output of the eventogram captured unique patterns and signatures of physiological events, which could be used to visualize and identify abnormal waveforms in any quasi-periodic signal. Full article
Figures

Figure 1

Open AccessArticle Mobile Diagnostics Based on Motion? A Close Look at Motility Patterns in the Schistosome Life Cycle
Diagnostics 2016, 6(2), 24; doi:10.3390/diagnostics6020024
Received: 2 February 2016 / Revised: 8 April 2016 / Accepted: 23 May 2016 / Published: 17 June 2016
Viewed by 1175 | PDF Full-text (6117 KB) | HTML Full-text | XML Full-text | Supplementary Files
Abstract
Imaging at high resolution and subsequent image analysis with modified mobile phones have the potential to solve problems related to microscopy-based diagnostics of parasitic infections in many endemic regions. Diagnostics using the computing power of “smartphones” is not restricted by limited expertise or
[...] Read more.
Imaging at high resolution and subsequent image analysis with modified mobile phones have the potential to solve problems related to microscopy-based diagnostics of parasitic infections in many endemic regions. Diagnostics using the computing power of “smartphones” is not restricted by limited expertise or limitations set by visual perception of a microscopist. Thus diagnostics currently almost exclusively dependent on recognition of morphological features of pathogenic organisms could be based on additional properties, such as motility characteristics recognizable by computer vision. Of special interest are infectious larval stages and “micro swimmers” of e.g., the schistosome life cycle, which infect the intermediate and definitive hosts, respectively. The ciliated miracidium, emerges from the excreted egg upon its contact with water. This means that for diagnostics, recognition of a swimming miracidium is equivalent to recognition of an egg. The motility pattern of miracidia could be defined by computer vision and used as a diagnostic criterion. To develop motility pattern-based diagnostics of schistosomiasis using simple imaging devices, we analyzed Paramecium as a model for the schistosome miracidium. As a model for invasive nematodes, such as strongyloids and filaria, we examined a different type of motility in the apathogenic nematode Turbatrix, the “vinegar eel.” The results of motion time and frequency analysis suggest that target motility may be expressed as specific spectrograms serving as “diagnostic fingerprints.” Full article
(This article belongs to the Special Issue Mobile Diagnosis)
Open AccessArticle Chord Recognition Based on Temporal Correlation Support Vector Machine
Appl. Sci. 2016, 6(5), 157; doi:10.3390/app6050157
Received: 11 February 2016 / Revised: 6 May 2016 / Accepted: 6 May 2016 / Published: 19 May 2016
Viewed by 593 | PDF Full-text (1977 KB) | HTML Full-text | XML Full-text
Abstract
In this paper, we propose a method called temporal correlation support vector machine (TCSVM) for automatic major-minor chord recognition in audio music. We first use robust principal component analysis to separate the singing voice from the music to reduce the influence of the
[...] Read more.
In this paper, we propose a method called temporal correlation support vector machine (TCSVM) for automatic major-minor chord recognition in audio music. We first use robust principal component analysis to separate the singing voice from the music to reduce the influence of the singing voice and consider the temporal correlations of the chord features. Using robust principal component analysis, we expect the low-rank component of the spectrogram matrix to contain the musical accompaniment and the sparse component to contain the vocal signals. Then, we extract a new logarithmic pitch class profile (LPCP) feature called enhanced LPCP from the low-rank part. To exploit the temporal correlation among the LPCP features of chords, we propose an improved support vector machine algorithm called TCSVM. We perform this study using the MIREX’09 (Music Information Retrieval Evaluation eXchange) Audio Chord Estimation dataset. Furthermore, we conduct comprehensive experiments using different pitch class profile feature vectors to examine the performance of TCSVM. The results of our method are comparable to the state-of-the-art methods that entered the MIREX in 2013 and 2014 for the MIREX’09 Audio Chord Estimation task dataset. Full article
(This article belongs to the Special Issue Audio Signal Processing) Printed Edition available
Figures

Open AccessArticle Frequency Management for Electromagnetic Continuous Wave Conductivity Meters
Sensors 2016, 16(4), 490; doi:10.3390/s16040490
Received: 2 December 2015 / Revised: 25 February 2016 / Accepted: 29 February 2016 / Published: 7 April 2016
Cited by 2 | Viewed by 725 | PDF Full-text (9890 KB) | HTML Full-text | XML Full-text
Abstract
Ground conductivity meters use electromagnetic fields for the mapping of geological variations, like the determination of water amount, depending on ground layers, which is important for the state analysis of embankments. The VLF band is contaminated by numerous natural and artificial electromagnetic interference
[...] Read more.
Ground conductivity meters use electromagnetic fields for the mapping of geological variations, like the determination of water amount, depending on ground layers, which is important for the state analysis of embankments. The VLF band is contaminated by numerous natural and artificial electromagnetic interference signals. Prior to the determination of ground conductivity, the meter’s working frequency is not possible, due to the variable frequency of the interferences. Frequency management based on the analysis of the selected band using track-before-detect (TBD) algorithms, which allows dynamical frequency changes of the conductivity of the meter transmitting part, is proposed in the paper. Naive maximum value search, spatio-temporal TBD (ST-TBD), Viterbi TBD and a new algorithm that uses combined ST-TBD and Viterbi TBD are compared. Monte Carlo tests are provided for the numerical analysis of the properties for a single interference signal in the considered band, and a new approach based on combined ST-TBD and Viterbi algorithms shows the best performance. The considered algorithms process spectrogram data for the selected band, so DFT (Discrete Fourier Transform) could be applied for the computation of the spectrogram. Real–time properties, related to the latency, are discussed also, and it is shown that TBD algorithms are feasible for real applications. Full article
(This article belongs to the Section Physical Sensors)
Open AccessArticle A Complexity-Based Approach for the Detection of Weak Signals in Ocean Ambient Noise
Entropy 2016, 18(3), 101; doi:10.3390/e18030101
Received: 17 December 2015 / Revised: 4 March 2016 / Accepted: 8 March 2016 / Published: 18 March 2016
Cited by 3 | Viewed by 848 | PDF Full-text (3390 KB) | HTML Full-text | XML Full-text
Abstract
There are numerous studies showing that there is a constant increase in the ocean ambient noise level and the ever-growing demand for developing algorithms for detecting weak signals in ambient noise. In this study, we utilize dynamical and statistical complexity to detect the
[...] Read more.
There are numerous studies showing that there is a constant increase in the ocean ambient noise level and the ever-growing demand for developing algorithms for detecting weak signals in ambient noise. In this study, we utilize dynamical and statistical complexity to detect the presence of weak ship noise embedded in ambient noise. The ambient noise and ship noise were recorded in the South China Sea. The multiscale entropy (MSE) method and the complexity-entropy causality plane (C-H plane) were used to quantify the dynamical and statistical complexity of the measured time series, respectively. We generated signals with varying signal-to-noise ratio (SNR) by varying the amplification of a ship signal. The simulation results indicate that the complexity is sensitive to change in the information in the ambient noise and the change in SNR, a finding that enables the detection of weak ship signals in strong background ambient noise. The simulation results also illustrate that complexity is better than the traditional spectrogram method, particularly effective for detecting low SNR signals in ambient noise. In addition, complexity-based MSE and C-H plane methods are simple, robust and do not assume any underlying dynamics in time series. Hence, complexity should be used in practical situations. Full article
(This article belongs to the Special Issue Computational Complexity)
Open AccessArticle Exposure to Air Ions in Indoor Environments: Experimental Study with Healthy Adults
Int. J. Environ. Res. Public Health 2015, 12(11), 14301-14311; doi:10.3390/ijerph121114301
Received: 18 June 2015 / Revised: 12 October 2015 / Accepted: 5 November 2015 / Published: 10 November 2015
Viewed by 895 | PDF Full-text (715 KB) | HTML Full-text | XML Full-text | Supplementary Files
Abstract
Since the beginning of the 20th century there has been a scientific debate about the potential effects of air ions on biological tissues, wellbeing and health. Effects on the cardiovascular and respiratory system as well as on mental health have been described. In
[...] Read more.
Since the beginning of the 20th century there has been a scientific debate about the potential effects of air ions on biological tissues, wellbeing and health. Effects on the cardiovascular and respiratory system as well as on mental health have been described. In recent years, there has been a renewed interest in this topic. In an experimental indoor setting we conducted a double-blind cross-over trial to determine if higher levels of air ions, generated by a special wall paint, affect cognitive performance, wellbeing, lung function, and cardiovascular function. Twenty healthy non-smoking volunteers (10 female, 10 male) participated in the study. Levels of air ions, volatile organic compounds and indoor climate factors were determined by standardized measurement procedures. Air ions affected the autonomous nervous system (in terms of an increase of sympathetic activity accompanied by a small decrease of vagal efferent activity): In the test room with higher levels of air ions (2194/cm3 vs. 1038/cm3) a significantly higher low to high frequency ratio of the electrocardiography (ECG) beat-to-beat interval spectrogram was found. Furthermore, six of nine subtests of a cognitive performance test were solved better, three of them statistically significant (verbal factor, reasoning, and perceptual speed), in the room with higher ion concentration. There was no influence of air ions on lung function and on wellbeing. Our results indicate slightly activating and cognitive performance enhancing effects of a short-term exposure to higher indoor air ion concentrations. Full article
Open AccessArticle A New Reassigned Spectrogram Method in Interference Detection for GNSS Receivers
Sensors 2015, 15(9), 22167-22191; doi:10.3390/s150922167
Received: 2 June 2015 / Revised: 22 August 2015 / Accepted: 24 August 2015 / Published: 2 September 2015
Viewed by 801 | PDF Full-text (1655 KB) | HTML Full-text | XML Full-text
Abstract
Interference detection is very important for Global Navigation Satellite System (GNSS) receivers. Current work on interference detection in GNSS receivers has mainly focused on time-frequency (TF) analysis techniques, such as spectrogram and Wigner–Ville distribution (WVD), where the spectrogram approach presents the TF resolution
[...] Read more.
Interference detection is very important for Global Navigation Satellite System (GNSS) receivers. Current work on interference detection in GNSS receivers has mainly focused on time-frequency (TF) analysis techniques, such as spectrogram and Wigner–Ville distribution (WVD), where the spectrogram approach presents the TF resolution trade-off problem, since the analysis window is used, and the WVD method suffers from the very serious cross-term problem, due to its quadratic TF distribution nature. In order to solve the cross-term problem and to preserve good TF resolution in the TF plane at the same time, in this paper, a new TF distribution by using a reassigned spectrogram has been proposed in interference detection for GNSS receivers. This proposed reassigned spectrogram method efficiently combines the elimination of the cross-term provided by the spectrogram itself according to its inherent nature and the improvement of the TF aggregation property achieved by the reassignment method. Moreover, a notch filter has been adopted in interference mitigation for GNSS receivers, where receiver operating characteristics (ROCs) are used as metrics for the characterization of interference mitigation performance. The proposed interference detection method by using a reassigned spectrogram is evaluated by experiments on GPS L1 signals in the disturbing scenarios in comparison to the state-of-the-art TF analysis approaches. The analysis results show that the proposed interference detection technique effectively overcomes the cross-term problem and also keeps good TF localization properties, which has been proven to be valid and effective to enhance the interference Sensors 2015, 15 22168 detection performance; in addition, the adoption of the notch filter in interference mitigation has shown a significant acquisition performance improvement in terms of ROC curves for GNSS receivers in jamming environments. Full article
(This article belongs to the Section Remote Sensors)
Open AccessArticle Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition
Sensors 2015, 15(1), 1458-1478; doi:10.3390/s150101458
Received: 16 September 2014 / Accepted: 1 December 2014 / Published: 14 January 2015
Cited by 7 | Viewed by 1481 | PDF Full-text (1190 KB) | HTML Full-text | XML Full-text
Abstract
The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture
[...] Read more.
The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture analysis for characterization and classification of different emotions in a speech signal. The motivation is that we have to consider emotions have different intensity values in different frequency bands. In terms of human visual perceptual, the texture property on multi-resolution of emotional speech spectrogram should be a good feature set for emotion classification in speech. Furthermore, the multi-resolution analysis on texture can give a clearer discrimination between each emotion than uniform-resolution analysis on texture. In order to provide high accuracy of emotional discrimination especially in real-life, an acoustic activity detection (AAD) algorithm must be applied into the MRTII-based feature extraction. Considering the presence of many blended emotions in real life, in this paper make use of two corpora of naturally-occurring dialogs recorded in real-life call centers. Compared with the traditional Mel-scale Frequency Cepstral Coefficients (MFCC) and the state-of-the-art features, the MRTII features also can improve the correct classification rates of proposed systems among different language databases. Experimental results show that the proposed MRTII-based feature information inspired by human visual perception of the spectrogram image can provide significant classification for real-life emotional recognition in speech. Full article
(This article belongs to the Section Physical Sensors)
Open AccessArticle The Feature Extraction Based on Texture Image Information for Emotion Sensing in Speech
Sensors 2014, 14(9), 16692-16714; doi:10.3390/s140916692
Received: 9 June 2014 / Revised: 24 August 2014 / Accepted: 29 August 2014 / Published: 9 September 2014
Cited by 13 | Viewed by 1546 | PDF Full-text (1377 KB) | HTML Full-text | XML Full-text
Abstract
In this paper, we present a novel texture image feature for Emotion Sensing in Speech (ESS). This idea is based on the fact that the texture images carry emotion-related information. The feature extraction is derived from time-frequency representation of spectrogram images. First, we
[...] Read more.
In this paper, we present a novel texture image feature for Emotion Sensing in Speech (ESS). This idea is based on the fact that the texture images carry emotion-related information. The feature extraction is derived from time-frequency representation of spectrogram images. First, we transform the spectrogram as a recognizable image. Next, we use a cubic curve to enhance the image contrast. Then, the texture image information (TII) derived from the spectrogram image can be extracted by using Laws’ masks to characterize emotional state. In order to evaluate the effectiveness of the proposed emotion recognition in different languages, we use two open emotional databases including the Berlin Emotional Speech Database (EMO-DB) and eNTERFACE corpus and one self-recorded database (KHUSC-EmoDB), to evaluate the performance cross-corpora. The results of the proposed ESS system are presented using support vector machine (SVM) as a classifier. Experimental results show that the proposed TII-based feature extraction inspired by visual perception can provide significant classification for ESS systems. The two-dimensional (2-D) TII feature can provide the discrimination between different emotions in visual expressions except for the conveyance pitch and formant tracks. In addition, the de-noising in 2-D images can be more easily completed than de-noising in 1-D speech. Full article
(This article belongs to the Section Physical Sensors)
Open AccessArticle An FPGA-Based Rapid Wheezing Detection System
Int. J. Environ. Res. Public Health 2014, 11(2), 1573-1593; doi:10.3390/ijerph110201573
Received: 30 December 2013 / Revised: 24 January 2014 / Accepted: 24 January 2014 / Published: 29 January 2014
Cited by 7 | Viewed by 1793 | PDF Full-text (1657 KB) | HTML Full-text | XML Full-text
Abstract
Wheezing is often treated as a crucial indicator in the diagnosis of obstructive pulmonary diseases. A rapid wheezing detection system may help physicians to monitor patients over the long-term. In this study, a portable wheezing detection system based on a field-programmable gate array
[...] Read more.
Wheezing is often treated as a crucial indicator in the diagnosis of obstructive pulmonary diseases. A rapid wheezing detection system may help physicians to monitor patients over the long-term. In this study, a portable wheezing detection system based on a field-programmable gate array (FPGA) is proposed. This system accelerates wheezing detection, and can be used as either a single-process system, or as an integrated part of another biomedical signal detection system. The system segments sound signals into 2-second units. A short-time Fourier transform was used to determine the relationship between the time and frequency components of wheezing sound data. A spectrogram was processed using 2D bilateral filtering, edge detection, multithreshold image segmentation, morphological image processing, and image labeling, to extract wheezing features according to computerized respiratory sound analysis (CORSA) standards. These features were then used to train the support vector machine (SVM) and build the classification models. The trained model was used to analyze sound data to detect wheezing. The system runs on a Xilinx Virtex-6 FPGA ML605 platform. The experimental results revealed that the system offered excellent wheezing recognition performance (0.912). The detection process can be used with a clock frequency of 51.97 MHz, and is able to perform rapid wheezing classification. Full article
Open AccessArticle A Novel Voice Sensor for the Detection of Speech Signals
Sensors 2013, 13(12), 16533-16550; doi:10.3390/s131216533
Received: 3 November 2013 / Revised: 26 November 2013 / Accepted: 27 November 2013 / Published: 2 December 2013
Viewed by 1383 | PDF Full-text (817 KB) | HTML Full-text | XML Full-text
Abstract
In order to develop a novel voice sensor to detect human voices, the use of features which are more robust to noise is an important issue. Voice sensor is also called voice activity detection (VAD). Due to that the inherent nature of the
[...] Read more.
In order to develop a novel voice sensor to detect human voices, the use of features which are more robust to noise is an important issue. Voice sensor is also called voice activity detection (VAD). Due to that the inherent nature of the formant structure only occurred on the speech spectrogram (well-known as voiceprint), Wu et al. were the first to use band-spectral entropy (BSE) to describe the characteristics of voiceprints. However, the performance of VAD based on BSE feature was degraded in colored noise (or voiceprint-like noise) environments. In order to solve this problem, we propose the two-dimensional part-band energy entropy (TD-PBEE) parameter based on two variables: part-band partition number upon frequency index and long-term window size upon time index to further improve the BSE-based VAD algorithm. The two variables can efficiently represent the characteristics of voiceprints on each critical frequency band and use long-term information for noisy speech spectrograms, respectively. The TD-PBEE parameter can be regarded as a PBEE parameter over time. First, the strength of voiceprints can be partly enhanced by using four entropies applied to four part-bands. We can use the four part-band energy entropies for describing the voiceprints in detail. Due to the characteristics of non-stationary for speech and various noises, we will then use long-term information processing to refine the PBEE, so the voice-like noise can be distinguished from noisy speech through the concept of PBEE with long-term information. Our experiments show that the proposed feature extraction with the TD-PBEE parameter is quite insensitive to background noise. The proposed TD-PBEE-based VAD algorithm is evaluated for four types of noises and five signal-to-noise ratio (SNR) levels. We find that the accuracy of the proposed TD-PBEE-based VAD algorithm averaged over all noises and all SNR levels is better than that of other considered VAD algorithms. Full article
(This article belongs to the Section Physical Sensors)
Open AccessArticle Radio Frequency Interference Detection and Mitigation Algorithms Based on Spectrogram Analysis
Algorithms 2011, 4(4), 239-261; doi:10.3390/a4040239
Received: 26 August 2011 / Revised: 3 October 2011 / Accepted: 13 October 2011 / Published: 25 October 2011
Cited by 7 | Viewed by 3029 | PDF Full-text (1428 KB) | HTML Full-text | XML Full-text
Abstract
Radio Frequency Interference (RFI) detection and mitigation algorithms based on a signal’s spectrogram (frequency and time domain representation) are presented. The radiometric signal’s spectrogram is treated as an image, and therefore image processing techniques are applied to detect and mitigate RFI by two-dimensional
[...] Read more.
Radio Frequency Interference (RFI) detection and mitigation algorithms based on a signal’s spectrogram (frequency and time domain representation) are presented. The radiometric signal’s spectrogram is treated as an image, and therefore image processing techniques are applied to detect and mitigate RFI by two-dimensional filtering. A series of Monte-Carlo simulations have been performed to evaluate the performance of a simple thresholding algorithm and a modified two-dimensional Wiener filter. Full article

Years

Subjects

Refine Subjects

Journals

All Journals Refine Journals

Article Types

Refine Types

Countries

All Countries Refine Countries
Back to Top