Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (39)

Search Parameters:
Keywords = noise silencer

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 1588 KiB  
Article
An Eye-Tracking Study on Text Comprehension While Listening to Music: Preliminary Results
by Georgia Andreou and Maria Gkantaki
Appl. Sci. 2025, 15(7), 3939; https://doi.org/10.3390/app15073939 - 3 Apr 2025
Viewed by 2097
Abstract
The aim of the present study was to examine the effect of background music on text comprehension using eye-tracking technology. Ten Greek undergraduate students read four texts under the following four reading conditions: preferred music, non-preferred music, café noise, and in silence. Eye [...] Read more.
The aim of the present study was to examine the effect of background music on text comprehension using eye-tracking technology. Ten Greek undergraduate students read four texts under the following four reading conditions: preferred music, non-preferred music, café noise, and in silence. Eye movements were tracked to assess visual patterns, while reading performance and attitudes were also evaluated. The results showed that fixation measures remained stable across conditions, suggesting that early visual processing is not significantly influenced by auditory distractions. However, reading performance significantly declined under non-preferred music, highlighting its disruptive impact on cognitive processing. Participants also reported greater difficulty and fatigue in this condition, consistent with an increased cognitive load. In contrast, preferred music and silence were associated with enhanced understanding, confidence, and immersion, café noise also had a moderate but manageable effect on reading outcomes. These findings underscore the importance of tailoring reading environments to individual preferences in order to optimize reading performance and engagement. Future research studies should focus on the effects of different musical attributes, such as tempo and genre, and use more complex reading tasks, in order to better understand how auditory stimuli interact with cognitive load and visual processing. Full article
(This article belongs to the Special Issue Latest Research on Eye Tracking Applications)
Show Figures

Figure 1

14 pages, 3662 KiB  
Article
Study on the Design of the Gear Pair and Flow Characteristics of Circular-Arc Gear Pumps
by Geqiang Li, Yunda Liu, Weifeng Han, Donglin Li, Shuai Wang and Zhenchao Hao
Appl. Sci. 2025, 15(7), 3911; https://doi.org/10.3390/app15073911 - 2 Apr 2025
Viewed by 479
Abstract
Compared with traditional gear pumps, circular-arc gear pumps have the advantages of silence and small flow pulsation, but the theory of design is underdeveloped. This paper presents a design method for gear pumps with circular-arc helical gear pairs, and the influence mechanism of [...] Read more.
Compared with traditional gear pumps, circular-arc gear pumps have the advantages of silence and small flow pulsation, but the theory of design is underdeveloped. This paper presents a design method for gear pumps with circular-arc helical gear pairs, and the influence mechanism of flow characteristics is studied. First, a model of the gear pair is established, and a design method for the gear pair is proposed. Second, a CFD model is demonstrated, and the influences of the tooth profile parameters (tooth number, modules, and pressure angle) on the flow characteristics are analyzed. Finally, the significance of the influencing factors is analyzed. The results show that when the stagger angle of the two ends of the arc helical gear pair is an integral multiple of π/Z, there is no flow pulsation, and there is little noise. The tooth number and modules are positively correlated with the flow rate and flow pulsation, among which the modules have the most significant influence. The flow rate of the gear pump increases by 4–5 L/min for every 0.2 increase in the modules. The pressure angle and flow rate show a negative correlation trend, but the influence is insignificant. The flow rate is less than 1 L/min for every 2° change in the pressure angle. This paper provides a theoretical basis and reference value for the gear pair design of gear pumps. Full article
(This article belongs to the Section Mechanical Engineering)
Show Figures

Figure 1

17 pages, 7407 KiB  
Article
Low-Frequency Sound Absorption in Air Pipelines and Water-Filled Pipelines Based on Metamaterials
by Botao Liu, Zigeng Liu, Ang Li, Bo Zheng and Shengchun Liu
Appl. Sci. 2025, 15(7), 3820; https://doi.org/10.3390/app15073820 - 31 Mar 2025
Cited by 1 | Viewed by 634
Abstract
Due to the significant impedance mismatch between water and air, two types of pipeline silencers designed for the same target frequency with different filling materials often have significantly different thickness. This increases the difficulty in designing silencers for multiple pipelines with different filling [...] Read more.
Due to the significant impedance mismatch between water and air, two types of pipeline silencers designed for the same target frequency with different filling materials often have significantly different thickness. This increases the difficulty in designing silencers for multiple pipelines with different filling materials in narrow spaces. The paper reports a metamaterial design paradigm based on a Helmholtz cavity for low-frequency sound absorption for both air pipelines and water-filled pipelines. An asymmetric absorption metamaterial with coupled Helmholtz resonators is proposed to reduce the impact of low-frequency noise in air pipelines. By coupling the absorption mode and reflection mode, the asymmetric absorption metamaterial with a thickness of 71 mm achieves 95.6% absorption at 403 Hz. The tunable absorbing performance in broadband is confirmed by a finite element simulation. Additionally, a composite metamaterial constructed with HRs associated with a rubber layer is proposed for low-frequency broadband sound absorption in water-filled pipelines. An average absorptance of above 0.8 is achieved over the range of 380–508 Hz by coupling four basic composite metamaterial units with a thickness of 31.5 mm. The proposed design paradigm can reduce the complexity of designing multiple pipelines silencers with different filling materials because the muffler should have similar thickness in the same paradigm. Full article
(This article belongs to the Section Acoustics and Vibrations)
Show Figures

Figure 1

15 pages, 981 KiB  
Perspective
Mourning for Silence: Bereavement and Tinnitus—A Perspective
by Dirk De Ridder, Berthold Langguth and Winfried Schlee
J. Clin. Med. 2025, 14(7), 2218; https://doi.org/10.3390/jcm14072218 - 25 Mar 2025
Viewed by 781
Abstract
Tinnitus is defined as the conscious awareness of a tonal or composite noise for which there is no identifiable corresponding external acoustic source, which becomes tinnitus disorder when the phantom sound is associated with suffering and/or disability. There is only limited knowledge about [...] Read more.
Tinnitus is defined as the conscious awareness of a tonal or composite noise for which there is no identifiable corresponding external acoustic source, which becomes tinnitus disorder when the phantom sound is associated with suffering and/or disability. There is only limited knowledge about the time course of tinnitus disorder. Bereavement science has identified four different trajectories: resilience, recovery, chronic, and delayed. The question arises whether these four trajectories exist in tinnitus as well if one considers tinnitus as the loss of silence (at will). To verify whether these four trajectories exist, short-term tinnitus progression was analyzed retrospectively using an Ecological Momentary Assessment (EMA) approach, extracting the data from patients who started using the TrackYourTinnitus (TYT) app (version 1, Ulm University, 2013) from the start of their tinnitus perception. Four patients were identified retrospectively via the TYT app with acute tinnitus, and the bereavement trajectories were reconstructed based on EMA. In conclusion, this perspective suggests that the four known bereavement trajectories may exist in tinnitus, and prospective evaluations of larger samples are warranted to confirm or disprove this analogy between bereavement and tinnitus, in which tinnitus is conceived as the loss of (controllable) silence. Full article
(This article belongs to the Section Otolaryngology)
Show Figures

Figure 1

26 pages, 5471 KiB  
Article
CFD Simulation Analysis of a Diesel Generator Exhaust Muffler and Performance-Based Optimization
by Kursat Tanriver
Processes 2025, 13(3), 887; https://doi.org/10.3390/pr13030887 - 18 Mar 2025
Viewed by 1385
Abstract
This study addresses a gap in the literature by simultaneously optimizing noise reduction and structural integrity in silencer design. A novel silencer model offering advantages over conventional designs was developed. In the first phase, the initial optimization model was developed to minimize noise [...] Read more.
This study addresses a gap in the literature by simultaneously optimizing noise reduction and structural integrity in silencer design. A novel silencer model offering advantages over conventional designs was developed. In the first phase, the initial optimization model was developed to minimize noise levels while considering environmental constraints. CFD analyses using NEC Acostix and Ex-Tuner calculated noise, temperature, and pressure, followed by validation in SimScale. The results demonstrated improved efficiency in reducing pressure loss and noise. A silencer with Ø 800 mm diameter, 3000 mm length, and Ø 355.6 mm exhaust connection was manufactured and tested, achieving 96 dB(A) at 1 m, with front and side measurements of 85.1 dB(A) and 74.4 dB(A), respectively. In the second phase, fastener durability in silencer support plates was optimized using tensile tests and FEA in Ansys R19.2. Parametric analyses for M4-M20 bolts were conducted, and regression analysis in Minitab (Minitab Statistical Software Version 21.1) showed 97.74% accuracy. An objective function was developed using curve fitting. The second optimization problem, incorporating design constraints, was solved using the interior-point and Lagrange multipliers methods. This study provides a foundation for silencer design, ensuring both structural reliability and noise control. Future research will explore performance eco-friendly solutions across varying generator power levels. Full article
(This article belongs to the Section Energy Systems)
Show Figures

Figure 1

24 pages, 2016 KiB  
Article
Acoustic Transmission Loss of a Cylindrical Silencer Filled with Multilayer Poroelastic Materials Based on Mode-Matching Method
by Haesang Yang and Woojae Seong
J. Mar. Sci. Eng. 2024, 12(11), 2109; https://doi.org/10.3390/jmse12112109 - 20 Nov 2024
Viewed by 863
Abstract
The efficacy of silencers in reducing piping noise is contingent upon the specific installation and operating environment. Among the various forms of silencers, the acoustic characteristics of dissipative silencers with sound-absorbing materials attached internally exist in an area that is difficult to explain [...] Read more.
The efficacy of silencers in reducing piping noise is contingent upon the specific installation and operating environment. Among the various forms of silencers, the acoustic characteristics of dissipative silencers with sound-absorbing materials attached internally exist in an area that is difficult to explain by existing theories. This is dependent upon the specific type and placement of the attached sound-absorbing materials. This paper presents a methodology for calculating the acoustic transmission loss (TL) of a cylindrical silencer filled with a multilayer poroelastic material, employing the mode-matching method. To describe the numerical process of treating waves propagating within a poroelastic material and determine the modes in accordance with the boundary conditions necessary for analyzing the acoustic performance of the silencer, the Biot model and the Johnson–Champoux–Allard–Lafarge model were employed. The obtained modes were utilized to calculate the acoustic TL of silencers filled with single, double, and triple layers of poroelastic materials. In particular, the results obtained for the single layer were validated by comparing them with the results of a finite element analysis, and the results obtained for multiple layers with the same material were validated by comparing them with the equivalent single-layer results. Moreover, the results of the numerical calculations of the acoustic TLs of the silencer for three distinct types of poroelastic materials, including those with varying degrees of frame rigidity or softness, were compared, and the acoustic characteristics were analyzed in relation to the intrinsic properties of the materials and their arrangement. It is anticipated that the methodology presented in this paper will facilitate the design of silencers using poroelastic materials in accordance with the specific requirements of users or designers by allowing for a comprehensive consideration of the thickness of layers and the arrangement of materials. Full article
(This article belongs to the Topic Advances in Underwater Acoustics and Aeroacoustics)
Show Figures

Figure 1

18 pages, 7231 KiB  
Article
Transmission Loss Characteristics of Dual Cavity Impedance Composite Mufflers for Non-Planar Wave Cavity Resonance
by Yizhe Huang, Bojin Yan, Huizhen Zhang, Chenlin Wang, Jun Wang, Zhifu Zhang, Qibai Huang and Xin Zhan
Appl. Sci. 2024, 14(16), 6879; https://doi.org/10.3390/app14166879 - 6 Aug 2024
Cited by 2 | Viewed by 1291
Abstract
In conventional gasoline automobiles, the engine powers the air conditioning system and engine noise can somewhat mask the noise and vibration of the air conditioning system. In pure electric vehicles, however, the absence of an engine makes the air conditioning system’s noise more [...] Read more.
In conventional gasoline automobiles, the engine powers the air conditioning system and engine noise can somewhat mask the noise and vibration of the air conditioning system. In pure electric vehicles, however, the absence of an engine makes the air conditioning system’s noise more noticeable, concentrated in a limited frequency range at constant speeds. As a result, aerodynamic noise from the air conditioning system is a primary noise source in electric vehicles. Pipeline silencers are the main method for reducing this noise. The current silencer design uses plane wave acoustic theory but when cavity modal resonance occurs, the transmission loss error is relatively high. This article addresses the issue of non-planar wave cavity resonance, studying the cavity modal of a muffler using the finite element method to reveal the transmission loss under cavity mode resonance. A dual cavity expansion structure of an impedance composite muffler is proposed, with sound-absorbing materials placed in the cavity to enhance acoustic performance. The analysis of the transmission loss characteristics of the impedance composite muffler provides a theoretical basis for noise control in pure electric vehicle air conditioning systems. Full article
Show Figures

Figure 1

18 pages, 7475 KiB  
Data Descriptor
BELMASK—An Audiovisual Dataset of Adversely Produced Speech for Auditory Cognition Research
by Cleopatra Christina Moshona, Frederic Rudawski, André Fiebig and Ennes Sarradj
Data 2024, 9(8), 92; https://doi.org/10.3390/data9080092 - 24 Jul 2024
Viewed by 1979
Abstract
In this article, we introduce the Berlin Dataset of Lombard and Masked Speech (BELMASK), a phonetically controlled audiovisual dataset of speech produced in adverse speaking conditions, and describe the development of the related speech task. The dataset contains in total 128 min of [...] Read more.
In this article, we introduce the Berlin Dataset of Lombard and Masked Speech (BELMASK), a phonetically controlled audiovisual dataset of speech produced in adverse speaking conditions, and describe the development of the related speech task. The dataset contains in total 128 min of audio and video recordings of 10 German native speakers (4 female, 6 male) with a mean age of 30.2 years (SD: 6.3 years), uttering matrix sentences in cued, uninstructed speech in four conditions: (i) with a Filtering Facepiece P2 (FFP2) mask in silence, (ii) without an FFP2 mask in silence, (iii) with an FFP2 mask while exposed to noise, (iv) without an FFP2 mask while exposed to noise. Noise consisted of mixed-gender six-talker babble played over headphones to the speakers, triggering the Lombard effect. All conditions are readily available in face-and-voice and voice-only formats. The speech material is annotated, employing a multi-layer architecture, and was originally conceptualized to be used for the administration of a working memory task. The dataset is stored in a restricted-access Zenodo repository and is available for academic research in the area of speech communication, acoustics, psychology and related disciplines upon request, after signing an End User License Agreement (EULA). Full article
Show Figures

Figure 1

19 pages, 3360 KiB  
Article
ATC-SD Net: Radiotelephone Communications Speaker Diarization Network
by Weijun Pan, Yidi Wang, Yumei Zhang and Boyuan Han
Aerospace 2024, 11(7), 599; https://doi.org/10.3390/aerospace11070599 - 22 Jul 2024
Cited by 1 | Viewed by 2118
Abstract
This study addresses the challenges that high-noise environments and complex multi-speaker scenarios present in civil aviation radio communications. A novel radiotelephone communications speaker diffraction network is developed specifically for these circumstances. To improve the precision of the speaker diarization network, three core modules [...] Read more.
This study addresses the challenges that high-noise environments and complex multi-speaker scenarios present in civil aviation radio communications. A novel radiotelephone communications speaker diffraction network is developed specifically for these circumstances. To improve the precision of the speaker diarization network, three core modules are designed: voice activity detection (VAD), end-to-end speaker separation for air–ground communication (EESS), and probabilistic knowledge-based text clustering (PKTC). First, the VAD module uses attention mechanisms to separate silence from irrelevant noise, resulting in pure dialogue commands. Subsequently, the EESS module distinguishes between controllers and pilots by levying voice print differences, resulting in effective speaker segmentation. Finally, the PKTC module addresses the issue of pilot voice print ambiguity using text clustering, introducing a novel flight prior knowledge-based text-related clustering model. To achieve robust speaker diarization in multi-pilot scenarios, this model uses prior knowledge-based graph construction, radar data-based graph correction, and probabilistic optimization. This study also includes the development of the specialized ATCSPEECH dataset, which demonstrates significant performance improvements over both the AMI and ATCO2 PROJECT datasets. Full article
Show Figures

Figure 1

15 pages, 3524 KiB  
Article
ChickenSense: A Low-Cost Deep Learning-Based Solution for Poultry Feed Consumption Monitoring Using Sound Technology
by Ahmad Amirivojdan, Amin Nasiri, Shengyu Zhou, Yang Zhao and Hao Gan
AgriEngineering 2024, 6(3), 2115-2129; https://doi.org/10.3390/agriengineering6030124 - 9 Jul 2024
Cited by 6 | Viewed by 2924
Abstract
This research proposes a low-cost system consisting of a hardware setup and a deep learning-based model to estimate broiler chickens’ feed intake, utilizing audio signals captured by piezoelectric sensors. The signals were recorded 24/7 for 19 consecutive days. A subset of the raw [...] Read more.
This research proposes a low-cost system consisting of a hardware setup and a deep learning-based model to estimate broiler chickens’ feed intake, utilizing audio signals captured by piezoelectric sensors. The signals were recorded 24/7 for 19 consecutive days. A subset of the raw data was chosen, and events were labeled in two classes, feed-pecking and non-pecking (including singing, anomaly, and silence samples). Next, the labeled data were preprocessed through a noise removal algorithm and a band-pass filter. Then, the spectrogram and the signal envelope were extracted from each signal and fed as inputs to a VGG-16-based convolutional neural network (CNN) with two branches for 1D and 2D feature extraction followed by a binary classification head to classify feed-pecking and non-pecking events. The model achieved 92% accuracy in feed-pecking vs. non-pecking events classification with an f1-score of 91%. Finally, the entire raw dataset was processed utilizing the developed model, and the resulting feed intake estimation was compared with the ground truth data from scale measures. The estimated feed consumption showed an 8 ± 7% mean percent error on daily feed intake estimation with a 71% R2 score and 85% Pearson product moment correlation coefficient (PPMCC) on hourly intake estimation. The results demonstrate that the proposed system estimates broiler feed intake at each feeder and has the potential to be implemented in commercial farms. Full article
(This article belongs to the Special Issue The Future of Artificial Intelligence in Agriculture)
Show Figures

Figure 1

21 pages, 14655 KiB  
Article
Acoustic Pressure Amplification through In-Duct Sonic Black Holes
by Cédric Maury, Teresa Bravo, Muriel Amielh and Daniel Mazzoni
Appl. Sci. 2024, 14(11), 4699; https://doi.org/10.3390/app14114699 - 29 May 2024
Cited by 5 | Viewed by 1756
Abstract
Acoustic detection of machinery defaults from in-duct measurements is of practical importance in many areas, such as the health assessment of turbines in ventilation systems or engine testing in the surface and air transport sectors. This approach is, however, impeded by the low [...] Read more.
Acoustic detection of machinery defaults from in-duct measurements is of practical importance in many areas, such as the health assessment of turbines in ventilation systems or engine testing in the surface and air transport sectors. This approach is, however, impeded by the low signal-to-noise ratio (SNR) observed in such environments. In this study, it is proposed to exploit the slow sound effect of Sonic Black Hole (SBH) ducted silencers to enhance the sensing of incident pulse acoustic signals with low SNR. It is found from transfer matrix and finite element modelling that fully opened SBH silencers with perforated skin interfaces are able to substantially enhance an incident pulse amplitude while channeling an air flow. We demonstrate that the graded depths of the SBH cavities provide rainbow spectral decomposition and amplification of the incident pulse frequency components, provided that impedance matching, slow sound, and critically coupled conditions are met. In-duct experiments showed the ability of a 3D printed SBH silencer to simultaneously enhance acoustic sensing and fully trap the pulse spectral components in the SBH cavities in the presence of a low-speed flow. This study opens up new avenues for the development of dual-purpose silencers designed for acoustic monitoring and noise control in duct systems without obstructing the air flow. Full article
(This article belongs to the Section Acoustics and Vibrations)
Show Figures

Figure 1

16 pages, 2509 KiB  
Article
Effect of the Exterior Traffic Noises on the Sound Environment Evaluation in Office Spaces with Different Interior Noise Conditions
by Boya Yu, Yuying Chai and Chao Wang
Appl. Sci. 2024, 14(7), 3017; https://doi.org/10.3390/app14073017 - 3 Apr 2024
Cited by 2 | Viewed by 1879
Abstract
The present study focuses on the impact of exterior traffic noises on sound environment evaluation in office spaces, considering their interaction with interior noises. There were three interior noise conditions: silence, air-conditioner noise, and irrelevant speech noise. Six exterior traffic noises (road, maglev, [...] Read more.
The present study focuses on the impact of exterior traffic noises on sound environment evaluation in office spaces, considering their interaction with interior noises. There were three interior noise conditions: silence, air-conditioner noise, and irrelevant speech noise. Six exterior traffic noises (road, maglev, tram, metro, conventional inter-city train, and high-speed train) were merged with interior noise clips to create the combined noise stimuli. Forty subjects participated in the experiment to assess the acoustic environment in office spaces exposed to multiple noises. The results showed that both interior and exterior noise significantly affected acoustic comfort and noise disturbance. As for the exterior traffic noise, both the traffic noise source and the noise level were found to be influential on both attributes. More temporally fluctuating traffic noises, such as high-speed train noise, were found to have a greater negative effect on subjective evaluations. Meanwhile, the interior noise source was also found to influence evaluations of the sound environment. Compared to the single traffic noise condition, irrelevant speech noise significantly increased the negative impact of traffic noises, while the air-conditioner noise had a neutral effect. In addition, participants in offices with speech noise were less sensitive to the traffic noise level. Full article
(This article belongs to the Special Issue Recent Advances in Soundscape and Environmental Noise)
Show Figures

Figure 1

17 pages, 2465 KiB  
Article
Behavioral and Cortical Activation Changes in Children Following Auditory Training for Dichotic Deficits
by Deborah Moncrieff and Vanessa Schmithorst
Brain Sci. 2024, 14(2), 183; https://doi.org/10.3390/brainsci14020183 - 17 Feb 2024
Cited by 1 | Viewed by 2255
Abstract
We report changes following auditory rehabilitation for interaural asymmetry (ARIA) training in behavioral test performance and cortical activation in children identified with dichotic listening deficits. In a one group pretest–posttest design, measures of dichotic listening, speech perception in noise, and frequency pattern identification [...] Read more.
We report changes following auditory rehabilitation for interaural asymmetry (ARIA) training in behavioral test performance and cortical activation in children identified with dichotic listening deficits. In a one group pretest–posttest design, measures of dichotic listening, speech perception in noise, and frequency pattern identification were assessed before and 3 to 4.5 months after completing an auditory training protocol designed to improve binaural processing of verbal material. Functional MRI scans were also acquired before and after treatment while participants passively listened in silence or to diotic or dichotic digits. Significant improvements occurred after ARIA training for dichotic listening and speech-in-noise tests. Post-ARIA, fMRI activation increased during diotic tasks in anterior cingulate and medial prefrontal regions and during dichotic tasks, decreased in the left precentral gyrus, right-hemisphere pars triangularis, and right dorsolateral and ventral prefrontal cortices, regions known to be engaged in phonologic processing and working memory. The results suggest that children with dichotic deficits may benefit from the ARIA program because of reorganization of cortical capacity required for listening and a reduced need for higher-order, top-down processing skills when listening to dichotic presentations. Full article
(This article belongs to the Special Issue Neurofunctional Basis of Language Processing)
Show Figures

Figure 1

15 pages, 6592 KiB  
Article
Selective Block of Upregulated Kv1.3 Potassium Channels in ON-Bipolar Cells of the Blind Retina Enhances Optogenetically Restored Signaling
by Giulia Schilardi, Jakub Kralik and Sonja Kleinlogel
Int. J. Mol. Sci. 2023, 24(18), 14207; https://doi.org/10.3390/ijms241814207 - 18 Sep 2023
Cited by 4 | Viewed by 1782
Abstract
Loss of photoreceptors in retinal degenerative diseases also impacts the inner retina: bipolar cell dendrites retract, neurons rewire, and protein expression changes. ON-bipolar cells (OBCs) represent an attractive target for optogenetic vision restoration. However, the above-described maladaptations may negatively impact the quality of [...] Read more.
Loss of photoreceptors in retinal degenerative diseases also impacts the inner retina: bipolar cell dendrites retract, neurons rewire, and protein expression changes. ON-bipolar cells (OBCs) represent an attractive target for optogenetic vision restoration. However, the above-described maladaptations may negatively impact the quality of restored vision. To investigate this question, we employed human post-mortem retinas and transgenic rd1_Opto-mGluR6 mice expressing the optogenetic construct Opto-mGluR6 in OBCs and carrying the retinal degeneration rd1 mutation. We found significant changes in delayed rectifier potassium channel expression in OBCs of degenerative retinas. In particular, we found an increase in Kv1.3 expression already in early stages of degeneration. Immunohistochemistry localized Kv1.3 channels specifically to OBC axons. In whole-cell patch-clamp experiments, OBCs in the degenerated murine retina were less responsive, which could be reversed by application of the specific Kv1.3 antagonist Psora-4. Notably, Kv1.3 block significantly increased the amplitude and kinetics of Opto-mGluR6-mediated light responses in OBCs of the blind retina and increased the signal-to-noise ratio of light-triggered responses in retinal ganglion cells. We propose that reduction in Kv1.3 activity in the degenerated retina, either by pharmacological block or by KCNA3 gene silencing, could improve the quality of restored vision. Full article
Show Figures

Figure 1

17 pages, 1125 KiB  
Article
Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
by Yanjue Song and Nilesh Madhu
Sensors 2023, 23(14), 6438; https://doi.org/10.3390/s23146438 - 16 Jul 2023
Cited by 2 | Viewed by 1734
Abstract
Using the source-filter model of speech production, clean speech signals can be decomposed into an excitation component and an envelope component that is related to the phoneme being uttered. Therefore, restoring the envelope of degraded speech during speech enhancement can improve the intelligibility [...] Read more.
Using the source-filter model of speech production, clean speech signals can be decomposed into an excitation component and an envelope component that is related to the phoneme being uttered. Therefore, restoring the envelope of degraded speech during speech enhancement can improve the intelligibility and quality of output. As the number of phonemes in spoken speech is limited, they can be adequately represented by a correspondingly limited number of envelopes. This can be exploited to improve the estimation of speech envelopes from a degraded signal in a data-driven manner. The improved envelopes are then used in a second stage to refine the final speech estimate. Envelopes are typically derived from the linear prediction coefficients (LPCs) or from the cepstral coefficients (CCs). The improved envelope is obtained either by mapping the degraded envelope onto pre-trained codebooks (classification approach) or by directly estimating it from the degraded envelope (regression approach). In this work, we first investigate the optimal features for envelope representation and codebook generation by a series of oracle tests. We demonstrate that CCs provide better envelope representation compared to using the LPCs. Further, we demonstrate that a unified speech codebook is advantageous compared to the typical codebook that manually splits speech and silence as separate entries. Next, we investigate low-complexity neural network architectures to map degraded envelopes to the optimal codebook entry in practical systems. We confirm that simple recurrent neural networks yield good performance with a low complexity and number of parameters. We also demonstrate that with a careful choice of the feature and architecture, a regression approach can further improve the performance at a lower computational cost. However, as also seen from the oracle tests, the benefit of the two-stage framework is now chiefly limited by the statistical noise floor estimate, leading to only a limited improvement in extremely adverse conditions. This highlights the need for further research on joint estimation of speech and noise for optimum enhancement. Full article
(This article belongs to the Special Issue Machine Learning and Signal Processing Based Acoustic Sensors)
Show Figures

Figure 1

Back to TopTop