Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (19)

Search Parameters:
Keywords = music document processing

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
35 pages, 24993 KB  
Article
Sensory Heritage Is Vital for Sustainable Cities: A Case Study of Soundscape and Smellscape at Wong Tai Sin
by PerMagnus Lindborg, Lok Him Lam, Yui Chung Kam and Ran Yue
Sustainability 2025, 17(16), 7564; https://doi.org/10.3390/su17167564 - 21 Aug 2025
Viewed by 2872
Abstract
Sensory heritage encompasses culturally valued practices, rituals, and everyday activities experienced through the senses. While sight often dominates, hearing and smelling are generally more immersive and pervasive. Soundscape research is a well-established field within urban studies; however, smellscape remains insufficiently recognised. This study [...] Read more.
Sensory heritage encompasses culturally valued practices, rituals, and everyday activities experienced through the senses. While sight often dominates, hearing and smelling are generally more immersive and pervasive. Soundscape research is a well-established field within urban studies; however, smellscape remains insufficiently recognised. This study is part of Multimodal Hong Kong, a project aimed at documenting sensory cultural heritage across the city by capturing the complex interplay between soundscape, smellscape, urban experiences, everyday activities, and memory. We investigated the multisensory environment at Wong Tai Sin Temple through acoustic measurements and perceptual ratings of soundscape and smellscape across 197 locations within and around the site. Additionally, semi-structured interviews were conducted with visitors (N = 54, 15,015 words of transcript), which were analysed using content analysis and natural language processing. The results indicate that elevated noise levels mainly arise from human voices and pipe music within the temple compound, as well as traffic noise in the surrounding area. The smell of incense dominates near the temple altars, whereas natural, grassy odours prevail in the adjacent park. Interview responses confirm that incense burning constitutes a traditional religious practice forming a distinctive olfactory marker for Chinese temples, but it is also perceived as having adverse health implications. This study contributes to the growing body of sensory heritage research, underscoring the importance of both soundscape and smellscape in fostering culturally inclusive, vibrant, and sustainable urban environments. Full article
(This article belongs to the Special Issue Urban Noise Control, Public Health and Sustainable Cities)
Show Figures

Figure 1

12 pages, 1171 KB  
Article
Is Pupil Response to Speech and Music in Toddlers with Cochlear Implants Asymmetric?
by Amanda Saksida, Marta Fantoni, Sara Ghiselli and Eva Orzan
Audiol. Res. 2025, 15(4), 108; https://doi.org/10.3390/audiolres15040108 - 14 Aug 2025
Cited by 1 | Viewed by 1106 | Correction
Abstract
Background: Ear advantage (EA) reflects hemispheric asymmetries in auditory processing. While a right-ear advantage (REA) for speech and a left-ear advantage (LEA) for music are well documented in typically developing individuals, it is unclear how these patterns manifest in young children with cochlear [...] Read more.
Background: Ear advantage (EA) reflects hemispheric asymmetries in auditory processing. While a right-ear advantage (REA) for speech and a left-ear advantage (LEA) for music are well documented in typically developing individuals, it is unclear how these patterns manifest in young children with cochlear implants (CIs). This study investigated whether pupillometry could reveal asymmetric listening efforts in toddlers with bilateral CIs when listening to speech and music under monaural stimulation. Methods: Thirteen toddlers (mean age = 36.2 months) with early bilateral CIs participated. Pupillary responses were recorded during passive listening to speech and music stimuli, presented in quiet or with background noise. Each child was tested twice, once with only the left CI active and once with only the right CI active. Linear mixed-effects models assessed the influence of session (left/right CI), signal type (speech/music), and background noise. Results: A significant interaction between session and signal type was observed (p = 0.047). Speech elicited larger pupil sizes when processed through the left CI, while music showed no significant lateralized effects. Age and speech therapy frequency moderated pupil responses in speech and music trials, respectively. Conclusions: Pupillometry reveals subtle asymmetric listening effort in young CI users depending on the listening ear, suggesting early emerging functional lateralization despite sensory deprivation and device-mediated hearing. Full article
(This article belongs to the Section Hearing)
Show Figures

Figure 1

29 pages, 21077 KB  
Article
Precise Recognition of Gong-Che Score Characters Based on Deep Learning: Joint Optimization of YOLOv8m and SimAM/MSCAM
by Zhizhou He, Yuqian Zhang, Liumei Zhang and Yuanjiao Hu
Electronics 2025, 14(14), 2802; https://doi.org/10.3390/electronics14142802 - 11 Jul 2025
Cited by 1 | Viewed by 970
Abstract
In the field of music notation recognition, while the recognition technology for common notation systems such as staff notation has become quite mature, the recognition techniques for traditional Chinese notation systems like guqin tablature (jianzipu) and Kunqu opera gongchepu remain relatively underdeveloped. As [...] Read more.
In the field of music notation recognition, while the recognition technology for common notation systems such as staff notation has become quite mature, the recognition techniques for traditional Chinese notation systems like guqin tablature (jianzipu) and Kunqu opera gongchepu remain relatively underdeveloped. As an important carrier of China’s thousand-year musical culture, the digital preservation and inheritance of Kunqu opera’s Gongche notation hold significant cultural value and practical significance. By addressing the unique characteristics of Gongche notation, this study overcomes the limitations of Western staff notation recognition technologies. By constructing a deep learning model adapted to the morphology of Chinese character-style notation symbols, it provides technical support for establishing an intelligent processing system for Chinese musical documents, thereby promoting the innovative development and inheritance of traditional music in the era of artificial intelligence. This paper has constructed the LGRC2024 (Gong-che notation based on Lilu Qu Pu) dataset. It has also employed data augmentation operations such as image translation, rotation, and noise processing to enhance the diversity of the dataset. For the recognition of Gong-che notation, the YOLOv8 model was adopted, and the network performances of its lightweight (n) and medium-weight (m) versions were compared and analyzed. The superior-performing YOLOv8m was selected as the basic model. To further improve the model’s performance, SimAM, Triplet Attention, and Multi-scale Convolutional Attention Module (MSCAM) were introduced to optimize the model. The experimental results show that the accuracy of the basic YOLOv8m model increased from 65.9% to 78.2%. The improved models based on YOLOv8m achieved recognition accuracies of 80.4%, 81.8%, and 83.6%, respectively. Among them, the improved model with the MSCAM module demonstrated the best performance in all aspects. Full article
(This article belongs to the Special Issue New Trends in AI-Assisted Computer Vision)
Show Figures

Figure 1

30 pages, 5602 KB  
Review
A Comprehensive Review on Document Image Binarization
by Bilal Bataineh, Mohamed Tounsi, Nuha Zamzami, Jehan Janbi, Waleed Abdel Karim Abu-ain, Tarik AbuAin and Shaima Elnazer
J. Imaging 2025, 11(5), 133; https://doi.org/10.3390/jimaging11050133 - 26 Apr 2025
Cited by 7 | Viewed by 6382
Abstract
In today’s digital age, the conversion of hardcopy documents into digital formats is widespread. This process involves electronically scanning and storing large volumes of documents. These documents come from various sources, including records and reports, camera-captured text and screen snapshots, official documents, newspapers, [...] Read more.
In today’s digital age, the conversion of hardcopy documents into digital formats is widespread. This process involves electronically scanning and storing large volumes of documents. These documents come from various sources, including records and reports, camera-captured text and screen snapshots, official documents, newspapers, medical reports, music scores, and more. In the domain of document analysis techniques, an essential step is document image binarization. Its goal is to eliminate unnecessary data from images and preserve only the text. Despite the existence of multiple techniques for binarization, the presence of degradation in document images can hinder their efficacy. The objective of this work is to provide an extensive review and analysis of the document binarization field, emphasizing its importance and addressing the challenges encountered during the image binarization process. Additionally, it provides insights into techniques and methods employed for image binarization. The current paper also introduces benchmark datasets for evaluating binarization accuracy, model training, evaluation metrics, and the effectiveness of recent methods. Full article
(This article belongs to the Section Document Analysis and Processing)
Show Figures

Figure 1

24 pages, 896 KB  
Article
The Ubimus Plugging Framework: Deploying FPGA-Based Prototypes for Ubiquitous Music Hardware Design
by Damián Keller, Aman Jagwani and Victor Lazzarini
Computers 2025, 14(4), 155; https://doi.org/10.3390/computers14040155 - 21 Apr 2025
Viewed by 2082
Abstract
The emergent field of embedded computing presents a challenging scenario for ubiquitous music (ubimus) design. Available tools demand specific technical knowledge—as exemplified in the techniques involved in programming integrated circuits of configurable logic units, known as field-programmable gate arrays (FPGAs). Low-level hardware description [...] Read more.
The emergent field of embedded computing presents a challenging scenario for ubiquitous music (ubimus) design. Available tools demand specific technical knowledge—as exemplified in the techniques involved in programming integrated circuits of configurable logic units, known as field-programmable gate arrays (FPGAs). Low-level hardware description languages used for handling FPGAs involve a steep learning curve. Hence, FPGA programming offers a unique challenge to probe the boundaries of ubimus frameworks as enablers of fast and versatile prototyping. State-of-the-art hardware-oriented approaches point to the use of high-level synthesis as a promising programming technique. Furthermore, current FPGA system-on-chip (SoC) hardware with an associated onboard general-purpose processor may foster the development of flexible platforms for musical signal processing. Taking into account the emergence of an FPGA-based ecology of tools, we introduce the ubimus plugging framework. The procedures employed in the construction of a modular- synthesis library based on field-programmable gate array hardware, ModFPGA, are documented, and examples of musical projects applying key design principles are discussed. Full article
Show Figures

Figure 1

24 pages, 13064 KB  
Article
An Ontological Model for the Representation of Vallenato as Cultural Heritage in a Context-Aware System
by María Antonia Diaz-Mendoza, Emiro De-La-Hoz-Franco, Jorge Eliecer Gómez Gómez and Raúl Ramírez-Velarde
Heritage 2023, 6(8), 5648-5671; https://doi.org/10.3390/heritage6080297 - 31 Jul 2023
Cited by 2 | Viewed by 3409
Abstract
The traditional Colombian vallenato was declared Intangible Cultural Heritage of Humanity by UNESCO on 1 December 2015 with urgency for it to be safeguarded, which led the government of Colombia in the head of the Ministry of Culture and the vallenato music cluster [...] Read more.
The traditional Colombian vallenato was declared Intangible Cultural Heritage of Humanity by UNESCO on 1 December 2015 with urgency for it to be safeguarded, which led the government of Colombia in the head of the Ministry of Culture and the vallenato music cluster to develop a safeguarding plan that contains different activities, among which stands out a platform for the management of vallenato through educational processes. In this sense, this document proposes an ontological model for the representation of vallenato as cultural heritage in a context-aware system called Vallenatic. The ontology was developed using the NeOn methodology, designed in the Protégé software, and has 15 concepts (Vallenata Songs, Artist, Devices, Persons, Environment, Cultural Managers, Interface, Location, MOOC, Learning Object, Profile, Preference, Net, Time Cultural, Event, Cultural_Sites). The model was evaluated by means of nine (9) situations described in natural language and SWRL; this language was used since it allows expressing OWL concepts in combination with RuleML. The proposed model can be used for other musical genres that have the recognition of cultural and intangible heritage, such as the Spanish flamenco, Argentine Tango, Mexican Mariachi music, Peruvian scissors dance, Brazilian capoeira, Dominican bachata, Jamaican reggae, among others. Full article
Show Figures

Figure 1

15 pages, 369 KB  
Review
Conceptual Cartography for the Systematic Study of Music Education Based on ICT or EdTech
by Antonio León-Garrido, Julio Manuel Barroso-Osuna and Carmen Llorente-Cejudo
Societies 2022, 12(5), 136; https://doi.org/10.3390/soc12050136 - 30 Sep 2022
Cited by 5 | Viewed by 3533
Abstract
Music and its study have always been present among people. Its learning is significant, as it provides benefits and helps in the acquisition of many abilities and skills. However, didactic, methodological, and pedagogical changes have begun to appear that nurture and provide new [...] Read more.
Music and its study have always been present among people. Its learning is significant, as it provides benefits and helps in the acquisition of many abilities and skills. However, didactic, methodological, and pedagogical changes have begun to appear that nurture and provide new challenges to their learning. Fully adapting to the 21st century and abiding by the great demand for technologies, we have seen the rise of ICT (Information and Communication Technologies), which have also been conceived as educational technology (EdTech), when applied to education. Due to these reasons, the need to conduct a systematic literature review in four databases has arisen to find out whether the use of technology in music education helps to facilitate the teaching-learning process of students. Evidence from this research has been collected using concept mapping to organize the training process. Finally, it is relevant to comment that evidence has been found and verified that the use of Edtech helps in the learning of Music Education; given that, in various documents, it is observed that they increase motivation, musical-technological thinking, critical thinking, creativity, musical practice, and musical improvisation and that they give rise to fun, playful, enjoyable, and stimulating learning. Full article
(This article belongs to the Special Issue Digital Transformation: Social and Educational Perspective)
19 pages, 4449 KB  
Article
Optical Medieval Music Recognition Using Background Knowledge
by Alexander Hartelt and Frank Puppe
Algorithms 2022, 15(7), 221; https://doi.org/10.3390/a15070221 - 22 Jun 2022
Cited by 5 | Viewed by 3069
Abstract
This paper deals with the effect of exploiting background knowledge for improving an OMR (Optical Music Recognition) deep learning pipeline for transcribing medieval, monophonic, handwritten music from the 12th–14th century, whose usage has been neglected in the literature. Various types of background knowledge [...] Read more.
This paper deals with the effect of exploiting background knowledge for improving an OMR (Optical Music Recognition) deep learning pipeline for transcribing medieval, monophonic, handwritten music from the 12th–14th century, whose usage has been neglected in the literature. Various types of background knowledge about overlapping notes and text, clefs, graphical connections (neumes) and their implications on the position in staff of the notes were used and evaluated. Moreover, the effect of different encoder/decoder architectures and of different datasets for training a mixed model and for document-specific fine-tuning based on an extended OMR pipeline with an additional post-processing step were evaluated. The use of background models improves all metrics and in particular the melody accuracy rate (mAR), which is based on the insert, delete and replace operations necessary to convert the generated melody into the correct melody. When using a mixed model and evaluating on a different dataset, our best model achieves without fine-tuning and without post-processing a mAR of 90.4%, which is raised by nearly 30% to 93.2% mAR using background knowledge. With additional fine-tuning, the contribution of post-processing is even greater: the basic mAR of 90.5% is raised by more than 50% to 95.8% mAR. Full article
(This article belongs to the Special Issue Machine Understanding of Music and Sound)
Show Figures

Figure 1

24 pages, 1301 KB  
Article
EEG Hyperscanning and Qualitative Analysis of Moments of Interest in Music Therapy for Stroke Rehabilitation—A Feasibility Study
by Gerhard Tucek, Clemens Maidhof, Julia Vogl, Astrid Heine, Matthias Zeppelzauer, Nikolaus Steinhoff and Jörg Fachner
Brain Sci. 2022, 12(5), 565; https://doi.org/10.3390/brainsci12050565 - 27 Apr 2022
Cited by 12 | Viewed by 5952
Abstract
Interdisciplinary research into the underlying neural processes of music therapy (MT) and subjective experiences of patients and therapists are largely lacking. The aim of the current study was to assess the feasibility of newly developed procedures (including electroencephalography/electrocardiography hyperscanning, synchronous audio–video monitoring, and [...] Read more.
Interdisciplinary research into the underlying neural processes of music therapy (MT) and subjective experiences of patients and therapists are largely lacking. The aim of the current study was to assess the feasibility of newly developed procedures (including electroencephalography/electrocardiography hyperscanning, synchronous audio–video monitoring, and qualitative interviews) to study the personal experiences and neuronal dynamics of moments of interest during MT with stroke survivors. The feasibility of our mobile setup and procedures as well as their clinical implementation in a rehabilitation centre and an acute hospital ward were tested with four phase C patients. Protocols and interviews were used for the documentation and analysis of the feasibility. Recruiting patients for MT sessions was feasible, although data collection on three consecutive weeks was not always possible due to organisational constraints, especially in the hospital with acute ward routines. Research procedures were successfully implemented, and according to interviews, none of the patients reported any burden, tiredness, or increased stress due to the research procedures, which lasted approx. 3 h (ranging from 135 min to 209 min) for each patient. Implementing the research procedures in a rehabilitation unit with stroke patients was feasible, and only small adaptations were made for further research. Full article
Show Figures

Figure 1

31 pages, 2727 KB  
Article
Towards a Comprehensive Account of Rhythm Processing Issues in Developmental Dyslexia
by Tamara Rathcke and Chia-Yuan Lin
Brain Sci. 2021, 11(10), 1303; https://doi.org/10.3390/brainsci11101303 - 30 Sep 2021
Cited by 10 | Viewed by 4716
Abstract
Developmental dyslexia is typically defined as a difficulty with an individual’s command of written language, arising from deficits in phonological awareness. However, motor entrainment difficulties in non-linguistic synchronization and time-keeping tasks have also been reported. Such findings gave rise to proposals of an [...] Read more.
Developmental dyslexia is typically defined as a difficulty with an individual’s command of written language, arising from deficits in phonological awareness. However, motor entrainment difficulties in non-linguistic synchronization and time-keeping tasks have also been reported. Such findings gave rise to proposals of an underlying rhythm processing deficit in dyslexia, even though to date, evidence for impaired motor entrainment with the rhythm of natural speech is rather scarce, and the role of speech rhythm in phonological awareness is unclear. The present study aimed to fill these gaps. Dyslexic adults and age-matched control participants with variable levels of previous music training completed a series of experimental tasks assessing phoneme processing, rhythm perception, and motor entrainment abilities. In a rhythm entrainment task, participants tapped along to the perceived beat of natural spoken sentences. In a phoneme processing task, participants monitored for sonorant and obstruent phonemes embedded in nonsense strings. Individual sensorimotor skills were assessed using a number of screening tests. The results lacked evidence for a motor impairment or a general motor entrainment difficulty in dyslexia, at least among adult participants of the study. Instead, the results showed that the participants’ performance in the phonemic task was predictive of their performance in the rhythmic task, but not vice versa, suggesting that atypical rhythm processing in dyslexia may be the consequence, but not the cause, of dyslexic difficulties with phoneme-level encoding. No evidence for a deficit in the entrainment to the syllable rate in dyslexic adults was found. Rather, metrically weak syllables were significantly less often at the center of rhythmic attention in dyslexic adults as compared to neurotypical controls, with an increased tendency in musically trained participants. This finding could not be explained by an auditory deficit in the processing of acoustic-prosodic cues to the rhythm structure, but it is likely to be related to the well-documented auditory short-term memory issue in dyslexia. Full article
(This article belongs to the Special Issue Motor Speech Disorders and Prosody)
Show Figures

Figure 1

23 pages, 3880 KB  
Article
Applying Automatic Translation for Optical Music Recognition’s Encoding Step
by Antonio Ríos-Vila, Miquel Esplà-Gomis, David Rizo, Pedro J. Ponce de León and José M. Iñesta
Appl. Sci. 2021, 11(9), 3890; https://doi.org/10.3390/app11093890 - 25 Apr 2021
Cited by 12 | Viewed by 3867
Abstract
Optical music recognition is a research field whose efforts have been mainly focused, due to the difficulties involved in its processes, on document and image recognition. However, there is a final step after the recognition phase that has not been properly addressed or [...] Read more.
Optical music recognition is a research field whose efforts have been mainly focused, due to the difficulties involved in its processes, on document and image recognition. However, there is a final step after the recognition phase that has not been properly addressed or discussed, and which is relevant to obtaining a standard digital score from the recognition process: the step of encoding data into a standard file format. In this paper, we address this task by proposing and evaluating the feasibility of using machine translation techniques, using statistical approaches and neural systems, to automatically convert the results of graphical encoding recognition into a standard semantic format, which can be exported as a digital score. We also discuss the implications, challenges and details to be taken into account when applying machine translation techniques to music languages, which are very different from natural human languages. This needs to be addressed prior to performing experiments and has not been reported in previous works. We also describe and detail experimental results, and conclude that applying machine translation techniques is a suitable solution for this task, as they have proven to obtain robust results. Full article
(This article belongs to the Special Issue Advances in Music Reading Systems)
Show Figures

Figure 1

10 pages, 257 KB  
Article
Miskâsowin—Returning to the Body, Remembering What Keeps Us Alive
by Moe Clark, Kenna Aviles-Betel, Catherine Richardson and Zeina Allouche
Genealogy 2021, 5(2), 34; https://doi.org/10.3390/genealogy5020034 - 1 Apr 2021
Cited by 1 | Viewed by 4437
Abstract
The nêhiyawêwin (Plains Cree language) Cree word, miskâsowin, relates to the sacred teachings of Treaty Elders of Saskatchewan as a concept pertaining to wellness of “finding one’s sense of belonging”—a process integral in the aftermath of colonial disruption. Métis educator and performance artist [...] Read more.
The nêhiyawêwin (Plains Cree language) Cree word, miskâsowin, relates to the sacred teachings of Treaty Elders of Saskatchewan as a concept pertaining to wellness of “finding one’s sense of belonging”—a process integral in the aftermath of colonial disruption. Métis educator and performance artist Moe Clark offers an approach to healing and well-being, which is imparted through movement, flux and through musical and performance-based engagement. Moe works with tools of embodiment in performance and circle work contexts, including song creation, collaborative performance, participatory youth expression and land-based projects as healing art. She shares her process for re-animating these relationships to land, human kin, and other-than-human kin through breath-work, creative practice and relationality as part of a path to wholeness. The authors document Moe’s approach to supporting the identity, growth, healing and transformation of others. Full article
(This article belongs to the Special Issue Landin’ the Spirit: Indigenous Knowledge on Healing and Wellbeing)
21 pages, 4396 KB  
Article
Evaluation of the Dissemination of the South African 24-Hour Movement Guidelines for Birth to 5 Years
by Catherine E. Draper, Takana M. Silubonde, Gudani Mukoma and Esther M. F. van Sluijs
Int. J. Environ. Res. Public Health 2021, 18(6), 3071; https://doi.org/10.3390/ijerph18063071 - 17 Mar 2021
Cited by 7 | Viewed by 4258
Abstract
South Africa (SA) launched their 24-h movement guidelines for birth to five years in December 2018. The guideline dissemination plan adopted a “train-the-trainer” strategy through dissemination workshops with community-based organisations (CBOs) working in early childhood development. The aim of this paper is to: [...] Read more.
South Africa (SA) launched their 24-h movement guidelines for birth to five years in December 2018. The guideline dissemination plan adopted a “train-the-trainer” strategy through dissemination workshops with community-based organisations (CBOs) working in early childhood development. The aim of this paper is to: (1) document this dissemination process; and (2) report on the feasibility of implementing the dissemination workshops, the acceptability of the workshops (and guidelines) for different end-user groups, and the extent to which CBO representatives disseminated the guidelines to end-users. Fifteen workshops were held in seven of SA’s nine provinces with a total of 323 attendees. Quantitative and qualitative findings (n = 281) indicate that these workshops were feasible for community-based dissemination of the guidelines and that this method of dissemination was acceptable to CBOs and end-users. Findings from follow-up focus groups (6 groups, n = 28 participants) indicate that the guidelines were shared with end-users of CBOs who participated in the focus groups. An additional musical storytelling resource, the “Woza, Mntwana” song, was well-received by participants; sharing via WhatsApp was believed to be the most effective way to disseminate this song. These findings confirm the feasibility and acceptability of culturally appropriate and context-specific community-based dissemination of behavioural guidelines in low-income settings. Full article
Show Figures

Figure 1

23 pages, 4532 KB  
Project Report
The Design of Musical Instruments for Grey Parrots: An Artistic Contribution toward Auditory Enrichment in the Context of ACI
by Reinhard Gupfinger and Martin Kaltenbrunner
Multimodal Technol. Interact. 2020, 4(2), 16; https://doi.org/10.3390/mti4020016 - 3 May 2020
Cited by 5 | Viewed by 6296
Abstract
One particular approach in the context of Animal Computer Interaction (ACI) is auditory enrichment for captive wild animals. Here we describe our research and the methodology used to design musical instruments and interfaces aimed at providing auditory enrichment for grey parrots living in [...] Read more.
One particular approach in the context of Animal Computer Interaction (ACI) is auditory enrichment for captive wild animals. Here we describe our research and the methodology used to design musical instruments and interfaces aimed at providing auditory enrichment for grey parrots living in captivity. The paper is divided into three main phases: a project review and classification, sonic experiments at the parrot shelter and the design of musical instruments. The overview of recent projects that involve animals in the interaction and music-generation process highlights the costs and benefits of projects of this kind and provides insights into current technologies in this field and the musical talents of animals. Furthermore, we document a series of sonic experiments conducted at a parrot shelter to develop acoustically enriched environments through the use of musical instruments. These investigations were intended to provide a better understanding of how grey parrots communicate through sound, perceive and respond to auditory stimuli and possibly generate sound and music through the usage of technological devices. Based on the cognitive, physiological, and auditory abilities of grey parrots, and their intrinsic interest in sonic and physical interactions, we finally developed and tested various interactive instrument prototypes and here we present our design results for auditory enrichment in the context of ACI and artistic research. Full article
(This article belongs to the Special Issue Animal Centered Computing: Enriching the Lives of Animals)
Show Figures

Figure 1

25 pages, 6680 KB  
Review
Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
by Alaa Sulaiman, Khairuddin Omar and Mohammad F. Nasrudin
J. Imaging 2019, 5(4), 48; https://doi.org/10.3390/jimaging5040048 - 12 Apr 2019
Cited by 88 | Viewed by 13389
Abstract
In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal [...] Read more.
In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degraded documents that improves the quality of documents during the binarization process. Further discussions are made on the effectiveness and robustness of existing methods, and there is still a scope to develop a hybrid approach that can deal with degraded document binarization more effectively. Full article
Show Figures

Figure 1

Back to TopTop