Skip Content
You are currently on the new version of our website. Access the old version .

8,002 Results Found

  • Review
  • Open Access
1,064 Views
30 Pages

23 May 2025

Visual computing in medicine involves handling the generation, acquisition, processing, analysis, exploration, visualization, and interpretation of medical visual information. Machine learning has become a prominent tool for data analytics and proble...

  • Article
  • Open Access
19 Citations
7,491 Views
15 Pages

Visual Saliency Prediction Based on Deep Learning

  • Bashir Ghariba,
  • Mohamed S. Shehata and
  • Peter McGuire

12 August 2019

Human eye movement is one of the most important functions for understanding our surroundings. When a human eye processes a scene, it quickly focuses on dominant parts of the scene, commonly known as a visual saliency detection or visual attention pre...

  • Article
  • Open Access
2,036 Views
12 Pages

12 July 2024

Visual reinforcement learning is important in various practical applications, such as video games, robotic manipulation, and autonomous navigation. However, a major challenge in visual reinforcement learning is the generalization to unseen environmen...

  • Article
  • Open Access
6 Citations
3,963 Views
14 Pages

Visual encoding models are important computational models for understanding how information is processed along the visual stream. Many improved visual encoding models have been developed from the perspective of the model architecture and the learning...

  • Feature Paper
  • Article
  • Open Access
11 Citations
5,460 Views
15 Pages

Spontaneous Learning of Visual Structures in Domestic Chicks

  • Orsola Rosa-Salva,
  • József Fiser,
  • Elisabetta Versace,
  • Carola Dolci,
  • Sarah Chehaimi,
  • Chiara Santolin and
  • Giorgio Vallortigara

6 August 2018

Effective communication crucially depends on the ability to produce and recognize structured signals, as apparent in language and birdsong. Although it is not clear to what extent similar syntactic-like abilities can be identified in other animals, r...

  • Article
  • Open Access
1 Citations
3,274 Views
28 Pages

A Visual Mining Approach to Improved Multiple- Instance Learning

  • Sonia Castelo,
  • Moacir Ponti and
  • Rosane Minghim

27 November 2021

Multiple-instance learning (MIL) is a paradigm of machine learning that aims to classify a set (bag) of objects (instances), assigning labels only to the bags. This problem is often addressed by selecting an instance to represent each bag, transformi...

  • Article
  • Open Access
15 Citations
6,009 Views
19 Pages

Collaborative Learning Communities for Sustainable Employment through Visual Tools

  • Rodrigo Martín-García,
  • Carmen López-Martín and
  • Raquel Arguedas-Sanz

24 March 2020

Higher education institutions must enable students to acquire skills and capacities that prepare them for working life and enhance their employability. This will lead to an applied learning- and teaching-enhancement-oriented sustainable Higher Educat...

  • Article
  • Open Access
4 Citations
3,238 Views
22 Pages

18 August 2021

In this paper, we focus on the challenges of training efficiency, the designation of reward functions, and generalization in reinforcement learning for visual navigation and propose a regularized extreme learning machine-based inverse reinforcement l...

  • Review
  • Open Access
460 Views
45 Pages

2 February 2026

Localization and mapping remain critical challenges for Unmanned Ground Vehicles (UGVs) operating in unstructured natural environments, such as forests and agricultural fields. While Visual SLAM (VSLAM) and Visual–Inertial SLAM (VI-SLAM) have m...

  • Brief Report
  • Open Access
2 Citations
4,204 Views
11 Pages

A Deep Learning Approach to Measure Visual Function in Zebrafish

  • Manjiri Patil,
  • Annabel Birchall,
  • Hammad Syed,
  • Vanessa Rodwell,
  • Ha-Jun Yoon,
  • William H. J. Norton and
  • Mervyn G. Thomas

9 June 2025

Visual behaviour in zebrafish, often measured by the optokinetic reflex (OKR), serves as a valuable model for studying aspects of human neurological and ocular diseases and for conducting therapeutic or toxicology assays. Traditional methods for OKR...

  • Review
  • Open Access
29 Citations
11,537 Views
34 Pages

Review of Visual Simultaneous Localization and Mapping Based on Deep Learning

  • Yao Zhang,
  • Yiquan Wu,
  • Kang Tong,
  • Huixian Chen and
  • Yubin Yuan

25 May 2023

Due to the limitations of LiDAR, such as its high cost, short service life and massive volume, visual sensors with their lightweight and low cost are attracting more and more attention and becoming a research hotspot. As the hardware computation powe...

  • Article
  • Open Access
2,175 Views
20 Pages

Rebalancing in Supervised Contrastive Learning for Long-Tailed Visual Recognition

  • Jiahui Lv,
  • Jun Lei,
  • Jun Zhang,
  • Chao Chen and
  • Shuohao Li

In real-world visual recognition tasks, long-tailed distribution is a pervasive challenge, where the extreme class imbalance severely limits the representation learning capability of deep models. Although supervised learning has demonstrated certain...

  • Article
  • Open Access
2 Citations
2,358 Views
18 Pages

Visual Analytics Using Machine Learning for Transparency Requirements

  • Samiha Fadloun,
  • Khadidja Bennamane,
  • Souham Meshoul,
  • Mahmood Hosseini and
  • Kheireddine Choutri

13 July 2023

Problem solving applications require users to exercise caution in their data usage practices. Prior to installing these applications, users are encouraged to read and comprehend the terms of service, which address important aspects such as data priva...

  • Article
  • Open Access
4 Citations
3,402 Views
20 Pages

28 December 2021

This paper presents the construction of a new objective method for estimation of visual perceiving quality. The proposal provides an assessment of image quality without the need for a reference image or a specific distortion assumption. Two main proc...

  • Article
  • Open Access
31 Citations
6,566 Views
23 Pages

16 October 2020

Audio-visual emotion recognition aims to distinguish human emotional states by integrating the audio and visual data acquired in the expression of emotions. It is crucial for facilitating the affect-related human-machine interaction system by enablin...

  • Article
  • Open Access
8 Citations
4,782 Views
18 Pages

Unsupervised Deep Learning-Based RGB-D Visual Odometry

  • Qiang Liu,
  • Haidong Zhang,
  • Yiming Xu and
  • Li Wang

6 August 2020

Recently, deep learning frameworks have been deployed in visual odometry systems and achieved comparable results to traditional feature matching based systems. However, most deep learning-based frameworks inevitably need labeled data as ground truth...

  • Article
  • Open Access
21 Citations
6,841 Views
29 Pages

Visual Tracking Based on Extreme Learning Machine and Sparse Representation

  • Baoxian Wang,
  • Linbo Tang,
  • Jinglin Yang,
  • Baojun Zhao and
  • Shuigen Wang

22 October 2015

The existing sparse representation-based visual trackers mostly suffer from both being time consuming and having poor robustness problems. To address these issues, a novel tracking method is presented via combining sparse representation and an emergi...

  • Article
  • Open Access
4 Citations
1,964 Views
14 Pages

Dynamic Learning Rate of Template Update for Visual Target Tracking

  • Da Li,
  • Song Li,
  • Qin Wei,
  • Haoxiang Chai and
  • Tao Han

23 April 2023

The trackers based on discriminative correlation filter (DCF) have achieved remarkable performance in visual target tracking in recent years. Since the targets are usually affected by various factors such as deformation, rotation, motion blur and so...

  • Review
  • Open Access
33 Citations
16,826 Views
29 Pages

Visual Simultaneous Localization and Mapping (VSLAM) has been a hot topic of research since the 1990s, first based on traditional computer vision and recognition techniques and later on deep learning models. Although the implementation of VSLAM metho...

  • Article
  • Open Access
7 Citations
3,281 Views
17 Pages

Visual Active Learning for Labeling: A Case for Soundscape Ecology Data

  • Liz Huancapaza Hilasaca,
  • Milton Cezar Ribeiro and
  • Rosane Minghim

29 June 2021

Labeling of samples is a recurrent and time-consuming task in data analysis and machine learning and yet generally overlooked in terms of visual analytics approaches to improve the process. As the number of tailored applications of learning models in...

  • Article
  • Open Access
6 Citations
3,574 Views
19 Pages

A Deep Learning-Based Visual Map Generation for Mobile Robot Navigation

  • Carlos A. García-Pintos,
  • Noé G. Aldana-Murillo,
  • Emmanuel Ovalle-Magallanes and
  • Edgar Martínez

6 June 2023

Visual map-based robot navigation is a strategy that only uses the robot vision system, involving four fundamental stages: learning or mapping, localization, planning, and navigation. Therefore, it is paramount to model the environment optimally to p...

  • Review
  • Open Access
27 Citations
11,571 Views
30 Pages

12 June 2023

This article provides a detailed review of recent advances in audio-visual speech recognition (AVSR) methods that have been developed over the last decade (2013–2023). Despite the recent success of audio speech recognition systems, the problem...

  • Article
  • Open Access
11 Citations
4,866 Views
23 Pages

UBUMonitor: An Open-Source Desktop Application for Visual E-Learning Analysis with Moodle

  • Raúl Marticorena-Sánchez,
  • Carlos López-Nozal,
  • Yi Peng Ji,
  • Carlos Pardo-Aguilar and
  • Álvar Arnaiz-González

An inherent requirement of teaching using online learning platforms is that the teacher must analyze student activity and performance in relation to course learning objectives. Therefore, all e-learning environments implement a module to collect such...

  • Article
  • Open Access
28 Citations
8,093 Views
15 Pages

A Visual Dashboard to Track Learning Analytics for Educational Cloud Computing

  • Diana M. Naranjo,
  • José R. Prieto,
  • Germán Moltó and
  • Amanda Calatrava

4 July 2019

Cloud providers such as Amazon Web Services (AWS) stand out as useful platforms to teach distributed computing concepts as well as the development of Cloud-native scalable application architectures on real-world infrastructures. Instructors can benef...

  • Communication
  • Open Access
27 Citations
8,201 Views
13 Pages

Leveraging Deep Learning for Visual Odometry Using Optical Flow

  • Tejas Pandey,
  • Dexmont Pena,
  • Jonathan Byrne and
  • David Moloney

12 February 2021

In this paper, we study deep learning approaches for monocular visual odometry (VO). Deep learning solutions have shown to be effective in VO applications, replacing the need for highly engineered steps, such as feature extraction and outlier rejecti...

  • Article
  • Open Access
4 Citations
5,697 Views
16 Pages

Joint Prior Learning for Visual Sensor Network Noisy Image Super-Resolution

  • Bo Yue,
  • Shuang Wang,
  • Xuefeng Liang,
  • Licheng Jiao and
  • Caijin Xu

26 February 2016

The visual sensor network (VSN), a new type of wireless sensor network composed of low-cost wireless camera nodes, is being applied for numerous complex visual analyses in wild environments, such as visual surveillance, object recognition, etc. Howev...

  • Article
  • Open Access
5 Citations
2,002 Views
20 Pages

Visual Prompt Learning of Foundation Models for Post-Disaster Damage Evaluation

  • Fei Zhao,
  • Chengcui Zhang,
  • Runlin Zhang and
  • Tianyang Wang

8 May 2025

In response to the urgent need for rapid and precise post-disaster damage evaluation, this study introduces the Visual Prompt Damage Evaluation (ViPDE) framework, a novel contrastive learning-based approach that leverages the embedded knowledge withi...

  • Feature Paper
  • Review
  • Open Access
3 Citations
3,200 Views
38 Pages

14 October 2025

Deep learning has emerged as a powerful tool in computational neuroscience, enabling the modeling of complex neural processes and supporting data-driven insights into brain function. However, the non-transparent nature of many deep learning models li...

  • Review
  • Open Access
52 Citations
27,118 Views
38 Pages

Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers

  • Nils Hütten,
  • Miguel Alves Gomes,
  • Florian Hölken,
  • Karlo Andricevic,
  • Richard Meyes and
  • Tobias Meisen

Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is...

  • Article
  • Open Access
13 Citations
5,167 Views
12 Pages

20 January 2023

Maker education that incorporates computational thinking streamlines learning and helps familiarize learners with recent advances in science and technology. Computational thinking (CT) is a vital core capability that anyone can learn. CT can be learn...

  • Article
  • Open Access
9 Citations
4,570 Views
22 Pages

27 January 2021

When driving, people make decisions based on current traffic as well as their desired route. They have a mental map of known routes and are often able to navigate without needing directions. Current published self-driving models improve their perform...

  • Article
  • Open Access
1 Citations
2,125 Views
15 Pages

As a substitute for human arms, underwater vehicle dual-manipulator systems (UVDMSs) have attracted the interest of global researchers. Visual servoing is an important tool for the positioning and tracking control of UVDMSs. In this paper, a reinforc...

  • Article
  • Open Access
2 Citations
2,534 Views
16 Pages

13 July 2024

Zero-shot learning (ZSL) enables models to recognize categories not encountered during training, which is crucial for categories with limited data. Existing methods overlook efficient temporal modeling in multimodal data. This paper proposes a Tempor...

  • Article
  • Open Access
4 Citations
3,128 Views
22 Pages

Multi-Corpus Learning for Audio–Visual Emotions and Sentiment Recognition

  • Elena Ryumina,
  • Maxim Markitantov and
  • Alexey Karpov

15 August 2023

Recognition of emotions and sentiment (affective states) from human audio–visual information is widely used in healthcare, education, entertainment, and other fields; therefore, it has become a highly active research area. The large variety of...

  • Article
  • Open Access
53 Citations
8,285 Views
27 Pages

Airborne Visual Detection and Tracking of Cooperative UAVs Exploiting Deep Learning

  • Roberto Opromolla,
  • Giuseppe Inchingolo and
  • Giancarmine Fasano

7 October 2019

The performance achievable by using Unmanned Aerial Vehicles (UAVs) for a large variety of civil and military applications, as well as the extent of applicable mission scenarios, can significantly benefit from the exploitation of formations of vehicl...

  • Article
  • Open Access
1 Citations
1,433 Views
20 Pages

E-InMeMo: Enhanced Prompting for Visual In-Context Learning

  • Jiahao Zhang,
  • Bowen Wang,
  • Hong Liu,
  • Liangzhi Li,
  • Yuta Nakashima and
  • Hajime Nagahara

Large-scale models trained on extensive datasets have become the standard due to their strong generalizability across diverse tasks. In-context learning (ICL), widely used in natural language processing, leverages these models by providing task-speci...

  • Article
  • Open Access
77 Citations
7,406 Views
26 Pages

Breast cancer is a serious threat to women. Many machine learning-based computer-aided diagnosis (CAD) methods have been proposed for the early diagnosis of breast cancer based on histopathological images. Even though many such classification methods...

  • Article
  • Open Access
2 Citations
2,272 Views
22 Pages

13 June 2024

In the field of visualization, understanding users’ analytical reasoning is important for evaluating the effectiveness of visualization applications. Several studies have been conducted to capture and analyze user interactions to comprehend thi...

  • Article
  • Open Access
29 Citations
7,711 Views
20 Pages

Digital Education and Artistic-Visual Learning in Flexible University Environments: Research Analysis

  • Mariana-Daniela González-Zamar,
  • Emilio Abad-Segura,
  • Antonio Luque de la Rosa and
  • Eloy López-Meneses

22 October 2020

The constant development of digital technologies has allowed living in a digital environment based on connections, also transforming the context of the educational process. Experiences show that digital technologies have influenced the way of learnin...

  • Article
  • Open Access
2 Citations
4,187 Views
18 Pages

2 August 2019

Traditional supervised learning is dependent on the label of the training data, so there is a limitation that the class label which is not included in the training data cannot be recognized properly. Therefore, zero-shot learning, which can recognize...

  • Article
  • Open Access
3 Citations
4,165 Views
19 Pages

18 October 2018

Visual object tracking is a fundamental research area in the field of computer vision and pattern recognition because it can be utilized by various intelligent systems. However, visual object tracking faces various challenging issues because tracking...

  • Article
  • Open Access
12 Citations
4,268 Views
22 Pages

A Deep Learning Ensemble Method to Visual Acuity Measurement Using Fundus Images

  • Jin Hyun Kim,
  • Eunah Jo,
  • Seungjae Ryu,
  • Sohee Nam,
  • Somin Song,
  • Yong Seop Han,
  • Tae Seen Kang,
  • Woongsup Lee,
  • Seongjin Lee and
  • Seunghwan Lee
  • + 2 authors

21 March 2022

Visual acuity (VA) is a measure of the ability to distinguish shapes and details of objects at a given distance and is a measure of the spatial resolution of the visual system. Vision is one of the basic health indicators closely related to a person&...

  • Article
  • Open Access
3 Citations
3,413 Views
24 Pages

Infrared Image Generation Based on Visual State Space and Contrastive Learning

  • Bing Li,
  • Decao Ma,
  • Fang He,
  • Zhili Zhang,
  • Daqiao Zhang and
  • Shaopeng Li

14 October 2024

The preparation of infrared reference images is of great significance for improving the accuracy and precision of infrared imaging guidance. However, collecting infrared data on-site is difficult and time-consuming. Fortunately, the infrared images c...

  • Article
  • Open Access
3,286 Views
22 Pages

Hybrid Deep Learning Framework for Eye-in-Hand Visual Control Systems

  • Adrian-Paul Botezatu,
  • Andrei-Iulian Iancu and
  • Adrian Burlacu

This work proposes a hybrid deep learning-based framework for visual feedback control in an eye-in-hand robotic system. The framework uses an early fusion approach in which real and synthetic images define the training data. The first layer of a ResN...

  • Article
  • Open Access
898 Views
18 Pages

The Role of the Visual Versus Verbal Modality in Learning Novel Verbs

  • Maria Luisa Lorusso,
  • Laura Pigazzini,
  • Laura Zampini,
  • Michele Burigo,
  • Martina Caccia,
  • Anna Milani and
  • Massimo Molteni

Background/Objectives: Verbs are considered to be more abstract than nouns, as they represent actions, states, and events, which are less tangible, more flexible in their meaning and thus less univocally specified. It has been suggested that children...

  • Article
  • Open Access
24 Citations
4,495 Views
23 Pages

7 July 2021

Recently, decreasing energy consumption under the premise of building comfort has become a popular topic, especially visual comfort. Existing research on visual comfort lacks a standard of how to select indicators. Moreover, studies on individual vis...

  • Article
  • Open Access
6 Citations
2,568 Views
13 Pages

5 August 2022

Risky driving behavior seriously affects the driver’s ability to react, execute and judge, which is one of the major causes of traffic accidents. The timely and accurate identification of the driving status of drivers is particularly important,...

  • Article
  • Open Access
14 Citations
4,495 Views
14 Pages

3 June 2023

Visual servoing is a control method that utilizes image feedback to control robot motion, and it has been widely applied in unmanned aerial vehicle (UAV) motion control. However, due to field-of-view (FOV) constraints, visual servoing still faces cha...

  • Article
  • Open Access
2 Citations
3,063 Views
16 Pages

14 September 2023

With the development of multimedia systems in wireless environments, the rising need for artificial intelligence is to design a system that can properly communicate with humans with a comprehensive understanding of various types of information in a h...

  • Article
  • Open Access
5 Citations
4,448 Views
19 Pages

12 April 2022

Visual odometry is the task of estimating the trajectory of the moving agents from consecutive images. It is a hot research topic both in robotic and computer vision communities and facilitates many applications, such as autonomous driving and virtua...

of 161