Special Issue "Multimodal Signal, Image and Video Analysis and Application"

A special issue of Electronics (ISSN 2079-9292). This special issue belongs to the section "Artificial Intelligence".

Deadline for manuscript submissions: 10 August 2022 | Viewed by 1008

Special Issue Editors

Dr. Qin Zou
E-Mail Website
Guest Editor
School of Computer Science, Wuhan University, Wuhan 430072, China
Interests: computer vision; pattern recognition; machine learning
Prof. Dr. Xinggang Wang
E-Mail Website
Guest Editor
School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan 430074, China
Interests: computer vision and machine learning, especially data/label- and computation-efficient deep learning for visual recognition
Special Issues, Collections and Topics in MDPI journals
Dr. Long Chen
E-Mail Website
Guest Editor
School of Mobile Information Engineering, Sun Yat-Sen University, Guangzhou 510275, China
Interests: autonomous vehicle perception system; imaging and point clouds processing and machine learning

Special Issue Information

Dear Colleagues,

With the increasing requirement of fine-scale representation of the physical world, various data capturing techniques have been developed, which leads to a wide range of modalities for signals, images and videos. It is of particular importance to study the use of multimodal information to understand events in real-world application scenarios, e.g., object detection using RGBD and LiDAR data, surface defect inspection using optical and radar data, mural painting analysis using X-ray and multi-spectrum data, information retrieval using images and texts, etc. This Special Issue aims at promoting cutting-edge research in multimodal information processing and offers a timely collection of works to benefit researchers in both academia and industry. We welcome high-quality original submissions addressing either theoretical or practical issues. Topics of interests include, but are not limited to:

  • Multimodal data acquisition techniques;
  • Multimodal data storage and indexing techniques;
  • Modeling, representation, and learning with multimodal data;
  • Multimodal data fusion;
  • Cross-modal data mining and retrieval;
  • Visualization and interpretation techniques for multimodal data;
  • Machine learning with multimodal data;
  • Open issues for multimodal data analysis and application.

Dr. Qin Zou
Prof. Dr. Xinggang Wang
Dr. Long Chen
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Electronics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2000 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • multimodal data
  • image processing
  • video processing
  • machine learning
  • information retrieval
  • data fusion

Published Papers (3 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Article
Data Augmentation Based on Generative Adversarial Network with Mixed Attention Mechanism
Electronics 2022, 11(11), 1718; https://doi.org/10.3390/electronics11111718 - 27 May 2022
Viewed by 117
Abstract
Some downstream tasks often require enough data for training in deep learning, but it is formidable to acquire data in some particular fields. Generative Adversarial Network has been extensively used in data augmentation. However, it still has problems of unstable training and low [...] Read more.
Some downstream tasks often require enough data for training in deep learning, but it is formidable to acquire data in some particular fields. Generative Adversarial Network has been extensively used in data augmentation. However, it still has problems of unstable training and low quality of generated images. This paper proposed Data Augmentation Based on Generative Adversarial Network with Mixed Attention Mechanism (MA-GAN) to solve those problems. This method can generate consistent objects or scenes by correlating the remote features in the image, thus improving the ability to create details. Firstly, the channel-attention and the self-attention mechanism are added into the generator and discriminator. Then, the spectral normalization is introduced into the generator and discriminator so that the parameter matrix satisfies the Lipschitz constraint, thus improving the stability of the model training process. By qualitative and quantitative evaluations on small-scale benchmarks (CelebA, MNIST, and CIFAR-10), the experimental results show that the proposed method performs better than other methods. Compared with WGAN-GP (Improved Training of Wasserstein GANs) and SAGAN (Self-Attention Generative Adversarial Networks), the proposed method contributes to higher classification accuracy, indicating that this method can effectively augment the data of small samples. Full article
(This article belongs to the Special Issue Multimodal Signal, Image and Video Analysis and Application)
Show Figures

Figure 1

Article
Multi-Site and Multi-Scale Unbalanced Ship Detection Based on CenterNet
Electronics 2022, 11(11), 1713; https://doi.org/10.3390/electronics11111713 - 27 May 2022
Viewed by 102
Abstract
Object detection plays an essential role in the computer vision domain, especially the machine learning-based approach, which has developed rapidly in the past decades. However, the development of convolutional neural networks in the marine field is relatively slow, such as in ship classification [...] Read more.
Object detection plays an essential role in the computer vision domain, especially the machine learning-based approach, which has developed rapidly in the past decades. However, the development of convolutional neural networks in the marine field is relatively slow, such as in ship classification and tracking. In this paper, ship detection is considered as a central point classification and regression task but discards the non-maximum suppression operation. We first improved the deep layer aggregation network to enhance the feature extraction capability of tiny targets, then reduced the number of parameters through the lightweight convolution module, and finally employed a unique activation function to enhance the nonlinearity of the model. By doing this, the improved network not only suits unbalanced sample ratios in classifying, but is more robust in scenarios where both the number and resolution of samples are unstable. Experimental results demonstrate that the proposed approach obtains outstanding performance and especially suits tiny object detection compared with current advanced methods. Furthermore, in contrast to the original CenterNet framework, the mAP of the proposed approach increased by 5.6%. Full article
(This article belongs to the Special Issue Multimodal Signal, Image and Video Analysis and Application)
Show Figures

Figure 1

Article
A Novel Path Voting Algorithm for Surface Crack Detection
Electronics 2022, 11(3), 501; https://doi.org/10.3390/electronics11030501 - 08 Feb 2022
Viewed by 359
Abstract
Path voting is a widely used technique for line structure detection in images. Traditional path voting, based on minimal-path, is performed to track paths based on how seeds grow. The former requires to set a starting point and an end point. Thus, the [...] Read more.
Path voting is a widely used technique for line structure detection in images. Traditional path voting, based on minimal-path, is performed to track paths based on how seeds grow. The former requires to set a starting point and an end point. Thus, the performance of minimal-path path voting depends on the initialization. However, high-quality initialization often requires human interaction, which limits its applications in practice. In this paper, a fully automatic path voting method has been proposed and applied for crack detection. The proposed path voting is performed to segment images, which partitions an image patch along the potential crack path and integrates the path to form a crack probability map. After path voting, crack seeds are sampled and modeled into a graph, and the edge weights are assigned using an attraction field algorithm. Finally, cracks are extracted by using spanning tree and tree pruning algorithms. Experimental results demonstrate that the proposed path voting approach can effectively infer the cracks from 2D optic images and 3D depth images. Full article
(This article belongs to the Special Issue Multimodal Signal, Image and Video Analysis and Application)
Show Figures

Figure 1

Back to TopTop