MDPI - Publisher of Open Access Journals

18 pages, 2469 KiB

Open AccessArticle

A Next-Best-View Method for Complex 3D Environment Exploration Using Robotic Arm with Hand-Eye System

by Michal Dobiš, Jakub Ivan, Martin Dekan, František Duchoň, Andrej Babinec and Róbert Málik

Appl. Sci. 2025, 15(14), 7757; https://doi.org/10.3390/app15147757 - 10 Jul 2025

Viewed by 239

The ability to autonomously generate up-to-date 3D models of robotic workcells is critical for advancing smart manufacturing, yet existing Next-Best-View (NBV) methods often rely on paradigms ill-suited for the fixed-base manipulators found in dynamic industrial environments. To address this gap, this paper proposes [...] Read more.

The ability to autonomously generate up-to-date 3D models of robotic workcells is critical for advancing smart manufacturing, yet existing Next-Best-View (NBV) methods often rely on paradigms ill-suited for the fixed-base manipulators found in dynamic industrial environments. To address this gap, this paper proposes a novel NBV method for the complete exploration of a 6-DOF robotic arm’s workspace. Our approach integrates collision-based information gain metric, a potential field technique to generate candidate views from exploration frontiers, and a tunable fitness function to balance information gain with motion cost. The method was rigorously tested in three simulated scenarios and validated on a physical industrial robot. Results demonstrate that our approach successfully maps the majority of the workspace in all setups, with a balanced weighting strategy proving most effective for combining exploration speed and path efficiency, a finding confirmed in the real-world experiment. We conclude that our method provides a practical and robust solution for autonomous workspace mapping, offering a flexible, training-free approach that advances the state-of-the-art for on-demand 3D model generation in industrial robotics. Full article

(This article belongs to the Special Issue Smart Manufacturing and Industry 4.0, 2nd Edition)

► Show Figures

Figure 1

27 pages, 1144 KiB

Open AccessArticle

DICTION: DynamIC robusT whIte bOx Watermarking Scheme for Deep Neural Networks

by Reda Bellafqira and Gouenou Coatrieux

Appl. Sci. 2025, 15(13), 7511; https://doi.org/10.3390/app15137511 - 4 Jul 2025

Viewed by 308

Abstract

Deep neural network (DNN) watermarking is a suitable method for protecting the ownership of deep learning (DL) models. It secretly embeds an identifier within the model, which can be retrieved by the owner to prove ownership. In this paper, we first provide a [...] Read more.

Deep neural network (DNN) watermarking is a suitable method for protecting the ownership of deep learning (DL) models. It secretly embeds an identifier within the model, which can be retrieved by the owner to prove ownership. In this paper, we first provide a unified framework for white-box DNN watermarking schemes that encompasses current state-of-the-art methods and outlines their theoretical inter-connections. Next, we introduce DICTION, a new white-box dynamic robust watermarking scheme derived from this framework. Its main originality lies in a generative adversarial network (GAN) strategy where the watermark extraction function is a DNN trained as a GAN discriminator, while the target model acts as a GAN generator. DICTION can be viewed as a generalization of DeepSigns, which, to the best of our knowledge, is the only other dynamic white-box watermarking scheme in the literature. Experiments conducted on four benchmark models (MLP, CNN, ResNet-18, and LeNet) demonstrate that DICTION achieves a zero bit error rate (BER) while maintaining model accuracy within 0.5% of the baseline. DICTION shows superior robustness, tolerating up to 95% weight pruning compared to 80% for existing methods, and it demonstrates complete resistance to fine-tuning and overwriting attacks where competing methods fail, with a BER of >0.3. Full article

(This article belongs to the Special Issue Security and Privacy in Artificial Intelligence: Technology, Applications, and Challenges)

► Show Figures

Figure 1

13 pages, 461 KiB

Open AccessArticle

How Immunization Information Systems Inform Age-Based HPV Vaccination Recommendations in the United States: A Mixed-Methods Study

by Nadja A. Vielot, Isabelle K. Bucklin, Kristy Westfall, Deanna Kepka, Gregory Zimet and Sherri Zorn

Vaccines 2025, 13(7), 716; https://doi.org/10.3390/vaccines13070716 - 30 Jun 2025

Viewed by 413

Abstract

Background: Immunization information systems (IISs) in the United States forecast vaccine due dates, which can inform when providers recommend vaccines to patients. IIS forecasting for HPV vaccination at 9 years, the minimum age of licensure, and when vaccination is likely most effective [...] Read more.

Background: Immunization information systems (IISs) in the United States forecast vaccine due dates, which can inform when providers recommend vaccines to patients. IIS forecasting for HPV vaccination at 9 years, the minimum age of licensure, and when vaccination is likely most effective is not documented or well-understood. Methods: We documented characteristics of HPV vaccination forecasts in jurisdictional IISs through Internet searches and requests to immunization program managers. Next, we conducted focus groups with stakeholders from seven jurisdictions to elucidate their processes for determining and implementing HPV vaccination forecasts. Results: Forecast data were available from 49 out of 64 CDC-funded jurisdictions, of which 14 (29%) recommended HPV vaccination at age 9 and 35 (71%) recommended HPV vaccination starting at ages 11 through to 15. Jurisdictions that recommended HPV vaccination at age 9 cited the positions of the American Cancer Society and American Academy of Pediatrics and reported little or no provider opposition to this recommendation. Jurisdictions reported variable flexibility in programming their forecasts. Those that changed their HPV vaccination forecast from 11 to 9 years did so easily while some experienced limitations. Other jurisdictions adhered strictly to the CDC’s routine recommendation at age 11–12 years and would only update the forecast in tandem with updated CDC guidance. The impact of IISs and electronic health record interoperability on how providers view and utilize IIS forecasting is unclear. Conclusions: Jurisdictions can share best practices for forecasting at 9 and future studies can evaluate the effects of forecasting age on the vaccination rates, providing evidence for nationwide vaccination recommendations. Full article

(This article belongs to the Special Issue Improving HPV Vaccination Coverage: Current Issues, Future Prospects and Strategies)

► Show Figures

Figure 1

23 pages, 4734 KiB

Open AccessArticle

Optimal Viewpoint Assistance for Cooperative Manipulation Using D-Optimality

by Kyosuke Kameyama, Kazuki Horie and Kosuke Sekiyama

Sensors 2025, 25(10), 3002; https://doi.org/10.3390/s25103002 - 9 May 2025

Viewed by 612

Abstract

This study proposes a D-optimality-based viewpoint selection method to improve visual assistance for a manipulator by optimizing camera placement. The approach maximizes the information gained from visual observations, reducing uncertainty in object recognition and localization. A mathematical framework utilizing D-optimality criteria is developed [...] Read more.

This study proposes a D-optimality-based viewpoint selection method to improve visual assistance for a manipulator by optimizing camera placement. The approach maximizes the information gained from visual observations, reducing uncertainty in object recognition and localization. A mathematical framework utilizing D-optimality criteria is developed to determine the most informative camera viewpoint in real time. The proposed method is integrated into a robotic system where a mobile robot adjusts its viewpoint to support the manipulator in grasping and placing tasks. Experimental evaluations demonstrate that D-optimality-based viewpoint selection improves recognition accuracy and task efficiency. The results suggest that optimal viewpoint planning can enhance perception robustness, leading to better manipulation performance. Although tested in structured environments, the approach has the potential to be extended to dynamic or unstructured settings. This research contributes to the integration of viewpoint optimization in vision-based robotic cooperation, with promising applications in industrial automation, service robotics, and human–robot collaboration. Full article

(This article belongs to the Special Issue Advanced Research in Intelligent Autonomous Mobile Robots System, Learning and Control)

► Show Figures

Graphical abstract

11 pages, 1240 KiB

Open AccessArticle

Profile of 50 m Sprinting: The Influence of Carbon-Plated Spikes on Maximum-Velocity Performance

by Krzysztof Mackala, Michal Krzysztofik, Adrian Weber, Dariusz Mroczek and Adam Zajac

Sensors 2025, 25(7), 1979; https://doi.org/10.3390/s25071979 - 22 Mar 2025

Viewed by 858

Abstract

The main goal of this study was to determine whether the type of spike can influence the final sprint result by comparing step by step the kinematics of four 50-m sprints. Twelve well-trained junior sprinters (ages 17–19) from the Polish National Team (ranging [...] Read more.

The main goal of this study was to determine whether the type of spike can influence the final sprint result by comparing step by step the kinematics of four 50-m sprints. Twelve well-trained junior sprinters (ages 17–19) from the Polish National Team (ranging from 100 to 400 m) participated in the study, with personal bests in the 100-m sprint of 10.70 ± 0.19 s. The OptoJump Next-Microgate sensor measurement system (Optojump, Bolzano, Italy) was used to measure the essential kinematic sprinting variables. Following the sprint distance, photocells were placed on the track at the start, at 10 m, at 20 m, at 30 m, and at the finish (50 m). Fifty-meter sprints were completed alternately, two with classic and two with the carbon-plated spikes. For every sprinter, the order in which the spikes were chosen was randomized. To better understand the problem of variability in kinematic parameters, in addition to the actual statistics, the profile analysis process was applied. The analysis of the four 50 m sprints did not show significant differences between the kinematic parameters considering runs in both the classic Nike and carbon-plated Nike ZoomX Flymax spikes. It may be suggested that spikes’ sole bending stiffness may not affect short-distance (up to 50–60 m) sprinting performance. From a practical point of view, training focused on maximum speed development can be carried out with both classic and carbon-plated spikes. Finally, our experiment can guide the preparation of a research methodology that assesses the effect of carbon-plated spikes on prolonged sprinting, e.g., 200–400 m. Full article

(This article belongs to the Special Issue Sensors Technologies for Measurements and Signal Processing)

► Show Figures

Figure 1

20 pages, 2696 KiB

Open AccessArticle

See-Then-Grasp: Object Full 3D Reconstruction via Two-Stage Active Robotic Reconstruction Using Single Manipulator

by Youngtaek Hong, Jonghyeon Kim, Geonho Cha, Eunwoo Kim and Kyungjae Lee

Appl. Sci. 2025, 15(1), 272; https://doi.org/10.3390/app15010272 - 30 Dec 2024

Viewed by 1719

Abstract

In this paper, we propose an active robotic 3D reconstruction methodology for achieving full object 3D reconstruction. Existing robotic 3D reconstruction approaches often struggle to cover the entire view space of the object or reconstruct occluded regions, such as the bottom or back [...] Read more.

In this paper, we propose an active robotic 3D reconstruction methodology for achieving full object 3D reconstruction. Existing robotic 3D reconstruction approaches often struggle to cover the entire view space of the object or reconstruct occluded regions, such as the bottom or back side. To address these limitations, we introduce a two-stage robotic active 3D reconstruction pipeline, named See-Then-Grasp (STG), that employs a robot manipulator for direct interaction with the object. The manipulator moves toward the points with the highest uncertainty, ensuring efficient data acquisition and rapid reconstruction. Our method expands the view space of the object to include the entire perspective, including occluded areas, making the previous fixed view candidate approach time-consuming for identifying uncertain regions. To overcome this, we propose a gradient-based next best view pose optimization method that efficiently identifies uncertain regions, enabling faster and more effective reconstruction. Our method optimizes the camera pose based on an uncertainty function, allowing it to identify the most uncertain regions in a short time. Through experiments with synthetic objects, we demonstrate that our approach effectively addresses the next best view selection problem, achieving significant improvements in computational efficiency while maintaining high-quality 3D reconstruction. Furthermore, we validate our method on a real robot, showing that it enables full 3D reconstruction of real-world objects. Full article

(This article belongs to the Special Issue Advances in Robotics and Autonomous Systems)

► Show Figures

Figure 1

29 pages, 30892 KiB

Open AccessArticle

A Generalized Voronoi Diagram-Based Segment-Point Cyclic Line Segment Matching Method for Stereo Satellite Images

by Li Zhao, Fengcheng Guo, Yi Zhu, Haiyan Wang and Bingqian Zhou

Remote Sens. 2024, 16(23), 4395; https://doi.org/10.3390/rs16234395 - 24 Nov 2024

Viewed by 877

Abstract

Matched line segments are crucial geometric elements for reconstructing the desired 3D structure in stereo satellite imagery, owing to their advantages in spatial representation, complex shape description, and geometric computation. However, existing line segment matching (LSM) methods face significant challenges in effectively addressing [...] Read more.

Matched line segments are crucial geometric elements for reconstructing the desired 3D structure in stereo satellite imagery, owing to their advantages in spatial representation, complex shape description, and geometric computation. However, existing line segment matching (LSM) methods face significant challenges in effectively addressing co-linear interference and the misdirection of parallel line segments. To address these issues, this study proposes a “continuous–discrete–continuous” cyclic LSM method, based on the Voronoi diagram, for stereo satellite images. Initially, to compute the discrete line-point matching rate, line segments are discretized using the Bresenham algorithm, and the pyramid histogram of visual words (PHOW) feature is assigned to the line segment points which are detected using the line segment detector (LSD). Next, to obtain continuous matched line segments, the method combines the line segment crossing angle rate with the line-point matching rate, utilizing a soft voting classifier. Finally, local point-line homography models are constructed based on the Voronoi diagram, filtering out misdirected parallel line segments and yielding the final matched line segments. Extensive experiments on the challenging benchmark, WorldView-2 and WorldView-3 satellite image datasets, demonstrate that the proposed method outperforms several state-of-the-art LSM methods. Specifically, the proposed method achieves F1-scores that are 6.22%, 12.60%, and 18.35% higher than those of the best-performing existing LSM method on the three datasets, respectively. Full article

(This article belongs to the Section Remote Sensing Image Processing)

► Show Figures

Figure 1

22 pages, 6640 KiB

Open AccessArticle

Efficient 3D Exploration with Distributed Multi-UAV Teams: Integrating Frontier-Based and Next-Best-View Planning

by André Ribeiro and Meysam Basiri

Drones 2024, 8(11), 630; https://doi.org/10.3390/drones8110630 - 31 Oct 2024

Cited by 1 | Viewed by 1976

Abstract

Autonomous exploration of unknown environments poses many challenges in robotics, particularly when dealing with vast and complex landscapes. This paper presents a novel framework tailored for distributed multi-robot systems, harnessing the 3D mobility capabilities of Unmanned Aerial Vehicles (UAVs) equipped with advanced LiDAR [...] Read more.

Autonomous exploration of unknown environments poses many challenges in robotics, particularly when dealing with vast and complex landscapes. This paper presents a novel framework tailored for distributed multi-robot systems, harnessing the 3D mobility capabilities of Unmanned Aerial Vehicles (UAVs) equipped with advanced LiDAR sensors for the rapid and effective exploration of uncharted territories. The proposed approach uniquely integrates the robustness of frontier-based exploration with the precision of Next-Best-View (NBV) planning, supplemented by a distance-based assignment cooperative strategy, offering a comprehensive and adaptive strategy for these systems. Through extensive experiments conducted across distinct environments using up to three UAVs, the efficacy of the exploration planner and cooperative strategy is rigorously validated. Benchmarking against existing methods further underscores the superiority of the proposed approach. The results demonstrate successful navigation through complex 3D landscapes, showcasing the framework’s capability in both single- and multi-UAV scenarios. While the benefits of employing multiple UAVs are evident, exhibiting reductions in exploration time and individual travel distance, this study also reveals findings regarding the optimal number of UAVs, particularly in smaller and wider environments. Full article

(This article belongs to the Special Issue Recent Advances in UAV Navigation)

► Show Figures

Figure 1

11 pages, 575 KiB

Open AccessProceeding Paper

A Comprehensive Asset Management Plan for the Bridges of Ontario for 2023–2025

by Chowdhury Sakib-Uz-Zaman, Md Ainul Kabir and Golam Kabir

Eng. Proc. 2024, 76(1), 36; https://doi.org/10.3390/engproc2024076036 - 22 Oct 2024

Cited by 1 | Viewed by 642

Abstract

Bridges, like any other infrastructure, deteriorate over time, and the more they degrade, the more expensive it becomes to perform maintenance activities on them. Therefore, it is important to predict their deterioration and plan intervention programs from economic and functionality points of view. [...] Read more.

Bridges, like any other infrastructure, deteriorate over time, and the more they degrade, the more expensive it becomes to perform maintenance activities on them. Therefore, it is important to predict their deterioration and plan intervention programs from economic and functionality points of view. While deterministic models are used to predict the deterioration of a structure, they do not consider any maintenance activities carried out beforehand. In this paper, we adjusted the deterioration model to overcome those limitations and planned an intervention program for the bridges of Ontario for 2023–2025. To achieve this, we cleaned the dataset; formulated the models; ran simulations with multiple deterministic and stochastic models to find the best one; adjusted the model equation to account for various levels of previous maintenance works; formulated the intervention action requirement for the next three years by developing an algorithm to reflect previous maintenance trends and their effects on the condition ratings; prepared an equalized work requirement for each year; and prioritized the action requirements based on a proposed risk matrix. Finally, an exhaustive plan with an action requirement for all the bridges was prepared for 2023–2025. The proposed plan will help decision makers to take proper action to maintain the serviceability of the bridges. Full article

(This article belongs to the Proceedings of 1st International Conference on Industrial, Manufacturing, and Process Engineering (ICIMP-2024))

► Show Figures

Figure 1

19 pages, 2162 KiB

Open AccessEssay

A Futures Perspective on Learning and Teaching in Higher Education: An Essay on Best and Next Practices

by Nada Jarni and David Gurr

Trends High. Educ. 2024, 3(3), 793-811; https://doi.org/10.3390/higheredu3030045 - 12 Sep 2024

Cited by 1 | Viewed by 3745

Abstract

Higher education is a sector that can be slow to change, yet there are significant pressures on universities and other providers to change. Learning and teaching are central to what higher education does, and pressures, such as the switch to remote learning during [...] Read more.

Higher education is a sector that can be slow to change, yet there are significant pressures on universities and other providers to change. Learning and teaching are central to what higher education does, and pressures, such as the switch to remote learning during the pandemic and the increasing use of generative AI, are causing a reconsideration about good learning and teaching. This essay provides a futures framework to explore best and next practices in learning and teaching in higher education. Four important and influential papers and reviews are used to consider past and current views of good teaching and learning in higher education. From these, six evidence-informed teaching practices are described as examples of current best-practice views, and then these are developed into possible, plausible, probable, and preferred next practices. This essay provides a stimulus for practitioners and researchers to adopt a futures mindset for thinking about the development of teaching and learning in higher education. Full article

► Show Figures

Figure 1

17 pages, 16026 KiB

Open AccessArticle

ARM4CH: A Methodology for Autonomous Reality Modelling for Cultural Heritage

by Nikolaos Giakoumidis and Christos-Nikolaos Anagnostopoulos

Sensors 2024, 24(15), 4950; https://doi.org/10.3390/s24154950 - 30 Jul 2024

Cited by 2 | Viewed by 1252

Abstract

Nowadays, the use of advanced sensors, such as terrestrial, mobile 3D scanners and photogrammetric imaging, has become the prevalent practice for 3D Reality Modeling (RM) and the digitization of large-scale monuments of Cultural Heritage (CH). In practice, this process is heavily related to [...] Read more.

Nowadays, the use of advanced sensors, such as terrestrial, mobile 3D scanners and photogrammetric imaging, has become the prevalent practice for 3D Reality Modeling (RM) and the digitization of large-scale monuments of Cultural Heritage (CH). In practice, this process is heavily related to the expertise of the surveying team handling the laborious planning and time-consuming execution of the 3D scanning process tailored to each site’s specific requirements and constraints. To minimize human intervention, this paper proposes a novel methodology for autonomous 3D Reality Modeling of CH monuments by employing autonomous robotic agents equipped with the appropriate sensors. These autonomous robotic agents are able to carry out the 3D RM process in a systematic, repeatable, and accurate approach. The outcomes of this automated process may also find applications in digital twin platforms, facilitating secure monitoring and the management of cultural heritage sites and spaces, in both indoor and outdoor environments. The main purpose of this paper is the initial release of an Industry 4.0-based methodology for reality modeling and the survey of cultural spaces in the scientific community, which will be evaluated in real-life scenarios in future research. Full article

(This article belongs to the Special Issue Sensors and Advanced Sensing Techniques for Computer Vision Applications)

► Show Figures

Figure 1

16 pages, 9644 KiB

Open AccessArticle

FF3D: A Rapid and Accurate 3D Fruit Detector for Robotic Harvesting

by Tianhao Liu, Xing Wang, Kewei Hu, Hugh Zhou, Hanwen Kang and Chao Chen

Sensors 2024, 24(12), 3858; https://doi.org/10.3390/s24123858 - 14 Jun 2024

Cited by 6 | Viewed by 1857

Abstract

This study presents the Fast Fruit 3D Detector (FF3D), a novel framework that contains a 3D neural network for fruit detection and an anisotropic Gaussian-based next-best view estimator. The proposed one-stage 3D detector, which utilizes an end-to-end 3D detection network, shows superior accuracy [...] Read more.

This study presents the Fast Fruit 3D Detector (FF3D), a novel framework that contains a 3D neural network for fruit detection and an anisotropic Gaussian-based next-best view estimator. The proposed one-stage 3D detector, which utilizes an end-to-end 3D detection network, shows superior accuracy and robustness compared to traditional 2D methods. The core of the FF3D is a 3D object detection network based on a 3D convolutional neural network (3D CNN) followed by an anisotropic Gaussian-based next-best view estimation module. The innovative architecture combines point cloud feature extraction and object detection tasks, achieving accurate real-time fruit localization. The model is trained on a large-scale 3D fruit dataset and contains data collected from an apple orchard. Additionally, the proposed next-best view estimator improves accuracy and lowers the collision risk for grasping. Thorough assessments on the test set and in a simulated environment validate the efficacy of our FF3D. The experimental results show an AP of 76.3%, an AR of 92.3%, and an average Euclidean distance error of less than 6.2 mm, highlighting the framework’s potential to overcome challenges in orchard environments. Full article

(This article belongs to the Special Issue Applications of Intelligent Robots: Sensing, Interaction, Navigation and Control Systems)

► Show Figures

Figure 1

19 pages, 2537 KiB

Open AccessArticle

Use of Real-World FHIR Data Combined with Context-Sensitive Decision Modeling to Guide Sentinel Biopsy in Melanoma

by Catharina Lena Beckmann, Georg Lodde, Jessica Swoboda, Elisabeth Livingstone and Britta Böckmann

J. Clin. Med. 2024, 13(11), 3353; https://doi.org/10.3390/jcm13113353 - 6 Jun 2024

Cited by 2 | Viewed by 1528

Abstract

Background: To support clinical decision-making at the point of care, the “best next step” based on Standard Operating Procedures (SOPs) and actual accurate patient data must be provided. To do this, textual SOPs have to be transformed into operable clinical algorithms and [...] Read more.

Background: To support clinical decision-making at the point of care, the “best next step” based on Standard Operating Procedures (SOPs) and actual accurate patient data must be provided. To do this, textual SOPs have to be transformed into operable clinical algorithms and linked to the data of the patient being treated. For this linkage, we need to know exactly which data are needed by clinicians at a certain decision point and whether these data are available. These data might be identical to the data used within the SOP or might integrate a broader view. To address these concerns, we examined if the data used by the SOP is also complete from the point of view of physicians for contextual decision-making. Methods: We selected a cohort of 67 patients with stage III melanoma who had undergone adjuvant treatment and mainly had an indication for a sentinel biopsy. First, we performed a step-by-step simulation of the patient treatment along our clinical algorithm, which is based on a hospital-specific SOP, to validate the algorithm with the given Fast Healthcare Interoperability Resources (FHIR)-based data of our cohort. Second, we presented three different decision situations within our algorithm to 10 dermatooncologists, focusing on the concrete patient data used at this decision point. The results were conducted, analyzed, and compared with those of the pure algorithmic simulation. Results: The treatment paths of patients with melanoma could be retrospectively simulated along the clinical algorithm using data from the patients’ electronic health records. The subsequent evaluation by dermatooncologists showed that the data used at the three decision points had a completeness between 84.6% and 100.0% compared with the data used by the SOP. At one decision point, data on “patient age (at primary diagnosis)” and “date of first diagnosis” were missing. Conclusions: The data needed for our decision points are available in the FHIR-based dataset. Furthermore, the data used at decision points by the SOP and hence the clinical algorithm are nearly complete compared with the data required by physicians in clinical practice. This is an important precondition for further research focusing on presenting decision points within a treatment process integrated with the patient data needed. Full article

(This article belongs to the Section Dermatology)

► Show Figures

Figure 1

14 pages, 29789 KiB

Open AccessArticle

Multi-View Metal Parts Pose Estimation Based on a Single Camera

by Chen Chen and Xin Jiang

Sensors 2024, 24(11), 3408; https://doi.org/10.3390/s24113408 - 25 May 2024

Cited by 2 | Viewed by 1388

Abstract

Pose estimation of metal parts plays a vital role in industrial grasping areas. It is challenging to obtain complete point clouds of metal parts because of their reflective properties. This study introduces an approach for recovering the 6D pose of CAD-known metal parts [...] Read more.

Pose estimation of metal parts plays a vital role in industrial grasping areas. It is challenging to obtain complete point clouds of metal parts because of their reflective properties. This study introduces an approach for recovering the 6D pose of CAD-known metal parts from images captured by a single RGB camera. The proposed strategy only requires RGB images without depth information. The core idea of the proposed method is to use multiple views to estimate the metal parts’ pose. First, the pose of metal parts is estimated in the first view. Second, ray casting is employed to simulate additional views with the corresponding status of the metal parts, enabling the calculation of the camera’s next best viewpoint. The camera, mounted on a robotic arm, is then moved to this calculated position. Third, this study integrates the known camera transformations with the poses estimated from different viewpoints to refine the final scene. The results of this work demonstrate that the proposed method effectively estimates the pose of shiny metal parts. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

15 pages, 2379 KiB

Open AccessArticle

Enhancing Apple Cultivar Classification Using Multiview Images

by Silvia Krug and Tino Hutschenreuther

J. Imaging 2024, 10(4), 94; https://doi.org/10.3390/jimaging10040094 - 17 Apr 2024

Cited by 1 | Viewed by 2154

Abstract

Apple cultivar classification is challenging due to the inter-class similarity and high intra-class variations. Human experts do not rely on single-view features but rather study each viewpoint of the apple to identify a cultivar, paying close attention to various details. Following our previous [...] Read more.

Apple cultivar classification is challenging due to the inter-class similarity and high intra-class variations. Human experts do not rely on single-view features but rather study each viewpoint of the apple to identify a cultivar, paying close attention to various details. Following our previous work, we try to establish a similar multiview approach for machine-learning (ML)-based apple classification in this paper. In our previous work, we studied apple classification using one single view. While these results were promising, it also became clear that one view alone might not contain enough information in the case of many classes or cultivars. Therefore, exploring multiview classification for this task is the next logical step. Multiview classification is nothing new, and we use state-of-the-art approaches as a base. Our goal is to find the best approach for the specific apple classification task and study what is achievable with the given methods towards our future goal of applying this on a mobile device without the need for internet connectivity. In this study, we compare an ensemble model with two cases where we use single networks: one without view specialization trained on all available images without view assignment and one where we combine the separate views into a single image of one specific instance. The two latter options reflect dataset organization and preprocessing to allow the use of smaller models in terms of stored weights and number of operations than an ensemble model. We compare the different approaches based on our custom apple cultivar dataset. The results show that the state-of-the-art ensemble provides the best result. However, using images with combined views shows a decrease in accuracy by 3% while requiring only 60% of the memory for weights. Thus, simpler approaches with enhanced preprocessing can open a trade-off for classification tasks on mobile devices. Full article

(This article belongs to the Special Issue Computer Vision and Deep Learning: Trends and Applications (2nd Edition))

► Show Figures

Figure 1

Search Results (75)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (75)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI