Next Article in Journal
Smart Sensors for Healthcare and Medical Applications
Previous Article in Journal
WON-OCDMA System Based on MW-ZCC Codes for Applications in Optical Wireless Sensor Networks
Previous Article in Special Issue
Fabrication of Ultraviolet Photodetectors Based on Fe-Doped ZnO Nanorod Structures
Open AccessArticle

High-Availability Computing Platform with Sensor Fault Resilience

Department of Computer Science and Information Engineering, National Central University, Taoyuan 320, Taiwan
Institute for Information Industry, Taipei 106, Taiwan
Author to whom correspondence should be addressed.
Sensors 2021, 21(2), 542;
Received: 21 December 2020 / Revised: 5 January 2021 / Accepted: 11 January 2021 / Published: 13 January 2021
(This article belongs to the Special Issue Selected Papers from TIKI IEEE ICICE 2019& ICASI 2020)
Modern computing platforms usually use multiple sensors to report system information. In order to achieve high availability (HA) for the platform, the sensors can be used to efficiently detect system faults that make a cloud service not live. However, a sensor may fail and disable HA protection. In this case, human intervention is needed, either to change the original fault model or to fix the sensor fault. Therefore, this study proposes an HA mechanism that can continuously provide HA to a cloud system based on dynamic fault model reconstruction. We have implemented the proposed HA mechanism on a four-layer OpenStack cloud system and tested the performance of the proposed mechanism for all possible sets of sensor faults. For each fault model, we inject possible system faults and measure the average fault detection time. The experimental result shows that the proposed mechanism can accurately detect and recover an injected system fault with disabled sensors. In addition, the system fault detection time increases as the number of sensor faults increases, until the HA mechanism is degraded to a one-system-fault model, which is the worst case as the system layer heartbeating. View Full-Text
Keywords: failover; high availability; sensor fault; fault detection and recovery; liveness detection failover; high availability; sensor fault; fault detection and recovery; liveness detection
Show Figures

Figure 1

MDPI and ACS Style

Lee, Y.-L.; Arizky, S.N.; Chen, Y.-R.; Liang, D.; Wang, W.-J. High-Availability Computing Platform with Sensor Fault Resilience. Sensors 2021, 21, 542.

AMA Style

Lee Y-L, Arizky SN, Chen Y-R, Liang D, Wang W-J. High-Availability Computing Platform with Sensor Fault Resilience. Sensors. 2021; 21(2):542.

Chicago/Turabian Style

Lee, Yen-Lin; Arizky, Shinta N.; Chen, Yu-Ren; Liang, Deron; Wang, Wei-Jen. 2021. "High-Availability Computing Platform with Sensor Fault Resilience" Sensors 21, no. 2: 542.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Search more from Scilit
Back to TopTop