Survey and Future Directions of Fault-Tolerant Distributed Computing on Board Spacecraft
journal contributionposted on 2016-10-26, 15:27 authored by Muhammad Fayyaz, Tanya Vladimirova
Current and future space missions demand highly reliable on-board computing systems, which are capable of carrying out high-performance data processing. At present, no single computing scheme satisfies both, the highly reliable operation requirement and the high-performance computing requirement. The aim of this paper is to review existing systems and offer a new approach to addressing the problem. In the first part of the paper, a detailed survey of fault-tolerant distributed computing systems for space applications is presented. Fault types and assessment criteria for fault-tolerant systems are introduced. Redundancy schemes for distributed systems are analyzed. A review of the state-of-the-art on fault-tolerant distributed systems is presented and limitations of current approaches are discussed. In the second part of the paper, a new fault-tolerant distributed computing platform with wireless links among the computing nodes is proposed. Novel algorithms, enabling important aspects of the architecture, such as time slot priority adaptive fault-tolerant channel access and fault-tolerant distributed computing using task migration are introduced.
CitationAdvances in Space Research, 2016, In Press
Author affiliation/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Engineering
- AM (Accepted Manuscript)