posted on 2016-10-26, 15:27authored byMuhammad Fayyaz, Tanya Vladimirova
Current and future space missions demand highly reliable on-board computing systems, which are capable of carrying out high-performance data processing. At present, no single computing scheme satisfies both, the highly reliable operation requirement and the high-performance computing requirement. The aim of this paper is to review existing systems and offer a new approach to addressing the problem. In the first part of the paper, a detailed survey of fault-tolerant distributed computing systems for space applications is presented. Fault types and assessment criteria for fault-tolerant systems are introduced. Redundancy schemes for distributed systems are analyzed. A review of the state-of-the-art on fault-tolerant distributed systems is presented and limitations of current approaches are discussed. In the second part of the paper, a new fault-tolerant distributed computing platform with wireless links among the computing nodes is proposed. Novel algorithms, enabling important aspects of the architecture, such as time slot priority adaptive fault-tolerant channel access and fault-tolerant distributed computing using task migration are introduced.
History
Citation
Advances in Space Research, 2016, In Press
Author affiliation
/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Engineering
Version
AM (Accepted Manuscript)
Published in
Advances in Space Research
Publisher
Elsevier for 1. Committee on Space Research (COSPAR) 2. Pergamon