Fault-Tolerant Search Algorithms

Fault-Tolerant Search Algorithms
Title Fault-Tolerant Search Algorithms PDF eBook
Author Ferdinando Cicalese
Publisher Springer Science & Business Media
Total Pages 218
Release 2013-11-29
Genre Computers
ISBN 3642173276

Download Fault-Tolerant Search Algorithms Book in PDF, Epub and Kindle

Why a book on fault-tolerant search algorithms? Searching is one of the fundamental problems in computer science. Time and again algorithmic and combinatorial issues originally studied in the context of search find application in the most diverse areas of computer science and discrete mathematics. On the other hand, fault-tolerance is a necessary ingredient of computing. Due to their inherent complexity, information systems are naturally prone to errors, which may appear at any level – as imprecisions in the data, bugs in the software, or transient or permanent hardware failures. This book provides a concise, rigorous and up-to-date account of different approaches to fault-tolerance in the context of algorithmic search theory. Thanks to their basic structure, search problems offer insights into how fault-tolerant techniques may be applied in various scenarios. In the first part of the book, a paradigmatic model for fault-tolerant search is presented, the Ulam—Rényi problem. Following a didactic approach, the author takes the reader on a tour of Ulam—Rényi problem variants of increasing complexity. In the context of this basic model, fundamental combinatorial and algorithmic issues in the design of fault-tolerant search procedures are discussed. The algorithmic efficiency achievable is analyzed with respect to the statistical nature of the error sources, and the amount of information on which the search algorithm bases its decisions. In the second part of the book, more general models of faults and fault-tolerance are considered. Special attention is given to the application of fault-tolerant search procedures to specific problems in distributed computing, bioinformatics and computational learning. This book will be of special value to researchers from the areas of combinatorial search and fault-tolerant computation, but also to researchers in learning and coding theory, databases, and artificial intelligence. Only basic training in discrete mathematics is assumed. Parts of the book can be used as the basis for specialized graduate courses on combinatorial search, or as supporting material for a graduate or undergraduate course on error-correcting codes.

Fault-Tolerant Parallel Computation

Fault-Tolerant Parallel Computation
Title Fault-Tolerant Parallel Computation PDF eBook
Author Paris Christos Kanellakis
Publisher Springer Science & Business Media
Total Pages 203
Release 2013-03-09
Genre Computers
ISBN 1475752105

Download Fault-Tolerant Parallel Computation Book in PDF, Epub and Kindle

Fault-Tolerant Parallel Computation presents recent advances in algorithmic ways of introducing fault-tolerance in multiprocessors under the constraint of preserving efficiency. The difficulty associated with combining fault-tolerance and efficiency is that the two have conflicting means: fault-tolerance is achieved by introducing redundancy, while efficiency is achieved by removing redundancy. This monograph demonstrates how in certain models of parallel computation it is possible to combine efficiency and fault-tolerance and shows how it is possible to develop efficient algorithms without concern for fault-tolerance, and then correctly and efficiently execute these algorithms on parallel machines whose processors are subject to arbitrary dynamic fail-stop errors. The efficient algorithmic approaches to multiprocessor fault-tolerance presented in this monograph make a contribution towards bridging the gap between the abstract models of parallel computation and realizable parallel architectures. Fault-Tolerant Parallel Computation presents the state of the art in algorithmic approaches to fault-tolerance in efficient parallel algorithms. The monograph synthesizes work that was presented in recent symposia and published in refereed journals by the authors and other leading researchers. This is the first text that takes the reader on the grand tour of this new field summarizing major results and identifying hard open problems. This monograph will be of interest to academic and industrial researchers and graduate students working in the areas of fault-tolerance, algorithms and parallel computation and may also be used as a text in a graduate course on parallel algorithmic techniques and fault-tolerance.

Fault-Tolerant Message-Passing Distributed Systems

Fault-Tolerant Message-Passing Distributed Systems
Title Fault-Tolerant Message-Passing Distributed Systems PDF eBook
Author Michel Raynal
Publisher Springer
Total Pages 459
Release 2018-09-08
Genre Computers
ISBN 3319941410

Download Fault-Tolerant Message-Passing Distributed Systems Book in PDF, Epub and Kindle

This book presents the most important fault-tolerant distributed programming abstractions and their associated distributed algorithms, in particular in terms of reliable communication and agreement, which lie at the heart of nearly all distributed applications. These programming abstractions, distributed objects or services, allow software designers and programmers to cope with asynchrony and the most important types of failures such as process crashes, message losses, and malicious behaviors of computing entities, widely known under the term "Byzantine fault-tolerance". The author introduces these notions in an incremental manner, starting from a clear specification, followed by algorithms which are first described intuitively and then proved correct. The book also presents impossibility results in classic distributed computing models, along with strategies, mainly failure detectors and randomization, that allow us to enrich these models. In this sense, the book constitutes an introduction to the science of distributed computing, with applications in all domains of distributed systems, such as cloud computing and blockchains. Each chapter comes with exercises and bibliographic notes to help the reader approach, understand, and master the fascinating field of fault-tolerant distributed computing.

Fault-Tolerance Techniques for High-Performance Computing

Fault-Tolerance Techniques for High-Performance Computing
Title Fault-Tolerance Techniques for High-Performance Computing PDF eBook
Author Thomas Herault
Publisher Springer
Total Pages 325
Release 2015-07-01
Genre Computers
ISBN 3319209434

Download Fault-Tolerance Techniques for High-Performance Computing Book in PDF, Epub and Kindle

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

Methods, Models and Tools for Fault Tolerance

Methods, Models and Tools for Fault Tolerance
Title Methods, Models and Tools for Fault Tolerance PDF eBook
Author Michael Butler
Publisher Springer Science & Business Media
Total Pages 350
Release 2009-03-26
Genre Computers
ISBN 3642008666

Download Methods, Models and Tools for Fault Tolerance Book in PDF, Epub and Kindle

The growing complexity of modern software systems makes it increasingly difficult to ensure the overall dependability of software-intensive systems. Mastering system complexity requires design techniques that support clear thinking and rigorous validation and verification. Formal design methods together with fault-tolerant design techniques help to achieve this. Therefore, there is a clear need for methods that enable rigorous modeling and the development of complex fault-tolerant systems. This book is an outcome of the workshop on Methods, Models and Tools for Fault Tolerance, MeMoT 2007, held in conjunction with the 6th international conference on Integrated Formal Methods, iFM 2007, in Oxford, UK, in July 2007. The authors of the best workshop papers were asked to enhance and expand their work, and a number of well-established researchers working in the area contributed invited chapters in addition. From the 15 refereed and revised papers presented, 12 are versions reworked from the workshop and 3 papers are invited. The articles are organized in four topical sections on: formal reasoning about fault-tolerant systems and protocols; fault tolerance: modelling in B; fault tolerance in system development process; and fault-tolerant applications.

Tools and Algorithms for the Construction and Analysis of Systems

Tools and Algorithms for the Construction and Analysis of Systems
Title Tools and Algorithms for the Construction and Analysis of Systems PDF eBook
Author Tomáš Vojnar
Publisher Springer
Total Pages 413
Release 2019-04-03
Genre Computers
ISBN 3030174654

Download Tools and Algorithms for the Construction and Analysis of Systems Book in PDF, Epub and Kindle

This book is Open Access under a CC BY licence. The LNCS 11427 and 11428 proceedings set constitutes the proceedings of the 25th International Conference on Tools and Algorithms for the Construction and Analysis of Systems, TACAS 2019, which took place in Prague, Czech Republic, in April 2019, held as part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2019. The total of 42 full and 8 short tool demo papers presented in these volumes was carefully reviewed and selected from 164 submissions. The papers are organized in topical sections as follows: Part I: SAT and SMT, SAT solving and theorem proving; verification and analysis; model checking; tool demo; and machine learning. Part II: concurrent and distributed systems; monitoring and runtime verification; hybrid and stochastic systems; synthesis; symbolic verification; and safety and fault-tolerant systems.

Bio-Inspired Fault-Tolerant Algorithms for Network-on-Chip

Bio-Inspired Fault-Tolerant Algorithms for Network-on-Chip
Title Bio-Inspired Fault-Tolerant Algorithms for Network-on-Chip PDF eBook
Author Muhammad Athar Javed Sethi
Publisher CRC Press
Total Pages 212
Release 2020-03-17
Genre Computers
ISBN 1000048055

Download Bio-Inspired Fault-Tolerant Algorithms for Network-on-Chip Book in PDF, Epub and Kindle

Network on Chip (NoC) addresses the communication requirement of different nodes on System on Chip. The bio-inspired algorithms improve the bandwidth utilization, maximize the throughput and reduce the end-to-end latency and inter-flit arrival time. This book exclusively presents in-depth information regarding bio-inspired algorithms solving real world problems focussing on fault-tolerant algorithms inspired by the biological brain and implemented on NoC. It further documents the bio-inspired algorithms in general and more specifically, in the design of NoC. It gives an exhaustive review and analysis of the NoC architectures developed during the last decade according to various parameters. Key Features: Covers bio-inspired solutions pertaining to Network-on-Chip (NoC) design solving real world examples Includes bio-inspired NoC fault-tolerant algorithms with detail coding examples Lists fault-tolerant algorithms with detailed examples Reviews basic concepts of NoC Discusses NoC architectures developed-to-date