Application-Layer Fault-Tolerance Protocols

Application-Layer Fault-Tolerance Protocols
Author: De Florio, Vincenzo
Publisher: IGI Global
Total Pages: 378
Release: 2009-01-31
Genre: Computers
ISBN: 160566183X

Download Application-Layer Fault-Tolerance Protocols Book in PDF, Epub and Kindle

"This book increases awareness of the need for application-level fault-tolerance (ALFT) through introduction of problems and qualitative analysis of solutions"--Provided by publisher.

Software Engineering of Fault Tolerant Systems

Software Engineering of Fault Tolerant Systems
Author: P. Pelliccione
Publisher: World Scientific
Total Pages: 293
Release: 2007
Genre: Computers
ISBN: 9812778861

Download Software Engineering of Fault Tolerant Systems Book in PDF, Epub and Kindle

In architecting dependable systems, what is required to improve the overall system robustness is fault tolerance. Many methods have been proposed to this end, the solutions are usually considered late during the design and implementation phases of the software life-cycle (e.g., Java and Windows NT exception handling), thus reducing the effectiveness error and fault handling. Since the system design typically models only normal behaviour of the system while ignoring exceptional ones, the implementation of the system is unable to handle abnormal events. Consequently, the system may fail in unexpected ways due to faults.It has been argued that fault tolerance management during the entire life-cycle improves the overall system robustness and that different classes of threats need to be identified for and dealt with at each distinct phase of software development, depending on the abstraction level of the software system being modelled.This book builds on this trend and investigates how fault tolerance mechanisms can be applied when engineering a software system. In particular, it identifies the new problems arising in this area, introduces the new models to be applied at different abstraction levels, defines methodologies for model-driven engineering of such systems and outlines the new technologies and validation and verification environments supporting this.

Methods, Models and Tools for Fault Tolerance

Methods, Models and Tools for Fault Tolerance
Author: Michael Butler
Publisher: Springer
Total Pages: 350
Release: 2009-03-03
Genre: Computers
ISBN: 3642008674

Download Methods, Models and Tools for Fault Tolerance Book in PDF, Epub and Kindle

The growing complexity of modern software systems increases the di?culty of ensuring the overall dependability of software-intensive systems. Complexity of environments, in which systems operate, high dependability requirements that systems have to meet, as well as the complexity of infrastructures on which they rely make system design a true engineering challenge. Mastering system complexity requires design techniques that support clear thinking and rigorous validation and veri?cation. Formal design methods help to achieve this. Coping with complexity also requires architectures that are t- erant of faults and of unpredictable changes in environment. This issue can be addressed by fault-tolerant design techniques. Therefore, there is a clear need of methods enabling rigorous modelling and development of complex fault-tolerant systems. This bookaddressessuchacuteissues indevelopingfault-tolerantsystemsas: – Veri?cation and re?nement of fault-tolerant systems – Integrated approaches to developing fault-tolerant systems – Formal foundations for error detection, error recovery, exception and fault handling – Abstractions, styles and patterns for rigorousdevelopment of fault tolerance – Fault-tolerant software architectures – Development and application of tools supporting rigorous design of depe- able systems – Integrated platforms for developing dependable systems – Rigorous approaches to speci?cation and design of fault tolerance in novel computing systems TheeditorsofthisbookwereinvolvedintheEU(FP-6)projectRODIN(R- orous Open Development Environment for Complex Systems), which brought together researchers from the fault tolerance and formal methods communi- 1 ties. In 2007 RODIN organized the MeMoT workshop held in conjunction with the Integrated Formal Methods 2007 Conference at Oxford University.

Fault-Tolerant Parallel and Distributed Systems

Fault-Tolerant Parallel and Distributed Systems
Author: Dimiter R. Avresky
Publisher: Springer Science & Business Media
Total Pages: 396
Release: 2012-12-06
Genre: Computers
ISBN: 1461554497

Download Fault-Tolerant Parallel and Distributed Systems Book in PDF, Epub and Kindle

The most important use of computing in the future will be in the context of the global "digital convergence" where everything becomes digital and every thing is inter-networked. The application will be dominated by storage, search, retrieval, analysis, exchange and updating of information in a wide variety of forms. Heavy demands will be placed on systems by many simultaneous re quests. And, fundamentally, all this shall be delivered at much higher levels of dependability, integrity and security. Increasingly, large parallel computing systems and networks are providing unique challenges to industry and academia in dependable computing, espe cially because of the higher failure rates intrinsic to these systems. The chal lenge in the last part of this decade is to build a systems that is both inexpensive and highly available. A machine cluster built of commodity hardware parts, with each node run ning an OS instance and a set of applications extended to be fault resilient can satisfy the new stringent high-availability requirements. The focus of this book is to present recent techniques and methods for im plementing fault-tolerant parallel and distributed computing systems. Section I, Fault-Tolerant Protocols, considers basic techniques for achieving fault-tolerance in communication protocols for distributed systems, including synchronous and asynchronous group communication, static total causal order ing protocols, and fail-aware datagram service that supports communications by time.

Fault-Tolerance Techniques for Spacecraft Control Computers

Fault-Tolerance Techniques for Spacecraft Control Computers
Author: Mengfei Yang
Publisher: John Wiley & Sons
Total Pages: 374
Release: 2017-01-23
Genre: Computers
ISBN: 1119107407

Download Fault-Tolerance Techniques for Spacecraft Control Computers Book in PDF, Epub and Kindle

Comprehensive coverage of all aspects of space application oriented fault tolerance techniques • Experienced expert author working on fault tolerance for Chinese space program for almost three decades • Initiatively provides a systematic texts for the cutting-edge fault tolerance techniques in spacecraft control computer, with emphasis on practical engineering knowledge • Presents fundamental and advanced theories and technologies in a logical and easy-to-understand manner • Beneficial to readers inside and outside the area of space applications

Fault Tolerance Within Session Initiation Protocol

Fault Tolerance Within Session Initiation Protocol
Author: Ekta Khurana
Publisher:
Total Pages: 0
Release: 2005
Genre:
ISBN:

Download Fault Tolerance Within Session Initiation Protocol Book in PDF, Epub and Kindle

The Session Initiation Protocol is an application layer signaling protocol designed for the session establishment between end users. These SIP sessions are established based on a request-response paradigm between the client and server components of the user agent. Proxy servers may exist within the network, which serve as a bridge between two user agents who are either within different domains or are unregistered and unaware of each others' location. The entities however are not always reliable and may encounter faults. First, this thesis aims to identify the locations within a SIP session where faults can occur and the effect on the system. The current mechanisms used to handle these faults were analyzed and it was determined that the user agents were forced to recover from faults by means of restarting the session. A novel approach was then proposed which detects and recovers from faults dynamically.

Fault-Tolerance Techniques for High-Performance Computing

Fault-Tolerance Techniques for High-Performance Computing
Author: Thomas Herault
Publisher: Springer
Total Pages: 325
Release: 2015-07-01
Genre: Computers
ISBN: 3319209434

Download Fault-Tolerance Techniques for High-Performance Computing Book in PDF, Epub and Kindle

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

Fault Tolerance

Fault Tolerance
Author: Peter A. Lee
Publisher: Springer Science & Business Media
Total Pages: 326
Release: 2012-12-06
Genre: Computers
ISBN: 370918990X

Download Fault Tolerance Book in PDF, Epub and Kindle

The production of a new version of any book is a daunting task, as many authors will recognise. In the field of computer science, the task is made even more daunting by the speed with which the subject and its supporting technology move forward. Since the publication of the first edition of this book in 1981 much research has been conducted, and many papers have been written, on the subject of fault tolerance. Our aim then was to present for the first time the principles of fault tolerance together with current practice to illustrate those principles. We believe that the principles have (so far) stood the test of time and are as appropriate today as they were in 1981. Much work on the practical applications of fault tolerance has been undertaken, and techniques have been developed for ever more complex situations, such as those required for distributed systems. Nevertheless, the basic principles remain the same.