REPLICATION VS. FAILURE PREVENTION: HOW TO BOOST SERVICE AVAILABILITY?

Felix Salfner1 and Katinka Wolter2

International Computer Science Institute, Berkeley1,
Humboldt-Universität zu Berlin2,

salfner@icsi.berkeley.edu


Abstract

The objective of this paper is to provide a first analysis of the effectiveness of simple server replication vs. failure prevention in non-high-availability applications. We analyze service availability for a system with N servers where each server is modeled as a finite queue subject to failures. A Petri net analysis suggests that service availability is most effectively improved by server duplication, but for further improvement the combination with failure prevention seems most effective.