Achieving the Full Potential of State Machine Replication: High Throughput, Low Latency, Constant Availability
- 👤 Speaker: Julian Moraru, Carnegie Mellon University
- 📅 Date & Time: Thursday 13 March 2014, 10:00 - 11:00
- 📍 Venue: Small Lecture Theatre, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
Abstract
Redundancy through state replication is the primary mechanism for achieving fault tolerance in distributed systems. State machine replication (SMR) is used extensively both within datacenters, where machine failures are common and must be tolerated, and in the wide-area, to ensure that data is close to all the clients that access it, and to guard against data loss and service unavailability caused by datacenter outages. ! Today, the SMR protocol of choice in systems where performance and availability are critical is Paxos. Paxos does not depend on external failure detectors or reconfiguration services to tolerate the failure of a minority of replicas, and therefore, in theory, systems using Paxos have high availability. However, because of the need to optimize for high performance, the elegance of the core protocol does not fully extend to practical implementations. ! This work aims to plant practical SMR implementation aspects in a firm theoretical ground, and thus to enable SMR designs that achieve high throughput through near-perfect load balancing, near-optimal request processing latency (especially in the wide area), and high performance robustness when confronted with failures and slow replicas. The talk will focus on Egalitarian Paxos, a new variant of the Paxos protocol. In EPaxos, all replicas perform the same functions simultaneously to ensure load balancing, constant availability, and low commit latency. We will also show the benefits of in-depth exploration of other aspects of state machine replication—aspects heretofore belonging only to the realm of practical optimizations—such as time leases.
Series This talk is part of the Microsoft Research Cambridge, public talks series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Microsoft Research Cambridge, public talks
- ndk22's list
- ob366-ai4er
- Optics for the Cloud
- personal list
- PMRFPS's
- rp587
- School of Technology
- Small Lecture Theatre, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Julian Moraru, Carnegie Mellon University
Thursday 13 March 2014, 10:00-11:00