In this talk, I will present our recent work on a new speedy and fault-tolerant replicated multi-core transactional database system, Rolis. Rolis’s aim is to mask the high cost of replication by ensuring that cores are always doing useful work and not waiting for each other or for other replicas. Rolis achieves this by not mixing the multi-core concurrency control with multi-machine replication, as is traditionally done by systems that use Paxos to replicate the transaction commit protocol. Instead, Rolis takes an “execute-replicate-replay” approach. Rolis first speculatively executes the transaction on the leader machine and then replicates the per-thread transaction log to the followers using a novel protocol that leverages independent Paxos instances to avoid coordination, while still allowing followers to safely replay. The execution, replication, and replay are carefully designed to be scalable and have nearly zero coordination overhead across cores.
Please email for a
Weihai is a second-year CS PhD student at Stony Brook University and currently working on a multi-core and fault-tolerant distributed system.