A collection of research papers on Distributed Systems focusing on the way we could build large scale clusters without heavy coordination between nodes.
- Consistency and Replication
- Conflict-Free Replicated Data Types
- Failure detection
- Scheduling
- Topology
- Peer to Peer
- Language and Tools Support
- Consistency Without Borders - Peter Alvaro, Peter Bailis, Neil Conway, Joseph M. Hellerstein
- Anna: A KVS For Any Scale - Chenggang Wu, Jose M. Faleiro, Yihan Lin, Joseph M. Hellerstein
- Putting Consistency Back into Eventual Consistency - Valter Balegas, Sérgio Duarte, Carla Ferreira, Rodrigo Rodrigues, Nuno Preguiça, Mahsa Najafzadeh, Marc Shapiro
- Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS - Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, and David G. Andersen
- Cure: Strong semantics meets high availability and low latency - Deepthi Devaki Akkoorath, Alejandro Tomsic, Manuel Bravo, Zhongmiao Li, Tyler Crain, Annette Bieniusa, Nuno Preguiça, Marc Shapiro
- Bolt-On Causal Consistency - Peter Bailis, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica
- The Potential Dangers of Causal Consistency and an Explicit Solution - Peter Bailis, Alan Fekete, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica
- Geo-Replication: Fast If Possible, Consistent If Necessary - Valter Balegas, Cheng Li, Mahsa Najafzadeh, Daniel Porto, Allen Clement, Sérgio Duarte, Carla Ferreira, Johannes Gehrke, João Leitão, Nuno Preguiça, et al.
- Concise Server-Wide Causality Management for Eventually Consistent Data Stores - Ricardo Gonçalves, Paulo Sérgio Almeida, Carlos Baquero, Victor Fonte
- Conflict-free Replicated Data Types - Marc Shapiro, Nuno Preguiça, Carlos Baquero, Marek Zawirski
- Delta State Replicated Data Types - Paulo Sérgio Almeida, Ali Shoker, Carlos Baquero
- Omega: flexible, scalable schedulers for large compute clusters - Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes
- Sparrow: Distributed, Low Latency Scheduling - Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion Stoica
- Slot Scheduling: General-Purpose Multiprocessor Scheduling for Heterogeneous Workloads - Brandon Hall
- Paired Gang Scheduling - Yair Wiseman, Drog G. Feitelson
- Lock-free and Wait-free Slot Scheduling Algorithms - Pooja Aggarwal, Smruti R. Sarangi
- Failure Detectors for Large-Scale Distributed Systems - Naohiro Hayashibara, Adel Cherif, Takuya Katayama
- 2W-FD: A Failure Detector Algorithm with QoS - Alejandro Z. Tomsic, Pierre Sens, Joao Coelho Garcia, Luciana Arantes, Julien Sopena
- The φ Accrual Failure Detector - Naohiro Hayashibara, Xavier Défago, Rami Yared, Takuya Katayama
- A Scalable and Efficient Self-Organizing Failure Detector for Grid Applications - Yuuki Horita, Kenjiro Taura, Takashi Chikayama
- T-man: Gossip-based Overlay Topology Management - Márk Jelasity, Ozalp Babaoglu
- A fair comparison of gossip algorithms over large-scale random topologies - Ruijing Hu, Julien Sopena, Luciana Arantes, Pierre Sens, Isabelle Demeure
- A Survey and Comparison of Peer-to-Peer Overlay Network Schemes - Eng Keong Lua, Jon Crowcroft, Marcelo Pias, Ravi Sharma and Steven Lim
- Lasp: A Language for Distributed, Coordination-Free Programming - Christopher Meiklejohn, Peter Van Roy