Technology
I’ve been pseudo-collecting academic or white papers on products and technology for awhile. I’ve never had a good way to track them, and thought maybe creating a page where these are all consolidated would be helpful. Maybe this already exists somewhere. I’ve not seen it if it does. Sorted in reverse time order (most recent at top)
Database and Storage
- Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases (2017)
- Spanner, TrueTime & The CAP Theorem (2017)
- Spanner: Google’s Globally-Distributed Database (2013)
- CAP Twelve Years Later: How the “Rules” Have Changed (2012)
- Bigtable: A Distributed Storage System for Structured Data (2006)
- Rules of Thumb in Data Engineering (2000)
Data Processing
- Kafka: a Distributed Messaging System for Log Processing (2011)
- MapReduce: Simplified Data Processing on Large Clusters (2004)
Fault Tolerance
- Harvest, Yield and Scalable Tolerant Systems (1999)
- Time, Clocks, and the Ordering of Events in a Distributed System (1978)
File System
- The Google File System (2003)
Site Reliability Engineering (SRE)
Containers