Open hhow09 opened 1 year ago
https://interviewready.io/blog/white-papers-worth-reading-for-software-engineers
Google File System: https://static.googleusercontent.com/media/research.google.com/en//archive/gfs-sosp2003.pdf
Map Reduce Big Data Algorithm: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/16cb30b4b92fd4989b8619a61752a2387c6dd474.pdf
BigTable NoSQL Document Store: https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf
Colossus Next Gen File Store: https://cloud.google.com/blog/products/storage-data-transfer/a-peek-behind-colossus-googles-file-system
Megastore
Large Object Store: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/36971.pdf
Monarch Time Series DB: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/d84ab6c93881af998de877d0070a706de7bec6d8.pdf
Chubby Distributed Lock Management: https://static.googleusercontent.com/media/research.google.com/en//archive/chubby-osdi06.pdf
Spanner Distributed Database: https://static.googleusercontent.com/media/research.google.com/en//archive/spanner-osdi2012.pdf
Dapper Tracing System: https://static.googleusercontent.com/media/research.google.com/en//archive/papers/dapper-2010-1.pdf
Borg Cluster Management: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/43438.pdf
Zanzibar Authentication System: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/10683a8987dbf0c6d4edcafb9b4f05cc9de5974a.pdf
Pregel Graph Processing: https://15799.courses.cs.cmu.edu/fall2013/static/papers/p135-malewicz.pdf
Amazon Aurora DB Architecture: https://web.stanford.edu/class/cs245/readings/aurora.pdf
Dynamo DB NoSQL Database: https://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf
Foundation DB NewSQL database: https://www.foundationdb.org/files/fdb-paper.pdf
Monolith Embedding in real time: https://arxiv.org/pdf/2209.07663.pdf
Gorilla Time series DB: https://www.vldb.org/pvldb/vol8/p1816-teller.pdf
Cassandra NoSQL DB: https://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf
FlexiRaft Distributed Concensus Tradeoffs: https://www.cidrdb.org/cidr2023/papers/p83-yadav.pdf
Memcache In-memory Cache: https://www.usenix.org/system/files/conference/nsdi13/nsdi13-final170_update.pdf
Millisampler Network Sampling: https://dl.acm.org/doi/pdf/10.1145/3517745.3561430
TAO Graph Database: https://www.usenix.org/system/files/conference/atc13/atc13-bronson.pdf
MineSweeper - Root Cause Analysis: https://arxiv.org/pdf/2010.09974.pdf
Prophet - Forecasting at Scale: https://peerj.com/preprints/3190.pdf
ShardManager: https://research.facebook.com/file/245575980870853/Shard-Manager-A-Generic-Shard-Management-Framework-for-Geo-distributed-Applications.pdf
Hadoop File System: https://storageconference.us/2010/Papers/MSST/Shvachko.pdf
Kafka Event Bus: https://notes.stephenholiday.com/Kafka.pdf
Flink: https://asterios.katsifodimos.com/assets/publications/flink-deb.pdf
https://interviewready.io/blog/white-papers-worth-reading-for-software-engineers
Google
Google File System: https://static.googleusercontent.com/media/research.google.com/en//archive/gfs-sosp2003.pdf
Map Reduce Big Data Algorithm: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/16cb30b4b92fd4989b8619a61752a2387c6dd474.pdf
BigTable NoSQL Document Store: https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf
Colossus Next Gen File Store: https://cloud.google.com/blog/products/storage-data-transfer/a-peek-behind-colossus-googles-file-system
Megastore
Large Object Store: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/36971.pdf
Monarch Time Series DB: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/d84ab6c93881af998de877d0070a706de7bec6d8.pdf
Chubby Distributed Lock Management: https://static.googleusercontent.com/media/research.google.com/en//archive/chubby-osdi06.pdf
Spanner Distributed Database: https://static.googleusercontent.com/media/research.google.com/en//archive/spanner-osdi2012.pdf
Dapper Tracing System: https://static.googleusercontent.com/media/research.google.com/en//archive/papers/dapper-2010-1.pdf
Borg Cluster Management: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/43438.pdf
Zanzibar Authentication System: https://storage.googleapis.com/pub-tools-public-publication-data/pdf/10683a8987dbf0c6d4edcafb9b4f05cc9de5974a.pdf
Pregel Graph Processing: https://15799.courses.cs.cmu.edu/fall2013/static/papers/p135-malewicz.pdf
Amazon
Amazon Aurora DB Architecture: https://web.stanford.edu/class/cs245/readings/aurora.pdf
Dynamo DB NoSQL Database: https://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf
Misc
Foundation DB NewSQL database: https://www.foundationdb.org/files/fdb-paper.pdf
Monolith Embedding in real time: https://arxiv.org/pdf/2209.07663.pdf
Meta
Gorilla Time series DB: https://www.vldb.org/pvldb/vol8/p1816-teller.pdf
Cassandra NoSQL DB: https://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf
FlexiRaft Distributed Concensus Tradeoffs: https://www.cidrdb.org/cidr2023/papers/p83-yadav.pdf
Memcache In-memory Cache: https://www.usenix.org/system/files/conference/nsdi13/nsdi13-final170_update.pdf
Millisampler Network Sampling: https://dl.acm.org/doi/pdf/10.1145/3517745.3561430
TAO Graph Database: https://www.usenix.org/system/files/conference/atc13/atc13-bronson.pdf
MineSweeper - Root Cause Analysis: https://arxiv.org/pdf/2010.09974.pdf
Prophet - Forecasting at Scale: https://peerj.com/preprints/3190.pdf
ShardManager: https://research.facebook.com/file/245575980870853/Shard-Manager-A-Generic-Shard-Management-Framework-for-Geo-distributed-Applications.pdf
Apache
Hadoop File System: https://storageconference.us/2010/Papers/MSST/Shvachko.pdf
Kafka Event Bus: https://notes.stephenholiday.com/Kafka.pdf
Flink: https://asterios.katsifodimos.com/assets/publications/flink-deb.pdf