Skip to content

Latest commit

 

History

History
198 lines (149 loc) · 7.74 KB

internals.markdown

File metadata and controls

198 lines (149 loc) · 7.74 KB
layout title group id
page
Internals
navigation
internals

Internals

This page contains resources for current faculty and students.

Wiki

Go to dada.cs.washington.edu/mediawiki/index.php/Database_Group.

Mailing Lists

UW-only lists:

  • uw-db: main general-interest database list
  • uw-db-local: internal main general-interest database list
  • db-students: mailing list for database students
  • uw-systems: main general-interest systems list
  • ai-hackers: main general-interest AI list
  • nwds: Northwest Database Society (NWDS) list

Important external mailing lists:

DBWORLD: general conference and database-interest announcements. Subscribe by sending "subscribe" to [email protected].

General Exam Reading List

Basic Textbooks

  • H. Garcia-Molina, J. Ullman and J. Widom. Database Systems: The Complete Book.
    • Chapter 3: Design Theory for Relational Databases.

Datalog

Data Integration

Data Models

Data Warehousing and Mining

Mathematical Foundations

  • S. Abiteboul, R. Hull, V. Vianu. Foundations of Databases.
    • Chapter 8 (Functional and Join Dependency); Chapter 9 (Inclusion Dependency), except proofs.
    • Chapter 6, Sections 6.2 (Global Optimizations) and 6.4 (Computing with Acyclic Joins).
  • J. Ullman. Database and Knowledge Base Systems, vol. I.
    • Chapter 3 (Logic as a Data Model).
  • J. Ullman. Database and Knowledge Base Systems, vol. II.
    • Chapters 12 (Top-Down Evaluation), 13 (Magic Sets), 14 (Containment).
  • M. Y. Vardi. Constraint Satisfaction and Database Theory: A Tutorial. PODS 2000, pp. 76-5.

Parallel and Distributed Databases

  • T. Oszu, P. Valduriez. Principles of Distributed Database Systems, 2nd ed.
    • Chapter 4 (Distributed Database Systems), pp. 82-99, skimming basic terminology; Chapter 5 (Distributed Database Design), pp. 107-154, skimming examples, algorithms, and Section 5.4.3; Chapter 13 (Parallel Database Systems), pp. 420-452.
  • D. Kossman. [The State of the Art in Distributed Query Processing.] (http://portal.acm.org/citation.cfm?id=371598) ACM Computing Surveys 32(4), 2000, pp. 418-469.
  • Jeffrey Dean and Sanjay Ghemawat. [MapReduce: Simplified Data Processing on Large Clusters.] (http://research.google.com/archive/mapreduce.html) OSDI'04: Sixth Symposium on Operating System Design and Implementation.

Query Execution

  • G. Graefe. Query Evaluation Techniques for Large Databases. ACM Computing Surveys 25(2), 1993, pp. 73-170.
    • (Sections 1-5)
  • A. Ailamaki, D. J. DeWitt, M. D. Hill, D. A. Wood. DBMSs on a Modern Processor: Where Does Time Go? VLDB 1999.

Query Optimization

  • S. Chaudhuri. An Overview of Query Optimization in Relational Systems. PODS 1998.
  • P. Selinger, M. Astrahan, D. Chamberlin, R. Lorie, T. Price. Access Path Selection in a Relational Database Management System. SIGMOD 1979.

Ranking and Information Retrieval

  • Ronald Fagin, Amnon Lotem, Moni Naor. Optimal aggregation algorithms for middleware. JCSS 66(4): 614-656 (2003)

Sampling, Approximate Query Processing, Database Statistics

  • Graham Cormode, Minos N. Garofalakis, Peter J. Haas, Chris Jermaine. Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches. Foundations and Trends in Databases 4(1-3): 1-294 (2012)
    • Chapters 2 and 3

Stream Processing

  • Brian Babcock, Shivnath Babu, Mayur Datar, Rajeev Motwani, Jennifer Widom. Models and Issues in Data Stream Systems. PODS 2002: 1-16.
  • D. Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, and S. Zdonik. Aurora: A New Model and Architecture for Data Stream Management. In VLDB Journal (12)2, 2003.

Systems

Transaction Processing

  • P. A. Bernstein, E. Newcomer. Principles of Transaction Processing, 2nd ed.
    • Chapter 1 (Introduction), Chapter 9 (Two-Phase Commit), Chapter 6 (Locking) --the new version, Chapter 8 (Database System Recovery).
  • Michael J. Franklin. [Concurrency Control and Recovery.] (http://zoo.cs.yale.edu/classes/cs637/franklin97concurrency.pdf) The Handbook of Computer Science and Engineering, A. Tucker, ed., CRC Press, Boca Raton, 1997.
  • Rick Cattell. [Scalable SQL and NoSQL Data Stores.] (http://cattell.net/datastores/Datastores.pdf) SIGMOD Record, December 2010.
  • M. Stonebraker, S. Madden, D.J. Abadi, S. Harizopoulos, N. Hachem, and P. Helland. [The end of an Architectural Era: (It’s Time for a Complete Rewrite.)] (http://nms.csail.mit.edu/~stavros/pubs/hstore.pdf) VLDB ’07: Proceedings of the 33rd international conference on Very large data bases, 2007, pp. 1150-1160.
  • James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, David Mwaura, David Nagle, Sean Quinlan, Rajesh Rao, Lindsay Rolig, Yasushi Saito, Michal Szymaniak, Christopher Taylor, Ruth Wang, and Dale Woodford. [Spanner: Google's Globally-Distributed Database.] (http://research.google.com/archive/spanner.html) OSDI'12: Tenth Symposium on Operating System Design and Implementation