This is an advanced course on principles of database systems. Main topics include overview of the relational data model and relational query languages; recursive queries, query processing and optimization; database design, dependencies, normal forms, and the chase procedure. You will also see how some of these topics are applied in the design of a system for information integration.
Main Reference: Foundations of Databases by S. Abiteboul, R. Hull and V. Vianu (Addison-Wesley 1995, ISBN: 0-201-53771-0)
Tentative Syllabus:
| Topic | References |
| History, background on the relational model, relational query languages (1-2 lectures) | Chapters 1 - 3, 4.1 - 4.2 |
| Optimization of conjunctive queries (1-2 lectures) | Chapter 6.2 Optimal Implementation of Conjunctive Queries in Relational Data Bases Optimization of Real Conjunctive Queries |
| Dependencies (2-3 lectures) | Chapters 8.1 - 8.3 |
| Normal forms (1 lecture) | Chapter 11.2 |
| Chase with embedded dependencies, query optimization and logical implication with chase (2-3 lectures) | Chapters 8.4, 10 A Proof Procedure for Data Dependencies A Theory of Data Dependencies: A Survey |
| Information integration | |
| Invited speaker: Laura Haas, IBM Almaden Research Center Title of talk: Beauty and the Beast: The Theory and Practice of Information Integration E2-506, Monday November 5, 11am-12 noon |
Beauty and the Beast: The Theory and Practice of Information Integration Model Management 2.0—Manipulating Richer Mappings |
| Overview of some papers related to an information integration system | |
| 1. Translating Web Data, the Clio System (1 lecture) |
The Clio Project: Managing Heterogeneity
Translating Web Data
Clio Grows Up: From Research Prototype to Industrial Tool |
| 2. Various visual programming tools for mappings, Nested Mappings (1 lecture) |
Nested Mappings: Schema Mappings Reloaded Altova MapForce DataDirect StylusStudio |
3. Making information integration systems more usable
|
Debugging Schema Mappings with Routes |
| 4. The theory behind relational data exchange (1 lecture) | Data Exchange: Semantics and Query Answering |
| Invited talk: High Performance Storage at Los Alamos National Lab
E2-506, Tuesday November 20, 3PM |
|