|
DIAS
|
SPRING 2010
Advanced Topics in Database Systems
Download the Course Poster
Course Objectives
Database systems today are more ubiquitous than ever. With the explosion of information, new requirements for fast and reliable data management have emerged and database systems support the backend of most imternat applications. Managing terabytes of information is now routine, petabytes a frequent necessity, and exabytes around the corner. On top of that, database systems now operate on hardware that has intelligence on its own, makes decisions and predicts what the software will do. All of the above open new horizons for research on database system design and performance; this course covers system and performance issues in today's database system design. Topics include query processing and optimization, concurrency control, smart and efficient benchmarking, modern query processing algorithms for internet applications, interaction between the database software and the underlying hardware, and other topics related to database system performance. The course is intended for students who (a) have taken database courses before and want to know what is new and sizzling in the database community; or (b) are looking for a Ph.D. topic; or (3) are already involved in a project that needs database expertise, and want to learn more about it. The incarnation of the course each semester it is offered will differ than that of other semesters in (a) the material, which will be adapted to new topics and (b) the structure, which will be adapted according to the students' needs and the instructor's experience.
Content
In the Spring of 2010 offering of the course we will read and discuss papers from two major areas of data management systems research:
- Database performance and scalability issues on modern hardware platforms. We will discuss query processing algorithms for deep memory hierarchies and aggressively parallel systems based on multicore chips, as well as storage and the new, popular flash devices.
- Data management support for scientific applications. Except for their sheer size, scientific dataset pose several new challenges to data management. We will discuss recent research on (a) methods for automation of physical database design, i.e., schema and indexes, as well as (b) incorporation of physical models and specialized data structures (such as tetrahedral meshes) as first-class citizens into the database.
The tentative list of topics is as follows:
- Schema Mapping/Matching
- Data Provenance
- Peer-to-peer DBs/Mobile DBs
- Data Streams/Sensor DBs
- Cloud Computing / Data Warehousing
- Column Stores
- Automated Physical Design
We are of course also open to suggestions from course participants. The course will consist from papers which we will all read and have presented in the class, either by the instructor or by the students.
There will be a list of project ideas given out, but students can suggest and work on their own ideas with potentials for advancing the state of the art.
Required prior knowledge
basic database management systems (undergraduate level, e.g. B-trees, joins, sorting, etc.), C/C++ (depending on the choice of project), data structures and algorithms
Course URL
Updates to the program and all course material are posted on the web page of the course at http://moodle.epfl.ch/course/view.php?id=4541.
(much more general) EDIC entry of the course description: Advanced Topics in Database Systems [en]
Keywords
database management systems, chip multiprocessors, cache hierarchies, flash disks, scientific databases, automated database design
FALL 2008
Advanced Databases
Objectives
This course is intended for those students who aim at being capable of working on new database applications using advanced up to date technology. It covers a wide spectrum of new technologies related to data management.
Content
1. Object-oriented & Object-relational database management systems (DBMSs). Case study: Oracle
2. Database optimization
3. Databases in a distributed environment: distributed databases, federated databases, multidatabases. Case study
4. Database design in cooperative systems: database integration
5. DB and the web
6. DB & XML
7. DB & Ontologies
8. Spatio-temporal information systems
9. Mobility and Location-Based Services
10. Data Warehousing, Data Mining (time permitting)
Required prior knowledge
Databases
Type of teaching
Ex cathedra; exercises in class; projects
Note
Course URL: Updates to the program and all course material are posted on the web page of the course at www.moodle.com.
(much more general) EDIC entry of the course description: Advanced Databases [en]
Bibliography
1. Database Systems - The Complete Book, H.Garcia-Molina, J.D.Ullman, J,Widom, Prentice Hall, 2002
2. The Object Data Standard: ODMG 3.0 (The Morgan Kaufmann Series in Data Management Systems),by Rick Catell (Editor), 2000
3. Principles of Distributed Database Systems, M.T.Ozsu, P.Valduriez, 2nd Edition, Prentice-Hall, 1999
4. Handbook On Ontologies, S.Staab, R.Studer (Editors), Springer, 2004
5. Conceptual Modeling for Traditional and Spatio-Temporal Applications - The MADS Approach, C.Parent, S. Spaccapietra, E. Zimanyi, Springer, 2006
6. J.Ullman,J.Widom: "A First Course in Database Systems", Prentice Hall Int., 1997
7. R. Elmasri & S. Navathe: " Fundamentals of Database Systems ", Benjamin-Cummings, 3rd edition, 2000
8. C. Date: " An introduction to database systems " Addison Wesley, vol. 1-2, 7th edition, 2000
|