CS 530 Database Systems II (FSS 2015)

Organization

  • Lecturer: Prof. Dr. Rainer Gemulla
  • Tutors: Marius Eich, Fisnik Kastrati
  • Type of course: Lecture and practical exercises (6 ECTS points)
  • Lecture: Tuesday, 12:00-13:30, B6 - A 101
  • Tutorium: Wednesday, 8:30-10:00, A5 - B 243
  • Evaluation: Final exam or oral examination, regular exercises
  • Prerequisites: Database Systems I or equivalent

Content

This lecture introduces the fundamental concepts and computational paradigms of large-scale data management and Big Data. We first look at data warehouses and parallel databases and then proceed to modern Big Data technologies. The course covers concept, algorithms, and system issues; accompanying exercises provide hands-on experience.

Tentative list of topics:

  • Parallel and distributed databases
  • Data Warehousing
  • MapReduce and its ecosystem
  • XML
  • NOSQL
  • Streams
  • Graphs

Lecture Notes

  • 00 Organization (pdf, updated: 17.2.)
  • 01 Introduction (pdf)
  • 02 Parallel and distributed database systems (pdf, updated: 26.5.)
  • Extra: User defined functions (pdf)
  • 03 Data Warehousing (pdf, updated 1.6.)
  • 04 MapReduce (pdf, updated 5.5.)
  • 05 Pig (pdf)
  • 06 NoSQL (pdf, updated: 26.5.)
  • 07 Wrapup (pdf)

Exercises

Literature

  • H. Garcia-Molina, J. D. Ullman, J. Widom
    Database Systems: The Complete Book
    Prentice Hall, 2nd ed., 2008
  • A. Kemper, A. Eickler
    Datenbanksysteme: Eine Einführung (German only)
    De Gruyter Oldenbourg, 9. Aufl., 2013
  • T. Öszu, P. Valduriez
    Principles of Distributed Database Systems
    Springer, 3rd ed., 2011
  • T. White
    Hadoop – The Definitive Guide

    O’Reilly, 3rded., 2012
  • J. Lin, C. Dyer
    Data-Intensive Text Processing with
    MapReduce
    Morgan and Claypool, 1st ed., 2010
  • C. Strauch
    NoSQL databases

    Stuttgart Media University, 2011
  • E. Redmond, J. R. Wilson
    Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
    Pragmatic Bookshelf, 1st ed., 2012
  • P. J. Sadalage, M. Fowler
    NoSQL Distilled

    Addison-Wesley, 2012
  • More in lecture notes