Seminar on Non-Traditional Data Management (NoSQL and more)

Winter Semester 2012

Dr. Rainer Gemulla
Dr.-Ing. Sebastian Michel

Seminar regulations

Place/time of the Seminar

  • Place: Room 001, Building E1.7
  • Time: Monday, 16:15-18:00

How to Register?

Once the list of available papers (talks) is available, send us an email stating your name, your matriculation number, and a list of (at least) 3 preferred papers. The deadline for registration is Sep 16, 2012
Please send the email to Rainer Gemulla and Sebastian Michel, with [dbseminar2012] in the subject. After the deadline, we will let you know within a couple of days whether or not you have got a place in the seminar and, if so, which topic has been assigned to you.

Attending the kickoff meeting is mandatory.


In this seminar, you will
  • Read, understand, and explore scientific literature
  • Summarize a current research topic in a concise report (5 pages)
  • Give a full presentation about your topic (45 minutes)
  • Give a flash presentation about your topic (5 minutes)
  • Moderate a scientific discussion about a topic of one of your fellow students

Requirements for the Certificate

  • Prepare a 45 minutes presentation about your topic to introduce it to your fellow students.
  • Make a first appointment with your tutor (who will be announced along with the topics) to discuss the outline of your presentation at least 4 weeks in advance of your presentation. You are responsible for scheduling meetings with your tutor.
  • Point out advantages or potential weaknesses of the work covered in your presentation. If you are unsure about what to present, talk to your tutor. Note that—even though relevant presentations may be available on the web—we expect that you prepare your own slides (which may be, of course, inspired by the original slides). Send your slides to and discuss them with your tutor at least 2 weeks before your talk. Otherwise, your talk may be cancelled.
  • Each presentation is followed by approximately 15 minutes of discussion. The discussion is moderated by a second student. The moderator's role is to provide interesting input (such as observations, questions, related work) for the discussion and, in general, to enable a constructive discussion. A preliminary version of the presenter's slides will be sent to the moderator on the Thursday before the presentation.
  • Three weeks after your talk, submit a short report (not longer than 5 pages) about your topic. The report should concisely summarize the article and point out strengths and weaknesses.
  • In our last meeting, give a 5 minutes flash presentation about your topic. As before, discuss your slides with your tutor at least 2 weeks before the presentation.
  • Attend all presentations, not just your own. If you are ill, let us know in advance.
  • Actively participate in the discussions.
  • Slides, presentations, and reports must be prepared in English.
  • Your final grade is influenced by: your oral presentations, your knowledge about your topic (e.g., as shown in the discussion after your presentation), your performance as a moderator, your general participation in the seminar, and your written report.

Contents of the Seminar

Recent topics on non-tradional data management issues with emphasis on what is commonly known as the NoSQL direction.


  • October 22, 2012: Kick-off meeting

  • November 12, 2012: NoDB
    • Presenter: Marc Schmit
    • Moderator: Sviat Bogachev
    • Tutor: Sebastian Michel
    • Ioannis Alagiannis, Renata Borovica, Miguel Branco, Stratos Idreos, Anastasia Ailamaki: NoDB: efficient query execution on raw data files. SIGMOD 2012:241-252

  • November 19, 2012: CrowdDB
    • Presenter: Patrick Bender
    • Moderator: Kanimozhi Duraikannu
    • Tutor: Rainer Gemulla
    • Michael J. Franklin, Donald Kossmann, Tim Kraska, Sukriti Ramesh, Reynold Xin: CrowdDB: answering queries with crowdsourcing. SIGMOD Conference 2011: 61-72

  • November 26, 2012: Graphs
    • Presenter: Prashant Yadava
    • Moderator: Patrick Bender
    • Tutor: Rainer Gemulla
    • Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, Joseph M. Hellerstein: Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud. PVLDB 2012.

  • December 03, 2012: Modern architectures
    • Presenter: Edit Kapcari
    • Moderator: Prashant Yadava
    • Tutor: Rainer Gemulla
    • Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony Nguyen, Tim Kaldewey, Victor Lee, Scott Brandt, Pradeep Dubey: FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs

  • December 10, 2012: NoSQL
    • Presenter: Sviat Bogachev
    • Moderator: Patrick Commercon
    • Tutor: Sebastian Michel
    • Jason Baker, Chris Bond, James Corbett, J. J. Furman, Andrey Khorlin, James Larson, Jean-Michel Leon, Yawei Li, Alexander Lloyd, Vadim Yushprakh: Megastore: Providing Scalable, Highly Available Storage for Interactive Services. CIDR 2011:223-234

  • December 17, 2012: RDF
    • Presenter: Kanimozhi Duraikannu
    • Moderator: Sarah Reszow
    • Tutor: Rainer Gemulla
    • Jiewen Huang, Daniel J. Abadi, Kun Ren: Scalable SPARQL Querying of Large RDF Graphs. PVLDB 4(11): 1123-1134 (2011)

  • January 07, 2013: Rapid scalability
    • Presenter: Nina Istomina
    • Moderator: Marc Schmit
    • Tutor: Sebastian Michel
    • Michael Armbrust, Kristal Curtis, Tim Kraska, Armando Fox, Michael J. Franklin, David A. Patterson: PIQL: Success-Tolerant Query Processing in the Cloud. PVLDB 2012.

  • January 14, 2013: Query interfaces
    • Presenter: Yannick Baron
    • Moderator: Nina Istomina
    • Tutor: Sebastian Michel
    • Lukas Blunschi, Claudio Jossen, Donald Kossmann, Magdalini Mori, Kurt Stockinger: SODA: Generating SQL for Business Users. PVLDB 5(10): 932-943 (2012)

  • January 21, 2013: Canceled

  • January 28, 2013: Canceled

  • February 4, 2013: Summary of the seminar with 5min flash talks.