Multimedia Similarity Search (Thomas Seidl)

Montag, 6. März 2017, 13:30 - 17:00 Uhr
Das Tutorial zeigt aktuelle Methoden der Ähnlichkeitssuche in Multimedia-Datenbanken auf. Es bezieht sich auf verschiedene Datentypen wie Farbbilder und Videos sowie Zeitreihen, Trajektorien und geometrische Formen. Der technische Bogen spannt sich von Objektrepräsentationen über Ähnlichkeitsmodelle und effzienten Algorithmen zur Ähnlichkeitssuche bis hin zum Data Mining mit Aufgaben von Clustering und Klassifikation.
Die behandelten Ähnlichkeitsmodelle erstrecken sich auf unmittelbare Objektdarstellungen wie beispielsweise bei kernbasierten Methoden sowie auch auf Einbettungen von komplexen Objekte in Vektorräume. Die Algorithmen zur Ähnlichkeitssuche beziehen Datenbanktechniken wie mehrstufige Anfragebearbeitung, Approximationen und Indexstrukturen ein. Im Bereich der Data-Mining-Methoden werden Techniken behandelt, die auf der Verwendung der vorher genannten Ähnlichkeitsmodellierung beruhen.

Scalable Data Management: An In-Depth Tutorial on NoSQL Data Stores (Felix Gessert, Wolfram Wingerath, Norbert Ritter)

Dienstag, 7. März 2017, 09:00 - 12:30 Uhr
The unprecedented scale at which data is consumed and generated today has shown a large demand for scalable data management and given rise to non-relational, distributed “NoSQL” database systems. Two central problems triggered this process: 1) vast amounts of user-generated content in modern applications and the resulting request loads and data volumes as well as 2) the desire of the developer community to employ problem-specific data models for storage and querying. To address these needs, various data stores have been developed by both industry and research, arguing that the era of one-size-fits-all database systems is over. The heterogeneity and sheer amount of these systems – now commonly referred to as NoSQL data stores – make it increasingly diffcult to select the most appropriate system for a given application. Therefore, these systems are frequently combined in polyglot persistence architectures to leverage each system in its respective sweet spot.
This tutorial gives an in-depth survey of the most relevant NoSQL databases to provide comparative classification and highlight open challenges. To this end, we analyze the approach of each system to derive its scalability, availability, consistency, data modeling and querying characteristics. We present how each system’s design is governed by a central set of trade-offs over irreconcilable system properties. We then cover recent research results in distributed data management to illustrate that some shortcomings of NoSQL systems could already be solved in practice, whereas other NoSQL data management problems pose interesting and unsolved research challenges. In addition to earlier tutorials, we explicitly address how the quickly emerging topic of processing and storing massive amounts of data in real-time can be solved by different types real-time data management systems.