28–29 May 2013
CERN
Europe/Zurich timezone

Array Analytics - Concepts, Codes, and Challenges

28 May 2013, 17:00
20m
60/6-015 - Room Georges Charpak (Room F) (CERN)

60/6-015 - Room Georges Charpak (Room F)

CERN

90
Show room on map

Speaker

Peter Baumann (Jacobs University)

Description

(This talk is proposed for the Array Analytics session) After a long period of neglection by database research, arrays now are recognized as a core data structure in science and engineering domains, and actually as a main representative of the Big Data there. However, it is not only about array data and accessing them - today requirements on server-side processing capabilities are high, often transcending the classical query language concepts. Therefore, Array Database research is not bound to traditional database conceptualizations and is tightly intertwining itself with related domains like image and signal processing, statistics, supercomputing, and visualization, thereby justifying the more general characterization of Array Analytics. This recognition has sparked research and implementations such as rasdaman, SciQL, SciDB, and PostGIS Raster. In our talk, we give an overview on the field of Array Analytics from a database perspective. We address formalisms, discuss different conceptualizations like "array-as-table" and "array-as-attribute", exemplify array querying and optimization, and present architectural approaches of array storage and processing. Real-life use cases illustrate relevance. Finally, standardization efforts on Array Analytics are inspected. In doing so, we spot open issues and research directions.

Author

Peter Baumann (Jacobs University)

Presentation materials