Indico has been upgraded to version 3.1. Details in the SSB
Nov 4 – 8, 2019
Adelaide Convention Centre
Australia/Adelaide timezone

COFFEA - Columnar Object Framework For Effective Analysis

Nov 5, 2019, 12:15 PM
Hall G (Adelaide Convention Centre)

Hall G

Adelaide Convention Centre

Oral Track 6 – Physics Analysis Track 6 – Physics Analysis


Nick Smith (Fermi National Accelerator Lab. (US))


The COFFEA Framework provides a new approach to HEP analysis, via columnar operations, that improves time-to-insight, scalability, portability, and reproducibility of analysis. It is implemented with the Python programming language and commodity big data technologies such as Apache Spark and NoSQL databases. To achieve this suite of improvements across many use cases, COFFEA takes a factorized approach, separating the analysis implementation and data delivery scheme. All analysis operations are implemented using the NumPy or awkward-array packages which are wrapped to yield user code whose purpose is quickly intuited. Various data delivery schemes are wrapped into a common front-end which accepts user inputs and code, and returns user defined outputs. We will present published results from analysis of CMS data using the COFFEA framework along with a discussion of metrics and the user experience of arriving at those results with columnar analysis.

Consider for promotion No

Primary authors

CMS Collaboration Nick Smith (Fermi National Accelerator Lab. (US))

Presentation materials