24th International Conference on Computing in High Energy & Nuclear Physics

Name: 24th International Conference on Computing in High Energy & Nuclear Physics
Start: 2019-11-04T08:00:00+10:30
End: 2019-11-08T13:00:00+10:30
Location: Adelaide Convention Centre

4–8 Nov 2019

Adelaide Convention Centre

Australia/Adelaide timezone

Contact us

Open data provenance and reproducibility: a case study from publishing CMS open data

5 Nov 2019, 14:00

15m

Riverbank R1 (Adelaide Convention Centre)

Riverbank R1

Adelaide Convention Centre

Oral Track 8 – Collaboration, Education, Training and Outreach Track 8 – Collaboration, Education, Training and Outreach

Tibor Simko (CERN)

In this paper we present the latest CMS open data release published on the CERN Open Data portal. The samples of raw datasets, collision and simulated datasets were released together with the detailed information about the data provenance. The data production chain covers the necessary compute environments, the configuration files and the computational procedures used in each data production step. We describe data curation techniques used to obtain and publish the data provenance information and we study the possibility to reproduce parts of the released data using the publicly available information. The present work demonstrates the usefulness of releasing selected samples of raw and primary data in order to fully ensure the completeness of information about data production chain for the attention of general data scientists and other non-specialists interested in using particle physics data for education or research purposes.

Consider for promotion	Yes

CMS Collaboration and CERN IT Tibor Simko (CERN)

chep2019-opendata-cms-slides.pdf

24th International Conference on Computing in High Energy & Nuclear Physics

Contact us

Open data provenance and reproducibility: a case study from publishing CMS open data

Riverbank R1

Adelaide Convention Centre

Speaker

Description

Primary authors

Presentation materials

Choose timezone

24th International Conference on Computing in High Energy & Nuclear Physics

Contact us

Speaker

Description

Primary authors

Presentation materials

Share this page

Direct link

Social networks

Calendaring