8โ€“12 Sept 2025
Hamburg, Germany
Europe/Berlin timezone

Applying Transaction Processing Techniques to Large Scale Analysis

Not scheduled
30m
Hamburg, Germany

Hamburg, Germany

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Speaker

Benjamin Galewsky (Univ. Illinois at Urbana Champaign (US))

Description

Data analysts working with large datasets require absolute certainty that each file is processed exactly once. ServiceX addresses this challenge by using well established transaction processing architectures. This system implements a fully transactional workflow powered by PostgreSQL and RabbitMQ, ensuring data integrity throughout the processing pipeline. This presentation details both the infrastructure design that enables these transactional guarantees and the techniques we used to identify and remediate intermittent transaction leaks, resulting in a reliable system that operates consistently at scale.

Significance

This talk shows how the application of tools and techniques from the broader computer science field can improve the productivity of physics analysis.

Author

Benjamin Galewsky (Univ. Illinois at Urbana Champaign (US))

Co-author

Peter Onyisi (University of Texas at Austin (US))

Presentation materials

There are no materials yet.