2nd S2I2 HEP/CS Workshop
Princeton University
The worldwide particle physics community is currently planning upgrades to the Large Hadron Collider (LHC) at CERN in Geneva. The LHC today already uses a worldwide distributed computing grid to meet the needs of thousands of scientists to process and analyze some of the world's largest scientific datasets. The upgrades being planned will increase data volumes by more than two orders of magnitude and require significantly more complex data and analysis techniques.
This 2nd S2I2 HEP/CS workshop aims to bring together a diverse set of attendees from the high energy physics (HEP) and computer science (CS) communities to understand how the two communities could work together in the context of a future NSF Software Institute aimed at supporting particle physics research over the long term. We will build on the discussions which took place at the the first S2I2 HEP/CS workshop and take a fresh look at planned HEP and computer science research and brainstorm about engaging specific areas of effort, perspectives, synergies and expertise of mutual benefit to HEP and CS communities, especially as it relates to a future NSF Software Institute for HEP.
Discussions and sessions include Science Practices & Policies, Sociology and Community Issues, Machine Learning, Software Life Cycle / Software Engineering / Software/Data/Workflow Preservation & Reproducibility, Scalable Platforms, Data Management, Access, Distribution, Organization, Data Intensive Analysis Tools and Techniques, Visualization, Data Streaming and Training, Education, Professional Development, Advancement.
The meeting rooms at Princeton are:
- Monday (plenary) - Lewis Library 120 (Vidyo link)
- Tuesday (parallel sessions) - McDonnell 103, Jadwin Hall A06, Jadwin Hall 475, Jadwin Hall 111
- Wednesday (plenary) - Lewis Library 138 (Vidyo link)
Useful links:
Sponsors
This event is being organised in part by the S2I2-HEP Conceptualization project, including travel support for some participants. The S2I2-HEP project is supported by National Science Foundation grants ACI-1558216, ACI-1558219, and ACI-1558233.
-
-
12:00 PM
Lunch
-
1
Workshop Introduction Lewis Library 120
Lewis Library 120
-
2
The S2I2-HEP Conceptualization Project Lewis Library 120
Lewis Library 120
Speaker: Peter Elmer (Princeton University (US)) -
3
Summary of first S2I2 HEP/CS workshop at NCSA/UIUC Lewis Library 120
Lewis Library 120
Speaker: Mark Neubauer (Univ. Illinois at Urbana-Champaign (US)) -
4
Building Communities with the Open Science Grid Lewis Library 120
Lewis Library 120
Speaker: Frank Wuerthwein (Univ. of California San Diego (US)) -
5
Software and Programming Support for Computational Research at Princeton Lewis Library 120
Lewis Library 120
Speaker: Ian Cosden (Princeton University) -
6
Program Director Perspectives on the High-Energy Physics Institute Lewis Library 120
Lewis Library 120
Speakers: Rajiv Ramnath (National Science Foundation), Vipin Chaudhary (National Science Foundation) -
2:50 PM
Coffee Break
-
7
Science Practices & Policies, Sociology and Community Issues (Intro) Lewis Library 120
Lewis Library 120
-
8
Understanding Scientific Collaboration Lewis Library 120
Lewis Library 120
Speaker: Charlotte Lee (University of Washington) -
9
Collaborations and Communities in High Energy Physics Lewis Library 120
Lewis Library 120
Speaker: Michael David Sokoloff (University of Cincinnati (US)) -
10
The Large Synoptic Survey Telescope (LSST) Lewis Library 120
Lewis Library 120
Speaker: Robert Lupton (Princeton University) -
11
Center for Research in Open Source Software (CROSS) Lewis Library 120
Lewis Library 120
Speaker: Carlos Maltzahn (University of California - Santa Cruz) -
12
Scientific Computing in the Clouds Lewis Library 120
Lewis Library 120
Speaker: Karan Bhatia (Google) -
5:30 PM
Reception - Prospect House
-
12:00 PM
-
-
13
Parallel Session - Data Management, Access and Organisation / Data Streaming Jadwin Hall 475
Jadwin Hall 475
-
a) IntroductionSpeakers: Oliver Gutsche (Fermi National Accelerator Lab. (US)), Tanu Malik (Depaul)
-
-
14
Parallel Session - Machine Learning, Algorithms Jadwin Hall 111
Jadwin Hall 111
-
a) IntroductionSpeaker: Sergei Gleyzer (University of Florida (US))
-
b) Lightning Talk: Emerging trends in software for statistics and machine learningSpeaker: Kyle Stuart Cranmer (New York University (US))
-
c) Lightning Talk: Kalman Filter based Tracking ReconstructionSpeaker: Dr Matthieu Lefebvre (Princeton University (US))
-
d) Lightning Talk: ML for Pattern Recognition in HEPSpeaker: Paolo Calafiura (Lawrence Berkeley National Lab. (US))
-
e) Lightning Talk: Optimization of distributed systems from the network point of view using machine learningSpeaker: Harvey Newman (California Institute of Technology (US))
-
f) Lighnint Talk: End-to-end reconstructon and classification in HEP with deep learningSpeaker: Michael Andrews (Carnegie-Mellon University (US))
-
g) Lighning talk: Data Management and MLSpeaker: Valentin Y Kuznetsov (Cornell University (US))
-
i) Lighning Talk: Learnings from Industry: Tooling, Learning from Massive Datasets, & Software Quality @SoundCloudSpeaker: Meghan Kane
-
j) Lightning Talk: Machine Learning As a ServiceSpeaker: Ilija Vukotic (University of Chicago (US))
-
k) Lightning Talk: Neural Network Optimization for PhysicsSpeaker: Fernanda Psihas (Indiana University)
-
l) Lightning Talk: Edward: A library for probabilistic modeling, inference, and criticismSpeaker: Dustin Tran (Columbia University)
-
-
15
Parallel Session - Software Life Cycle / Software Engineering McDonnell Hall 103
McDonnell Hall 103
-
a) IntroductionSpeakers: Elizabeth Sexton-Kennedy (Fermi National Accelerator Lab. (US)), Jeffrey Carver (University of Alabama)
-
b) Lightning Talk: An Update on Software CitationSpeaker: Daniel S. Katz (University of Illinois)
-
c) Lightning Talk: Software AmoebasSpeakers: Douglas Thain, Douglas Thain (University of Notre Dame)
-
d) Lightning Talk: Generating All the Things: Using Code Generation to Transform Scientific Knowledge to Software ArtifactsSpeaker: Spencer Smith
-
-
10:30 AM
Coffee Break
-
16
Parallel Session - Data Management, Access and Organisation / Data Streaming Jadwin Hall 475
Jadwin Hall 475
-
a) Discussion
-
-
17
Parallel Session - Machine Learning, Algorithms Jadwin Hall 111
Jadwin Hall 111
-
a) Discussion
-
-
18
Parallel Session - Software Life Cycle / Software Engineering McDonnell Hall 103
McDonnell Hall 103
-
a) Discussion
-
b) Lightning Talk: Static Analysis ToolsSpeaker: Christopher Jones (Fermi National Accelerator Lab. (US))
-
-
12:30 PM
Lunch
-
19
Parallel Session - Data Intensive Analysis Tools & Visualization Jadwin Hall 111
Jadwin Hall 111
-
b) Lightning Talk: Constructing a ROOT-less workflow with python and HDF5Speaker: Matthew Bellis (Siena College)
-
c) Lightning Talk: Machine learning pipelines with Spark MLSpeaker: Dr Alexey Svyatkovskiy (Princeton University)
-
d) Lightning Talk: XENON1T, Open Source and PythonSpeaker: Christopher Tunnell (Enrico Fermi Institute-University of Chicago-Unknown)
-
e) Lightning Talk: Volumetric image analysis and visualization problems in neuroimagingSpeaker: Lawrence Frank (UCSD)
-
20
Parallel Session - Scalable Platforms Jadwin Hall A06
Jadwin Hall A06
-
a) IntroductionSpeakers: Douglas Thain (University of Notre Dame), Robert William Gardner Jr (University of Chicago (US))
-
b) Lightning Talk: Corralling Heterogeneous SystemsSpeaker: Gordon Watts (University of Washington (US))
-
c) Lightning Talk: Distributed Scalable Platforms Issues and ApproachesSpeaker: Harvey Newman (California Institute of Technology (US))
-
d) Lightning Talk: Services at the EdgeSpeaker: Robert William Gardner Jr (University of Chicago (US))
-
e) Lightning Talk: Towards 1000x with Heterogeneous, Programmable Hardware DatacenterSpeaker: Anton Burtsev (University of California - Irvine)
-
f) Lightning Talk: Non-determinism in applications at the exascale: impact on debugging and numerical reproducibilitySpeaker: Michela Taufer (University of Delaware)
-
g) Lightning Talk: Exploiting node-level parallelism at exascaleSpeaker: Prof. Sunita Chandrasekaran (University of Delaware)
-
h) Lightning Talk: Missing AbstractionsSpeaker: Daniel S. Katz (University of Illinois)
-
i) Lightning Talk: Do we need distributed computing?Speaker: Kaushik De (University of Texas at Arlington (US))
-
j) Lightning Talk: Developing non-LHC software and our Services-based infrastructure dreamSpeaker: Christopher Tunnell (Enrico Fermi Institute-University of Chicago-Unknown)
-
k) Lightning Talk: Fundamental Problems of Distributed SystemsSpeaker: Douglas Thain
-
l) Lightning Talk: Learnings from Industry: Tooling, Datasets, Productivity, & Software Quality @SoundCloudSpeaker: Meghan Kane (SoundCloud)
-
-
21
Parallel Session - Software/Data/Workflow Preservation & Reproducibility Jadwin Hall 475
Jadwin Hall 475
-
a) IntroductionSpeakers: Carlos Maltzahn (University of California - Santa Cruz), Mike Hildreth (University of Notre Dame (US))
-
b) Lightning Talk: Non-determinism in applications at the exascale: impact on debugging and numerical reproducibilitySpeaker: Michela Taufer (University of Delaware)
-
c) Lightning Talk: Recast, Reana, and HepData: infrastructure for reproducibility and reinterpretationSpeaker: Lukas Alexander Heinrich (New York University (US))
-
d) Lightning Talk: The Popper FrameworkSpeaker: Carlos Maltzahn (University of California - Santa Cruz)
-
-
3:00 PM
Coffee Break
-
22
Parallel Session - Data Intensive Analysis Tools, Visualization Jadwin Hall 111
Jadwin Hall 111
-
a) Discussion
-
-
23
Parallel Session - Scalable Platforms Jadwin Hall A06
Jadwin Hall A06
-
a) Discussion
-
-
24
Parallel Session - Software/Data/Workflow Preservation & Reproducibility Jadwin Hall 475
Jadwin Hall 475
-
a) Swift Workflows at ArgonneSpeaker: Justin Wozniak (Argonne National Lab)
-
b) CI for High Level Science GoalsSpeaker: Kyle Stuart Cranmer (New York University (US))
-
c) Discussion
-
-
13
-
-
25
Training, Education, Professional Development, Advancement Lewis Library 138
Lewis Library 138
-
26
Physics Analysis Training Model at the CMS Experiment Lewis Library 138
Lewis Library 138
Speaker: Sudhir Malik (University of Puerto Rico (PR)) -
27
Discussion - Training Lewis Library 138
Lewis Library 138
-
28
Summary - Software Life Cycle / Software Engineering Lewis Library 138
Lewis Library 138
Speakers: Elizabeth Sexton-Kennedy (Fermi National Accelerator Lab. (US)), Jeffrey Carver (University of Alabama) -
29
Summary - Software/Data/Workflow Preservation & Reproducibility Lewis Library 138
Lewis Library 138
Speakers: Carlos Maltzahn (University of California - Santa Cruz), Mike Hildreth (University of Notre Dame (US)) -
30
Summary - Machine Learning, Algorithms Lewis Library 138
Lewis Library 138
Speaker: Sergei Gleyzer (University of Florida (US)) -
10:30 AM
Coffee Break
-
31
Summary - Data Intensive Analysis Tools, Visualization Lewis Library 138
Lewis Library 138
Speaker: Fernanda Psihas (Indiana University) -
32
Summary - Scalable Platforms Lewis Library 138
Lewis Library 138
Speakers: Douglas Thain (University of Notre Dame), Robert William Gardner Jr (University of Chicago (US)) -
33
Summary - Data Management, Access and Organisation/Data Streaming Lewis Library 138
Lewis Library 138
Speakers: Oliver Gutsche (Fermi National Accelerator Lab. (US)), Tanu Malik (Depaul) -
34
Discussion - Next Steps Lewis Library 138
Lewis Library 138
-
35
Closeout Lewis Library 138
Lewis Library 138
-
1:00 PM
Take-Away Lunch
-
25