US ATLAS Computing Facility (Possible Topical)
Facilities Team Google Drive Folder
Zoom information
Meeting ID: 993 2967 7148
Meeting password: 452400
Invite link: https://umich.zoom.us/j/99329677148?pwd=c29ObEdCak9wbFBWY2F2Rlo4cFJ6UT09
-
-
13:00
→
13:05
WBS 2.3 Facility Management News 5mSpeakers: Alexei Klimentov (Brookhaven National Laboratory (US)), Dr Shawn Mc Kee (University of Michigan (US))
-
13:05
→
13:10
OSG-LHC 5mSpeakers: Brian Hua Lin (University of Wisconsin), Matyas Selmeci
- Release: frontier-squid-6.14 in the coming weeks
- Friendly reminder that we only test our software against the latest OS releases (https://osg-htc.org/docs/release/supported_platforms/)
- Questions about grid software and full chain vs leaf certs?
- OSG School 2026 applications open https://osg-htc.org/school-2026/
-
13:10
→
13:30
WBS 2.3.1: Tier1 CenterConvener: Alexei Klimentov (Brookhaven National Laboratory (US))
-
13:10
Tier-1 Infrastructure 5mSpeaker: Jason Smith
- 13:15
-
13:20
Storage 5mSpeakers: Carlos Fernando Gamboa (Department of Physics-Brookhaven National Laboratory (BNL)-Unkno), Carlos Fernando Gamboa (Brookhaven National Laboratory (US))
- 13:25
-
13:10
-
13:30
→
13:40
WBS 2.3.2 Tier2 Centers
Updates on US Tier-2 centers
Conveners: Fred Luehring (Indiana University (US)), Rafael Coelho Lopes De Sa (University of Massachusetts (US)) -
13:40
→
13:50
WBS 2.3.3 Heterogenous Integration and Operations
HIOPS
Convener: Rui Wang (Argonne National Laboratory (US))-
13:40
HPC Operations 5mSpeaker: Rui Wang (Argonne National Laboratory (US))
-
13:45
Integration of Complex Workflows on Heterogeneous Resources 5mSpeaker: Doug Benjamin (Brookhaven National Laboratory (US))
-
13:40
-
13:50
→
14:10
WBS 2.3.4 Analysis FacilitiesConvener: Wei Yang (SLAC National Accelerator Laboratory (US))
-
13:50
Analysis Facilities - BNL 5mSpeaker: Qiulan Huang (Brookhaven National Laboratory (US))
-
User space cleanup: Viviana provided a start storage policy to refer
-
Prepared a python code and testing about email notification to inactive users automatically regarding to the inactive account policy
-
- Ofer, Rob, Tom working with Giordon to set up new AF benchmarking monitor on BNL OpenShift
-
-
13:55
Analysis Facilities - SLAC 5mSpeaker: Wei Yang (SLAC National Accelerator Laboratory (US))
-
14:00
Analysis Facilities - Chicago 5mSpeaker: Fengping Hu (University of Chicago (US))
Jupyter Notebook Services Updates
-
Image Rationalization: Consolidated and reorganized notebook images, reducing
ml_platformvariants (e.g., conda, Julia) to simplify the user experience and streamline maintenance. -
Unified Monitoring Framework: Launched three interlinked dashboards covering JupyterLab, Coffea-Casa, and BinderHub services.
-
Cluster-Level Visibility: High-level view of server health, resource allocation trends, and GPU utilization across environments.
-
User Analytics: Per-user usage metrics to identify heavy usage patterns and support capacity planning.
-
Infrastructure Efficiency: Pod-level observability to optimize resource allocation and improve overall service efficiency.
-
-
13:50
-
14:10
→
14:30
WBS 2.3.5 Continuous OperationsConveners: Ivan Glushkov (Brookhaven National Laboratory (US)), Ofer Rind (Brookhaven National Laboratory)
- T3 LOCALGROUPDISKs - we do not take care of them but users are using them (and failing, Nevis).
- The particular problem was solved (cert problem)
- Restarting FTS4 tests now.
- Load tests with FT transfers in the beginning of March.
- Manchester will be the first production side to switch to FTS4
- Stopping IPv4
- on LHCONE for AGLT2 on March 10
- on LHCOPN for PIC on February 23
-
14:10
ADC Operations, US Cloud Operations: Site Issues, Tickets & ADC Ops News 5mSpeaker: Kaushik De (University of Texas at Arlington (US))
-
Very nice presentation of US mini-DC results at WLCG DOMA General earlier today (link)
- AI email report for SWT2 from Kaushik.
-
-
14:15
Services DevOps 5mSpeaker: Ilija Vukotic (University of Chicago (US))
-
14:20
Facility R&D 5mSpeaker: Robert William Gardner Jr (University of Chicago (US))
-
Facility R&D Biweekly (notes): updates on RP1, SENSE, HTCondor-related development,...
-
-
14:25
Cybersecurity plan(s) 5mSpeakers: Robert William Gardner Jr (University of Chicago (US)), Shigeki Misawa (Brookhaven National Laboratory (US))
- T3 LOCALGROUPDISKs - we do not take care of them but users are using them (and failing, Nevis).
-
14:30
→
14:40
AOB 10m
-
13:00
→
13:05