

# **ATLAS TDAQ upgrades for Phase-2**

## **F.Pastore (RHUL) on behalf of the ATLAS Collaboration**







# **The High-Luminosity LHC**













# ATLAS at High-Luminosity L = 7.5x10<sup>34</sup> /cm<sup>2</sup>/s

- 60<sup>111</sup> 200 collisions per bunch crossing (every 25 ns)
- ~ 10 000 particles per event
- Mostly low p<sub>T</sub> particles due to low transfer energy interactions





**HL-LHC ti event in ATLAS ITK** at <u>=200



# **ATLAS for Phase-II**

New Front End and Back End electronics for <u>calorimeter</u> and trigger

**New Front End electronics for the** muon spectrometer and new trigger

- Upgrades of detectors to be more robust against pileup
- Completely new Trigger/DAQ architecture (this talk)

New full silicon <u>Inner Tracker</u> (ITk) - sensors/mechanics/Front End New <u>High-Granularity Timing detector</u> (HGTD) for pile-up mitigation

Upgrades on Front End (FE) electronics towards new hardware triggers







# increasing the trigger rates



2024

A

# **ATLAS TDAQ architecture for Phase-II**



ACAT 2024

"ATLAS TDAQ upgrades for Phase-2"

astore

Ľ

- Level-0 (L0) trigger (hardware)
  - 40 MHz  $\Rightarrow$  1 MHz
  - 10 µs latency
- Event Filter trigger (software)
  - 1 MHz  $\Rightarrow$  10 kHz
- DAQ: Readout and Dataflow
  - 5 TB/s data throughput
- TDR Amendment in 2022 [ATLAS-TDR-029-ADD-1]
- Technical Design Report (TDR) in 2017 [ATLAS-TDR-029]
  - Re-design of the Event Filter architecture based on COTs





# L0 trigger for Phase-II



- Identify muon and calorimeter L0 trigger objects (TOBs) in local regions (Region-of-interest)
- TOBs are collected into the Global Trigger which implements full granularity offline-like reconstruction @40MHz
- Central Trigger Processor (CTP) provides hardware trigger decision, sent to the Readout and the Event Filter
- Technology for custom build components
  - ATCA-based architecture
  - FPGA processing
    data I/O with optical links 2.5-25 Gb/s



# LO Calorimeter trigger

- Dedicated boards to identify calorimeter objects, improved resolution thanks to full granularity alorimeter information (for both EM and HAD)
  - additionally improved by use of Global Trigger board (see next)
- Four boards for Feature Extraction
  - e(lectron)FEX, j(et)FEX, g(lobal)FEX already installed in Run 3
    - Retaining hardware with significantly updated firmware
  - f(orward)FEX: new board for forward electrons  $|\eta|>2.5$  and jets  $|\eta|>3.3$  me
    - Schematic capture & layout completed, prototype soon ready



- New optical connections to Global Trigger (fFOX, gFOX, FOX++)
  - Custom designed, final design approval this year















# LO Muon trigger

- Extended to include all muon detectors (in addition to legacy RPC) & TGC & NSW
  - New trigger processor for endcap New Small Wheel (NSW-TP) lacksquare
  - New trigger processor for MDT precision chambers (MDT-TP) with improved p<sub>T</sub> resolution on muons identified by RPC and TGC
    - Final Design Review in 2023, pre-production has started lacksquare
- Data is streamed out and the entire trigger logic is moved offdetector (Sector Logic board)
  - New common board for all technologies
  - 2nd prototype layout started
- **Integration tests in full swing (more connectivity to test)** 
  - Including on-detector boards, MDT-TP, Sector Logic and readout (FELIX)



## MDT-TP Command Module



## Sector-Logic prototype







# LO Global Trigger

## **Process L0 TOBs + more high-granularity calorimeter**

- Offline-like algorithms to refine identification of muons, calorimeters (topological-clusters), jets (anti- $k_T$ ) and pile-up subtraction
- With topological functionality as in Run 3

## Farm of boards with same hardware platform (Global Common Module) for different functionalities:

- Data aggregation and time-multiplexing per bunch-crossing (MUX)
- Processing algorithms for each event in Global Event Processors (GEP)  $\bullet$
- Send GEP outputs to the Central Trigger Processor (gCTPi) ullet

## • Status

- Preliminary design review of the board in 2023, early tests on prototype (2 Xilinx VP1802)
- Firmware progressing, review passed for critical algorithms (tau and pile-up suppression)
- Slice test (including MUX, GEP and gCTPi functionality) in fall 2024



## LO Global v3 prototype





# LO Central Trigger





ore

Pas

## Takes final L0 decision, applying vetos and pre-scales, and drives the timing signals

## New Central Trigger Processor (CTP) board for

- More trigger inputs: 512 to 1024
- More bandwidth, more complex criteria
- Preliminary design expected this spring

## Muon-to-CTP-Interface MuCTPI to remove overlaps between muons and calculate multiplicities

• Reuse Run 3 board with upgraded firmware

## Trigger, Timing and Control (TTC) system network, distributed via new Local Trigger Interface (LTI) modules

- Preliminary design in 2023, new prototype underway
- Tests on going of phase stability for UltraScale+ GTHe4

## More details in A. Koulouris' talk



## • FELIX (Front-End Link eXchange)

- preparation
- lacksquare



# Readout



## New common interface between detectors and DAQ, with commodity servers and switched network

• 4.6 TB/s aggregate throughput across whole system

• PCIe cards with single FPGA, custom optical links for front-end

Already running in Run 3 for new detectors and trigger components

# • Data Handler collects data fragments for detector-specific

## FELIX prototypes upgrade for Run 4

• 2nd prototype (FLX-182): Xilinx VM1802, PCIe Gen4, 24 links up to 25 Gb/s + TTC interface - 50 boards production for integration tests

3rd prototype (FLX-155): Xilinx VP1552, PCIe Gen5, up to 48 links. Prototype design approved for production

• Firmware is mature and expandable

Final Design Review by the end of 2024







# **Dataflow and network**



## Dataflow aggregates data (full-event-building)) buffered for Event Filter and sent to permanent storage

- Software prototype, based on Run 3, plus optimisations to support required 12 kHz event rate per Event Filter rack
- Preliminary design in 2023, large scale tests in Run 3 system

## Networking capabilities expanded, new simulation models [link]

Controlled with traffic-shaping and sufficient buffers

## Persistent-storage buffer prototype using <u>DAOS</u> (opensource high-performance distributed storage system)

Alternative to Run 3 application-based design





# **Dataflow and network**



## **Dataflow** aggregates data (full-event-building) buffered for Event Filter and sent to permanent storage

- Software prototype, based on Run 3, plus optimisations to support required 12 kHz event rate per Event Filter rack
- Preliminary design in 2023, large scale tests in Run 3  $\bullet$ system

## **Networking** capabilities expanded, new simulation models [link]

Controlled with traffic-shaping and sufficient buffers

## Persistent-storage buffer prototype using <u>DAOS</u> (opensource high-performance distributed storage system)

Alternative to Run 3 application-based design





# **Event Filter**



- **FPGAs**
- - CPU: x8 speed-up with fast tracking
  - GPU: x12 speed-up on topological calorimeter cell clustering
  - Many ML approaches (GNN, CNN, RNN) look promising
    - Forum dedicated to ML on FPGA



2024

ACAT

## Software trigger with full event-building @1MHz

## Farm of heterogeneous commodity processors, w/offline-like algorithms, possibly with accelerators (CPU + GPUs and/or

## Feasibility demonstrated with algorithms on different platforms

## Building demonstrators, to investigate use of accelerators, with final technology choice in 2025. Mainly driven by tracking needs



# **EF Tracking**

- **Project dedicated to assess the best technology choice for Event Filter**
- **Ongoing R&D on accelerators, comparing CPU, GPU and FPGA** 
  - Support for common GPU language/API to all ATLAS experiment
  - Settled FPGA family on AMD/Xilinx
- Demonstrators for test slices of tracking algorithms on multiple technologies (CPU+GPU+FPGA) for commodity boards
  - Split into track seeding, pattern recognition, track fitting, ambiguity removal
  - Including NN options (eg GNN)
  - Exploring use of High Level Synthesis (HLS)
- Interfaces to ATLAS software via A Common Tracking Software (ACTS)
  - Experiment independent toolkit for tracking

















# **Online software**

**ACAT 2024** 

TDAQ upgrades for Phase-2"

"ATLAS

Pastore

Ľ





## Gluing whole T/DAQ system together for common configuration, control and monitoring

## **Prototype based on Kubernetes as farm orchestrator**

Open-source platform to automate deployment, management and scaling with containerised applications

## Tested successfully on Run 3 farm, with 2600+ nodes

Scaling the cluster size is still a research topic [OpenAl]

• Very few in the world working with a cluster of this size

## **Being integrated within ATLAS Finite State Machine (Run Control)**









Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems

- Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems
- Level-0 systems focusing on finalising design and progressing to integration with prototypes that already exist
  - With major effort in firmware development  $\bullet$

- Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems
- Level-0 systems focusing on finalising design and progressing to integration with prototypes that already exist
  - With major effort in firmware development  $\bullet$
- **Readout and DAQ** have developed prototypes, now under testing
  - Adopting latest products available on the market ullet

- Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems
- Level-0 systems focusing on finalising design and progressing to integration with prototypes that already exist
  - With major effort in firmware development  $\bullet$
- **Readout and DAQ have developed prototypes, now under testing** 
  - Adopting latest products available on the market lacksquare
- **Event Filter software trigger studying best options for use of accelerators** 
  - Extensive R&D for both algorithms and hardware choices

- Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems
- Level-0 systems focusing on finalising design and progressing to integration with prototypes that already exist
  - With major effort in firmware development
- **Readout and DAQ have developed prototypes, now under testing** 
  - Adopting latest products available on the market ullet
- **Event Filter software trigger studying best options for use of accelerators** 
  - Extensive R&D for both algorithms and hardware choices
- Physics and performance studies ongoing to ensure performance goals met
  - Enabling new trigger algorithms and developing updated Run 4 trigger menu

- Very active area of Phase-2 upgrades in the ATLAS Trigger and DAQ systems
- Level-0 systems focusing on finalising design and progressing to integration with prototypes that already exist
  - With major effort in firmware development
- **Readout and DAQ have developed prototypes, now under testing** 
  - Adopting latest products available on the market ullet
- **Event Filter software trigger studying best options for use of accelerators** 
  - Extensive R&D for both algorithms and hardware choices
- Physics and performance studies ongoing to ensure performance goals met
  - Enabling new trigger algorithms and developing updated Run 4 trigger menu

## **Detailed integration plan, from now through to installation and commissioning**

# ACAT 2024

22nd International Workshop on Advanced Computing Analysis Techniques in Physics Research

## 11 – 15 March 2024 | Stony Brook, New York

Charles B Wang Center – Stony Brook University Abstract Submission by January 20th, 2024 Website: https://acat2024.sdcc.bnl.gov/ Email: acat-loc2024@cern.ch





International Advisory Committee: David Britton (Glasgow Univ, UK) Chair, Gang Chen (IHEP, China), Latifa Elouadrhiri (Jefferson Lab, USA), Ian Fisk (FII, USA), Maria Girone (CERN, Switzerland), Gudrun Heinrich (KIT, Germany), Gregor Kasieczka (UHH, Germany), Doris Kim (Soongsil Univ, Korea), Jerome Lauret (BNL, USA), Daniel Maitre (Durham, UK), Axel Naumann (CERN, Switzerland), Jennifer Ngadiuba (FNAL, USA), Fons Rademakers (CERN, Switzerland), Liz Sexton-Kennedy (Fermilab, USA), Lucia Silvestris (INFN Bari, Italy), Liliana Teodorescu, (Brunel Univ, UK), Claudio Torres (USM, Chile), Gordon T. Watts (Univ. Washington, USA), Monique Werlen (LAPTH, France)

Local Organizing Committee: Christopher Bee (SBU), Alan Calder (SBU) Co-chair, Ian Fisk (FII), Ann DeGregorio Gardner (SBU), Robert Harrison (SBU), John Hobbs (SBU), Jerome Lauret (BNL) Chair, Meifeng Lin (BNL), Liz Mogavero (BNL), Karin Muscarella (SBU), Michael Poat (BNL), Danielle Zimmerman (SBU), Michael Zingale (SBU)

