

# A 65 nm Data Concentration ASIC for the CMS **Outer Tracker Detector Upgrade at HL-LHC**

L.Caponetto<sup>1</sup>, G.Galbit<sup>1</sup>, <u>B.Nodari<sup>1</sup></u>, S.Scarfi<sup>2</sup>, S.Viret<sup>1</sup>

<sup>1</sup>Institut de Physique Nucléaire de Lyon (IPNL), IN2P3, CNRS – Lyon (France)  $^{2}$  CERN



Next generation of chips at High-Luminosity LHC (HL-LHC) will be exposed to extremely high levels of radiation and particle rates. In the so-called Phase II upgrade, the CMS experiment will need a completely new tracker detector: the main goal is to make sure that physics performance stay stable with respect to the current detector.

It will be included in the first level of the trigger chain, which runs at 40 MHz. To ensure tracker data extraction at 40 MHz, a new type of detection module has been designed.



 $\blacktriangleright$  Future tracker module (aka  $p_{T}$ -module) detection area is a superposition of 2 silicon sensors. A charged particle crossing a module creates two correlated clusters, which are

### Concentrator integrated circuit (CIC)

- > A front-end chip common to PS and 2S modules of the future Phase-II CMS Outer Tracker.
- ▶ It collects the digital data coming from 8 upstream FE chips (MPAs<sup>1</sup> or CBCs), it formats the signal in data packets containing the trigger information from 8 bunch crossings and the raw data from events passing the first trigger level, and finally it transmits them to a IpGBT<sup>2</sup> chip.
- It receives inputs from 48 lines at 320 Mbps with 2 different data formats.
- It sends outputs through 7 lines at 320/640 Mbps using one single data format.
- The design and implementation is in a 65nm CMOS technology.
- Chip core works at 2 different input voltages: either at 1.0 V (PS) or 1.2 V (2S).
- > A first prototype of CIC will be submitted by the end of September 2018.
- The second run of CIC (including SEU hardened design) is foreseen by the end of 2019.

#### CIC role in the readout chain

- > 2 data streams are generated in the Front-End hybrid for the Back-End:
- L1 data: stream of frames responding to the L1-accept trigger signal. Each frame aggregates the hit cluster data received from all FE chips. Frame size is flexible and depends on the number of hits in the module. The sustainable capacity (per CIC chip and per L1-accept event) is 254 clusters (@ PS) and 127 clusters (@ 2S). TRIGGER data: sequential stream of 8 BX long blocks. Each block contains the Trigger data aggregated from the 8 FE chips associated to one CIC. The CIC can select and forward up to 40 stubs among the 192 potential input stubs. CIC provides to the readout chain an extra factor 10 of data reduction, by grouping data over time (8 BX blocks) and space (8 input chips). CIC ASIC uniforms the data format for the Back-End.
- reconstructed within the module to form a stub.
- $\triangleright$  Stub width is called the *bend*. Bend is linked to the particle transverse momentum ( $p_T$ ). For track trigger, only low bend stubs are necessary.
- Reducing data rate by one order of magnitude, stubs make tracker data extraction @40MHz possible.

### **2** front-end flavours:

Tracker will be populated with 2 pT-module types: PS (pixels/strips) and 2S (strips/strips).



► FE readout chains are differents (changes are the FE-ASICs, In hybrids, data formats). CIC ASIC is the only shared component. ► There will be 2 CIC chips per module.





#### Architecture



- Phase Aligner: analog IP block used at each input data channel incoming from the 6 lines from each MPA/CBC front-end. Is required in order to synchronize the signal with the internal clock (320 MHz).
  - <u>CIC\_Core</u>: based on two independent data paths, working in parallel:
    - Trigger data path: treatment of data payload produced @ 40MHz (deserialization, word alignment, stub selection) and output frame creation). This is the information necessary to the L1 track trigger.

NC

3

Ŭ

Top

00

0

2

WEPP

- <u>L1 data path</u>: treatment of raw tracker data produced @ 750kHz max. (sparsification in the case of CBC, storage) in FIFOs, output frame creation). This data payload is sent on request each time an L1-accept signal is received.
- SystemManager: manages the clocks generation (40 MHz, 320 MHz, 640 MHz), clock gating, reset distribution, command decoding from the fast control frame.
- SlowControl: manages the communication via I<sup>2</sup>C protocol for control and monitoring of the system. It contains the I<sup>2</sup>C slave and the internal slow control registers.

<u>IPs blocks</u>: Phase aligner (SMU Univ. and CERN), SLVS Drivers and receivers<sup>3</sup> (INFN and Univ. Bergamo and Pavia), CMOS I/O pads and I<sup>2</sup>C Slave (CERN), ESD protections (SOFICS).

| TRIGGER DATA PATH:                                                                                                                                                                                                                                                                                                                                                                                                                                                  | L1 DATA PATH:                                                                                                                                                                                                                                                                                                                                                                                          |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul> <li>Trigger readout data is synchronous across all the detector with respect to the LHC BX clock.</li> <li>CIC output trigger frame contains data from: <ul> <li>64 input CBC frames or 32 input MPA frames.</li> </ul> </li> <li>Differents configurable output formats: <ul> <li>320 Mbps or 640 Mbps via 5 or 6 (configurable) output lines;</li> <li>Z information (MPA), bend on 3 bits (MPA), 4 bits (CBC) or no bend forwarding;</li> </ul> </li> </ul> | <ul> <li>L1 readout data is asynchronous between different modules and CIC internal processing starts after reception of the L1-accept signal.</li> <li>Each FE block handles frame reception independently from the others.</li> <li>L1 output formatter merges the data that corresponds to the same event.</li> <li>The readout data from CBC (2S module) is sparsified in the CIC ASIC.</li> </ul> |
| It the output trame is too short to send all the stubs                                                                                                                                                                                                                                                                                                                                                                                                              |                                                                                                                                                                                                                                                                                                                                                                                                        |

Physical design



Process TSMC 65nm LP 1p7m4x1z1u metal stack. Wire bond with AP RDL. ~372k standard cells.





## Standalone Testbench

- Test design functionalities within a script-based framework (python).
- It performs the comparison between data stream from CMS simulation environment with the CIC model output after the phase alignment and data treatment.





Back-annotated simulations are performed in 3 corners: MAX (ss, 0.9V, -40°C), TYP (tt, 1V, 25°C), MIN (ff, 1.32V, 0°C) and 2 test cases: PS and 2S.



- Results show that all functionalities are met.
- System level Testbench for the validation of the full acquisition chain, developed at CERN (S.Scarfi's poster).

We want to thank D.Ceresa, A.Caratelli, K.Kloukinas for their contribution in this project.

lowest |bend| stubs are transmitted in priority.

optimized to reduce the sequential comparisons.

Stubs sorting network is based on comparators/switchers

[1] A 65 nm pixel readout ASIC with quick transverse momentum discrimination capabilities for the CMS Tracker at HL-LHC, D.Ceresa et Al., JINST, Vol 11, Jan '16. [2] https://espace.cern.ch/GBTProject/LpGBT/Specifications/LpGbtxSpecifications.pdf [3] Design of low-power, low-voltage, differential I/O links for High Energy Physics applications, G. Traversi et AI, JINST Vol. 10, Jan. '15.

#### b.nodari@ipnl.in2p3.fr