



# Introduction to Data Acquisition



ISOTDAQ 2015: 6th International School of Trigger and Data Acquisition

Rio de Janeiro, 28 Jan 2015

Andrea.Negri@pv.infn.it



#### Acknowledgment



- Lecture inherited from Wainer Vandelli
  - Material and ideas have been taken from:
    - CERN Summer Student lectures, N.Neufeld and C.Gaspar
    - the "Physics data acquisition and analysis" lessons given by R.Ferrari at the University of Parma, Italy
- Errors and flaws are mine





#### Outline

- Introduction
  - What is DAQ?
  - Overall framework
- Basic DAQ concepts
  - Digitization, Latency
  - Deadtime, De-randomization
- Scaling up
  - Readout and Event Building
  - Buses vs Network
- Do it yourself







- Data AcQuisition is not an exact science
- DAQ is an alchemy of physics, electronics, networking, hacking and experience

- ..., money and manpower matter as well

- Aim of this lesson is to introduce the basic DAQ concept avoiding as many technological details as possible
  - The following lectures will cover these aspects



I'll mostly refer to DAQ in High-Energy Physics



#### Overview

 Overall the main role of T & DAQ is to process the signals generated in a detector and saving the interesting information on a permanent storage





### **Trigger & DAQ**



- Trigger
  - Either selects interesting events or rejects boring ones, in real time
  - i.e. with minimal controlled latency
    - time it takes to form and distribute its decision
- DAQ
  - Gathers data produced by detectors: Readout
  - Possibly feeds several trigger levels
  - Forms complete events: Event Building
  - Stores event data: Data Logging
  - Provides Run Control, Configuration and Monitoring facilities

Data

Flow

# Trigger, DAQ and Controls





#### Outline



- Introduction
  - What is DAQ?
  - Overall framework
- Basic DAQ concepts
  - Digitization, Latency
  - Deadtime, De-randomization
- Scaling up
  - Readout and Event Building
  - Buses vs Network
- Do it yourself



# Basic DAQ: periodic trigger

- Es: measure temperature at a fixed frequency
  - ADC performs analog to digital conversion, digitization (our front-end electronics)
  - CPU does readout and processing
- System clearly limited by the time τ to process an "event"
  ADC conversion + CPU processing + Storage
- The DAQ maximum sustainable rate is simply the inverse of  $\tau$ , e.g.:

-  $\tau = 1 \text{ ms } \rightarrow R = 1/\tau = 1 \text{ kHz}$ 



# Basic DAQ: periodic trigger

TRIGGER

- Es: measure temperature at a fixed frequency
  - ADC performs analog to digital conversion, digitization (our front-end electronics)
  - CPU does readout and processing
- System clearly limited by the time τ to process an "event"
  - ADC conversion + CPU processing + Storage
- The DAQ maximum sustainable rate is simply the inverse of  $\tau$ , e.g.:
  - $\tau = 1 \text{ ms } \rightarrow R = 1/\tau = 1 \text{ kHz}$



F

ADC

Processing

disk

# Basic DAQ: periodic trigger

TRIGGER

- Es: measure temperature at a fixed frequency
  - ADC performs analog to digital conversion, digitization (our front-end electronics)
  - CPU does readout and processing
- System clearly limited by the time τ to process an "event"
  - ADC conversion + CPU processing + Storage
- The DAQ maximum sustainable rate is simply the inverse of  $\tau$ , e.g.:

$$\tau = 1 \text{ ms} \rightarrow \text{R} = 1/\tau = 1 \text{ kHz}$$



BS

ш

ADC

Processing

disk

#### Andrea.Negri@unipv.it

#### 12

#### Basic DAQ: "real" trigger

- Events asynchronous and unpredictable
  - E.g.: beta decay studies
- A physics trigger is needed
  - delay compensate for trigger latency
  - **Discriminator**: generate an output signal only if amplitude of input pulse is grater than a certain threshold





Intro to DAO

- Events asynchronous and unpredictable
  - E.g.: beta decay studies
- A physics trigger is needed
  - delay compensate for trigger latency
  - Discriminator: generate an output signal only if amplitude of input pulse is grater than a certain threshold





#### CERN

## **Basic DAQ: "real" trigger**

- Stochastic process
  - Fluctuations in time between events
- Let's assume for example
  - a process rate f = 1 kHz, i.e.  $\lambda$  = 1 ms





#### CERN

## Basic DAQ: "real" trigger

- Stochastic process
  - Fluctuations in time between events
- Let's assume for example
  - a process rate f = 1 kHz, i.e.  $\lambda$  = 1 ms





#### CERN

## Basic DAQ: "real" trigger

 Stochastic process  $\lambda = 1 \text{ ms}$  Fluctuations in time between events • Let's assume for example TRIGGER - a process rate f = 1 kHz, i.e.  $\lambda$  = 1 ms delay - and, as before,  $\tau = 1$  ms 1.0  $\lambda = 1 ms$ start ADC Probability of time (in ms) 0.8 between events for average BS decay rate of f=1kHz  $\rightarrow \lambda$ =1ms 0.6 ш odf What if a trigger is 0.4 interrupt created when the Processing 0.2 system is busy? 0.0 disk 1 2 3 Δ 5 6 8 Time between events (ms)

f = 1 kHz

#### state (Q) by signals applied to the control inputs (SET, CLEAR)



- E.g.: using an AND port and a latch (flip-flop)

a bistable circuit that changes

processing

while the system is busy in

• **Busy logic** avoids triggers

BS

П

delay

f = 1 kHz

 $\lambda = 1 \text{ ms}$ 

TRIGGER

AND

# **Basic DAQ: "real" trigger**

#### 0.6 odf 0.4

Time between events (ms)



0.2

0.0

## **Basic DAQ: "real" trigger**

Intro to DAO

- **Busy logic** avoids triggers while the system is busy in processing
  - E.g.: using an AND port and a latch (flip-flop)
    - a bistable circuit that changes state (Q) by signals applied to the control inputs (SET, CLEAR)
- Which (average) DAQ rate ● 1.0 can we achieve now? 0.8 Reminder: with a clock trigger

and  $\tau = 1$  ms the limit is 1 kHz







- Definitions
  - **f** average rate of physics phenomenon (input)
  - v average rates of DAQ (output)
  - τ: deadtime, the time the system requires to process an event, without being able to handle other triggers
  - the probability that DAQ is busy  $P[busy] = v \tau$
  - the probability that DAQ is free  $P[free] = 1 v \tau$

• Therefore:

$$v = f P[free] \Rightarrow v = f(1-v\tau) \Rightarrow v = \frac{f}{1+f\tau} < f$$

$$\epsilon = \frac{N_{saved}}{N_{tot}} = \frac{1}{1+f\tau} < 100\%$$





- Definitions
  - f average rate of physics phenomenon (input)
  - v average rates of DAQ (output)
  - τ: deadtime, the time the system requires to process an event, without being able to handle other triggers
  - the probability that DAQ is busy  $P[busy] = v \tau$
  - the probability that DAQ is free  $P[free] = 1 v \tau$
- Therefore:

$$v = f P[free] \Rightarrow v = f(1-v\tau) \Rightarrow v = \frac{f}{1+f\tau} < f$$

$$\epsilon = \frac{N_{saved}}{N_{tot}} = \frac{1}{1+f\tau} < 100\%$$

 So, in our specific example

$$\begin{array}{c|c} f=1\,kHz \rightarrow \\ \tau=1\,ms \end{array} \begin{array}{|c|c|} \nu=500\,Hz \\ \epsilon=50\,\% \end{array}$$



- DAQ rate always < physics rate</li>
- Efficiency always < 100%</li>





21



- In order to obtain  $\epsilon{\sim}100\%$  ( i.e.: v~f )  $~\to$  ft << 1  $\to$  t <<  $\lambda$ 

- E.g.:  $\epsilon \sim 99\%$  for f = 1 kHz  $~\rightarrow~\tau <$  0.1 ms  $\rightarrow$  1/ $\tau >$  100 kHz
- To cope with the input signal fluctuations, we have to **over-design** our DAQ system by a factor 10.
- How can we mitigate this effect?



- In order to obtain  $\epsilon{\sim}100\%$  ( i.e.:  $\nu{\sim}f$  )  $~\rightarrow$  fr << 1  $\rightarrow$   $\tau$  <<  $\lambda$ 
  - E.g.:  $\epsilon \sim 99\%$  for f = 1 kHz  $~\rightarrow~\tau <$  0.1 ms  $\rightarrow$  1/ $\tau >$  100 kHz

 To cope with the input signal fluctuations, we have to **over-design** our DAQ system by a factor 10.

• How can we mitigate this effect?





- In order to obtain  $\epsilon{\sim}100\%$  ( i.e.:  $\nu{\sim}f$  )  $~\to f\tau << 1 \to \tau << \lambda$ 
  - E.g.:  $\epsilon {\sim} 99\%$  for f = 1 kHz  $~\rightarrow~\tau <$  0.1 ms  $\rightarrow 1/\tau >$  100 kHz
  - To cope with the input signal fluctuations, we have to **over-design** our DAQ system **by a factor 10!**
- How can we mitigate this effect?





- The effect of the queue depends on its depth

disk



- Efficiency vs traffic intensity ( $\rho = \tau / \lambda$ ) for different queue depths
  - $\rho$  > 1, the system is overloaded
  - $\rho$  << 1, the output is over-designed
  - $\rho$  ~ 1, using a queue, high efficiency can be obtained even w/ moderate depth
- Analytic calculation possible for very simple systems only
  - Otherwise MonteCarlo simulation is required

#### Andrea.Negri@unipv.it

#### 28

#### **De-randomization summary**

Intro to DAO

- Almost 100% efficiency with minimal deadtime achievable if
  - ADC is able to operate at rate >> f
  - Data processing and storing operate at a rate ~ f
- The FIFO decouples the low latency front-end from the data processing
  - Minimize the amount of "unnecessary" fast components
- Could the delay be replaced with a "FIFO"?
  - Analog pipelines → Heavily used in LHC DAQs



#### Andrea.Negri@unipv.it

# De-randomization summary

- Almost 100% efficiency with minimal deadtime achievable if
  - ADC is able to operate at rate >> f
  - Data processing and storing operate at a rate ~ f
- The FIFO decouples the low latency front-end from the data processing
  - Minimize the amount of "unnecessary" fast components
- Could the delay be replaced with a "FIFO"?
  - Analog pipelines, heavily used in LHC DAQs

Intro to DAO







- Particle collisions are synchronous
  - So, do we still need de-randomization buffers?
- Trigger rejects uninteresting events
  - Good events are unpredictable
- Even if collisions are synchronous, the time distribution of triggers is random
  - De-randomization is still needed





#### Outline



- Introduction
  - What is DAQ?
  - Overall framework
- Basic DAQ concepts
  - Digitization, Latency
  - Deadtime, De-randomization
- Scaling up
  - Readout and Event Building
  - Buses vs Network
- Do it yourself

























# Adding more channels



• Buffering usually needed at every level





- Reading out data or building events out of many channels requires many components
  - In the design of our hierarchical data-collection system, we have better define "building blocks"
  - Eg: readout crates, event building nodes, daq slices ...



- How to organize the interconnections inside the building blocks and between building blocks?
  - Two main classes: **bus** or **network** 
    - Warning: buses and network are generic concepts that can be easily confused with their most common implementations





- Reading out data or building events out of many channels requires many components
  - In the design of our hierarchical data-collection system, we have better define "building blocks"
  - Eg: readout crates, event building nodes, daq slices ...



- How to organize the interconnections inside the building blocks and between building blocks?
  - Two main classes: **bus** or **network** 
    - Warning: buses and network are generic concepts that can be easily confused with their most common implementations





#### Buses



• Examples: VME, PCI, SCSI, Parallel ATA, ...

- local, external, crate, long distance, ...

- Devices connected via a shared bus
  - Bus  $\rightarrow$  group of electrical lines
- Sharing implies arbitration
  - Devices can be **master** or **slave**
  - Devices can be addresses (uniquely identified) on the bus









## **Bus facts**



- Simple :-)
  - Fixed number of lines (bus-width)
  - Devices have to follow well defined interfaces
    - Mechanical, electrical, communication, ...
- Scalability issues :-(
  - Bus bandwidth is shared among all the devices
  - Maximum bus width is limited
  - Maximum bus frequency is inversely proportional to the bus length
  - Maximum number of devices depends on bus length
  - On the long term, other "effects" might limit the scalability of your system



#### **Bus facts**





# On the long term, other "effects" might limit the scalability of your system



- Telephone, Ethernet, Infiniband, ...
- All devices are **equal** 
  - Devices <u>communicate directly</u> with each other sending messages
  - No arbitration, simultaneous communications
- In switched networks, switches move messages between sources and destinations
  - Find the right path
  - Handle **congestions** (two messages with the same destination at the same time)
    - The key is .... buffering











### Network



- Networks scale well
  - They are the backbones of LHC DAQ systems







- Introduction
  - What is DAQ?
  - Overall framework
- Basic DAQ concepts
  - Digitization, Latency
  - Deadtime, De-randomization
- Scaling up
  - Readout and Event Building
  - Buses vs Network
- Do it yourself









- Study the trigger properties
  - Periodic or stochastic, continuous or bunched
- Consider the needed efficiency
  - It is good to keep operation margins, but avoid over-sizing
- Identify the fluctuation sources and size adequate buffering mechanisms
  - Watch out: (deterministic) complex systems introduce fluctuations: multi-threaded software, network communications, ...
- An adequate buffer is not a huge buffer
  - Makes your system less stable and responsive, prone to divergences and oscillations. Overall it decreases reliability





- Keep it simple, keep under control the number of free parameters without losing flexibility
  - Have you ever heard about SUSY phase-space scans? Do you really want something like that for your DAQ system?
- Problems require perseverance
  - Be careful, a rare little glitch in your DAQ might be the symptom of a major issue with your data
- In any case, ...





- Keep it simple, keep under control the number of free parameters without losing flexibility
  - Have you ever heard about SUSY phase-space scans? Do you really want something like that for your DAQ system?

DON'T PANIC

- Problems require perseverance
  - Be careful, a rare little glitch in your DAQ might be the symptom of a major issue with your data
- In any case, ...

Andrea.Negri@unipv.it