ACAT 2019

Name: ACAT 2019
Start: 2019-03-10T19:00:00+01:00
End: 2019-03-15T13:45:00+01:00
Location: Steinmatte conference center

10–15 Mar 2019

Steinmatte conference center

Europe/Zurich timezone

Need Help?

Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System

12 Mar 2019, 16:50

20m

Steinmatte Plenary

Oral Track 1: Computing Technology for Physics Research Track 1: Computing Technology for Physics Research

Oskar Wyszynski (CERN)

Network monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited.
We will present the application of machine learning techniques to detect anomalies among network devices as well as connection instabilities. The former exploits homogeneity of network hardware to detect device anomalies such as too high CPU or memory utilization, and consequently uncover a pre-failure state. The latter algorithm learns to distinguish between port speed instabilities caused by, e.g. failing transceiver or fiber, and speed changes due to scheduled system reboots.
All the algorithms described are implemented in the DAQ network of the ATLAS experiment.

Oskar Wyszynski (CERN) Eukeni Pozo Astigarraga (CERN)

ATL-COM-DAQ-2019-028c.pdf

ACAT 2019

Need Help?

Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System

Steinmatte Plenary

Speaker

Description

Primary authors

Presentation materials

Choose timezone

ACAT 2019

Need Help?

Speaker

Description

Primary authors

Presentation materials

Share this page

Direct link

Social networks

Calendaring