9–13 Jul 2018
Sofia, Bulgaria
Europe/Sofia timezone

The GridKa Tape System: monitoring and failure analysis

10 Jul 2018, 16:00
1h
Sofia, Bulgaria

Sofia, Bulgaria

National Culture Palace, Boulevard "Bulgaria", 1463 NDK, Sofia, Bulgaria
Poster Track 4 - Data Handling Posters

Speaker

Mr Dorin Lobontu (Karlsruhe Institut of Technology)

Description

A tape system usually comprises lots of tape drives, several thousand or even tens of thousands of cartridges, robots, software applications and machines which are running these applications. All involved components are able to log failures and statistical data. However, correlation is a laborious and ambiguous process and a wrong interpretation can easily result in a wrong decision. A single defective drive or cartridge can silently put the data on many other cartridges at stake, so it is extremely import to discover problems as early as possible. The longer it takes to identify and isolate a defective component the more extensive is the damage. To be able to take the right decision at the right time an effective monitoring system is essential. But how effective is the monitoring system? Is there any off-the-shelf software which can do the whole work for you? This paper is going to give an insight into the failures of the tape drives and tape cartridges we have been experiencing over the years at GridKa and the procedures we have developed to keep the data on tape as safe as possible.

Primary authors

Mr Dorin Lobontu (Karlsruhe Institut of Technology) Dr Doris Ressmann (KIT) Martin Beitzinger (Karlsruher Institut for Technology) Andreas Heiss (KIT - Karlsruhe Institute of Technology (DE)) Andreas Petzold (KIT - Karlsruhe Institute of Technology (DE)) Karin Schaefer (Karlsruher Institut for Technology)

Presentation materials