21–25 May 2012
New York City, NY, USA
US/Eastern timezone

Next generation WLCG File Transfer Service (FTS)

24 May 2012, 17:00
25m
Eisner & Lubin Auditorium (Kimmel Center)

Eisner & Lubin Auditorium

Kimmel Center

Parallel Distributed Processing and Analysis on Grids and Clouds (track 3) Distributed Processing and Analysis on Grids and Clouds

Speaker

Mr Zsolt Molnár (CERN)

Description

LHC experiments at CERN and worldwide utilize WLCG resources and middleware components to perform distributed computing tasks. One of the most important tasks is reliable file replication. It is a complex problem, suffering from transfer failures, disconnections, transfer duplication, server and network overload, differences in storage systems, etc. To address these problems, EMI and gLite have provided the independent File Transfer Service (FTS) and Grid File Access Library (GFAL) tools. Their development started almost a decade ago, in the meantime, requirements in data management have changed - the old architecture of FTS and GFAL cannot keep support easily these changes. Technology has also been progressing: FTS and GFAL do not fit into the new paradigms (cloud, messaging, for example). To be able to serve the next stage of LHC data collecting (from 2013), we need a new generation of  these tools: FTS 3 and GFAL 2. We envision a service requiring minimal configuration, which can dynamically adapt to the state of its resources (endpoints (SE-s)  and network) and which offers an SE-centric configuration model for administrators to use where necessary. Main problems that we solve: scalability problems of the static channel model, resource and network states are not taken into account when scheduling transfer jobs, supporting multiple database backends (Oracle, MySQL), multiple transfer and control protocols. The new system will be much easier to configure and manage. FTS 3 will provide transparent job submission and monitoring features based on messaging. We report on design and prototype experience.  We provide an overview about how FTS fits into the more general suite of data management utilities provided by the EMI project.

Summary

We present our plans and prototyping experience about the next generation WLCG FTS and GFAL tools. We address the problems that the actual tools face with: static channel model, configuration and scalability problems, etc. We present the solution we proposed and the design of the new tools as well. We also overview how FTS fits into the more general suite of data management utilities provided by the EMI project.

Student? Enter 'yes'. See http://goo.gl/MVv53 no

Primary author

Mr Zsolt Molnár (CERN)

Co-authors

Presentation materials