27 September 2004 to 1 October 2004
Interlaken, Switzerland
Europe/Zurich timezone

Session

Distributed Computing Services

7
27 Sept 2004, 14:00
Interlaken, Switzerland

Interlaken, Switzerland

Conveners

Distributed Computing Services

  • Conrad Steenberg (CalTech)

Distributed Computing Services

  • Rob Kennedy (FNAL)

Distributed Computing Services

  • O. Smirnova (Lund University, Sweden)

Presentation materials

There are no materials yet.

  1. M. Branco (CERN)
    27/09/2004, 14:00
    Track 4 - Distributed Computing Services
    oral presentation
    As part of the ATLAS Data Challenges 2 (DC2), an automatic production system was introduced and with it a new data management component. The data management tools used for previous Data Challenges were built as separate components from the existing Grid middleware. These tools relied on a database of its own which acted as a replica catalog. With the extensive use of Grid technology...
    Go to contribution page
  2. M. Ernst (DESY)
    27/09/2004, 14:20
    Track 4 - Distributed Computing Services
    oral presentation
    The LHC needs to achieve reliable high performance access to vastly distributed storage resources across the network. USCMS has worked with Fermilab-CD and DESY-IT on a storage service that was deployed at several sites. It provides Grid access to heterogeneous mass storage systems and synchronization between them. It increases resiliency by insulating clients from storage and network...
    Go to contribution page
  3. L. Lueking (FERMILAB)
    27/09/2004, 14:40
    Track 4 - Distributed Computing Services
    oral presentation
    A high performance system has been assembled using standard web components to deliver database information to a large number (thousands?) of broadly distributed clients. The CDF Experiment at Fermilab is building processing centers around the world imposing a high demand load on their database repository. For delivering read-only data, such as calibrations, trigger information and run...
    Go to contribution page
  4. Dirk Duellmann
    27/09/2004, 15:00
    Track 4 - Distributed Computing Services
    oral presentation
    While there are differences among the LHC experiments in their views of the role of databases and their deployment, there is relatively widespread agreement on a number of principles: 1. Physics codes will need access to database-resident data. The need for database access is not confined to middleware and services: physics-related data will reside in databases. 2. ...
    Go to contribution page
  5. O. Smirnova (Lund University, Sweden)
    27/09/2004, 15:20
    Track 4 - Distributed Computing Services
    oral presentation
    The NorduGrid middleware, ARC, has integrated support for querying and registering to Data Indexing services such as the Globus Replica Catalog and Globus Replica Location Server. This support allows one to use these Data Indexing services for for example brokering during job-submission, automatic registration of files and many other things. This integrated support is complemented by a...
    Go to contribution page
  6. J-P. Baud (CERN)
    27/09/2004, 15:40
    Track 4 - Distributed Computing Services
    oral presentation
    LCG-2 is the collective name for the set of middleware released for use on the LHC Computing Grid in December 2003. This middleware, based on LCG-1, had already several improvements in the Data Management area. These included the introduction of the Grid File Access Library(GFAL), a POSIX-like I/O Interface, along with MSS integration via the Storage Resource...
    Go to contribution page
  7. A. Hanushevsky (SLAC)
    27/09/2004, 16:30
    Track 4 - Distributed Computing Services
    oral presentation
    As the BaBar experiment shifted its computing model to a ROOT-based framework, we undertook the development of a high-performance file server as the basis for a fault-tolerant storage environment whose ultimate goal was to minimize job failures due to server failures. Capitalizing on our five years of experience with extending Objectivity's Advanced Multithreaded Server (AMS), elements...
    Go to contribution page
  8. E. Hjort (LAWRENCE BERKELEY LABORATORY)
    27/09/2004, 16:50
    Track 4 - Distributed Computing Services
    oral presentation
    The STAR experiment utilizes two major computing facilities for its data processing needs - the RCF at Brookhaven and the PDSF at LBNL/NERSC. The sharing of data between these facilities utilizes data grid services for file replication, and the deployment of these services was accomplished in conjunction with the Particle Physics Data Grid (PPDG). For STAR's 2004 run it will be...
    Go to contribution page
  9. Ofer RIND
    27/09/2004, 17:10
    Track 4 - Distributed Computing Services
    oral presentation
    Providing Grid applications with effective access to large volumes of data residing on a multitude of storage systems with very different characteristics prompted the introduction of storage resource managers (SRM). Their purpose is to provide consistent and efficient wide-area access to storage resources unconstrained by their particular implementation (tape, large disk arrays,...
    Go to contribution page
  10. C. CIOFFI (Oxford University)
    27/09/2004, 17:30
    Track 4 - Distributed Computing Services
    oral presentation
    The LHCb experiment needs to store all the information about the datasets and their processing history of recorded data resulting from particle collisions at the LHC collider at CERN as well as of simulated data. To achieve this functionality a design based on data warehousing techniques was chosen, where several user-services can be implemented and optimized individually without...
    Go to contribution page
  11. K. Nienartowicz (CERN)
    27/09/2004, 17:50
    Track 4 - Distributed Computing Services
    oral presentation
    Data management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a European e-Science infrastructure. We have designed a set of services based on previous experience in other Grid projects, trying to address the requirements of our user communities. In this paper we summarize the most fundamental requirements and...
    Go to contribution page
  12. R. Kennedy (FERMI NATIONAL ACCELERATOR LABORATORY)
    27/09/2004, 18:10
    Track 4 - Distributed Computing Services
    oral presentation
    SAMGrid is the shared data handling framework of the two large Fermilab Run II collider experiments: DZero and CDF. In production since 1999 at D0, and since mid-2004 at CDF, the SAMGrid framework has been adapted over time to accommodate a variety of storage solutions and configurations, as well as the differing data processing models of these two experiments. This has been...
    Go to contribution page
  13. Maria Girone
    29/09/2004, 14:00
    Track 4 - Distributed Computing Services
    oral presentation
    This presentation will summarise the deployment experience gained with POOL during the first larger LHC experiments data challenges performed. In particular we discuss the storage access performance and optimisations, the integration issues with grid middleware services such as the LCG Replica Location Service (RLS) and the LCG Replica Manager and experience with the POOL proposed...
    Go to contribution page
  14. E. Laure (CERN)
    29/09/2004, 14:20
    Track 4 - Distributed Computing Services
    oral presentation
    The aim of the EGEE (Enabling Grids for E-Science in Europe) is to create a reliable and dependable European Grid infrastructure for e-Science. The objective of the Middleware Re-engineering and Integration Research Activity is to provide robust middleware components, deployable on several platforms and operating systems, corresponding to the core Grid services for resource access, data...
    Go to contribution page
  15. C. Steenberg (California Institute of Technology)
    29/09/2004, 14:40
    Track 4 - Distributed Computing Services
    oral presentation
    Clarens enables distributed, secure and high-performance access to the worldwide data storage, compute, and information Grids being constructed in anticipation of the needs of the Large Hadron Collider at CERN. We report on the rapid progress in the development of a second server implementation in the Java language, the evolution of a peer-to-peer network of Clarens servers, and general...
    Go to contribution page
  16. Birger KOBLITZ (CERN)
    29/09/2004, 15:00
    Track 4 - Distributed Computing Services
    oral presentation
    The ARDA project was started in April 2004 to support the four LHC experiments (ALICE, ATLAS, CMS and LHCb) in the implementation of individual production and analysis environments based on the EGEE middleware. The main goal of the project is to allow a fast feedback between the experiment and the middleware development teams via the construction and the usage of end-to-end...
    Go to contribution page
  17. F. Rademakers (CERN)
    29/09/2004, 15:20
    Track 4 - Distributed Computing Services
    oral presentation
    The ALICE experiment and the ROOT team have developed a Grid-enabled version of PROOF that allows efficient parallel processing of large and distributed data samples. This system has been integrated with the ALICE-developed AliEn middleware. Parallelism is implemented at the level of each local cluster for efficient processing and at the Grid level, for optimal workload management of...
    Go to contribution page
  18. T. Barrass (CMS, UNIVERSITY OF BRISTOL)
    29/09/2004, 15:40
    Track 4 - Distributed Computing Services
    oral presentation
    CMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogenous datagrid. The range of tools used, and the directed, rather than optimised nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a range of tools to operate in a complementary way. The system created comprises a...
    Go to contribution page
  19. 29/09/2004, 16:30
    Track 4 - Distributed Computing Services
    oral presentation
    SAM was developed as a data handling system for Run II at Fermilab. SAM is a collection of services, each described by metadata. The metadata are modeled on a relational database, and implemented in ORACLE. SAM, originally deployed in production for the D0 Run II experiment, has now been also deployed at CDF and is being commissioned at MINOS. This illustrates that the metadata...
    Go to contribution page
  20. E. Neilsen (FERMI NATIONAL ACCELERATOR LABORATORY)
    29/09/2004, 16:50
    Track 4 - Distributed Computing Services
    oral presentation
    The lattice gauge theory community produces large volumes of data. Because the data produced by completed computations form the basis for future work, the maintenance of archives of existing data and metadata describing the provenance, generation parameters, and derived characteristics of that data is essential not only as a reference, but also as a basis for future work. Development of...
    Go to contribution page
  21. Richard Mount (SLAC)
    29/09/2004, 17:10
    Track 4 - Distributed Computing Services
    oral presentation
  22. 30/09/2004, 14:00
    Track 4 - Distributed Computing Services
    oral presentation
    The ATLAS experiment uses a tiered data Grid architecture that enables possibly overlapping subsets, or replicas, of the original set to be located across the ATLAS collaboration. The full set of experiment data is located at a single Tier 0 site, and then subsets of the data are located at national Tier 1 sites, smaller subsets at smaller regional Tier 2 sites, and so on. In order to...
    Go to contribution page
  23. Jerome LAURET (BROOKHAVEN NATIONAL LABORATORY)
    30/09/2004, 14:20
    Track 4 - Distributed Computing Services
    oral presentation
    While many success stories can be told as a product of the Grid middleware developments, most of the existing systems relying on workflow and job execution are based on integration of self-contained production systems interfacing with a given scheduling component or portal, or directly uses the base component of the Grid middleware (globus-job-run, globus-job-submit). However, such systems...
    Go to contribution page
  24. R. Cavanaugh (UNIVERSITY OF FLORIDA)
    30/09/2004, 14:40
    Track 4 - Distributed Computing Services
    oral presentation
    A grid consists of high-end computational, storage, and network resources that, while known a priori, are dynamic with respect to activity and availability. Efficient co-scheduling of requests to use grid resources must adapt to this dynamic environment while meeting administrative policies. We discusses the necessary requirements of such a scheduler and introduce a distributed...
    Go to contribution page
  25. 30/09/2004, 15:00
    Track 4 - Distributed Computing Services
    oral presentation
    The R-GMA (Relational Grid Monitoring Architecture) was developed within the EU DataGrid project, to bring the power of SQL to an information and monitoring system for the grid. It provides producer and consumer services to both publish and retrieve information from anywhere within a grid environment. Users within a Virtual Organization may define their own tables dynamically into...
    Go to contribution page
  26. M. Sgaravatto (INFN Padova)
    30/09/2004, 15:20
    Track 4 - Distributed Computing Services
    oral presentation
    Resource management and scheduling of distributed, data-driven applications in a Grid environment are challenging problems. Although significant results were achieved in the past few years, the development and the proper deployment of generic, reliable, standard components present issues that still need to be completely solved. Interested domains include workload management,...
    Go to contribution page
  27. B K. Kim (UNIVERSITY OF FLORIDA), M. Mambelli (University of Chicago)
    30/09/2004, 15:40
    Track 4 - Distributed Computing Services
    oral presentation
    Grid computing involves the close coordination of many different sites which offer distinct computational and storage resources to the Grid user community. The resources at each site need to be monitored continuously. Static and dynamic site information need to be presented to the user community in a simple and efficient manner. This paper will present both the design and...
    Go to contribution page
  28. I. Legrand (CALTECH)
    30/09/2004, 16:30
    Track 4 - Distributed Computing Services
    oral presentation
    The MonALISA (MONitoring Agents in A Large Integrated Services Architecture) system is a scalable Dynamic Distributed Services Architecture which is based on the mobile code paradigm. An essential part of managing a global system, like the Grids, is a monitoring system that is able to monitor and track the many site facilities, networks, and all the task in progress, in real time....
    Go to contribution page
  29. N. De Bortoli (INFN - NAPLES (ITALY))
    30/09/2004, 16:50
    Track 4 - Distributed Computing Services
    oral presentation
    GridICE is a monitoring service for the Grid, it measures significant Grid related resources parameters in order to analyze usage, behavior and performance of the Grid and/or to detect and notify fault situations, contract violations, and user-defined events. In its first implementation, the notification service relies on a simple model based on a pre-defined set of events. The growing...
    Go to contribution page
  30. D. Smith (STANFORD LINEAR ACCELERATOR CENTER)
    30/09/2004, 17:10
    Track 4 - Distributed Computing Services
    oral presentation
    The BaBar experiment has migrated its event store from an objectivity based system to a system using ROOT-files, and along with this has developed a new bookkeeping design. This bookkeeping now combines data production, quality control, event store inventory, distribution of BaBar data to sites and user analysis in one central place, and is based on collections of data stored as...
    Go to contribution page
  31. M. Sanchez-Garcia (UNIVERSITY OF SANTIAGO DE COMPOSTELA)
    30/09/2004, 17:30
    Track 4 - Distributed Computing Services
    oral presentation
    The LHCb Data Challenge 04 includes the simulation of over 200 M simulated events using distributed computing resources on N sites and extending along 3 months. To achieve this goal a dedicated Production grid (DIRAC) has been deployed. We will present the Job Monitoring and Accounting services developed to follow the status of the production along its way and to evaluate the results at...
    Go to contribution page
  32. E. Efstathiadis (BROOKHAVEN NATIONAL LABORATORY)
    30/09/2004, 17:50
    Track 4 - Distributed Computing Services
    oral presentation
    As a PPDG cross-team joint project, we proposed to study, develop, implement and evaluate a set of tools that allow Meta-Schedulers to take advantage of consistent information (such as information needed for complex decision making mechanisms) across both local and/or Grid Resource Management Systems (RMS). We will present and define the requirements and schema by which one can...
    Go to contribution page
  33. A. TSAREGORODTSEV (CNRS-IN2P3-CPPM, MARSEILLE)
    30/09/2004, 18:10
    Track 4 - Distributed Computing Services
    oral presentation
    DIRAC is the LHCb distributed computing grid infrastructure for MC production and analysis. Its architecture is based on a set of distributed collaborating services. The service decomposition broadly follows the ARDA project proposal, allowing for the possibility of interchanging the EGEE/ARDA and DIRAC components in the future. Some components developed outside the DIRAC project are...
    Go to contribution page
Building timetable...