Conveners
Distributed Computing Services
- Conrad Steenberg (CalTech)
Distributed Computing Services
- Rob Kennedy (FNAL)
Distributed Computing Services
- O. Smirnova (Lund University, Sweden)
-
M. Branco (CERN)27/09/2004, 14:00Track 4 - Distributed Computing Servicesoral presentationAs part of the ATLAS Data Challenges 2 (DC2), an automatic production system was introduced and with it a new data management component. The data management tools used for previous Data Challenges were built as separate components from the existing Grid middleware. These tools relied on a database of its own which acted as a replica catalog. With the extensive use of Grid technology...Go to contribution page
-
M. Ernst (DESY)27/09/2004, 14:20Track 4 - Distributed Computing Servicesoral presentationThe LHC needs to achieve reliable high performance access to vastly distributed storage resources across the network. USCMS has worked with Fermilab-CD and DESY-IT on a storage service that was deployed at several sites. It provides Grid access to heterogeneous mass storage systems and synchronization between them. It increases resiliency by insulating clients from storage and network...Go to contribution page
-
L. Lueking (FERMILAB)27/09/2004, 14:40Track 4 - Distributed Computing Servicesoral presentationA high performance system has been assembled using standard web components to deliver database information to a large number (thousands?) of broadly distributed clients. The CDF Experiment at Fermilab is building processing centers around the world imposing a high demand load on their database repository. For delivering read-only data, such as calibrations, trigger information and run...Go to contribution page
-
Dirk Duellmann27/09/2004, 15:00Track 4 - Distributed Computing Servicesoral presentationWhile there are differences among the LHC experiments in their views of the role of databases and their deployment, there is relatively widespread agreement on a number of principles: 1. Physics codes will need access to database-resident data. The need for database access is not confined to middleware and services: physics-related data will reside in databases. 2. ...Go to contribution page
-
O. Smirnova (Lund University, Sweden)27/09/2004, 15:20Track 4 - Distributed Computing Servicesoral presentationThe NorduGrid middleware, ARC, has integrated support for querying and registering to Data Indexing services such as the Globus Replica Catalog and Globus Replica Location Server. This support allows one to use these Data Indexing services for for example brokering during job-submission, automatic registration of files and many other things. This integrated support is complemented by a...Go to contribution page
-
J-P. Baud (CERN)27/09/2004, 15:40Track 4 - Distributed Computing Servicesoral presentationLCG-2 is the collective name for the set of middleware released for use on the LHC Computing Grid in December 2003. This middleware, based on LCG-1, had already several improvements in the Data Management area. These included the introduction of the Grid File Access Library(GFAL), a POSIX-like I/O Interface, along with MSS integration via the Storage Resource...Go to contribution page
-
A. Hanushevsky (SLAC)27/09/2004, 16:30Track 4 - Distributed Computing Servicesoral presentationAs the BaBar experiment shifted its computing model to a ROOT-based framework, we undertook the development of a high-performance file server as the basis for a fault-tolerant storage environment whose ultimate goal was to minimize job failures due to server failures. Capitalizing on our five years of experience with extending Objectivity's Advanced Multithreaded Server (AMS), elements...Go to contribution page
-
E. Hjort (LAWRENCE BERKELEY LABORATORY)27/09/2004, 16:50Track 4 - Distributed Computing Servicesoral presentationThe STAR experiment utilizes two major computing facilities for its data processing needs - the RCF at Brookhaven and the PDSF at LBNL/NERSC. The sharing of data between these facilities utilizes data grid services for file replication, and the deployment of these services was accomplished in conjunction with the Particle Physics Data Grid (PPDG). For STAR's 2004 run it will be...Go to contribution page
-
Ofer RIND27/09/2004, 17:10Track 4 - Distributed Computing Servicesoral presentationProviding Grid applications with effective access to large volumes of data residing on a multitude of storage systems with very different characteristics prompted the introduction of storage resource managers (SRM). Their purpose is to provide consistent and efficient wide-area access to storage resources unconstrained by their particular implementation (tape, large disk arrays,...Go to contribution page
-
C. CIOFFI (Oxford University)27/09/2004, 17:30Track 4 - Distributed Computing Servicesoral presentationThe LHCb experiment needs to store all the information about the datasets and their processing history of recorded data resulting from particle collisions at the LHC collider at CERN as well as of simulated data. To achieve this functionality a design based on data warehousing techniques was chosen, where several user-services can be implemented and optimized individually without...Go to contribution page
-
K. Nienartowicz (CERN)27/09/2004, 17:50Track 4 - Distributed Computing Servicesoral presentationData management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a European e-Science infrastructure. We have designed a set of services based on previous experience in other Grid projects, trying to address the requirements of our user communities. In this paper we summarize the most fundamental requirements and...Go to contribution page
-
R. Kennedy (FERMI NATIONAL ACCELERATOR LABORATORY)27/09/2004, 18:10Track 4 - Distributed Computing Servicesoral presentationSAMGrid is the shared data handling framework of the two large Fermilab Run II collider experiments: DZero and CDF. In production since 1999 at D0, and since mid-2004 at CDF, the SAMGrid framework has been adapted over time to accommodate a variety of storage solutions and configurations, as well as the differing data processing models of these two experiments. This has been...Go to contribution page
-
Maria Girone29/09/2004, 14:00Track 4 - Distributed Computing Servicesoral presentationThis presentation will summarise the deployment experience gained with POOL during the first larger LHC experiments data challenges performed. In particular we discuss the storage access performance and optimisations, the integration issues with grid middleware services such as the LCG Replica Location Service (RLS) and the LCG Replica Manager and experience with the POOL proposed...Go to contribution page
-
E. Laure (CERN)29/09/2004, 14:20Track 4 - Distributed Computing Servicesoral presentationThe aim of the EGEE (Enabling Grids for E-Science in Europe) is to create a reliable and dependable European Grid infrastructure for e-Science. The objective of the Middleware Re-engineering and Integration Research Activity is to provide robust middleware components, deployable on several platforms and operating systems, corresponding to the core Grid services for resource access, data...Go to contribution page
-
C. Steenberg (California Institute of Technology)29/09/2004, 14:40Track 4 - Distributed Computing Servicesoral presentationClarens enables distributed, secure and high-performance access to the worldwide data storage, compute, and information Grids being constructed in anticipation of the needs of the Large Hadron Collider at CERN. We report on the rapid progress in the development of a second server implementation in the Java language, the evolution of a peer-to-peer network of Clarens servers, and general...Go to contribution page
-
Birger KOBLITZ (CERN)29/09/2004, 15:00Track 4 - Distributed Computing Servicesoral presentationThe ARDA project was started in April 2004 to support the four LHC experiments (ALICE, ATLAS, CMS and LHCb) in the implementation of individual production and analysis environments based on the EGEE middleware. The main goal of the project is to allow a fast feedback between the experiment and the middleware development teams via the construction and the usage of end-to-end...Go to contribution page
-
F. Rademakers (CERN)29/09/2004, 15:20Track 4 - Distributed Computing Servicesoral presentationThe ALICE experiment and the ROOT team have developed a Grid-enabled version of PROOF that allows efficient parallel processing of large and distributed data samples. This system has been integrated with the ALICE-developed AliEn middleware. Parallelism is implemented at the level of each local cluster for efficient processing and at the Grid level, for optimal workload management of...Go to contribution page
-
T. Barrass (CMS, UNIVERSITY OF BRISTOL)29/09/2004, 15:40Track 4 - Distributed Computing Servicesoral presentationCMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogenous datagrid. The range of tools used, and the directed, rather than optimised nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a range of tools to operate in a complementary way. The system created comprises a...Go to contribution page
-
29/09/2004, 16:30Track 4 - Distributed Computing Servicesoral presentationSAM was developed as a data handling system for Run II at Fermilab. SAM is a collection of services, each described by metadata. The metadata are modeled on a relational database, and implemented in ORACLE. SAM, originally deployed in production for the D0 Run II experiment, has now been also deployed at CDF and is being commissioned at MINOS. This illustrates that the metadata...Go to contribution page
-
E. Neilsen (FERMI NATIONAL ACCELERATOR LABORATORY)29/09/2004, 16:50Track 4 - Distributed Computing Servicesoral presentationThe lattice gauge theory community produces large volumes of data. Because the data produced by completed computations form the basis for future work, the maintenance of archives of existing data and metadata describing the provenance, generation parameters, and derived characteristics of that data is essential not only as a reference, but also as a basis for future work. Development of...Go to contribution page
-
Richard Mount (SLAC)29/09/2004, 17:10Track 4 - Distributed Computing Servicesoral presentation
-
30/09/2004, 14:00Track 4 - Distributed Computing Servicesoral presentationThe ATLAS experiment uses a tiered data Grid architecture that enables possibly overlapping subsets, or replicas, of the original set to be located across the ATLAS collaboration. The full set of experiment data is located at a single Tier 0 site, and then subsets of the data are located at national Tier 1 sites, smaller subsets at smaller regional Tier 2 sites, and so on. In order to...Go to contribution page
-
Jerome LAURET (BROOKHAVEN NATIONAL LABORATORY)30/09/2004, 14:20Track 4 - Distributed Computing Servicesoral presentationWhile many success stories can be told as a product of the Grid middleware developments, most of the existing systems relying on workflow and job execution are based on integration of self-contained production systems interfacing with a given scheduling component or portal, or directly uses the base component of the Grid middleware (globus-job-run, globus-job-submit). However, such systems...Go to contribution page
-
R. Cavanaugh (UNIVERSITY OF FLORIDA)30/09/2004, 14:40Track 4 - Distributed Computing Servicesoral presentationA grid consists of high-end computational, storage, and network resources that, while known a priori, are dynamic with respect to activity and availability. Efficient co-scheduling of requests to use grid resources must adapt to this dynamic environment while meeting administrative policies. We discusses the necessary requirements of such a scheduler and introduce a distributed...Go to contribution page
-
30/09/2004, 15:00Track 4 - Distributed Computing Servicesoral presentationThe R-GMA (Relational Grid Monitoring Architecture) was developed within the EU DataGrid project, to bring the power of SQL to an information and monitoring system for the grid. It provides producer and consumer services to both publish and retrieve information from anywhere within a grid environment. Users within a Virtual Organization may define their own tables dynamically into...Go to contribution page
-
M. Sgaravatto (INFN Padova)30/09/2004, 15:20Track 4 - Distributed Computing Servicesoral presentationResource management and scheduling of distributed, data-driven applications in a Grid environment are challenging problems. Although significant results were achieved in the past few years, the development and the proper deployment of generic, reliable, standard components present issues that still need to be completely solved. Interested domains include workload management,...Go to contribution page
-
B K. Kim (UNIVERSITY OF FLORIDA), M. Mambelli (University of Chicago)30/09/2004, 15:40Track 4 - Distributed Computing Servicesoral presentationGrid computing involves the close coordination of many different sites which offer distinct computational and storage resources to the Grid user community. The resources at each site need to be monitored continuously. Static and dynamic site information need to be presented to the user community in a simple and efficient manner. This paper will present both the design and...Go to contribution page
-
I. Legrand (CALTECH)30/09/2004, 16:30Track 4 - Distributed Computing Servicesoral presentationThe MonALISA (MONitoring Agents in A Large Integrated Services Architecture) system is a scalable Dynamic Distributed Services Architecture which is based on the mobile code paradigm. An essential part of managing a global system, like the Grids, is a monitoring system that is able to monitor and track the many site facilities, networks, and all the task in progress, in real time....Go to contribution page
-
N. De Bortoli (INFN - NAPLES (ITALY))30/09/2004, 16:50Track 4 - Distributed Computing Servicesoral presentationGridICE is a monitoring service for the Grid, it measures significant Grid related resources parameters in order to analyze usage, behavior and performance of the Grid and/or to detect and notify fault situations, contract violations, and user-defined events. In its first implementation, the notification service relies on a simple model based on a pre-defined set of events. The growing...Go to contribution page
-
D. Smith (STANFORD LINEAR ACCELERATOR CENTER)30/09/2004, 17:10Track 4 - Distributed Computing Servicesoral presentationThe BaBar experiment has migrated its event store from an objectivity based system to a system using ROOT-files, and along with this has developed a new bookkeeping design. This bookkeeping now combines data production, quality control, event store inventory, distribution of BaBar data to sites and user analysis in one central place, and is based on collections of data stored as...Go to contribution page
-
M. Sanchez-Garcia (UNIVERSITY OF SANTIAGO DE COMPOSTELA)30/09/2004, 17:30Track 4 - Distributed Computing Servicesoral presentationThe LHCb Data Challenge 04 includes the simulation of over 200 M simulated events using distributed computing resources on N sites and extending along 3 months. To achieve this goal a dedicated Production grid (DIRAC) has been deployed. We will present the Job Monitoring and Accounting services developed to follow the status of the production along its way and to evaluate the results at...Go to contribution page
-
E. Efstathiadis (BROOKHAVEN NATIONAL LABORATORY)30/09/2004, 17:50Track 4 - Distributed Computing Servicesoral presentationAs a PPDG cross-team joint project, we proposed to study, develop, implement and evaluate a set of tools that allow Meta-Schedulers to take advantage of consistent information (such as information needed for complex decision making mechanisms) across both local and/or Grid Resource Management Systems (RMS). We will present and define the requirements and schema by which one can...Go to contribution page
-
A. TSAREGORODTSEV (CNRS-IN2P3-CPPM, MARSEILLE)30/09/2004, 18:10Track 4 - Distributed Computing Servicesoral presentationDIRAC is the LHCb distributed computing grid infrastructure for MC production and analysis. Its architecture is based on a set of distributed collaborating services. The service decomposition broadly follows the ARDA project proposal, allowing for the possibility of interchanging the EGEE/ARDA and DIRAC components in the future. Some components developed outside the DIRAC project are...Go to contribution page