CHEP04

Name: CHEP04
Start: 2004-09-27T08:30:00+02:00
End: 2004-10-01T18:00:00+02:00
Location: Interlaken, Switzerland

27 September 2004 to 1 October 2004

Interlaken, Switzerland

Europe/Zurich timezone

Session

Distributed Computing Services

27 Sept 2004, 14:00

Interlaken, Switzerland

Distributed Computing Services

Conrad Steenberg (CalTech)

Distributed Computing Services

Rob Kennedy (FNAL)

Distributed Computing Services

O. Smirnova (Lund University, Sweden)

There are no materials yet.

142. Don Quijote - Data Management for the ATLAS Automatic Production System

M. Branco (CERN)

27/09/2004, 14:00

Track 4 - Distributed Computing Services

oral presentation

As part of the ATLAS Data Challenges 2 (DC2), an automatic production system was introduced and with it a new data management component. The data management tools used for previous Data Challenges were built as separate components from the existing Grid middleware. These tools relied on a database of its own which acted as a replica catalog. With the extensive use of Grid technology...
Go to contribution page
190. Managed Data Storage and Data Access Services for Data Grids

M. Ernst (DESY)

27/09/2004, 14:20

Track 4 - Distributed Computing Services

oral presentation

The LHC needs to achieve reliable high performance access to vastly distributed storage resources across the network. USCMS has worked with Fermilab-CD and DESY-IT on a storage service that was deployed at several sites. It provides Grid access to heterogeneous mass storage systems and synchronization between them. It increases resiliency by insulating clients from storage and network...
Go to contribution page
204. FroNtier: High Performance Database Access Using Standard Web Components in a Scalable Multi-tier Architecture

L. Lueking (FERMILAB)

27/09/2004, 14:40

Track 4 - Distributed Computing Services

oral presentation

A high performance system has been assembled using standard web components to deliver database information to a large number (thousands?) of broadly distributed clients. The CDF Experiment at Fermilab is building processing centers around the world imposing a high demand load on their database repository. For delivering read-only data, such as calibrations, trigger information and run...
Go to contribution page
218. On Distributed Database Deployment for the LHC Experiments

Dirk Duellmann

27/09/2004, 15:00

Track 4 - Distributed Computing Services

oral presentation

While there are differences among the LHC experiments in their views of the role of databases and their deployment, there is relatively widespread agreement on a number of principles: 1. Physics codes will need access to database-resident data. The need for database access is not confined to middleware and services: physics-related data will reside in databases. 2. ...
Go to contribution page
253. Experiences with Data Indexing services supported by the NorduGrid middleware

O. Smirnova (Lund University, Sweden)

27/09/2004, 15:20

Track 4 - Distributed Computing Services

oral presentation

The NorduGrid middleware, ARC, has integrated support for querying and registering to Data Indexing services such as the Globus Replica Catalog and Globus Replica Location Server. This support allows one to use these Data Indexing services for for example brokering during job-submission, automatic registration of files and many other things. This integrated support is complemented by a...
Go to contribution page
278. Evolution of LCG-2 Data Management

J-P. Baud (CERN)

27/09/2004, 15:40

Track 4 - Distributed Computing Services

oral presentation

LCG-2 is the collective name for the set of middleware released for use on the LHC Computing Grid in December 2003. This middleware, based on LCG-1, had already several improvements in the Data Management area. These included the introduction of the Grid File Access Library(GFAL), a POSIX-like I/O Interface, along with MSS integration via the Storage Resource...
Go to contribution page
328. The Next Generation Root File Server

A. Hanushevsky (SLAC)

27/09/2004, 16:30

Track 4 - Distributed Computing Services

oral presentation

As the BaBar experiment shifted its computing model to a ROOT-based framework, we undertook the development of a high-performance file server as the basis for a fault-tolerant storage environment whose ultimate goal was to minimize job failures due to server failures. Capitalizing on our five years of experience with extending Objectivity's Advanced Multithreaded Server (AMS), elements...
Go to contribution page
334. Production mode Data-Replication framework in STAR using the HRM Grid

E. Hjort (LAWRENCE BERKELEY LABORATORY)

27/09/2004, 16:50

Track 4 - Distributed Computing Services

oral presentation

The STAR experiment utilizes two major computing facilities for its data processing needs - the RCF at Brookhaven and the PDSF at LBNL/NERSC. The sharing of data between these facilities utilizes data grid services for file replication, and the deployment of these services was accomplished in conjunction with the Particle Physics Data Grid (PPDG). For STAR's 2004 run it will be...
Go to contribution page
345. Storage Resource Managers at Brookhaven

Ofer RIND

27/09/2004, 17:10

Track 4 - Distributed Computing Services

oral presentation

Providing Grid applications with effective access to large volumes of data residing on a multitude of storage systems with very different characteristics prompted the introduction of storage resource managers (SRM). Their purpose is to provide consistent and efficient wide-area access to storage resources unconstrained by their particular implementation (tape, large disk arrays,...
Go to contribution page
392. File-Metadata Management System for the LHCb Experiment

C. CIOFFI (Oxford University)

27/09/2004, 17:30

Track 4 - Distributed Computing Services

oral presentation

The LHCb experiment needs to store all the information about the datasets and their processing history of recorded data resulting from particle collisions at the LHC collider at CERN as well as of simulated data. To achieve this functionality a design based on data warehousing techniques was chosen, where several user-services can be implemented and optimized individually without...
Go to contribution page
414. Data Management in EGEE

K. Nienartowicz (CERN)

27/09/2004, 17:50

Track 4 - Distributed Computing Services

oral presentation

Data management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a European e-Science infrastructure. We have designed a set of services based on previous experience in other Grid projects, trying to address the requirements of our user communities. In this paper we summarize the most fundamental requirements and...
Go to contribution page
460. SAMGrid Integration of SRMs

R. Kennedy (FERMI NATIONAL ACCELERATOR LABORATORY)

27/09/2004, 18:10

Track 4 - Distributed Computing Services

oral presentation

SAMGrid is the shared data handling framework of the two large Fermilab Run II collider experiments: DZero and CDF. In production since 1999 at D0, and since mid-2004 at CDF, the SAMGrid framework has been adapted over time to accommodate a variety of storage solutions and configurations, as well as the differing data processing models of these two experiments. This has been...
Go to contribution page
383. Experience with POOL from the LCG Data Challenges of the three LHC experiments

Maria Girone

29/09/2004, 14:00

Track 4 - Distributed Computing Services

oral presentation

This presentation will summarise the deployment experience gained with POOL during the first larger LHC experiments data challenges performed. In particular we discuss the storage access performance and optimisations, the integration issues with grid middleware services such as the LCG Replica Location Service (RLS) and the LCG Replica Manager and experience with the POOL proposed...
Go to contribution page
247. Middleware for the next generation Grid infrastructure

E. Laure (CERN)

29/09/2004, 14:20

Track 4 - Distributed Computing Services

oral presentation

The aim of the EGEE (Enabling Grids for E-Science in Europe) is to create a reliable and dependable European Grid infrastructure for e-Science. The objective of the Middleware Re-engineering and Integration Research Activity is to provide robust middleware components, deployable on several platforms and operating systems, corresponding to the core Grid services for resource access, data...
Go to contribution page
184. The Clarens Grid-enabled Web Services Framework: Services and Implementation

C. Steenberg (California Institute of Technology)

29/09/2004, 14:40

Track 4 - Distributed Computing Services

oral presentation

Clarens enables distributed, secure and high-performance access to the worldwide data storage, compute, and information Grids being constructed in anticipation of the needs of the Large Hadron Collider at CERN. We report on the rapid progress in the development of a second server implementation in the Java language, the evolution of a peer-to-peer network of Clarens servers, and general...
Go to contribution page
305. Experiences with the gLite Grid Middleware

Birger KOBLITZ (CERN)

29/09/2004, 15:00

Track 4 - Distributed Computing Services

oral presentation

The ARDA project was started in April 2004 to support the four LHC experiments (ALICE, ATLAS, CMS and LHCb) in the implementation of individual production and analysis environments based on the EGEE middleware. The main goal of the project is to allow a fast feedback between the experiment and the middleware development teams via the construction and the usage of end-to-end...
Go to contribution page
430. Global Distributed Parallel Analysis using PROOF and AliEn

F. Rademakers (CERN)

29/09/2004, 15:20

Track 4 - Distributed Computing Services

oral presentation

The ALICE experiment and the ROOT team have developed a Grid-enabled version of PROOF that allows efficient parallel processing of large and distributed data samples. This system has been integrated with the ALICE-developed AliEn middleware. Parallelism is implemented at the level of each local cluster for efficient processing and at the Grid level, for optimal workload management of...
Go to contribution page
162. Software agents in data and workflow management

T. Barrass (CMS, UNIVERSITY OF BRISTOL)

29/09/2004, 15:40

Track 4 - Distributed Computing Services

oral presentation

CMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogenous datagrid. The range of tools used, and the directed, rather than optimised nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a range of tools to operate in a complementary way. The system created comprises a...
Go to contribution page
500. Housing Metadata for the Common Physicist Using a Relational Database

29/09/2004, 16:30

Track 4 - Distributed Computing Services

oral presentation

SAM was developed as a data handling system for Run II at Fermilab. SAM is a collection of services, each described by metadata. The metadata are modeled on a relational database, and implemented in ORACLE. SAM, originally deployed in production for the D0 Run II experiment, has now been also deployed at CDF and is being commissioned at MINOS. This illustrates that the metadata...
Go to contribution page
196. Lattice QCD Data and Metadata Archives at Fermilab and the International Lattice Data Grid

E. Neilsen (FERMI NATIONAL ACCELERATOR LABORATORY)

29/09/2004, 16:50

Track 4 - Distributed Computing Services

oral presentation

The lattice gauge theory community produces large volumes of data. Because the data produced by completed computations form the basis for future work, the maintenance of archives of existing data and metadata describing the provenance, generation parameters, and derived characteristics of that data is essential not only as a reference, but also as a basis for future work. Development of...
Go to contribution page
536. Huge Memory systems for data-intensive science

Richard Mount (SLAC)

29/09/2004, 17:10

Track 4 - Distributed Computing Services

oral presentation
69. Resource Predictors in HEP Applications

30/09/2004, 14:00

Track 4 - Distributed Computing Services

oral presentation

The ATLAS experiment uses a tiered data Grid architecture that enables possibly overlapping subsets, or replicas, of the original set to be located across the ATLAS collaboration. The full set of experiment data is located at a single Tier 0 site, and then subsets of the data are located at national Tier 1 sites, smaller subsets at smaller regional Tier 2 sites, and so on. In order to...
Go to contribution page
318. The STAR Unifid Meta-Scheduler project, a front end around evolving technologies for user analysis and data production.

Jerome LAURET (BROOKHAVEN NATIONAL LABORATORY)

30/09/2004, 14:20

Track 4 - Distributed Computing Services

oral presentation

While many success stories can be told as a product of the Grid middleware developments, most of the existing systems relying on workflow and job execution are based on integration of self-contained production systems interfacing with a given scheduling component or portal, or directly uses the base component of the Grid middleware (globus-job-run, globus-job-submit). However, such systems...
Go to contribution page
321. SPHINX: A Scheduling Middleware for Data Intensive Applications on a Grid

R. Cavanaugh (UNIVERSITY OF FLORIDA)

30/09/2004, 14:40

Track 4 - Distributed Computing Services

oral presentation

A grid consists of high-end computational, storage, and network resources that, while known a priori, are dynamic with respect to activity and availability. Efficient co-scheduling of requests to use grid resources must adapt to this dynamic environment while meeting administrative policies. We discusses the necessary requirements of such a scheduler and introduce a distributed...
Go to contribution page
417. Information and Monitoring Services within a Grid Environment

30/09/2004, 15:00

Track 4 - Distributed Computing Services

oral presentation

The R-GMA (Relational Grid Monitoring Architecture) was developed within the EU DataGrid project, to bring the power of SQL to an information and monitoring system for the grid. It provides producer and consumer services to both publish and retrieve information from anywhere within a grid environment. Users within a Virtual Organization may define their own tables dynamically into...
Go to contribution page
420. Pratical approaches to Grid workload and resource management in the EGEE project

M. Sgaravatto (INFN Padova)

30/09/2004, 15:20

Track 4 - Distributed Computing Services

oral presentation

Resource management and scheduling of distributed, data-driven applications in a Grid environment are challenging problems. Although significant results were achieved in the past few years, the development and the proper deployment of generic, reliable, standard components present issues that still need to be completely solved. Interested domains include workload management,...
Go to contribution page
490. Grid2003 Monitoring, Metrics, and Grid Cataloging System

B K. Kim (UNIVERSITY OF FLORIDA), M. Mambelli (University of Chicago)

30/09/2004, 15:40

Track 4 - Distributed Computing Services

oral presentation

Grid computing involves the close coordination of many different sites which offer distinct computational and storage resources to the Grid user community. The resources at each site need to be monitored continuously. Static and dynamic site information need to be presented to the user community in a simple and efficient manner. This paper will present both the design and...
Go to contribution page
89. MonALISA: An Agent Based, Dynamic Service System to Monitor, Control and Optimize Grid based Applications.

I. Legrand (CALTECH)

30/09/2004, 16:30

Track 4 - Distributed Computing Services

oral presentation

The MonALISA (MONitoring Agents in A Large Integrated Services Architecture) system is a scalable Dynamic Distributed Services Architecture which is based on the mobile code paradigm. An essential part of managing a global system, like the Grids, is a monitoring system that is able to monitor and track the many site facilities, networks, and all the task in progress, in real time....
Go to contribution page
274. Design and Implementation of a Notification Model for Grid Monitoring Events

N. De Bortoli (INFN - NAPLES (ITALY))

30/09/2004, 16:50

Track 4 - Distributed Computing Services

oral presentation

GridICE is a monitoring service for the Grid, it measures significant Grid related resources parameters in order to analyze usage, behavior and performance of the Grid and/or to detect and notify fault situations, contract violations, and user-defined events. In its first implementation, the notification service relies on a simple model based on a pre-defined set of events. The growing...
Go to contribution page
338. BaBar Bookkeeping - a distributed meta-data catalog of the BaBar event store.

D. Smith (STANFORD LINEAR ACCELERATOR CENTER)

30/09/2004, 17:10

Track 4 - Distributed Computing Services

oral presentation

The BaBar experiment has migrated its event store from an objectivity based system to a system using ROOT-files, and along with this has developed a new bookkeeping design. This bookkeeping now combines data production, quality control, event store inventory, distribution of BaBar data to sites and user analysis in one central place, and is based on collections of data stored as...
Go to contribution page
388. A Lightweight Monitoring and Accounting System for LHCb DC'04 Production

M. Sanchez-Garcia (UNIVERSITY OF SANTIAGO DE COMPOSTELA)

30/09/2004, 17:30

Track 4 - Distributed Computing Services

oral presentation

The LHCb Data Challenge 04 includes the simulation of over 200 M simulated events using distributed computing resources on N sites and extending along 3 months. To achieve this goal a dedicated Production grid (DIRAC) has been deployed. We will present the Job Monitoring and Accounting services developed to follow the status of the production along its way and to evaluate the results at...
Go to contribution page
393. Development and use of MonALISA high level monitoring services for the star unified Meta-Scheduler

E. Efstathiadis (BROOKHAVEN NATIONAL LABORATORY)

30/09/2004, 17:50

Track 4 - Distributed Computing Services

oral presentation

As a PPDG cross-team joint project, we proposed to study, develop, implement and evaluate a set of tools that allow Meta-Schedulers to take advantage of consistent information (such as information needed for complex decision making mechanisms) across both local and/or Grid Resource Management Systems (RMS). We will present and define the requirements and schema by which one can...
Go to contribution page
377. DIRAC - The Distributed MC Production and Analysis for LHCb

A. TSAREGORODTSEV (CNRS-IN2P3-CPPM, MARSEILLE)

30/09/2004, 18:10

Track 4 - Distributed Computing Services

oral presentation

DIRAC is the LHCb distributed computing grid infrastructure for MC production and analysis. Its architecture is based on a set of distributed collaborating services. The service decomposition broadly follows the ARDA project proposal, allowing for the possibility of interchanging the EGEE/ARDA and DIRAC components in the future. Some components developed outside the DIRAC project are...
Go to contribution page

Building timetable...

CHEP04

Session

Distributed Computing Services

Interlaken, Switzerland

Conveners

Distributed Computing Services

Distributed Computing Services

Distributed Computing Services

Presentation materials

Choose timezone

CHEP04

Conveners

Distributed Computing Services

Distributed Computing Services

Distributed Computing Services

Presentation materials