Homer Neal
(University of Michigan (US))
28/10/2013, 09:00
Miscellaneous
Dr
Shawn Mc Kee
(University of Michigan (US))
28/10/2013, 09:20
Miscellaneous
Dr
Dorian Kcira
(California Institute of Technology (US))
28/10/2013, 10:00
The Caltech Tier2 is a major site providing substantial and reliable computational and storage resources to CMS, combining production processing of simulated events, support for US CMS physics analysis, and computing, software systems, and network developments. Caltech continues to lead key several areas of the LHC computing and software aimed at enabling grid-based data analysis, as well as...
Mr
Benjeman Meekhof
(University of Michigan)
28/10/2013, 10:15
Fall 2013 ATLAS Great Lakes Tier 2 site report covering recent network updates, work with AFS on ZFS, new provisioning with cobbler and cfengine, recent experiences with dCache issues, and the usual statistics/status information.
Dr
Ofer Rind
(BROOKHAVEN NATIONAL LABORATORY), Dr
Tony Wong
(Brookhaven National Laboratory)
28/10/2013, 11:00
Brookhaven National Lab (BNL) will present the site report for the RHIC-ATLAS Computing Facility (RACF)
Erik Mattias Wadenstein
(Unknown)
28/10/2013, 11:15
Overview of recent developments in the distributed NDGF Tier1. Might include a closer look at running Atlas computing on a couple of different HPC resources.
Wolfgang Friebel
(Deutsches Elektronen-Synchrotron (DE))
28/10/2013, 11:30
Fall 2013 DESY Site report
Ajit Kumar Mohapatra
(University of Wisconsin (US))
28/10/2013, 12:00
As a major WLCG/OSG T2 site, the University of Wisconsin Madison CMS T2 has provided very productive and reliable services for CMS MonteCarlo production/processing, and large scale global CMS physics analysis using high throughput computing, highly available storage system, and scalable distributed software systems. The close integration of the CMS specific T2 resources with that of the UW...
Ulf Tigerstedt
(CSC Oy)
28/10/2013, 14:00
Early 2013 CSC (the it centre for science in Finland) powered up the new datacentre on Kajaani, 600 km north of the offices. The datacentre focuses of energy efficiency and cost-cutting, but the design and implementation was not easy.
Dr
Tony Wong
(Brookhaven National Laboratory)
28/10/2013, 14:25
The advent of cloud computing centers such as Amazon's EC2 and Google's Computing Engine has elicited comparisons with dedicated computing clusters. Discussions on appropriate usage of cloud resources (both academic and commercial) and costs have ensued. This presentation discusses a detailed analysis of the costs of operating and maintaining the RACF (RHIC and ATLAS Computing Facility)...
Dr
Tony Wong
(Brookhaven National Laboratory)
28/10/2013, 14:50
We describe a recent rack installation incident at the RACF and its effects on facility operations. A draft proposal to address safety issues will also be discussed.
Massimo Paladin
(CERN)
28/10/2013, 15:40
Basic IT Services
Jan Engels
(Deutsches Elektronen-Synchrotron (DE))
28/10/2013, 16:00
Starting in 2012, DESY has been extending it's IT systems infrastructure to make use of the Puppet configuration management system developed by Puppetlabs. The main focus of this talk is to share the experience gained in this program of work and to summarize the current status and outlook of the Puppet infrastructure at the DESY site.
Andreas Petzold
(KIT - Karlsruhe Institute of Technology (DE))
29/10/2013, 09:00
Current status and latest news at GridKa, e.g.:
- Hardware status
- Storage systems
- Batch system
Sandy Philpott
(JLAB)
29/10/2013, 09:15
An update of high performance and scientific computing activities since the Spring 2012 meeting.
Dr
Chris Brew
(STFC - Science & Technology Facilities Council (GB))
29/10/2013, 09:30
An update from the UK GridPP Tier 2s
Dr
Michele Michelotto
(Universita e INFN (IT))
29/10/2013, 11:00
I started to make measurament of Power consumption when running the HEP-SPEC06 benchmark. A few slides on the move from SL5 to SL6.
William Strecker-Kellogg
(Brookhaven National Lab)
29/10/2013, 14:00
Scheduling jobs with heterogeneous resource requirements to a pool of
computers with heterogeneous resources is a challenging task and Condor
is beginning to tackle this in the most generic form. Integrating
so-called partitionable slots (a batch resource able to be sliced along
a variety of dimensions, from RAM to CPUs, to disks or even GPUs) with
the rest of Condor's accounting and...
Andrew David Lahiff
(STFC - Science & Technology Facilities Council (GB))
29/10/2013, 14:30
The RAL Tier 1 maintains a batch farm with close to 10000 job slots that is used by all the LHC VOs as well as a number of smaller users. We have increasingly found that our existing batch system is unable to cope with the demands placed on it by our users. During the past year work has been carried out evaluating alternative technologies to our existing Torque/Maui batch system and preparing...
Sebastien Ceuterickx
(CERN)
29/10/2013, 16:00
With the dramatic increase of wireless-capable devices, Wi-Fi connectivity has become an essential network service on a par with the traditional cabled network. The evolution of the CERN Wi-Fi infrastructure will be presented, including the BYOD strategy and the integration of eduroam. The deployment considerations for large conference rooms and underground facilities will also be addressed.
Sebastien Ceuterickx
(CERN)
29/10/2013, 16:30
The latest changes on the CERN network infrastructure will be presented. This includes the deployment of IPv6, the network connectivity for the Data Centre extension at Wigner and the upgrade of the network infrastructure for Business Continuity. The second part of the talk will give an overview of the implementation of a new, safety-related wireless network (TETRA).
Shawn Mc Kee
(University of Michigan (US))
29/10/2013, 17:00
The WLCG infrastructure has evolved from its original restrictive network topology, based on the MONARC model, to a more interconnected system, where data movement between regions or countries does not necessarily need to involve T1 centers. While this evolution brought obvious advantages, especially in terms of flexibility for the LHC experiment’s data management systems, it also raises the...
Dave Kelsey
(STFC - Science & Technology Facilities Council (GB))
30/10/2013, 09:00
An update on the activities of the group in IPv6 testing and planning since the Bologna meeting.
Mr
Romain Wartel
(CERN)
30/10/2013, 09:30
This presentation provides an update of the security landscape since the last meeting. It describes the main vectors of compromises in the academic community and presents interesting recent attacks. It also covers security risks management in general, as well as the security aspects of the current hot topics in computing, for example identity federation and virtualisation.
Bob Cowles
(Indiana University / CACR)
30/10/2013, 10:00
Scientific collaborations are evolving to a model where large, multi-dimensional data sets are analyzed in whole or in part by relatively small groups of researchers. These groups are often without the expertise and/or resources to develop and maintain a sophisticated IT infrastructure and represent the growing "long tail of science". The presentation will discuss the evolving structures and...
Dave Kelsey
(STFC - Science & Technology Facilities Council (GB))
30/10/2013, 11:00
There is much activity in the area of identity management for research communities. This talk will present the current status of this work and explore possible future options for WLCG and HEP more generally.
kevin hill
30/10/2013, 11:30
The Open Science Grid (OSG) has undergone some changes in its authentication model for user job submission. This talk will outline changes already implemented as well as future plans as they stand now, together with the use cases that motivated them.
Mr
Gabriele Carcassi
(Brookhaven National Laboratory (US))
30/10/2013, 14:00
When investigating a problem, one typically needs to gather and correlate information from disparate sources: operating system, batch system, data transfer tools, and so on. We investigate the use of Control System Studio to gather information from multiple places. When the appropriate hooks are created, this should allow the end user to create ad-hoc ways to mix and match data without the...
Vitor Emanuel Gomes Gouveia
(CERN)
30/10/2013, 15:00
The life cycle of the ELFSms (extremely large fabric management system) is reaching is end. This set of tools provided to manage machines in the CERN Computer Centre has reached the end-of-life and a new Configuration Management System is going to take its place.
The new Configuration Management System is going to change drastically the way we manage machines in the CERN Computer Centre...
Edward Simmonds
(Fermilab),
Tyler Parsons
(Fermilab)
30/10/2013, 16:00
Puppet has been in use by the Fermilab Experiments Facilities department to support computing for a variety of experiments for the last several years. This presentation will discuss our experience deploying, refining, and upgrading Puppet to scale to thousands of systems, across different experiments, servers, batch nodes, and workstations. This presentation will describe our efforts to...
Timothy Michael Skirvin
(F)
30/10/2013, 16:25
Over the past year, the USCMS-T1 project has decided to jump head-first into Puppet as our primary configuration management tool. We would like to talk about what's worked, what hasn't worked, and how we've been able to work with the other Fermilab teams to share our experiences without necessarily sharing a code base.
Fernando Moreno Pascual
(CERN)
30/10/2013, 16:50
The CERN Telephone service is moving towards unified communications.
New IP Phone devices connected to Lync IP Phone Service enhance the classic telephony by adding many features like IM, presence, voice mailbox, call delegation, etc. The complete integration with Exchange allows the mailbox to be used as call log history, voice mailbox, etc.
In addition, connecting the IP Phone to your...
Dr
Amit Chattopadhyay
(Western Digital Corporation)
31/10/2013, 09:00
The reliability of hard disk drives (HDD) has been quantified historically by a mean time to failure (MTTF), or an annualized failure rate (AFR), defined at a specified operating temperature, and an assumed functional duty cycle. We provide justification for replacing the ambiguous concept of duty cycle with the readily quantifiable “workload”, which is defined as the total amount of data...
German Cancio Melia
(CERN)
31/10/2013, 09:45
The goal of the HEPiX Bit Preservation Working Group is to share ideas, practices and experience on bit stream preservation activities across sites providing long-term and large-scale archive services. Different aspects should be covered like: technology used for long-term archiving, definition of reliability, mitigation of data loss risks, monitoring/verification of the archive contents,...
Lisa Ann Giacchetti
(Fermi National Accelerator Lab. (US))
31/10/2013, 10:15
The CMS T1 facility at Fermilab manages many tens of petabytes of data for CMS. This talk will present some historical information on the solutions used to store this data as well as information on the new solutions we are in the process of implementing and how we got to where we are now.
Dr
Arne Wiebalck
(CERN)
31/10/2013, 11:15
This will be a follow-up of the discussions about OpenAFS and IPv6 we had in Bologna, in particular summarizing input from potential developers on timelines/prices/development models, as well as conclusions from the survey conducted to understand the needs of the HEPiX community regarding the lack of IPv6 in OpenAFS.
Derrick Brashear
(Y)
31/10/2013, 11:30
A status report on OpenAFS with a focus on:
* 2013 Security Vulnerabilities
. OPENAFS-SA-2013-001
Buffer overflows in OpenAFS fileserver
. OPENAFS-SA-2013-002
Buffer overflow in OpenAFS ptserver
. OPENAFS-SA-2013-003
Brute force DES attack permits compromise of AFS cell
. OPENAFS-SA-2013-004
vos -encrypt doesn't encrypt connection data
*...
Derrick Brashear
(Y),
Jeffrey Altman
(Your File System Inc.)
31/10/2013, 12:00
YFS is a Software Defined Storage solution for secure private, public and hybrid cloud storage deployments. YFS 1.0 clients and servers are dual protocol stack providing next generation file system capabilities while maintaining backward compatibility with IBM AFS 3.6 and OpenAFS clients and servers.
This talk will highlight the enhanced capabilities of YFS 1.0 vs OpenAFS 1.6.5 including...
Dr
Alexander Moibenko
(Fermi NAtiona Accelerator Laboratoy)
31/10/2013, 14:00
Enstore is a tape based Mass Storage System originally designed for Run II Tevatron experiments at FNAL (CDF, D0). Over the years it has proven to be reliable and scalable data archival and delivery solution, which meets diverse requirements of variety of applications including US CMS Tier 1, High Performance Computing, Intensity Frontier experiments as well as data backups. Data intensive...
Dr
Patrick Fuhrmann
(DESY)
31/10/2013, 14:30
This presentation is intended to bring HEP storage administrators up to speed with ongoing dCache developments and activities.
In the context of WLCG, we will report on our collaboration with the xRootd folks in terms of federated storage and monitoring,
our efforts to support the strict separation of CMS between disk and tape storage endpoints,
and we hope to have the first results on...
Lincoln Bryant
(University of Chicago (US))
31/10/2013, 15:55
This talk will cover our deployment of Ceph, a highly-scalable next-generation distributed filesystem, at Midwest Tier 2 to back-end various projects. We'll talk about our experience deploying Ceph, performance benchmarks, and some thoughts about where we would like to go next.
Thomas Oulevey
(CERN)
31/10/2013, 16:20
This talk will show how we used Koji, to give the different IT teams flexibility. It will also cover the building of Redhat packages for Scientific Linux Cern and other Redhat addons. Finally, we will state the limitations of Koji and some workaround we found.
Pat Riehecky
(Fermilab)
31/10/2013, 16:45
This presentation will provide an update on the current status of Scientific Linux, descriptions for some possible future goals, and allow a chance for users to provide feedback on its direction.
Gerard Bernabeu Altayo
(F)
01/11/2013, 09:50
In 2010, Fermilab initiated the FermiCloud project to deliver a dynamic and scalable Infrastructure-as-a-Service (IaaS) capability using open source cloud computing frameworks to support the needs of the Fermilab scientific communities. A collaboration of personnel from Fermilab and the Korea Institute of Science and Technology Information (KISTI) has focused significant work over the past 18...
Jaroslava Schovancova
(Brookhaven National Laboratory (US))
01/11/2013, 10:30
The PanDA Production ANd Distributed Analysis system has been developed by ATLAS to meet the experiment's requirements for a data-driven workload management system for production and distributed analysis processing capable of operating at LHC data processing scale. After 7 years of impressively successful PanDA operation in ATLAS there are also other experiments which can benefit from PanDA in...
Andrew David Lahiff
(STFC - Science & Technology Facilities Council (GB))
01/11/2013, 10:55
Even with the growing interest in cloud computing, grid-based submission to traditional batch systems is still the primary way for the experiments to run jobs at WLCG sites. Integrating a batch system with virtualised worker nodes on a cloud potentially offers sites many benefits. At RAL we have recently investigated making opportunistic use of a private StratusLab cloud when it has unused...
Ian Collier
(UK Tier1 Centre)
01/11/2013, 11:20
In the last three years the CernVM Filesystem (CernVM-FS) has transformed the distribution of experiment software to WLCG grid sites. CernVM-FS removes the need for local installations jobs and performant software at sites, in addition it often improves performance at the same time. Furthermore the use of CernVM-FS standardizes the computing environment across the grid and removes the need for...
Mr
Troy Dawson
(Red Hat)
01/11/2013, 11:45
OpenShift has three offerings, Origin, Online, and Enterprise. Now you can enjoy the benefits of Paas in the public cloud, or on your own cloud.
I will be showing OpenShift Origin, setup locally. What features does it have for both admin and user. How will that help both labs and experiments.