Conveners
Basic IT services
- Tony Wong (Brookhaven National Laboratory)
- Helge Meinhard (CERN)
Basic IT services: Basic IT services
- James Botts (LBNL)
- Helge Meinhard (CERN)
- Tony Wong (Brookhaven National Laboratory)
Basic IT services: BoF on monitoring tools
- Helge Meinhard (CERN)
- James Botts (LBNL)
- Tony Wong (Brookhaven National Laboratory)
Wataru Takase
(KEK)
4/21/16, 2:00 PM
Basic IT Services
Although ElasticSearch and Kibana bring great monitoring platform, they lack access control feature by default. This means any user who can access to Kibana can retrieve any information from ElasticSearch. In CERN cloud service, a homemade ElasticSearch plugin has been deployed to restricts data access based on cloud user. It enables each user to have a separated dashboard for cloud usage....
Daniel Fernandez Rodriguez
(Universidad de Oviedo (ES))
4/21/16, 2:25 PM
Basic IT Services
During the past two years, CERN Cloud Infrastructure has been using an open source tool called Rundeck for automating routine operational procedures. The aim of this project was to provide the team with a common place for implemented workflows and jobs. Thanks to Rundeck we were able to delegate internal tasks to other teams without exposing internal procedures or credentials. In addition to...
Christopher Huhn
(GSI)
4/21/16, 2:50 PM
Basic IT Services
At Hepix Fall 2011 at Vencouver I gave a presentation about GSI's starting migration from CFengine to Chef configuration management.
This migration was a bumpier ride than initially expected (as usual?).
So now, 5 years later, I'd like to
- take a look back at our intentions for the migration,
- the difficulties we encountered,
- the current situation and issues still to be solved,
-...
Go Iwai
(KEK)
4/21/16, 3:15 PM
Basic IT Services
High Energy Accelerator Research Organization (KEK) plays a key role in particle physics experiments, as well as supporting the communities in Japanese universities. In order to ensure those important missions, KEK has two large-scale computer systems: the Supercomputer System (KEKSC) and the Central Computer System (KEKCC).
The KEKSC is mainly used by collaborative researches in theoretical...
Mohammed Daoudi
(CERN)
4/21/16, 4:10 PM
Basic IT Services
In the LHCb Online system we keep systems significantly beyond the warranty period, in some cases up to 7 or more years. We also have upgraded systems in large numbers with third party components (disks for instance). In this contribution give an overview of the various problems we encountered and how we overcome them. We discuss hardware problems, inhouse repairs and related load on the admin team.
Hristo Umaru Mohamed
(University of Cincinnati (US))
4/21/16, 4:35 PM
Basic IT Services
The LHCb experiment operates a large computing infrastructure with
more than 2000 servers, 300 virtual machines and 400 embedded systems.Many of the systems are operated diskless from NFS or iSCSI root-volumes. They are connected by more than 200 switches and routers. A large fraction of these systems are mission critical for the experiment and as such need to be constantly monitored.
The main...
Fabien Wernli
(CCIN2P3)
4/21/16, 5:00 PM
Basic IT Services
Many of today's opensource monitoring tools have grown to distributed, horizontally scaling solutions.
When designing a new infrastructure, choosing and configuring the right software stack to analyze and record logs and metrics can
admittedly still be a challenge, but we are no longer restricted to the vertically scaling rrdtool-type timeseries storage.
The real challenge is the amount of...