15–19 Oct 2012
Institute of High Energy Physics
Asia/Shanghai timezone

Session

Computing

16 Oct 2012, 11:00
C305 (Institute of High Energy Physics)

C305

Institute of High Energy Physics

19B YuquanLu Shijingshan Beijing China

Conveners

Computing

  • Michele Michelotto (Universita e INFN (IT))
  • Gilles Mathieu (CNRS)

Computing

  • Gilles Mathieu (CNRS)
  • Michele Michelotto (Universita e INFN (IT))

Computing

  • Michele Michelotto (Universita e INFN (IT))
  • Gilles Mathieu (CNRS)

Presentation materials

There are no materials yet.

  1. Steven Timm (Fermilab)
    16/10/2012, 11:00
    Computing & Batch Services
    Presentation
    The Condor Batch System has been used at Fermilab for a decade in the Run II Reprocessing and Analysis, the USCMS Tier 1 facility, and the FermiGrid General Purpose Grid Cluster. In this talk I present an overview of the operational stability, the scalabilty, and the best practices we have learned to build a 27,000 job slot campus grid using the Condor system.
    Go to contribution page
  2. Jerome Belleman (CERN)
    16/10/2012, 11:30
    Computing & Batch Services
    Presentation
    The CERN batch service runs a 60k CPU core cluster using Platform LSF. We present some of the challenges of running a service at this scale, and describe the current planning of how we aim to evolve the current system to a more dynamic, larger scale service. As part of this, we recently undertook a project of developing new monitoring tools and upgrading the batch accounting system; we...
    Go to contribution page
  3. Manfred Alef (Karlsruhe Institute of Technology (KIT))
    16/10/2012, 12:00
    Computing & Batch Services
    Presentation
    This talk describes the scheduled migration to another LRMS at GridKa: - Problems and limitations of the LRMS which is currently used at GridKa - Selection and tests of a new one - Configuration details, e.g. fair-share configurations, and experiences with a first sub-cluster which is already managed by the new LRMS
    Go to contribution page
  4. Aresh Vedaee (CC-IN2P3 - Centre de Calcul (FR))
    17/10/2012, 14:00
    Computing & Batch Services
    Presentation
    Report from the BOF session the day before
    Go to contribution page
  5. philippe olivero (CC-IN2P3)
    17/10/2012, 14:30
    Computing & Batch Services
    Presentation
    CC-IN2P3 has been running OGE for more than one year now. After describing the current context, I will report the difficulties encountered, solved or not, and the new enhancements we would like to get.
    Go to contribution page
  6. Andreas Haupt (Deutsches Elektronen-Synchrotron (DE))
    17/10/2012, 15:00
    Computing & Batch Services
    Presentation
    All the currently available Gridengine implementations don't provide any authenticated access with the default setup. This opens a big and easily exploitable security hole which might be considered severe especially in multi-community clusters. This talk will describe in detail the attack vector available in such setups. It will furthermore give a step-by-step guide to activate the...
    Go to contribution page
  7. Erik Mattias Wadenstein (Unknown)
    17/10/2012, 16:00
    Computing & Batch Services
    Presentation
    Many compute clusters in the nordics run Slurm, this includes the grid connected ones. This talk looks at the experience, which parts works well, what could use improvements, and some comparisons to other batch systems.
    Go to contribution page
  8. Dr Giacinto Donvito (INFN-Bari)
    17/10/2012, 16:30
    Computing & Batch Services
    Presentation
    We will show all the work done in order to install and configure the batch system itself together with the security configuration needed. In this presentation we will show the results of the deep testing that we have done on SLURM, in order to be sure that it will cover all the needed functionalities like: priorities, fairshare, limits, QoS, failover capabilities and others. We will report...
    Go to contribution page
Building timetable...