Oct 10 – 14, 2016
San Francisco Marriott Marquis
America/Los_Angeles timezone

Geographically distributed Batch System as a Service: the INDIGO-DataCloud approach exploiting HTCondor

Oct 12, 2016, 12:00 PM
GG C2 (San Francisco Mariott Marquis)


San Francisco Mariott Marquis

Oral Track 3: Distributed Computing Track 3: Distributed Computing


One of the challenges a scientific computing center has to face is to keep delivering a computational framework well consolidated within the community (i.e. the batch farm), while complying to modern computing paradigms. The aim is to ease system administration at all levels (from hardware to applications) and to provide a smooth end-user experience.
HTCondor is a LRMS widely used in the scientific community and it’s Cloud aware (i.e. it does not resent from a volatile environment where resources might dynamically change). Apache Mesos is a tool that allows to abstract computing resources away from the physical or virtual hosts and to deal with the entire computing infrastructure as a single pool of resources to be shared among services.
Within the INDIGO-DataCloud project, we adopted two different approaches to implement a PaaS-level, on-demand Batch Farm Service based on HTCondor and Mesos.
In the first approach, the various HTCondor daemons are packaged inside pre-configured Docker images and deployed as Long Running Service (LRS) through Marathon, profiting from its health checks and failover capabilities.
In the second approach, we have implemented an HTCondor framework for Mesos, that can be used by itself or as a component in the more complex INDIGO PaaS system in conjunction with an orchestration layer like i.e. Marathon. The new framework consists of a scheduler to implement HTCondor policies on the resource offers provided by the Mesos master and a dedicated executor to launch tasks on the slave nodes. The benefits of an ad-hoc framework are first of all a fine-grained level of control on the tasks the application is responsible for. Moreover, it is possible to implement the preferred authorization rules and roles for multi-tenancy and to define application-specific scaling rules. Application isolation and packetization are achieved with the Docker Containerizer module of Mesos.
The most difficult aspects of both approaches concern networking and storage. For instance, the usage of the shared port mode within HTCondor has been evaluated in order to avoid dynamically assigned ports; container-to-container communication and isolation have been addressed exploring solutions based on overlay networks (including e.g. the Calico Project implementation).
Finally, we have studied the possibility to deploy an HTCondor cluster that spans over different sites, also exploiting the Condor Connection Brokering (CCB) component that allows communication across a private network boundary or firewall, as in case of multi-site deployments.
Concerning the storage aspects, where factors such as scalability, performance and reliability have to be taken into account, we have explored the usage of CVMFS (using Parrot) and the integration with the INDIGO Data Services (Onedata, Dynafed, FTS).
In this contribution, we are going to describe and motivate our implementative choices and to show the results of the first tests performed.

Secondary Keyword (Optional) Computing middleware
Primary Keyword (Mandatory) Distributed workload management
Tertiary Keyword (Optional) Network systems and solutions

Primary authors

Alessandro Costantini (University of Perugia) Alessandro Italiano Italiano (INFN-CNAF) DIEGO MICHELOTTO (INFN - National Institute for Nuclear Physics) Davide Salomoni (Universita e INFN, Bologna (IT)) Doina Cristina Aiftimiei (INFN - CNAF, IFIN-HH) Giacinto Donvito (Universita e INFN, Bari (IT)) Luciano Gaido (Universita e INFN Torino (IT)) Marica Antonacci Matteo Panella (CNAF) Miguel Caballer (UPV) Sara Vallero (Universita e INFN Torino (IT)) Stefano Bagnasco (Universita e INFN Torino (IT)) Tommaso Boccali (Universita di Pisa & INFN (IT))

Presentation materials