10–14 Oct 2016
San Francisco Marriott Marquis
America/Los_Angeles timezone

Plancton: an opportunistic distributed computing project based on Docker containers

13 Oct 2016, 15:30
1h 15m
San Francisco Marriott Marquis

San Francisco Marriott Marquis

Poster Track 7: Middleware, Monitoring and Accounting Posters B / Break

Speaker

Matteo Concas (Universita e INFN Torino (IT))

Description

The computing power of most modern commodity computers is far from being fully exploited by standard usage patterns.

The work we present describes the development and setup of a virtual computing cluster based on Docker containers used as worker nodes. The facility is based on Plancton[1]: a lightweight fire-and-forget background service that spawns and controls a local pool of Docker containers on a host with free resources by constantly monitoring its CPU utilisation. Plancton is designed to release the resources allocated opportunistically whenever another demanding task is run by the host user, according to configurable thresholds: this is attained by killing a number of running containers.

The resources comprising the facility are a collection of heterogeneous non-dedicated Linux hosts ideally inside the same local network, with no guaranteed network bandwidth, made available by members of a collaboration or institute. The user has agreed to donate its spare CPU cycles and remains the administrator of the involved host. Since the system is based on Docker containers performance isolation and security are guaranteed through sandboxing.

Using a thin virtualization layer such as Docker has the effect of having containers that are started almost instantly upon request. We will show how fast startup and disposal of containers finally enables us to implement the formation of the opportunistic cluster in a headless fashion, where our containers are mere pilots.

As an example we are running pilot HTCondor containers automatically joining a given cluster and terminating right after executing a job or in a short while if no new job is available. Software is provided through CVMFS on Parrot, making the execution environment suitable for HEP jobs.

Finally we will show how the uncomplicated approach of Plancton to containers deployment makes it suitable for setting up dedicated computing facilities too, provided that the underlying use case is sufficiently simple.

[1] https://github.com/mconcas/plancton

Primary Keyword (Mandatory) Computing middleware
Secondary Keyword (Optional) Virtualization
Tertiary Keyword (Optional) Distributed workload management

Primary author

Matteo Concas (Universita e INFN Torino (IT))

Co-authors

Dario Berzano (CERN) Massimo Masera (Universita e INFN Torino (IT)) Maximiliano Puccio (Universita e INFN Torino (IT)) Sara Vallero (Universita e INFN Torino (IT)) Stefano Bagnasco (I.N.F.N. TORINO) Stefano Lusso (INFN-TO)

Presentation materials