Second K8s-HEP Meetup

Name: Second K8s-HEP Meetup
Start: 2020-12-01T09:00:00-06:00
End: 2020-12-02T13:45:00-06:00
Location: Zoom

1–2 Dec 2020

Zoom

America/Chicago timezone

Contact

lincolnb@uchicago.edu

Contribution List

1. Welcome! What's this?

Lincoln Bryant (University of Chicago (US)), Robert William Gardner Jr (University of Chicago (US))

01/12/2020, 09:00

block 1 - presentations

9. Fermilab Experience with OKD (OpenShift)

Anthony Tiradani (Fermilab)

01/12/2020, 09:10

block 1 - presentations

Fermilab has made the strategic decision to deploy OKD, the open source version of Red Hat OpenShift, for Kubernetes container management. We will discuss our experience so far with OKD and describe some of the challenges we faced deploying a variety of applications.

6. Debugging Kubernetes pod throughput with Calico CNI

Mr Thomas George Hartland (CERN)

01/12/2020, 09:30

block 1 - presentations

Exploring how the kubelet, with Calico as the CNI plugin,
depends on the performance of the Kubernetes API server
to be able to start pods quickly.

2. K8s autoscaling based on custom metrics. Two examples of application: CMSWEB and HTCondor in the CMS Analysis Facility@INFN

Tommaso Tedeschi (Universita e INFN, Perugia (IT))

01/12/2020, 09:50

block 1 - presentations

At the moment, Kubernetes only supports horizontal pod autoscaling based on predefined pod metrics (CPU and memory usage). Therefore, in order to achieve an actually green elastic cloud model (optimizing resource usage) a key point is to integrate this tool with autoscaling solutions based on custom metrics, and this requires the usage of third-party elements.
In this work we show the...

3. Lightweight integration of Kubernetes clusters for ATLAS batch processing

Fernando Harald Barreiro Megino (University of Texas at Arlington)

01/12/2020, 10:10

block 1 - presentations

The PanDA team has evaluated the possibility of native Kubernetes job submission in order to process ATLAS workloads and offer the possibility of immediate integration of major cloud computing providers. This model also offers a novel way to set up lightweight compute sites, without the need of setting up a Grid stack.

During the last year we have been running several queues at clusters...

10. Lazy Image Pulling with Stargz

Spyridon Trigazis (CERN)

01/12/2020, 10:30

block 1 - presentations

Container images allowed having reproducible environments and container orchestration lets users parallelize and create elaborate workflows with tools like Argo or just Kubernetes jobs. It is easy to create very large images and when parallelizing jobs, the time and cost of pulling container images can increase significantly. Golang developers proposed the seekable tar gunzip format...

14. Reproducible and Scalable workflows for SkyhookDM experimentation on Kubernetes

Jayjeet Chakraborty (University of California, Santa Cruz)

01/12/2020, 11:20

Overflow & Open Discussion

Preparing a Systems experiment environment requires setting up infrastructure, baselining the infrastructure, installing dependencies and tools, running experiments, and manually plotting results, which if done manually, is cumbersome and error-prone. This same scenario applies to researchers starting to experiment with Ceph or SkyhookDM, which is an extension for Ceph to run queries on...

16. Discussion

01/12/2020, 11:40

Overflow & Open Discussion

5. Multi Cluster / Cloud Kubernetes for GPU Evaluation

Ricardo Brito Da Rocha (CERN)

02/12/2020, 09:00

block 3 - presentations

GPUs are scarce resources in many of our centers, including CERN.

This talk will quickly describe a multi cloud deployment with the goal of evaluating the performance of different workloads in all GPUs offered by GCP, Azure and AWS.

It will include some details about setting up clusters and GPUs in each of these clouds, and some preliminary results.

4. Running a multi-tenant Kubernetes with GitOps

Brian Paul Bockelman (University of Wisconsin Madison (US))

02/12/2020, 09:20

block 3 - presentations

Starting in October 2020, the PATh project is making a concerted effort to transition the centrally-run OSG services (such as websites, software repositories, information services) from ad-hoc deployment models to Kubernetes.

To do so, we needed a Kubernetes "home" and an operational model! In this talk, we'll overview the work going on in the Tiger cluster at Morgridge, our current...

8. Overview of CMSWEB Cluster in Kubernetes

Muhammad Imran (National Centre for Physics (PK))

02/12/2020, 09:40

block 3 - presentations

The CMS experiment heavily relies on the CMSWEB cluster to host critical services for its operational needs. The cluster is deployed on virtual machines (VMs) from the CERN OpenStack cloud and is manually maintained by operators and developers. The release cycle is composed of several steps, from building RPMs, their deployment, validation, and integration tests. To enhance the sustainability...

7. Experience with K8s at Coffea-Casa AF@UNL

Carl Lundstedt (University of Nebraska Lincoln (US))

02/12/2020, 10:00

block 3 - presentations

In this contribution we would like to share our experience designing an Analysis Facility for the columnar analysis utilizing the analysis package COFFEA at University of Nebraska-Lincoln and to describe our adventure on deploying different workloads and services at UNL Kubernetes cluster (Jupyterhub with Traefik integration, HTCondor, ServiceX and other infrastructure deployments).

12. Test REANA Deployment at BNL

Christopher Henry Hollowell (Brookhaven National Laboratory (US))

02/12/2020, 10:20

block 3 - presentations

In this presentation we'll discuss our experiences deploying a test REANA instance on a k8s cluster at BNL.

11. What's new with SLATE?

lincoln bryant

02/12/2020, 10:40

block 3 - presentations

Will review progress over the past year with SLATE - including new containerized apps, storage provisioner, security policies for federated operations

13. Kubernetes at UVic

Ryan Taylor (University of Victoria (CA))

02/12/2020, 11:00

block 3 - presentations

I will describe Kubernetes cluster deployment at UVic, including batch computing and APEL accounting for ATLAS.

15. Packaging and using services in Kubernetes

Brian Hua Lin (University of Wisconsin - Madison)

02/12/2020, 11:20

block 3 - presentations

OSG lessons learned distributing service container images and experiences contributing to and deploying services with SLATE

17. Discussion

02/12/2020, 12:05

Overflow & Open Discussion

Choose timezone

Second K8s-HEP Meetup

Contact