Cloud Pre-Exercise: Glossary

Key Points

Introduction
  • Kubernetes is an orchestrator of containers. It is most useful when it is run in a cluster of computers.

  • Commercial K8s clusters are a good option for large computing needs.

  • We can run our containerized CMSSW jobs and subsequent analysis workflows in a K8s cluster.

Getting started with Kubectl
  • kubectl is the ruler of GKE

Getting started with Argo
  • Argo is a very useful tool for running workflows and parallel jobs

Storing a workflow output
  • With Kubernetes one can run workflows similar to a batch system

  • Open Data workflows can be run in a commercial cloud environment using modern tools

Create an Argo Workflow
  • With a simple but a tight yaml structure, a full-blown analysis can be performed using the Argo tool on a K8s cluster.

Cleaning up
  • With a couple commands it is easy to get back to square one.

Glossary

FIXME