CMS Open Data Workshop for Theorists

Fermilab LHC Physics Center (LPC)

Online (link)

Sep 30 - Oct 2, 2020

8:30 am - 5:00 pm (US Central Time Zone)

Instructors: Matt Bellis, Edgar Carrera, Thomas Gaehtgens, Allan Da Silva Jales, Julie Hogan, Clemens Lange, Kati Lassila-Perini, Santeri Laurila, Adelina Lintuluoto, Tom McCauley, Sezen Sekmen, Jesse Thaler

Helpers: Asdrubal Cruz, Nada Mohamed, Nikolas Pervan, Farrah Simpson, Stefan Wunsch

General Information

In 2014, CMS released a significant amount of data through the CERN Open Data Portal, available to anyone to analyze as they saw fit. In 2017, a theory group at MIT published two peer-reviewed publications using this data, prompting renewed discussion as to how to make these open datasets easier to work with by non-CMS analysts. The goal of this workshop is to lower the threshold for access to these data for theorists and phenomenologists. Attendees will be lead through the steps of how to login to the CERN virtual machines, run the CMS analysis software, and perform the basics of an analysis. All exercises will be hands-on and participants should be prepared to dive into the data right away. Time will also be spent brainstorming with attendees about how the entire process of accessing and analyzing the data could be made more useful for the broader HEP community.

Please visit the official Indico site for the workshop.

Who is this workshop for? This workshop is primarily aimed at theoretical physicists with prior knowledge of collider physics and a deep interest in learning the works and arts of conducting experimental analysis using CMS Open Data.

Where is this happening? This workshop will be remote using the Zoom application (details will be emailed to participants).

When is it happening? Sep 30 - Oct 2, 2020. Add to your Google Calendar. US Central Time Zone (UTC-5).

Requirements: Participants must use their personal computer with a Mac, Linux, or Windows operating system that they have administrative privileges on. They should have a working virtual machine or Docker container environment as listed in the pre-exercises.

Accessibility: We are dedicated to providing a positive and accessible learning environment for all. Please notify the instructors in advance of the workshop if you require any accommodations or if there is anything we can do to make this workshop more accessible to you.

Contact: Please email mbellis@siena.edu , ecarrera@cern.ch or Katri.Lassila-Perini@cern.ch for more information.


Code of Conduct

Please adhere to the Fermilab's Code of Conduct.



Schedule

All times in US Central Time Zone (UTC-5)
Help is available through our Mattermost channel and/or the CMS Open Data Forum

Pre-exercises

(Mandatory exercises must be completed before the start of the workshop)
Mandatory 5 min Orientation
Mandatory 2 hVirtual Machine or
Docker container setup for CMS Open Data
Mandatory 2 hCMSSW fundamentals
Mandatory 2.4 hROOT basics
Optional (external lesson)The Unix Shell
Optional (external lesson)Version Control with Git
Optional (external lesson)Programming with Python

Wednesday

08:30-09:00Welcome and orientationOrganizers
09:00-09:45Live Presentation: Workshop IntroductionKati Lassila-Perini
09:45-10:00Break
10:00-10:45Live Hands-on lesson: Dataset scoutingMatt Bellis
10:45-12:00Live Hands-on lesson: Trigger manipulationEdgar Carrera
12:00-12:15Break
12:15-13:00Async Demo: Physics objects(Julie Hogan)
13:00-14:30Lunch
14:30-16:00Async Hands-on lesson: Physics objects I(Julie Hogan)
16:00-17:00Live Fermilab Colloquium:
The Future is Open
Jesse Thaler

Thursday

08:30-10:00Live Hands-on lesson: Physics objects IIJulie Hogan
10:00-10:15Break
10:15-11:15Live Hands-on lesson: Pre-selection and skimmingJulie Hogan
11:15-11:45Live Hands-on lesson: Object ID and selection.Julie Hogan
11:45-12:00Break
12:00-13:00Live Hands-on lesson: Plotting and interpretationMatt Bellis
13:00-14:30Lunch
14:30-15:15Async Demo: Luminosity and data quality.(Tom McCauley)
15:15-17:00Async Hands-on lesson: The tag and probe method(Allan Da Silva Jales,
Thomas Gaehtgens)

Friday

08:30-10:30Live Demo: CMS analysis in the cloudClemens Lange
Adelina Lintuluoto
10:30-10:45Break
10:45-11:45Live Hands-on lesson: CMS analysis in the cloudClemens Lange
Adelina Lintuluoto
11:45-12:00Break
12:00-13:00Live Hands-on lesson:
Background estimation - the ABDC method
Santeri Laurila
13:00-14:30Lunch
14:30-15:00Live Featured Demo: ADLSezen Sekmen
15:00-17:00Wrap up and time for discussionEverybody

Syllabus

Computing tools

  • CERN Open Data Portal
  • CMS Open Data
  • CMS Twiki
  • ROOT
  • Docker
  • Virtual Machines

CMSSW

  • Overview
  • Data Model
  • Analyzers
  • Configuration
  • Conditions Data

Analysis

  • Data and Simulation
  • Selection
  • Luminosity
  • Backgrounds
  • Systematics
  • Interpretation

Analysis examples

  • Higgs to tau tau