Dataset Scouting: Glossary

Key Points

Introduction
  • Finding the data is non-trivial, but all the information is on the portal

  • A careful understanding of the search options can help with finding what you need

Where are the datasets?
  • Use the filter selections in the left-hand sidebar of the CERN Open Data Portal to find datasets.

What data and Monte Carlo are available?
  • The collision data are directed to different datasets based on trigger decisions

  • The Monte Carlo datasets contain a specific simulated physics process

How to access metadata on the command line?
  • cernopendata-client is a command-line tool to download dataset files and metadata from the CERN Open Data portal.

What is in the datafiles?
  • It’s useful to sometimes inspect the files before diving into the full analysis

  • Some files may not have the information you’re looking for

Hands-on activity
  • The information is all there for you to find datasets you want for your analysis.

  • But it make take some poking around to find it.

  • Familiarizing yourself with the search options is time well-spent.

Glossary

FIXME