This lesson is being piloted (Beta version)

Dataset scouting: Glossary

Key Points

Introduction
  • Finding the data is non-trivial, but all the information is on the portal

  • A careful understanding of the search options can help with finding what you need

Where are the datasets?
  • The data and Monte Carlo are stored in directories with names that give you some insight as to what they contain

What data and Monte Carlo are available?
  • The triggers are all given their own collision datasets

  • The Monte Carlo samples all have their own datasets

  • Navigating the Open Data Portal is the right way to find out what is available

What is in the datafiles?
  • It’s useful to sometimes inspect the files before diving into the full analysis

  • Some files may not have the information you’re looking for

Break
  • Taking a break is good!

Hands-on activity
  • The information is all there for you to find datasets you want for your analysis.

  • But it make take some poking around to find it.

  • Familiarizing yourself with the search options is time well-spent.

Offline challenge
  • It can take some time and some effort to figure out what collision data and what simulation data is appropriate for your analysis

Glossary

FIXME