cms-dpoa / CMS_Trivia

0 stars 0 forks source link

Introduction

Datasets in the CERN open data portal are categorized (or labelled) according to the physics subfield they relate to. For example, the dataset /GluGluHToGG_M123_13TeV_amcatnloFXFX_pythia8/RunIIFall15MiniAODv2-PU25nsData2015v1_76X_mcRun2_asymptotic_v12-v1/MINIAODSIM belongs to the category: Higgs Physics/Standard Model. Assigning labels to each dataset is a tedious task because researchers sometimes don't follow the naming convention and the categories are not present in the dataset metadata. Until now, this categorization has been done "by-hand", by interviewing experts on the content of each dataset. The problem is that there are more than 600 datasets (and more to come) that have not yet been categorized. These datasets are thus not accessible for researchers who do not know the exact name of the dataset. The purpose of this game is to enable CERN researchers to easily assign a label to each unlabelled dataset in the open data collection, making it a less tedious task.

The game is vaguely inspired by the TV show: Who Wants to be a Millionaire.

Level 1

In this level, the players' reliability is assessed. This will help us choose a category for an unlabelled dataset in case various options are proposed by different players. Questions in this level consist on the classification of an already labelled dataset. For example the dataset /BlackHole_BH1_MD-2000_MBH-10000_n-2_TuneCUETP8M1_13TeV-blackmax/... corresponds to the category Exotica/Extra Dimensions. Two lifelines will be available:

Level 2

This level is the central part of the game. In this level, questions consist on the classification of unlabelled datasets. In this case, there will be no options and no lifelines (we don't know the true label). There's also the possibility to create a new category. How much we trust your answers will be based on your score in Level 1. By answering these questions you will help built a more complete CMS open data portal.

Thanks for playing!

How to build the project

You can easily build and run the project with docker. To do this run the following command in the main project directory:

docker-compose up --build

The docker image of the build is call frontend and the API is backend.