CERN, the European Organization for Nuclear Research, aims to publish all data recorded by the Large Hadron Collider to support the scientific community.
The Compact Muon Solenoid (CMS) detector of particles and the CERN Open Data team have been collaborating to publish all proton-proton collision data recorded by CMS at the Large Hadron Collider (LHC) since 2011. LHC data are unique and are of interest to the scientific and academic community. CMS has therefore committed to releasing its research data openly, with all data being made available 10 years after recording them; the embargo gives the scientists working on CMS adequate time to analyse the data themselves.
The first batch of data was published in 2016 and the team recently published a new batch of data. All research-quality data recorded by CMS during the first two years of LHC operation are now publicly available. The total data volume of this latest release is 96 terabytes.
Additional tools, workflows, and examples for analysing the data using cloud technologies are also provided. The data and related materials are available on the CERN Open Data portal, an open repository built using CERN’s home-grown and open-source software, Invenio.