cms-sw / cmssw

CMS Offline Software
http://cms-sw.github.io/
Apache License 2.0
1.07k stars 4.28k forks source link

Study additional decrease in SIM AOD size #39449

Open Dr15Jones opened 1 year ago

Dr15Jones commented 1 year ago

Studying a Run 3 pileup based workflow (11834.21) with 1000 events shows that the following branches take the bulk of the space on disk

Branch Relative %
recoTracks_generalTracks__RECO 32.6%
recoPFCandidates_particleFlow__RECO 16.1%
recoGenParticles_genParticles__HLT 4.6%

Applications of different branch structure, object thinning, and lossy compression strategies (described below) to those branches could allow us to decrease the SIM AOD size by > 15% depending on how much loss of information is acceptable.

makortel commented 5 months ago

-core

Do we consider this case closed?

I guess at this point it seems clear the lossy compression won't be going forward.