dmwm / DAS

Data Aggregation System
11 stars 7 forks source link

Stange issue with DAS search #4281

Closed abdollah110 closed 6 years ago

abdollah110 commented 6 years ago

Hello once I query like the following, it does not show anything: https://cmsweb.cern.ch/das/request?view=list&limit=50&instance=prod%2Fglobal&input=dataset%3D%2FWJetsToLNu*%2FRunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1%2FMINIAODSIM

It only shows me the dataset name in case I type the entire name, like this: https://cmsweb.cern.ch/das/request?view=list&limit=50&instance=prod%2Fglobal&input=+%2FWJetsToLNu_TuneCP5_13TeV-madgraphMLM-pythia8%2FRunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1%2FMINIAODSIM

This is really confusing and has taken lots of my time to find WJet sample.

Can you please fix this bug???

vkuznet commented 6 years ago

Hi, this is not a bug but rather a feature. By default DAS shows valid datasets for pattern queries. But if you specify complete dataset name DAS will show dataset regardless of the status.

So in first case, you specified a pattern and DAS does not show a dataset since it is in PRODUCTION state. Therefore if you want to find your datasets regardless of their status you should always use the query

dataset dataset=/WJetsToLNu/RunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1/MINIAODSIM status=

or similar where you specify the dataset patterns. The point is to supply status=* parameter.

There is nothing to be fixed and you should pay attention to what DAS shows, for your query dataset=/WJetsToLNu*/RunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1/MINIAODSIM it prints on a page the following message:

" By default DAS show dataset with VALID status. To query all datasets regardless of their status please usedataset dataset=/WJetsToLNu/RunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1/MINIAODSIM status= query or use proper status value, e.g. PRODUCTION "

which is exactly what I described above.

Best, Valentin.

On 0, abdollah notifications@github.com wrote:

Hello once I query like the following, it does not show anything: https://cmsweb.cern.ch/das/request?view=list&limit=50&instance=prod%2Fglobal&input=dataset%3D%2FWJetsToLNu*%2FRunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1%2FMINIAODSIM

It only shows me the dataset name in case I type the entire name, like this: https://cmsweb.cern.ch/das/request?view=list&limit=50&instance=prod%2Fglobal&input=+%2FWJetsToLNu_TuneCP5_13TeV-madgraphMLM-pythia8%2FRunIIFall17MiniAOD-94X_mc2017_realistic_v10-v1%2FMINIAODSIM

This is really confusing and has taken lots of my time to find WJet sample.

Can you please fix this bug???

-- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/dmwm/DAS/issues/4281

abdollah110 commented 6 years ago

OK Thanks. Do you have any idea why this particular sample is in 'production' status and not 'valid'?

vkuznet commented 6 years ago

This usually happens when dataset is not yet completed, e.g. it's still growing (i.e. new blocks, files are still producing) or Tier-0 is still not closed the dataset.

Since it is in production level, you better ask data-ops team why it remains in this state. Sometimes it is just overhead (e.g. somebody forgot to close it).

On 0, abdollah notifications@github.com wrote:

OK Thanks. Do you have any idea why this particular sample is in 'production' status and not 'valid'?

-- You are receiving this because you commented. Reply to this email directly or view it on GitHub: https://github.com/dmwm/DAS/issues/4281#issuecomment-379677996

abdollah110 commented 6 years ago

OK Thanks