issues
search
DUNE
/
dist-comp
Action items for DUNE distributed computing, and common scripts that are used.
2
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Close Rucio datasets when workflow stage is complete
#172
dougbenjamin
opened
21 hours ago
0
DUNE global pool not able to talk to OSG factory gfactory-2.opensciencegrid.org
#171
StevenCTimm
opened
1 week ago
1
Other changes for justIN 01.02
#170
Andrew-McNab-UK
opened
1 week ago
0
Numbered output datasets in justIN workflows
#169
Andrew-McNab-UK
opened
2 weeks ago
3
Improve justIN dashboard list pages
#168
Andrew-McNab-UK
opened
2 weeks ago
0
GPU support in justIN
#167
Andrew-McNab-UK
opened
2 weeks ago
0
Documentation updates
#166
hschellman
opened
3 weeks ago
0
Put in request NERSC ServiceNow to include the dune.osgstorage.org CVMFS
#165
StevenCTimm
opened
1 month ago
0
Accounting: All DUNE global pool schedd's not reporting to GRACC and hence not to APEL
#164
StevenCTimm
opened
1 month ago
0
Monitoring: Need to get classad outputs from the global pool schedd's into elasticsearch/kibana
#163
StevenCTimm
opened
1 month ago
0
Dune global pool, 100K jobs held
#162
StevenCTimm
opened
2 months ago
1
Workflows that can't find a FCL file continue submitting junk jobs indefinitely
#161
StevenCTimm
closed
2 weeks ago
1
Open a ticket with glideinwms developers re. high-memory jobs causing glideinwms to think there are free cores when there aren't.
#160
StevenCTimm
opened
2 months ago
2
DUNE global pool collector not seeing a large number of glideins reporting
#159
StevenCTimm
closed
2 months ago
1
Can't disable a site or a storage in JustIN
#158
StevenCTimm
closed
2 months ago
1
Need better error messages when making workflow requests
#157
calcuttj
closed
2 weeks ago
1
JustIN apparently marking large fractions of jobs stalled when it should not be
#156
StevenCTimm
closed
2 weeks ago
1
Set meeting with Leslie Groer and Canadians to discuss changes in job submission to CA_SFU and CA_VICTORIA
#155
StevenCTimm
closed
2 months ago
4
GPU matching does not seem to work for jobs in the DUNE global pool
#154
Andrew-McNab-UK
closed
1 month ago
5
Apptainer does not have kx509
#153
hschellman
opened
3 months ago
3
Get jobscripts from GitHub too
#152
Andrew-McNab-UK
closed
2 weeks ago
0
dunegpfrontend01 can't see jobs on Justin-prod-sched01.dune.hep.ac.uk or osgsub02.sdcc.bnl.gov
#151
StevenCTimm
closed
3 months ago
2
DUNE access to GPUs at UK_Manchester
#150
Andrew-McNab-UK
closed
3 months ago
2
UKRSDC Product 2.1.4
#149
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.1.3
#148
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.1.2
#147
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.1.1
#146
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.7
#145
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.6
#144
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.5
#143
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.4
#142
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.3
#141
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.2
#140
Andrew-McNab-UK
opened
3 months ago
0
UKRSDC Product 2.2.1
#139
Andrew-McNab-UK
opened
3 months ago
0
Per-RSE output datasets for workflows/stages
#138
Andrew-McNab-UK
closed
2 weeks ago
6
Consolidated datasets for justIN automated log.tgz files
#137
Andrew-McNab-UK
closed
2 weeks ago
4
Request that workflow_id be promoted to be a searchable key in the workflow metacat
#136
hschellman
closed
2 weeks ago
8
Modify the rucio/metacat upload in justIN to populate the DID's into the metacat dataset that is created.
#133
StevenCTimm
closed
2 weeks ago
3
Enable the pinning feature of justIN so that we can read files directly from FNAL_DCACHE
#132
StevenCTimm
opened
4 months ago
1
Implement `--monte-carlo X` for justin-test-jobscript
#134
calcuttj
closed
2 weeks ago
3
All AWT is red as of 09:31 Fermilab time--Bad proxy
#131
StevenCTimm
closed
4 months ago
1
All AWT writes to MANCHESTER and QMUL failing with error code 99
#130
StevenCTimm
closed
4 months ago
4
Consult glideinwms developers on why US_FNAL-FermiGrid is ramping up so slow
#129
StevenCTimm
opened
4 months ago
1
Change the way that rucio dataset is created
#135
StevenCTimm
closed
4 months ago
1
why schedd on Justin-prod-sched01 crashing and going in and out of the pool
#128
StevenCTimm
closed
4 months ago
1
Can we make it such that evicted jobs never try to get rescheduled again in the global pool?
#127
StevenCTimm
closed
4 months ago
1
HTCondor rescheduling of jobs
#126
Andrew-McNab-UK
opened
4 months ago
0
Can we change glideinwms pressure to do entry by entry rather than site by site
#125
StevenCTimm
opened
4 months ago
0
Various storage elements (including EOS) polling voms-admin again
#124
StevenCTimm
closed
2 months ago
4
Why DUNE_IT_INFN_CNAF not getting tested in AWT
#123
StevenCTimm
opened
4 months ago
5
Next