tapis-project / tapis-java

Texas Advanced Computing Center APIs
BSD 3-Clause "New" or "Revised" License
5 stars 6 forks source link

ICICLE access to TACC job histories #100

Open richcar58 opened 1 year ago

richcar58 commented 1 year ago

Use the data load script in the smart-scheduler repo to load job histories from 5 TACC HPC machines. Anonymize the sensitive data, export the anonymized data to csv files and compress the csv files.

Create a Tapis system, taccjobs, in the production icicle tenant and owned by the synthetic userid icicle. Use Tapis APIs to load the compressed csv files into the root directory of taccjobs. Share that directory publicly with all users in the icicle tenant.

As another icicle user, test file listing, single file download and directory download. Publicize how colleagues in ICICLE can access this data.