Add compute resource usage to processing.json

AllenNeuralDynamics / aind-data-schema

A library that defines AIND data schema and validates JSON.

MIT License

23 stars 17 forks source link

Add compute resource usage to processing.json #536

Closed camilolaiton closed 1 month ago

camilolaiton commented 1 year ago

It is important to track compute resources whenever we are running our scripts in order to optimize our algorithms. Therefore, I propose that we include that in the processing.json. @dyf @saskiad

These are some of the important info I am adding to the new versions of the SmartSPIM capsules:

Information about operative system, compute resources and its peak usage could be very important but could not be limited to only these. The usage of compute resources over time/step could also be very important to track.

saskiad commented 2 months ago

I think we should move on this - the idea has come up in a couple of conversations and I think implementing this would be helpful.

saskiad commented 2 months ago

need specs

dbirman commented 2 months ago

@camilolaiton is going to provide some more details on what we might want to include here, copying some possible things from above to discuss:

OS
CPU info
GPU info
cpu/gpu/ram usage snapshots over time
instance information (cpu/gpu/ram/memory)
instance costs? not sure how visible that is

camilolaiton commented 2 months ago

Personally, I'd like a schema that includes:

Operative System (e.g., Linux, Windows, Mint, MacOS, ...) and its version.
Machine architecture (e.g., x86_64, we should definitely include bit version).
Physical resources (e.g., number of CPU cores, RAM, shared memory compartment, number of GPUs, Disks).
Usage of CPU as a percentage over time (I don't think it should be that specific as % per core), same for GPUs and RAM. Maybe include SHM as well if you guys agree.

I think those are the most important ones for reproducibility and track our algorithms and used resources. I like @dbirman idea of instance costs but that'd work only if we register the type of machine that was used. I see it more as another script that based on this info could compute the money we spent.

saskiad commented 2 months ago

I think we should hold off on cost in the schema itself. We're making tools to track that - we definitely want eyes on it - but I have reservations about putting it directly into the schema. any thoughts @dyf ?

dbirman commented 2 months ago

I'll make a draft with the above, minus cost, and then we can iterate from there.

dbirman commented 2 months ago

~~(blocked waiting on https://github.com/AllenNeuralDynamics/aind-data-schema-models/pull/75)~~

dbirman commented 2 months ago

[x] Operating system enum
[x] Architecture enum
[x] ResourceTimestamp refactor (TimestampedValue in models?)

dbirman commented 2 months ago

~~Waiting on https://github.com/AllenNeuralDynamics/aind-data-schema-models/issues/77~~