Open CBroz1 opened 2 years ago
I posted to the DLC gitter to ask for input
Proposed refactoring of pose training structure:
Pose Training input:
Identities: optional
1+Identity: string
name: string
description: string
Skeletons:
1+Skeleton:
id: string
1+Node Labels: string
1+Edges: optional
Training Frames:
1+Training Frame:
Annotator: optional string
Source Video: optional
Path: relative filepath
Frame Index: unsigned int
Source Frame: optional filepath
Instances:
1+Links to identity: name
1+Links to Skeleton: id
1+Node Positions:
x: float
y: float
z: optional float
visible: bool
@roomrys mostly LGTM; the only thing I'd suggest is for Source Video
and Source Frame
, these should be ImageSeries (which allows the user flexibility to choose between external vs. internal storage) and RGBImage data types, respectively.
With our default approach of writing the image frames directly to h5py.Dataset
, both of those should work as-is for now but leaves it open for long-term flexibility for other user-chosen methods of data storage.
Forked repo here with these specifications. Currently adding tests in SLEAP for writing/reading the proposed format. Subject to change and open to recommendations/concerns!
https://github.com/talmolab/ndx-pose-training/blob/main/spec/ndx-pose.extensions.yaml is looking good - should we open a draft PR to ndx-pose to continue the discussion about any specifics there?
We plan to ...