Custom data preprocessing

Hi, sorry for the late reply, I was traveling for the last few weeks. And thanks for your interest in our work!

For multi-view capture, I think you need to follow neural body and easymocap guidelines. After all, the datasets I used are created by their team. In principle, our preprocessing code should work with any easymocap output, still, you might need to do some tweaks yourself for custom datasets.

If your mean custom smartphone capture, then it's a bit complicated. The overall process is 1) extract person bounding boxes in each frame (I use openpose to get keypoints and compute bounding boxes), 2) get SMPL estimations using SPIN (with bounding boxes computed from stage 1), 3) get foreground masks with MODNet, 4) preprocess the raw SMPL estimations and raw masks so that ARAH dataloaders can load them, 5) train the model.

I could update a script for (4) with some guidelines, probably after the CVPR deadline. However, for other steps, there are too many dependencies which makes it really hard to have a unified codebase doing all of them. I would like to figure out a concise pipeline doing all the steps, but given the complexity of dependencies and other tasks at hand, I cannot promise any timeline for it...

taconite / arah-release

Custom data preprocessing #6