Closed alexrichardson21 closed 4 years ago
Or is there a way to build a more cohesive temporal fitting process instead of just the single frame?
You are right, the difficulty is that for face-only images, no good landmark predictor exists to constrain the neck pose (OpenPose also does a poor job for these cases). If you have a fullbody image, you can actually fit SMPL-X (expressive body model) instead of FLAME (head only) using SMPLify-X. If you are only interested in the head, you can then still crop the head-only vertices (which are basically the FLAME vertices). Howerver this will give you a much better neck.
Another option could be not only optimizing a landmark loss but additionally optimizing some photometric loss (see e.g. this demo for details how to do this). This provides more supervision than keypoints only. And yes, certainly one could do a joint optimization with some temporal regularization to multiple consecutive frames of a video.
Interesting. I was using SMPLify X before but it runs significantly slower than FLAME fitting. Haven't looked into photometric loss yet so that seems worth a try
Interesting. I was using SMPLify X before but it runs significantly slower than FLAME fitting. Haven't looked into photometric loss yet so that seems worth a try
Hi Alex!
I wonder if you had a chance to look at any neck landmarks/positions?
I've noticed the neck tends to be a little off because the flame landmark embedding is missing any sort of neck landmark. If I wanted to add to the embedding using some openpose landmarks, how would i find the barycentric coordinates for the new landmarks?