Closed Orig1n closed 10 months ago
Great question! This was indeed a challenge during our research. Although EG3D had published their code, they did not provide an explanation for the manual adjustments they made to their camera poses in their code. This was particularly problematic since our face alignment process was completely different. For the talking face task, we cropped the videos based solely on the bounding box calculated in the first frame, meaning that the subsequent frames were not aligned. Given this approach, determining how to model the rotation and translation was complex, especially when the EG3D camera convention was potentially misleading. We had no option but to manually adjust these parameters in the context of a talking face setting.
I'm confused when I read this function. Do the operations like
trans[2] += -10
,c *= 0.27 c[1] += 0.015 c[2] += 0.161
,K[0,0] = 2985.29/700 * focal / 1050 K[1,1] = 2985.29/700 * focal / 1050
andpose[:3, 3] = pose[:3, 3]/4.0 * 2.7
have any special meaning?