Hello, I am doing computer vision research, in which I want to create a simulation of people walking around the street. To be more specific, can we get:
The head pose (yaw, pitch, roll) of the person (compared to camera view)
The normalized position (x,y coordinate) of the person
If we can get that, where should i start?
Hello, I am doing computer vision research, in which I want to create a simulation of people walking around the street. To be more specific, can we get: