EricGuo5513 / text-to-motion

Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."
MIT License
428 stars 38 forks source link

Question about 263 dimensions #36

Closed 1311894932 closed 8 months ago

1311894932 commented 8 months ago

the paper says pose is defined by root angular vec(Y), linear vec(XZ), root height. And LOCAL joints positions(R^3j), velocities(R^3j) and rotations(R^6j) in root space, finally there is binary features obtained by thresholding the heel and toe joint velocities to emphasize the foot ground contacts(R^4). So i think the dimensions of pose is 4(root)+21*12+4(binary features)=260? where is the lost 3 dimensions?

Any response will be appreciated!!!

EricGuo5513 commented 8 months ago

Hi, each dimension of the pose feature vector looks like this:

Recover global angle and positions for rotation data

root_rot_velocity (B, seq_len, 1)# root_linear_velocity (B, seq_len, 2)# root_y (B, seq_len, 1)# ric_data (B, seq_len, (joint_num - 1)3)# rot_data (B, seq_len, (joint_num - 1)6)# local_velocity (B, seq_len, joint_num*3)# foot contact (B, seq_len, 4)

On Tue, 31 Oct 2023 at 07:49, 1311894932 @.***> wrote:

the paper says pose is defined by root angular vec(Y), linear vec(XZ), root height. And LOCAL joints positions(R^3j), velocities(R^3j) and rotations(R^6j) in root space, finally there is binary features obtained by thresholding the heel and toe joint velocities to emphasize the foot ground contacts(R^4). So i think the dimensions of pose is 4(root)+21*12+4(binary features)=260? where is the lost 3 dimensions?

Any response will be appreciated!!!

— Reply to this email directly, view it on GitHub https://github.com/EricGuo5513/text-to-motion/issues/36, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKRYNB5GVCQOXCACFYMI76LYCD6XJAVCNFSM6AAAAAA6XXLAQKVHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3TANJRHE3DINI . You are receiving this because you are subscribed to this thread.Message ID: @.***>

1311894932 commented 8 months ago

Hello Dr.Guo! Thanks very much for replying amidst busy schedule !!!

------------------ 原始邮件 ------------------ 发件人: "EricGuo5513/text-to-motion" @.>; 发送时间: 2023年11月2日(星期四) 凌晨0:12 @.>; @.**@.>; 主题: Re: [EricGuo5513/text-to-motion] Question about 263 dimensions (Issue #36)

Hi, each dimension of the pose feature vector looks like this:

Recover global angle and positions for rotation data

root_rot_velocity (B, seq_len, 1)# root_linear_velocity (B, seq_len, 2)# root_y (B, seq_len, 1)# ric_data (B, seq_len, (joint_num - 1)3)# rot_data (B, seq_len, (joint_num - 1)6)# local_velocity (B, seq_len, joint_num*3)# foot contact (B, seq_len, 4)

On Tue, 31 Oct 2023 at 07:49, 1311894932 @.***> wrote:

> the paper says pose is defined by root angular vec(Y), linear vec(XZ), > root height. And LOCAL joints positions(R^3j), velocities(R^3j) and > rotations(R^6j) in root space, finally there is binary features obtained by > thresholding the heel and toe joint velocities to emphasize the foot ground > contacts(R^4). So i think the dimensions of pose is 4(root)+21*12+4(binary > features)=260? where is the lost 3 dimensions? > > Any response will be appreciated!!! > > — > Reply to this email directly, view it on GitHub > <https://github.com/EricGuo5513/text-to-motion/issues/36&gt;, or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AKRYNB5GVCQOXCACFYMI76LYCD6XJAVCNFSM6AAAAAA6XXLAQKVHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3TANJRHE3DINI&gt; > . > You are receiving this because you are subscribed to this thread.Message > ID: @.***> >

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>