etched-ai / open-oasis

Inference script for Oasis 500M
MIT License
298 stars 19 forks source link

Camera action tokenization #9

Open jxiong21029 opened 2 hours ago

jxiong21029 commented 2 hours ago

Hello,

In the provided data, the camera actions have already been pre-quantized to integer values from 0-79 inclusive. How do raw mouse movements get converted to these values?

Here in the one_hot_actions method, the comment references another repository for the camera quantization, but it seems like that repository is not publicly available.

Thanks.

jxiong21029 commented 1 hour ago

Did you use VPT foveated quantization with camera_maxval=20, camera_binsize=0.5, and mu somewhere between 2.3 and 3.1?