sambanova / generative_data_prep

Apache License 2.0
58 stars 8 forks source link

Zoltanc/category id rebalance bug fix #47

Closed snova-zoltanc closed 1 year ago

snova-zoltanc commented 1 year ago

Summary

A previous PR added the option to add metadata about what category id each token is. This PR forgot to update the rebalancing stage to also rebalance the category id tokens, this PR is a fix for that.

PR Checklist