Open BlinkDL opened 1 year ago
We're looking into this, stay tuned!
Thanks for the bug report, we've just fixed this. There was a mistake in the mapping between old and new parameter names that we've now fixed.
Great. How abt the configuration for 125M and 355M
Here are examples about how to load all the models, and example outputs: https://github.com/HazyResearch/H3/blob/main/examples/README.md
Hi I can run 1.3B using benchmark code here, but 2.7B is still not working (bad results) with the following params: