Hello there,
I would like to ask why was there the reduction to only 4096 params from the model it was built from?
And if i have the compute wouldnt I be better of using the original model chronos was based on, given the number of tokens?
However i am guessing it would just be an empty model right, but the pro would be i could use covariates perhaps?
Thanks for answering.
Hello there, I would like to ask why was there the reduction to only 4096 params from the model it was built from? And if i have the compute wouldnt I be better of using the original model chronos was based on, given the number of tokens? However i am guessing it would just be an empty model right, but the pro would be i could use covariates perhaps? Thanks for answering.