Closed qixuema closed 9 months ago
@qixuema oh yes indeed, should be fixed in the latest commit
i take it you must be seeing some positive results on your end?
@lucidrains I'm really sorry, but I haven't focused much on the triangle mesh, so I may not be able to answer how the results are regarding the mesh at the moment.
If you think it's necessary, I could consider conducting some tests on the mesh in the near future because, in any case, directly generating a mesh is an exciting research work.
Lastly, thank you very much for your work!
oh! no problem
assumed you had already gotten to the generation stage
in any event, thanks for finding this issue!
Thank you for your contributions to this project! Your work is greatly appreciated.
Hi Phil,
Thank you for your valuable contributions to this project!
I'm encountering a problem with handling long sequences here, where the maximum sequence length is set to 2048. The issue arises when processing a batch of code samples; if the iteration reaches the maximum sequence length without appending an
eos_token_id
at the end of all samples, the subsequent code block that depend on this are skipped.This results in the codes, inputted into later stages of the process, still containing an
eos_token_id
. This could potentially lead to errors in operations such asgather
that follow.Best regards, Xueqi