-
hello,可以分享下benchmark_batch_size.py这个脚本吗,谢谢
-
I have noticed that my fine-tuned versions of the phi-3.5-mini model generate incoherent content when exceeding an output length of 4096 tokens. I could reproduce this behaviour with the base-model as…
-
Unsloth: You have 1 CPUs. Using `safe_serialization` is 10x slower.
We shall switch to Pytorch saving, which will take 3 minutes and not 30 minutes.
To force `safe_serialization`, set it to `None` i…
-
Hi there Jake!
Just a quick question - I saw in the TransPhyloMulti paper that it should be possible to change the sampling proportion over time, but I just wanted to check whether the same thing c…
-
## Info
- [ACM article From Alibaba (2023)](https://doi.org/10.1145/3617327)
## Abstraction
- **Dynamic tensor shape**
- Unlike Image processing, language models such as LLM need to handle dyn…
-
Thank you for the work and for releasing its code.
Can you please give the steps on how to run the code for Partial Mesh Completion as mentioned in the paper?
-
# Background
The edge generation step ends with the `_meshes` property of the shape being filled with `SS2D_Mesh`[[1]] objects, which in turn consist of one or more `ArrayMesh` objects.
In `_draw()`…
mphe updated
2 months ago
-
### Prerequisites
- [X] I am using [the latest version of Locust](https://github.com/locustio/locust/releases/)
- [X] I am reporting a bug, not asking a question
### Description
I have a test scena…
-
### 🐛 Describe the bug
We are planning to turn on the inline_inbuilt_nn_modules flag to True by default. There are a few hiccups here and there, but we will turn on the flag and fix these issues late…
-
https://github.com/megvii-research/DPGN/blob/b9401115b43319544c07718890e115e7f4fc2a1d/main.py#L316
`self.pred_loss(query_node_pred_generation, query_label.long()).mean()`
For example, 5 way 1 sh…