-
**Describe the bug**
I have encountered several issues while attempting to implement a combination of moe technique and lora fine-tuning with the llama2 model using deepspeed. I am using deepspeed ze…
-
When working with db, messaging, or other high-level client libraries, applications could create (at least) two distinct layers of spans (when both layers are instrumented):
- logical operation (suc…
-
https://arxiv.org/abs/1711.08324
(Submitted on 22 Nov 2017)
NHZlX updated
6 years ago
-
1. When the noise image output from the generator is added with the shot noise, should the value of the noise be 0~1,my guess is to use bl with wb in the camera parameters for deflation, as written in…
-
Hi,
I was trying out and benchmarking different single-device Mixture of Experts layer implementations, and I naively assumed vmap could be of help in this task. In some cases a simple vmap-based i…
-
Thank you so much for sharing!
Could you provide insights into the number of epochs required to achieve high-resolution, fine details during VQVAE training for 256x256 RGB images?
Additionally, …
-
I am attempting to use TFQ to do measurement noise mitigation by passing the measured bitstrings into a tensorflow model. Ultimately, this is something I'd like to run on either physical or simulated …
-
Hi Dr.Xu,
Is it convenient for you to share the pretrain model with me?
-
Hello,
First of all, thank you very much for the implementation you have done. it has saved us a lot of time. it's really a great job
I am trying to do super resolution on 64*64 to 256*256 imag…
-
The docs for DifferentialEquations.jl include a nice FAQ section with performance tips [here](https://docs.sciml.ai/stable/basics/faq/). Is there anything like that for common performance pitfalls fo…