-
In another issue, I wrote the following:
> I dream of a compiler that is powerful enough to let users focus solely on the *semantics* of a program (*what* is to be computed), while the compiler fig…
-
If you are submitting a bug report, please fill in the following details and use the tag [bug].
**Describe the bug**
The generations from huggingface model (LlamaForCausalLM) and HookedTransformer…
-
- see https://github.com/ObrienlabsDev/machine-learning/issues/10
## Use Cases
Tensor cores have 3.5x the performance on NVidia GPUs than cuda cores
### LLM and Generative AI
- https://github.…
-
### Feature request
PagedAttention has been a mainstream optimization technology for generation task based on LLMs. It has been supported by a lot of server engines, e.g., [vllm](https://github.co…
-
> PyTorch 是由 Facebook 开发,基于 **Torch** 开发,从并不常用的 Lua 语言转为 Python 语言开发的深度学习框架,Torch 是 TensorFlow 开源前非常出名的一个深度学习框架,而 PyTorch 在开源后由于其使用简单,动态计算图的特性得到非常多的关注,并且成为了 TensorFlow 的 最大竞争对手。目前其 Github 也有 2w8+ 关注。
…
-
Hi,
Thank you for this exciting work. I will try to explain a use case that I hope will be possible with functorch because the naive way is just extremely slow.
Setting:
The use case follows [thi…
-
https://github.com/tensorflow/tensorflow/issues/50401
Created: 2021-06-22T15:34:15Z
----------------------------------------------------------------------------------------------------
**System inf…
-
Draft for context changes
- Operators on stack
- Tensor name and pointer mapping for easier operator porting
- Easier breakpoint setting
- Extensibility for tensor object-pooling, persistent opera…
-
### Description
I use Ray in an HPC cluster. The cluster has InfiniBand which has low latency and high bandwidth. Ray is based on gRPC and data transferring uses gRPC, too. I can use IPoIB(Internet …
-
Colossal-AI may be used to reduce GPU Memory consumption during training (but may cause more training time).
https://github.com/hpcaitech/ColossalAI
Colossal-AI is used for acceleration of AIGC (A…