mlsys Search Results - Githubissues

125 results
for mlsys

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Oneflow-Inc/libai #340

运行tools/train.sh脚本报错：Check failed: num_device > 0 (0 vs. 0) …

> 使用的oneflow版本：0.8.0+cu102，使用的libai版本：最新commit 目前正在尝试利用oneflow-libai跑gpt-2，按照[tutorial](https://libai.readthedocs.io/en/latest/tutorials/get_started/quick_run.html)的指示，仅修改了dataset相关的配置信息，运行`bash to…

Sakura-gh updated 2 years ago
3
jasperzhong/read-papers-and-code #310

MLSys '22 | Accelerating Training and Inference of Graph Neu…

https://arxiv.org/pdf/2110.08450.pdf

jasperzhong updated 2 years ago
1
caiyinqiong/Semantic-Retrieval-Models #3

Recommend to use this tool to collect retrieval-related pape…

Hi, I am Gordon Lee. Sorry to bother you with this issue. Thanks for your excellent work on sematic-retrieval models. Recently, MLNLP and I have made a search tool to collect top-tier conference up…

Doragd updated 1 year ago
2
thuml/awesome-multi-task-learning #2

New MTL papers from our group

Hi there, We're a [MLSys group](https://guanh01.github.io/) working on multi-task learning at the University of Massachusetts Amherst. Recently we have some new works completed. Would you mind taki…

zhanglijun95 updated 2 years ago
3
ise-uiuc/nnsmith #33

[FEATURE TRACKING] Preparing and Enhancing Open Source Devel…

Before open-sourcing the nnsmith project, I want to simplify and standardize a bit the development and user accessibility. Below is a tracking list of TODOs for @ganler to make the repository a better…

ganler updated 2 years ago
14
UofT-EcoSystem/hfta #10

[Optim] No need to broadcastablize coefficients during __ini…

Technically speaking, `.view()` should not actually trigger a memory reformatting (at least for GPUs; XLA is relatively unclear as it brings in extra complexity); therefore, the tensors that contain t…

wangshangsam updated 2 years ago
1
tensorflow/data-validation #170

What was the motivation of selected drift/skew comparators?

Are there any publications/docs explaining the motivation behind using `jensen_shannon_divergence` / `infinity_norm` for the Data Drift / Training-Serving Skew detection? Since there are many approa…

michalbrys updated 2 years ago
2
jasperzhong/read-papers-and-code #166

MLSys '21 | Scaling Distributed Training with Adaptive Summa…

https://proceedings.mlsys.org/paper/2021/file/757b505cfd34c64c85ca5b5690ee5293-Paper.pdf 感觉应该发ICLR.

jasperzhong updated 3 years ago
2
brais-martinez/real2binary #4

Data-Driven Channel Rescaling

Firstly, greatly thanks for sharing you brilliant work! After reading the R2B paper, I got little confused about the data-driven rescaling. For BNN, one of the most significant benefit is to use on…

wu-hai updated 2 years ago
2
PeterSH6/paper-notes #11

MLSys '21 | CPR: Understanding and Improving Failure Toleran…

PDF: [https://research.fb.com/wp-content/uploads/2021/04/CPR-Understanding-and-Improving-Failure-Tolerant-Training-for-Deep-Learning-Recommendation-with-Partial-Recovery.pdf](https://research.fb.com/w…

PeterSH6 updated 3 years ago
3

上一页 1...7 8 9 10 11 12 13...13 下一页

125 results for mlsys

125 results
for mlsys