issues
search
gaow0007
/
code-reading
MIT License
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Online Workload Allocation and Energy Optimization in Large Language Model Inference Systems
#190
gaow0007
opened
5 months ago
0
BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs
#189
gaow0007
opened
5 months ago
0
Pytorch-Memory-Utils
#188
gaow0007
opened
5 months ago
0
Data-Juicer: A Data-Centric Text Processing System for Large Language Models
#187
gaow0007
opened
1 year ago
0
Saturn: Optimized Training of Multiple Large Deep Learning Models
#186
gaow0007
opened
1 year ago
0
SPRIGHT: Extracting the Server from Serverless Computing! High-Performance EBPF-Based Event-Driven, Shared-Memory Processing
#185
gaow0007
opened
1 year ago
0
On Optimal Caching and Model Multiplexing for Large Model Inference
#184
gaow0007
opened
1 year ago
0
FECoM: A Step towards Fine-Grained Energy Measurement for Deep Learning
#183
gaow0007
opened
1 year ago
0
ProxyStore
#182
gaow0007
opened
1 year ago
0
python-azure-function-gpu
#181
gaow0007
opened
1 year ago
0
Tools-MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications
#180
gaow0007
opened
1 year ago
0
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
#179
gaow0007
opened
1 year ago
0
ProPack: Executing Concurrent Serverless Functions Faster and Cheaper
#178
gaow0007
opened
1 year ago
0
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
#177
gaow0007
opened
1 year ago
0
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
#176
gaow0007
opened
1 year ago
0
Copy is All You Need
#175
gaow0007
opened
1 year ago
0
ServerlessBench
#174
gaow0007
opened
1 year ago
0
ML.ENERGY Leaderboard
#173
gaow0007
opened
1 year ago
0
On Optimal Caching and Model Multiplexing for Large Model Inference
#172
gaow0007
opened
1 year ago
0
Augmenting Language Models with Long-Term Memory
#171
gaow0007
opened
1 year ago
0
Computron
#170
gaow0007
opened
1 year ago
0
FP8 Quantization: The Power of the Exponent
#169
gaow0007
opened
1 year ago
0
RAF: RAF Accelerates deep learning Frameworks
#168
gaow0007
opened
1 year ago
1
Profiling and Monitoring of Deep Learning Training Tasks
#167
gaow0007
opened
1 year ago
0
Characterizing Scaling and Transfer Learning of Neural Networks in SciML
#166
gaow0007
opened
1 year ago
0
How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study
#165
gaow0007
opened
1 year ago
0
JAX MeZO: Fine-Tuning Language Models with Just Forward Passes
#164
gaow0007
opened
1 year ago
0
Algorithms for explaining machine learning models
#163
gaow0007
opened
1 year ago
0
Constrained Value-Aligned LLM via Safe RLHF
#162
gaow0007
opened
1 year ago
0
Tetris: Memory-efficient Serverless Inference through Tensor Sharing
#161
gaow0007
opened
1 year ago
0
QLoRA: Efficient Finetuning of Quantized LLMs
#160
gaow0007
opened
1 year ago
0
Blazing fast bulk data transfers between any cloud
#159
gaow0007
opened
1 year ago
0
Open-source data curation platform for LLMs
#158
gaow0007
opened
1 year ago
0
CES (Conformalized Early Stopping)
#157
gaow0007
opened
1 year ago
0
TinyNAS
#156
gaow0007
opened
1 year ago
1
FasterTransformer
#155
gaow0007
opened
1 year ago
0
Large Language Models with Parameter-Efficient Federated Finetuning in the Presence of Heterogeneous Instructions
#154
gaow0007
opened
1 year ago
0
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research
#153
gaow0007
opened
1 year ago
0
CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation
#152
gaow0007
opened
1 year ago
0
Enable Fundamental Cacheability for Distributed Deep Learning Training
#151
gaow0007
opened
1 year ago
0
GC3: An Optimizing Compiler for GPU Collective Communication
#150
gaow0007
opened
1 year ago
1
Prediction of the Resource Consumption of Distributed Deep Learning Systems
#149
gaow0007
opened
1 year ago
0
Let's Wait Awhile - Datasets, Simulator, Analysis
#148
gaow0007
opened
1 year ago
0
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
#147
gaow0007
opened
1 year ago
0
The Power of Prediction: Microservice Auto Scaling via Workload Learning
#146
gaow0007
opened
1 year ago
0
PROFILING AND IMPROVING THE PYTORCH DATALOADER FOR HIGH-LATENCY STORAGE
#145
gaow0007
opened
1 year ago
0
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
#144
gaow0007
opened
1 year ago
0
PerfSpect
#143
gaow0007
opened
1 year ago
0
Magicube: Efficient Quantized Sparse Matrix Operations on Tensor Cores
#142
gaow0007
opened
1 year ago
0
Towards Demystifying Serverless Machine Learning Training
#141
gaow0007
opened
1 year ago
0
Next