gaow0007 code-reading issues

gaow0007 / code-reading

MIT License

0 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Online Workload Allocation and Energy Optimization in Large Language Model Inference Systems

#190 gaow0007 opened 5 months ago
0
BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs

#189 gaow0007 opened 5 months ago
0
Pytorch-Memory-Utils

#188 gaow0007 opened 5 months ago
0
Data-Juicer: A Data-Centric Text Processing System for Large Language Models

#187 gaow0007 opened 1 year ago
0
Saturn: Optimized Training of Multiple Large Deep Learning Models

#186 gaow0007 opened 1 year ago
0
SPRIGHT: Extracting the Server from Serverless Computing! High-Performance EBPF-Based Event-Driven, Shared-Memory Processing

#185 gaow0007 opened 1 year ago
0
On Optimal Caching and Model Multiplexing for Large Model Inference

#184 gaow0007 opened 1 year ago
0
FECoM: A Step towards Fine-Grained Energy Measurement for Deep Learning

#183 gaow0007 opened 1 year ago
0
ProxyStore

#182 gaow0007 opened 1 year ago
0
python-azure-function-gpu

#181 gaow0007 opened 1 year ago
0
Tools-MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications

#180 gaow0007 opened 1 year ago
0
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

#179 gaow0007 opened 1 year ago
0
ProPack: Executing Concurrent Serverless Functions Faster and Cheaper

#178 gaow0007 opened 1 year ago
0
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources

#177 gaow0007 opened 1 year ago
0
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

#176 gaow0007 opened 1 year ago
0
Copy is All You Need

#175 gaow0007 opened 1 year ago
0
ServerlessBench

#174 gaow0007 opened 1 year ago
0
ML.ENERGY Leaderboard

#173 gaow0007 opened 1 year ago
0
On Optimal Caching and Model Multiplexing for Large Model Inference

#172 gaow0007 opened 1 year ago
0
Augmenting Language Models with Long-Term Memory

#171 gaow0007 opened 1 year ago
0
Computron

#170 gaow0007 opened 1 year ago
0
FP8 Quantization: The Power of the Exponent

#169 gaow0007 opened 1 year ago
0
RAF: RAF Accelerates deep learning Frameworks

#168 gaow0007 opened 1 year ago
1
Profiling and Monitoring of Deep Learning Training Tasks

#167 gaow0007 opened 1 year ago
0
Characterizing Scaling and Transfer Learning of Neural Networks in SciML

#166 gaow0007 opened 1 year ago
0
How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study

#165 gaow0007 opened 1 year ago
0
JAX MeZO: Fine-Tuning Language Models with Just Forward Passes

#164 gaow0007 opened 1 year ago
0
Algorithms for explaining machine learning models

#163 gaow0007 opened 1 year ago
0
Constrained Value-Aligned LLM via Safe RLHF

#162 gaow0007 opened 1 year ago
0
Tetris: Memory-efficient Serverless Inference through Tensor Sharing

#161 gaow0007 opened 1 year ago
0
QLoRA: Efficient Finetuning of Quantized LLMs

#160 gaow0007 opened 1 year ago
0
Blazing fast bulk data transfers between any cloud

#159 gaow0007 opened 1 year ago
0
Open-source data curation platform for LLMs

#158 gaow0007 opened 1 year ago
0
CES (Conformalized Early Stopping)

#157 gaow0007 opened 1 year ago
0
TinyNAS

#156 gaow0007 opened 1 year ago
1
FasterTransformer

#155 gaow0007 opened 1 year ago
0
Large Language Models with Parameter-Efficient Federated Finetuning in the Presence of Heterogeneous Instructions

#154 gaow0007 opened 1 year ago
0
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research

#153 gaow0007 opened 1 year ago
0
CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

#152 gaow0007 opened 1 year ago
0
Enable Fundamental Cacheability for Distributed Deep Learning Training

#151 gaow0007 opened 1 year ago
0
GC3: An Optimizing Compiler for GPU Collective Communication

#150 gaow0007 opened 1 year ago
1
Prediction of the Resource Consumption of Distributed Deep Learning Systems

#149 gaow0007 opened 1 year ago
0
Let's Wait Awhile - Datasets, Simulator, Analysis

#148 gaow0007 opened 1 year ago
0
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters

#147 gaow0007 opened 1 year ago
0
The Power of Prediction: Microservice Auto Scaling via Workload Learning

#146 gaow0007 opened 1 year ago
0
PROFILING AND IMPROVING THE PYTORCH DATALOADER FOR HIGH-LATENCY STORAGE

#145 gaow0007 opened 1 year ago
0
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs

#144 gaow0007 opened 1 year ago
0
PerfSpect

#143 gaow0007 opened 1 year ago
0
Magicube: Efficient Quantized Sparse Matrix Operations on Tensor Cores

#142 gaow0007 opened 1 year ago
0
Towards Demystifying Serverless Machine Learning Training

#141 gaow0007 opened 1 year ago
0