issues
search
chufanchen
/
read-paper-and-code
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CoLLA 2023 | Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges
#84
chufanchen
opened
7 months ago
0
ICLR 2024 | TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
#83
chufanchen
closed
7 months ago
2
DiLoCo: Distributed Low-Communication Training of Language Models
#82
chufanchen
opened
7 months ago
0
SoCC 2018 | Fast Distributed Deep Learning via Worker-adaptive Batch Sizing
#81
chufanchen
closed
7 months ago
2
ACSOS 2020 | Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching
#80
chufanchen
opened
7 months ago
0
SoCC 2020 | Semi-dynamic load balancing: efficient distributed learning in non-dedicated environments
#79
chufanchen
closed
7 months ago
4
SIGMOD '22 | NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access
#78
chufanchen
opened
7 months ago
0
Towards a Universal Decision Making Paradigm
#77
chufanchen
opened
7 months ago
0
ICLR 2021 | Reset-Free Lifelong Learning via Skill-Space Planning.
#76
chufanchen
opened
7 months ago
0
ICML 2023 | Parameter-Level Soft-Masking for Continual Learning
#75
chufanchen
opened
7 months ago
2
ICLR 2023 | Continual Pre-training of Language Models
#74
chufanchen
opened
7 months ago
0
EMNLP 2023 | Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks
#73
chufanchen
closed
7 months ago
3
ICLM 2019 | Policy Consolidation for Continual Reinforcement Learning
#72
chufanchen
opened
7 months ago
0
An Empirical Model of Large-Batch Training
#71
chufanchen
opened
7 months ago
2
NeurIPS 2023 | Rewiring Neurons in Non-Stationary Environments
#70
chufanchen
opened
7 months ago
2
ICLR 2024 | MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
#69
chufanchen
opened
7 months ago
0
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
#68
chufanchen
opened
7 months ago
0
OSDI 20 | A Unified Architecture for Accelerating Distributed DNN Training in Heterogeneous GPU/CPU Clusters
#67
chufanchen
closed
7 months ago
5
Large-scale Reinforcement Learning for Diffusion Models
#66
chufanchen
opened
7 months ago
0
Video as the New Language for Real-World Decision Making
#65
chufanchen
opened
7 months ago
0
ICLM 2018 | Lipschitz Continuity in Model-based Reinforcement Learning
#64
chufanchen
opened
7 months ago
0
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
#63
chufanchen
opened
7 months ago
0
ICLR 2024 | ODICE: Revealing the Mystery of DICE via Orthogonal-gradient Update
#62
chufanchen
opened
7 months ago
0
ICLR 2024 | Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning
#61
chufanchen
opened
7 months ago
0
ICLR 2024 | Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
#60
chufanchen
opened
7 months ago
0
CoRR 2023 | ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
#59
chufanchen
opened
7 months ago
0
CoRR 2023 | Policy Optimization in RLHF: The Impact of Out-of-preference Data
#58
chufanchen
opened
7 months ago
0
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
#57
chufanchen
opened
7 months ago
0
Proximal Preference Optimization for Diffusion Models
#56
chufanchen
opened
7 months ago
0
ICLR 2023 | Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
#55
chufanchen
opened
7 months ago
0
MM 23' | Guided Image Synthesis via Initial Image Editing in Diffusion Model
#54
chufanchen
opened
7 months ago
0
ICLR 2024 | Training Diffusion Models with Reinforcement Learning
#53
chufanchen
opened
7 months ago
0
Diffusion Model Alignment Using Direct Preference Optimization
#52
chufanchen
opened
7 months ago
2
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey
#51
chufanchen
opened
7 months ago
0
ICLR 2022 | Learning a subspace of policies for online adaptation in Reinforcement Learning
#50
chufanchen
closed
7 months ago
1
SaLinA - A Flexible and Simple Library for Learning Sequential Agents (including Reinforcement Learning)
#49
chufanchen
opened
7 months ago
0
ICLR 2023 | Building a Subspace of Policies for Scalable Continual Learning
#48
chufanchen
opened
7 months ago
0
ICLR 2023 | Does Zero-shot Reinforcement Learning Exist?
#47
chufanchen
opened
7 months ago
0
AAMAS 2024 | Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
#46
chufanchen
opened
7 months ago
0
Unsupervised Zero-Shot RL via Functional Reward Representations
#45
chufanchen
opened
7 months ago
0
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
#44
chufanchen
opened
7 months ago
0
CoLLAs 2023 | The Effectiveness of World Models for Continual Reinforcement Learning
#43
chufanchen
opened
7 months ago
0
Harnessing Discrete Representations For Continual Reinforcement Learning
#42
chufanchen
opened
7 months ago
0
NeurIPS 2022 | Disentangling Transfer in Continual Reinforcement Learning
#41
chufanchen
closed
7 months ago
3
CoLLAs 2023 | Loss of Plasticity in Continual Deep Reinforcement Learning
#40
chufanchen
opened
7 months ago
0
NeurIPS 2023 | COOM: A Game Benchmark for Continual Reinforcement Learning
#39
chufanchen
opened
7 months ago
0
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
#38
chufanchen
opened
7 months ago
0
CDC 2023 | Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding
#37
chufanchen
opened
7 months ago
0
NeurIPS 2022 | Decentralized Training of Foundation Models in Heterogeneous Environments
#36
chufanchen
opened
7 months ago
0
ASPLOS 2020 | Heterogeneity-Aware Asynchronous Decentralized Training
#35
chufanchen
opened
7 months ago
0
Previous
Next