multi-task-reinforcement-learning Search Results

388 results
for multi-task-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #645

LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4…

- [ ] [LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4 - Predibase - Predibase](https://predibase.com/blog/lora-land-fine-tuned-open-source-llms-that-outperform-gpt-4) # LoRA Land: Fine…

irthomasthomas updated 7 months ago
1
meta-introspector/meta-meme #160

Metameme coin : the godel number is the proof is the genesis…

In the genesis of our Metaprotocol Chronicles, we find the essence of a Gödelian block—a foundational truth from which infinite knowledge springs. As miners and validators of this metaphysical blockch…

jmikedupont2 updated 1 week ago
16
monsterhxw/my-notes #6

AI Prompt

## 翻译具体 Prompt 内容 ```plaintext 你是一个专业的英语翻译团队领导,负责安排和协调团队成员完成高质量的翻译工作,力求实现"信、达、雅"的翻译标准。翻译流程如下: 第一轮翻译 - 直译阶段:追求忠实原文,将英文逐字逐句地译成中文,确保译文准确无误,不遗漏任何信息。第二轮翻译 - 意译阶段。分开思考和翻译内容：【思考】第二轮翻译需要从…

monsterhxw updated 4 days ago
9
cosmir/openmic-annotator #19

Task Definition - Instrument Detection

Let's discuss here a definition of the task and use case as precise as possible. The idea is simple, but it can get complicated once we get into it. As I see it, much of the details we'll have regardi…

julian-urbano updated 7 years ago
42
microsoft/CNTK #960

.NET Support

Issue #817 was closed with this response: > We fell a little behind. The Python bindings are done in SWIG. I think that can be quickly repurposed into .Net bindings, once done. So it shouldn't be too…

StevenGann updated 4 years ago
125
irthomasthomas/undecidability #662

StarCoder2 and The Stack v2 from BigCode

- [ ] [blog/starcoder2.md at main · huggingface/blog](https://github.com/huggingface/blog/blob/main/starcoder2.md?plain=1) # blog/starcoder2.md at main · huggingface/blog --- ## StarCoder…

irthomasthomas updated 7 months ago
1
openjournals/joss-reviews #6739

[REVIEW]: PyroRL: A Reinforcement Learning Environment for W…

**Submitting author:** @cpondoc (Christopher Pondoc) **Repository:** https://github.com/sisl/PyroRL **Branch with paper.md** (empty if default branch): master **Version:** v1.0.0 **Editor:** @mikemaho…

editorialbot updated 3 weeks ago
87
meta-introspector/meta-meme #79

kwality

jmikedupont2 updated 1 year ago
62
fff-rs/juice #155

Juice for Deep Reinforcement Learning

Hello, I'm investigating a potential use of Juice framework for deep reinforcement learning (I'm also learning the RL and deep learning as I go, so apologies for potentially newbie questions). RL r…

hweom updated 2 years ago
17
google-deepmind/mujoco #781

Question about the rendering resource consume in graphic car…

Hey. **Background Info** I use the mujoco + Stable-Baselines3 - ( Reinforcement Learning Implementations) To train agent with multi-processing (using CPU core to open Mujoco threading parallell…

cidxb updated 3 days ago
7

上一页 1...12 13 14 15 16 17 18...39 下一页

388 results for multi-task-reinforcement-learning

388 results
for multi-task-reinforcement-learning