policy-learning Search Results

1000+ results
for policy-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

arXivTimes/arXivTimes #1261

Population Based Augmentation: Efficient Learning of Augment…

## 一言でいうと自動で最適なData Augmentationを探索するAutoAugment(#764 )の計算時間を1000倍高速にしたという研究。Data Augmentationの適用もパラメーターの一種と考え、進化戦略(PBA)を用いて良好な結果を出したモデル/Augmentationを残していく形を取っている。モデルのパラメーターが持ち越されるため再計算の必要がない !…

icoxfog417 updated 5 years ago
1
arXivTimes/arXivTimes #1228

MCP: Learning Composable Hierarchical Control with Multiplic…

## 一言でいうと複数のベース戦略を組み合わせ、様々な戦略を実現する手法の提案。通常、戦略の組み合わせは合算(Additive)で行われることが多いが、掛け合わせ(Multiplicative)を使うことで複数戦略を統合して行動分布を作るような形をとっている。これにより、複雑な連続コントロールタスクができることを確認 ### 論文リンク https://arxiv.org/ab…

icoxfog417 updated 5 years ago
1
unizard/AwesomeArxiv #198

[2018.06.14] AutoAugment: Learning Augmentation Policies fro…

Institute: Google Brain URL: https://arxiv.org/pdf/1805.09501.pdf Keyword: Data Augmentation, AutoML, ReinforceLearning Interest: 5 Code: https://github.com/DeepVoltaire/AutoAugment GoogleBlog: h…

unizard updated 6 years ago
2
ray-project/ray #46311

CI test linux://rllib:learning_tests_multi_agent_cartpole_ap…

CI test **linux://rllib:learning_tests_multi_agent_cartpole_appo_multi_gpu** is consistently_failing. Recent failures: - https://buildkite.com/ray-project/postmerge/builds/5169#01905b51-30e3-4427-98…

can-anyscale updated 2 months ago
9
nus-cs2103-AY2425S1/pe-dev-response #3450

Hard-to-type command words

All command words are rather long and include dashes, which makes it time-consuming for the user to type, given that user will be typing these command words repeatedly and for every single time they u…

nus-pe-bot updated 2 weeks ago
1
OpenGVLab/InternImage #282

detection 我是用自己数据训练就是那个官方balloon数据，典型小样本，效果太差

optimizer = dict(type='SGD', lr=0.05, momentum=0.9, weight_decay=0.0001) optimizer_config = dict(grad_clip=None) # learning policy lr_config = dict( policy='step', warmup='linear', w…

BoFan-tunning updated 2 months ago
4
PaloAltoNetworks/terraform-provider-prismacloudcompute #80

Unable to create CI Vulnerability rules

## Describe the bug I am using terraform resource prismacloudcompute_ci_image_vulnerability_policy to provision CI image vulnerability rules however its not working correct with loop. ## Expecte…

jhabikal21 updated 1 month ago
1
huggingface/lerobot #504

Porting HIL-SERL

# HIL-SERL in LeRobot --- On porting [HIL-SERL](https://hil-serl.github.io/) to LeRobot. This page will outline the minimal list of components and tasks that should be implemented in the LeRobot c…

michel-aractingi updated 2 weeks ago
2
Azure/bicep-registry-modules #3849

[AVM Module Issue]: associatedKeyVaultResourceId parameter f…

### Check for previous/existing GitHub issues - [x] I have checked for previous/existing GitHub issues ### Issue Type? Bug ### Module Name avm/res/machine-learning-services/workspace ### (Option…

DavidSP-Transparity updated 15 hours ago
4
shufangxun/LLaVA-MoD #5

CUDA OOM issues

Hello, I've been trying to qwen2 0.5B and tinyclip using the repository, but I'm running into CUDA OOM issues on the dense2dense distillation step. Im running on 4 80GB A100s, I was wondering if I …

pumetu updated 2 weeks ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for policy-learning

1000+ results
for policy-learning