regularized-optimization Search Results

JuliaSmoothOptimizers/OptimizationProblems.jl #333

Benchopt suite for machine learning

It would be nice to support the [Benchopt](https://github.com/benchopt/benchopt) problem suite, which is also available in Python: - [ ] Ordinary Least Squares - [ ] Non-Negative Least Squares - …

gdalle updated 4 months ago

kblomdahl/dream-go #47

Monte-Carlo tree search as regularized policy optimization

> The combination of Monte-Carlo tree search (MCTS) with deep reinforcement learning has led to significant advances in artificial intelligence. However, AlphaZero, the current stateof-the-art MCTS …

kblomdahl updated 2 years ago

huggingface/trl #2156

[CGPO] Add support for Constrained Generative Policy Optimiz…

### Method description Constrained Generative Policy Optimization was introduced by Meta in a recent paper (https://arxiv.org/pdf/2409.20370). It seems to outperform PPO and DPO and is specifically…

gaetanlop updated 4 weeks ago

lightvector/KataGo #275

Monte-Carlo tree search as regularized policy optimization

I saw this [post on reddit](https://www.reddit.com/r/MachineLearning/comments/hrzooh/r_montecarlo_tree_search_as_regularized_policy/) and thought this might be of interest here. [Paper](https://proce…

Hersmunch updated 3 years ago

talmolab/stac-mjx #44

Improve comments and variable names

Some improvements to comments, etc. 1. Explain why mjx data (comment about creating on GPU): https://github.com/talmolab/stac-mjx/blob/main/stac_mjx/controller.py#L201 2. Change _lb and _ub to _jo…

jf514 updated 2 months ago

CGAL/cgal #8515

Spelling mistakes found checking en-GB to en-US

Based an #8514 a check words that are misspelling in `en-US` but are correct in `en-GB` (CGAL normally uses `en-US`) # Recognized by my spelling checker and by google translate as misspelled - an…

albert-github updated 1 month ago

breatheco-de/breatheco-de #7921

Agregar tecnologias a todos los proyectos de machine learnin…

https://chatgpt.com/share/6722826e-2730-800e-9be1-5c6b505e6fa3 Las tecnologias se agregan en el learn.json de cada proyecto Revisa todo el syllabus y ve editando en github cada project y color lso…

alesanchezr updated 4 days ago

CliMA/ParameterEstimocean.jl #236

Interface for `ParameterEstimation.jl` and `Optim.jl` or `Bl…

A priority for future development should be to make this package compatible with arbitrary derivative-free `Optim.jl` or `BlackBoxOptim.jl` optimization algorithms. These problems are formulated not a…

adelinehillier updated 2 years ago

personabb/survey_paper #16

【2024/06】Iterative Length-Regularized Direct Preference Opti…

## 論文タイトル（原文まま） Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level ## 一言でいうと 7Bの言語モデルをGPT-4レベルに向上させるための、反復長さ正則化直接選好最適化（iLR-DP…

personabb updated 4 months ago

statsmodels/statsmodels #1971

DOC: Topic section in documentation

We need some central location for documenting features that are common to most models, and/or have a generic implementation. ### robust covariances main candidate right now are robust covariances: Th…

josef-pkt updated 7 years ago

241 results for regularized-optimization

241 results
for regularized-optimization