exploration-exploitation Search Results

Dooders/Experiments #13

Experiment: Simulated Annealing Gradient Descent vs. Traditi…

Run an experiment to evaluate the performance of a simulated annealing gradient descent (SA-GD) approach compared to traditional gradient descent (GD). The purpose of this experiment is to understand …

csmangum updated 1 week ago

kgex/developer-roadmap #489

Add Exploration-Exploitation Trade-off resource

DineshkumarS05 updated 1 year ago

Niketkumardheeryan/ML-CaPsule #1142

Waste Management through Reinforcement Learning

The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency. Environment and State R…

Panchadip-128 updated 1 month ago

GarimaSingh0109/WasteManagment #359

[Feature] Waste Management through Reinforcement Learning te…

### Description The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency. Environm…

Panchadip-128 updated 3 weeks ago

gitter-lab/active-learning-drug-discovery #1

Remarks/issues with current weighted exploitation-exploratio…

The current strategy assigns exploitation and exploration weights to clusters in the following manner: ![image](https://user-images.githubusercontent.com/7997790/50731117-92257780-1122-11e9-940c-51…

Malnammi updated 5 years ago

ubc-vision/3dgs-mcmc #21

Question about relocate_gs()

It seems only the optimizer parameters of the alive gaussians that are sampled are reset, while the optimizer parameters of the dead gaussians remain unchanged. May I ask the reason for this？

wangwang-xyz updated 2 weeks ago

mlr-org/mlrMBO #450

document/develop more ways to control exploration-exploitati…

Here are ways that I see mlrMBO currently offering control over exploration vs exploitation for single-objective tuning: - The infill criterion offers a discrete set of choices, each of which impli…

zkurtz updated 6 years ago

makaveli10/rl #2

Multi arm Bandits

- Exploitation is the right thing to do to maximize the expected reward on the one step, but exploration may produce the greater total reward in the long run. - Reward is lower in the short run, dur…

makaveli10 updated 1 year ago

deil87/smbo-java #17

Consider to add support for different strategies for balanci…

https://en.wikipedia.org/wiki/Multi-armed_bandit

deil87 updated 5 years ago

h2oai/h2o-3 #7241

AutoML: rethink exploitation feature after 3.34 changes

With the models grouping introduced in {{3.34}}, the {{exploitation_ratio}} doesn’t apply as strictly as it did before. Pre {{3.34}}, the exploitation ratio was dedicated to tuning of learn-rate on t…

exalate-issue-sync[bot] updated 1 year ago

578 results for exploration-exploitation

578 results
for exploration-exploitation