Closed lcassano closed 5 years ago
The threshold is set to 0.8 by default instead of 0.9 as the paper indicates "The summary ‘score’ computes the percentage of runs for which the average regret drops below 0.9 faster than the 2^N episodes expected by dithering."
def find_solution(df_in: pd.DataFrame, sweep_vars: Sequence[Text] = None, merge: bool = True, thresh: float = 0.8) -> pd.DataFrame:
The threshold is set to 0.8 by default instead of 0.9 as the paper indicates "The summary ‘score’ computes the percentage of runs for which the average regret drops below 0.9 faster than the 2^N episodes expected by dithering."
def find_solution(df_in: pd.DataFrame, sweep_vars: Sequence[Text] = None, merge: bool = True, thresh: float = 0.8) -> pd.DataFrame: