[Marked for Deprecation. please visit https://github.com/brain-score/language for the migrated project] Benchmarking of Language Models using Human Neural and Behavioral experiment data
we also might want to shift from "ceiling" terminology to "group-level crosscorrelation norm" or "across participants score normalization" or something along those lines, since ceiling is an inaccurate term.
we also might want to shift from "ceiling" terminology to "group-level crosscorrelation norm" or "across participants score normalization" or something along those lines, since ceiling is an inaccurate term.