Fix non-deterministic cache hash for MetricGrouping tasks by nuthalapativarun · Pull Request #1260 · huggingface/lighteval

nuthalapativarun · 2026-06-14T18:50:20Z

`LightevalTaskConfig.str` (used by `SampleCache._get_task_hash` to build the cache key) reprs each metric field directly. For `MetricGrouping` metrics, `corpus_level_fn` and `higher_is_better` are dicts whose values are callables, so `repr(...)` includes the function's memory address (e.g. `<function compute at 0x7f...>`). This makes the resulting cache hash non-deterministic across runs, breaking the cache for tasks like `leaderboard|truthfulqa:mc|0`.

This adds a small helper that recurses into dict values and renders callables by name (matching the existing handling for top-level metric callables), so the hash is stable across processes/runs.

Closes #1023

nuthalapativarun · 2026-06-23T15:43:42Z

Just checking in on this one - no urgency, but wanted to bring it back up in case it slipped through. Let me know if any changes are needed.

Fix non-deterministic cache hash for MetricGrouping tasks

956ee83

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix non-deterministic cache hash for MetricGrouping tasks#1260

Fix non-deterministic cache hash for MetricGrouping tasks#1260
nuthalapativarun wants to merge 1 commit into
huggingface:mainfrom
nuthalapativarun:fix/1023-cache-hash-metric-grouping

nuthalapativarun commented Jun 14, 2026

Uh oh!

nuthalapativarun commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

nuthalapativarun commented Jun 14, 2026

Uh oh!

nuthalapativarun commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant