Configuration ⚙️¶
Eval Set Configuration ⛭¶
The eval set configuration file allows you to set:
name
- display namemax_concurrent_judge_tasks
: how many judge tasks to run concurrentlymax_concurrent_llm_tasks
: how many llm tasks to run concurrentlyeval_metrics
: list of metrics to evaluate. to learn about the available metrics, runsimpleval metrics-explorer
llm_as_a_judge_name
: the model to use for the llm as a judge. Runsimpleval list-models
to see available judges.llm_as_a_judge_model_id
(optional) - The model id to use for thellm_as_a_judge_name
provider. Each judge comes with a default model-id, but you can set your own. Runsimpleval litellm-models-explorer
to see available models for Lite LLM judges.
Override Eval Set Configuration ⛭¶
If you want to override the configuration for the testcase level, you can do this for max_concurrent_judge_tasks
and/or max_concurrent_llm_tasks
by setting the override
config element with the testcase name you want to override.
For example:
1 2 3 4 5 6 7 8 9 10 11 12 13 |
|
Global Configuration ⛭¶
The global configuration file allows you to override certain global settings.
Currently it supports setting the retry behavior for different models.
Create a
global_config.json
in your working directory and populate it as you wish. You should see an indication in the terminal that the global configuration file was loaded.
Retries Configuration ⛭¶
Warning
This is an experimental feature and is not yet fully supported.
You can override the behavior of the retry mechanism for family of models.
This is useful if you keep hitting rate limits.
Supported models:
✅ bedrock_claude_sonnet
(will work for any bedrock model)
Retries are using tenacity's wait_random_exponential
function.
To learn more about the available options, see the function documentation
The default values are:
1 2 3 4 5 6 7 8 9 10 11 |
|
This means that it will stop after 5 attempts, has an initial window of 2s, will increase the wait time exponentially by a factor of 2, with a minimum of 10 seconds and up to a maximum of 30 seconds.
Update the values and paste into your global config file.
The global config schema is implemented in: simpleval/global_config/retries.py