A valid request URL is required to generate request examples{
"object": "eval",
"id": "<string>",
"name": "<string>",
"data_source_config": {
"type": "custom"
},
"testing_criteria": null,
"created_at": 123,
"metadata": {}
}Create the structure of an evaluation that can be used to test a model’s performance.
An evaluation is a set of testing criteria and a datasource. After creating an evaluation, you can run it on different models and model parameters. We support several types of graders and datasources.
For more information, see the Evals guide.
NOTE: This Azure OpenAI API is in preview and subject to change.
A valid request URL is required to generate request examples{
"object": "eval",
"id": "<string>",
"name": "<string>",
"data_source_config": {
"type": "custom"
},
"testing_criteria": null,
"created_at": 123,
"metadata": {}
}Enables access to AOAI Evals, a preview feature. This feature requires the 'aoai-evals' header to be set to 'preview'.
preview The explicit Azure AI Foundry Models API version to use for this request.
v1 if not otherwise specified.
v1, preview 201 The configuration for the data source used for the evaluation runs. Dictates the schema of the data used in the evaluation.
Show child attributes
A list of graders for all eval runs in this group. Graders can reference variables in the data source using double curly braces notation, like {{item.variable_name}}. To reference the model's output, use the sample namespace (ie, {{sample.output_text}}).
Show child attributes
The name of the evaluation.
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Show child attributes
The request has succeeded.
An Eval object with a data source config and testing criteria. An Eval represents a task to be done for your LLM integration. Like:
The object type.
eval Unique identifier for the evaluation.
The name of the evaluation.
Configuration of data sources used in runs of the evaluation.
Show child attributes
A list of testing criteria.
Show child attributes
The Unix timestamp (in seconds) for when the eval was created.
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Show child attributes