Introduction to benchmarking

How does benchmarking with Narev work?

There are three steps to getting your benchmark up and running.

Create a dataset and define a quality metric

This dataset contains all prompts used in the benchmark. Here, you also define how to evaluate responses.

Add variants to your benchmark

Variants are a combination of:

You can add as many variants as needed to determine the most cost efficient model.

Run the benchmark and interpret the results

Optional: publish benchmark to Hub

Narev Hub is a community of public benchmarks created by users.

⌘I