Managing projects in Validate

Tonic Validate supports the following types of projects:

  • Development projects

  • Production monitoring projects

About development projects

A Validate development project contains the results of Validate runs.

For a set of questions, which can be from a Validate benchmark, the run assesses the quality of the answers from your RAG system. If you also provide expected answers, then the run compares the answers from your RAG system against the provided answers. It also analyzes how the RAG system used additional context to answer the questions.

Each run generates an overall score and metrics.

For information about starting a run, go to Starting a Validate run.

For information about viewing run results, go to Viewing and managing runs.

For development projects, you can also use our Ragas integration, tonic_ragas_logger, which allows you to display Validate visualizations of Ragas results.

About production monitoring projects

A production monitoring project tracks the performance of a production RAG system.

You configure the RAG system to send to the production monitoring project:

  • The questions the RAG system receives

  • The answers it provided

  • The context it used to determine the answer

Validate then generates metrics for each question, and allows you track the RAG system performance over time. Note that production monitoring does not use Ragas.

For information about configuring your RAG system to send questions to a Validate production monitoring project, go to Configuring your RAG system to send questions to Validate.

For information about viewing the results, go to Viewing the metric scores and logged questions.

Displaying the list of projects

The Validate home page includes the list of projects.

For each project, the list displays:

  • The project name

  • For development projects, when the most recent run occurred

  • For development projects, a chart that maps the average overall score for each run over time

Displaying details for a project

To display the details for a project, click the project tile.

New development project without runs

For a new development project that does not have any runs, the project details page guides you through the required steps to create a run.

If you use tonic_ragas_logger to visualize Ragas results in Validate, then select Ragas as the logging framework.

If you use Validate runs to generate and visualize metrics, then select Validate as the logging framework.

Development project with runs

For a development project that has completed runs, the project details page displays the list of runs, and provides an overview of the scores across the runs and questions.

At the left is the list of runs for the project. From there, you can display details for the run results.

The tiles across the top contain the average overall score and average metrics scores for the most recent run.

By default, the graph displays the overall score across all of the runs over time. When you click a metric score tile, the graph updates to show the average metric score across the runs.

Below the graph is the list of questions in the project benchmark. For each question, the list shows the overall score for a month ago and for the most recent run.

To filter the question list, in the filter field, type text from the question.

When you click a question, the graph is updated to show the average overall or metric score across runs for that specific question. To deselect the question, click it again.

New production monitoring project

For a new production monitoring project that does not have any results, the project details page guides you through the required steps to set up a feed of questions from the RAG system to the project.

Production monitoring project with results

For a production monitoring project that has received questions, the project details page shows a set of overall scores based on the most recent questions that the project received.

The overall scores are followed by a timeline that shows changes in the average metric scores over a selected timeframe.

Below the timeline is the list of questions with metric scores. When you click a point in the timeline, the questions are filtered to display questions that were received during that time.

Creating a project

For a new project, you provide a name and select the project type.

To create a project:

  1. On the Validate home page, click Create a Project.

  2. In the Name field, type the name of the project, then click Next.

  1. Click the type of project to create, then click Save.

Validate displays the project details page.

Changing the project name

For an existing project, you can change the name.

To edit the project name:

  1. Either:

  2. On the Edit Project panel, in the Project Name field, type the new name for the project.

  1. Click Save.

Deleting a project

To delete a project, from the projects list:

  1. Click the options icon for the project.

  2. In the options menu, click Delete.

Last updated