Evaluator NeMo Platform SDK Resources
The nemo_evaluator_sdk package provides context-agnostic objects for defining metrics, datasets, evaluation configuration, and result handling.
When you want to execute those evaluations through the NeMo Platform Evaluator plugin, use the Evaluator SDK resource mounted on the nemo_platform SDK.
This page explains the NeMo Platform-specific objects used to run local plugin jobs, submit durable platform jobs, and retrieve evaluator job results.
Evaluator
The Evaluator resource is the sync SDK object for working with the Evaluator plugin on NeMo Platform.
It is accessed directly from a NeMoPlatform instance:
The primary execution methods are run and submit.
Use run when you want a local in-process plugin execution that returns a completed EvaluationResult.
Use submit when you want to create a durable remote platform job and manage the job lifecycle separately.
The dataset argument accepts inline rows, local dataset paths, local glob paths, and fileset references with optional fragment selectors. Use config for evaluator runtime settings, aggregate_fields on result-returning calls to shape aggregate scores, and target plus prompt_template when the evaluator should generate model or agent responses before scoring.
run() arguments
submit() arguments
Run locally
Submit a platform job
AsyncEvaluator
The AsyncEvaluator resource provides the same Evaluator plugin surface for AsyncNeMoPlatform.
Async methods must be awaited:
AsyncEvaluator.run() and AsyncEvaluator.submit() accept the same arguments as the sync methods above.
EvaluatorJobResource
The EvaluatorJobResource is the sync job handle returned by Evaluator.submit.
You can also reconnect to an existing job with Evaluator.get_job_resource.
Some of the most useful methods and properties are described below.
AsyncEvaluatorJobResource
The AsyncEvaluatorJobResource is the async job handle returned by AsyncEvaluator.submit.
It mirrors EvaluatorJobResource, but status and result methods are awaited.