Manage Metrics
Instantiate the metric class you want to run and pass it with dataset and optional configuration to evaluator.run(...) or evaluator.submit(...).
Initialize the SDK
Create Metric Objects Inline
Metric objects are normal Python objects from nemo_evaluator_sdk.metrics.*. Keep them close to the evaluation code so the definition, dataset fields, and execution request stay in sync.
Use run for fast local execution while developing a metric. Use submit for durable remote execution through the platform job service.
Reuse a Metric Definition
Because metrics are inline objects, reuse is usually just a Python helper function or module-level factory.
Choose Metric Classes
Use the metric-specific pages for configuration details and examples:
Configure Runtime Parameters
Pass execution settings through the config argument.
For online evaluations, provide a model or agent target and use the online parameter classes described in Model Configuration and Agent Configuration.
Submit a Durable Job
Related Topics
- Metric Results - Work with
EvaluationResult, aggregate scores, and row scores - Manage Metric Jobs - Submit, monitor, reconnect to, and download job results
- Similarity Metrics - Configure exact match, F1, BLEU, ROUGE, and string/number checks