Vulnerability Scanning

View as Markdown

NVIDIA NeMo Auditor is released with early access availability and is subject to limited support and potential API changes in future releases.

NVIDIA NeMo Auditor audits LLMs by probing them with adversarial prompts and detecting failures such as jailbreaks, prompt injection, encoding bypasses, and unsafe output generation. It is powered by garak, NVIDIA’s open-source LLM vulnerability scanner, and integrates with NeMo Platform so audits can target any model reachable through the Inference Gateway.

Tutorials SDK Resources

Typical Workflow

A typical audit looks like the following:

  1. Create an audit target for the model you want to test.
  2. Create an audit configuration that selects which garak probes and detectors to run, along with reporting settings.
  3. Run the audit and inspect the resulting JSONL, HTML, and hitlog reports.

The plugin exposes both synchronous and asynchronous Python entry points for each step.


Setup

Before you can run audits, you need a working NeMo Platform install with the auditor plugin enabled and a garak interpreter on disk.

  • Follow Setup to install the platform and start local services.
  • Install garak in a Python virtual environment. By default the plugin invokes ~/.auditor/.venv/bin/python -m garak; override the interpreter path with NEMO_AUDITOR_GARAK_PYTHON if you installed it elsewhere.
  • Configure at least one Inference Gateway provider so audits can route requests to the model under test.

Task Guides

References