Tutorials
These tutorials cover the two user-facing surfaces of the Anonymizer plugin: the streaming preview workflow for iteration, and the run job for full datasets.
Library vs. Service
Anonymizer separates configuration (what to detect and how to replace it) from execution (where the work runs and how models are reached).
Part 1: Build the config (library)
Use anonymizer.config to define the rewrite or replacement strategy and detection options. This code is identical whether you run Anonymizer standalone or through the NeMo Platform service.
Part 2: Execute (platform)
Submit the config to the Anonymizer service. The plugin owns the request shape (PreviewRequest, AnonymizerRequest) so it can also describe the input source and model routing:
Service-Specific Considerations
When using Anonymizer as a NeMo Platform service:
Prerequisites
Complete Setup to install NeMo Platform, run nemo services run, and configure an inference provider. The root workspace includes the Anonymizer plugin, so nemo services run discovers it automatically and mounts /apis/anonymizer/... on the gateway — no separate plugin install step is needed. Verify the CLI is registered:
You should see validate, preview, and run command groups.
These tutorials route inference through an Inference Gateway provider, so a NeMo Platform cluster must be running before you preview or run a job. The examples reference the default NVIDIA Build provider created during setup.
nemo setup pre-configures a default/nvidia-build model provider during local startup.
This provider routes inference requests to models hosted on build.nvidia.com using the API base URL https://integrate.api.nvidia.com
and the NGC API key with Public API Endpoints permissions provided during deployment.
You can verify this provider exists by running nemo inference providers list --workspace default.
The tutorials in these docs use this provider for inference, but you can alternatively create your own and use it instead.
Upload an Input Fileset
sdk.anonymizer.preview, preview submit, and run submit reject local file paths, so the tutorials read from a fileset. Create a small CSV containing PII and upload it to a fileset named anonymizer-inputs:
The tutorials reference this file with fileset://{WORKSPACE}/anonymizer-inputs#anonymizer-input.csv.
Tutorials
Stream a small anonymized sample to iterate on AnonymizerConfig and model_configs. Covers sdk.anonymizer.preview, nemo anonymizer preview run / preview submit, and the NDJSON frame stream.
Run the full pipeline locally with nemo anonymizer run run or submit it to the Jobs worker with nemo anonymizer run submit. Load dataset.parquet, trace.parquet, and failed_records.json artifacts.