Platforms

Section 51.1

This section catalogs the third-party platforms that production LLM teams use to enforce safety, security, and governance policies. The landscape divides cleanly into three roles: moderation APIs (real-time classifiers that pre-screen user inputs and model outputs for harmful content), red-team platforms (tools that systematically probe a deployed LLM for jailbreaks and vulnerabilities), and compliance / governance services (SaaS that produces audit-ready evidence for the EU AI Act, NIST AI Risk Management Framework, and ISO/IEC 42001). For each role we name the dominant 2026 vendors, their pricing tier, and the deployment context where each one wins.

The Part X safety-platform landscape divides cleanly into three roles.
Figure 51.1.1: The Part X safety-platform landscape divides cleanly into three roles. Moderation APIs (OpenAI Moderation, Azure Content Safety) sit on the request path. Red-team platforms (Garak, PyRIT, Haize, Giskard) run offline probes. Compliance services (Credo AI, Fairly AI, ModelBench) translate eval evidence into audit reports for the EU AI Act and NIST AI RMF.

51.1.1 Moderation APIs

51.1.2 Red-team and adversarial-testing platforms

51.1.3 Compliance and governance

51.1.4 Comparing the platforms

Table 51.1.1a: Safety platforms (2026).
Platform Role Cost Best for
OpenAI Moderation Pre-classify input/output Free Quick safety net
Azure Content Safety Multi-modal classification Per-call Enterprise multi-modal
Garak Open red-team scanner Free CI vulnerability scans
PyRIT Open red-team automation Free Structured red-team campaigns
Credo AI Governance Enterprise SaaS EU AI Act compliance

What's Next?

In the next section, Section 51.2: Libraries & Frameworks, we build on the material covered here.

Further Reading

Security Platforms

Microsoft (2024). "Microsoft Security Copilot." microsoft.com/en-us/security/business/ai-machine-learning/microsoft-security-copilot. Reference enterprise LLM security platform.
Cloudflare (2024). "Cloudflare Workers AI." developers.cloudflare.com/workers-ai. Reference platform for LLM-edge security including prompt-injection scanning.