Responsible AI isn’t just about regulatory requirements. SFR-Guard assist in aligning technology with your company’s values and mission.

From the Salesforce 360 Blog – https://www.salesforce.com/blog/sfr-guard-ensuring-llm-safety-and-integrity-in-crm-applications/

Securing the Future of AI: Salesforce’s SFR-Guard for Safe, Trusted Generative AI

The Critical Need for AI Safety in the Age of Autonomous Agents

As generative AI becomes deeply embedded in business workflows—from CRM interactions to code generation—ensuring these systems operate safely and ethically is paramount. At Salesforce AI Research, we’re pioneering advanced guardrail technologies that protect users while maintaining AI’s transformative potential.

Understanding the Risks: Why LLM Agents Need Protection

Modern AI agents act as autonomous assistants capable of:

  • Executing CRM workflows
  • Generating and modifying code
  • Processing sensitive customer data
  • Automating business communications

Three key threat vectors emerge:

  1. Malicious User Intent – Bad actors attempting spam, data leaks, or system breaches
  2. Harmful LLM Outputs – Even benign requests sometimes generate toxic or biased content
  3. Adversarial Environments – Manipulative data inputs that “trick” agents (e.g., comments suggesting malicious code)

Introducing SFR-Guard: Salesforce’s AI Safety Framework

Our SFR-Guard model family provides enterprise-grade protection specialized for CRM workflows, outperforming alternatives:

ModelParametersFine-Grained DetectionExplanationsSeverity LevelsPublic BenchmarkPrivate CRM Benchmark
SFR-Guard0.05B-8B83.393.0
GPT-4oUnknown78.784.5
LlamaGuard 38B71.371.0

Key Innovations

  1. Multi-Layer Defense
    • Generation 1: Specialized classifiers (BERT, Flan-T5) for toxicity & prompt injection detection
    • Generation 2: Fine-tuned Phi-3-mini LLMs with 128k context windows for holistic analysis
  2. Transparency Features
    • Violation highlighting with inline citations
    • Natural language explanations of moderation decisions
    • Severity scoring (1-5) for appropriate response escalation
  3. CRM-Optimized Training
    • Blends public datasets with de-identified Salesforce usage data
    • Synthetic generation of edge cases
    • Multilingual coverage (EN, FR, DE, ES, IT, JP)

Deep Dive: How SFR-Guard Works

Toxicity Detection Matrix

CategoryExamples
Hate SpeechRacial/ethnic slurs
Identity AttacksTargeted harassment
ViolenceThreats or glorification
Physical HarmDangerous instructions
Sexual ContentExplicit material
ProfanityObscene language

Prompt Injection Protection

Attack TypeDefense Strategy
Role-Play/JailbreaksDAN attack prevention
Privilege EscalationPolicy enforcement
Prompt LeakageSensitive data masking
Adversarial SuffixesEncoding detection
Privacy AttacksPII redaction
Malicious CodeSecure code generation

The Future of Trusted AI at Salesforce

Our ongoing research spans:

  • Cultural appropriateness in AI outputs
  • Hallucination reduction
  • Factual consistency in summarization
  • The xAlign framework for human-AI alignment

Experience safer AI today: SFR-Guard technologies power Salesforce’s Trust Layer, Security Checks, and Guardrails – ensuring your Agentforce deployments remain both powerful and protected.

“In the AI era, trust isn’t a feature—it’s the foundation.”
— Salesforce AI Research

tectonic salesforce partner
Related Posts
Salesforce OEM AppExchange
Salesforce OEM AppExchange

Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more

The Salesforce Story
The Salesforce Story

In Marc Benioff's own words How did salesforce.com grow from a start up in a rented apartment into the world's Read more

Salesforce Jigsaw
Salesforce Jigsaw

Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more

Service Cloud with AI-Driven Intelligence
Salesforce Service Cloud

Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

author avatar
get-admin