← Back to search

Models tagged "llm-safety"

Sort by:

Found 2 models (showing 1-2)

lucataco/prompt-guard-86m

Detect prompt injection and jailbreak attempts in text inputs. Classify a string as benign, injection, and/or jailbreak...

text-classification • prompt-injection-detection • llm-safety • 31 runs

meta/llamaguard-7b

Moderate text prompts and assistant responses for safety and policy compliance. Accepts a user message (prompt) and/or a...

content-moderation • safety-classification • text-classification • 26 runs