Models tagged "content-moderation"

lucataco/nsfw_video_detection

Detect NSFW content in videos. Accepts a video and returns a single label: "normal" or "nsfw", with adjustable strictnes...

🎥 • video-nsfw-detection • content-moderation • 62.7K runs

🤖 Model 🖼️

falcons-ai/nsfw_image_detection

Detect NSFW content in images. Accepts an image and returns a binary label of "nsfw" or "normal" for content moderation...

🖼️ • image-nsfw-detection • content-moderation • 64.8M runs

🤖 Model 🖼️

jigsawstack/nsfw

Detect NSFW content in images for moderation. Accepts an image and returns category flags and confidence scores for nudi...

🖼️ • image-nsfw-detection • 24 runs

🤖 Model 🖼️

lucataco/flux-content-filter

Moderate images and prompts for NSFW, public figures, and copyright risk. Accepts an image and/or text prompt and return...

🖼️ • image-nsfw-detection • content-moderation • copyright-detection • 142.2K runs

🤖 Model

fofr/prompt-classifier

Moderate text-to-image prompts by scoring NSFW/toxicity on a 0–10 safety scale. Takes a text prompt and returns an integ...

content-moderation • toxicity-detection • text-classification • 1.9M runs

🤖 Model

meta/llama-guard-4-12b

Classify text prompts, model responses, and multiple images for safety policy compliance. Accepts text and a list of ima...

content-moderation • safety-classification • llm-guardrails • 15.5K runs

🤖 Model

meta/meta-llama-guard-2-8b

Moderate LLM conversations by classifying user prompts and assistant responses as SAFE or UNSAFE and listing violated po...

content-moderation • text-classification • guardrails • 734.9K runs

🤖 Model

meta/llama-guard-3-8b

Classify text for safety policy compliance. Takes a user prompt and/or an assistant response and returns a safe/unsafe l...

content-moderation • safety-classification • 356.7K runs

🤖 Model

atrifat/hate-speech-detector

Detect hate speech and toxic language in text. Accepts a text string and returns probability scores for toxicity, severe...

toxicity-detection • content-moderation • text-classification • 135.7K runs

🤖 Model 🖼️

google-deepmind/shieldgemma-2-4b-it

Classify images for safety violations across sexually_explicit, dangerous_content, and violence_gore policies. Takes an...

🖼️ • image-nsfw-detection • image-safety • image-moderation • 224 runs

🤖 Model 🖼️

meta/llama-guard-3-11b-vision

Moderate images and accompanying user messages by classifying safety risks. Takes an image and optional text input; outp...

🖼️ • content-moderation • image-analysis • image-nsfw-detection • 1.5K runs

🤖 Model 🖼️

zsxkib/stable-diffusion-safety-checker

Detect NSFW content in images for content moderation. Takes a single image as input and returns a binary label (safe or...

🖼️ • image-nsfw-detection • content-moderation • 546 runs

🤖 Model 🖼️

m1guelpf/nsfw-filter

Detect NSFW content in images. Analyze an input image with Stable Diffusion’s content filter and return a structured mod...

🖼️ • image-nsfw-detection • content-moderation • 10.1M runs

🤖 Model

meta/llamaguard-7b

Moderate text prompts and assistant responses for safety policy compliance. Accepts a user prompt and/or an assistant me...

content-moderation • safety-classification • 26 runs

🤖 Model 🖼️

kojott/content-moderation-vision

Moderate images for safety and policy compliance. Accepts an image (optional custom prompt) and returns structured JSON...

🖼️ • image-nsfw-detection • content-moderation • 189 runs