atrifat/topic-classification
Classify the topic of tweets and short-form text. Accepts a text string and returns a ranked list of topic labels with c...
Found 17 models (showing 1-17)
Classify the topic of tweets and short-form text. Accepts a text string and returns a ranked list of topic labels with c...
Classify sentiment from text input. Return a sentiment label (e.g., positive or negative) and a confidence score for the...
Detect whether text is AI- or human-written. Accepts text input and returns a likelihood score indicating the probabilit...
Classify text prompts for text-to-image systems and return a safety score from 0 (safe) to 10 (very NSFW/toxic). Accepts...
Classify text prompts and assistant responses for safety. Accepts a user message and/or assistant reply and outputs a sa...
Generate text from a prompt. Uses a LLaMA 1.0 13B language model fine-tuned for academic tasks such as text classificati...
Classify a text string as POSITIVE or NEGATIVE sentiment using a BERT-based binary classifier. Input: text. Output: sent...
Detect hate speech and toxic content in text. Accepts a text string and returns JSON scores for toxicity, severe_toxicit...
Classify text into user-provided categories and rank labels by confidence. Accepts a text sequence and a comma-separated...
Classify text into user-defined categories without training (zero-shot). Accepts an input text and a comma-separated lis...
Detect prompt injection and jailbreak attempts in text inputs. Classify a string as benign, injection, and/or jailbreak...
Moderate LLM prompts and responses for safety policy compliance. Accepts a user prompt and/or an assistant reply as text...
Classify sentiment of text as positive, negative, or neutral. Accepts a text input and returns an overall sentiment labe...
Moderate text prompts and assistant responses for safety and policy compliance. Accepts a user message (prompt) and/or a...
Classify sentiment in text. Accepts a text string and returns sentiment labels—positive, neutral, negative—with confiden...
Classify text against a custom safety policy. Provide a plain‑English policy and a text input, and receive a structured...
Classify sentiment of one or many texts and return positive/negative labels with confidence scores. Accepts a batch of t...