π€ Model π β π
glavin001/exllama-airoboros-7b-gpt4-1.4-gptq
Generate text from a prompt with a quantized 7B chat LLM optimized for fast inference via ExLlama (Airoboros-7B-GPT4-1.4...
Found 22 models (showing 21-22)
Generate text from a prompt with a quantized 7B chat LLM optimized for fast inference via ExLlama (Airoboros-7B-GPT4-1.4...
Generate chat-style text responses from prompts. Runs Metaβs Llama 2 7B Chat (GPTQ-quantized) for instruction following...