metadata

license: mit
language:
  - en
base_model:
  - microsoft/phi-4

Model Card for Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit

Model Overview

Model Name: Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit
Model Type: Instruction-tuned, Quantized GPT-4-based language model
Quantization: GPTQ 4-bit
Author: Satwik11
Hosted on: Hugging Face

Description

This model is a quantized version of the Microsoft phi-4 Instruct model, designed to deliver high performance while maintaining computational efficiency. By leveraging the GPTQ 4-bit quantization method, it enables deployment in environments with limited resources while retaining a high degree of accuracy.

The model is fine-tuned for instruction-following tasks, making it ideal for applications in conversational AI, question answering, and general-purpose text generation.

Key Features

Instruction-tuned: Fine-tuned to follow human-like instructions effectively.
Quantized for Efficiency: Uses GPTQ 4-bit quantization to reduce memory requirements and inference latency.
Pre-trained Base: Built on the Microsoft phi-4 framework, ensuring state-of-the-art performance on NLP tasks.

Use Cases

Chatbots and virtual assistants.
Summarization and content generation.
Research and educational applications.
Semantic search and knowledge retrieval.

Model Details

Architecture

Base Model: Microsoft phi-4
Quantization Technique: GPTQ (4-bit)
Language: English
Training Objective: Instruction-following fine-tuning