Satwik11's picture
Update README.md
0ba75d1 verified
|
raw
history blame
1.57 kB
metadata
license: mit
language:
  - en
base_model:
  - microsoft/phi-4

Model Card for Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit

Model Overview

Model Name: Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit
Model Type: Instruction-tuned, Quantized GPT-4-based language model
Quantization: GPTQ 4-bit
Author: Satwik11
Hosted on: Hugging Face

Description

This model is a quantized version of the Microsoft phi-4 Instruct model, designed to deliver high performance while maintaining computational efficiency. By leveraging the GPTQ 4-bit quantization method, it enables deployment in environments with limited resources while retaining a high degree of accuracy.

The model is fine-tuned for instruction-following tasks, making it ideal for applications in conversational AI, question answering, and general-purpose text generation.

Key Features

  • Instruction-tuned: Fine-tuned to follow human-like instructions effectively.
  • Quantized for Efficiency: Uses GPTQ 4-bit quantization to reduce memory requirements and inference latency.
  • Pre-trained Base: Built on the Microsoft phi-4 framework, ensuring state-of-the-art performance on NLP tasks.

Use Cases

  • Chatbots and virtual assistants.
  • Summarization and content generation.
  • Research and educational applications.
  • Semantic search and knowledge retrieval.

Model Details

Architecture

  • Base Model: Microsoft phi-4
  • Quantization Technique: GPTQ (4-bit)
  • Language: English
  • Training Objective: Instruction-following fine-tuning