Satwik11
/

Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit

4-bit precision

Model card Files Files and versions Community

Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit / README.md

Satwik11's picture

Update README.md

0ba75d1 verified 2 days ago

|

history blame contribute delete

1.57 kB

	---
	license: mit
	language:
	- en
	base_model:
	- microsoft/phi-4
	---
	# Model Card for Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit

	## Model Overview

	Model Name: Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit
	Model Type: Instruction-tuned, Quantized GPT-4-based language model
	Quantization: GPTQ 4-bit
	Author: Satwik11
	Hosted on: Hugging Face

	## Description

	This model is a quantized version of the Microsoft phi-4 Instruct model, designed to deliver high performance while maintaining computational efficiency. By leveraging the GPTQ 4-bit quantization method, it enables deployment in environments with limited resources while retaining a high degree of accuracy.

	The model is fine-tuned for instruction-following tasks, making it ideal for applications in conversational AI, question answering, and general-purpose text generation.

	## Key Features

	- Instruction-tuned: Fine-tuned to follow human-like instructions effectively.
	- Quantized for Efficiency: Uses GPTQ 4-bit quantization to reduce memory requirements and inference latency.
	- Pre-trained Base: Built on the Microsoft phi-4 framework, ensuring state-of-the-art performance on NLP tasks.

	## Use Cases

	- Chatbots and virtual assistants.
	- Summarization and content generation.
	- Research and educational applications.
	- Semantic search and knowledge retrieval.

	## Model Details

	### Architecture

	- Base Model: Microsoft phi-4
	- Quantization Technique: GPTQ (4-bit)
	- Language: English
	- Training Objective: Instruction-following fine-tuning