Triangle104
/

phi-4-Q6_K-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 3 days ago

Commit

b26361f

·

verified ·

1 Parent(s): 4d6b474

Update README.md

Files changed (1) hide show

README.md +58 -0

README.md CHANGED Viewed

@@ -28,6 +28,64 @@ base_model: microsoft/phi-4
 This model was converted to GGUF format from [`microsoft/phi-4`](https://huggingface.co/microsoft/phi-4) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/microsoft/phi-4) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`microsoft/phi-4`](https://huggingface.co/microsoft/phi-4) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/microsoft/phi-4) for more details on the model.
+---
+Model details:
+-
+Developers
+-
+Microsoft Research
+Description
+-
+phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning.
+phi-4 underwent a rigorous enhancement and alignment process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures
+Architecture
+-
+14B parameters, dense decoder-only Transformer model
+Inputs
+-
+Text, best suited for prompts in the chat format
+Context length
+-
+16K tokens
+GPUs
+-
+1920 H100-80G
+Training time
+-
+21 days
+Training data
+-
+9.8T tokens
+Outputs
+-
+Generated text in response to input
+Dates
+-
+October 2024 – November 2024
+Status
+-
+Static model trained on an offline dataset with cutoff dates of June 2024 and earlier for publicly available data
+Release date
+-
+December 12, 2024
+License
+-
+MIT
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)