EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml

Performance Metrics

DirectML

We measured the performance of DirectML on AMD Ryzen 9 7940HS /w Radeon 78

Prompt Length Generation Length Average Throughput (tps)
128 128 -
128 256 -
128 512 -
128 1024 -
256 128 -
256 256 -
256 512 -
256 1024 -
512 128 -
512 256 -
512 512 -
512 1024 -
1024 128 -
1024 256 -
1024 512 -
1024 1024 -
Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API has been turned off for this model.

Collection including EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml