Microsoft.ML.OnnxRuntimeGenAI unknown Provider type web
#1
by
HannesPP
- opened
my try replace a existin phi-3 onnx with your model
what do I miss?
-Hannes
this flavor is for ort-web. If you want to run it with onnxruntime-genai you can try to replace 'web' in genai_config.json with 'cuda'.
The model uses MHA and fp32 outputs for logits - cuda normally would use GQA and fp16 so it is not the best possible model for cuda but it should work.