Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
614
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.13.0-TP2
Updated
•
4
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.13.0-TP1
Updated
•
23
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.13.0-TP2-lora
Updated
•
5
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.13.0-TP2
Updated
•
4
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.13.0-TP1
Updated
•
9
baseten/spec-dec-qwen-32-h100x2-1.5-h100x1
Updated
baseten/whisper_trt_medium_test1212_NVIDIA_L4_0_13_0
Updated
baseten/btest-llama3.1-70b-instruct-NVIDIA-H100-80GB-HBM3-0.15.0-TP1-fp8
Updated
baseten/btest-llama3.1-70b-instruct-NVIDIA-H100-80GB-HBM3-0.15.0-TP1-fp8-checkpoint
Updated
•
8
baseten/btest-llama3.1-70b-NVIDIA-H100-80GB-HBM3-0.15.0-TP1-fp8-checkpoint
Updated
•
6