SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper âĒ 2502.14786 âĒ Published 17 days ago âĒ 128
Running on L4 1.54k 1.54k MagicQuill ðŠķ Edit and enhance images with custom color and edge modifications
Running 286 286 Kokoro Text-to-Speech (WebGPU) ðĢ High-quality speech synthesis powered by Kokoro TTS