hysts's picture

hysts

hysts

·

AI & ML interests

Computer Vision

Recent Activity

new activity 3 days ago

EXCAI/Diffusion-As-Shader:Apply for community grant: Academic project (gpu and storage)

liked a Space 3 days ago

EXCAI/Diffusion-As-Shader

updated a Space 4 days ago

hysts/age-estimation-APPA-REAL

View all activity

Organizations

hysts's activity

upvoted an article 4 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

6 days ago

• 57

upvoted an article 12 days ago

Article

FastRTC: The Real-Time Communication Library for Python

13 days ago

• 130

upvoted 3 articles 25 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

27 days ago

• 26

Article

Open R1: Update #2

By

and 6 others •

27 days ago

• 197

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 293

upvoted 4 articles about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 70

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 795

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 417

upvoted a collection 6 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted a paper 10 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 88

upvoted a paper about 1 year ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

upvoted 8 papers over 1 year ago

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Paper • 2311.06772 • Published Nov 12, 2023 • 35

Music ControlNet: Multiple Time-varying Controls for Music Generation

Paper • 2311.07069 • Published Nov 13, 2023 • 44

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

Paper • 2311.06783 • Published Nov 12, 2023 • 28

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Paper • 2311.04145 • Published Nov 7, 2023 • 35

Learning From Mistakes Makes LLM Better Reasoner

Paper • 2310.20689 • Published Oct 31, 2023 • 29

CapsFusion: Rethinking Image-Text Data at Scale

Paper • 2310.20550 • Published Oct 31, 2023 • 26

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

Paper • 2310.19909 • Published Oct 30, 2023 • 21

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Paper • 2310.19512 • Published Oct 30, 2023 • 16