luokai's picture

42 215

luokai

iamluokai

·

iamluokai

AI & ML interests

None yet

Recent Activity

liked a Space about 12 hours ago

lpiccinelli/UniK3D-demo

liked a model 3 days ago

Junfeng5/Liquid_V1_7B

upvoted a collection 8 days ago

View all activity

Organizations

iamluokai's activity

upvoted a collection 8 days ago

InternVL3

34 items • Updated 2 days ago • 49

upvoted a paper 15 days ago

SkyReels-A2: Compose Anything in Video Diffusion Transformers

Paper • 2504.02436 • Published 16 days ago • 35

upvoted a paper about 1 month ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 134

upvoted a collection about 1 month ago

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated 18 days ago • 102

upvoted 2 collections about 2 months ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated about 1 month ago • 104

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 2 days ago • 218

upvoted a paper 2 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

upvoted a paper 3 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

upvoted a paper 4 months ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

upvoted a collection 4 months ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 16 days ago • 146

upvoted 2 collections 5 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 2 days ago • 90

CogVideo

10 items • Updated 6 days ago • 50

upvoted a paper 5 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 76

upvoted a collection 6 months ago

LongVU

7 items • Updated Oct 31, 2024 • 30

upvoted a paper 6 months ago

Framer: Interactive Frame Interpolation

Paper • 2410.18978 • Published Oct 24, 2024 • 38

upvoted a collection 7 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Mar 13 • 302

upvoted 2 papers 7 months ago

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Paper • 2409.12960 • Published Sep 19, 2024 • 25

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13, 2024 • 54

upvoted a collection 8 months ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Mar 6 • 87