-
GenEx: Generating an Explorable World
Paper • 2412.09624 • Published • 84 -
IamCreateAI/Ruyi-Mini-7B
Image-to-Video • Updated • 12.8k • 458 -
Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation
Paper • 2412.06016 • Published • 20 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 74
Doğuş Can Korkmaz
doguscank
AI & ML interests
Vision, LLMs, vLLMs, semantic segmentation, forecasting
Recent Activity
upvoted
a
paper
2 days ago
Taming Multimodal Joint Training for High-Quality Video-to-Audio
Synthesis
updated
a collection
2 days ago
to read
updated
a collection
2 days ago
to read
Organizations
None yet
Collections
9
models
1
datasets
None public yet