15 533 233

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

liked a model 2 days ago

NovaSky-AI/Sky-T1-32B-Preview

upvoted a paper 3 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

upvoted a paper 3 days ago

An Empirical Study of Autoregressive Pre-training from Videos

View all activity

Organizations

taufiqdp's activity

liked a model 2 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated about 3 hours ago • 652 • 197

upvoted 2 papers 3 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 4 days ago • 64

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published 4 days ago • 30

upvoted 3 papers 4 days ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 5 days ago • 30

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 5 days ago • 74

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 5 days ago • 197

liked a model 4 days ago

microsoft/phi-4

Text Generation • Updated 5 days ago • 49.9k • 1.1k

liked a Space 5 days ago

Running

136

🔥

Attention Visualization

Vision Transformer Attention Visualization

upvoted 2 papers 5 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 6 days ago • 56

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72

upvoted a collection 6 days ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 2 days ago • 213

upvoted a paper 9 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 13 days ago • 15

upvoted a paper 10 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 12 days ago • 92

upvoted 2 papers 13 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 19 days ago • 89

1.58-bit FLUX

Paper • 2412.18653 • Published 20 days ago • 69

liked a model 17 days ago

deepseek-ai/DeepSeek-V3

Updated 14 days ago • 115k • 1.74k

upvoted a paper 19 days ago

OpenAI o1 System Card

Paper • 2412.16720 • Published 23 days ago • 31

liked a model 20 days ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated 1 day ago • 123k • 489

liked a Space 21 days ago

Running

🏢

Hub Recap

upvoted a paper 21 days ago

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 25 days ago • 13