Dokyoon

leeloolee

Eruly

AI & ML interests

Recent Activity

upvoted an article 4 days ago

Context Parallelism

liked a model 5 days ago

zer0int/CLIP-SAE-ViT-L-14

liked a model 5 days ago

PRIME-RL/Eurus-2-7B-PRIME

View all activity

Organizations

leeloolee's activity

upvoted an article 4 days ago

Article

Context Parallelism

•

Aug 13, 2024

• 13

upvoted a collection 5 days ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 93 items • Updated 5 days ago • 96

upvoted a paper 5 days ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published 6 days ago • 12

upvoted a paper 19 days ago

GUI Agents: A Survey

Paper • 2412.13501 • Published 26 days ago • 23

upvoted a paper 25 days ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 43

upvoted a paper 27 days ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted a collection 27 days ago

Multimodal-SAE

Collection

The collection of the sae that hooked on llava • 4 items • Updated 7 days ago • 5

upvoted a collection 28 days ago

GUI agents

Collection

A collection of papers on GUI agents • 3 items • Updated 30 days ago • 5

upvoted 2 papers about 1 month ago

Granite Guardian

Paper • 2412.07724 • Published Dec 10, 2024 • 18

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 122

upvoted an article about 1 month ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 11

upvoted a paper about 2 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

upvoted an article about 2 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

•

Nov 21, 2024

• 35

upvoted 3 papers 2 months ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 22

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Paper • 2411.05005 • Published Nov 7, 2024 • 13

upvoted 4 papers 3 months ago

GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation

Paper • 2410.20474 • Published Oct 27, 2024 • 14

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 19

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22, 2024 • 45

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30