Wu Chengyue's picture

Wu Chengyue

WuChengyue

·

AI & ML interests

None yet

Organizations

WuChengyue's activity

upvoted a paper 3 months ago

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published Dec 13, 2024 • 33

upvoted a paper 5 months ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 34

upvoted 2 papers 10 months ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 102

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Paper • 2405.07990 • Published May 13, 2024 • 20

upvoted a paper 11 months ago

Adapting LLaMA Decoder to Vision Transformer

Paper • 2404.06773 • Published Apr 10, 2024 • 18

upvoted a paper 12 months ago

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14, 2024 • 55

upvoted a collection about 1 year ago

AnyLLM-Pro

6 items • Updated Feb 27, 2024 • 4

upvoted 2 papers about 1 year ago

FiT: Flexible Vision Transformer for Diffusion Model

Paper • 2402.12376 • Published Feb 19, 2024 • 48

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 48