HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation Paper • 2303.15994 • Published Mar 28, 2023 • 2
Text Promptable Surgical Instrument Segmentation with Vision-Language Models Paper • 2306.09244 • Published Jun 15, 2023 • 2
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation Paper • 2311.16492 • Published Nov 27, 2023 • 2
Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning Paper • 2403.06728 • Published Mar 11 • 2
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models Paper • 2407.11213 • Published Jul 15 • 3
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published Oct 26 • 23
Learning Flow Fields in Attention for Controllable Person Image Generation Paper • 2412.08486 • Published 15 days ago • 32