GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks Paper • 2409.13832 • Published Sep 20, 2024
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control Paper • 2409.15977 • Published Sep 24, 2024
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis Paper • 2312.10741 • Published Dec 17, 2023
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Paper • 2406.12671 • Published Jun 18, 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Paper • 2410.02369 • Published Oct 3, 2024 • 1
Improving Neural Indoor Surface Reconstruction with Mask-Guided Adaptive Consistency Constraints Paper • 2309.09739 • Published Sep 18, 2023
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation Paper • 2405.15619 • Published May 24, 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Paper • 2409.18124 • Published Sep 26, 2024 • 32
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection Paper • 2308.14286 • Published Aug 28, 2023
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model Paper • 2406.19905 • Published Jun 28, 2024
ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning Paper • 2211.05488 • Published Nov 10, 2022 • 1
Multi-Curve Translator for High-Resolution Photorealistic Image Translation Paper • 2203.07756 • Published Mar 15, 2022 • 1
Modular Degradation Simulation and Restoration for Under-Display Camera Paper • 2209.11455 • Published Sep 23, 2022 • 1
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement Paper • 2107.12898 • Published Jul 27, 2021 • 2
Rethinking Performance Gains in Image Dehazing Networks Paper • 2209.11448 • Published Sep 23, 2022 • 1
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models Paper • 2308.05733 • Published Aug 10, 2023
Diffusion Models Trained with Large Data Are Transferable Visual Models Paper • 2403.06090 • Published Mar 10, 2024
Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth Paper • 2202.01470 • Published Feb 3, 2022
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Paper • 2403.16627 • Published Mar 25, 2024 • 20