Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models Paper • 2410.07176 • Published Oct 9 • 1
Data Advisor Collection [EMNLP 2024] Dynamic and Constitutional Data Curation for LLMs • 3 items • Updated Oct 13 • 1
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models Paper • 2410.05269 • Published Oct 7 • 3
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe Paper • 2410.05248 • Published Oct 7 • 8
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs Paper • 2410.05295 • Published Oct 3 • 12
Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models Paper • 2410.03659 • Published Oct 4 • 6
WPO Collection Models and datasets in paper "WPO: Enhancing RLHF with Weighted Preference Optimization". • 11 items • Updated Aug 22 • 5
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13 • 18
Rethinking Tabular Data Understanding with Large Language Models Paper • 2312.16702 • Published Dec 27, 2023 • 4
mDPO: Conditional Preference Optimization for Multimodal Large Language Models Paper • 2406.11839 • Published Jun 17 • 37