Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21 β’ 58 β’ 4
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper β’ 2410.11711 β’ Published Oct 15 β’ 8 β’ 4
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media Paper β’ 2410.12791 β’ Published Oct 16 β’ 4 β’ 3
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper β’ 2410.02749 β’ Published Oct 3 β’ 12 β’ 3
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper β’ 2410.02712 β’ Published Oct 3 β’ 35 β’ 3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper β’ 2409.12568 β’ Published Sep 19 β’ 47 β’ 4
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper β’ 2409.05177 β’ Published Sep 8 β’ 5 β’ 3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper β’ 2409.04269 β’ Published Sep 6 β’ 9 β’ 3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper β’ 2409.04269 β’ Published Sep 6 β’ 9 β’ 3
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance Paper β’ 2409.04593 β’ Published Sep 6 β’ 23 β’ 2
Benchmarking Chinese Knowledge Rectification in Large Language Models Paper β’ 2409.05806 β’ Published Sep 9 β’ 13 β’ 3