Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 11 days ago • 51 • 11
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 11 days ago • 51 • 11
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 11 days ago • 51 • 11
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 11 days ago • 51
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 57 • 7
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 57
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench Paper • 2310.01386 • Published Oct 2, 2023
Exploring Human-Like Translation Strategy with Large Language Models Paper • 2305.04118 • Published May 6, 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models Paper • 2310.20499 • Published Oct 31, 2023 • 8
A Comprehensive Study of GPT-4V's Multimodal Capabilities in Medical Imaging Paper • 2310.20381 • Published Oct 31, 2023 • 2
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate Paper • 2305.19118 • Published May 30, 2023
Context-Aware Cross-Attention for Non-Autoregressive Translation Paper • 2011.00770 • Published Nov 2, 2020
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Paper • 2311.16511 • Published Nov 25, 2023 • 1
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning Paper • 2012.14768 • Published Dec 29, 2020
Understanding and Improving Lexical Choice in Non-Autoregressive Translation Paper • 2012.14583 • Published Dec 29, 2020
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper • 2306.09093 • Published Jun 15, 2023 • 15
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation Paper • 2106.00903 • Published Jun 2, 2021
Progressive Multi-Granularity Training for Non-Autoregressive Translation Paper • 2106.05546 • Published Jun 10, 2021
On the Copying Behaviors of Pre-Training for Neural Machine Translation Paper • 2107.08212 • Published Jul 17, 2021
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models Paper • 2401.08350 • Published Jan 16, 2024