Zhaopeng Tu

zptu

http://www.zptu.net

AI & ML interests

None yet

Recent Activity

commented on a paper 5 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

commented on a paper 6 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

commented on a paper 6 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

View all activity

Organizations

None yet

zptu's activity

commented a paper 5 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 11 days ago • 51 •

commented 2 papers 6 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 11 days ago • 51 •

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 11 days ago • 51 •

authored a paper 10 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 11 days ago • 51

commented a paper 2 months ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57 •

authored 15 papers 2 months ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

Paper • 2310.01386 • Published Oct 2, 2023

Exploring Human-Like Translation Strategy with Large Language Models

Paper • 2305.04118 • Published May 6, 2023

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

Paper • 2310.20499 • Published Oct 31, 2023 • 8

A Comprehensive Study of GPT-4V's Multimodal Capabilities in Medical Imaging

Paper • 2310.20381 • Published Oct 31, 2023 • 2

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

Paper • 2305.19118 • Published May 30, 2023

Context-Aware Cross-Attention for Non-Autoregressive Translation

Paper • 2011.00770 • Published Nov 2, 2020

GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation

Paper • 2311.16511 • Published Nov 25, 2023 • 1

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning

Paper • 2012.14768 • Published Dec 29, 2020

Understanding and Improving Lexical Choice in Non-Autoregressive Translation

Paper • 2012.14583 • Published Dec 29, 2020

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

Paper • 2306.09093 • Published Jun 15, 2023 • 15

Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation

Paper • 2106.00903 • Published Jun 2, 2021

Progressive Multi-Granularity Training for Non-Autoregressive Translation

Paper • 2106.05546 • Published Jun 10, 2021

On the Copying Behaviors of Pre-Training for Neural Machine Translation

Paper • 2107.08212 • Published Jul 17, 2021

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

Paper • 2401.08350 • Published Jan 16, 2024