OpenCompass

community

https://opencompass.org.cn/

OpenCompassX

open-compass

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

KennyUTC authored a paper about 4 hours ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

ZwwWayne authored a paper 3 days ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

KennyUTC authored a paper 6 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

View all activity

opencompass's activity

KennyUTC

authored a paper about 4 hours ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published 4 days ago • 22

ZwwWayne

authored a paper 3 days ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published 6 days ago • 20

KennyUTC

authored a paper 6 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 7 days ago • 34

KennyUTC

updated a Space 7 days ago

Running

🚀

MMBench Leaderboard

jnanliu

updated a dataset 12 days ago

opencompass/LiveMathBench

Viewer • Updated 12 days ago • 238 • 77 • 3

TracyMc

updated a Space 18 days ago

Running

🦀

Compass Academic Leaderboard

KennyUTC

updated a Space 22 days ago

Running on CPU Upgrade

562

🌎

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

ZwwWayne

authored a paper 26 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 27 days ago • 91

jnanliu

authored a paper 26 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 27 days ago • 91

KennyUTC

updated a Space 26 days ago

Running

🥇

Open LMM Reasoning Leaderboard

A Leaderboard that demonstrates LMM reasoning capabilities

zsytony

authored a paper 26 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 27 days ago • 91

ZwwWayne

authored a paper 28 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 92

KennyUTC

authored a paper 28 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 92

vanilla1116

updated a model about 1 month ago

opencompass/anah-v2

Text Generation • Updated Dec 11, 2024 • 45 • 2

vansin

posted an update about 1 month ago

Post

831

Try InternThinker~

https://internlm-chat.intern-ai.org.cn/internthinker

nebulae09

authored a paper about 2 months ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 19

KennyUTC

authored a paper about 2 months ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 19

vansin

posted an update about 2 months ago

Post

1237

Amazing !!!! test Post

nebulae09

authored 2 papers about 2 months ago

VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models

Paper • 2407.11691 • Published Jul 16, 2024 • 13

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Paper • 2410.12405 • Published Oct 16, 2024 • 13

AI & ML interests

Recent Activity

Team members 18

opencompass's activity

MMBench Leaderboard

Compass Academic Leaderboard

Open VLM Leaderboard

Open LMM Reasoning Leaderboard