vansin's picture

vansin

vansin

·

vansin

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

commented on a paper 4 days ago

FLAME: A Federated Learning Benchmark for Robotic Manipulation

commented on a paper 4 days ago

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

View all activity

Organizations

vansin's activity

commented 5 papers 4 days ago

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

Paper • 2503.01763 • Published 6 days ago • 4 •

FLAME: A Federated Learning Benchmark for Robotic Manipulation

Paper • 2503.01729 • Published 6 days ago • 4 •

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

Paper • 2503.01449 • Published 7 days ago • 4 •

CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs

Paper • 2503.01378 • Published 7 days ago • 3 •

SwiLTra-Bench: The Swiss Legal Translation Benchmark

Paper • 2503.01372 • Published 7 days ago • 2 •

commented a paper 17 days ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published 17 days ago • 12 •

New activity in cais/hle 17 days ago

Application of InternLM3 to Join the leaderboard

#5 opened 17 days ago by

New activity in vectara/leaderboard 17 days ago

Application of InternLM3 to Join the leaderboard

#9 opened 17 days ago by

commented a paper 25 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 28 days ago • 126 •

commented a paper 26 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 27 days ago • 60 •

commented a paper 27 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 27 days ago • 60 •

New activity in deepseek-ai/DeepSeek-R1 about 1 month ago

How to deploy DeepSeek-R1 witn LMDeploy ?

#48 opened about 1 month ago by

commented a paper 3 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 136 •

New activity in akhaliq/anychat 3 months ago

mindsearch

#20 opened 3 months ago by

add_mindsearch

#19 opened 3 months ago by

commented a paper 3 months ago

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29, 2024 • 42 •

New activity in vansin/MS 3 months ago

# [Need Help] Deploy The MindSearch Gradio to Hugging Face

#1 opened 3 months ago by

New activity in THUDM/cogvlm2-llama3-chat-19B 4 months ago

CogVLM models are supported by LMDeploy for inference

#11 opened 8 months ago by

New activity in meta-llama/Meta-Llama-3-8B 4 months ago

llama3 answer repeat to many times

#220 opened 6 months ago by

New activity in lmarena-ai/chatbot-arena-leaderboard 6 months ago

Please add InternLM2.5-20B-Chat and InternLM2.5-7B-Chat to Leaderboard

#61 opened 6 months ago by