HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 6 days ago • 37
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 6 days ago • 37
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published 24 days ago • 38
VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 22
Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy Paper • 2404.05238 • Published Apr 8, 2024 • 3
GlitchBench: Can large multimodal models detect video game glitches? Paper • 2312.05291 • Published Dec 8, 2023 • 3
GlitchBench: Can large multimodal models detect video game glitches? Paper • 2312.05291 • Published Dec 8, 2023 • 3
Explaining image classifiers by removing input features using generative models Paper • 1910.04256 • Published Oct 9, 2019