AI for Math Reasoning

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

CaraJ authored a paper 13 days ago

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

CaraJ authored a paper 13 days ago

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

lupantech authored a paper 3 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

View all activity

AI4Math's activity

CaraJ

authored 2 papers 13 days ago

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Paper • 2404.13046 • Published Apr 19 • 1

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published 13 days ago • 21

lupantech

authored a paper 3 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19 • 36

CaraJ

authored a paper 3 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19 • 36

CaraJ

authored a paper 4 months ago

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Paper • 2304.00967 • Published Apr 3, 2023

CaraJ

authored a paper 6 months ago

MAVIS: Mathematical Visual Instruction Tuning

Paper • 2407.08739 • Published Jul 11 • 30

ZrrSkywalker

authored 2 papers 6 months ago

MAVIS: Mathematical Visual Instruction Tuning

Paper • 2407.08739 • Published Jul 11 • 30

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10 • 40

lupantech

authored a paper 6 months ago

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13 • 18

CaraJ

updated a dataset 8 months ago

AI4Math/MathVerse

Viewer • Updated Apr 19 • 4.73k • 949 • 40

CaraJ

authored a paper 9 months ago

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4 • 33

lupantech

authored a paper 9 months ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21 • 51

ZrrSkywalker

authored 8 papers 9 months ago

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

Paper • 2303.16199 • Published Mar 28, 2023 • 4

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Paper • 2306.11732 • Published Jun 15, 2023

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement

Paper • 2304.01195 • Published Apr 3, 2023

MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection

Paper • 2203.13310 • Published Mar 24, 2022

ImageBind-LLM: Multi-modality Instruction Tuning

Paper • 2309.03905 • Published Sep 7, 2023 • 16

AI & ML interests

Recent Activity

Team members 4

AI4Math's activity