Abdoul Majid O. Thiombiano's picture

111 13

Abdoul Majid O. Thiombiano

thiomajid

·

https://thiomajid.github.io/

AI & ML interests

NLP & Reasoning

Organizations

thiomajid's activity

upvoted a paper 9 days ago

SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization

Paper • 2410.21411 • Published 14 days ago • 19

upvoted a paper 11 days ago

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published 13 days ago • 21

upvoted a paper 13 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 17 days ago • 77

upvoted a paper 17 days ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 80

upvoted a paper 19 days ago

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published 20 days ago • 12

upvoted a paper 20 days ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published 21 days ago • 15

upvoted a paper 24 days ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published 25 days ago • 86

upvoted 7 papers about 1 month ago

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9 • 37

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

Cottention: Linear Transformers With Cosine Attention

Paper • 2409.18747 • Published Sep 27 • 15

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26 • 49

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 90

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28 • 18

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

upvoted 3 papers about 2 months ago

Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24 • 29

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24 • 16

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 128

upvoted a collection about 2 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted 2 papers 2 months ago

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

Paper • 2409.05591 • Published Sep 9 • 28

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 87