Edmond Jacoupeau's picture

Edmond Jacoupeau

edmond

·

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

NX-AI/xLSTM-7b

liked a Space 18 days ago

akhaliq/anychat

liked a model about 2 months ago

facebook/MobileLLM-1B

View all activity

Organizations

edmond's activity

upvoted a collection 2 months ago

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14 • 12

upvoted an article 3 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 213

upvoted a paper 5 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23 • 42

upvoted 2 collections 6 months ago

Gemma 2 Release

15 items • Updated 12 days ago • 206

Florence

9 items • Updated Jul 11 • 160

upvoted a paper 7 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

upvoted an article 7 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 226

upvoted a collection 7 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 12 days ago • 142

upvoted a collection 8 months ago

LLaVA++ (LLaMA-3 and Phi-3-Mini)

Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated Jun 11 • 23

upvoted 3 papers 8 months ago

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 104

Voyager: An Open-Ended Embodied Agent with Large Language Models

Paper • 2305.16291 • Published May 25, 2023 • 9

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Paper • 2206.08853 • Published Jun 17, 2022 • 1

upvoted a collection 8 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 19 days ago • 697

upvoted a paper 8 months ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 54

upvoted 2 papers 12 months ago

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Paper • 2312.17090 • Published Dec 28, 2023 • 4

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Paper • 2312.14385 • Published Dec 22, 2023 • 5

upvoted 4 papers about 1 year ago

Pixel Aligned Language Models

Paper • 2312.09237 • Published Dec 14, 2023 • 14

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 37

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 117

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 18