Simon Brandeis's picture

Simon Brandeis

sbrandeis

·

SBrandeis

AI & ML interests

None yet

Recent Activity

published an article 3 days ago

Cohere on Hugging Face Inference Providers 🔥

liked a Space 10 days ago

jamesliu1217/EasyControl_Ghibli

upvoted a collection 12 days ago

View all activity

Organizations

sbrandeis's activity

upvoted a collection 12 days ago

Llama 4

Llama 4 release • 10 items • Updated 13 days ago • 436

upvoted an article about 2 months ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

Feb 18

• 96

upvoted an article 2 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 143

upvoted an article 11 months ago

Article

Benchmarking Text Generation Inference

May 29, 2024

• 31

upvoted 2 collections about 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 743

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91

upvoted 3 papers about 1 year ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 128

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 143

Locally Typical Sampling

Paper • 2202.00666 • Published Feb 1, 2022 • 2

upvoted a paper over 1 year ago

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 44

upvoted a collection over 1 year ago

MAGNeT

Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4, 2024 • 40

upvoted 4 papers over 1 year ago

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Paper • 2307.13304 • Published Jul 25, 2023 • 2

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Paper • 2312.09767 • Published Dec 15, 2023 • 27

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 81

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Paper • 2312.02145 • Published Dec 4, 2023 • 6

upvoted 2 collections over 1 year ago

Notus 7B v1

Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus • 11 items • Updated Dec 11, 2024 • 18

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 236

upvoted 2 papers over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Positional Description Matters for Transformers Arithmetic

Paper • 2311.14737 • Published Nov 22, 2023 • 2