Anthonny Olime's picture

Anthonny Olime

Aviv-anthonnyolime

·

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

deepseek-ai/DeepSeek-V3-Base

liked a model about 8 hours ago

ibm-granite/granite-3.1-8b-base

liked a model 1 day ago

Qwen/QVQ-72B-Preview

View all activity

Organizations

Aviv-anthonnyolime's activity

liked 2 models about 8 hours ago

deepseek-ai/DeepSeek-V3-Base

Updated about 13 hours ago • 327

ibm-granite/granite-3.1-8b-base

Text Generation • Updated 6 days ago • 3.57k • 10

liked 3 models 1 day ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated 1 day ago • 1.07k • 258

facebook/dinov2-with-registers-giant

Image Feature Extraction • Updated 3 days ago • 35 • 3

facebook/dinov2-with-registers-giant-imagenet1k-1-layer

Image Classification • Updated 3 days ago • 30 • 1

updated a collection 1 day ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted a paper 1 day ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 78

updated a collection 1 day ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted 2 papers 1 day ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 2 days ago • 24

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Paper • 2307.14023 • Published Jul 26, 2023 • 1

updated a collection 1 day ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted a paper 1 day ago

MoEUT: Mixture-of-Experts Universal Transformers

Paper • 2405.16039 • Published May 25 • 1

updated a collection 1 day ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted 2 papers 2 days ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published 9 days ago • 23

A Touch, Vision, and Language Dataset for Multimodal Alignment

Paper • 2402.13232 • Published Feb 20 • 14

updated a collection 2 days ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted a paper 2 days ago

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Paper • 2402.00340 • Published Feb 1 • 1

updated a collection 2 days ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated 1 day ago • 1

upvoted a paper 2 days ago

Optimizing Byte-level Representation for End-to-end ASR

Paper • 2406.09676 • Published Jun 14 • 1