𝒕𝒂𝒏𝒗𝒊𝒓's picture

𝒕𝒂𝒏𝒗𝒊𝒓

Tanvir1337

·

https://linktr.ee/tanvir1337x

AI & ML interests

Deep Learning, Generative Adversarial Networks, Transformer, Diffusion, SOTA Foundation Models

Recent Activity

updated a collection about 7 hours ago

liked a Space about 7 hours ago

mvaloatto/TCTF

upvoted an article about 7 hours ago

Mastering Tensor Dimensions in Transformers

View all activity

Organizations

Tanvir1337's activity

upvoted an article about 7 hours ago

Article

Mastering Tensor Dimensions in Transformers

By

•

about 21 hours ago

• 11

upvoted a paper 3 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 5 days ago • 68

upvoted a collection about 1 month ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 87

upvoted a collection about 2 months ago

CogVideo

10 items • Updated Nov 27, 2024 • 46

upvoted 5 collections 2 months ago

GPT-Generated Unified Format (GGUF)

ease of reading • 44 items • Updated 4 days ago • 10

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 79

Flux LoRA Collections

Flux THE LoRA • 130 items • Updated about 16 hours ago • 32

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 12

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 22 days ago • 198

upvoted a paper 2 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 127

upvoted a collection 2 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted a paper 3 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 53

upvoted 2 collections 3 months ago

D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc

Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. • 76 items • Updated 14 days ago • 7

GGUF Image Model Quants

List of GGUF quants for text to image base models. • 12 items • Updated 5 days ago • 19

upvoted a paper 3 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

upvoted 4 articles 3 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20, 2024

• 35

Article

Mamba Out

By

•

Oct 18, 2024

• 8

Article

AI is turning nuclear: a review

By

•

Oct 20, 2024

• 11

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21, 2024

• 32

upvoted a collection 3 months ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 26 days ago • 96