5 43 258

Daniel Bourke PRO

mrdbourke

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a Space 2 days ago

stevengrove/YOLO-World

updated a Space 6 days ago

mrdbourke/learn_hf_food_not_food_text_classifier_demo_video

liked a model 14 days ago

openai/clip-vit-large-patch14

View all activity

Organizations

None yet

mrdbourke's activity

liked a Space 2 days ago

Running on T4

350

🔥

Food Not Food Text Classifier (video)

liked a model 14 days ago

openai/clip-vit-large-patch14

Zero-Shot Image Classification • Updated Sep 15, 2023 • 24.7M • 1.54k

upvoted a collection 16 days ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated 5 days ago • 77

liked 3 models 16 days ago

liked a Space 18 days ago

Running

573

🌖

Qwen2-VL-72B

replied to rwightman's post 21 days ago

MARS results look great! Just started a training run with cmars, will report back

liked a model 21 days ago

rwightman/timm-optim-caution

Updated 20 days ago • 8

liked a model 22 days ago

ds4sd/docling-models

Updated 16 days ago • 45.8k • 53

liked a Space 22 days ago

Running

954

🎤

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15 • 13

reacted to cfahlgren1's post with 🚀 22 days ago

Post

2991

We just dropped an LLM inside the SQL Console 🤯

The amazing, new Qwen/Qwen2.5-Coder-32B-Instruct model can now write SQL for any Hugging Face dataset ✨

It's 2025, you shouldn't be hand writing SQL! This is a big step in making it where anyone can do in depth analysis on a dataset. Let us know what you think 🤗

replied to rwightman's post 22 days ago

Woah, looks like a good boost across most results. Been using torch.optim.adamw for months. Will try out a training run today with timm.optim.cadamw

reacted to rwightman's post with 🔥 22 days ago

Post

1310

There's a new timm release, v 1.0.12, with a focus on optimizers. The optimizer factory has been refactored, there's now a timm.optim.list_optimizers() and new way to register optimizers and their attributes. As always you can use an timm optimizer like a torch one, just replace torch.optim with timm.optim

New optimizers include:
* AdafactorBigVision - adfactorbv
* ADOPT - adopt / adoptw (decoupled decay)
* MARS - mars
* LaProp - laprop
* Cautious Optimizers - a modification to all of the above, prefix with c as well as cadamw, cnadamw, csgdw, clamb, crmsproptf

I shared some caution comparisons in this model repo: rwightman/timm-optim-caution

For details, references, see the code: https://github.com/huggingface/pytorch-image-models/tree/main/timm/optim