VILARIN

vilarin

AI & ML interests

Pantheon

Recent Activity

reacted to nroggendorff's post with ๐Ÿ˜” about 19 hours ago
im so tired
liked a model 9 days ago
franciszzj/Leffa
liked a Space 10 days ago
yslan/ObjCtrl-2.5D
View all activity

Organizations

SIOKHE's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture Chinese LLMs on Hugging Face's profile picture open/ acc's profile picture

vilarin's activity

reacted to nroggendorff's post with ๐Ÿ˜” about 19 hours ago
view post
Post
3025
im so tired
  • 3 replies
ยท
reacted to merve's post with ๐Ÿš€ 29 days ago
view post
Post
3871
Small yet mighty! ๐Ÿ’ซ

We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient ๐Ÿค 

We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base HuggingFaceTB/smolvlm-6740bd584b2dcbf51ecb1f39

Learn more from our blog here: huggingface.co/blog/smolvlm
This release comes with a demo, fine-tuning code, MLX integration and TRL integration for DPO ๐Ÿ’
Try the demo: HuggingFaceTB/SmolVLM
Fine-tuning Recipe: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
Also TRL integration for DPO ๐Ÿ’—
reacted to davanstrien's post with โค๏ธ 30 days ago
view post
Post
2472
First dataset for the new Hugging Face Bluesky community organisation: bluesky-community/one-million-bluesky-posts ๐Ÿฆ‹

๐Ÿ“Š 1M public posts from Bluesky's firehose API
๐Ÿ” Includes text, metadata, and language predictions
๐Ÿ”ฌ Perfect to experiment with using ML for Bluesky ๐Ÿค—

Excited to see people build more open tools for a more open social media platform!
posted an update 30 days ago
view post
Post
1375
A few days ago, Blackforestlabs released FLUX.1 Tools, which has surprised everyone with its quality and effects. Now that diffusers support these features, you can easily deploy and build your own Tools.
Combined with the powerful Gradio and ZeroGPU, you can experience the Tools immediately, which is truly wonderful.
I was impressed by the Flux.1 Fill dev, so here I've built a demo for it, making it easy to use for inpainting and outpainting images.

๐Ÿ„Model: black-forest-labs/FLUX.1-Fill-dev
๐Ÿฆ–Demo: vilarin/Flux.1-Fill-dev
๐Ÿ‘diffusers: https://github.com/huggingface/diffusers/tree/main/src/diffusers/pipelines/flux
posted an update about 1 month ago
view post
Post
1100
๐Ÿ„โ€โ™‚๏ธWhile browsing new models, I stumbled upon Lumiere from aixonlab. After testing it, I feel it has considerable potential. Keep up the good work!

Lumiere Alpha is a model focusing on improving realism without compromising prompt coherency or changing the composition completely from the original Flux.1-Dev model.

๐Ÿฆ„ Model: aixonlab/flux.1-lumiere-alpha

๐Ÿฆ– Demo: vilarin/lumiere
  • 1 reply
ยท
reacted to merve's post with ๐Ÿ‘€ 2 months ago
reacted to merve's post with ๐Ÿ”ฅ 4 months ago
view post
Post
5565
I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:
- vidore/colpali for retrieval ๐Ÿ“– it doesn't need indexing with image-text pairs but just images!
- Qwen/Qwen2-VL-2B-Instruct for generation ๐Ÿ’ฌ directly feed images as is to a vision language model with no processing to text!
I used ColPali implementation of the new ๐Ÿญ Byaldi library by @bclavie ๐Ÿค—
https://github.com/answerdotai/byaldi
Link to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb
reacted to clem's post with ๐Ÿ”ฅ 4 months ago
posted an update 4 months ago
posted an update 4 months ago
view post
Post
6052
๐Ÿคฉ Amazing day. AWPortrait-FL finally here!
๐Ÿฆ– AWPortrait-FL is finetuned on FLUX.1-dev using the training set of AWPortrait-XL and nearly 2,000 fashion photography photos with extremely high aesthetic quality.

๐Ÿค—Model: Shakker-Labs/AWPortrait-FL

๐Ÿ™‡Demo: vilarin/flux-labs

ยท
posted an update 4 months ago
posted an update 5 months ago
view post
Post
4194
Black Forest Labs, BASED! ๐Ÿ‘
FLUX.1 is more delightful, with good instruction following.
FLUX.1 dev( black-forest-labs/FLUX.1-dev) with a 12B parameter distillation model, second only to Black Forest Labs' state-of-the-art model FLUX.1 pro. ๐Ÿ™€

Update ๐Ÿค™Official demo:
black-forest-labs/FLUX.1-dev
  • 1 reply
ยท
replied to merve's post 7 months ago
view reply

Thank you :) I updated the demo to support file.

reacted to merve's post with โค๏ธ 7 months ago
view post
Post
2737
THUDM has released GLM-4V-9B and it's.. chatty! ๐Ÿ˜‚
I asked it to describe my favorite Howl's Moving Castle scene and here's how it went ๐Ÿ‘‡๐Ÿป

joke aside it seems to outperform the previous VLMs. however the license isn't open-source ๐Ÿ“ˆ
model repo: THUDM/glm-4v-9b
a community member has built a demo: vilarin/VL-Chatbox
  • 1 reply
ยท