s3nh's picture

s3nh

s3nh

AI & ML interests

Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh

Recent Activity

reacted to MonsterMMORPG's post with πŸ”₯ 8 days ago
Wan 2.1 Ultra Advanced Gradio APP for - Works as low as 4GB VRAM - 1-Click Installers for Windows, RunPod, Massed Compute - Batch Processing - T2V - I2V - V2V Installer and APP : https://www.patreon.com/posts/123105403 Download from here : https://www.patreon.com/posts/123105403 I have been working 14 hours today to make this APP before sleeping for you guys :) We have all the features of Wan 2.1 model Text to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px Video to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px Text to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px Image to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px When you analyze the above and below images First video is animated from the input image with following prompt A hooded wraith stands motionless in a torrential downpour, lightning cracking across the stormy sky behind it. Its face is an impenetrable void of darkness beneath the tattered hood. Rain cascades down its ragged, flowing cloak, which appears to disintegrate into wisps of shadow at the edges. The mysterious figure holds an enormous sword of pure energy, crackling with electric blue lightning that pulses and flows through the blade like liquid electricity. The weapon drags slightly on the wet ground, sending ripples of power across the puddles forming at the figure's feet. Three glowing blue gems embedded in its chest pulse in rhythm with the storm's lightning strikes, each flash illuminating the decaying, ancient fabric of its attire. The rain intensifies around the figure, droplets seemingly slowing as they near the dark entity, while forks of lightning repeatedly illuminate its imposing silhouette. The atmosphere grows heavier with each passing moment as the wraith slowly raises its crackling blade, the blue energy intensifying and casting eerie shadows
liked a model 20 days ago
louaaron/sedd-small
liked a model 26 days ago
onnx-community/Kokoro-82M-v1.0-ONNX
View all activity

Organizations

ESPnet's profile picture Gradio-Blocks-Party's profile picture Lajonbot's profile picture The Waifu Research Department's profile picture AblateIt's profile picture Blog-explorers's profile picture BangumiBase's profile picture CyberHarem's profile picture HydraLM's profile picture GOAT.AI's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Spinner-GPT-4's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Smol Community's profile picture

s3nh's activity

reacted to MonsterMMORPG's post with πŸ”₯ 8 days ago
view post
Post
2358
Wan 2.1 Ultra Advanced Gradio APP for - Works as low as 4GB VRAM - 1-Click Installers for Windows, RunPod, Massed Compute - Batch Processing - T2V - I2V - V2V

Installer and APP : https://www.patreon.com/posts/123105403

Download from here : https://www.patreon.com/posts/123105403

I have been working 14 hours today to make this APP before sleeping for you guys :)

We have all the features of Wan 2.1 model

Text to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px

Video to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px

Text to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px

Image to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px

When you analyze the above and below images
First video is animated from the input image with following prompt

A hooded wraith stands motionless in a torrential downpour, lightning cracking across the stormy sky behind it. Its face is an impenetrable void of darkness beneath the tattered hood. Rain cascades down its ragged, flowing cloak, which appears to disintegrate into wisps of shadow at the edges. The mysterious figure holds an enormous sword of pure energy, crackling with electric blue lightning that pulses and flows through the blade like liquid electricity. The weapon drags slightly on the wet ground, sending ripples of power across the puddles forming at the figure's feet. Three glowing blue gems embedded in its chest pulse in rhythm with the storm's lightning strikes, each flash illuminating the decaying, ancient fabric of its attire. The rain intensifies around the figure, droplets seemingly slowing as they near the dark entity, while forks of lightning repeatedly illuminate its imposing silhouette. The atmosphere grows heavier with each passing moment as the wraith slowly raises its crackling blade, the blue energy intensifying and casting eerie shadows

  • 3 replies
Β·
reacted to their post with πŸ€— about 1 month ago
view post
Post
1977
Welcome back,

Small Language Models Enthusiasts and GPU Poor oss enjoyers lets connect.
Just created an organization which main target is to have fun with smaller models tuneable on consumer range GPUs, feel free to join and lets have some fun, much love ;3

https://huggingface.co./SmolTuners
Β·
reacted to YannisTevissen's post with πŸ‘πŸ€— about 2 months ago
reacted to sayakpaul's post with πŸ”₯ 2 months ago
view post
Post
4374
Commits speak louder than words πŸ€ͺ

* 4 new video models
* Multiple image models, including SANA & Flux Control
* New quantizers -> GGUF & TorchAO
* New training scripts

Enjoy this holiday-special Diffusers release πŸ€—
Notes: https://github.com/huggingface/diffusers/releases/tag/v0.32.0
reacted to merve's post with 🧠 3 months ago
view post
Post
1815
A complete RAG pipeline includes a reranker, which ranks the documents to find the best document πŸ““
Same goes for multimodal RAG, multimodal rerankers which we can integrate to multimodal RAG pipelines!
Learn how to build a complete multimodal RAG pipeline with vidore/colqwen2-v1.0 as retriever, lightonai/MonoQwen2-VL-v0.1 as reranker, Qwen/Qwen2-VL-7B-Instruct as VLM in this notebook that runs on a GPU as small as L4 πŸ”₯ https://huggingface.co./learn/cookbook/multimodal_rag_using_document_retrieval_and_reranker_and_vlms
  • 1 reply
Β·
reacted to fdaudens's post with πŸ€— 3 months ago
view post
Post
1333
🀝 Want to share your AI models while protecting your work? Licenses are key!

Fascinating to see that nearly 60% of models on the Hub use Apache & MIT licenses.

Explore the viz here: huggingface/open-source-ai-year-in-review-2024
reacted to Lewdiculous's post with βž• 3 months ago
reacted to fdaudens's post with πŸ‘ 3 months ago
view post
Post
1396
πŸ” From instruction-following to creative storytelling, dive into 2024's most impactful AI datasets! These gems are shaping everything from scientific research to video understanding.

Check it out: huggingface/open-source-ai-year-in-review-2024
replied to louisbrulenaudet's post 3 months ago
reacted to louisbrulenaudet's post with πŸ€— 3 months ago
view post
Post
2038
I’ve published a new dataset to simplify model merging πŸ€—

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches πŸ“–

Dataset : louisbrulenaudet/mergekit-configs
  • 1 reply
Β·
reacted to nyuuzyou's post with πŸ‘ 3 months ago
view post
Post
1516
✈️ Aircraft Dataset & Generation Model nyuuzyou/aircraft-images & nyuuzyou/AircraftFLUX-LoRA

Dataset Features:
β€’ 165,340 high-res aircraft images with metadata
β€’ Machine-generated English captions
β€’ Detailed aircraft specs, registration & flight info
β€’ Environmental context descriptions

LoRA model specializes in:
β€’ Realistic aircraft generation
β€’ Accurate technical details for unpopular airplanes compared to black-forest-labs/FLUX.1-schnell
β€’ Proper airline liveries
β€’ Contextual aviation scenes
replied to danielhanchen's post 3 months ago
reacted to danielhanchen's post with πŸ€—πŸ‘ 3 months ago
reacted to stefan-it's post with ❀️ 3 months ago
view post
Post
1542
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.

πŸ‘‰ Link: https://github.com/stefan-it/model-garden-lms

An overview of some features:

- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS

I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!

πŸ‘‰ Model Hub Link: https://huggingface.co./model-garden-lms

If you find these resources useful, please give them a like!

Made from Bavarian Oberland with ❀️ and πŸ₯¨.
reacted to lucifertrj's post with πŸ‘€ 3 months ago
view post
Post
540
Image Prompt Engineering Guide:
➑️ Artistic styling for Image generation
➑️ Prompt weighting using the parentheses method to generate realistic images.
➑️ Advanced features like style and positioning control[experimental].
➑️ Image placement on the generated AI image using Recraft V3 Mockup.

Watch: https://www.youtube.com/watch?v=d3nUG28-jIc
replied to AtAndDev's post 3 months ago
reacted to davidberenstein1957's post with πŸ”₯ 3 months ago
replied to davidberenstein1957's post 3 months ago
view reply

Looking great, cznnot wait to test, thank you πŸ€—