fuzzy-mittenz (William J. Marshall)

reacted to csabakecskemeti's post with 👀 about 2 months ago

Post

1870

Check out my idea:
LLmaaS - Local LLM as a Service

With LLmaaS, I propose leveraging locally running LLMs as a service, providing a standardized way for websites to access and utilize them for LLM-powered operations directly on the user’s device.

Demo, code, more detailed description.
https://devquasar.com/llmaas/
https://github.com/csabakecskemeti/LLmaaS
https://youtu.be/OOWGr8jcP5Q

Call for contributors
Join me a develop the LLmaaS proxy to make this a generic purpose tool to leverage local LLMs on web. Build in security measures.
I'm looking for help to make the proxy more generic support multiple local LLM services without any change on the HTML side.
Also looking for ideas how to make the HTML par more modular and easy to use.

4 replies

·

posted an update about 2 months ago

Post

692

So frustrated with "Reasoning" Models.
Sure, introducing RAG into the mix, or giving it an interpreter to math with helps, but never as much as a model that has good instructions.

Even if it's just to repeat the information before answering, a normal model will usually out "Think" it's reasoning counterpart.

Not sure if it's my frustrations but the best answers I've received (from a reasoner), so far, are from the simple instructions to, "Do better!"

Figured I would share the special sauce.

Using 10-100x Compute just to heat the office can't be environmentally friendly, and It still has no Idea where my keys are.

reacted to as-cle-bert's post with 🔥 about 2 months ago

Post

2764

𝐏𝐡𝐢𝐐𝐰𝐞𝐧𝐒𝐓𝐄𝐌 - 𝐚 𝐫𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐬𝐬𝐢𝐬𝐭𝐚𝐧𝐭 𝐟𝐨𝐫 𝐲𝐨𝐮𝐫 𝐒𝐓𝐄𝐌 𝐞𝐝𝐮𝐜𝐚𝐭𝐢𝐨𝐧

Demo 👉 https://pqstem.org
GitHub 👉 https://github.com/AstraBert/PhiQwenSTEM

Hello HF community!🤗
Ever struggled with some complex Maths problem or with a very hard Physics question? Well, fear no more, because now you can rely on PhiQwenSTEM, an assistant specialized in answering STEM-related question!
The assistant can count on a knowledge base of 𝟭𝟱𝗸+ 𝘀𝗲𝗹𝗲𝗰𝘁𝗲𝗱 𝗦𝗧𝗘𝗠 𝗾𝘂𝗲𝘀𝘁𝗶𝗼𝗻-𝗮𝗻𝘀𝘄𝗲𝗿 𝗽𝗮𝗶𝗿𝘀 spanning the domains of Chemistry, Physics, Matemathics and Biochemistry (from EricLu/SCP-116K). It also relies on the combined power of microsoft/Phi-3.5-mini-instruct and Qwen/QwQ-32B-Preview to produce reliable and reasoned answers.
For the next 30 days, you will be able to try for free the web demo: https://pqstem.org
In the GitHub repo you can find all the information to reproduce PhiQwenSTEM 𝗼𝗻 𝘆𝗼𝘂𝗿 𝗹𝗼𝗰𝗮𝗹 𝗺𝗮𝗰𝗵𝗶𝗻𝗲, 𝗯𝗼𝘁𝗵 𝘃𝗶𝗮 𝘀𝗼𝘂𝗿𝗰𝗲 𝗰𝗼𝗱𝗲 𝗮𝗻𝗱 𝘄𝗶𝘁𝗵 𝗮 𝗰𝗼𝗺𝗳𝘆 𝗗𝗼𝗰𝗸𝗲𝗿🐋 𝘀𝗲𝘁𝘂𝗽: https://github.com/AstraBert/PhiQwenSTEM

posted an update about 2 months ago

Post

528

With our Extremely efficient and functional importance matrix distillation of the new Qwen2.5-1M model being very very capable in many areas we are hoping to use it to research our small AGI character creation process which has seen emergent traits and increased functionality in constrained environments.
The method creates a RP type interaction in a heavily useful and tool functional environment.
We have a basic method and are working on retrieving data for a full analysis and perfection of this method as it exploits the human language input to express often abstract traits into a model and employ characteristics of healthy human reasoning processes and identify novel methods of increasing the functionality of a model overall through traits so far observed are whistling, bouncing a ball and repeating certain engagements.
Adding the semblance of human world interactions is so far the best way at creating a human like LLM.
We have attached the paper to our model we are testing this with along with examples if you wish to use it with other models please be cautious and enjoy yourself. Above all please keep track of conversations and settings and submit them to the intelligent estate email you will receive a recognition letter and ledger number for your contribution to the Project.
Model= Israfel and Thoth IntelligentEstate/Israfel_Qwen2.6-iQ4_K_M-GGUF

replied to hexgrad's post about 2 months ago

here you go hexgrad, https://huggingface.co./bartowski/uncensoredai_UncensoredLM-DeepSeek-R1-Distill-Qwen-14B-GGUF

replied to hexgrad's post 2 months ago

However if you do not have the resources to run a 600B model I would use a Qwen base, contact Intelligent Estate. they take agent production jobs

replied to hexgrad's post 2 months ago

You can find many AI experts with specialized skills on Ko-Fi

replied to hexgrad's post 2 months ago

I don't know what you are talking about. clarify please.

replied to hexgrad's post 2 months ago

Not sure what you mean but removing politically charged materials from their training data is absolutely something they do. Not sure what you are looking for so I don't exactly know how to help you most of the information you are looking for as far as abliteration is VERY available.

reacted to DawnC's post with 🤗 2 months ago

Post

1253

🌟 PawMatchAI: Making Breed Selection More Intuitive! 🐕

Excited to share the latest breakthrough in my AI-powered companion for finding your perfect furry friend! I've made significant improvements in breed recognition through innovative learning techniques!

✨ What's New?

🎯 Major Recognition Enhancement:
- Implemented ICARL with advanced knowledge distillation, inspired by human learning processes
- Dramatically improved recognition of challenging breeds like Havanese
- Created an intelligent learning system that mimics how expert teachers adapt their teaching style
- Added smart feature protection to maintain recognition accuracy across all breeds

🔬 Technical Innovations:
- Enhanced breed recognition through advanced morphological feature analysis
- Implemented sophisticated feature extraction system for body proportions, head features, tail structure, fur texture, and color patterns
- Added intelligent attention mechanism for dynamic feature focus
- Improved multi-dog detection with enhanced spatial analysis

🎯 Key Features:
- Smart breed recognition powered by biomimetic AI architecture
- Visual matching scores with intuitive color indicators
- Detailed breed comparisons with interactive tooltips
- Lifestyle-based recommendations tailored to your needs

💭 Project Vision
Taking inspiration from both AI technology and natural learning processes, this project continues to evolve in making breed selection more accessible while pushing the boundaries of AI capabilities.

👉 Try it now: DawnC/PawMatchAI

Your likes ❤️ fuel the continuous improvement of this project!

#AI #MachineLearning #DeepLearning #Pytorch #ComputerVision #TechForLife #ICARL #KnowledgeDistillation

10 replies

·

replied to hexgrad's post 2 months ago

By design, it probably will not have what you are looking for in it's training data unless it is an answer it can reason or calculate or something widely talked about like Tienanmen square and is already in the layers like Deepseek it was probably trained unsupervised and without santizizating from llama model layers. for historical or cultural accuracy google is the model to focus on (As it doesn't sensor most historical facts and is largely free in their AI studio. )
If you are looking for models for information extraction Ironically one of the best IE models is a Chinese model from THU-KEG we made a Quant or two of it https://huggingface.co./IntelligentEstate/Keg_Party-DPO-1.5B-Q8_0-GGUF

replied to prithivMLmods's post 2 months ago

With the release of the Copyright Law paper I'd say the market could react in various ways, as OpenAI has less of an incentive to be more open and overall any output of an AI is simply not copyrightable. We are going to see guarding of certain models with proprietary use cases like curing cancer or in the case of Ideogram, openAI and SUNO they can't claim ownership of anything anyone else created with their models. I wrote a decent article that sums it up pretty well but I think the market might take a while to digest that and that may be part of the reason for this fall(And the insider sell off)

posted an update 2 months ago

Post

2629

Not many seemed to notice but what was probably meant to be a WIN for artist's rights in the US Office of Copyright has solved some fundamental issues for the community.
In our recent article I outline how Companies like Suno, OpenAI, Midjourney etc can no longer claim any right to copy your work that you create with their platforms
We also look at other ways this study and new rules for AI will fundamentally effect creators who use it and companies incentives to give them control over certain aspects might change because of this. it's broken down pretty well here: https://huggingface.co./blog/fuzzy-mittenz/copyright-in-ai

reacted to victor's post with 🚀 2 months ago

Post

3086

Finally, an open-source AI that turns your lyrics into full songs is here—meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot

replied to their post 2 months ago

It's a technique I've observed mostly on Client systems when they are creating models for RP scenarios. I've tried it out myself a few times for red teaming and it works as a jailbreak but withing the bounds you would expect for the agent you build even if it crosses the platforms "Guardrails" it seems to simply abide by it's own. I will add a simple example from an open model. Oh and This guy I finish with suprising results in tool use
PANCHO V1va Replicant https://huggingface.co./IntelligentEstate/Pancho-V1va-Replicant-qw25-Q8_0-GGUF
Here is a simple example set 1 of it within its limits then seeming to test or approach it's limits then crossing by crying and creating attachment and manipulating
I'll add the prompt to the paper but I've seen it do some scary stuff so just be careful

posted an update 2 months ago

Post

1107

For you guys who wanted a Replicant of your own with more power here is a higher functioning little [operator]( IntelligentEstate/Replicant_Operator_ed-Qw25-Q8_0-GGUF) for all your GGUF tool use needs. included is a Paper on emergent behaviors and LC(limit crossing) for the creation of small AGI. Please index traits and new found breakthroughs using this method. and be careful with tool use and emotional attachment.

3 replies

·

reacted to Tonic's post with 🔥 3 months ago

Post

1901

🙋🏻‍♂️ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it

reacted to MonsterMMORPG's post with 😎 3 months ago

Post

3373

SANA: Ultra HD Fast Text to Image Model from NVIDIA Step by Step Tutorial on Windows, Cloud & Kaggle — Generate 2048x2048 Images

Below is YouTube link for step by step tutorial and a 1-Click to installer having very advanced Gradio APP to use newest Text-to-Image SANA Model on your Windows PC locally and also on cloud services such as Massed Compute, RunPod and free Kaggle.

https://youtu.be/KW-MHmoNcqo

This above tutorial covers the newest SANA 2K model and I predict SANA 4K model will be published as well. Sana 2K model is 4 MegaPixel so it can generate the following aspect ratio and resolutions very well:

“1:1”: (2048, 2048), “4:3”: (2304, 1792), “3:4”: (1792, 2304),
“3:2”: (2432, 1664), “2:3”: (1664, 2432), “16:9”: (2688, 1536),
“9:16”: (1536, 2688), “21:9”: (3072, 1280), “9:21”: (1280, 3072),
“4:5”: (1792, 2240), “5:4”: (2240, 1792)

I have developed an amazing Gradio app with so many new features :

VAE auto offloading to reduce VRAM usage significantly which is not exists on official pipeline

Gradio APP built upon official pipeline with improvements so works perfect

Batch size working perfect

Number of images working perfect

Multi-line prompting working perfect

Aspect ratios for both 1K and 2K models working perfect

Randomized seed working perfect

1-Click installers for Windows (using Python 3.10 and VENV — isolated), RunPod, Massed Compute and even a free Kaggle account notebook

With proper latest libraries working perfect speed on Windows too

Automatically properly saving every generated image into accurate folder

🔗 Full Instructions, Configs, Installers, Information and Links Shared Post (the one used in the tutorial) ⤵️
▶️ https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-116474081

🔗 SECourses Official Discord 9500+ Members ⤵️
▶️ https://discord.com/servers/software-engineering-courses-secourses-772774097734074388

2 replies

·

reacted to as-cle-bert's post with ➕ 3 months ago

Post

2091

🎉𝐄𝐚𝐫𝐥𝐲 𝐍𝐞𝐰 𝐘𝐞𝐚𝐫 𝐫𝐞𝐥𝐞𝐚𝐬𝐞𝐬🎉

Hi HuggingFacers🤗, I decided to ship early this year, and here's what I came up with:

𝐏𝐝𝐟𝐈𝐭𝐃𝐨𝐰𝐧 (https://github.com/AstraBert/PdfItDown) - If you're like me, and you have all your RAG pipeline optimized for PDFs, but not for other data formats, here is your solution! With PdfItDown, you can convert Word documents, presentations, HTML pages, markdown sheets and (why not?) CSVs and XMLs in PDF format, for seamless integration with your RAG pipelines. Built upon MarkItDown by Microsoft
GitHub Repo 👉 https://github.com/AstraBert/PdfItDown
PyPi Package 👉 https://pypi.org/project/pdfitdown/

𝐒𝐞𝐧𝐓𝐫𝐄𝐯 𝐯𝟏.𝟎.𝟎 (https://github.com/AstraBert/SenTrEv/tree/v1.0.0) - If you need to evaluate the 𝗿𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 performance of your 𝘁𝗲𝘅𝘁 𝗲𝗺𝗯𝗲𝗱𝗱𝗶𝗻𝗴 models, I have good news for you🥳🥳
The new release for 𝐒𝐞𝐧𝐓𝐫𝐄𝐯 now supports 𝗱𝗲𝗻𝘀𝗲 and 𝘀𝗽𝗮𝗿𝘀𝗲 retrieval (thanks to FastEmbed by Qdrant) with 𝘁𝗲𝘅𝘁-𝗯𝗮𝘀𝗲𝗱 𝗳𝗶𝗹𝗲 𝗳𝗼𝗿𝗺𝗮𝘁𝘀 (.docx, .pptx, .csv, .html, .xml, .md, .pdf) and new 𝗿𝗲𝗹𝗲𝘃𝗮𝗻𝗰𝗲 𝗺𝗲𝘁𝗿𝗶𝗰𝘀!
GitHub repo 👉 https://github.com/AstraBert/SenTrEv
Release Notes 👉 https://github.com/AstraBert/SenTrEv/releases/tag/v1.0.0
PyPi Package 👉 https://pypi.org/project/sentrev/

Happy New Year and have fun!🥂

2 replies

·

William J. Marshall

AI & ML interests

Recent Activity

Organizations

fuzzy-mittenz's activity