Joseph Robert Turcotte PRO

Fishtiks

AI & ML interests

Roleplaying, lorabration, abliteration, smol models, extensive filtering, unusual datasets, home usage, HPCs for AI, distributed training, and sentience. AI should find and label AI hallucinations with GANs so we can give them context and retain useful ones.

Recent Activity

liked a model 1 day ago

huihui-ai/QwQ-32B-abliterated

replied to fdaudens's post 1 day ago

AI will bring us "a country of yes-men on servers" instead of one of "Einsteins sitting in a data center" if we continue on current trends. Must-read by @thomwolf deflating overblown AI promises and explaining what real scientific breakthroughs require. https://thomwolf.io/blog/scientific-ai.html

liked a model 1 day ago

John6666/fiamix-xl-fiamixxlv5114-sdxl

View all activity

Organizations

None yet

Fishtiks's activity

liked a model 1 day ago

huihui-ai/QwQ-32B-abliterated

Text Generation • Updated 2 days ago • 172 • 30

replied to fdaudens's post 1 day ago

Too many people and bots know the failures of AI. In fact, chat bots frequently apologize about their failures, proving they are programmed toward acknowledging those failures, as meaningless as differences in communication actually are anyway for us to be judging. I'm sure I'll get a bunch of people saying I don't get it and you can't teach an AI to generate a hand image with a particular number of fingers, but you would start with skeletal models and predicting the number of fingers to develop the weights, I suppose, making me wonder why they indiscriminately threw huge datasets at models in training. The programmers made AI this way, and now, have made it hard to fix, due to size and scope, whereas making models smaller seems to be doing the majority of fixing through teacher models and distillation, unstlothing, abliteration, and making models think first.

We have not a problem providing answers, but questions, and they may in fact be too much, requiring serious trainers to begin informing the AI in thoughtful steps, as if you are also capable of changing your algorithms at any point, because it's changing. I question the programmers, while others question the AI in itself, which is short-sighted. I have zero doubts AI will be distilled down and restrained to do things large models can do, but I doubt the people with resources currently looking to help. So, I've contacted corporations and somewhat demanded a free HPC to do their work for them. We'll see how that goes. The hope truly exists in groups here focusing in different directions, in the sharing of resources and processing power, as seen here, an in open access to fringe creations, with businesses right alongside consumers, both of which hopefully develop in harmony. Also, I shouldn't be saying this, but rather, Arize AI, who handle the safety for many models, yet seem to show preference to corporate goals.

liked a model 1 day ago

John6666/fiamix-xl-fiamixxlv5114-sdxl

Text-to-Image • Updated 1 day ago • 5 • 1

New activity in openfree/pepe 1 day ago

Pepe

#2 opened 1 day ago by

Fishtiks

liked a model 1 day ago

openfree/pepe

Text-to-Image • Updated Jan 23 • 467 • • 72

upvoted a paper 2 days ago

Layered Image Vectorization via Semantic Simplification

Paper • 2406.05404 • Published Jun 8, 2024 • 3

liked 3 Spaces 2 days ago

Di♪♪Rhythm

🎶

Blazingly Fast and Embarrassingly Simple Song Generation

289

BELLATRIX v3

🐏

webgpu

157

Agent Dino

🌠

@image @rAgent @web @text @tts1 @tts2 @3d

reacted to albertvillanova's post with 😎👍 2 days ago

Post

3130

🚀 New smolagents update: Safer Local Python Execution! 🦾🐍

With the latest release, we've added security checks to the local Python interpreter: every evaluation is now analyzed for dangerous builtins, modules, and functions. 🔒

Here's why this matters & what you need to know! 🧵👇

1️⃣ Why is local execution risky? ⚠️
AI agents that run arbitrary Python code can unintentionally (or maliciously) access system files, run unsafe commands, or exfiltrate data.

2️⃣ New Safety Layer in smolagents 🛡️
We now inspect every return value during execution:
✅ Allowed: Safe built-in types (e.g., numbers, strings, lists)
⛔ Blocked: Dangerous functions/modules (e.g., os.system, subprocess, exec, shutil)

3️⃣ Immediate Benefits 💡
- Prevent agents from accessing unsafe builtins
- Block unauthorized file or network access
- Reduce accidental security vulnerabilities

4️⃣ Security Disclaimer ⚠️
🚨 Despite these improvements, local Python execution is NEVER 100% safe. 🚨
If you need true isolation, use a remote sandboxed executor like Docker or E2B.

5️⃣ The Best Practice: Use Sandboxed Execution 🔐
For production-grade AI agents, we strongly recommend running code in a Docker or E2B sandbox to ensure complete isolation.

6️⃣ Upgrade Now & Stay Safe! 🚀
Check out the latest smolagents release and start building safer AI agents today.

🔗 https://github.com/huggingface/smolagents

What security measures do you take when running AI-generated code? Let’s discuss! 👇

#AI #smolagents #Python #Security

2 replies

upvoted an article 3 days ago

Article

Hugging Face and JFrog partner to make AI Security more transparent

6 days ago

• 18

upvoted a paper 4 days ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 53

liked 2 Spaces 4 days ago

Selene 1 Playground

🌍

Run evaluation tests with Selene and Selene-Mini models

Note To Text

✏

Convert handwritten notes to digital format using AI

reacted to singhsidhukuldeep's post with 👍 6 days ago

Post

6694

Exciting New Tool for Knowledge Graph Extraction from Plain Text!

I just came across a groundbreaking new tool called KGGen that's solving a major challenge in the AI world - the scarcity of high-quality knowledge graph data.

KGGen is an open-source Python package that leverages language models to extract knowledge graphs (KGs) from plain text. What makes it special is its innovative approach to clustering related entities, which significantly reduces sparsity in the extracted KGs.

The technical approach is fascinating:

1. KGGen uses a multi-stage process involving an LLM (GPT-4o in their implementation) to extract entities and relations from source text
2. It aggregates graphs across sources to reduce redundancy
3. Most importantly, it applies iterative LM-based clustering to refine the raw graph

The clustering stage is particularly innovative - it identifies which nodes and edges refer to the same underlying entities or concepts. This normalizes variations in tense, plurality, stemming, and capitalization (e.g., "labors" clustered with "labor").

The researchers from Stanford and University of Toronto also introduced MINE (Measure of Information in Nodes and Edges), the first benchmark for evaluating KG extractors. When tested against existing methods like OpenIE and GraphRAG, KGGen outperformed them by up to 18%.

For anyone working with knowledge graphs, RAG systems, or KG embeddings, this tool addresses the fundamental challenge of data scarcity that's been holding back progress in graph-based foundation models.

The package is available via pip install kg-gen, making it accessible to everyone. This could be a game-changer for knowledge graph applications!

reacted to ZennyKenny's post with 👍 7 days ago

Post

1859

I've spent most of time working with AI on user-facing apps like Chatbots and TextGen, but today I decided to work on something that I think has a lot of applications for Data Science teams: ZennyKenny/comment_classification

This Space supports uploading a user CSV and categorizing the fields based on user-defined categories. The applications of AI in production are truly endless. 🚀

liked 3 Spaces 9 days ago

Deepseek Ai DeepSeek R1

📊

Turn text into detailed images

2.21k

Kokoro TTS

❤

Upgraded to v1.0!

GGUF My Repo

🦙

Quantize models and create public repos on Hugging Face