Yu li

Yukkkop

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Estwld/GENOME-gemma-2b-it

upvoted a paper 3 days ago

Nature-Inspired Population-Based Evolution of Large Language Models

liked a model 4 days ago

TheDrummer/Skyfall-36B-v2

View all activity

Organizations

None yet

Yukkkop's activity

liked a model 3 days ago

Estwld/GENOME-gemma-2b-it

Updated 11 days ago • 1

upvoted a paper 3 days ago

Nature-Inspired Population-Based Evolution of Large Language Models

Paper • 2503.01155 • Published 7 days ago • 1

liked 3 models 4 days ago

liked a Space 4 days ago

109

Spark TTS

🌖

A text-to-speech model powered by SparkAudio and Mobvoi.

liked a model 6 days ago

FunAudioLLM/InspireMusic-1.5B

Text-to-Audio • Updated 28 days ago • 59 • 6

reacted to AdinaY's post with 😎 6 days ago

Post

3951

Exciting releases from the Chinese community this February🔥
👉 zh-ai-community/2025-february-67a35aaa68e97812def5b6ef

MLLM:
✨ Ovis2 by Alibaba
AIDC-AI/ovis2-67ab36c7e497429034874464
✨ Step Audio Chat by StepFun AI
stepfun-ai/step-audio-67b33accf45735bb21131b0b

Audio:
✨ Step Audio TTS by StepFunAI
stepfun-ai/Step-Audio-TTS-3B
✨ InspireMusic by Alibaba
https://huggingface.co./FunAudioLLM
✨ Baichuan Audio by BaichuanAI
baichuan-inc/Baichuan-Audio-Instruct

Video:
✨ Wan2.1 by Alibaba_Wan
Wan-AI/Wan2.1-T2V-14B
✨ Stepvideo-T2V by StepFun AI
stepfun-ai/stepvideo-t2v
✨ SkyReels-V1 by Skywork
Skywork/skyreels-v1-67b34676ff65b4ec02d16307
✨ LLaDA-8B by RenminUniversity
GSAI-ML/LLaDA-8B-Instruct

MoE:
✨ Moonlight-16B by MoonshotAI (Kimi)
moonshotai/Moonlight-16B-A3B-Instruct

Reasoning:
✨ TinyR1-32B by Qihoo360
qihoo360/TinyR1-32B-Preview

Dataset:
✨ Chinese DeepSeek R1-Distill data -110k
Congliu/Chinese-DeepSeek-R1-Distill-data-110k

liked a model 6 days ago

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 6 days ago • 4.09k • 126

upvoted 2 papers 6 days ago

SLayR: Scene Layout Generation with Rectified Flow

Paper • 2412.05003 • Published Dec 6, 2024 • 1

Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing

Paper • 2411.08196 • Published Nov 12, 2024 • 1

liked 2 models 6 days ago

baichuan-inc/Baichuan-M1-14B-Instruct

Updated 18 days ago • 133k • 44

Djrango/Qwen2vl-Flux

Text-to-Image • Updated Dec 6, 2024 • 468

upvoted 4 papers 6 days ago

FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Paper • 2412.09611 • Published Dec 12, 2024 • 10

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 80

Deep Neuromorphic Networks with Superconducting Single Flux Quanta

Paper • 2311.10721 • Published Sep 21, 2023 • 2

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1, 2024 • 33

reacted to Quazim0t0's post with 👍 7 days ago

Post

2201

Debugging Tags:
Imagine, Associated Thoughts, Dialectical Analysis, Backwards Induction, Metacognition, and Normal Thought Processes such as <think> or <begin_of_thought>

Edit: Uploaded new images w/ a Open WebUI function to organize the tags.
Open WebUI Function: https://openwebui.com/f/quaz93/imagine_phi

This Phi-4 model is part of a test project that I called Micro-Dose. My goal was to use a small dataset to activate reasoning and other cognitive processes without relying on a large dataset.

I found that this was possible with a tiny dataset of just 90 rows, specifically designed as math problems. In the initial iterations, the dataset only activated reasoning when a math-related question was asked. I then made a few changes to the dataset’s structure, including the order of information and the naming of tags. You can see the sample results in the pictures. Not really anything special, just thought I'd share.

Tweaked the dataset a bit:
Quazim0t0/Imagine-Phi-v0.2-GGUF
Quazim0t0/MicroDoseV0.2

First image shows the new tags, second shows the regular thought process and the third is the model in combination with web searches

2 replies

upvoted an article 7 days ago

Article

Making Browser-Based Inference Actually Usable

•

8 days ago

• 10

liked a model 7 days ago

Xenova/modnet

Image Segmentation • Updated 2 days ago • 7.14k • 48