alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a Space 7 days ago

DragGan/DragGan

reacted to mkurman's post with ❤️ 7 days ago

Introducing a new architecture, MedIT One – a single-token transformer with LSTM-like recurrence. It is extremely fast in training and inference, but we lack funding for large-scale training. Enjoy 🍓 https://github.com/MedITSolutionsKurman/medit-one

reacted to Quazim0t0's post with 👍 7 days ago

Debugging Tags: Imagine, Associated Thoughts, Dialectical Analysis, Backwards Induction, Metacognition, and Normal Thought Processes such as <think> or <begin_of_thought> Edit: Uploaded new images w/ a Open WebUI function to organize the tags. Open WebUI Function: https://openwebui.com/f/quaz93/imagine_phi This Phi-4 model is part of a test project that I called Micro-Dose. My goal was to use a small dataset to activate reasoning and other cognitive processes without relying on a large dataset. I found that this was possible with a tiny dataset of just 90 rows, specifically designed as math problems. In the initial iterations, the dataset only activated reasoning when a math-related question was asked. I then made a few changes to the dataset’s structure, including the order of information and the naming of tags. You can see the sample results in the pictures. Not really anything special, just thought I'd share. Tweaked the dataset a bit: https://huggingface.co./Quazim0t0/Imagine-Phi-v0.2-GGUF https://huggingface.co./datasets/Quazim0t0/MicroDoseV0.2 First image shows the new tags, second shows the regular thought process and the third is the model in combination with web searches

View all activity

Organizations

AtAndDev's activity

upvoted a collection about 1 month ago

DeepSeek-R1

8 items • Updated Jan 21 • 567

upvoted a collection about 2 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 77

upvoted a paper about 2 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 92

upvoted a collection about 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 11 days ago • 552

upvoted an article about 2 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 129

upvoted a paper about 2 months ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published Jan 16 • 47

upvoted a collection about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 292

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted an article about 2 months ago

Article

Gradio spaces are the perfect agent tools\!

By

•

Jan 17

• 14

upvoted 4 papers about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 274

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 60

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 29

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

upvoted an article about 2 months ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

Jan 15

• 43

upvoted 3 papers about 2 months ago

Generative AI for Cel-Animation: A Survey

Paper • 2501.06250 • Published Jan 8 • 13

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published Jan 6 • 23

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

upvoted an article about 2 months ago

Article

Symbiotic Intelligence

By

•

Nov 19, 2024

• 3

upvoted 2 papers 2 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 80