Xi's picture

Xi

xi0v

·

AI & ML interests

Reinforcement learning, Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

liked a model about 4 hours ago

John6666/spring-il-v10-sdxl

liked a model about 4 hours ago

John6666/nooblima-big-version-sdxl

liked a model about 4 hours ago

John6666/noob-and-ntr-mix-v2-sdxl

View all activity

Organizations

xi0v's activity

upvoted a paper about 12 hours ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 3 days ago • 61

upvoted a paper about 13 hours ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 3 days ago • 45

upvoted a collection 3 days ago

cool datasets

154 items • Updated about 5 hours ago • 15

upvoted an article 5 days ago

Article

G2P Shrinks Speech Models

By

•

Feb 5

• 35

upvoted a collection 5 days ago

granite-abliterated

3 items • Updated 4 days ago • 3

upvoted an article 6 days ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22, 2024

• 26

upvoted a paper 8 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published 12 days ago • 25

upvoted a paper 9 days ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 11 days ago • 26

upvoted an article 9 days ago

Article

Common AI Model Formats

By

•

10 days ago

• 27

upvoted 2 papers 10 days ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 13 days ago • 67

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 12 days ago • 67

upvoted a paper 11 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 18 days ago • 66

upvoted 2 papers 12 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 13 days ago • 26

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 17 days ago • 160

upvoted 2 papers 13 days ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published 18 days ago • 33

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 17 days ago • 83

upvoted 3 papers 15 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 17 days ago • 59

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 17 days ago • 24

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 19 days ago • 28

upvoted a paper 16 days ago

Thinking Preference Optimization

Paper • 2502.13173 • Published 20 days ago • 17