128 80 2001

Nicky

NickyNicky

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

google/gemma-3-4b-it

liked a model about 15 hours ago

google/timesfm-2.0-500m-pytorch

liked a model about 15 hours ago

google/gemma-3-12b-it

View all activity

Organizations

NickyNicky's activity

upvoted an article 2 days ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

3 days ago

• 113

upvoted a paper 22 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 22 days ago • 56

upvoted 2 articles about 1 month ago

Article

Open R1: Update #2

and 6 others •

about 1 month ago

• 201

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted a paper about 1 month ago

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Paper • 2305.17679 • Published May 28, 2023 • 2

upvoted 7 articles about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 427

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 70

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 42

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

•

Jan 24

• 16

Article

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

•

Jan 23

• 3

Article

We now support VLMs in smolagents!

Jan 24

• 92

upvoted a collection about 2 months ago

ProLIP

Collection

Official ProLIP weights • 7 items • Updated about 21 hours ago • 6

upvoted 6 articles about 2 months ago

Article

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

•

Dec 13, 2024

• 3

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

Jan 19

• 15

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 37

Article

Yay! Organizations can now publish blog Articles

and 3 others •

Jan 20

• 36

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

Jan 23

• 64

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 63

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 345