35 5 337

Nikita Davidchuk

Ar4ikov

https://github.com/Ar4ikov

Ar4ikov

AI & ML interests

nlp, web, opensource, transformers, asr, ser, tts, cv

Recent Activity

liked a Space 8 days ago

fffiloni/MEMO

liked a model 15 days ago

yandex/YandexGPT-5-Lite-8B-pretrain

liked a Space 17 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

Ar4ikov's activity

liked a Space 8 days ago

MEMO

👁

Memory-Guided Diffusion for Expressive Talking Video Gen

liked a model 15 days ago

yandex/YandexGPT-5-Lite-8B-pretrain

Updated 15 days ago • 8.43k • 165

liked 3 Spaces 17 days ago

2.21k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

383

OmniParser V2

🏢

OmniParser, turn your LLM into GUI agent

971

LuminaBrush

📈

Execute commands from environment

liked a Space 21 days ago

Workflow Canvas

🖼

FLUX Hand-written STYLE Genereator

liked a Space about 1 month ago

287

Kokoro Text-to-Speech (WebGPU)

🗣

High-quality speech synthesis powered by Kokoro TTS

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 16 days ago • 2.99M • • 11.2k

Systran/faster-whisper-large-v3

Automatic Speech Recognition • Updated Nov 23, 2023 • 811k • 353

liked a Space about 1 month ago

2.24k

Kokoro TTS

❤

Upgraded to v1.0!

liked 4 models about 1 month ago

liked a model about 2 months ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 8 days ago • 1.59M • 3.64k

liked a Space about 2 months ago

RuadaptQwen2.5

💬

Generate text responses in a chat format

liked a model 2 months ago

distil-whisper/distil-large-v3

Automatic Speech Recognition • Updated 6 days ago • 713k • • 304

liked a Space 2 months ago

4.14k

TRELLIS

🏢

Scalable and Versatile 3D Generation from images

reacted to suayptalha's post with ❤️ 2 months ago

Post

2149

🚀 Introducing 𝐅𝐢𝐫𝐬𝐭 𝐇𝐮𝐠𝐠𝐢𝐧𝐠 𝐅𝐚𝐜𝐞 𝐈𝐧𝐭𝐞𝐠𝐫𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐦𝐢𝐧𝐆𝐑𝐔 𝐌𝐨𝐝𝐞𝐥𝐬 from the paper 𝐖𝐞𝐫𝐞 𝐑𝐍𝐍𝐬 𝐀𝐥𝐥 𝐖𝐞 𝐍𝐞𝐞𝐝𝐞𝐝?

🖥 I have integrated 𝐧𝐞𝐱𝐭-𝐠𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐑𝐍𝐍𝐬, specifically minGRU, which offer faster performance compared to Transformer architectures, into HuggingFace. This allows users to leverage the lighter and more efficient minGRU models with the "𝐭𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫𝐬" 𝐥𝐢𝐛𝐫𝐚𝐫𝐲 for both usage and training.

💻 I integrated two main tasks: 𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧 and 𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐂𝐚𝐮𝐬𝐚𝐥𝐋𝐌.

𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧:
You can use this class for 𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞 𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧 tasks. I also trained a Sentiment Analysis model with stanfordnlp/imdb dataset.

𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐂𝐚𝐮𝐬𝐚𝐥𝐋𝐌:
You can use this class for 𝐂𝐚𝐮𝐬𝐚𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥 tasks such as GPT, Llama. I also trained an example model with roneneldan/TinyStories dataset. You can fine-tune and use it!

🔗 𝐋𝐢𝐧𝐤𝐬:
Models: suayptalha/mingru-676fe8d90760d01b7955d7ab
GitHub: https://github.com/suayptalha/minGRU-hf
LinkedIn Post: https://www.linkedin.com/posts/suayp-talha-kocabay_mingru-a-suayptalha-collection-activity-7278755484172439552-wNY1

📰 𝐂𝐫𝐞𝐝𝐢𝐭𝐬:
Paper Link: https://arxiv.org/abs/2410.01201

I am thankful to Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio and Hossein Hajimirsadeghi for their papers.

liked a model 2 months ago

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 4.11M • 788