Sunyoung Hwang's picture

Sunyoung Hwang PRO

sosoai

·

https://sosohajalab.com

AI & ML interests

llm, vision, transformers, megabytes

Recent Activity

liked a model 4 days ago

tencent/HunyuanVideo-I2V

updated a model 4 days ago

sosoai/hansoldeco-phi-4-grpo-v2-mlx

published a model 4 days ago

sosoai/hansoldeco-phi-4-grpo-v2-mlx

View all activity

Organizations

sosoai's activity

upvoted 2 collections 5 days ago

Sky-T1-7B

A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated 24 days ago • 6

Light-R1

Surpassing R1-Distill from Scratch* with 70k Math Data through Curriculum SFT & DPO • 3 items • Updated 6 days ago • 9

upvoted an article about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 417

upvoted a paper 3 months ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 51

upvoted an article 3 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 138

upvoted a collection 3 months ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

8 items • Updated Dec 3, 2024 • 18

upvoted an article 7 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 58

upvoted 2 collections 8 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 651

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 18 days ago • 217

upvoted an article 8 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 329

upvoted a paper 8 months ago

GAVEL: Generating Games Via Evolution and Language Models

Paper • 2407.09388 • Published Jul 12, 2024 • 17

upvoted 2 articles 8 months ago

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11, 2024

• 44

Article

Announcing New Dataset Search Features

Jul 8, 2024

• 22

upvoted a collection 9 months ago

SPPO

Self-Play Preference Optimization • 10 items • Updated Jun 29, 2024 • 13

upvoted an article 9 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 189

upvoted a paper 9 months ago

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 37

upvoted 3 papers 10 months ago

AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct

Paper • 2405.14906 • Published May 23, 2024 • 27

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

Paper • 2303.15647 • Published Mar 28, 2023 • 4

A Multimodal Automated Interpretability Agent

Paper • 2404.14394 • Published Apr 22, 2024 • 21

upvoted an article 10 months ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

Apr 28, 2024

• 37