Gökdeniz Gülmez's picture

Gökdeniz Gülmez

Goekdeniz-Guelmez

·

AI & ML interests

Transformers / NLP / Multimodal / Realtime M2M / J.O.S.I.E.v4o

Recent Activity

liked a model about 12 hours ago

mistralai/Mistral-Small-24B-Instruct-2501

upvoted an article about 13 hours ago

Proximal Policy Optimization (PPO)

updated a dataset about 15 hours ago

Goekdeniz-Guelmez/GRPO-MLX-Dataset

View all activity

Organizations

Goekdeniz-Guelmez's activity

upvoted an article about 13 hours ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

• 14

upvoted a collection 5 days ago

DolphinLabeled Datasets

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated 29 days ago • 11

upvoted 2 collections 6 days ago

Qwen2.5-VL

14 items • Updated 6 days ago • 4

Qwen2.5-1M

10 items • Updated 9 days ago • 4

upvoted a collection 8 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 9 days ago • 313

upvoted a collection 9 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 9 days ago • 97

upvoted a collection 17 days ago

Helium-1

Kyutai's Helium-1 2B Model, outperforming other state of the art small models. • 4 items • Updated 17 days ago • 1

upvoted a collection 24 days ago

J.O.S.I.E. v6.0

Trained on opensourced and private custom DPO/ORPO datasets • 8 items • Updated 24 days ago • 2

upvoted a collection 25 days ago

KaLM-embedding

7 items • Updated 13 days ago • 22

upvoted 2 collections 29 days ago

QVQ-72B-Preview

5 items • Updated Dec 24, 2024 • 7

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated about 1 month ago • 60

upvoted an article about 1 month ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted a collection about 1 month ago

Falcon3 Mamba

3 items • Updated Dec 22, 2024 • 1

upvoted 2 collections about 2 months ago

Josiefied

The best uncensored models • 16 items • Updated Dec 20, 2024 • 1

Llama 3.2

Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models. • 15 items • Updated Dec 17, 2024 • 11

upvoted a collection 2 months ago

Molmo

4 items • Updated Nov 23, 2024 • 2

upvoted 2 papers 3 months ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published Nov 6, 2024 • 45

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 65

upvoted a collection 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 212

upvoted a paper 3 months ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 77