Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
published
a dataset
1 day ago
ibndias/htb-v2
published
a dataset
1 day ago
ibndias/cipher-context-dataset
liked
a Space
3 days ago
huggingface-projects/repo_duplicator
Organizations
Collections
2
Papers
2
models
15

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
15

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
96

ibndias/Qwen-2.5-7B_Base_Math_smalllr
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
•
13

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
11

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
8

ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
•
Updated
•
1.74k