Bringing my ideas to life
Gagan Bhatia
gagan3012
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
authored
a paper
5 days ago
DateLogicQA: Benchmarking Temporal Biases in Large Language Models
commented
a paper
5 days ago
DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Organizations
Collections
1
spaces
15
models
91
gagan3012/index_wikipedia_arabic
Updated
gagan3012/Qwen2-VL-2B-Instruct-LoRA-AR
Updated
•
51
gagan3012/Florence-2-FT-ArabicOCR
Text Generation
•
Updated
•
78
•
2
gagan3012/Mistral_arabic_dpo_agec_final_combined
Text Generation
•
Updated
•
10
gagan3012/ArMistral-GEC
Text Generation
•
Updated
•
10
gagan3012/tinyllama-20480
Text Generation
•
Updated
•
13
gagan3012/dpo-test
Text Generation
•
Updated
•
14
gagan3012/Multilingual-mistral-asian
Text Generation
•
Updated
•
17
gagan3012/Multilingual-mistral
Text Generation
•
Updated
•
1.06k
•
2
gagan3012/MegaArabic
Text Generation
•
Updated
•
17
datasets
97
gagan3012/DateLogicQA
Viewer
•
Updated
•
190
•
27
gagan3012/TimeBench-event
Preview
•
Updated
•
10
gagan3012/TimeLLAMA-Eval
Viewer
•
Updated
•
1k
•
16
gagan3012/temporal_qa
Viewer
•
Updated
•
11k
•
11
gagan3012/skyworks_reward_model_prefs_v2
Viewer
•
Updated
•
77k
•
44
gagan3012/skyworks_reward_model_prefs
Viewer
•
Updated
•
100
•
34
gagan3012/helpsteer2-preference-v2
Viewer
•
Updated
•
9.13k
•
40
gagan3012/helpsteer2-preference
Viewer
•
Updated
•
9.13k
•
74
gagan3012/dpo-fix
Viewer
•
Updated
•
3.4k
•
54
gagan3012/multi-reward-bench
Viewer
•
Updated
•
2.99k
•
45