Sanjiban Choudhury's picture

1 1

Sanjiban Choudhury PRO

sc2582

·

https://www.sanjibanchoudhury.com/

AI & ML interests

None yet

Recent Activity

updated a model 25 days ago

rl-llm-agent/Llama-3.2-3B-Instruct-sft-alfworld-leap-iter1

published a model 25 days ago

rl-llm-agent/Llama-3.2-3B-Instruct-sft-alfworld-leap-iter1

liked a dataset about 1 month ago

allenai/UNcommonsense

View all activity

Organizations

sc2582's activity

updated a model 25 days ago

rl-llm-agent/Llama-3.2-3B-Instruct-sft-alfworld-leap-iter1

Text Generation • Updated 25 days ago • 13

published a model 25 days ago

rl-llm-agent/Llama-3.2-3B-Instruct-sft-alfworld-leap-iter1

Text Generation • Updated 25 days ago • 13

liked a dataset about 1 month ago

allenai/UNcommonsense

Viewer • Updated Jan 19, 2024 • 18.3k • 246 • 10

updated a model about 2 months ago

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1

Updated Jan 20 • 12

published 9 models about 2 months ago

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iter0

Updated Jan 8 • 7

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iter2

Updated Jan 11 • 11

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter0

Updated Jan 13 • 8

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iqlearn-iter0

Updated Jan 13 • 14

rl-llm-agent/Llama-3.2-3B-Instruct-value-alfworld-8b-sft

Updated Jan 13 • 8

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-shaped-iter0

Updated Jan 14 • 6

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1

Updated Jan 20 • 12

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iter2-70k

Updated Jan 16 • 7

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-exploration-aflworld-iter0-checkpoint-50

Updated Jan 16 • 8

updated 7 models about 2 months ago

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-exploration-aflworld-iter0-checkpoint-50

Updated Jan 16 • 8

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iter2-70k

Updated Jan 16 • 7

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-shaped-iter0

Updated Jan 14 • 6

rl-llm-agent/Llama-3.2-3B-Instruct-value-alfworld-8b-sft

Updated Jan 13 • 8

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iqlearn-iter0

Updated Jan 13 • 14

rl-llm-agent/Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter0

Updated Jan 13 • 8

rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iter2

Updated Jan 11 • 11