Michiel de Jong
msdejong
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
Qwen/Qwen2.5-Coder-32B-Instruct:What is the correct norm eps for 14B and 32B base models?
Organizations
None yet
msdejong's activity
What is the correct norm eps for 14B and 32B base models?
1
#1 opened about 1 month ago
by
msdejong