view post Post 2198 Tried my hand at simplifying the derivations of Direct Preference Optimization.I cover how one can reformulate RLHF into DPO. The idea of implicit reward modeling is chef's kiss.Blog: https://huggingface.co./blog/ariG23498/rlhf-to-dpo See translation π 4 4 + Reply
view post Post 1945 Timm β€οΈ TransformersWtih the latest version of transformers you can now use any timm model with the familiar transformers API.Blog Post: https://huggingface.co./blog/timm-transformersRepository with examples: https://github.com/ariG23498/timm-wrapper-examplesCollection: ariG23498/timmwrapper-6777b85f1e8d085d3f1374a1 See translation π 10 10 + Reply
view post Post 1424 We are blessed with another iteration of Pali Gemma. Google launches PaliGemma 2. google/paligemma-2-release-67500e1e1dbfdd4dee27ba48 merve/paligemma2-vqav2 See translation π€ 3 3 + Reply
pyimagesearch/construction-safety-object-detection-paligemma Viewer β’ Updated Dec 5, 2024 β’ 398 β’ 120 β’ 1
view post Post 2953 Qwen/qwen25-66e81a666513e518adb90d9e Qwen/Qwen2.5-Coder-Artifacts Qwen/Qwen2.5-Coder-demo π 7 7 π 4 4 π 2 2 + Reply
view post Post 1598 Cohere drops two new multilingual models! CohereForAI/aya-expanse-8b CohereForAI/aya-expanse-32bTry them out here CohereForAI/aya_expanse π 6 6 π 2 2 + Reply