-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 3.88M • • 2.68k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 557k • • 4.35k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 35k • • 1.69k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 58
Molone Laveh PRO
molonelaveh
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a Space
1 day ago
nanotron/ultrascale-playbook
liked
a model
21 days ago
perplexity-ai/r1-1776
liked
a model
about 2 months ago
UsefulSensors/moonshine-base
Organizations
Collections
2
models
None public yet
datasets
None public yet