DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Paper
•
2310.03714
•
Published
•
31
Note I'm not a fan of the implementation, but I think the ideas behind DSPy are interesting.
Note The paper that introduced the concept of multi-agents!
Note GAIA benchmark is the most challenging benchmark for generalist agents, requiring a good web browser, multimodal capabilities, and complex multi-step task solving.
Note This paper is the basis for the Thought -> Action -> Observation cycle used in most agent frameworks nowadays.
Need to analyze data? Let a Llama-3.1 agent do it for you!
Note This one shows much more impressive scores than ShowUI : but the VLMs used are also much larger (7B and 72B vs 2B) and based on the better Qwen2.5 instead of Qwen2.