The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces Paper • 2410.13194 • Published Oct 17 • 1
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12 • 9
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 129
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16 • 77
SPAR: Personalized Content-Based Recommendation via Long Engagement Attention Paper • 2402.10555 • Published Feb 16 • 33
DPO vs KTO vs IPO Collection A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost • 2 items • Updated Jan 16 • 12