110 628 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

upvoted a paper 18 days ago

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

upvoted a paper 18 days ago

Diverse Inference and Verification for Advanced Reasoning

View all activity

Organizations

None yet

MichaelBarryUK's activity

upvoted 12 papers 18 days ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published 25 days ago • 18

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published 25 days ago • 30

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 23 days ago • 16

Large Language Diffusion Models

Paper • 2502.09992 • Published 23 days ago • 98

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 21 days ago • 141

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options

Paper • 2502.12929 • Published 19 days ago • 7

Atom of Thoughts for Markov LLM Test-Time Scaling

Paper • 2502.12018 • Published 20 days ago • 15

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Paper • 2502.12574 • Published 19 days ago • 11

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 19 days ago • 65

upvoted 7 papers 19 days ago

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Paper • 2502.10454 • Published 26 days ago • 7

Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems

Paper • 2502.11098 • Published 21 days ago • 13

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published 21 days ago • 15

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published 20 days ago • 6

CRANE: Reasoning with constrained LLM generation

Paper • 2502.09061 • Published 24 days ago • 18

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published 21 days ago • 22

SAFE-SQL: Self-Augmented In-Context Learning with Fine-grained Example Selection for Text-to-SQL

Paper • 2502.11438 • Published 20 days ago • 7

commented a paper about 1 month ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 66 •