Paulson's picture

86 21

Paulson

Pnaomi

·

AI & ML interests

Yes

Recent Activity

upvoted a paper 3 days ago

Understanding and Predicting Derailment in Toxic Conversations on GitHub

upvoted a paper 3 days ago

Lost in Literalism: How Supervised Training Shapes Translationese in LLMs

upvoted a paper 3 days ago

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

View all activity

Organizations

Pnaomi's activity

upvoted 20 papers 3 days ago

Understanding and Predicting Derailment in Toxic Conversations on GitHub

Paper • 2503.02191 • Published 6 days ago • 3

Lost in Literalism: How Supervised Training Shapes Translationese in LLMs

Paper • 2503.04369 • Published 4 days ago • 4

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Paper • 2503.04378 • Published 4 days ago • 6

L^2M: Mutual Information Scaling Law for Long-Context Language Modeling

Paper • 2503.04725 • Published 3 days ago • 18

Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published 10 days ago • 7

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published 3 days ago • 19

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 4 days ago • 18

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 4 days ago • 66

FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Paper • 2503.04222 • Published 4 days ago • 12

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 4 days ago • 31

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published 10 days ago • 18

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 3 days ago • 76

SwiLTra-Bench: The Swiss Legal Translation Benchmark

Paper • 2503.01372 • Published 7 days ago • 2

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

Paper • 2503.01449 • Published 7 days ago • 3

FLAME: A Federated Learning Benchmark for Robotic Manipulation

Paper • 2503.01729 • Published 6 days ago • 4

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

Paper • 2503.01763 • Published 6 days ago • 4

Exploring Rewriting Approaches for Different Conversational Tasks

Paper • 2502.18860 • Published 12 days ago • 4

Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective

Paper • 2503.01933 • Published 7 days ago • 10

Mixture of Structural-and-Textual Retrieval over Text-rich Graph Knowledge Bases

Paper • 2502.20317 • Published 10 days ago • 6

CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom

Paper • 2503.01836 • Published 6 days ago • 10