samusenps

AI & ML interests

Foundational Architectures, Multi-Modality, Interpretability, Benchmarking w/ simulations, Robotics, Integration with Non envasive Open Source stack RISC-V BCI. Extremely high quality training data. Fully Open Source ML/AI.

Recent Activity

upvoted a paper 25 days ago

SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

upvoted a paper 25 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

upvoted a paper 25 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

View all activity

Organizations

samusenps's activity

upvoted 20 papers 25 days ago

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published Dec 21, 2024 • 8

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Paper • 2412.17295 • Published Dec 23, 2024 • 9

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published Dec 23, 2024 • 12

ResearchTown: Simulator of Human Research Community

Paper • 2412.17767 • Published Dec 23, 2024 • 14

Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published Dec 19, 2024 • 19

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 21

Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published Dec 22, 2024 • 29

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 29

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 34

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 42

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 45

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 85

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Paper • 2412.16153 • Published Dec 20, 2024 • 6