Pending Classification

admarcosai 's Collections

Architectures

LLM x Finance

HCI

Position Papers

Coding

Reasoning | Planning

Alignment: FineTuning-Preference

Data Efficiency

Survey

Efficient Inference

LLM x GRAPHS

AI x GAMES

Benchmarks

Libraries and Framworks

Agentics

Preference Dataset

QA Dataset

Coding Dataset

LLM Evaluation

Function Calling Dataset

Conversation

Alignment

Model Architectures

LLM x RL

Serving

Datasets

LLM x RAG

LMMM

LLM Pretraining

Models

Self-Learning AI

LLM-Security

XAI

MultiLingual

Efficient-Continuous Training

ParadigmShift-Inquiry

Sparsity

Math Datasets

AI UX

Parallellism

InContext Learning

Efficient Training

LLM x Symbolics

Long Context

Tool Use | Function Calling

Quantization | Compression

Regulation

LLM | Writing

Math

LLM x Animation

3D Generation

Memory

Modality: Video

3D - AI

Mambas and LLM-AltArch

World Models

updated 3 days ago

Upvote

Video Creation by Demonstration

Paper • 2412.09551 • Published 13 days ago • 8
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 15 days ago • 45
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 17 days ago • 71
APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published 19 days ago • 38
Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19 • 19
Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11 • 28
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7 • 30
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17 • 48
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5 • 27
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Paper • 2403.11481 • Published Mar 18 • 12
Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27 • 18
ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18 • 34
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

Paper • 2401.14215 • Published Jan 25 • 2
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries

Paper • 2402.13043 • Published Feb 20 • 2
Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 19 days ago • 47
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Paper • 2412.07774 • Published 15 days ago • 25
Granite Guardian

Paper • 2412.07724 • Published 15 days ago • 18
Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published 18 days ago • 10
Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 16 days ago • 62
ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 17 days ago • 68
Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published 16 days ago • 25
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 19 days ago • 121
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 20 days ago • 48
Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 21 days ago • 43
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published 20 days ago • 35
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Paper • 2310.08992 • Published Oct 13, 2023 • 10
Densing Law of LLMs

Paper • 2412.04315 • Published 20 days ago • 16
Discriminative Fine-tuning of LVLMs

Paper • 2412.04378 • Published 20 days ago • 10
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Paper • 2412.04448 • Published 20 days ago • 9
PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published 21 days ago • 118
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published 22 days ago • 12
Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs

Paper • 2411.08719 • Published Nov 10
Little Giants: Synthesizing High-Quality Embedding Data at Scale

Paper • 2410.18634 • Published Oct 24
A Survey on Data Synthesis and Augmentation for Large Language Models

Paper • 2410.12896 • Published Oct 16
Self-Improvement in Language Models: The Sharpening Mechanism

Paper • 2412.01951 • Published 23 days ago
Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12 • 62
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published 26 days ago • 55
MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published 23 days ago • 39
Multi-Agent Large Language Models for Conversational Task-Solving

Paper • 2410.22932 • Published Oct 30
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning

Paper • 2412.03248 • Published 22 days ago • 25
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published 22 days ago • 20
Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published 22 days ago • 10
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published 23 days ago • 65
o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published 27 days ago • 40
Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 28 days ago • 32
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning

Paper • 2412.00568 • Published 25 days ago • 14
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published 23 days ago • 6
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published 27 days ago • 5
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting

Paper • 2412.00869 • Published 24 days ago • 4
World-consistent Video Diffusion with Explicit 3D Modeling

Paper • 2412.01821 • Published 23 days ago • 4
Yi-Lightning Technical Report

Paper • 2412.01253 • Published 24 days ago • 25
Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published 26 days ago • 19
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset

Paper • 2411.15640 • Published Nov 23 • 4
Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published 29 days ago • 27
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published 29 days ago • 9
Learning 3D Representations from Procedural 3D Programs

Paper • 2411.17467 • Published about 1 month ago • 8
Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 30 days ago • 47
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published about 1 month ago • 40
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published about 1 month ago • 36
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Paper • 2411.15221 • Published Nov 20 • 25
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Paper • 2411.16508 • Published about 1 month ago • 8
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Paper • 2411.15671 • Published Nov 23 • 7
LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24 • 8
Predicting Emergent Capabilities by Finetuning

Paper • 2411.16035 • Published Nov 25 • 6
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22 • 56
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20 • 20
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Paper • 2411.13543 • Published Nov 20 • 18
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21 • 58
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21 • 29
Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20 • 39
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21 • 9
Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published Nov 20 • 7
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20 • 30
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17 • 50
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20 • 15
Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18 • 15
Building Trust: Foundations of Security, Safety and Transparency in AI

Paper • 2411.12275 • Published Nov 19 • 10
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

Paper • 2411.12240 • Published Nov 19 • 6
Generative World Explorer

Paper • 2411.11844 • Published Nov 18 • 75
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16 • 44
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18 • 19
Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published Nov 18 • 17
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering

Paper • 2411.09213 • Published Nov 14 • 6
Evaluating the role of `Constitutions' for learning from AI feedback

Paper • 2411.10168 • Published Nov 15 • 5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published Nov 15 • 31
LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15 • 111
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14 • 71
Hardware and Software Platform Inference

Paper • 2411.05197 • Published Nov 7 • 3
Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11 • 34
Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12 • 13
GRS-QA -- Graph Reasoning-Structured Question Answering Dataset

Paper • 2411.00369 • Published Nov 1 • 6
GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31 • 14
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Paper • 2410.13080 • Published Oct 16
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31 • 59
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published Oct 28 • 11
RARe: Retrieval Augmented Retrieval with In-Context Examples

Paper • 2410.20088 • Published Oct 26 • 5
Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8 • 16
Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8 • 7
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6 • 30
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7 • 111
BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7 • 63
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7 • 49
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published Nov 7 • 22
Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6 • 16
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 63
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4 • 46
How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published Nov 4 • 33
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4 • 24
Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published Nov 4 • 23
Constrained Diffusion Implicit Models

Paper • 2411.00359 • Published Nov 1 • 6
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Paper • 2411.01192 • Published Nov 2 • 3
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30 • 46
Personalization of Large Language Models: A Survey

Paper • 2411.00027 • Published Oct 29 • 31
Survey of User Interface Design and Interaction Techniques in Generative AI Applications

Paper • 2410.22370 • Published Oct 28 • 11
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31 • 18
SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31 • 21
AAAR-1.0: Assessing AI's Potential to Assist Research

Paper • 2410.22394 • Published Oct 29 • 14
On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30 • 18
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published Oct 27 • 39
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25 • 17
A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25 • 40
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24 • 32
Counting Ability of Large Language Models and Impact of Tokenization

Paper • 2410.19730 • Published Oct 25 • 10
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 89
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24 • 40
Unbounded: A Generative Infinite Game of Character Life Simulation

Paper • 2410.18975 • Published Oct 24 • 35
Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits

Paper • 2410.18234 • Published Oct 23 • 3
WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23 • 18
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 82
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Paper • 2310.04378 • Published Oct 6, 2023 • 19
Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20
Aligning Text-to-Image Diffusion Models with Reward Backpropagation

Paper • 2310.03739 • Published Oct 5, 2023 • 21
Large Concept Models: Language Modeling in a Sentence Representation Space

Paper • 2412.08821 • Published 14 days ago • 7
The Role of Summarization in Generative Agents: A Preliminary Perspective

Paper • 2305.01253 • Published May 2, 2023
Generative Agents: Interactive Simulacra of Human Behavior

Paper • 2304.03442 • Published Apr 7, 2023 • 12
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13 • 20
Generative Agent Simulations of 1,000 People

Paper • 2411.10109 • Published Nov 15 • 2
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 145
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic

Paper • 2401.07382 • Published Jan 14 • 2
Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11 • 26
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs

Paper • 2403.05020 • Published Mar 8 • 2
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Paper • 2403.07708 • Published Mar 12
Large Language Model-based Human-Agent Collaboration for Complex Task Solving

Paper • 2402.12914 • Published Feb 20
Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions

Paper • 2408.15787 • Published Aug 28
Building Cooperative Embodied Agents Modularly with Large Language Models

Paper • 2307.02485 • Published Jul 5, 2023 • 11
Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21 • 26
Challenges in Human-Agent Communication

Paper • 2412.10380 • Published 28 days ago
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Paper • 2412.03563 • Published 21 days ago
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Paper • 2410.19346 • Published Oct 25
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents

Paper • 2411.00927 • Published Nov 1
Simulating User Agents for Embodied Conversational-AI

Paper • 2410.23535 • Published Oct 31
Positive Experience Reflection for Agents in Interactive Text Environments

Paper • 2411.02223 • Published Nov 4
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Paper • 2412.08442 • Published 15 days ago
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23 • 68
Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1 • 42
Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13 • 27
CodeNav: Beyond tool-use to using real-world codebases with LLM agents

Paper • 2406.12276 • Published Jun 18
Code Agents are State of the Art Software Testers

Paper • 2406.12952 • Published Jun 18 • 1
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Paper • 2409.16299 • Published Sep 9 • 10
Reinforcement Learning: An Overview

Paper • 2412.05265 • Published 19 days ago • 4
Automated Reinforcement Learning: An Overview

Paper • 2201.05000 • Published Jan 13, 2022

Upvote

Collection guide
Browse collections