admarcosai
's Collections
Pending Classification
updated
Video Creation by Demonstration
Paper
•
2412.09551
•
Published
•
8
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for
Customized Manga Generation
Paper
•
2412.07589
•
Published
•
45
Unraveling the Complexity of Memory in RL Agents: an Approach for
Classification and Evaluation
Paper
•
2412.06531
•
Published
•
71
APOLLO: SGD-like Memory, AdamW-level Performance
Paper
•
2412.05270
•
Published
•
38
Ultra-Sparse Memory Network
Paper
•
2411.12364
•
Published
•
19
Paper
•
2409.07429
•
Published
•
28
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
Long-Horizon Tasks
Paper
•
2408.03615
•
Published
•
30
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge
Bases
Paper
•
2407.12784
•
Published
•
48
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
•
2407.04363
•
Published
•
27
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Paper
•
2403.11481
•
Published
•
12
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper
•
2402.17753
•
Published
•
18
ChatQA: Building GPT-4 Level Conversational QA Models
Paper
•
2401.10225
•
Published
•
34
Commonsense-augmented Memory Construction and Management in Long-term
Conversations via Context-aware Persona Refinement
Paper
•
2401.14215
•
Published
•
2
Effective and Efficient Conversation Retrieval for Dialogue State
Tracking with Implicit Text Summaries
Paper
•
2402.13043
•
Published
•
2
Evaluating and Aligning CodeLLMs on Human Preference
Paper
•
2412.05210
•
Published
•
47
UniReal: Universal Image Generation and Editing via Learning Real-world
Dynamics
Paper
•
2412.07774
•
Published
•
25
Paper
•
2412.07724
•
Published
•
18
Fully Open Source Moxin-7B Technical Report
Paper
•
2412.06845
•
Published
•
10
Training Large Language Models to Reason in a Continuous Latent Space
Paper
•
2412.06769
•
Published
•
62
ProcessBench: Identifying Process Errors in Mathematical Reasoning
Paper
•
2412.06559
•
Published
•
68
Maya: An Instruction Finetuned Multilingual Multimodal Model
Paper
•
2412.07112
•
Published
•
25
Expanding Performance Boundaries of Open-Source Multimodal Models with
Model, Data, and Test-Time Scaling
Paper
•
2412.05271
•
Published
•
121
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases
Paper
•
2412.04862
•
Published
•
48
Evaluating Language Models as Synthetic Data Generators
Paper
•
2412.03679
•
Published
•
43
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and
Proactive Robotic Failure Detection
Paper
•
2412.04455
•
Published
•
35
CodeChain: Towards Modular Code Generation Through Chain of
Self-revisions with Representative Sub-modules
Paper
•
2310.08992
•
Published
•
10
Paper
•
2412.04315
•
Published
•
16
Discriminative Fine-tuning of LVLMs
Paper
•
2412.04378
•
Published
•
10
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper
•
2412.04448
•
Published
•
9
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
•
2412.03555
•
Published
•
118
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic
Data From Large Language Models
Paper
•
2412.02980
•
Published
•
12
Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training
in LLMs
Paper
•
2411.08719
•
Published
Little Giants: Synthesizing High-Quality Embedding Data at Scale
Paper
•
2410.18634
•
Published
A Survey on Data Synthesis and Augmentation for Large Language Models
Paper
•
2410.12896
•
Published
Self-Improvement in Language Models: The Sharpening Mechanism
Paper
•
2412.01951
•
Published
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
•
2411.08147
•
Published
•
62
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's
Reasoning Capability
Paper
•
2411.19943
•
Published
•
55
MALT: Improving Reasoning with Multi-Agent LLM Training
Paper
•
2412.01928
•
Published
•
39
Multi-Agent Large Language Models for Conversational Task-Solving
Paper
•
2410.22932
•
Published
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and
Pruning
Paper
•
2412.03248
•
Published
•
25
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on
Retrieval-Augmented Generation
Paper
•
2412.02592
•
Published
•
20
Scaling Image Tokenizers with Grouped Spherical Quantization
Paper
•
2412.02632
•
Published
•
10
X-Prompt: Towards Universal In-Context Image Generation in
Auto-Regressive Vision Language Foundation Models
Paper
•
2412.01824
•
Published
•
65
o1-Coder: an o1 Replication for Coding
Paper
•
2412.00154
•
Published
•
40
Open-Sora Plan: Open-Source Large Video Generation Model
Paper
•
2412.00131
•
Published
•
32
The Well: a Large-Scale Collection of Diverse Physics Simulations for
Machine Learning
Paper
•
2412.00568
•
Published
•
14
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Paper
•
2412.01800
•
Published
•
6
A Simple and Provable Scaling Law for the Test-Time Compute of Large
Language Models
Paper
•
2411.19477
•
Published
•
5
Exploring the Abilities of Large Language Models to Solve Proportional
Analogies via Knowledge-Enhanced Prompting
Paper
•
2412.00869
•
Published
•
4
World-consistent Video Diffusion with Explicit 3D Modeling
Paper
•
2412.01821
•
Published
•
4
Yi-Lightning Technical Report
Paper
•
2412.01253
•
Published
•
25
Reverse Thinking Makes LLMs Stronger Reasoners
Paper
•
2411.19865
•
Published
•
19
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering
Benchmark Dataset
Paper
•
2411.15640
•
Published
•
4
Large Language Model-Brained GUI Agents: A Survey
Paper
•
2411.18279
•
Published
•
27
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for
Quantized LLMs with 100T Training Tokens
Paper
•
2411.17691
•
Published
•
9
Learning 3D Representations from Procedural 3D Programs
Paper
•
2411.17467
•
Published
•
8
Star Attention: Efficient LLM Inference over Long Sequences
Paper
•
2411.17116
•
Published
•
47
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple
Distillation, Big Progress or Bitter Lesson?
Paper
•
2411.16489
•
Published
•
40
From Generation to Judgment: Opportunities and Challenges of
LLM-as-a-judge
Paper
•
2411.16594
•
Published
•
36
Reflections from the 2024 Large Language Model (LLM) Hackathon for
Applications in Materials Science and Chemistry
Paper
•
2411.15221
•
Published
•
25
All Languages Matter: Evaluating LMMs on Culturally Diverse 100
Languages
Paper
•
2411.16508
•
Published
•
8
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models
Paper
•
2411.15671
•
Published
•
7
LLMs Do Not Think Step-by-step In Implicit Reasoning
Paper
•
2411.15862
•
Published
•
8
Predicting Emergent Capabilities by Finetuning
Paper
•
2411.16035
•
Published
•
6
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
56
A Flexible Large Language Models Guardrail Development Methodology
Applied to Off-Topic Prompt Detection
Paper
•
2411.12946
•
Published
•
20
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Paper
•
2411.13543
•
Published
•
18
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper
•
2411.14405
•
Published
•
58
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented
LMs
Paper
•
2411.14199
•
Published
•
29
Hymba: A Hybrid-head Architecture for Small Language Models
Paper
•
2411.13676
•
Published
•
39
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
Paper
•
2411.14257
•
Published
•
9
Patience Is The Key to Large Language Model Reasoning
Paper
•
2411.13082
•
Published
•
7
VBench++: Comprehensive and Versatile Benchmark Suite for Video
Generative Models
Paper
•
2411.13503
•
Published
•
30
SageAttention2 Technical Report: Accurate 4 Bit Attention for
Plug-and-play Inference Acceleration
Paper
•
2411.10958
•
Published
•
50
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context
Training
Paper
•
2411.13476
•
Published
•
15
Continuous Speculative Decoding for Autoregressive Image Generation
Paper
•
2411.11925
•
Published
•
15
Building Trust: Foundations of Security, Safety and Transparency in AI
Paper
•
2411.12275
•
Published
•
10
Evaluating Tokenizer Performance of Large Language Models Across
Official Indian Languages
Paper
•
2411.12240
•
Published
•
6
Generative World Explorer
Paper
•
2411.11844
•
Published
•
75
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large
Language Models on Mobile Devices
Paper
•
2411.10640
•
Published
•
44
Search, Verify and Feedback: Towards Next Generation Post-training
Paradigm of Foundation Models via Verifier Engineering
Paper
•
2411.11504
•
Published
•
19
Drowning in Documents: Consequences of Scaling Reranker Inference
Paper
•
2411.11767
•
Published
•
17
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation
Systems for Medical Question Answering
Paper
•
2411.09213
•
Published
•
6
Evaluating the role of `Constitutions' for learning from AI feedback
Paper
•
2411.10168
•
Published
•
5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer
Use
Paper
•
2411.10323
•
Published
•
31
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper
•
2411.10440
•
Published
•
111
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Paper
•
2411.09595
•
Published
•
71
Hardware and Software Platform Inference
Paper
•
2411.05197
•
Published
•
3
Stronger Models are NOT Stronger Teachers for Instruction Tuning
Paper
•
2411.07133
•
Published
•
34
Scaling Properties of Diffusion Models for Perceptual Tasks
Paper
•
2411.08034
•
Published
•
13
GRS-QA -- Graph Reasoning-Structured Question Answering Dataset
Paper
•
2411.00369
•
Published
•
6
GPT or BERT: why not both?
Paper
•
2410.24159
•
Published
•
14
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with
Large Language Models
Paper
•
2410.13080
•
Published
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A
Gradient Perspective
Paper
•
2410.23743
•
Published
•
59
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM
Inference
Paper
•
2410.21465
•
Published
•
11
RARe: Retrieval Augmented Retrieval with In-Context Examples
Paper
•
2410.20088
•
Published
•
5
Autoregressive Models in Vision: A Survey
Paper
•
2411.05902
•
Published
•
16
Game-theoretic LLM: Agent Workflow for Negotiation Games
Paper
•
2411.05990
•
Published
•
7
Language Models are Hidden Reasoners: Unlocking Latent Reasoning
Capabilities via Self-Rewarding
Paper
•
2411.04282
•
Published
•
30
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
•
2411.04905
•
Published
•
111
BitNet a4.8: 4-bit Activations for 1-bit LLMs
Paper
•
2411.04965
•
Published
•
63
Mixture-of-Transformers: A Sparse and Scalable Architecture for
Multi-Modal Foundation Models
Paper
•
2411.04996
•
Published
•
49
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large
Language Model
Paper
•
2411.04496
•
Published
•
22
Self-Consistency Preference Optimization
Paper
•
2411.04109
•
Published
•
16
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle
Grandmaster Level
Paper
•
2411.03562
•
Published
•
63
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM
Quantization
Paper
•
2411.02355
•
Published
•
46
How Far is Video Generation from World Model: A Physical Law Perspective
Paper
•
2411.02385
•
Published
•
33
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated
Parameters by Tencent
Paper
•
2411.02265
•
Published
•
24
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Paper
•
2411.02397
•
Published
•
23
Constrained Diffusion Implicit Models
Paper
•
2411.00359
•
Published
•
6
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and
Cross-Cultural Embedding Models and Benchmarks
Paper
•
2411.01192
•
Published
•
3
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper
•
2410.23218
•
Published
•
46
Personalization of Large Language Models: A Survey
Paper
•
2411.00027
•
Published
•
31
Survey of User Interface Design and Interaction Techniques in Generative
AI Applications
Paper
•
2410.22370
•
Published
•
11
BitStack: Fine-Grained Size Control for Compressed Large Language Models
in Variable Memory Environments
Paper
•
2410.23918
•
Published
•
18
SelfCodeAlign: Self-Alignment for Code Generation
Paper
•
2410.24198
•
Published
•
21
AAAR-1.0: Assessing AI's Potential to Assist Research
Paper
•
2410.22394
•
Published
•
14
On Memorization of Large Language Models in Logical Reasoning
Paper
•
2410.23123
•
Published
•
18
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science
Competitions
Paper
•
2410.20424
•
Published
•
39
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World
Exploration, Feedback and Optimization
Paper
•
2410.19609
•
Published
•
17
A Survey of Small Language Models
Paper
•
2410.20011
•
Published
•
40
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized
Generalist Computer Assistant
Paper
•
2410.18603
•
Published
•
32
Counting Ability of Large Language Models and Impact of Tokenization
Paper
•
2410.19730
•
Published
•
10
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for
Contrastive Loss
Paper
•
2410.17243
•
Published
•
89
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis
from Scratch
Paper
•
2410.18693
•
Published
•
40
Unbounded: A Generative Infinite Game of Character Life Simulation
Paper
•
2410.18975
•
Published
•
35
Multi-Draft Speculative Sampling: Canonical Architectures and
Theoretical Limits
Paper
•
2410.18234
•
Published
•
3
WorldSimBench: Towards Video Generation Models as World Simulators
Paper
•
2410.18072
•
Published
•
18
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Paper
•
2311.05556
•
Published
•
82
Latent Consistency Models: Synthesizing High-Resolution Images with
Few-Step Inference
Paper
•
2310.04378
•
Published
•
19
Conditional Diffusion Distillation
Paper
•
2310.01407
•
Published
•
20
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Paper
•
2310.03739
•
Published
•
21
Large Concept Models: Language Modeling in a Sentence Representation
Space
Paper
•
2412.08821
•
Published
•
7
The Role of Summarization in Generative Agents: A Preliminary
Perspective
Paper
•
2305.01253
•
Published
Generative Agents: Interactive Simulacra of Human Behavior
Paper
•
2304.03442
•
Published
•
12
SOTOPIA-π: Interactive Learning of Socially Intelligent Language
Agents
Paper
•
2403.08715
•
Published
•
20
Generative Agent Simulations of 1,000 People
Paper
•
2411.10109
•
Published
•
2
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
•
145
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic
Paper
•
2401.07382
•
Published
•
2
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper
•
2401.06080
•
Published
•
26
Is this the real life? Is this just fantasy? The Misleading Success of
Simulating Social Interactions With LLMs
Paper
•
2403.05020
•
Published
•
2
Improving Reinforcement Learning from Human Feedback Using Contrastive
Rewards
Paper
•
2403.07708
•
Published
Large Language Model-based Human-Agent Collaboration for Complex Task
Solving
Paper
•
2402.12914
•
Published
Interactive Agents: Simulating Counselor-Client Psychological Counseling
via Role-Playing LLM-to-LLM Interactions
Paper
•
2408.15787
•
Published
Building Cooperative Embodied Agents Modularly with Large Language
Models
Paper
•
2307.02485
•
Published
•
11
Natural Language Reinforcement Learning
Paper
•
2411.14251
•
Published
•
26
Challenges in Human-Agent Communication
Paper
•
2412.10380
•
Published
From Individual to Society: A Survey on Social Simulation Driven by
Large Language Model-based Agents
Paper
•
2412.03563
•
Published
AgentSense: Benchmarking Social Intelligence of Language Agents through
Interactive Scenarios
Paper
•
2410.19346
•
Published
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building
Large Language Model-Based Conversational AI Agents
Paper
•
2411.00927
•
Published
Simulating User Agents for Embodied Conversational-AI
Paper
•
2410.23535
•
Published
Positive Experience Reflection for Agents in Interactive Text
Environments
Paper
•
2411.02223
•
Published
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons
Paper
•
2412.08442
•
Published
OpenDevin: An Open Platform for AI Software Developers as Generalist
Agents
Paper
•
2407.16741
•
Published
•
68
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
42
Scaling Instructable Agents Across Many Simulated Worlds
Paper
•
2404.10179
•
Published
•
27
CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Paper
•
2406.12276
•
Published
Code Agents are State of the Art Software Testers
Paper
•
2406.12952
•
Published
•
1
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks
at Scale
Paper
•
2409.16299
•
Published
•
10
Reinforcement Learning: An Overview
Paper
•
2412.05265
•
Published
•
4
Automated Reinforcement Learning: An Overview
Paper
•
2201.05000
•
Published