Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper โข 2502.05171 โข Published about 1 month ago โข 122
Byte Latent Transformer: Patches Scale Better Than Tokens Paper โข 2412.09871 โข Published Dec 13, 2024 โข 93