Ruisi Cai's picture

1 3

Ruisi Cai

CCCCRS

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

upvoted a paper 9 days ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

View all activity

Organizations

CCCCRS's activity

authored a paper 5 days ago

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Paper • 2501.00712 • Published 11 days ago • 5

upvoted a paper 9 days ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Paper • 2501.00658 • Published 12 days ago • 7

authored 5 papers 2 months ago

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?

Paper • 2302.12480 • Published Feb 24, 2023

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 12

Robust Mixture-of-Expert Training for Convolutional Neural Networks

Paper • 2308.10110 • Published Aug 19, 2023 • 2

Flextron: Many-in-One Flexible Large Language Model

Paper • 2406.10260 • Published Jun 11, 2024 • 2

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15

upvoted a paper 3 months ago

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15

commented a paper 3 months ago

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15 •

upvoted a paper 6 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39