SAMBIT CHAKRABORTY
sambitchakhf03
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
13 days ago
Phi-4 Technical Report
upvoted
a
paper
14 days ago
Training Large Language Models to Reason in a Continuous Latent Space
updated
a collection
about 1 month ago
dataset/text
Organizations
Collections
5
-
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU
Paper • 2403.06504 • Published • 53 -
Stealing Part of a Production Language Model
Paper • 2403.06634 • Published • 90 -
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
Paper • 2406.14909 • Published • 14
models
2
datasets
None public yet