-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 107 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 54 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 47
Phuong Pham
mp1704
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
qnguyen3/r1-res-stream
liked
a model
3 months ago
5CD-AI/Vintern-3B-beta
liked
a model
8 months ago
Qwen/CodeQwen1.5-7B
Organizations
Collections
1
models
15
mp1704/tora_7b_sft_ckpt_200
Text Generation
•
Updated
•
3
mp1704/tora_7b_pt
Text Generation
•
Updated
•
5
mp1704/gpt-neo-sft-v2.1
Text Generation
•
Updated
•
104
mp1704/gpt-neo-sft-v2
Text Generation
•
Updated
•
105
mp1704/gpt-neo-sft
Text Generation
•
Updated
•
105
mp1704/gpt-neo-pt
Text Generation
•
Updated
•
106
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
2
mp1704/gemma_2b_pt
Text Generation
•
Updated
•
6
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
111
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
101