Minh Tran's picture

6 216

Minh Tran

tminh

·

minhtcai

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

microsoft/Phi-4-multimodal-instruct

liked a dataset 19 days ago

GeneralReasoning/GeneralThought-Feb25

liked a model 21 days ago

Qwen/Qwen2.5-VL-7B-Instruct

View all activity

Organizations

tminh's activity

upvoted a collection 10 months ago

ViHateT5 - Vietnamese Hate Speech Detection with T5

5 items • Updated Jul 16, 2024 • 3

upvoted a paper 10 months ago

Deep Bidirectional Language-Knowledge Graph Pretraining

Paper • 2210.09338 • Published Oct 17, 2022 • 1

upvoted 3 collections 11 months ago

CodeGemma Release

18 items • Updated 2 days ago • 81

Switch-Transformers release

This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated 2 days ago • 17

Mixtral HQQ Quantized Models

4-bit and 2-bit Mixtral models quantized using https://github.com/mobiusml/hqq • 9 items • Updated Mar 29, 2024 • 14

upvoted a paper 12 months ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 56