-
Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time
Paper • 2408.13233 • Published • 24 -
Heterogeneous Multi-task Learning with Expert Diversity
Paper • 2106.10595 • Published • 1 -
Residual Mixture of Experts
Paper • 2204.09636 • Published • 1 -
Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Paper • 2307.05956 • Published • 1
Hazem Essam
hazemessam
AI & ML interests
Protein Language Modeling, Natural Language Processing, Generative Adverserial Networks.
Recent Activity
liked
a model
14 days ago
hexgrad/Kokoro-82M
liked
a dataset
18 days ago
arcinstitute/opengenome2
new activity
about 1 month ago
ElnaggarLab/ankh-base:Missing sentencepiece model?
Organizations
Collections
1
Papers
1
datasets
9
hazemessam/saprot_dataset
Viewer
•
Updated
•
41.1M
•
75
•
1
hazemessam/uniref50
Viewer
•
Updated
•
68.4M
•
89
hazemessam/ddg_megadataset
Viewer
•
Updated
•
754k
•
76
hazemessam/ddg
Preview
•
Updated
•
172
hazemessam/abyssal_db
Preview
•
Updated
•
60
hazemessam/prostata
Viewer
•
Updated
•
10.5k
•
71
hazemessam/fireprot_db
Viewer
•
Updated
•
53.4k
•
68
hazemessam/uniprot_sprot
Viewer
•
Updated
•
569k
•
98
hazemessam/squad_v2
Viewer
•
Updated
•
2
•
125
•
1