keval shah
keval-sha
AI & ML interests
Multi-modal ML
Organizations
keval-sha's activity
git status stuck: fetching git changes
3
#12 opened 6 months ago
by
keval-sha
size of hidden layers and sliding window attention - dimension is the same, 4096. Is that for a reason?
2
#153 opened 7 months ago
by
keval-sha
size of hidden layers and sliding window attention - dimension is the same, 4096. Is that for a reason?
2
#153 opened 7 months ago
by
keval-sha