Also wonder, given the additional compute and time we now have, would ensemble with another static embedding model give additional quality improvements. But then, what might be the best way to ensemble, maybe domain dataset specific would help, different loss trained model?
Nitin Surya
nitinsurya
AI & ML interests
Vision, NLP and multimodal learning.
Recent Activity
commented on
an
article
about 1 month ago
Train 400x faster Static Embedding Models with Sentence Transformers
commented on
an
article
about 1 month ago
Train 400x faster Static Embedding Models with Sentence Transformers
updated
a Space
about 1 year ago
nitinsurya/clip-embedding
Organizations
None yet
nitinsurya's activity
commented on
Train 400x faster Static Embedding Models with Sentence Transformers
about 1 month ago
commented on
Train 400x faster Static Embedding Models with Sentence Transformers
about 1 month ago
With some research directions looking into tokenizer free, wondering how far one can stretch the character level similarly trained model.