ijohn07 commited on
Commit
2d55e78
1 Parent(s): 50ed833

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -1,7 +1,8 @@
1
-
2
  ---
3
  pipeline_tag: sentence-similarity
4
- language: fr
 
 
5
  license: apache-2.0
6
  tags:
7
  - passage-retrieval
@@ -14,11 +15,11 @@ base_model_relation: quantized
14
  # 🇫🇷 french-gte-multilingual-base
15
 
16
  This model is a 51.4% smaller version of [Alibaba-NLP/gte-multilingual-base](https://huggingface.co/Alibaba-NLP/gte-multilingual-base)
17
- for the French language, created using the [mtem-pruner](https://huggingface.co/spaces/antoinelouis/mtem-pruner) space.
18
 
19
- This pruned model should perform similarly to the original model for French language tasks with a much smaller
20
  memory footprint. However, it may not perform well for other languages present in the original multilingual model as tokens not
21
- commonly used in French were removed from the original multilingual model's vocabulary.
22
 
23
  ## Usage
24
 
@@ -27,7 +28,7 @@ You can use this model with the Transformers library:
27
  ```python
28
  from transformers import AutoModel, AutoTokenizer
29
 
30
- model_name = "ijohn07/french-gte-multilingual-base"
31
  model = AutoModel.from_pretrained(model_name, trust_remote_code=True)
32
  tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True, use_fast=True)
33
  ```
@@ -37,7 +38,7 @@ Or with the sentence-transformers library:
37
  ```python
38
  from sentence_transformers import SentenceTransformer
39
 
40
- model = SentenceTransformer("ijohn07/french-gte-multilingual-base")
41
  ```
42
 
43
- **Credits**: cc [@antoinelouis](https://huggingface.co/antoinelouis)
 
 
1
  ---
2
  pipeline_tag: sentence-similarity
3
+ language:
4
+ - fr
5
+ - en
6
  license: apache-2.0
7
  tags:
8
  - passage-retrieval
 
15
  # 🇫🇷 french-gte-multilingual-base
16
 
17
  This model is a 51.4% smaller version of [Alibaba-NLP/gte-multilingual-base](https://huggingface.co/Alibaba-NLP/gte-multilingual-base)
18
+ for the French and English language, created using the [mtem-pruner](https://huggingface.co/spaces/antoinelouis/mtem-pruner) space.
19
 
20
+ This pruned model should perform similarly to the original model for French and English language tasks with a much smaller
21
  memory footprint. However, it may not perform well for other languages present in the original multilingual model as tokens not
22
+ commonly used in French and English were removed from the original multilingual model's vocabulary.
23
 
24
  ## Usage
25
 
 
28
  ```python
29
  from transformers import AutoModel, AutoTokenizer
30
 
31
+ model_name = "ijohn07/french-english-gte-base"
32
  model = AutoModel.from_pretrained(model_name, trust_remote_code=True)
33
  tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True, use_fast=True)
34
  ```
 
38
  ```python
39
  from sentence_transformers import SentenceTransformer
40
 
41
+ model = SentenceTransformer("ijohn07/french-english-gte-base")
42
  ```
43
 
44
+ **Credits**: cc [@antoinelouis](https://huggingface.co/antoinelouis)