Update README.md
Browse files
README.md
CHANGED
@@ -947,9 +947,9 @@ Click here to try the online demo of [MiniCPM-o 2.6](https://minicpm-omni-webdem
|
|
947 |
Inference using Huggingface transformers on NVIDIA GPUs. Please ensure that `transformers==4.44.2` is installed, as other versions may have compatibility issues. We are investigating this issue. Requirements tested on python 3.10:
|
948 |
```
|
949 |
Pillow==10.1.0
|
950 |
-
torch==2.
|
951 |
-
torchaudio==2.
|
952 |
-
torchvision==0.
|
953 |
transformers==4.44.2
|
954 |
librosa==0.9.0
|
955 |
soundfile==0.12.1
|
@@ -986,8 +986,13 @@ tokenizer = AutoTokenizer.from_pretrained('openbmb/MiniCPM-o-2_6', trust_remote_
|
|
986 |
|
987 |
# In addition to vision-only mode, tts processor and vocos also needs to be initialized
|
988 |
model.init_tts()
|
|
|
|
|
|
|
|
|
989 |
model.tts.float()
|
990 |
```
|
|
|
991 |
### Omni mode
|
992 |
we provide two inference modes: chat and streaming
|
993 |
|
|
|
947 |
Inference using Huggingface transformers on NVIDIA GPUs. Please ensure that `transformers==4.44.2` is installed, as other versions may have compatibility issues. We are investigating this issue. Requirements tested on python 3.10:
|
948 |
```
|
949 |
Pillow==10.1.0
|
950 |
+
torch==2.3.1
|
951 |
+
torchaudio==2.3.1
|
952 |
+
torchvision==0.18.1
|
953 |
transformers==4.44.2
|
954 |
librosa==0.9.0
|
955 |
soundfile==0.12.1
|
|
|
986 |
|
987 |
# In addition to vision-only mode, tts processor and vocos also needs to be initialized
|
988 |
model.init_tts()
|
989 |
+
```
|
990 |
+
|
991 |
+
If you are using an older version of PyTorch, you might encounter this issue `"weight_norm_fwd_first_dim_kernel" not implemented for 'BFloat16'`, Please convert the TTS to float32 type.
|
992 |
+
```python
|
993 |
model.tts.float()
|
994 |
```
|
995 |
+
|
996 |
### Omni mode
|
997 |
we provide two inference modes: chat and streaming
|
998 |
|