Improving reliability, using prompt parameter
#111 opened 8 months ago
by
dcd12345678
supress the number in whisper model
2
#109 opened 8 months ago
by
saichand09
Upload Jens Wörle Onyo.m4a
#106 opened 8 months ago
by
lennartuh
Meaningless or repetitive sentences continue to be generated.
#105 opened 8 months ago
by
kbuwel
Audio input consists of only 3000. Short-form transcription is activated.no_speech_threshold is set to 0.5, but will be ignored.
#104 opened 8 months ago
by
rizwanishaq
transcribing chinese audio with the this model, the output text lacks punctuation
2
#103 opened 9 months ago
by
TaoGoblin
Issues with transcription of Burmese audio
#102 opened 9 months ago
by
hanshupe
How can i return both word level and segment together when using hugging face transformer?
#101 opened 9 months ago
by
lilijy
Incoherences in timestamps between chunked and sequential form
4
#100 opened 9 months ago
by
vprosadm
How we can use this model to achieve a real-time trans?
4
#99 opened 9 months ago
by
Von-violet
How to get accuracy of transcription from the model?
5
#98 opened 9 months ago
by
Atulad
Module not found error. No module named GTTS.
#97 opened 9 months ago
by
BeyondStreamlit
Enhancing Pipeline with Speech Probability for Reduced Hallucination
#96 opened 9 months ago
by
rizwanishaq
Rename README.md to persian
#95 opened 9 months ago
by
Zabs
OpenAI Whisper Support Urdu (Pakistan) language?
2
#94 opened 9 months ago
by
AamirFarooq
How to mention lanuage parameter on inference api
1
#93 opened 10 months ago
by
aitransync
Download and Load model on local system.
1
#92 opened 10 months ago
by
RebelloAlbina
How to save the loss value for each step during the training process?
2
#91 opened 10 months ago
by
zhouwen999
Auto speech/languages detection in Real Time streaming?
1
#89 opened 10 months ago
by
sabys
not transcribing to english
#88 opened 10 months ago
by
aitransync
Does it support Korean translation and speech? :)
#87 opened 10 months ago
by
JinPark
Transcript an Spanish audio
4
#86 opened 10 months ago
by
Andrews99
is it possible to access the transcript result in batches after each chunk has finished?
3
#85 opened 10 months ago
by
hanifanggawi
how to download model and load model and use it
1
#84 opened 10 months ago
by
r5avindra
[AUTOMATED] Model Memory Requirements
#83 opened 10 months ago
by
model-sizer-bot
Output Discrepancy
1
#82 opened 10 months ago
by
eBisw
Transcription and Translation In the same call
1
#81 opened 10 months ago
by
saalnlp
model in closed network
3
#78 opened 11 months ago
by
iamwhoiamm
Trouble shooting local inference on whisper
2
#77 opened 11 months ago
by
RESOLVER101757
The new Longform transcription method
5
#76 opened 11 months ago
by
deep-intel
Upload vocab.json
#74 opened 11 months ago
by
smerchi
How to adapt for low resource language?
9
#73 opened 11 months ago
by
Imran1
Problems with AutoProcessor
#72 opened 11 months ago
by
Kwang442
Suddenly all my transcriptions are in English
10
#71 opened 11 months ago
by
WWCF
Transcription in different languages for Punjabi audio
#67 opened 11 months ago
by
jssaluja
I want to use Whisper on a piece of hardware
#66 opened 11 months ago
by
Avinier
How to fix "TypeError: expected str, bytes or os.PathLike object, not NoneType" when specifying the local whisper model
2
#65 opened 12 months ago
by
BenjaminChu
Speaker Embedding
2
#64 opened 12 months ago
by
bertrand-fournel
whisper jax diarization Icelandic
#62 opened 12 months ago
by
Dondada79
Translating English Audio Into Spanish Text
4
#61 opened 12 months ago
by
stvnchnsn
Error with word level timestamps - ValueError: set return_segments=True
5
#60 opened 12 months ago
by
dkincaid
Passing parameters to the model deployed on HF Inference Endpoints
3
#59 opened about 1 year ago
by
dkincaid
Does whisper-large-v3 work on Sagemaker?
3
#58 opened about 1 year ago
by
dkincaid
Which File Shall I Download from The Files and Versions
2
#57 opened about 1 year ago
by
BenjaminChu
Transcribing multiple languages in single audio file
3
#56 opened about 1 year ago
by
supercharge19
Is there any way we can get no_speech_probability from the pipeline?
1
#55 opened about 1 year ago
by
rizwanishaq
how to handle input audio files with either white noise or general noise and no speech
2
#54 opened about 1 year ago
by
unk1911
changed use_flash_attention_2=True to attn_implementation="flash_attention_2"
1
#53 opened about 1 year ago
by
macadeliccc
Whisper parameters?
1
#52 opened about 1 year ago
by
Megatron17