openai/whisper-large-v3

#110 opened 8 months ago by

Pytagora

supress the number in whisper model

#109 opened 8 months ago by

saichand09

Upload Jens Wörle Onyo.m4a

#106 opened 8 months ago by

lennartuh

Meaningless or repetitive sentences continue to be generated.

#105 opened 8 months ago by

kbuwel

Audio input consists of only 3000. Short-form transcription is activated.no_speech_threshold is set to 0.5, but will be ignored.

#104 opened 8 months ago by

rizwanishaq

transcribing chinese audio with the this model, the output text lacks punctuation

#103 opened 9 months ago by

TaoGoblin

Issues with transcription of Burmese audio

#102 opened 9 months ago by

hanshupe

How can i return both word level and segment together when using hugging face transformer?

#101 opened 9 months ago by

lilijy

Incoherences in timestamps between chunked and sequential form

#100 opened 9 months ago by

vprosadm

How we can use this model to achieve a real-time trans？

#99 opened 9 months ago by

Von-violet

How to get accuracy of transcription from the model?

5

#98 opened 9 months ago by

Atulad

Module not found error. No module named GTTS.

#97 opened 9 months ago by

BeyondStreamlit

Enhancing Pipeline with Speech Probability for Reduced Hallucination

#96 opened 9 months ago by

rizwanishaq

Rename README.md to persian

#95 opened 9 months ago by

Zabs

OpenAI Whisper Support Urdu (Pakistan) language?

#94 opened 9 months ago by

AamirFarooq

How to mention lanuage parameter on inference api

#93 opened 10 months ago by

aitransync

Download and Load model on local system.

#92 opened 10 months ago by

RebelloAlbina

How to save the loss value for each step during the training process?

#91 opened 10 months ago by

zhouwen999

Auto speech/languages detection in Real Time streaming?

#89 opened 10 months ago by

sabys

not transcribing to english

#88 opened 10 months ago by

aitransync

Does it support Korean translation and speech? :)

#87 opened 10 months ago by

JinPark

Transcript an Spanish audio

#86 opened 10 months ago by

Andrews99

is it possible to access the transcript result in batches after each chunk has finished?

#85 opened 10 months ago by

hanifanggawi

how to download model and load model and use it

#84 opened 10 months ago by

r5avindra

[AUTOMATED] Model Memory Requirements

#83 opened 10 months ago by

model-sizer-bot

Output Discrepancy

#82 opened 10 months ago by

eBisw

Transcription and Translation In the same call

#81 opened 10 months ago by

saalnlp

model in closed network

#78 opened 11 months ago by

iamwhoiamm

Trouble shooting local inference on whisper

#77 opened 11 months ago by

RESOLVER101757

The new Longform transcription method

5

#76 opened 11 months ago by

deep-intel

Upload vocab.json

#74 opened 11 months ago by

smerchi

How to adapt for low resource language?

9

#73 opened 11 months ago by

Imran1

Problems with AutoProcessor

#72 opened 11 months ago by

Kwang442

Suddenly all my transcriptions are in English

10

#71 opened 11 months ago by

WWCF

Transcription in different languages for Punjabi audio

#67 opened 11 months ago by

jssaluja

I want to use Whisper on a piece of hardware

#66 opened 11 months ago by

Avinier

How to fix "TypeError: expected str, bytes or os.PathLike object, not NoneType" when specifying the local whisper model

#65 opened 12 months ago by

BenjaminChu

Speaker Embedding

#64 opened 12 months ago by

bertrand-fournel

whisper jax diarization Icelandic

#62 opened 12 months ago by

Dondada79

Translating English Audio Into Spanish Text

#61 opened 12 months ago by

stvnchnsn

Error with word level timestamps - ValueError: set return_segments=True

5

#60 opened 12 months ago by

dkincaid

Passing parameters to the model deployed on HF Inference Endpoints

#59 opened about 1 year ago by

dkincaid

Does whisper-large-v3 work on Sagemaker?

#58 opened about 1 year ago by

dkincaid

Which File Shall I Download from The Files and Versions

#57 opened about 1 year ago by

BenjaminChu

Transcribing multiple languages in single audio file

#56 opened about 1 year ago by

supercharge19

Is there any way we can get no_speech_probability from the pipeline?

#55 opened about 1 year ago by

rizwanishaq

how to handle input audio files with either white noise or general noise and no speech

#54 opened about 1 year ago by

unk1911

changed use_flash_attention_2=True to attn_implementation="flash_attention_2"

#53 opened about 1 year ago by

macadeliccc

Whisper parameters?