Problem with using Mistralai model api from Huggingface

#26

by Hawks101 - opened Sep 30, 2023

Sep 30, 2023

Hi, I am using the Mistral-7B-Instruct-v0.1 model through Huggingface's api for question answering a pdf and its working but the response is not long...it gets cut halfway after one sentence . Please help.

mayacinka

Oct 1, 2023

I have the exact same problem, I can only get around 20-30 tokens in the response. I wonder if it's model internal limitation

Hawks101

Oct 1, 2023

I have the exact same problem, I can only get around 20-30 tokens in the response. I wonder if it's model internal limitation

Please let me know if you find a solution.

adel-almasaabi

Oct 6, 2023

This comment has been hidden

Ayush8120

Oct 10, 2023

Can you attach an image of the code you are using for generation? Its working fine for me.

ArthurZ

Oct 10, 2023

Can you also try with model.generate(**inputs, max_new_tokens = 350) the default is 20 which might explain what's happening.

maxfrai

Oct 26, 2023

You can make POST request with inputs (and your query) and add "parameters" field. Increase max_new_tokens amount to get more text.

{
                "inputs": inputs,
                "parameters": {
                    "max_new_tokens": 100,
                    "temperature": 0.5,
                    "top_k": 40,
                    "top_p": 0.95,
                    "repetition_penalty": 1.1
                }
            }
``

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment