mistralai/Mixtral-8x7B-Instruct-v0.1

How to All Utilize all GPU's when device="balanced_low_0" in GPU setting

2

#185 opened 11 months ago by

kmukeshreddy

Update README.md

#184 opened 11 months ago by

alamati

Is function calling (tools) supported?

1

#183 opened 11 months ago by

TomerRobusta

Getting cut-off responses with Mixtral 8x7B-Instruct-v0.1 mostly in Date of Birth years

3

#182 opened 11 months ago by

keskival

How can I run it on multiple GPUs?

11

#181 opened 11 months ago by

barbery

Where is the mixtral-8x7b's tokenizer encoder? Is there a specific repository or node module?

1

#180 opened 11 months ago by

RamanSB

What is the max token limit on this model?

2

#179 opened 11 months ago by

RamanSB

Finetuning Mixtral 8x7B Instruct-v0.1 using Transformers

2

#178 opened 11 months ago by

Ateeqq

Update chat template to resemble the prompt as stated in the model card.

7

#176 opened 11 months ago by

nilsec

max_sequence_length

1

#175 opened 11 months ago by

Ravnoor1

Awesome. I Got Very Good Responses, However...

#174 opened 11 months ago by deleted

🚩 Report

#173 opened 11 months ago by

SwatiM

How to run the full model ?

2

#171 opened 11 months ago by

dounykim

Is there a working/quantized/exl2 (etc) version that will fit on a single 24GB video card (4090)

2

#170 opened 11 months ago by

cleverest

403 error

1

#169 opened 11 months ago by

minhphan-qbe

Adding Evaluation Results

#168 opened 11 months ago by

leaderboard-pr-bot

Rename README.md to RegulusOne

#167 opened 11 months ago by

Theguy666

Help: CUDA Out of Memory. Hardware requirements.

2

#147 opened 12 months ago by

zebfreeman

Update README.md

#146 opened 12 months ago by

frank76rm

Experimental use

#144 opened 12 months ago by

yassineelkhadiri14

TemplateError: Conversation roles must alternate user/assistant/user/assistant/...

4

#143 opened 12 months ago by

quamer23

Is instruction format necessary

2

#142 opened 12 months ago by

supercharge19

[AUTOMATED] Model Memory Requirements

3

#141 opened 12 months ago by

model-sizer-bot

Update README.md

#140 opened 12 months ago by

woodyk

Cuda Out of memory issue when deploying mistralai/Mixtral-8x7B-Instruct-v0.1 on AWS "ml.g5.48xlarge"

1

#139 opened 12 months ago by

sonalisbapte

slow response

1

#138 opened 12 months ago by

bhavanam2809

Sparsity in mixtral

#137 opened 12 months ago by

dpk17

Request: DOI

#136 opened 12 months ago by

Sonny03

HELP!

2

#135 opened 12 months ago by

Dommos

Running in Multi-gpu's

5

#134 opened 12 months ago by

kmukeshreddy

Update README.md

#133 opened 12 months ago by

gmverbas

How to format custom dataset to finetune Mixtral with TRL SFT script?

#132 opened 12 months ago by

icpro

How to use run the code on Colab Free Tier or Mac OS?

16

#131 opened 12 months ago by

dounykim

Different answer after each request

2

#130 opened 12 months ago by

amin2557

How to finetune the model?

2

#129 opened 12 months ago by

akasranjan

How much Resource is needed to run the Mixtral ?

1

#128 opened 12 months ago by

rkhapre

Update README.md

#126 opened about 1 year ago by

mariakatosvich

The inference API Endpoint gives wrongly formatted answer based on the context given but works well in example Spaces. How we can fix this?

9

#125 opened about 1 year ago by

rkhapre

Request: DOI

#124 opened about 1 year ago by

jsr2

Update README.md

#123 opened about 1 year ago by

Pawamami

what is max input token limit of this model?

1

#122 opened about 1 year ago by

vaidehirao

addd

1

#121 opened about 1 year ago by

seedeera

Request: SDFSDFSD

1

#120 opened about 1 year ago by

seedeera

Consistency check failed - model-00019-of-00019.safetensors

#118 opened about 1 year ago by

br1-pist

Difference in EOS token between Mistral/Mixtral and LLAMA.

1

#117 opened about 1 year ago by

xkszltl

Model Output is Changed

9

#116 opened about 1 year ago by

AnzaniAI

The chat template doesn't support a system prompt

6

#114 opened about 1 year ago by

sam-kap

How to get 'output_router_logits'

1

#113 opened about 1 year ago by

cts13

Run inference on 2 GPUs

1

#112 opened about 1 year ago by

bweinstein123

Running a 4-bit Quantized 7B Model on a PC: Feasibility and Insights

4

#109 opened about 1 year ago by

edw-hug-face