togethercomputer/LLaMA-2-7B-32K · RoPE scaling and max_position

ag0

Aug 3, 2023

Hello,

In config.json, a linear rope_scaling of 8 is defined, and max_position_embeddings has been increased to 32768.

However, the huggingface Llama2 doc specifies that when a rope scaling strategy is used, max_position_embeddings should not be updated.
https://huggingface.co./docs/transformers/main/model_doc/llama2#transformers.LlamaConfig.rope_scaling

Wouldn't the existing config result in the RoPE scaling being applied twice (especially when setting trust_remote_code=False)?

Yhyu13

Aug 4, 2023

This should be fixed

Together org Aug 4, 2023

Let us know what you think!:)

togethercomputer
/

LLaMA-2-7B-32K