wchai commited on
Commit
243adcd
·
verified ·
1 Parent(s): 01aea46

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -125,4 +125,15 @@ AuroraCap is a multimodal large language model for image and video captioning.
125
 
126
  ## Quick Start
127
  See [Docs](https://github.com/rese1f/aurora/blob/main/docs/auroracap/README.md).
 
 
 
 
 
 
 
 
 
 
 
128
  ## Citation
 
125
 
126
  ## Quick Start
127
  See [Docs](https://github.com/rese1f/aurora/blob/main/docs/auroracap/README.md).
128
+
129
+ ## FAQ
130
+
131
+ Q: Can I only use token merging during inference?
132
+
133
+ A: No, our experiments show that token merging is also a way to accelerate training while maintaining similar performance. Additionally, besides auroracap, you can also use token merging on other llava-like models.
134
+
135
+ Q: Why do we provide both official LLaVA-format and Xtuner format weights for AuroraCap?
136
+
137
+ A: While Xtuner supports saving checkpoints in multiple formats, it currently only allows continued training with the Xtuner format. Therefore, we currently provide the model in the Xtuner format for both continued training and inference. In the future, we will provide the model in the official LLaVA format for both training and inference, enabling quicker SGLang deployment and integration with the transformers.
138
+
139
  ## Citation