π 3 text-encoders: 2 CLIPs, one T5-XXL; plug-and-play: removing the larger one maintains competitiveness
ποΈ Dataset was deduplicated with SSCD which helped with memorization (no more details about the dataset tho)
Variants π A DPO fine-tuned model showed great improvement in prompt understanding and aesthetics βοΈ An Instruct Edit 2B model was trained, and learned how to do text-replacement
Results β State of the art in automated evals for composition and prompt understanding β Best win rate in human preference evaluation for prompt understanding, aesthetics and typography (missing some details on how many participants and the design of the experiment)
MusicLang is a controllable model for music generation:
> π¦ Discover the LLAMA2 architecture, trained from scratch for symbolic music generation, ensuring exceptional quality; > π¨βπ¨ Unleash your creativity by extending an existing music, or create new ones from scratch; > π€ Integrate MusicLang into your applications, with an inference optimized for CPUs written in C, other integrations and optimizations coming soon.
In the space, youβll find :
1οΈβ£ MusicLang foundation model: our fondation model for creating and generating original midi soundtracks musiclang/musiclang-v2;
3οΈβ£ MusicLang Language:a new language for tonal music. This language allows composers to load, write, transform and predict symbolic music in a simple, condensed and high level manner https://github.com/MusicLang/musiclang;
I am thrilled to announce Gemma, new 2B and 7B models from Google, based on the same research and technology used to train the Gemini models! These models achieve state-of-the-art performance for their size, and are launched across Transformers, Google Cloud, and many other surfaces worldwide starting today.
These launches are the product of an outstanding collaboration between the Google DeepMind and Hugging Face teams over the last few months -- very proud of the work both teams have done, from integration with Vertex AI to optimization across the stack. Read more about the partnership in the main launch by @philschmid@osanseviero@pcuenq on the launch blog: https://huggingface.co./blog/gemma
More information below if you are curious about training details, eval results, and safety characteristics!