trisfromgoogle (Tris Warkentin)

liked a model about 1 month ago

google/gemma-2-2b

Text Generation • Updated Aug 7 • 331k • 466

liked a model about 2 months ago

jadechoghari/Ferret-UI-Gemma2b

Image-Text-to-Text • Updated Oct 18 • 627 • 48

liked a model 3 months ago

AIDC-AI/Ovis1.6-Gemma2-9B

Image-Text-to-Text • Updated 29 days ago • 5.43k • 261

authored a paper 5 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 75

posted an update 9 months ago

Post

1839

Very excited to share the first two official Gemma variants from Google! Today at Google Cloud Next, we announced cutting-edge models for code and research!

First, google/codegemma-release-66152ac7b683e2667abdee11 - a new set of code-focused Gemma models at 2B and 7B, in both pretrained and instruction-tuned variants. These exhibit outstanding performance on academic benchmarks and (in my experience) real-life usage. Read more in the excellent HuggingFace blog: https://huggingface.co./blog/codegemma

Second, ( google/recurrentgemma-release-66152cbdd2d6619cb1665b7a), which is based on the outstanding Google DeepMind research in Griffin: https://arxiv.org/abs/2402.19427. RecurrentGemma is a research variant that enables higher throughput and vastly improved memory usage. We are excited about new architectures, especially in the lightweight Gemma sizes, where innovations like RecurrentGemma can scale modern AI to many more use cases.

For details on the launches of these models, check out our launch blog -- and please do not hesitate to send us feedback. We are excited to see what you build with CodeGemma and RecurrentGemma!

Huge thanks to the Hugging Face team for helping ensure that these models work flawlessly in the Hugging Face ecosystem at launch!

3 replies

·

reacted to giux78's post with ❤️ 9 months ago

Post

1809

🎉 Super @DeepMount00 just released 𝗚𝗲𝗺𝗺𝗮_𝗤𝗔_𝗜𝗧𝗔_𝘃𝟯 𝗹𝗲𝗮𝗱𝗶𝗻𝗴 the 𝗥𝗔𝗚 𝘁𝗮𝘀𝗸 on the Italian 𝗟𝗟𝗠_𝗜𝗧𝗔_𝗟𝗘𝗔𝗗𝗘𝗥𝗕𝗢𝗔𝗥𝗗. The model is a fine tuned version of Gemma 2B.
Model details: https://huggingface.co./DeepMount00/Gemma_QA_ITA_v3
Explore the full RAG section rankings here: https://huggingface.co./spaces/FinancialSupport/open_ita_llm_leaderboard on section Classifica RAG

authored a paper 10 months ago

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 47

liked 4 models 10 months ago

liked a Space 10 months ago

Runtime error

125

🪁

Zephyr Gemma Chat

liked a dataset 10 months ago

argilla/OpenHermesPreferences

Viewer • Updated Mar 1 • 989k • 721 • 202

New activity in google/gemma-7b 10 months ago

Proposal for Renaming of Gemma-7B Model to Gemma-9B

4

#34 opened 10 months ago by

ZeroShot-AI

liked a model 10 months ago

google/gemma-7b-it

Text Generation • Updated Aug 14 • 360k • 1.14k

New activity in google/gemma-7b 10 months ago

Is this the same as Gemini Nano?

6

#25 opened 10 months ago by

nonetrix

7B or 8B?

4

#24 opened 10 months ago by

amgadhasan

liked a model 10 months ago

google/gemma-2b-it

Text Generation • Updated Sep 27 • 85.6k • • 685

reacted to clefourrier's post with ❤️ 10 months ago

Post

New base pretrained models on the Open LLM Leaderboard!

Two new OSS models by Google, who's getting back in the game 😎
The 7B is 2nd of the leaderboard, and better than Mistral (notably on GSM8K, aka math).

google/gemma-7b
google/gemma-2b

Check more results on the leaderboard https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard

posted an update 10 months ago

Post

I am thrilled to announce Gemma, new 2B and 7B models from Google, based on the same research and technology used to train the Gemini models! These models achieve state-of-the-art performance for their size, and are launched across Transformers, Google Cloud, and many other surfaces worldwide starting today.

Get started using and adapting Gemma in the model Collection: google/gemma-release-65d5efbccdbb8c4202ec078b

These launches are the product of an outstanding collaboration between the Google DeepMind and Hugging Face teams over the last few months -- very proud of the work both teams have done, from integration with Vertex AI to optimization across the stack. Read more about the partnership in the main launch by @philschmid @osanseviero @pcuenq on the launch blog: https://huggingface.co./blog/gemma

More information below if you are curious about training details, eval results, and safety characteristics!

Gemma Tech Report: https://goo.gle/GemmaReport
Launch announcement: https://blog.google/technology/developers/gemma-open-models/

6 replies

·

Tris Warkentin

AI & ML interests

Recent Activity

Organizations

trisfromgoogle's activity

google/gemma-2-2b

jadechoghari/Ferret-UI-Gemma2b

AIDC-AI/Ovis1.6-Gemma2-9B

Gemma 2: Improving Open Language Models at a Practical Size

Gemma: Open Models Based on Gemini Research and Technology

Crystalcareai/GemMoE-Beta-1

Telugu-LLM-Labs/Telugu-gemma-7b-finetuned-sft

google/gemma-2b

google/gemma-7b

Zephyr Gemma Chat

argilla/OpenHermesPreferences

Proposal for Renaming of Gemma-7B Model to Gemma-9B

google/gemma-7b-it

Is this the same as Gemini Nano?

7B or 8B?

google/gemma-2b-it