GEM benchmark
AI & ML interests
We develop infrastructure for the evaluation of generated text.
Recent Activity
GEM's activity
prithivMLmods
posted
an
update
4 days ago
fladhak
authored
a
paper
6 days ago
prithivMLmods
posted
an
update
7 days ago
Post
2082
Qwen2VL Models: Vision and Language Processing 🍉
📍FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]
❄️Demo : prithivMLmods/Qwen2-VL-2B . The demo includes the Qwen2VL 2B Base Model.
🎯The space handles documenting content from the input image along with standardized plain text. It includes adjustment tools with over 30 font styles, file formatting support for PDF and DOCX, textual alignments, font size adjustments, and line spacing modifications.
📄PDFs are rendered using the ReportLab software library toolkit.
🧵Models :
+ prithivMLmods/Qwen2-VL-OCR-2B-Instruct
+ prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
+ prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct
🚀Sample Document :
+ https://drive.google.com/file/d/1Hfqqzq4Xc-3eTjbz-jcQY84V5E1YM71E/view?usp=sharing
📦Collection :
+ prithivMLmods/vision-language-models-67639f790e806e1f9799979f
.
.
.
@prithivMLmods 🤗
📍FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]
❄️Demo : prithivMLmods/Qwen2-VL-2B . The demo includes the Qwen2VL 2B Base Model.
🎯The space handles documenting content from the input image along with standardized plain text. It includes adjustment tools with over 30 font styles, file formatting support for PDF and DOCX, textual alignments, font size adjustments, and line spacing modifications.
📄PDFs are rendered using the ReportLab software library toolkit.
🧵Models :
+ prithivMLmods/Qwen2-VL-OCR-2B-Instruct
+ prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
+ prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct
🚀Sample Document :
+ https://drive.google.com/file/d/1Hfqqzq4Xc-3eTjbz-jcQY84V5E1YM71E/view?usp=sharing
📦Collection :
+ prithivMLmods/vision-language-models-67639f790e806e1f9799979f
.
.
.
@prithivMLmods 🤗
prithivMLmods
posted
an
update
8 days ago
Post
3170
🎄 Here Before - Xmas🎅✨
🧑🏻🎄Models
+ [ Xmas 2D Illustration ] : strangerzonehf/Flux-Xmas-Illustration-LoRA
+ [ Xmas 3D Art ] : strangerzonehf/Flux-Xmas-3D-LoRA
+ [ Xmas Chocolate ] : strangerzonehf/Flux-Xmas-Chocolate-LoRA
+ [ Xmas Isometric Kit ] : strangerzonehf/Flux-Xmas-Isometric-Kit-LoRA
+ [ Xmas Realpix ] : strangerzonehf/Flux-Xmas-Realpix-LoRA
+ [ Xmas Anime ] : strangerzonehf/Flux-Anime-Xmas-LoRA
❄️Collections
+ [ Xmas Art ] : strangerzonehf/christmas-pack-6758b199487adafaddb68f82
+ [ Stranger Zone Collection ] : prithivMLmods/stranger-zone-collections-org-6737118adcf2cb40d66d0c7e
🥶Page
+ [ Stranger Zone ] : https://huggingface.co./strangerzonehf
.
.
.
@prithivMLmods 🤗
🧑🏻🎄Models
+ [ Xmas 2D Illustration ] : strangerzonehf/Flux-Xmas-Illustration-LoRA
+ [ Xmas 3D Art ] : strangerzonehf/Flux-Xmas-3D-LoRA
+ [ Xmas Chocolate ] : strangerzonehf/Flux-Xmas-Chocolate-LoRA
+ [ Xmas Isometric Kit ] : strangerzonehf/Flux-Xmas-Isometric-Kit-LoRA
+ [ Xmas Realpix ] : strangerzonehf/Flux-Xmas-Realpix-LoRA
+ [ Xmas Anime ] : strangerzonehf/Flux-Anime-Xmas-LoRA
❄️Collections
+ [ Xmas Art ] : strangerzonehf/christmas-pack-6758b199487adafaddb68f82
+ [ Stranger Zone Collection ] : prithivMLmods/stranger-zone-collections-org-6737118adcf2cb40d66d0c7e
🥶Page
+ [ Stranger Zone ] : https://huggingface.co./strangerzonehf
.
.
.
@prithivMLmods 🤗
Post
6441
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥
How? By combining step-wise reward models with tree search algorithms :)
We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"
We're open sourcing the full recipe and sharing a detailed blog post.
In our blog post we cover:
📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.
🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.
🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM
Here's the links:
- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute
- Code: https://github.com/huggingface/search-and-learn
Enjoy!
How? By combining step-wise reward models with tree search algorithms :)
We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"
We're open sourcing the full recipe and sharing a detailed blog post.
In our blog post we cover:
📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.
🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.
🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM
Here's the links:
- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute
- Code: https://github.com/huggingface/search-and-learn
Enjoy!
gentaiscool
authored
15
papers
10 days ago
Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision
Paper
•
1805.12307
•
Published
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Paper
•
2309.10661
•
Published
•
1
XPersona: Evaluating Multilingual Personalized Chatbot
Paper
•
2003.07568
•
Published
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Paper
•
2305.14235
•
Published
Multi-lingual and Multi-cultural Figurative Language Understanding
Paper
•
2305.16171
•
Published
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding
Paper
•
2009.05387
•
Published
IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages
Paper
•
2311.12405
•
Published
Are Multilingual Models Effective in Code-Switching?
Paper
•
2103.13309
•
Published
IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation
Paper
•
2104.08200
•
Published
LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Paper
•
2401.06034
•
Published
Greenformer: Factorization Toolkit for Efficient Deep Neural Networks
Paper
•
2109.06762
•
Published
•
1
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Paper
•
2110.08118
•
Published
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Paper
•
2112.02721
•
Published
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Paper
•
2112.06223
•
Published
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages
Paper
•
2402.08638
•
Published
•
1