abdullah (Abdullah Abdelrhim)

upvoted a paper 1 day ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 3 days ago • 88

upvoted a paper 4 days ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published 6 days ago • 28

upvoted a paper 5 days ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published 6 days ago • 28

upvoted a collection 5 days ago

MagpieLM

Collection

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated about 2 hours ago • 10

upvoted a paper 23 days ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published 26 days ago • 12

upvoted 2 papers 26 days ago

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published about 1 month ago • 21

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 19

upvoted a collection 27 days ago

ArabianLLM Series | Native Arabic Large Language Models

Collection

This collection is related to native Arabic Large Language Models.. It represent different sizes of GPT trained Model for Test Generative • 8 items • Updated 27 days ago • 2

upvoted an article about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 76

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Paper • 2407.18961 • Published Jul 18 • 38

upvoted an article about 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 196

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 243

upvoted a collection 3 months ago

InternLM2.5

Collection

14 items • Updated 8 days ago • 67

upvoted a collection 4 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 24

upvoted a paper 4 months ago

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29 • 10

upvoted a collection 4 months ago

sentence-transformers-from-synthetic-data

Collection

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated Jun 21 • 21

upvoted a paper 4 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20 • 33

upvoted a collection 4 months ago

Wikimedia Datasets

Collection

Wikimedia datasets, across languages and modalities, from different Wikimedia projects, on the hub. Not all tested. • 19 items • Updated May 16 • 9

upvoted an article 4 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14

• 62

upvoted a paper 4 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

upvoted 3 papers 5 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 46

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

upvoted an article 5 months ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

Apr 29

• 28

upvoted a paper 5 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 28

upvoted a collection 5 months ago

Text-to-text Generation Models (LLMs, Llama, GPT, ...)

Collection

5130 items • Updated about 1 month ago • 12

upvoted an article 5 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4

• 68

upvoted 2 papers 5 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 52

upvoted an article 5 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

upvoted a collection 6 months ago

Multilingual LLMs Chat Spaces

Collection

Here you find Chat spaces to interact and test multilingual models but the goal here is to test on Arabic • 3 items • Updated May 24 • 1

upvoted 5 papers 6 months ago

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5 • 12

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 23

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 103

Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition

Paper • 2404.00565 • Published Mar 31 • 6

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 55

upvoted a collection 6 months ago

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 14

upvoted 2 papers 6 months ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29 • 34

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26 • 28

upvoted 2 collections 6 months ago

🔮 Mixture of Experts

Collection

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 13 items • Updated Aug 16 • 22

Preference Datasets for KTO

Collection

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Jul 30 • 14

upvoted 4 papers 6 months ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 45

Algorithmic progress in language models

Paper • 2403.05812 • Published Mar 9 • 18

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 59

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11 • 53

upvoted a collection 6 months ago

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11 • 65

upvoted 11 papers 7 months ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 63

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Paper • 2403.02775 • Published Mar 5 • 11

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22 • 23

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21 • 110

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 76

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 15

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Paper • 2402.10963 • Published Feb 13 • 9

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16 • 41

Abdullah Abdelrhim

AI & ML interests

Organizations

abdullah's activity

The 5 Most Under-Rated Tools on Hugging Face

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

SmolLM - blazingly fast and remarkably powerful

Introducing the Open Arabic LLM Leaderboard

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO