Akshat Patil

akkky02

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

View all activity

Organizations

akkky02's activity

upvoted a paper about 1 month ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7 • 49

upvoted a paper 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

liked a Space 7 months ago

Running

846

🚀

Can You Run It? LLM version

upvoted 2 papers 8 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 119

upvoted a collection 8 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 20 days ago • 697

upvoted 4 papers 8 months ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 126

updated a model 8 months ago

akkky02/DPO-llama3-8B

Text Generation • Updated Apr 23 • 18

upvoted an article 8 months ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

•

Apr 24

• 59

updated a model 8 months ago

akkky02/OrpoLlama-3-8B

Text Generation • Updated Apr 22 • 12

upvoted an article 8 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22

• 228

updated 2 models 9 months ago

MAdAiLab/Llama2_Instruction_Finetuning_Experiments

Updated Apr 11

MAdAiLab/SLM_vs_LLM_experiments

Updated Apr 11

updated 3 datasets 9 months ago

MAdAiLab/patent_classification

Viewer • Updated Apr 7 • 35k • 43

MAdAiLab/lex_glue_ledgar

Viewer • Updated Apr 7 • 80k • 32

MAdAiLab/lex_glue_scotus

Viewer • Updated Apr 7 • 7.8k • 38

upvoted a paper 9 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 125