1 3 1

Lukas

sirluk

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

authored a paper 3 months ago

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

upvoted a paper 3 months ago

Retrieval-Augmented Decision Transformer: External Memory for In-context RL

View all activity

Articles

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

• 11

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

Jan 22, 2024

• 14

Organizations

sirluk's activity

upvoted a paper 2 months ago

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published Oct 29, 2024 • 22

authored a paper 3 months ago

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

Paper • 2410.07170 • Published Oct 9, 2024 • 15

upvoted 2 papers 3 months ago

Retrieval-Augmented Decision Transformer: External Memory for In-context RL

Paper • 2410.07071 • Published Oct 9, 2024 • 6

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

Paper • 2410.07170 • Published Oct 9, 2024 • 15

published an article 3 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 11

New activity in meta-llama/Llama-3.1-8B 4 months ago

apply_chat_template method not working correctly for llama 3 tokenizer

#35 opened 5 months ago by

sirluk

liked a dataset 7 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 3.92k • 756

published an article 12 months ago

Article

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

•

Jan 22, 2024

• 14