Qian Liu's picture

Qian Liu

SivilTaram

·

http://siviltaram.github.io/

AI & ML interests

Cooking cool things

Recent Activity

updated a model 2 days ago

SivilTaram/tongyao_models

updated a model 2 days ago

SivilTaram/tongyao_models

updated a model 2 days ago

SivilTaram/tongyao_models

View all activity

Organizations

SivilTaram's activity

New activity in akhaliq/anychat 3 months ago

Update app.py

#23 opened 3 months ago by

Create app_sailor.py

#22 opened 3 months ago by

New activity in OpenCoder-LLM/opc-sft-stage1 4 months ago

License

#5 opened 4 months ago by

New activity in OpenCoder-LLM/opc-annealing-corpus 4 months ago

License

#3 opened 4 months ago by

New activity in OpenCoder-LLM/opc-fineweb-code-corpus 4 months ago

Code elements inside web page are badly processed for FineWeb

#2 opened 4 months ago by

commented a paper 5 months ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9, 2024 • 7 •

New activity in SivilTaram/starcoder2-documentation 5 months ago

release plan for the rest of the-stack-v2-train-extras

#2 opened 5 months ago by

New activity in microsoft/tapex-large-finetuned-wtq 6 months ago

is it possible to support multiple languages, like Chinese?

#5 opened 8 months ago by

New activity in bigcode/the-stack-v2 7 months ago

"Documentation" data?

#8 opened 12 months ago by

Where is the-stack-v2-train-extras?

#17 opened 12 months ago by

question about starcoder 2 jupyter notebook conversion

#29 opened 7 months ago by

commented a paper 7 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 55 •

commented 2 papers 8 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 55 •

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 55 •

New activity in sail/regmix-data 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

New activity in sail/regmix-data-sample 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

commented 2 papers 8 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 37 •

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 37 •

commented a paper 9 months ago

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 39 •

New activity in sail/Sailor-14B-Chat 10 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by