Luca Soldaini
soldni
AI & ML interests
question answering, information retrieval, scientific document processing
Recent Activity
liked
a model
11 days ago
allenai/olmOCR-7B-0225-preview
updated
a dataset
13 days ago
allenai/olmOCR-mix-0225
published
a dataset
13 days ago
allenai/olmOCR-mix-0225
Organizations
soldni's activity
Fix loading and data viewer due to nested dirs
1
#3 opened 3 months ago
by
orionweller

Failed to load dataset
9
#3 opened 6 months ago
by
joelb

latest update?
2
#8 opened 11 months ago
by
fkov
update chat template
#1 opened 4 months ago
by
soldni

Seeing Arxiv content in the Algebraic Stack subset
3
#2 opened 6 months ago
by
dangerzone

Add `transformers` as library_name
#2 opened 5 months ago
by
Wauplin

How to run it on a mobile device?
3
#1 opened 6 months ago
by
KoiSikhaDo
Add proper library name
#3 opened 6 months ago
by
osanseviero

accidentally released?
1
#1 opened 6 months ago
by
Fizzarolli

What is the total # tokens after sampling proportion? 1.7T or 1.65T
3
#36 opened 10 months ago
by
ivanzhouyq

v1_7 update
#28 opened 11 months ago
by
kylel

Does allenai/c4 and the subset C4 in allenai/dolma is the same dataset?
4
#10 opened 12 months ago
by
speiqin
Can't download two files
1
#19 opened about 1 year ago
by
mrgorjan
Prompting to OLMo
2
#8 opened about 1 year ago
by
herambpatil2004
Update README.md
#10 opened over 1 year ago
by
Muennighoff

Add download instructions
#8 opened over 1 year ago
by
Muennighoff

Fix size
#9 opened over 1 year ago
by
Muennighoff

sample for analysis?
1
#1 opened over 1 year ago
by
KnutJaegersberg

Semantic Scholar API metadata for this dataset?
2
#1 opened over 1 year ago
by
MicPie

How to generate one token after the other with Scibert?
1
#4 opened almost 2 years ago
by
junoriosity