Pseudo Lab

non-profit

https://pseudo-lab.com

Pseudo-Lab

Activity Feed Request to join this org

AI & ML interests

Non-profit ML community

Recent Activity

peaceAsh authored a paper 8 days ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

scottsuk0306 authored a paper 19 days ago

Evaluating Language Models as Synthetic Data Generators

kinam0252 authored a paper 23 days ago

Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

View all activity

pseudolab's activity

peaceAsh

authored a paper 8 days ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published 16 days ago • 25

scottsuk0306

authored a paper 19 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 21 days ago • 43

lunarflu

posted an update 20 days ago

Post

1492

great blogpost! 🔥@wolfram
https://huggingface.co./blog/wolfram/llm-comparison-test-2024-12-04

kinam0252

authored a paper 23 days ago

Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

Paper • 2411.18664 • Published 28 days ago • 23

Tonic

posted an update about 2 months ago

Post

3391

🙋🏻‍♂️hey there folks,

periodic reminder : if you are experiencing ⚠️500 errors ⚠️ or ⚠️ abnormal spaces behavior on load or launch ⚠️

we have a thread 👉🏻 https://discord.com/channels/879548962464493619/1295847667515129877

if you can record the problem and share it there , or on the forums in your own post , please dont be shy because i'm not sure but i do think it helps 🤗🤗🤗

2 replies

Tonic

posted an update about 2 months ago

Post

1088

boomers still pick zenodo.org instead of huggingface ??? absolutely clownish nonsense , my random datasets have 30x more downloads and views than front page zenodos ... gonna write a comparison blog , but yeah... cringe.

1 reply

Tonic

posted an update 2 months ago

Post

817

🙋🏻‍♂️ hey there folks ,

really enjoying sharing cool genomics and protein datasets on the hub these days , check out our cool new org : https://huggingface.co./seq-to-pheno

scroll down for the datasets, still figuring out how to optimize for discoverability , i do think on that part it will be better than zenodo[dot}org , it would be nice to write a tutorial about that and compare : we already have more downloads than most zenodo datasets from famous researchers !

Tonic

posted an update 2 months ago

Post

1446

hey there folks,

twitter is aweful isnt it ? just getting into the habbit of using hf/posts for shares 🦙🦙

Tonic/on-device-granite-3.0-1b-a400m-instruct

new granite on device instruct model demo , hope you like it 🚀🚀

Tonic

posted an update 2 months ago

Post

984

if you're encountering 500 errors on spaces that seem to work otherwise , kindly consider screenshotting and sharing the link here : https://discord.com/channels/879548962464493619/1295847667515129877

7 replies

Tonic

posted an update 3 months ago

Post

2736

🙋🏻‍♂️hey there folks ,

did you know that https://huggingface.co./lmms-lab released a new version of 🌋🌋Llava on thursday ? Now it has 🎥video understanding !
check it out 👇🏻

collection : lmms-lab/llava-video-661e86f5e8dabc3ff793c944
demo : Tonic/Llava-Video

Tonic

posted an update 3 months ago

Post

1853

🙋🏻‍♂️ Hey there folks ,

🦎Salamandra release by @mvillegas and team
@BSC_CNS https://huggingface.co./BSC-LT is absolutely impressive so far !

perhaps the largest single training dataset of high quality text to date of 7.8 trillion tokens in 35 European languages and code.

the best part : the data was correctly licenced so it's actually future-proof!

the completions model is really creative and instruct fine tuned version is very good also.

now you can use such models for multi-lingual enterprise applications with further finetunes , long response generation, structured outputs (coding) also works.

check out 👇🏻
the collection : BSC-LT/salamandra-66fc171485944df79469043a
the repo : https://github.com/langtech-bsc/salamandra
7B-Instruct demo : Tonic/Salamandra-7B

Tonic

posted an update 3 months ago

Post

1719

@mlabonne hey there 🙋🏻‍♂️ I kinda got obsessed with your great model , and i found the endpoint for it in lambda labs, but basically i got rate limited / banned for trying to make my DPO dataset project, i was wondering if you all had an open ai compatible solution for me to make a great "thinking" sft + dpo dataset with all the splits 🙏🏻🙏🏻 kinda desparate , it's true , but was looking forward to a nice write ups 🚀🚀🚀

1 reply

Tonic

posted an update 3 months ago

Post

2305

Big Congrats on the BIG RELEASE by @mlabonne and team at https://huggingface.co./liquidai ...

testing it out now to make a dataset , i cant hardly wait... but one question 👇🏻 why / wen ? 😅🚀🚀

check out the blog post : https://www.liquid.ai/liquid-foundation-models

1 reply

Tonic

posted an update 3 months ago

Post

1240

🙋🏻‍♂️ Hey there folks,

stepfun-ai/GOT-OCR2_0 is in top trending and spaces of the week for the second week straight !!

This is madness 😱

🚀🚀check out my demo here : Tonic/GOT-OCR

Tonic

posted an update 3 months ago

Post

1088

🙋🏻‍♂️Hey there folks,

Nvidia just released a small 4B Nemotron-mini model , and it works surprisingly well !

you can check it out here :

base : nvidia/Minitron-4B-Base
instruct : nvidia/Nemotron-Mini-4B-Instruct
demo : https://huggingface.co./spaces/Tonic/Nemotron-Mini-4B

hoep you like it 🤗🤗

Tonic

posted an update 3 months ago

Post

2727

🙋🏻‍♂️Hey there folks ,

@ucaslcl released a new OCR model , that's👏🏻👏🏻 fantastic : https://huggingface.co./ucaslcl/GOT-OCR2_0

GPU : Tonic/GOT-OCR
Gradio Demo (Image Edit) : Tonic1/ImageEdit-GOT-OCR

Model : https://huggingface.co./ucaslcl/GOT-OCR2_0
Official demo : https://huggingface.co./spaces/ucaslcl/GOT_online
github : https://github.com/Ucas-HaoranWei/GOT-OCR2.0

4 replies

Tonic

posted an update 4 months ago

Post

1105

🙋🏻‍♂️ hey there folks ,

made an image similarity demo to test out the mistral-community/pixtral-12b-240910 model .

If anyone knows how to generate captions with it , please do let me know x 🚀

here's the demo : Tonic/Pixtral

hope you like it 🤗

Tonic

posted an update 4 months ago

Post

2660

So awesome , now i can deploy a jupyterlab on huggingface and deploy gradio from the jupyterlab

Tonic

posted an update 4 months ago

Post

1089

🙋🏻‍♂️Hey there folks,

Did you see the new coding model from @01-ai ?

collection : 01-ai/yi-coder-66bdb00f5bdd611f9a008f30
demo : Tonic/Yi-Coder-9B

achieves SOTA on benchmarks , 125K context window , 55 languages including Docker, Js and many more 🚀

1 reply

Tonic

posted an update 4 months ago

Post

2525

🙋🏻‍♂️hey there folks ,

✒️InkubaLM has been trained from scratch using 1.9 billion tokens of data for five African languages, along with English and French data, totaling 2.4 billion tokens of data. It is capable of understanding and generating content in five African languages: Swahili, Yoruba, Hausa, isiZulu, and isiXhosa, as well as English and French.

model lelapa/InkubaLM-0.4B
demo Tonic/Inkuba-0.4B

AI & ML interests

Recent Activity

Team members 57

pseudolab's activity