Dr. Joao Paulo Schwarz Schuler's picture

6 7

Dr. Joao Paulo Schwarz Schuler PRO

schuler

·

https://www.researchgate.net/profile/Joao-Paulo-Schwarz-Schuler

joaopauloschuler

AI & ML interests

artificial intelligence

Recent Activity

posted an update 4 days ago

🔮 GPT-3 implemented in pure Free Pascal! https://github.com/joaopauloschuler/gpt-3-for-pascal This implementation follows the GPT-3 Small architecture from the landmark paper "Language Models are Few-Shot Learners": ``` ┌─────────────────────────┐ │ Input Layer │ ├─────────────────────────┤ │ Token & Positional │ │ Embedding │ ├─────────────────────────┤ │ 12x Transformer │ │ Blocks │ │ - 12 heads │ │ - 768 hidden dims │ │ - 3072 intermediate │ ├─────────────────────────┤ │ Output Layer │ └─────────────────────────┘ ``` Clean Pascal Implementation ``` for CntLayer := 1 to {Layers=}12 do begin Result.AddTransformerBlockCAI( {Heads=}12, {intermediate dimensions=}4*768, {NoForward=}true, {HasNorm=}true, false ); end; ```

updated a model 5 days ago

schuler/experimental-JP47D56

updated a model 5 days ago

schuler/experimental-JP47D56B

View all activity

Organizations

None yet

Posts 2

Post

3314

🔮 GPT-3 implemented in pure Free Pascal!
https://github.com/joaopauloschuler/gpt-3-for-pascal

This implementation follows the GPT-3 Small architecture from the landmark paper "Language Models are Few-Shot Learners":

┌─────────────────────────┐
│     Input Layer       │
├─────────────────────────┤
│ Token & Positional    │
│     Embedding         │
├─────────────────────────┤
│   12x Transformer     │
│      Blocks           │
│  - 12 heads           │
│  - 768 hidden dims    │
│  - 3072 intermediate  │
├─────────────────────────┤
│   Output Layer        │
└─────────────────────────┘

Clean Pascal Implementation

for CntLayer := 1 to {Layers=}12 do
begin
  Result.AddTransformerBlockCAI(
    {Heads=}12, 
    {intermediate dimensions=}4*768, 
    {NoForward=}true, 
    {HasNorm=}true, 
    false
  );
end;

Post

7205

📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm

spaces 3

Experimental KPhi3 Model - Currently in Training

Experimental KPhi-3 micro 4k instruct gradio autoloader

Experimental KPhi-3 micro 4k instruct gradio autoloader

Experimental KPhi-3 nano 4k instruct gradio autoloader

Experimental KPhi-3 nano 4k instruct gradio autoloader

models 14

schuler/experimental-JP47D56

Text Generation • Updated 5 days ago • 25

schuler/experimental-JP47D56B

Text Generation • Updated 5 days ago • 25

schuler/experimental-JP47D56C

Text Generation • Updated 5 days ago • 28

schuler/experimental-JP47D55C

Text Generation • Updated 5 days ago • 43

schuler/experimental-JP47D55B

Text Generation • Updated 5 days ago • 31

schuler/experimental-JP47D55

Text Generation • Updated 5 days ago • 26

schuler/experimental-JP47D54

Text Generation • Updated 5 days ago • 34

schuler/experimental-JP47D54B

Text Generation • Updated 5 days ago • 21

schuler/experimental-JP47D54C

Text Generation • Updated 5 days ago • 22 • 1

schuler/experimental-JP47D21-KPhi-3-micro-4k-instruct

Text Generation • Updated Dec 5, 2024 • 72

datasets 7

schuler/cosmopedia-v2-textbook-and-howto-8.3m

Viewer • Updated Nov 20, 2024 • 8.3M • 343

schuler/cosmopedia-v2-textbook-and-howto-2.3m

Viewer • Updated Nov 17, 2024 • 2.27M • 85 • 1

schuler/open-orca-slimorca-deduped-cleaned-corrected-for-pascal-txt

Viewer • Updated Nov 17, 2024 • 132k • 54

schuler/cosmopedia-v2-textbook-and-howto-4.5m

Viewer • Updated Nov 17, 2024 • 4.46M • 231

schuler/TinyStories4PascalTxt

Viewer • Updated Oct 26, 2024 • 2.12M • 73

schuler/TinyStories4Pascal-Tokenized-v2

Updated Sep 16, 2024 • 56

schuler/TinyStories4Pascal

Preview • Updated Sep 14, 2024 • 84 • 2