Martial Terran
MartialTerran
AI & ML interests
I, Martial Terran am leading a Group to build solar-powered TimeCapsuleTeacher(TM} GPT-powered laptop computers, to provide Language, Math and Science Education to Non-English-Speaking people of the future in a Post-Apophis World.
Recent Activity
updated
a model
10 days ago
MartialTerran/coherent_text_from_1_megabyte_GPT2_model
liked
a model
10 days ago
MartialTerran/coherent_text_from_1_megabyte_GPT2_model
liked
a model
10 days ago
MartialTerran/Toy_GPTs_LLMs_for_CPU_Educational
Organizations
MartialTerran's activity
Where is the model.py that will run these Parameters on Windows Python in CMD console?
1
#1 opened about 1 month ago
by
MartialTerran
Where is SmolLM2_model.py???
2
#1 opened 2 months ago
by
MartialTerran
Size Mismatch in safetensors file
6
#3 opened 2 months ago
by
MartialTerran
DragonAI-Python-SmolLM2_model.py???
3
#1 opened 2 months ago
by
MartialTerran
Under-100M Parameter for detecting 20 Marathi numbers?
3
#1 opened 2 months ago
by
MartialTerran
Error. Crash. "The attention mask is not set and cannot be inferred from input
1
#8 opened 2 months ago
by
MartialTerran
Qwen2 sample model.py does not work.
7
#7 opened 2 months ago
by
MartialTerran
B/c Size Mismatch, Cant use from transformers import LlamaForCausalLM as workaround.
1
#5 opened 2 months ago
by
MartialTerran
GPT2_model.py
#1 opened 2 months ago
by
MartialTerran
Where is SmolLM2_model.py????
#1 opened 2 months ago
by
MartialTerran
Safetensors size mismatch.
5
#4 opened 2 months ago
by
MartialTerran
Sample Model Script for bfloat16 downloads safetensors parameters files then declares mismatch in their dimensions.
1
#3 opened 2 months ago
by
MartialTerran
Need Help to build a SmolLM2_360M_model.py
1
#2 opened 2 months ago
by
MartialTerran
Distinguishing between speech and non speech
3
#74 opened almost 2 years ago
by
CarelessWhisperer
Phoneme recognition
6
#86 opened almost 2 years ago
by
dg96
Whisper Finetuning - Validation loss is increasing but WER is Decreasing
2
#107 opened about 1 year ago
by
anahar
Storing Spelling information in LLMs
6
#2 opened 3 months ago
by
MartialTerran
Pad Token not uniquely defined?
#3 opened 2 months ago
by
MartialTerran
Optimizing Qwen Coder Models (1.5B & 3B) for Python and Edge Deployment
#6 opened 3 months ago
by
MartialTerran