Martial Terran
MartialTerran
AI & ML interests
I, Martial Terran am leading a Group to build solar-powered TimeCapsuleTeacher(TM} GPT-powered laptop computers, to provide Language, Math and Science Education to Non-English-Speaking people of the future in a Post-Apophis World.
Recent Activity
new activity
6 days ago
vonjack/SmolLM2-1.7B-Merged:Where is SmolLM2_model.py???
upvoted
a
paper
10 days ago
Transformers Can Do Arithmetic with the Right Embeddings
liked
a model
16 days ago
MartialTerran/Toy_GPTs_LLMs_for_CPU_Educational
Organizations
MartialTerran's activity
Where is SmolLM2_model.py???
2
#1 opened 23 days ago
by
MartialTerran
Size Mismatch in safetensors file
6
#3 opened 24 days ago
by
MartialTerran
DragonAI-Python-SmolLM2_model.py???
3
#1 opened 23 days ago
by
MartialTerran
Under-100M Parameter for detecting 20 Marathi numbers?
3
#1 opened 25 days ago
by
MartialTerran
Error. Crash. "The attention mask is not set and cannot be inferred from input
1
#8 opened 23 days ago
by
MartialTerran
Qwen2 sample model.py does not work.
7
#7 opened 23 days ago
by
MartialTerran
B/c Size Mismatch, Cant use from transformers import LlamaForCausalLM as workaround.
1
#5 opened 23 days ago
by
MartialTerran
GPT2_model.py
#1 opened 23 days ago
by
MartialTerran
Where is SmolLM2_model.py????
#1 opened 23 days ago
by
MartialTerran
Safetensors size mismatch.
5
#4 opened 24 days ago
by
MartialTerran
Sample Model Script for bfloat16 downloads safetensors parameters files then declares mismatch in their dimensions.
1
#3 opened 24 days ago
by
MartialTerran
Need Help to build a SmolLM2_360M_model.py
1
#2 opened 24 days ago
by
MartialTerran
Distinguishing between speech and non speech
3
#74 opened almost 2 years ago
by
CarelessWhisperer
Phoneme recognition
5
#86 opened over 1 year ago
by
dg96
Whisper Finetuning - Validation loss is increasing but WER is Decreasing
2
#107 opened 12 months ago
by
anahar
Storing Spelling information in LLMs
6
#2 opened about 2 months ago
by
MartialTerran
Pad Token not uniquely defined?
#3 opened about 1 month ago
by
MartialTerran
Optimizing Qwen Coder Models (1.5B & 3B) for Python and Edge Deployment
#6 opened about 1 month ago
by
MartialTerran
Duplicates in Train set
1
#12 opened about 1 year ago
by
Qilex