Luigi's picture

Luigi PRO

luigi12345

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago
Autocoder
updated a collection 1 day ago
Autocoder
updated a collection 1 day ago
Autocoder
View all activity

Articles

Organizations

None yet

luigi12345's activity

replied to their post 3 days ago
view reply
Write 100 tests concisely that if passed will make every requirements and conditions and every  related point mentioned by me  throughout this complete conversation  be fully addressed and adjust the code accordingly so it passes all tests.
posted an update 3 days ago
view post
Post
2510
PERFECT FINAL PROMPT for Coding and Debugging.
Step 1: Generate the prompt that if sent to you will make you adjust the script so it meets each and every of the criteria it needs to meet to be 100% bug free and perfect.

Step 2: adjust the script following the steps and instructions in the prompt created in Step 1.

  • 1 reply
ยท
posted an update 7 days ago
view post
Post
450
NEW LAUNCH! Apollo is a new family of open-source video language models by Meta, where 3B model outperforms most 7B models and 7B outperforms most 30B models ๐Ÿงถ

โœจ the models come in 1.5B https://huggingface.co./Apollo-LMMs/Apollo-1_5B-t32, 3B https://huggingface.co./Apollo-LMMs/Apollo-3B-t32 and 7B https://huggingface.co./Apollo-LMMs/Apollo-7B-t32 with A2.0 license, based on Qwen1.5 & Qwen2
โœจ the authors also release a benchmark dataset https://huggingface.co./spaces/Apollo-LMMs/ApolloBench

The paper has a lot of experiments (they trained 84 models!) about what makes the video LMs work โฏ๏ธ

Try the demo for best setup here https://huggingface.co./spaces/Apollo-LMMs/Apollo-3B
they evaluate sampling strategies, scaling laws for models and datasets, video representation and more!
> The authors find out that whatever design decision was applied to small models also scale properly when the model and dataset are scaled ๐Ÿ“ˆ scaling dataset has diminishing returns for smaller models
> They evaluate frame sampling strategies, and find that FPS sampling is better than uniform sampling, and they find 8-32 tokens per frame optimal
> They also compare image encoders, they try a variation of models from shape optimized SigLIP to DINOv2
they find
google/siglip-so400m-patch14-384
to be most powerful ๐Ÿ”ฅ
> they also compare freezing different parts of models, training all stages with some frozen parts give the best yield

They eventually release three models, where Apollo-3B outperforms most 7B models and Apollo 7B outperforms 30B models ๐Ÿ”ฅhttps://huggingface.co./HappyAIUser/Apollo-LMMs-Apollo-3B
  • 2 replies
ยท
posted an update 10 days ago
view post
Post
713
CHATGPT.com o1-MINI FOR FREE? Is this a bug?? Wow, I just converted gpt-4o-mini to o1-mini for free! In ChatGPT.com ! Is this a bug? I used this prompt

use CoT logic extensively to output the longest and richest and most beautiful possible verison of this app, call it MelindaAI Autoimage and make it be able to create 7 up to images with different prompts *the promtp of the user with differnt word order except for the first words that are fixed

  <!DOCTYPE html> <html lang="en"> <head>   <meta charset="UTF-8">   <meta name="viewport" content="width=device-width, initial-scale=1.0" ...

Really got it fully working and behaving in the UI with the complete Logic Section of Thoughts. I mean no surprises as it was quite obvious it was just the same model with backend automated reprompting, but it is quite astonoshing to see it behaving just the same as if I had choosen o1-mini which is limit rated while this one is free and UNLIMITED! Thoughts?
posted an update 13 days ago
view post
Post
1228
๐Ÿ’ฅ ๐—š๐—ผ๐—ผ๐—ด๐—น๐—ฒ ๐—ฟ๐—ฒ๐—น๐—ฒ๐—ฎ๐˜€๐—ฒ๐˜€ ๐—š๐—ฒ๐—บ๐—ถ๐—ป๐—ถ ๐Ÿฎ.๐Ÿฌ, ๐˜€๐˜๐—ฎ๐—ฟ๐˜๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐—ฎ ๐—™๐—น๐—ฎ๐˜€๐—ต ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐˜๐—ต๐—ฎ๐˜ ๐˜€๐˜๐—ฒ๐—ฎ๐—บ๐—ฟ๐—ผ๐—น๐—น๐˜€ ๐—š๐—ฃ๐—ง-๐Ÿฐ๐—ผ ๐—ฎ๐—ป๐—ฑ ๐—–๐—น๐—ฎ๐˜‚๐—ฑ๐—ฒ-๐Ÿฏ.๐Ÿฒ ๐—ฆ๐—ผ๐—ป๐—ป๐—ฒ๐˜! And they start a huge effort on agentic capabilities.

๐Ÿš€ The performance improvements are crazy for such a fast model:
โ€ฃ Gemini 2.0 Flash outperforms the previous 1.5 Pro model at twice the speed
โ€ฃ Now supports both input AND output of images, video, audio and text
โ€ฃ Can natively use tools like Google Search and execute code

โžก๏ธ If the price is on par with previous Flash iteration ($0.30 / M tokens, to compare with GPT-4o's $1.25) the competition will have a big problem with this 4x cheaper model that gets better benchmarks ๐Ÿคฏ

๐Ÿค– What about the agentic capabilities?

โ€ฃ Project Astra: A universal AI assistant that can use Google Search, Lens and Maps
โ€ฃ Project Mariner: A Chrome extension that can complete complex web tasks (83.5% success rate on WebVoyager benchmark, this is really impressive!)
โ€ฃ Jules: An AI coding agent that integrates with GitHub workflows

I'll be eagerly awaiting further news from Google!

Read their blogpost here ๐Ÿ‘‰ https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/

ยท
posted an update 16 days ago
view post
Post
1591
#Perfect finalm debug prompt:
Step 1: geneate the optimal promtp that if sent to you will amke you aoutptu a cokmpelte fullyw orkign รจrfect UX UI priduciton ready verion fo the scitpt
Step 2: follow th winsturcitones yoriusfl and otuptut eh finals cript
posted an update 26 days ago
view post
Post
440
๐Ÿ”ฅ1๏ธโƒฃminute 9๏ธโƒฃseconds of Chain of Thoughts!!

Actually In my Prompt Engineering lessons, one of the self evaluation criteria I always tell my students to use when they must check the effectivity on prompt guidance is the time length of the โ€œLogic Sectionโ€ of o1. (Of course server speed changes but for comparing different prompts is valid, specially considering that we are fighting with the -obvious- resource saving priorities of the model when run on OpenAI servers )


If anyone wants to share their own attempt, open https://chatgpt.com and give it a try feel free to post it in the comments section!๐ŸŽ

Insert your code here

posted an update 29 days ago
view post
Post
281
Top 20 GitHub Repositories for Autonomous AI Agents in Software Development

Best AI Software Engineer Agents and AI Frameworks and Tools.
Discover the top 20 GitHub repositories for autonomous AI agents in software development. These tools offer features like automated testing, debugging, and codebase management, complete with user-friendly interfaces. Enhance your development workflow with these cutting-edge resources. Read more: https://huggingface.co./blog/luigi12345/ai-autonomous-agents

posted an update about 1 month ago
view post
Post
3711
MinimalScrap
Only Free Dependencies. Save it.It is quite useful uh.


!pip install googlesearch-python requests
from googlesearch import search
import requests
query = "Glaucoma"
for url in search(f"{query} site:nih.gov filetype:pdf", 20):
    if url.endswith(".pdf"):
        with open(url.split("/")[-1], "wb") as f: f.write(requests.get(url).content)
        print("โœ…" + url.split("/")[-1])
print("Done!")

reacted to their post with ๐Ÿ‘ about 1 month ago
view post
Post
2312
Best Debug Prompt

You are a frustrated user who has tested this application extensively. Your job is to list EVERY possible way this app could completely break or become unusable.

For each potential failure:

1. What would make you say "This app is totally broken!"?
2. What exact steps did you take when it broke?
3. What did you see on your screen when it broke?
4. How angry would this make a typical user (1-10)?
5. What would you expect the app to do instead?

Think about:
- What happens if you click buttons really fast?
- What if your internet is slow/disconnected?
- What if you upload weird files/images?
- What if you try to break the app on purpose?
- What if multiple people use it at once?
- What if you use it on mobile/tablet?
- What if you refresh/navigate while it's working?
- What if you paste invalid inputs?
- What if you upload HUGE files?
- What if you leave it running overnight?

Don't worry about being technical - just describe what you saw break as a user.

Format each issue like:

ISSUE #1: [Brief angry user description]
- STEPS TO BREAK IT: [Exactly what you did]
- WHAT HAPPENED: [What you saw]
- ANGER LEVEL: [1-10]
- EXPECTED: [What should happen]

Keep going until you've found every possible way to break this app from a user's perspective!

After outpuiting the list, accoring to the list optmiced Composer edit block to fix the ones severe that make sense to adjust accoirng to gradio limitations and current usage target )dont suppose we need unecessary funcitons)
posted an update about 1 month ago
view post
Post
2312
Best Debug Prompt

You are a frustrated user who has tested this application extensively. Your job is to list EVERY possible way this app could completely break or become unusable.

For each potential failure:

1. What would make you say "This app is totally broken!"?
2. What exact steps did you take when it broke?
3. What did you see on your screen when it broke?
4. How angry would this make a typical user (1-10)?
5. What would you expect the app to do instead?

Think about:
- What happens if you click buttons really fast?
- What if your internet is slow/disconnected?
- What if you upload weird files/images?
- What if you try to break the app on purpose?
- What if multiple people use it at once?
- What if you use it on mobile/tablet?
- What if you refresh/navigate while it's working?
- What if you paste invalid inputs?
- What if you upload HUGE files?
- What if you leave it running overnight?

Don't worry about being technical - just describe what you saw break as a user.

Format each issue like:

ISSUE #1: [Brief angry user description]
- STEPS TO BREAK IT: [Exactly what you did]
- WHAT HAPPENED: [What you saw]
- ANGER LEVEL: [1-10]
- EXPECTED: [What should happen]

Keep going until you've found every possible way to break this app from a user's perspective!

After outpuiting the list, accoring to the list optmiced Composer edit block to fix the ones severe that make sense to adjust accoirng to gradio limitations and current usage target )dont suppose we need unecessary funcitons)
reacted to automatedstockminingorg's post with ๐Ÿ‘€ about 2 months ago
view post
Post
1767
hi everyone, i have just uploaded my first fine tuned model, but serverless inference client is'nt available, its built with transformer architecture and is just a fine tuned llama 8b instruct. does anyone know how to make serverless inference available on a model?
ยท