George Duan PRO

cctuan
ยท

AI & ML interests

None yet

Recent Activity

liked a Space 27 days ago
multimodalart/flux-lora-lab
updated a model 27 days ago
cctuan/gys1217
View all activity

Organizations

Ai Cloud's profile picture George Space's profile picture Yoly Lab's profile picture

cctuan's activity

reacted to m-ric's post with โค๏ธ about 1 month ago
view post
Post
2381
Single most important thing to do today: ๐—ด๐—ผ ๐˜๐—ฟ๐˜† ๐—ค๐˜„๐—ค ๐—ผ๐—ป ๐—›๐˜‚๐—ด๐—ด๐—ถ๐—ป๐—ด ๐—–๐—ต๐—ฎ๐˜!

๐Ÿ‘‰ https://huggingface.co./chat/models/Qwen/QwQ-32B-Preview
  • 2 replies
ยท
reacted to davanstrien's post with โค๏ธ about 1 month ago
view post
Post
2485
First dataset for the new Hugging Face Bluesky community organisation: bluesky-community/one-million-bluesky-posts ๐Ÿฆ‹

๐Ÿ“Š 1M public posts from Bluesky's firehose API
๐Ÿ” Includes text, metadata, and language predictions
๐Ÿ”ฌ Perfect to experiment with using ML for Bluesky ๐Ÿค—

Excited to see people build more open tools for a more open social media platform!
reacted to maxiw's post with ๐Ÿ‘ about 2 months ago
view post
Post
2071
You can now try out computer use models from the hub to automate your local machine with https://github.com/askui/vision-agent. ๐Ÿ’ป

import time
from askui import VisionAgent

with VisionAgent() as agent:
    agent.tools.webbrowser.open_new("http://www.google.com")
    time.sleep(0.5)
    agent.click("search field in the center of the screen", model_name="Qwen/Qwen2-VL-7B-Instruct")
    agent.type("cats")
    agent.keyboard("enter")
    time.sleep(0.5)
    agent.click("text 'Images'", model_name="AskUI/PTA-1")
    time.sleep(0.5)
    agent.click("second cat image", model_name="OS-Copilot/OS-Atlas-Base-7B")


Currently these models are integrated with Gradio Spaces API. Also planning to add local inference soon!

Currently supported:
- Qwen/Qwen2-VL-7B-Instruct
- Qwen/Qwen2-VL-2B-Instruct
- AskUI/PTA-1
- OS-Copilot/OS-Atlas-Base-7B
ยท
reacted to singhsidhukuldeep's post with ๐Ÿ‘€ 3 months ago
view post
Post
2163
While Google's Transformer might have introduced "Attention is all you need," Microsoft and Tsinghua University are here with the DIFF Transformer, stating, "Sparse-Attention is all you need."

The DIFF Transformer outperforms traditional Transformers in scaling properties, requiring only about 65% of the model size or training tokens to achieve comparable performance.

The secret sauce? A differential attention mechanism that amplifies focus on relevant context while canceling out noise, leading to sparser and more effective attention patterns.

How?
- It uses two separate softmax attention maps and subtracts them.
- It employs a learnable scalar ฮป for balancing the attention maps.
- It implements GroupNorm for each attention head independently.
- It is compatible with FlashAttention for efficient computation.

What do you get?
- Superior long-context modeling (up to 64K tokens).
- Enhanced key information retrieval.
- Reduced hallucination in question-answering and summarization tasks.
- More robust in-context learning, less affected by prompt order.
- Mitigation of activation outliers, opening doors for efficient quantization.

Extensive experiments show DIFF Transformer's advantages across various tasks and model sizes, from 830M to 13.1B parameters.

This innovative architecture could be a game-changer for the next generation of LLMs. What are your thoughts on DIFF Transformer's potential impact?
  • 1 reply
ยท
reacted to KingNish's post with โค๏ธ 4 months ago
reacted to MonsterMMORPG's post with ๐Ÿ”ฅ 6 months ago
view post
Post
6448
Kling AI Video is FINALLY Public (All Countries), Free to Use and MIND BLOWING - Full Tutorial > https://youtu.be/zcpqAxYV1_w

You probably seen those mind blowing AI made videos. And the day has arrived. The famous Kling AI is now worldwide available for free. In this tutorial video I will show you how to register for free with just email to Kling AI and use its mind blowing text to video animation, image to video animation and text to image, and image to image capabilities. This video will show you non-cherry pick results so you will know the actual quality and capability of the model unlike those extremely cherry pick example demos. Still, #KlingAI is the only #AI model that competes with OpenAI's #SORA and it is real to use.

๐Ÿ”— Kling AI Official Website โคต๏ธ
โ–ถ๏ธ https://www.klingai.com/

๐Ÿ”— SECourses Discord Channel to Get Full Support โคต๏ธ
โ–ถ๏ธ https://discord.com/servers/software-engineering-courses-secourses-772774097734074388

๐Ÿ”— Our GitHub Repository โคต๏ธ
โ–ถ๏ธ https://github.com/FurkanGozukara/Stable-Diffusion

๐Ÿ”— Our Reddit โคต๏ธ
โ–ถ๏ธ https://www.reddit.com/r/SECourses/
ยท