Rajdeep Borgohain's picture

Rajdeep Borgohain

rbgo

AI & ML interests

Solving language barriers.

Recent Activity

upvoted a paper 5 days ago
Phi-4 Technical Report
reacted to benjamin-paine's post with šŸ‘ 11 days ago
Hello HuggingFace šŸ¤—, and happy new year! šŸŽ† I'm thrilled to be releasing the first iteration of a project I've been working on for quite awhile now. It's called Taproot, and it's a seamlessly scalable open-source AI/ML inference engine designed for letting developers build real-time experiences clustered across a small-to-mid-sized cluster, without the burden of hyperscale infrastructure. Along with the server and task framework is a client library for node and the browser. And what good is a server and client without an app to go alongside it? To that end, I'm also releasing Anachrovox, a fun, real-time hands-free voice assistant that can run on mid-level devices in <12GB VRAM, with web search, weather, and other tools. It uses my real-time browser wake-word library to detect utterances of the phrase 'Hey Vox', 'Hi Vox', 'Okay Vox', 'Anachrovox' or just 'Vox' (alongside some others.) Releasing this many things at once will definitely result in bugs, so please report them when sighted! Thank you all! Taproot: https://github.com/painebenjamin/taproot Taproot JS Client: https://github.com/painebenjamin/taproot.js Anachrovox: https://github.com/painebenjamin/anachrovox The Anachrovox Spaces are networked together, balancing load across them to keep all front-ends responsive. You only have to choose what color you like the most! https://huggingface.co./spaces/benjamin-paine/anachrovox https://huggingface.co./spaces/benjamin-paine/anachrovox-amber
upvoted a collection 20 days ago
PaliGemma 2 Release
View all activity

Organizations

Inferless's profile picture Social Post Explorers's profile picture

rbgo's activity

reacted to benjamin-paine's post with šŸ‘ 11 days ago
view post
Post
2373
Hello HuggingFace šŸ¤—, and happy new year! šŸŽ†

I'm thrilled to be releasing the first iteration of a project I've been working on for quite awhile now. It's called Taproot, and it's a seamlessly scalable open-source AI/ML inference engine designed for letting developers build real-time experiences clustered across a small-to-mid-sized cluster, without the burden of hyperscale infrastructure.

Along with the server and task framework is a client library for node and the browser. And what good is a server and client without an app to go alongside it? To that end, I'm also releasing Anachrovox, a fun, real-time hands-free voice assistant that can run on mid-level devices in <12GB VRAM, with web search, weather, and other tools. It uses my real-time browser wake-word library to detect utterances of the phrase 'Hey Vox', 'Hi Vox', 'Okay Vox', 'Anachrovox' or just 'Vox' (alongside some others.)

Releasing this many things at once will definitely result in bugs, so please report them when sighted! Thank you all!

Taproot: https://github.com/painebenjamin/taproot
Taproot JS Client: https://github.com/painebenjamin/taproot.js
Anachrovox: https://github.com/painebenjamin/anachrovox

The Anachrovox Spaces are networked together, balancing load across them to keep all front-ends responsive. You only have to choose what color you like the most!

https://huggingface.co./spaces/benjamin-paine/anachrovox
https://huggingface.co./spaces/benjamin-paine/anachrovox-amber
  • 12 replies
Ā·
liked a Space about 1 month ago
reacted to m-ric's post with šŸ‘ about 2 months ago
view post
Post
789
šŸ” Meta teams use a fine-tuned Llama model to fix production issues in seconds

One of Meta's engineering teams shared how they use a fine-tuned small Llama (Llama-2-7B, so not even a very recent model) to identify the root cause of production issues with 42% accuracy.

šŸ¤” 42%, is that not too low?
āž”ļø Usually, whenever there's an issue in production, engineers dive into recent code changes to find the offending commit. At Meta's scale (thousands of daily changes), this is like finding a needle in a haystack.
šŸ’” So when the LLM-based suggestion is right, it cuts incident resolution time from hours to seconds!

How did they do it?

šŸ”„ Two-step approach:
ā€£ Heuristics (code ownership, directory structure, runtime graphs) reduce thousands of potential changes to a manageable set
ā€£ Fine-tuned Llama 2 7B ranks the most likely culprits

šŸŽ“ Training pipeline:
ā€£ Continued pre-training on Meta's internal docs and wikis
ā€£ Supervised fine-tuning on past incident investigations
ā€£ Training data mimicked real-world constraints (2-20 potential changes per incident)

šŸ”® Now future developments await:
ā€£ Language models could handle more of the incident response workflow (runbooks, mitigation, post-mortems)
ā€£ Improvements in model reasoning should boost accuracy further

Read it in full šŸ‘‰ https://www.tryparity.com/blog/how-meta-uses-llms-to-improve-incident-response
reacted to abhishek's post with šŸ‘ 3 months ago