3 87 87

Shyam Sunder Kumar

theainerd

AI & ML interests

Natural Language Processing

Recent Activity

reacted to clem's post with 🔥 about 8 hours ago

I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on Hugging Face (very cool!). He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference. As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!

upvoted a paper 2 days ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

reacted to Kseniase's post with 🔥 6 days ago

9 types of "Chain-of-..." approaches: Chain-of-Thought (CoT) prompting enhances reasoning in AI models by breaking down complex problems into step-by-step logical sequences. It continues proving its effectiveness, especially in top-performing reasoning models. However, there are other similar methods, that expand CoT and can be used for different purposes. Here are 9 of them: 1. Chain-of-Action-Thought (COAT) -> https://huggingface.co./papers/2502.02508 Helps model decide when to keep thinking, double-check their work, or try a different approach, using special guiding tokens. 2. Chain of Draft (CoD) -> https://huggingface.co./papers/2502.18600 It helps model generate short but meaningful reasoning steps, cutting costs and making processing faster 3. Chain-of-Agents -> https://huggingface.co./papers/2406.02818 Uses multi-agent collaboration: Worker agents process text parts in a structured chain, and manager agent summarizes the results 4. Chain-of-RAG ->https://huggingface.co./papers/2501.14342 Creates retrieval chains, instead of retrieving all info at once. It can dynamically adjust its search process and its parameters like step number 5. Chain-of-Shot Prompting (CoS) -> https://huggingface.co./papers/2502.06428 Helps models pick frames crucial for understanding a video, using a binary video summary and video co-reasoning module. 6. Chain of Hindsight (CoH) -> https://huggingface.co./papers/2302.02676 Converts all feedback into sequences to fine-tune the model and refine outputs 7. Chain-of-Note (CoN) -> https://huggingface.co./papers/2311.09210 Generates sequential reading notes for each retrieved document to assess relevance before integrating info into the final answer 8. Chain of Diagnosis (CoD) -> https://huggingface.co./papers/2407.13301 Transforms the diagnostic process into a diagnostic chain 9. Chain(s)-of-Knowledge -> https://www.turingpost.com/p/cok Enhance LLMs by dynamically pulling in external knowledge to improve accuracy and reduce errors

View all activity

Organizations

Collections 4

models 2

theainerd/Wav2Vec2-large-xlsr-hindi

Automatic Speech Recognition • Updated May 31, 2023 • 803k • • 5

theainerd/wav2vec2-large-xlsr-53-odia

Automatic Speech Recognition • Updated Mar 24, 2021 • 1.82k • 3

datasets

None public yet

Shyam Sunder Kumar

AI & ML interests

Recent Activity

Organizations

Collections 4

Agent Laboratory: Using LLM Agents as Research Assistants

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Training Large Language Models to Reason in a Continuous Latent Space

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Evolving Deeper LLM Thinking

Kimi k1.5: Scaling Reinforcement Learning with LLMs

models 2

theainerd/Wav2Vec2-large-xlsr-hindi

theainerd/wav2vec2-large-xlsr-53-odia

datasets

Shyam Sunder Kumar

AI & ML interests

Recent Activity

Organizations

Collections 4

models 2 Sort: Recently updated

datasets

models 2