-
CodePlan: Repository-level Coding using LLMs and Planning
Paper โข 2309.12499 โข Published โข 74 -
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Paper โข 2310.08588 โข Published โข 35 -
SALMON: Self-Alignment with Principle-Following Reward Models
Paper โข 2310.05910 โข Published โข 2 -
Lemur: Harmonizing Natural Language and Code for Language Agents
Paper โข 2310.06830 โข Published โข 32
Anthony W Figueroa
THEFIG
ยท
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
google/timesfm-2.0-500m-pytorch
reacted
to
singhsidhukuldeep's
post
with ๐
5 days ago
Exciting breakthrough in Retrieval-Augmented Generation (RAG): Introducing MiniRAG - a revolutionary approach that makes RAG systems accessible for edge devices and resource-constrained environments.
Key innovations that set MiniRAG apart:
Semantic-aware Heterogeneous Graph Indexing
- Combines text chunks and named entities in a unified structure
- Reduces reliance on complex semantic understanding
- Creates rich semantic networks for precise information retrieval
Lightweight Topology-Enhanced Retrieval
- Leverages graph structures for efficient knowledge discovery
- Uses pattern matching and localized text processing
- Implements query-guided reasoning path discovery
Impressive Performance Metrics
- Achieves comparable results to LLM-based methods while using Small Language Models (SLMs)
- Requires only 25% of storage space compared to existing solutions
- Maintains robust performance with accuracy reduction ranging from just 0.8% to 20%
The researchers from Hong Kong University have also contributed a comprehensive benchmark dataset specifically designed for evaluating lightweight RAG systems under realistic on-device scenarios.
This breakthrough opens new possibilities for:
- Edge device AI applications
- Privacy-sensitive implementations
- Real-time processing systems
- Resource-constrained environments
The full implementation and datasets are available on GitHub: HKUDS/MiniRAG
liked
a Space
14 days ago
fabiochiu/text-to-kb
Organizations
None yet
Collections
4
spaces
3
models
1
datasets
None public yet