Update README.md
Browse files
README.md
CHANGED
@@ -16,27 +16,6 @@ pinned: false
|
|
16 |
| [**Reference docs**](https://easydel.readthedocs.io/en/latest/)
|
17 |
| [**License**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#license-)
|
18 |
|
19 |
-
EasyDeL is an open-source framework designed to enhance and streamline the training process of machine learning models, with a primary focus on
|
20 |
-
|
21 |
-
## Key Features
|
22 |
-
|
23 |
-
- **Diverse Architecture Support**: Seamlessly work with various model architectures including Transformers, Mamba, RWKV, and more.
|
24 |
-
- **Custom Kernels**: EasyDeL supports custom kernels and operation for both GPU (via mosaic and triton) and TPU (via pallas).
|
25 |
-
- **Diverse Model Support**: Implements a wide range of models in JAX, including Falcon, Qwen2, Phi2, Mixtral, Qwen2Moe, Cohere, Dbrx, Phi3, and MPT.
|
26 |
-
- **Advanced Trainers**: Offers specialized trainers like DPOTrainer, ORPOTrainer, SFTTrainer, and VideoCLM Trainer.
|
27 |
-
- **Serving and API Engines**: Provides engines for efficiently serving large language models (LLMs) in JAX.
|
28 |
-
- **Quantization and Bit Operations**: Supports various quantization methods and 8, 6, and 4-bit operations for optimized inference and training.
|
29 |
-
- **Performance Optimization**: Integrates FlashAttention, RingAttention, and other performance-enhancing features.
|
30 |
-
- **Model Conversion**: Supports automatic conversion between JAX-EasyDeL and PyTorch-HF models.
|
31 |
-
|
32 |
-
### Fully Customizable and Hackable 🛠️
|
33 |
-
|
34 |
-
EasyDeL stands out by providing unparalleled flexibility and transparency:
|
35 |
-
|
36 |
-
- **Open Architecture**: Every single component of EasyDeL is open for inspection, modification, and customization. There are no black boxes here.
|
37 |
-
- **Hackability at Its Core**: We believe in giving you full control. Whether you want to tweak a small function or completely overhaul a training loop, EasyDeL lets you do it.
|
38 |
-
- **Custom Code Access**: All custom implementations are readily available and well-documented, allowing you to understand, learn from, and modify the internals as needed.
|
39 |
-
- **Encourage Experimentation**: We actively encourage users to experiment, extend, and improve upon the existing codebase. Your innovations could become the next big feature!
|
40 |
-
- **Community-Driven Development**: Share your custom implementations and improvements with the community, fostering a collaborative environment for advancing ML research and development.
|
41 |
|
42 |
With EasyDeL, you're not constrained by rigid frameworks. Instead, you have a flexible, powerful toolkit that adapts to your needs, no matter how unique or specialized they may be. Whether you're conducting cutting-edge research or building production-ready ML systems, EasyDeL provides the freedom to innovate without limitations.
|
|
|
16 |
| [**Reference docs**](https://easydel.readthedocs.io/en/latest/)
|
17 |
| [**License**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#license-)
|
18 |
|
19 |
+
EasyDeL is an open-source framework designed to enhance and streamline the training process of machine learning models, with a primary focus on JAX. It provides convenient and effective solutions for training and serving JAX models on TPU/GPU at scale.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
|
21 |
With EasyDeL, you're not constrained by rigid frameworks. Instead, you have a flexible, powerful toolkit that adapts to your needs, no matter how unique or specialized they may be. Whether you're conducting cutting-edge research or building production-ready ML systems, EasyDeL provides the freedom to innovate without limitations.
|