view article Article Drag GAN - Interactive Point-based Manipulation on the Generative Image Manifold By hwaseem04 • Dec 17, 2023 • 2
Adding Conditional Control to Text-to-Image Diffusion Models Paper • 2302.05543 • Published Feb 10, 2023 • 49
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition Paper • 2407.13559 • Published Jul 18, 2024 • 17
Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach Paper • 2406.00409 • Published Jun 1, 2024 • 1
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 189
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23, 2024 • 74
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition Paper • 2406.09630 • Published Jun 13, 2024 • 2