Turn video uploads into real-time narration and questions
Engage in multi-modal conversations with images and videos