Multimodal RAG Pejman
π·
7
Extract and answer questions from PDFs using images
Extract and answer questions from PDFs using images
Generate images from text prompts
Process video to detect specified objects
Interact with a chatbot and analyze images with text input
Analyze images to caption, detect objects, and extract text
Generate answers to questions about images