I've built and deployed Panorama FLUX, a Gradio app for creating ultra-wide panoramic images from three different text prompts using the FLUX.1-schnell model.
It uses a custom "Mixture of Diffusers" pipeline to generate and seamlessly blend each section of the image.
Key Features: - Multi-Prompt Input: Control the left, center, and right of the scene with unique prompts. - Seamless Blending: Choose between Cosine and Gaussian blending methods to eliminate seams between tiles. - Optimized for FLUX.1-schnell: Designed for fast, 4-step generation with embedded guidance. - Multi-Language Support: On-the-fly translation for prompts written in Korean and Chinese. - Memory Efficient: Supports both custom (mmgp) and standard diffusers offloading for use on consumer GPUs or in Spaces.
This was a fun project that involved deep-diving into the FLUX architecture to get the tiling, guidance, and positional embeddings right.