ESRGAN 4x Models
Overview
This repository contains two powerful 4x super-resolution models based on ESRGAN (Enhanced Super-Resolution Generative Adversarial Network). These models are specifically designed to upscale images to four times their original resolution while maintaining or enhancing visual fidelity.
Model Descriptions
Nickelback_70000G.safetensors
- Architecture: ESRGAN
- Strengths : Photorealistic upscaling of real-world images.
- Use Cases : Best suited for scenarios requiring photorealistic output, such as enhancing rofessional photography or increasing resolution in images where realistic content is crucial.
foolhardy_Remacri.safetensors
- Architecture: ESRGAN
- Strengths : Enhances the quality of comics and anime images by improving detailed texture and edge sharpness.
- Use Cases : Ideal for users who need high-quality outputs in anime and comic book styles, where fine details are important for narrative or artistic value.
Usage Instructions
- Installation: Ensure you have
PyTorchinstalled along with any other necessary libraries in your environment. - Loading Models: Load the models using a script that supports ESRGAN model inference (such as
Real-ESRGANor custom scripts). - Inference: Run your images through one of these models to upscale them by 4x.
Evaluation
These models were part of an evaluation process similar to "Hires.fix":
- The image was first generated artificially using Z-Image for 8 steps at a resolution of 1088x1600 px.
- The image was then downsampled by a factor of 2 using bilinear interpolation.
- One of the ESRGAN models (x4) was applied to upscale the image.
- The image was downsampled again by a factor of 2 using bilinear interpolation.
- Finally, 2 steps of Z-Image were applied.
The resulting images had the same resolution as the original but with improved quality. Among these checkpoints, Nickelback_70000G.safetensors and foolhardy_Remacri.safetensors produced the best image quality for photographic and illustrative/comic/anime content respectively, according to my evaluation.
Limitations
Both checkpoints are based on ESRGAN and therefore inherit its limitations:
- They should not be used in applications where accuracy and consistency are critical, such as medical imaging or legal document processing.