MonsterMMORPG
962
followers
·
11 following
AI & ML interests
Check out my youtube page SECourses for Stable Diffusion tutorials. They will help you tremendously in every topic
Recent Activity
reacted
to
their
post
with 🤯
about 19 hours ago
Whisper-WebUI Premium - Ultra Fast and High Accuracy Speech to Text Transcripton App for All Languages - Windows, RunPod, Massed Compute 1-Click Installers - Supporting RTX 1000 to 5000 series
Latest installer zip file : https://www.patreon.com/posts/145395299
New Features
Password protected version, password is just 1 : WhisperWeb_UI_v1_password_is_1.zip
It has better interface, more features, default settings set for maximum accuracy
It will show transcription realtime both on Gradio interface and also on CMD
It will show better status and output at the cmd like starting time, starting file, etc
It will save every generated transcription properly with same name as input file name with proper name sanitization
After deep scan of the entire pipeline, default parameters are set for maximum accuracy and quality
1-Click installers for Windows local PC, RunPod (Linux-Cloud) and Massed Compute (Linux-Cloud)
The app the installers are made for RTX 1000 series to RTX 5000 series with pre-compiled libraries
We install with Torch 2.8, CUDA 12.9, latest Flash Attention, Sage Attention, xFormers - all precompiled
As low as 6 GB VRAM GPUs can use
OpenAI Whisper Supported Models:
tiny.en, tiny, base.en, base, small.en, small, medium.en, medium, large-v1, large-v2, large-v3, large, large-v3-turbo, turbo
Distil-Whisper Supported Models (Faster-Whisper & Insanely-Fast-Whisper):
distil-large-v2, distil-large-v3, distil-medium.en, distil-small.en
100 languages are supported
View all activity
Organizations
MonsterMMORPG 's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
view article
Z-Image Turbo LoRA training with AI Toolkit and Z-Image ControlNet Full Tutorial for Highest Quality
view article
Qwen Image Models Realism is Now Next Level & Tutorial for Object Removal, Inpainting & Outpainting
published
an
article
about 1 month ago
view article
Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison (top ones Qwen bottom ones FLUX) - Same Dataset (28 imgs) - I can't return back to FLUX such as massive difference - Qwen destroys the FLUX at complex prompts and emotions
published
an
article
about 2 months ago
view article
How to Install and Use ComfyUI and SwarmUI on Massed Compute and RunPod Private Cloud GPU Services
published
an
article
about 2 months ago
view article
The Secret to FREE, Local AI Image Generation is Finally Here
view article
Ovi - Generate Videos With Audio Like VEO 3 or SORA 2 - Run Locally - Open Source for Free
view article
SUPIR is Still Unchallanged Image Upscaler — Supports GPUs starting from RTX 1000 series to RTX 5000 series including Cloud GPUs like H100, A100, B200, L40S, RTX 6000 Pro and such
view article
GenTube: Make Stunning AI Art in 2 seconds - New Free Image Generation Platform Review & Tutorial
view article
Qwen Image LoRA trainings Stage 1 results and pre-made configs published - As low as training with 6 GB GPUs - Stage 2 research will hopefully improve quality even more - Images generated with 8-steps lightning LoRA + SECourses Musubi Tuner trained LoRA in 8 steps + 2x Latent Upscale
view article
Nano Banana (Gemini 2.5 Flash Image) Full Tutorial - 27 Unique Cases vs Qwen Image Edit - Free 2 Use
view article
Qwen Image Edit Full Tutorial: 26 Different Demo Cases, Prompts & Images, Pwns FLUX Kontext Dev
view article
Wan 2.2, FLUX, FLUX Krea & Qwen Image Just got Upgraded: Ultimate Tutorial for Open Source SOTA Image & Video Gen Models
view article
Decoding the Shift and Diffusion Models Training Like Qwen Image, FLUX, SDXL, and More
view article
New Text-to-Image Model King is Qwen Image — FLUX DEV vs FLUX Krea vs Qwen Image Realism vs Qwen Image Max Quality
view article
🚀 Wan 2.2 & FLUX Krea Full Tutorial — Automated Install & Perfect Presets
view article
How To Prompt Wan Models Full Tutorial and Guide
view article
MultiTalk Levelled Up - Way Better Animation Compared to Before with New Workflows - Image to Video
view article
FLUX Kontext Dev Detailed Local Windows How To Tutorial — Better Than ChatGPT & Gemini Image Editing
view article
Beginner’s Guide — Generate Videos With SwarmUI
view article
Ultimate ComfyUI & SwarmUI on RunPod Tutorial with Addition RTX 5000 Series GPUs & 1-Click to Setup