Overview
ComfyUI is a node-based editor for building and running generative AI pipelines on your own machine. Instead of writing code, you connect nodes on a graph (flowchart) to define each step of a workflow, then queue it to produce images, video, audio, or 3D output.
It targets people who want fine control over every model and parameter rather than a single one-click button. That includes artists, technical hobbyists, and engineers who need to wire model loading, sampling, conditioning, and post-processing into a repeatable graph and reuse or share it.
Within the image-generation space, ComfyUI sits at the flexible end. It natively supports many open models such as the Stable Diffusion family, SDXL, SD3, and FLUX, and extends to video, audio, and 3D, so one tool covers a range of multimodal workflows.
What it does
- Node graph interface for building complex generation workflows with no coding required.
- Runs locally on Windows, Linux, and macOS, with support for NVIDIA, AMD, Intel, Apple Silicon, and Ascend hardware.
- Native support for many open models: SD1.x/2.x, SDXL, SD3/3.5, FLUX, plus video, audio, and 3D models.
- Smart memory management with offloading can run large models on GPUs with as little as 1GB VRAM, or on CPU with --cpu (slow).
- Only re-executes the parts of a workflow that changed between runs, with an asynchronous queue for batching jobs.
- Loads full workflows (including seeds) embedded in generated PNG, WebP, and FLAC files, and saves/loads workflows as JSON.
Getting started
The desktop application is the easiest way to start on Windows and macOS, but you can also install ComfyUI manually from source on any OS. The steps below cover the manual NVIDIA/CUDA install from the README.
Clone the repository
Get the source code from GitHub and move into the project directory.
git clone https://github.com/comfyanonymous/ComfyUI.git
cd ComfyUIInstall PyTorch (NVIDIA/CUDA)
Install the PyTorch stack with CUDA support. AMD, Intel, and Apple Silicon users should follow the matching section in the README instead.
pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu130Install dependencies
Install the remaining Python requirements.
pip install -r requirements.txtStart ComfyUI
Launch the server, then open the printed local URL in your browser to load the node editor. Add --cpu if you have no supported GPU.
python main.pyCommands and code are distilled from the project's own documentation — always check the official repo for the latest.
When to use it
- Build a custom Stable Diffusion or FLUX image pipeline with explicit control over sampler, conditioning, and upscaling steps.
- Generate short video clips locally with models like LTX-Video, Hunyuan Video, or Wan.
- Create a reusable workflow graph, save it as JSON, and share it with collaborators who can reproduce the exact result.
- Run generation on modest hardware by relying on smart offloading to fit large models into limited VRAM.
How ComfyUI compares
ComfyUI alongside other open-source image generation tools AI/TLDR tracks, ranked by GitHub stars.
| Tool | Stars | What it does |
|---|---|---|
| Stable Diffusion web UI (AUTOMATIC1111) | ★ 164k | A browser interface for running Stable Diffusion image generation locally with extensions and fine-grained controls. |
| ComfyUI | ★ 118k | Node-based visual editor for local image and video generation pipelines |
| Fooocus | ★ 50.4k | A simplified image generation app built on Stable Diffusion that hides technical settings for easy prompting. |
| InvokeAI | ★ 27.5k | A self-hosted creative tool and canvas for generating and editing images with open diffusion models. |
| Stability-AI generative-models | ★ 27.2k | Stability AI's official code for its Stable Diffusion family of image and video generation models. |
| FLUX | ★ 25.6k | Black Forest Labs' open-weight diffusion models and inference code for generating and editing images from text prompts. |
| Z-Image | ★ 11.6k | Alibaba Tongyi's 6B-parameter open image model that produces photorealistic images quickly on a single GPU. |
| DALLE2-pytorch | ★ 11.3k | An open implementation of DALL-E 2 in PyTorch, with the CLIP encoder, diffusion prior, and cascading decoder you train to generate images from text. |