AI/TLDR

ComfyUI

Node-based visual editor for local image and video generation pipelines

Overview

ComfyUI is a node-based editor for building and running generative AI pipelines on your own machine. Instead of writing code, you connect nodes on a graph (flowchart) to define each step of a workflow, then queue it to produce images, video, audio, or 3D output.

It targets people who want fine control over every model and parameter rather than a single one-click button. That includes artists, technical hobbyists, and engineers who need to wire model loading, sampling, conditioning, and post-processing into a repeatable graph and reuse or share it.

Within the image-generation space, ComfyUI sits at the flexible end. It natively supports many open models such as the Stable Diffusion family, SDXL, SD3, and FLUX, and extends to video, audio, and 3D, so one tool covers a range of multimodal workflows.

What it does

  • Node graph interface for building complex generation workflows with no coding required.
  • Runs locally on Windows, Linux, and macOS, with support for NVIDIA, AMD, Intel, Apple Silicon, and Ascend hardware.
  • Native support for many open models: SD1.x/2.x, SDXL, SD3/3.5, FLUX, plus video, audio, and 3D models.
  • Smart memory management with offloading can run large models on GPUs with as little as 1GB VRAM, or on CPU with --cpu (slow).
  • Only re-executes the parts of a workflow that changed between runs, with an asynchronous queue for batching jobs.
  • Loads full workflows (including seeds) embedded in generated PNG, WebP, and FLAC files, and saves/loads workflows as JSON.

Getting started

The desktop application is the easiest way to start on Windows and macOS, but you can also install ComfyUI manually from source on any OS. The steps below cover the manual NVIDIA/CUDA install from the README.

Clone the repository

Get the source code from GitHub and move into the project directory.

bashbash
git clone https://github.com/comfyanonymous/ComfyUI.git
cd ComfyUI

Install PyTorch (NVIDIA/CUDA)

Install the PyTorch stack with CUDA support. AMD, Intel, and Apple Silicon users should follow the matching section in the README instead.

bashbash
pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu130

Install dependencies

Install the remaining Python requirements.

bashbash
pip install -r requirements.txt

Start ComfyUI

Launch the server, then open the printed local URL in your browser to load the node editor. Add --cpu if you have no supported GPU.

bashbash
python main.py

Commands and code are distilled from the project's own documentation — always check the official repo for the latest.

When to use it

  • Build a custom Stable Diffusion or FLUX image pipeline with explicit control over sampler, conditioning, and upscaling steps.
  • Generate short video clips locally with models like LTX-Video, Hunyuan Video, or Wan.
  • Create a reusable workflow graph, save it as JSON, and share it with collaborators who can reproduce the exact result.
  • Run generation on modest hardware by relying on smart offloading to fit large models into limited VRAM.

How ComfyUI compares

ComfyUI alongside other open-source image generation tools AI/TLDR tracks, ranked by GitHub stars.

ToolStarsWhat it does
Stable Diffusion web UI (AUTOMATIC1111)★ 164kA browser interface for running Stable Diffusion image generation locally with extensions and fine-grained controls.
ComfyUI★ 118kNode-based visual editor for local image and video generation pipelines
Fooocus★ 50.4kA simplified image generation app built on Stable Diffusion that hides technical settings for easy prompting.
InvokeAI★ 27.5kA self-hosted creative tool and canvas for generating and editing images with open diffusion models.
Stability-AI generative-models★ 27.2kStability AI's official code for its Stable Diffusion family of image and video generation models.
FLUX★ 25.6kBlack Forest Labs' open-weight diffusion models and inference code for generating and editing images from text prompts.
Z-Image★ 11.6kAlibaba Tongyi's 6B-parameter open image model that produces photorealistic images quickly on a single GPU.
DALLE2-pytorch★ 11.3kAn open implementation of DALL-E 2 in PyTorch, with the CLIP encoder, diffusion prior, and cascading decoder you train to generate images from text.