Image Generation Stable Diffusion AI Art

🖼️

Fooocus

A user-friendly image generation platform based on Stable Diffusion XL with Midjourney-like simplicity

Beginner open-source self-hosted stable-diffusion-xl text-to-image

GitHub Repository Official Website

Alternative To

• Midjourney
• DALL-E
• Stable Diffusion WebUI

Difficulty Level

Beginner

Suitable for users with basic technical knowledge. Easy to set up and use.

Overview

Fooocus is an image generation software that brings Midjourney-like simplicity to Stable Diffusion XL. Created by the developer of ControlNet, it streamlines the complex parameters of image generation so users can focus on prompts and images. Fooocus includes built-in prompt expansion using GPT-2, various artistic styles, and advanced features like image prompts and inpainting while maintaining a user-friendly interface.

System Requirements

CPU: Modern multi-core processor (Intel/AMD)
RAM: 16GB+ recommended
GPU: NVIDIA GPU with 4GB+ VRAM (8GB+ recommended for optimal performance)
Storage: 15GB+ for application and models
OS: Windows, Linux, or macOS (including Apple Silicon via MPS)

Installation Guide

Option 1: Windows Installation (Easiest)

Download the latest Windows package from the GitHub releases page
Extract the ZIP file using 7-Zip (right-click file → properties → unblock if needed)
Run one of the launcher batch files:
- run.bat - Standard edition
- run_anime.bat - Anime-optimized edition
- run_realistic.bat - Realistic-optimized edition
The interface will open in your default web browser automatically (usually at http://127.0.0.1:7865)

Option 2: Manual Installation (Windows/Linux/Mac)

For Windows

Clone the repository:

git clone https://github.com/lllyasviel/Fooocus.git

Navigate to the project directory:
```
cd Fooocus
```
Run the update script to download models and dependencies:
```
python entry_with_update.py
```

For Linux

Ensure Python 3.10 is installed

Clone the repository:

git clone https://github.com/lllyasviel/Fooocus.git
cd Fooocus

Set up a Python virtual environment:

python3 -m venv fooocus_env
source fooocus_env/bin/activate
pip install -r requirements_versions.txt

Run the application:
```
python entry_with_update.py
```

For macOS (including Apple Silicon)

Install Conda and PyTorch nightly

Clone the repository:

git clone https://github.com/lllyasviel/Fooocus.git
cd Fooocus

Create and configure the Conda environment:

conda env create -f environment.yaml
conda activate fooocus
pip install pygit2==1.12.2

Run the application:
```
python entry_with_update.py
```

Option 3: Google Colab (No Installation)

If you don’t have a suitable GPU, you can run Fooocus on Google Colab:

Visit colab.research.google.com
Open a new notebook

Run the following commands:

!pip install -q https://github.com/camenduru/stable-diffusion-webui-colab/releases/download/0.0.20/xformers-0.0.20+cu118.pt.whl
!git clone https://github.com/lllyasviel/Fooocus
%cd Fooocus
!pip install -r requirements_versions.txt
!python entry_with_update.py --share --always-high-vram

Click the link generated with “gradio.app” to access the interface

Note: The first run will download required models (approximately 7GB).

Practical Exercise: Getting Started with Fooocus

Let’s walk through a simple exercise to help you get familiar with Fooocus’s capabilities.

Step 1: Basic Text-to-Image Generation

Launch Fooocus and wait for the interface to load in your browser
In the main text field, enter a simple prompt like: “forest elf, fantasy character, detailed, colorful clothing”
Click “Generate” to create your first image
Notice how Fooocus automatically enhances your prompt in the background (you may see the expanded prompt in the console)
The generated image will appear in the gallery below

Step 2: Using Styles and Presets

Scroll down to the “Style” section
Try selecting different style options like “Cinematic,” “Photographic,” or “Fooocus V2” (the default)
You can select multiple styles simultaneously - try combining “Fooocus V2” with “Cinematic”
Generate a new image with the same prompt to see how styles affect the output
Try switching to a different preset using the dropdown menu at the top (Default/Anime/Realistic)

Step 3: Advanced Features

Using Image Prompts

Click the “Image Prompt” tab
Upload a reference image by clicking “Upload”
Enter a text prompt that describes additional elements or styles
Adjust the weight slider to control how much influence the image has
Click “Generate” to create an image based on both text and image prompts

Inpainting

Click the “Inpaint” tab
Upload an image you want to modify
Use the brush tool to paint over areas you want to change
Enter a text prompt describing what should appear in the masked area
Click “Generate” to create the inpainted result

Step 4: Fine-Tuning Parameters

Click “Advanced” to expand the advanced settings
Experiment with these parameters:
- Sampling Sharpness: Affects texture detail (try 2.0 - 7.0)
- Guidance Scale: Controls adherence to prompt (try 4.0 - 9.0)
- Quality/Speed: Adjusts generation steps (higher is better quality but slower)
- Seed: Set a specific seed for reproducible results
Generate images with different parameter combinations to understand their effects

When you create an image you like, right-click on it and select “Save Image”
The saved PNG file contains embedded metadata with generation parameters
You can drag and drop this PNG back into Fooocus later to load all the same settings

Resources

Official Documentation

GitHub Repository README - Main documentation and installation guides
Advanced Features Guide - Detailed explanation of all features
Community Generated Examples - Non-cherry-picked examples showing Fooocus capabilities

Community Support

GitHub Discussions - Community support and showcase
GitHub Issues - Bug reports and feature requests
Matrix Space: #comfyui_space:matrix.org - Related community chat for broader Stable Diffusion discussion