Alice-AI-ART-dev / README.md

SpiridonSunRotator

Update README.md

11e735e verified 5 days ago

preview code

raw

history blame contribute delete

1.47 kB

metadata

license: apache-2.0
pipeline_tag: text-to-image
tags:
  - text-to-image
  - image-generation
  - yandex

Alice AI ART dev

by Yandex

Alice AI ART dev is 4.8B parameter diffusion UNet model capable of generating images from text prompts.

Key features

Relevance A considerable amount of work was done to improve text-to-image alignment. According to the Side-by-Side evaluation, our model is competitive with Qwen-Image, despite being significantly smaller (4.8B parameters vs 20B parameters).
Aesthetics Our model is capable of generating high-quality images with a wide range of styles and themes.
Accessibility Alice AI ART dev is runnable on consumer-grade[^1] GPUs (for instance, NVIDIA RTX 3090) making it accessible to a wider audience.

[^1] with weight offloading

Usage

The image generation pipeline can be loaded a follows

pipe = YandexArtOSPipeline.from_pretrained(
    "yandex_art_os",
    cpu_offload=True
)

For memory-constrained GPUs we recommend to turn on cpu_offload flag:

By default we use following sampling parameters:

{
    "num_inference_steps": 32,
    "cond_scale": 2.75,
    "unet_switch_timestep": 8,
    "karras_rho": 6.0,
    "method_name": "dpm-multistep",
    "sampler_kwargs": {
        "num_train_timesteps": 1000,
        "beta_start": 0.00001013,
        "beta_end": 0.019771934,
        "use_karras_sigmas": True,
        "algorithm_type": "sde-dpmsolver++"
    }
}