|
|
--- |
|
|
license: apache-2.0 |
|
|
pipeline_tag: text-to-image |
|
|
tags: |
|
|
- text-to-image |
|
|
- image-generation |
|
|
- yandex |
|
|
--- |
|
|
Alice AI ART dev |
|
|
--- |
|
|
by Yandex |
|
|
|
|
|
 |
|
|
|
|
|
Alice AI ART dev is 4.8B parameter diffusion UNet model capable of generating images from text prompts. |
|
|
|
|
|
Key features |
|
|
--- |
|
|
|
|
|
* **Relevance** A considerable amount of work was done to improve text-to-image alignment. According to the Side-by-Side evaluation, our model is competitive with Qwen-Image, despite being significantly smaller (4.8B parameters vs 20B parameters). |
|
|
* **Aesthetics** Our model is capable of generating high-quality images with a wide range of styles and themes. |
|
|
* **Accessibility** Alice AI ART dev is runnable on consumer-grade[^1] GPUs (for instance, NVIDIA RTX 3090) making it accessible to a wider audience. |
|
|
|
|
|
[^1] with weight offloading |
|
|
|
|
|
Usage |
|
|
--- |
|
|
The image generation pipeline can be loaded a follows |
|
|
```python |
|
|
pipe = YandexArtOSPipeline.from_pretrained( |
|
|
"yandex_art_os", |
|
|
cpu_offload=True |
|
|
) |
|
|
``` |
|
|
For memory-constrained GPUs we recommend to turn on `cpu_offload` flag: |
|
|
|
|
|
By default we use following sampling parameters: |
|
|
```python |
|
|
{ |
|
|
"num_inference_steps": 32, |
|
|
"cond_scale": 2.75, |
|
|
"unet_switch_timestep": 8, |
|
|
"karras_rho": 6.0, |
|
|
"method_name": "dpm-multistep", |
|
|
"sampler_kwargs": { |
|
|
"num_train_timesteps": 1000, |
|
|
"beta_start": 0.00001013, |
|
|
"beta_end": 0.019771934, |
|
|
"use_karras_sigmas": True, |
|
|
"algorithm_type": "sde-dpmsolver++" |
|
|
} |
|
|
} |
|
|
``` |
|
|
|
|
|
|