Cute Kimonos with Hello World SDXL

hello-world-sdxl:v3.2

evevalentine2017

Jan 26

361

Run this yourself

Input

prompt

Specify things to see in the output

best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1>

negative_prompt

Specify things to not see in the output

(worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark,

num_outputs

Number of output images

width

Output image width

1024

height

Output image height

1024

enhance_face_with_adetailer

Enhance face with adetailer

true

enhance_hands_with_adetailer

Enhance hands with adetailer

true

adetailer_denoising_strength

1: completely redraw face or hands / 0: no effect on output images

0.45

detail

Enhance/diminish detail while keeping the overall style/character

brightness

Adjust brightness

contrast

Adjust contrast

seed

Same seed with the same prompt generates the same image. Set as -1 to randomize output.

2992725976

input_image

Base image that the output should be generated from. This is useful when you want to add some detail to input_image. For example, if prompt is "sunglasses" and input_image has a man, there is the man wearing sunglasses in the output.

input_image_redrawing_strength

How differ the output is from input_image. Used only when input_image is given.

0.55

reference_image

Image with which the output should share identity (e.g. face of a person or type of a dog)

reference_image_strength

Strength of applying reference_image. Used only when reference_image is given.

reference_pose_image

Image with a reference pose

reference_pose_strength

Strength of applying reference_pose_image. Used only when reference_pose_image is given.

reference_depth_image

Image with a reference depth

reference_depth_strength

Strength of applying reference_depth_image. Used only when reference_depth_image is given.

sampler

Sampler type

Restart

samping_steps

Number of denoising steps

cfg_scale

Scale for classifier-free guidance

9.5

clip_skip

The number of last layers of CLIP network to skip

vae

Select VAE

sdxl_vae.safetensors

lora_1

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

SDXL Detail.safetensors

lora_2

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

lora_3

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

embedding_1

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

embedding_2

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

embedding_3

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

disable_prompt_modification

Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.

false

Output

https://files.tungsten.run/uploads/7d02a6f4bf37473ead1735ff0aa363fd/00000-2992725976.webp

https://files.tungsten.run/uploads/947fa09d76fc4c37b1e73da6936c257d/00001-2992725977.webp

https://files.tungsten.run/uploads/a281bc9007ad48378c43ec28e8b25848/00002-2992725978.webp

Finished in 234.7 seconds

Setting up the model... Preparing inputs... Processing... Loading VAE weight: models/VAE/sdxl_vae.safetensors Full prompt: best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1> Full negative prompt: (worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark, 0%| | 0/35 [00:00<?, ?it/s] 3%|▎ | 1/35 [00:03<02:07, 3.76s/it] 6%|▌ | 2/35 [00:06<01:39, 3.02s/it] 9%|▊ | 3/35 [00:08<01:29, 2.79s/it] 11%|█▏ | 4/35 [00:11<01:23, 2.69s/it] 14%|█▍ | 5/35 [00:13<01:19, 2.64s/it] 17%|█▋ | 6/35 [00:16<01:15, 2.62s/it] 20%|██ | 7/35 [00:19<01:12, 2.60s/it] 23%|██▎ | 8/35 [00:21<01:09, 2.59s/it] 26%|██▌ | 9/35 [00:24<01:07, 2.58s/it] 29%|██▊ | 10/35 [00:26<01:04, 2.58s/it] 31%|███▏ | 11/35 [00:29<01:01, 2.58s/it] 34%|███▍ | 12/35 [00:31<00:59, 2.58s/it] 37%|███▋ | 13/35 [00:34<00:56, 2.58s/it] 40%|████ | 14/35 [00:37<00:54, 2.58s/it] 43%|████▎ | 15/35 [00:39<00:51, 2.59s/it] 46%|████▌ | 16/35 [00:42<00:49, 2.59s/it] 49%|████▊ | 17/35 [00:44<00:46, 2.59s/it] 51%|█████▏ | 18/35 [00:47<00:44, 2.60s/it] 54%|█████▍ | 19/35 [00:50<00:41, 2.60s/it] 57%|█████▋ | 20/35 [00:52<00:38, 2.60s/it] 60%|██████ | 21/35 [00:55<00:36, 2.60s/it] 63%|██████▎ | 22/35 [00:57<00:33, 2.60s/it] 66%|██████▌ | 23/35 [01:00<00:31, 2.60s/it] 69%|██████▊ | 24/35 [01:03<00:28, 2.60s/it] 71%|███████▏ | 25/35 [01:05<00:26, 2.60s/it] 74%|███████▍ | 26/35 [01:08<00:23, 2.60s/it] 77%|███████▋ | 27/35 [01:10<00:20, 2.60s/it] 80%|████████ | 28/35 [01:13<00:18, 2.60s/it] 83%|████████▎ | 29/35 [01:16<00:15, 2.60s/it] 86%|████████▌ | 30/35 [01:18<00:12, 2.60s/it] 89%|████████▊ | 31/35 [01:21<00:10, 2.60s/it] 91%|█████████▏| 32/35 [01:23<00:07, 2.60s/it] 94%|█████████▍| 33/35 [01:26<00:05, 2.60s/it] 97%|█████████▋| 34/35 [01:29<00:02, 2.60s/it] 100%|██████████| 35/35 [01:30<00:00, 2.21s/it] 100%|██████████| 35/35 [01:30<00:00, 2.58s/it] Decoding latents in cuda:0... done in 2.35s Move latents to cpu... done in 0.03s 0: 640x640 1 face, 7.9ms Speed: 3.5ms preprocess, 7.9ms inference, 31.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.19it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.23it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.24it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.26it/s] 100%|██████████| 16/16 [00:12<00:00, 1.48it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 3 hands, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.8ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:10, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.27it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.27it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.27it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.27it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.27it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.45it/s] 100%|██████████| 16/16 [00:12<00:00, 1.30it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.26it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.03it/s] 100%|██████████| 16/16 [00:13<00:00, 1.12it/s] 100%|██████████| 16/16 [00:13<00:00, 1.21it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.01s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.17it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.22it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.23it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.23it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.24it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.23it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.28it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 8.1ms Speed: 3.2ms preprocess, 8.1ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.4ms Speed: 3.4ms preprocess, 7.4ms inference, 1.6ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:10<00:03, 1.01it/s] 81%|████████▏ | 13/16 [00:11<00:03, 1.12s/it] 88%|████████▊ | 14/16 [00:12<00:02, 1.11s/it] 94%|█████████▍| 15/16 [00:13<00:01, 1.02s/it] 100%|██████████| 16/16 [00:13<00:00, 1.20it/s] 100%|██████████| 16/16 [00:13<00:00, 1.15it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.24it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.24it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 7.8ms Speed: 3.2ms preprocess, 7.8ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.24it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 hand, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.3ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s Uploading outputs... Finished.

prompt

Specify things to see in the output

negative_prompt

Specify things to not see in the output

(worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark,

num_outputs

Number of output images

width

Output image width

1024

height

Output image height

1024

enhance_face_with_adetailer

Enhance face with adetailer

true

enhance_hands_with_adetailer

Enhance hands with adetailer

true

adetailer_denoising_strength

1: completely redraw face or hands / 0: no effect on output images

0.45

detail

Enhance/diminish detail while keeping the overall style/character

brightness

Adjust brightness

contrast

Adjust contrast

seed

Same seed with the same prompt generates the same image. Set as -1 to randomize output.

2992725976

input_image

input_image_redrawing_strength

How differ the output is from input_image. Used only when input_image is given.

0.55

reference_image

Image with which the output should share identity (e.g. face of a person or type of a dog)

reference_image_strength

Strength of applying reference_image. Used only when reference_image is given.

reference_pose_image

Image with a reference pose

reference_pose_strength

Strength of applying reference_pose_image. Used only when reference_pose_image is given.

reference_depth_image

Image with a reference depth

reference_depth_strength

Strength of applying reference_depth_image. Used only when reference_depth_image is given.

sampler

Sampler type

Restart

samping_steps

Number of denoising steps

cfg_scale

Scale for classifier-free guidance

9.5

clip_skip

The number of last layers of CLIP network to skip

vae

Select VAE

sdxl_vae.safetensors

lora_1

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

SDXL Detail.safetensors

lora_2

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

lora_3

LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>

embedding_1

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

embedding_2

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

embedding_3

Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)

disable_prompt_modification

Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.

false

Finished in 234.7 seconds