Cute Kimonos with Hello World SDXL
Input
prompt
Specify things to see in the output
best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1>
negative_prompt
Specify things to not see in the output
(worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark,
num_outputs
Number of output images
3
width
Output image width
1024
height
Output image height
1024
enhance_face_with_adetailer
Enhance face with adetailer
true
enhance_hands_with_adetailer
Enhance hands with adetailer
true
adetailer_denoising_strength
1: completely redraw face or hands / 0: no effect on output images
0.45
detail
Enhance/diminish detail while keeping the overall style/character
0
brightness
Adjust brightness
0
contrast
Adjust contrast
0
seed
Same seed with the same prompt generates the same image. Set as -1 to randomize output.
2992725976
input_image
Base image that the output should be generated from. This is useful when you want to add some detail to input_image. For example, if prompt is "sunglasses" and input_image has a man, there is the man wearing sunglasses in the output.
input_image_redrawing_strength
How differ the output is from input_image. Used only when input_image is given.
0.55
reference_image
Image with which the output should share identity (e.g. face of a person or type of a dog)
reference_image_strength
Strength of applying reference_image. Used only when reference_image is given.
1
reference_pose_image
Image with a reference pose
reference_pose_strength
Strength of applying reference_pose_image. Used only when reference_pose_image is given.
1
reference_depth_image
Image with a reference depth
reference_depth_strength
Strength of applying reference_depth_image. Used only when reference_depth_image is given.
1
sampler
Sampler type
Restart
samping_steps
Number of denoising steps
35
cfg_scale
Scale for classifier-free guidance
9.5
clip_skip
The number of last layers of CLIP network to skip
2
vae
Select VAE
sdxl_vae.safetensors
lora_1
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_2
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_3
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
embedding_1
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_2
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_3
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
disable_prompt_modification
Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.
false
Output
https://files.tungsten.run/uploads/7d02a6f4bf37473ead1735ff0aa363fd/00000-2992725976.webp
https://files.tungsten.run/uploads/947fa09d76fc4c37b1e73da6936c257d/00001-2992725977.webp
https://files.tungsten.run/uploads/a281bc9007ad48378c43ec28e8b25848/00002-2992725978.webp
Finished in 234.7 seconds
Setting up the model... Preparing inputs... Processing... Loading VAE weight: models/VAE/sdxl_vae.safetensors Full prompt: best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1> Full negative prompt: (worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark, 0%| | 0/35 [00:00<?, ?it/s] 3%|▎ | 1/35 [00:03<02:07, 3.76s/it] 6%|▌ | 2/35 [00:06<01:39, 3.02s/it] 9%|▊ | 3/35 [00:08<01:29, 2.79s/it] 11%|█▏ | 4/35 [00:11<01:23, 2.69s/it] 14%|█▍ | 5/35 [00:13<01:19, 2.64s/it] 17%|█▋ | 6/35 [00:16<01:15, 2.62s/it] 20%|██ | 7/35 [00:19<01:12, 2.60s/it] 23%|██▎ | 8/35 [00:21<01:09, 2.59s/it] 26%|██▌ | 9/35 [00:24<01:07, 2.58s/it] 29%|██▊ | 10/35 [00:26<01:04, 2.58s/it] 31%|███▏ | 11/35 [00:29<01:01, 2.58s/it] 34%|███▍ | 12/35 [00:31<00:59, 2.58s/it] 37%|███▋ | 13/35 [00:34<00:56, 2.58s/it] 40%|████ | 14/35 [00:37<00:54, 2.58s/it] 43%|████▎ | 15/35 [00:39<00:51, 2.59s/it] 46%|████▌ | 16/35 [00:42<00:49, 2.59s/it] 49%|████▊ | 17/35 [00:44<00:46, 2.59s/it] 51%|█████▏ | 18/35 [00:47<00:44, 2.60s/it] 54%|█████▍ | 19/35 [00:50<00:41, 2.60s/it] 57%|█████▋ | 20/35 [00:52<00:38, 2.60s/it] 60%|██████ | 21/35 [00:55<00:36, 2.60s/it] 63%|██████▎ | 22/35 [00:57<00:33, 2.60s/it] 66%|██████▌ | 23/35 [01:00<00:31, 2.60s/it] 69%|██████▊ | 24/35 [01:03<00:28, 2.60s/it] 71%|███████▏ | 25/35 [01:05<00:26, 2.60s/it] 74%|███████▍ | 26/35 [01:08<00:23, 2.60s/it] 77%|███████▋ | 27/35 [01:10<00:20, 2.60s/it] 80%|████████ | 28/35 [01:13<00:18, 2.60s/it] 83%|████████▎ | 29/35 [01:16<00:15, 2.60s/it] 86%|████████▌ | 30/35 [01:18<00:12, 2.60s/it] 89%|████████▊ | 31/35 [01:21<00:10, 2.60s/it] 91%|█████████▏| 32/35 [01:23<00:07, 2.60s/it] 94%|█████████▍| 33/35 [01:26<00:05, 2.60s/it] 97%|█████████▋| 34/35 [01:29<00:02, 2.60s/it] 100%|██████████| 35/35 [01:30<00:00, 2.21s/it] 100%|██████████| 35/35 [01:30<00:00, 2.58s/it] Decoding latents in cuda:0... done in 2.35s Move latents to cpu... done in 0.03s 0: 640x640 1 face, 7.9ms Speed: 3.5ms preprocess, 7.9ms inference, 31.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.19it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.23it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.24it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.26it/s] 100%|██████████| 16/16 [00:12<00:00, 1.48it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 3 hands, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.8ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:10, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.27it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.27it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.27it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.27it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.27it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.45it/s] 100%|██████████| 16/16 [00:12<00:00, 1.30it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.26it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.03it/s] 100%|██████████| 16/16 [00:13<00:00, 1.12it/s] 100%|██████████| 16/16 [00:13<00:00, 1.21it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.01s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.17it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.22it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.23it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.23it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.24it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.23it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.28it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 8.1ms Speed: 3.2ms preprocess, 8.1ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.4ms Speed: 3.4ms preprocess, 7.4ms inference, 1.6ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:10<00:03, 1.01it/s] 81%|████████▏ | 13/16 [00:11<00:03, 1.12s/it] 88%|████████▊ | 14/16 [00:12<00:02, 1.11s/it] 94%|█████████▍| 15/16 [00:13<00:01, 1.02s/it] 100%|██████████| 16/16 [00:13<00:00, 1.20it/s] 100%|██████████| 16/16 [00:13<00:00, 1.15it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.24it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.24it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 7.8ms Speed: 3.2ms preprocess, 7.8ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.24it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 hand, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.3ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s Uploading outputs... Finished.
prompt
Specify things to see in the output
best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1>
negative_prompt
Specify things to not see in the output
(worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark,
num_outputs
Number of output images
3
width
Output image width
1024
height
Output image height
1024
enhance_face_with_adetailer
Enhance face with adetailer
true
enhance_hands_with_adetailer
Enhance hands with adetailer
true
adetailer_denoising_strength
1: completely redraw face or hands / 0: no effect on output images
0.45
detail
Enhance/diminish detail while keeping the overall style/character
0
brightness
Adjust brightness
0
contrast
Adjust contrast
0
seed
Same seed with the same prompt generates the same image. Set as -1 to randomize output.
2992725976
input_image
Base image that the output should be generated from. This is useful when you want to add some detail to input_image. For example, if prompt is "sunglasses" and input_image has a man, there is the man wearing sunglasses in the output.
input_image_redrawing_strength
How differ the output is from input_image. Used only when input_image is given.
0.55
reference_image
Image with which the output should share identity (e.g. face of a person or type of a dog)
reference_image_strength
Strength of applying reference_image. Used only when reference_image is given.
1
reference_pose_image
Image with a reference pose
reference_pose_strength
Strength of applying reference_pose_image. Used only when reference_pose_image is given.
1
reference_depth_image
Image with a reference depth
reference_depth_strength
Strength of applying reference_depth_image. Used only when reference_depth_image is given.
1
sampler
Sampler type
Restart
samping_steps
Number of denoising steps
35
cfg_scale
Scale for classifier-free guidance
9.5
clip_skip
The number of last layers of CLIP network to skip
2
vae
Select VAE
sdxl_vae.safetensors
lora_1
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_2
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_3
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
embedding_1
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_2
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_3
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
disable_prompt_modification
Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.
false
https://files.tungsten.run/uploads/7d02a6f4bf37473ead1735ff0aa363fd/00000-2992725976.webp
https://files.tungsten.run/uploads/947fa09d76fc4c37b1e73da6936c257d/00001-2992725977.webp
https://files.tungsten.run/uploads/a281bc9007ad48378c43ec28e8b25848/00002-2992725978.webp
Finished in 234.7 seconds
Setting up the model... Preparing inputs... Processing... Loading VAE weight: models/VAE/sdxl_vae.safetensors Full prompt: best quality, masterpiece, ultra high res, (photorealistic:1.3), 8K, raw photo, 1 girl, wearing kimono, outdoors, kimono Japanese house, old Japanese town, from below, sitting on wood, holding katana, katana, natural skin texture, skin pores, natural skin texture, dynamic pose, film grain, <lora:SDXL Detail:1> Full negative prompt: (worst quality, low resolution, bad hands, open mouth), distorted, twisted, watermark, 0%| | 0/35 [00:00<?, ?it/s] 3%|▎ | 1/35 [00:03<02:07, 3.76s/it] 6%|▌ | 2/35 [00:06<01:39, 3.02s/it] 9%|▊ | 3/35 [00:08<01:29, 2.79s/it] 11%|█▏ | 4/35 [00:11<01:23, 2.69s/it] 14%|█▍ | 5/35 [00:13<01:19, 2.64s/it] 17%|█▋ | 6/35 [00:16<01:15, 2.62s/it] 20%|██ | 7/35 [00:19<01:12, 2.60s/it] 23%|██▎ | 8/35 [00:21<01:09, 2.59s/it] 26%|██▌ | 9/35 [00:24<01:07, 2.58s/it] 29%|██▊ | 10/35 [00:26<01:04, 2.58s/it] 31%|███▏ | 11/35 [00:29<01:01, 2.58s/it] 34%|███▍ | 12/35 [00:31<00:59, 2.58s/it] 37%|███▋ | 13/35 [00:34<00:56, 2.58s/it] 40%|████ | 14/35 [00:37<00:54, 2.58s/it] 43%|████▎ | 15/35 [00:39<00:51, 2.59s/it] 46%|████▌ | 16/35 [00:42<00:49, 2.59s/it] 49%|████▊ | 17/35 [00:44<00:46, 2.59s/it] 51%|█████▏ | 18/35 [00:47<00:44, 2.60s/it] 54%|█████▍ | 19/35 [00:50<00:41, 2.60s/it] 57%|█████▋ | 20/35 [00:52<00:38, 2.60s/it] 60%|██████ | 21/35 [00:55<00:36, 2.60s/it] 63%|██████▎ | 22/35 [00:57<00:33, 2.60s/it] 66%|██████▌ | 23/35 [01:00<00:31, 2.60s/it] 69%|██████▊ | 24/35 [01:03<00:28, 2.60s/it] 71%|███████▏ | 25/35 [01:05<00:26, 2.60s/it] 74%|███████▍ | 26/35 [01:08<00:23, 2.60s/it] 77%|███████▋ | 27/35 [01:10<00:20, 2.60s/it] 80%|████████ | 28/35 [01:13<00:18, 2.60s/it] 83%|████████▎ | 29/35 [01:16<00:15, 2.60s/it] 86%|████████▌ | 30/35 [01:18<00:12, 2.60s/it] 89%|████████▊ | 31/35 [01:21<00:10, 2.60s/it] 91%|█████████▏| 32/35 [01:23<00:07, 2.60s/it] 94%|█████████▍| 33/35 [01:26<00:05, 2.60s/it] 97%|█████████▋| 34/35 [01:29<00:02, 2.60s/it] 100%|██████████| 35/35 [01:30<00:00, 2.21s/it] 100%|██████████| 35/35 [01:30<00:00, 2.58s/it] Decoding latents in cuda:0... done in 2.35s Move latents to cpu... done in 0.03s 0: 640x640 1 face, 7.9ms Speed: 3.5ms preprocess, 7.9ms inference, 31.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.19it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.23it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.24it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.26it/s] 100%|██████████| 16/16 [00:12<00:00, 1.48it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 3 hands, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.8ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:10, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.27it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.27it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.27it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.27it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.27it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.45it/s] 100%|██████████| 16/16 [00:12<00:00, 1.30it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.26it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.26it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.26it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.26it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.03it/s] 100%|██████████| 16/16 [00:13<00:00, 1.12it/s] 100%|██████████| 16/16 [00:13<00:00, 1.21it/s] Decoding latents in cuda:0... done in 0.77s Move latents to cpu... done in 0.01s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:12, 1.17it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.22it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.23it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.23it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.24it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.23it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.28it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 8.1ms Speed: 3.2ms preprocess, 8.1ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:11<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.78s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.4ms Speed: 3.4ms preprocess, 7.4ms inference, 1.6ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.29it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.27it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.26it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.26it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.26it/s] 38%|███▊ | 6/16 [00:04<00:07, 1.26it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.26it/s] 50%|█████ | 8/16 [00:06<00:06, 1.26it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.26it/s] 62%|██████▎ | 10/16 [00:07<00:04, 1.26it/s] 69%|██████▉ | 11/16 [00:08<00:03, 1.26it/s] 75%|███████▌ | 12/16 [00:10<00:03, 1.01it/s] 81%|████████▏ | 13/16 [00:11<00:03, 1.12s/it] 88%|████████▊ | 14/16 [00:12<00:02, 1.11s/it] 94%|█████████▍| 15/16 [00:13<00:01, 1.02s/it] 100%|██████████| 16/16 [00:13<00:00, 1.20it/s] 100%|██████████| 16/16 [00:13<00:00, 1.15it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.26it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.25it/s] 50%|█████ | 8/16 [00:06<00:06, 1.25it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.24it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.24it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.24it/s] 100%|██████████| 16/16 [00:12<00:00, 1.46it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 face, 7.8ms Speed: 3.2ms preprocess, 7.8ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:04<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.25it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.24it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.24it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.24it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.24it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 1 hand, 7.4ms Speed: 3.1ms preprocess, 7.4ms inference, 1.3ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/16 [00:00<?, ?it/s] 6%|▋ | 1/16 [00:00<00:11, 1.28it/s] 12%|█▎ | 2/16 [00:01<00:11, 1.25it/s] 19%|█▉ | 3/16 [00:02<00:10, 1.25it/s] 25%|██▌ | 4/16 [00:03<00:09, 1.25it/s] 31%|███▏ | 5/16 [00:03<00:08, 1.25it/s] 38%|███▊ | 6/16 [00:04<00:08, 1.24it/s] 44%|████▍ | 7/16 [00:05<00:07, 1.24it/s] 50%|█████ | 8/16 [00:06<00:06, 1.24it/s] 56%|█████▋ | 9/16 [00:07<00:05, 1.25it/s] 62%|██████▎ | 10/16 [00:08<00:04, 1.25it/s] 69%|██████▉ | 11/16 [00:08<00:04, 1.25it/s] 75%|███████▌ | 12/16 [00:09<00:03, 1.25it/s] 81%|████████▏ | 13/16 [00:10<00:02, 1.25it/s] 88%|████████▊ | 14/16 [00:11<00:01, 1.25it/s] 94%|█████████▍| 15/16 [00:12<00:00, 1.25it/s] 100%|██████████| 16/16 [00:12<00:00, 1.47it/s] 100%|██████████| 16/16 [00:12<00:00, 1.29it/s] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s Uploading outputs... Finished.