Fat Captain America with Tertium SDXL Turbo
Input
prompt
Specify things to see in the output
Cinematic shot, 5k resolution, very (fat:1.4) captain america, solo, in his costume eating burgers
negative_prompt
Specify things to not see in the output
(Portrait), (focus on face), (asymmetric face), Compression artifacts, bad art, worst quality, low quality, plastic, fake, bad limbs, conjoined, featureless, bad features, incorrect objects, watermark, logo
num_outputs
Number of output images
3
width
Output image width
1024
height
Output image height
1024
enhance_face_with_adetailer
Enhance face with adetailer
true
enhance_hands_with_adetailer
Enhance hands with adetailer
true
adetailer_denoising_strength
1: completely redraw face or hands / 0: no effect on output images
0.45
detail
Enhance/diminish detail while keeping the overall style/character
0
brightness
Adjust brightness
0
contrast
Adjust contrast
0
seed
Same seed with the same prompt generates the same image. Set as -1 to randomize output.
2176968099
input_image
Base image that the output should be generated from. This is useful when you want to add some detail to input_image. For example, if prompt is "sunglasses" and input_image has a man, there is the man wearing sunglasses in the output.
input_image_redrawing_strength
How differ the output is from input_image. Used only when input_image is given.
0.55
reference_image
Image with which the output should share identity (e.g. face of a person or type of a dog)
reference_image_strength
Strength of applying reference_image. Used only when reference_image is given.
1
reference_pose_image
Image with a reference pose
reference_pose_strength
Strength of applying reference_pose_image. Used only when reference_pose_image is given.
1
reference_depth_image
Image with a reference depth
reference_depth_strength
Strength of applying reference_depth_image. Used only when reference_depth_image is given.
1
sampler
Sampler type
DPM++ SDE Karras
samping_steps
Number of denoising steps
5
cfg_scale
Scale for classifier-free guidance
2.2
clip_skip
The number of last layers of CLIP network to skip
2
vae
Select VAE
sdxl_vae.safetensors
lora_1
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_2
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_3
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
embedding_1
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_2
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_3
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
disable_prompt_modification
Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.
false
Output
https://files.tungsten.run/uploads/5728aa9752564fe5a7ec742a1e36a686/00000-2176968099.webp
https://files.tungsten.run/uploads/0b36a4acd2cb4fc684ff1413ca522028/00001-2176968100.webp
https://files.tungsten.run/uploads/aa9e29a296ff4fcab442f3da84d7309c/00002-2176968101.webp
Finished in 88.9 seconds
Setting up the model... Preparing inputs... Processing... Loading VAE weight: models/VAE/sdxl_vae.safetensors Full prompt: Cinematic shot, 5k resolution, very (fat:1.4) captain america, solo, in his costume eating burgers Full negative prompt: (Portrait), (focus on face), (asymmetric face), Compression artifacts, bad art, worst quality, low quality, plastic, fake, bad limbs, conjoined, featureless, bad features, incorrect objects, watermark, logo 0%| | 0/5 [00:00<?, ?it/s] 20%|██ | 1/5 [00:04<00:16, 4.24s/it] 40%|████ | 2/5 [00:10<00:15, 5.20s/it] 60%|██████ | 3/5 [00:15<00:10, 5.43s/it] 80%|████████ | 4/5 [00:19<00:04, 4.83s/it] 100%|██████████| 5/5 [00:21<00:00, 3.56s/it] 100%|██████████| 5/5 [00:21<00:00, 4.21s/it] Decoding latents in cuda:0... done in 2.39s Move latents to cpu... done in 0.03s 0: 640x640 1 face, 8.4ms Speed: 6.7ms preprocess, 8.4ms inference, 24.1ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.89s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.50s/it] 100%|██████████| 3/3 [00:03<00:00, 1.00it/s] 100%|██████████| 3/3 [00:03<00:00, 1.17s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.5ms Speed: 3.3ms preprocess, 7.5ms inference, 2.0ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.02it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 3 faces, 8.5ms Speed: 3.1ms preprocess, 8.5ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.83s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.02it/s] 100%|██████████| 3/3 [00:03<00:00, 1.15s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.83s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.1ms Speed: 3.3ms preprocess, 7.1ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.01s 0: 640x640 1 face, 7.7ms Speed: 3.1ms preprocess, 7.7ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.44s/it] 100%|██████████| 3/3 [00:03<00:00, 1.04it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.4ms Speed: 3.0ms preprocess, 7.4ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.44s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:02<00:04, 2.13s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.60s/it] 100%|██████████| 3/3 [00:03<00:00, 1.05s/it] 100%|██████████| 3/3 [00:03<00:00, 1.25s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.0s Uploading outputs... Finished.
prompt
Specify things to see in the output
Cinematic shot, 5k resolution, very (fat:1.4) captain america, solo, in his costume eating burgers
negative_prompt
Specify things to not see in the output
(Portrait), (focus on face), (asymmetric face), Compression artifacts, bad art, worst quality, low quality, plastic, fake, bad limbs, conjoined, featureless, bad features, incorrect objects, watermark, logo
num_outputs
Number of output images
3
width
Output image width
1024
height
Output image height
1024
enhance_face_with_adetailer
Enhance face with adetailer
true
enhance_hands_with_adetailer
Enhance hands with adetailer
true
adetailer_denoising_strength
1: completely redraw face or hands / 0: no effect on output images
0.45
detail
Enhance/diminish detail while keeping the overall style/character
0
brightness
Adjust brightness
0
contrast
Adjust contrast
0
seed
Same seed with the same prompt generates the same image. Set as -1 to randomize output.
2176968099
input_image
Base image that the output should be generated from. This is useful when you want to add some detail to input_image. For example, if prompt is "sunglasses" and input_image has a man, there is the man wearing sunglasses in the output.
input_image_redrawing_strength
How differ the output is from input_image. Used only when input_image is given.
0.55
reference_image
Image with which the output should share identity (e.g. face of a person or type of a dog)
reference_image_strength
Strength of applying reference_image. Used only when reference_image is given.
1
reference_pose_image
Image with a reference pose
reference_pose_strength
Strength of applying reference_pose_image. Used only when reference_pose_image is given.
1
reference_depth_image
Image with a reference depth
reference_depth_strength
Strength of applying reference_depth_image. Used only when reference_depth_image is given.
1
sampler
Sampler type
DPM++ SDE Karras
samping_steps
Number of denoising steps
5
cfg_scale
Scale for classifier-free guidance
2.2
clip_skip
The number of last layers of CLIP network to skip
2
vae
Select VAE
sdxl_vae.safetensors
lora_1
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_2
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
lora_3
LoRA file. Apply by writing the following in prompt: <lora:FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE>
embedding_1
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_2
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
embedding_3
Embedding file (textural inversion). Apply by writing the following in prompt or negative prompt: (FILE_NAME_WITHOUT_EXTENSION:MAGNITUDE)
disable_prompt_modification
Disable automatically adding suggested prompt modification. Built-in LoRAs and trigger words will remain.
false
https://files.tungsten.run/uploads/5728aa9752564fe5a7ec742a1e36a686/00000-2176968099.webp
https://files.tungsten.run/uploads/0b36a4acd2cb4fc684ff1413ca522028/00001-2176968100.webp
https://files.tungsten.run/uploads/aa9e29a296ff4fcab442f3da84d7309c/00002-2176968101.webp
Finished in 88.9 seconds
Setting up the model... Preparing inputs... Processing... Loading VAE weight: models/VAE/sdxl_vae.safetensors Full prompt: Cinematic shot, 5k resolution, very (fat:1.4) captain america, solo, in his costume eating burgers Full negative prompt: (Portrait), (focus on face), (asymmetric face), Compression artifacts, bad art, worst quality, low quality, plastic, fake, bad limbs, conjoined, featureless, bad features, incorrect objects, watermark, logo 0%| | 0/5 [00:00<?, ?it/s] 20%|██ | 1/5 [00:04<00:16, 4.24s/it] 40%|████ | 2/5 [00:10<00:15, 5.20s/it] 60%|██████ | 3/5 [00:15<00:10, 5.43s/it] 80%|████████ | 4/5 [00:19<00:04, 4.83s/it] 100%|██████████| 5/5 [00:21<00:00, 3.56s/it] 100%|██████████| 5/5 [00:21<00:00, 4.21s/it] Decoding latents in cuda:0... done in 2.39s Move latents to cpu... done in 0.03s 0: 640x640 1 face, 8.4ms Speed: 6.7ms preprocess, 8.4ms inference, 24.1ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.89s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.50s/it] 100%|██████████| 3/3 [00:03<00:00, 1.00it/s] 100%|██████████| 3/3 [00:03<00:00, 1.17s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.5ms Speed: 3.3ms preprocess, 7.5ms inference, 2.0ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.02it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 3 faces, 8.5ms Speed: 3.1ms preprocess, 8.5ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.83s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.02it/s] 100%|██████████| 3/3 [00:03<00:00, 1.15s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.83s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.46s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.81s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.14s/it] Decoding latents in cuda:0... done in 0.79s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.1ms Speed: 3.3ms preprocess, 7.1ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.45s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.01s 0: 640x640 1 face, 7.7ms Speed: 3.1ms preprocess, 7.7ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.44s/it] 100%|██████████| 3/3 [00:03<00:00, 1.04it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.8s Move latents to cpu... done in 0.0s 0: 640x640 2 hands, 7.4ms Speed: 3.0ms preprocess, 7.4ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 640) 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:01<00:03, 1.79s/it] 67%|██████▋ | 2/3 [00:02<00:01, 1.44s/it] 100%|██████████| 3/3 [00:03<00:00, 1.03it/s] 100%|██████████| 3/3 [00:03<00:00, 1.13s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.0s 0%| | 0/3 [00:00<?, ?it/s] 33%|███▎ | 1/3 [00:02<00:04, 2.13s/it] 67%|██████▋ | 2/3 [00:03<00:01, 1.60s/it] 100%|██████████| 3/3 [00:03<00:00, 1.05s/it] 100%|██████████| 3/3 [00:03<00:00, 1.25s/it] Decoding latents in cuda:0... done in 0.81s Move latents to cpu... done in 0.0s Uploading outputs... Finished.