Part 2

Going Beyond the Basics

In Part 1 of our Prompt-Crafting Guide, we covered the absolute essentials – the building blocks of a prompt, how to structure one effectively, and the basics to kickstart your creative journey.

Now, in Part 2, we're diving into a slightly more advanced technique that's crucial for any prompter: adding emphasis or attention to specific tokens in your prompt. This skill lets us highlight or de-emphasize certain elements, giving us control over the final image that AI generates.

Mastering the art of emphasis can significantly enhance the precision and impact of our artworks.

Boosting or Diminishing Attention

The way we emphasize tokens can vary depending on the platform or interface you're using for your art creation. However, when working with Automatic1111's Web UI or Civitai.com's On-Site Generator (among others), the syntax follows a standard format.

Emphasis

The table below explains precisely how much each set of parentheses or square brackets increases or decreases attention:

https://lh7-us.googleusercontent.com/T7cD2FKWVnjV8Z1zuceuLnAQk1zIIDRBDYcq4PbdmMiOEgMLxaVwlfJyY2CuOQL1S-J1YDkwruv6IpgqAc3dkrvtMFIxBL6ilUF-s51uKpBL11llVQtEw0e93Up7SNbJxMuXMiPvxjLn_ITxyTd6g7s

Demonstrating the Power of Emphasis

In this extreme example, we showcase the profound impact of emphasis on an image. We take the word "stars" as our token and demonstrate how we can decrease its significance, reducing the prevalence of stars in the first image. Conversely, we dramatically increase the number and prominence of stars, both in the background and foreground, in the second image.

https://lh7-us.googleusercontent.com/f8VPF6XkUfScUo2VTT1AMkTzSLIgc3TGUADTrE1UsZUlsMM9230rXEZFL9QdewejXvyC6oEQCcbyuFEs6qTR7MfqR-vX-yUlIAL1G0bPR9qY_hwHQmipT_G5k0rutFd0GhiqCg7XOb1MhjTRc8c_kvs

Token Weighting

Token Weighting works on exactly the same principle as the parenthesis emphasis syntax, but is a later addition to Stable Diffusion interfaces, and is far more common to see in prompts, having become the de-facto standard.

A token is weighted with the syntax (token:n.n), n being the desired factor of attention. Generally, when weighting tokens, we stay within the (token:0.5) to (token:1.6) range, as higher and lower values may produce unexpected results in our images!