Smart detection of contradictions
Firefly image and video doesn't communicate how it is dealing with contradictions in what is specified.
Example 1:
In Firefly video in the 'shot size' setting, I choose 'Extreme long shot' but in the prompt text I include 'Extreme close up'. The result after multiple attempts was that the prompt 'Extreme close up' took priority in the output.
Example 2:
In Firefly video, my prompt had "red and orange koala, grayscale" (abbreviated version). Results with multiple attempts included simply a grayscale koala, a mostly grayscale koala with some hint of red/orange in its face and ears, a face that starts as mostly grayscale that animates into a more orange face, a face that is orange but ears, eyes, nose, body are grayscale.
The above are clear contradictions. There may be more subtle ones that a user does not realise.
A way Firefly could deal with things like the above, both for images and video),is to show the user the actual prompt that was used - so a new Firefly version of the prompt used for a specific image which could identify what it ignored, what it interepreted, what it added beyond what was stated. Maybe Firefly can give feedback on the prompt. Maybe in the future, you can chat with Firefly about the image and the prompt.

