Firefly is struggling with winged insect representations. Here's a typical response to a prompt calling for a Chinese-style black ink of a dragonfly. Eight wings is the most I've seen, but I've only seen a four-winged version once out of more than a dozen tries. Most have six wings, one had seven.
In terms of improving the model, there are some areas where Firefly continues to be reliably unreliable and could stand improvement:
Any prompt that calls for multiple animals (say an elephant, a crocodile, and a snake -- nothing rare or unusual) will result in chimeras: a snake with the texture of an elephant's trunk and the head of something that might be a crocodile but it's hard to tell, for example, or a couple of tiny elephants with snakes instead of trunks.
More than one person in a scene results in very distorted bodies, faces, body parts.
Many human figures have grotesque distortions: extra ears, a nose in the middle of a forehead, extra or fused limbs, fantastical shapes where a face should be, bizarre hands or feet.
Firely is beginning to learn that "four fingers plus a thumb" is the normal complement for a primate hand, but hands and feet are rendered correctly less than half the time and it is very rare for any human figure to have two sensible-looking hands and two sensible-looking feet.
Introduce unusual color to a prompt ("profile of young man with blond hair and green beard") and Firefly cannot make the beard green or the hair blond. The beard is mid-brown and the hair green. The subjects are all youngish white guys.
Firefly has real trouble with vehicles if there are people in them. A car almost never has a steering wheel, and if there is one it is usually at an impossible angle. People protrude from the car where the windshield should be (not a coupe; a sedan).
These seem to be less frequent with generative fill in Photoshop.
I've interacted with Firefly over the past week and have to agree. So far the AI does not perform well on any tasks without much user interaction to rate and train. I have not gotten a good image from Firefly yet but it appears that I am doing the training. Not beta testing at all.
ICC programmer and developer, Photographer, artist and color management expert, Print standards and process expert.
More specific might help. Stacked Bar chart is almost great. It could at least be a good starting point for post work. Columnar charts is a little better. Histogram chart is better. Bar graph. Sunburst chart might be interesting for the C, not so much for the YA. Stacked Area Chart created some pretty moments. Multi-level Pie Chart is interesting. And this was pretty good too: "multi-dimensional Columnar Graph [avoid=sphere and circle], heat map coloring"
1) prompt: `wombat gets in the way of a tourist couple's selfie. pov is the selfie camera`
- model doesn't always draw a wombat (often kangaroo or koala)
- model sometimes replaces the human head with that of the animal
2) model doesn't handle numbers or amounts well, you have to really overemphasise the amounts to force it to draw more than a handful of items eg `infinite number of monkeys sitting on a house, crammed, too many` vs `lots of monkeys sitting on a house` or `50 monkeys...`
When I typed "shih tzu dog snowboarding on solitude ski resorts utah". I get the beutiful images almost matching what I asked except the location. images match with ski resort. But don't show anywhere with the name or landmark of solitude ski resort. At least name board of "Solitude ski resorts' in backgound would satisfy the customer.
Thank you for the feedback. Our model does have some difficulty with locations and cities. Please use the thumbs up/thumbs down buttons to provide feedback on the image.
Graphs(Line, bar, Dual Axis Chart, Scatter Plot Chart, Bubble Chart, Waterfall Chart, Funnel Chart) Train it on Social media logos, Website or mobile app mockups, Analytics dashboards, Email newsletters and Digital ad banners. Things like this. Many people using this software are agencies and these are things agencies will be attracted to.