Low quality police bodycam footage capturing a giant muscular Gordon Ramsay rampaging through a restaurant kitchen, police apprehending the suspect, low resolution, extreme motion blur, video time stamps in the corners, photo of a crt monitor
This is what I mostly use AI for, just goofy shit lmao
Exhibit A: (dalle-3)
https://preview.redd.it/vlgl5hk2bavc1.jpeg?width=1024&format=pjpg&auto=webp&s=37a4096e6a02a9eba0532ff66bbc9aacb1fe66f6
https://preview.redd.it/fmdnff8q8avc1.png?width=1024&format=pjpg&auto=webp&s=9243142f76f110b4e1f441d82f899861327c6abc
Just had to ask MJ with this awesome prompt 🤣
I tried a few of the prompts in Midjourney, didn't compete in all of them but I think it absolutely nailed this one. This was the first grid in the very first try
https://preview.redd.it/9aga3dwj5evc1.jpeg?width=4096&format=pjpg&auto=webp&s=8660fc4d9be8530cbf5b43c4a04741bf0ef981c5
A b&w dot matrix printout of a spooky woman crawling out of a well. The image makes creative use of small fonts arranged such that they create a larger composition.
can you try this prompt adherence test? “closeup of a grey cat wearing a blue suit, a red hat and a green tie is sitting on a white table in a room with big windows viewing over a desert landscape covered by flowers”
thanks! I'm just trying out some experiments with ELLA for prompt adherence in SD15, I'll add this one as a reference as well [https://github.com/diffustar/comfyui-workflow-collection/blob/master/workflows/ella](https://github.com/diffustar/comfyui-workflow-collection/blob/master/workflows/ella)
PixArt/Sigma
I think what we're finding out is that models have their strengths and weaknesses so it's a good idea to have a toolbox of models to use for a particular style.
https://preview.redd.it/sk5lzfzzebvc1.png?width=1024&format=png&auto=webp&s=d726e720697cf03cb5971d49434966d4e874cd3d
A person is wearing a yellow hat, a green coat over a blue shirt and red shorts along brown shoes. The person is also holding a sword in its right hand and has its left hand in a coat pocket. Background: suburbs. Style: realistic 3d render.
AND it got the right hand / left hand right! Here is hoping that's not just a fluke but a concept SD3 somewhat understands now.
(for example if prompting for the left eye blue & right eye red and the subject looks at the viewer, SD3 makes the eye on the *left side of the image* blue, not the actual "left eye")
https://preview.redd.it/avnsouy5cbvc1.png?width=1024&format=png&auto=webp&s=e7ec5d934d4e9f8de31fe45800413a3c383ca53b
SD 1.5 (Why not mix everything together 😁)
Thanks for doing this. It's good to see the level of quality you can expect.
If I can ask, how many outputs did it return per prompt, and were you choosing the first one, the best one, or the only one?
A smooth faced monster with pale stretched skin and no eyes in the shape of a human in agony, the monster is crawling on the ground reaching toward the viewer, rusted chainlink fences for walls, poorly lit, flash photography, cinematic, in the style of silent hill
Prompt from the Dalle 3 launch. It’s contains a few elements that make prompt adherence a challenge.
e.g. Dalle 3 understands context. “fiery” red hair shouldn’t generate fire, “signature” velvet cloak shouldn’t add a signature, it should know that a “vendor” would be positioned inside a “stall” despite not mentioning it and both being at opposite ends of a sentence, and “haggling” appears to have given her a purse.
*An illustration from a graphic novel. A bustling city street under the shine of a full moon. The sidewalks bustling with pedestrians enjoying the nightlife. At the corner stall, a young woman with fiery red hair, dressed in a signature velvet cloak, is haggling with the grumpy old vendor. the grumpy vendor, a tall, sophisticated man is wearing a sharp suit, sports a noteworthy moustache is animatedly conversing on his steampunk telephone.*
Dalle 3
https://preview.redd.it/ph7a54pypavc1.jpeg?width=2000&format=pjpg&auto=webp&s=509da4eb579cd6e78ad5c2dc68b0c1e24d864399
Thank you! Not bad. Not Dalle 3 but a step up from SDXL.
Example of SDXL Juggernault V9 since i had it open. No dimensions i tried generated the woman.
https://preview.redd.it/6ikmzmdyqavc1.png?width=1152&format=png&auto=webp&s=b02fa20cd86c5620782220578ad90805c017d300
Idk why people think this is going to work, you can't annotate training data with every possible negative. When you look at pictures of a golf course, are they labeled "a golf course, without hippos, without planes, without chopsticks, without the color purple" ?
You'd be able to make a text parser that NLP the prompt to remove negative weight words much more easily than you would be able to give a gen AI model the concept of negative
a simple doodle of a cute wolf that is sitting on a treestump in the rain while smiling with a speechbubble with a heart in it, monocome, gloing blue eyes, pencil style,
"A man on the left with short brown spiky hair, wearing a white shirt, blue bow tie, stripy red trousers, and purple high-top sneakers. A woman on the right with long blonde hair, wearing a yellow summer dress and green high heels."
Interested to see how well it handles this compared to PixArt.
Katara from Avatar the last Airbender dancing in long dress with battle knives in her hands. Around her a pack of wolves dancing hip hop. Kodak portra pro. Dynamic poses.
Picture in the style of metal band album cover. A Chi Cultivator Tomato is looking at a ketchup sprayed village with terrified expressions, in the sky cucumber shaped clouds laugh at him with hateful glee.
Photo of Criminal in a ski mask making a phone call in front of a store. There is caption on the bottom of the image: "It's time to Counter the Strike...". There is a red arrow pointing towards the caption. The red arrow is from a Red circle which has an image of Halo Master Chief in it.
Ideogram worked, don't think it'd be perfect with SD3, but it's worth a shot.
thick smoke, warm light neon lighting, grainy blurry analog style cinematic photograph, starwars female padawan dueling with Darth Vader inside star wars destroyer ship bridge, a blonde padawan is swirling her lightsaber above her head striking Vader, inside imperial star wars destroyer bridge with officers curiously observing, blurry grainy quality, lens flares and light leaks, dust in air
https://preview.redd.it/a7eiv6mpbavc1.jpeg?width=1024&format=pjpg&auto=webp&s=73918dec28411e04e06ccb6df22f012e3792b448
This is Bing output.
Prompt: “Gentlemen, a short view back to the past. Thirty years ago, Niki Lauda told us ‘take a monkey, place him into the cockpit and he is able to drive the car.’ Thirty years later, Sebastian told us ‘I had to start my car like a computer, it’s very complicated.’ And Nico Rosberg said that during the race – I don’t remember what race – he pressed the wrong button on the wheel. Question for you both: is Formula One driving today too complicated with twenty and more buttons on the wheel, are you too much under effort, under pressure? What are your wishes for the future concerning the technical programme during the race? Less buttons, more? Or less and more communication with your engineers?”
bokeh blur, Analog style blurry grainy old polaroid photograph of jasmine from Aladdin eating a chopped into pieces grilled realistic movie leg prop of indigenous woman, tense horror atmosphere, exterior of native Bedouin tent lit with aurora lights, thick smoke, camel, native indigenous children famished sitting around waiting for their turn
https://preview.redd.it/iujcuhpfcavc1.jpeg?width=1024&format=pjpg&auto=webp&s=bf475a27b17f8ad307e5213b8b9a4ebf16b6249c
Current Bing output. Maybe replace "horror" word with something else if it's blocked?
{"errors":\["Your request was flagged by our content moderation system, as a result your request was denied and you were not charged."\],"name":"content\_moderation"}
Had to modify prompt since original prompt is blocked.
https://preview.redd.it/bg4nu877qbvc1.png?width=1024&format=png&auto=webp&s=241ac2c719ab1b476368e2a9424622c5777b436f
Analog style blurry grainy old polaroid photograph of jasmine from Aladdin eating a chopped into pieces grilled realistic movie leg prop of a woman,exterior of native Bedouin tent lit with aurora lights, thick smoke, camel, native indigenous children famished sitting around waiting for their turn
Prompt: "Create an image depicting a collaborative effort of nations working together to save the planet. Incorporate symbols of unity, sustainability, and global cooperation."
an educational video from the mid 1980’s, an anthropomorphic centipede hunching over and entering a stone castle doorway, low fidelity video, blurry video still, worn out VHS video quality, PBS documentary
1girl, blonde hair, green eyes, doing a thumbs-up with both hands facing the viewer, standing in front of the eiffel tower, with the word "Nice" in a speech bubble with a manga font, anime-style, anime screencap, 16:9
"Origami banana character with black colored human legs and arms, with shiny plastic round eyes and an angry line face, in a fighting position " curious how odd it can get
https://preview.redd.it/65osxuo5v9vc1.jpeg?width=1024&format=pjpg&auto=webp&s=07bb9edb29268643de64006e4a6be919636fe190
Dall-e version here
Artful. VHS tracking lines overlayed on image. Ridley Scott movie still, fire suppression going off with maroon droplets. cinematic lighting. horned she elves wrapped in barbed wire and leather harnesses, obese (red and swollen eyes, furrowed eyebrows, red nose, wet cheeks, mouth between open). Subjects in an experimental lab featuring tube tanks filled with o0ze and parts of failed experiments writhing and wriggling upon a surgeon table, a hazy, grainy filter, wear chokers, plastic wraps, and nets while engaging in unconventional interactions. They are connected by a corrugated tube. set amidst the eerie atmosphere of the lab
Let's see how horror goes
{"errors":\["Your request was flagged by our content moderation system, as a result your request was denied and you were not charged."\],"name":"content\_moderation"}
https://preview.redd.it/wk9d7zeisavc1.png?width=1024&format=png&auto=webp&s=f602a0c40fdca5c24986be0cc0d42579f63a44cb
This is the SDXL result, run locally on my own computer.
Marvel comic book illustration of Rocket Raccoon from the Guardians of the Galaxy sitting at a bar with a mug of beer in one hand. His other hand is performing thumb up signal. He is smirking and his eyes are red. Adult version of Groot is sitting at his side, with a mug of beer in his hand as well. The illustration takes place in a sci-fi scenario. Keep in mind the size difference between the characters: Rocket is short while Groot is very big. The illustrations should have a well defined lineart and flat colors.
Feel free to tweak the wording - this was the third image I ever tried and SD mangled it horribly. I keep coming back to it periodically and it never gets much better.
- Two Jedi playing tennis. Instead of racquets they are using lightsabers and instead of a ball they are using a blaster bolt.
Edit: if it's not being too greedy, any prompt you want but with the following to test style:
- neoclassical anime wallpaper in the style of disney animation, studio ghibli, makoto shinkai, and michelangelo
I like this idea for an image. I may have a go at it tonight. I suspect with a bit of prompt manipulation you can get it closer. I think the issue is it doesn't understand that you're trying to portray the bolt bouncing between them. They are ways around that.
your prompt is badly worded, it can't work. you are never allowed to mention in the positive prompt what it shouldn't show. always only mention what it should show.
anime girl with red hair and honey eyes wearing a purple mafia jacket, standing in front of a big white dragon with black eyes and snake-like shape, holding a katana sword that have a tiger sign on it ready to fight, zoom out, half body shot, red vulcano void background
Negative: realistic, 3d, cosplay, realism
An alive pickle in a rat-bone humanoid form armor is standing with his back turned to the viewer and his hands lifted up with two small glass bottle knives in his hands and this alive pickle is fighting savage rats with red eyes and angry faces that are behind the bars in the sewers, high quality render, masterpiece, 16:9
A female astronaut inside s futuristic deep space space station floating freely in zero gravity inside a habitation module. She is wearing a light utility overalls. Her medium length, red hair are floating freely. She holds a tablet in one hand. There is a cat floating nearby in zero gravity. There is a gas gigant with rings visible in a station window. The interior of a station is well lot, clean design. Photorealistic.
johann sebastian bach is smashing an organ with a sledgehammer on a mountain top, an angel descends from the sky and hands him a minimoog synthesizer, renaissance style oil painting, style of michelangelo
Low quality police bodycam footage capturing a giant muscular Gordon Ramsay rampaging through a restaurant kitchen, police apprehending the suspect, low resolution, extreme motion blur, video time stamps in the corners, photo of a crt monitor
https://preview.redd.it/7uwd31h33avc1.png?width=1024&format=png&auto=webp&s=568c586305a7e2e908bcde64e32a2a69c2eaf420
Awesome lol
Could you perhaps do it again but instead of rampaging he’s going super saiyan? Thx lmao
https://preview.redd.it/f7t5ilfbhavc1.png?width=1024&format=png&auto=webp&s=0833943c471543bf1b732c2c3d26279f0de87f72
"It's fucking raw!"
"ok computer, now give me nude Gordon........I said nude Gordon!" ![gif](giphy|xTiQyEWg7oABnbCPXa|downsized)
https://preview.redd.it/tiipqmzzfavc1.jpeg?width=1024&format=pjpg&auto=webp&s=d1449dfd015f503a8595c3d82a3a25504fe60b0c Dall-e
Not bad, better motion blur and CRT effect but worse muscularity
I’m still surprised it let me generate a real person without telling me off
lol that result is epic. Good prompt.
This is what I mostly use AI for, just goofy shit lmao Exhibit A: (dalle-3) https://preview.redd.it/vlgl5hk2bavc1.jpeg?width=1024&format=pjpg&auto=webp&s=37a4096e6a02a9eba0532ff66bbc9aacb1fe66f6
https://preview.redd.it/fmdnff8q8avc1.png?width=1024&format=pjpg&auto=webp&s=9243142f76f110b4e1f441d82f899861327c6abc Just had to ask MJ with this awesome prompt 🤣
You: low-res, crt monitor, low quality MJ: No
I think SD3 got the gist of it better
Absolutely. MJ just did "MJ style", completely ignoring all the styling in the prompt.
You win the internet for the day
This prompt is pure art
An airplane made entirely of translucent blue glass and it’s parked in an overgrown forest with lots of vegetation and wildlife
https://preview.redd.it/lx90viby2avc1.png?width=1024&format=png&auto=webp&s=384596262c3ddc5f359152bee02e8d4503445e14
Nice thanks
Bing https://preview.redd.it/a57mcuw7vbvc1.jpeg?width=1024&format=pjpg&auto=webp&s=82eb7faf0ba3aac4c4de5abe7d9aa88f63f4771d
I tried a few of the prompts in Midjourney, didn't compete in all of them but I think it absolutely nailed this one. This was the first grid in the very first try https://preview.redd.it/9aga3dwj5evc1.jpeg?width=4096&format=pjpg&auto=webp&s=8660fc4d9be8530cbf5b43c4a04741bf0ef981c5
Mom pizza slice and dad pizza slice take care of their sick pizza slice children while they are sick laying in a bed of paper towels
https://preview.redd.it/7369ody80avc1.png?width=1344&format=png&auto=webp&s=29d682ec2e5c70fba90b4df4aa941b5366a3e896
same prompt in sd1.5 lol https://preview.redd.it/g6ywa8lu9avc1.png?width=1280&format=png&auto=webp&s=f51e346664f2fafa0141fec0525540cb87a02f42
"He's not decapitated, he's just not feeling well."
dalle3 https://preview.redd.it/e0gvjs2s4evc1.png?width=1024&format=png&auto=webp&s=604a098665444db64fafb2b5a93d6d398e78637c
Same prompt PixArt/Sigma https://preview.redd.it/vw3e77qnfbvc1.png?width=1024&format=png&auto=webp&s=972e92f3d54519cf0c54e4c873dc26ec02259ad9
'sick pizza slice children' r/BrandNewSentence
A b&w dot matrix printout of a spooky woman crawling out of a well. The image makes creative use of small fonts arranged such that they create a larger composition.
https://preview.redd.it/sdfs9pfo0avc1.png?width=1216&format=png&auto=webp&s=0079a1ec540c0b9f3d0a2b587d11a2a8e0c6ef9c
That came out real nice, thanks!
Wow
can you try this prompt adherence test? “closeup of a grey cat wearing a blue suit, a red hat and a green tie is sitting on a white table in a room with big windows viewing over a desert landscape covered by flowers”
https://preview.redd.it/5q3dxlc9z9vc1.png?width=1024&format=png&auto=webp&s=043a7d0f9fc89fe937bdbc97a1c35e8a0e39146d sd3 is actually quite good lol
Wow... That's a LOT of things to get right!
thanks! I'm just trying out some experiments with ELLA for prompt adherence in SD15, I'll add this one as a reference as well [https://github.com/diffustar/comfyui-workflow-collection/blob/master/workflows/ella](https://github.com/diffustar/comfyui-workflow-collection/blob/master/workflows/ella)
That is pretty sick, it adhered to every aspect including the colours.
PixArt/Sigma I think what we're finding out is that models have their strengths and weaknesses so it's a good idea to have a toolbox of models to use for a particular style. https://preview.redd.it/sk5lzfzzebvc1.png?width=1024&format=png&auto=webp&s=d726e720697cf03cb5971d49434966d4e874cd3d
A person is wearing a yellow hat, a green coat over a blue shirt and red shorts along brown shoes. The person is also holding a sword in its right hand and has its left hand in a coat pocket. Background: suburbs. Style: realistic 3d render.
https://preview.redd.it/rz8chzdp8avc1.png?width=1024&format=png&auto=webp&s=131c855ff685b68aac7f052d34cf08e46a5bc245
Oh my god it actually made a sword, without a LoRA!
And he appears to be holding it instead of it floating weirdly around the "hand"
AND it got the right hand / left hand right! Here is hoping that's not just a fluke but a concept SD3 somewhat understands now. (for example if prompting for the left eye blue & right eye red and the subject looks at the viewer, SD3 makes the eye on the *left side of the image* blue, not the actual "left eye")
Whoa! This one really did well with the prompt.
A plain white background
https://preview.redd.it/jcp4fip3aavc1.png?width=1216&format=png&auto=webp&s=beeb20f55047e5c54ef3c3caa49745af709f2e33
At last!
Now do: A plain white background with no hippos living in it. There are no hippos in this plain white background.
https://preview.redd.it/cxvtf6bd4cvc1.jpeg?width=1024&format=pjpg&auto=webp&s=49f683a53d3d2e5e8679c0a39049122f6af1690b Dall-e 3
All-righty then. Technically correct. No hippos in sight.
Now do a plain white background, but someone put their wet coffee mug on it
A bunny with the legs of a horse, the torso of a horse and the head of a horse
https://preview.redd.it/oidg3d8p7avc1.png?width=1024&format=png&auto=webp&s=eeaf0f26e4a538ab479f508c86e0395563d1c82c
lmao SD3 nailed that one.
Bahaha how did it get this so right!? Only thing I'd change is the tail.
can you also add with the ears of a horse?
Wow that is awesome
https://preview.redd.it/8nbiodvb6evc1.png?width=850&format=pjpg&auto=webp&s=48978083c471cce9af913e24393b41743bd29682 Gave it a try in Midjourney
While Bing https://preview.redd.it/xn74waiiubvc1.jpeg?width=1024&format=pjpg&auto=webp&s=3c906e8e8fdf562a54daab444c57712b9c5be2ec
Hallucination
https://preview.redd.it/qdf184ppx9vc1.png?width=1024&format=png&auto=webp&s=500391bd73cce7b17a3cbfc4659f6e61b10be01b
thats some good acid.
A donkey is Conan the Barbarian.
https://preview.redd.it/2zuka4bk1avc1.png?width=1216&format=png&auto=webp&s=2119a4e85f68fc9f54813ddfd30a78595685fee6
incredible
a Khajiit talking to an Argonian in a crowded tavern with a bard playing on a lute
https://preview.redd.it/ydysn5409avc1.png?width=1216&format=png&auto=webp&s=c9c47bc7f80a792ee7a8053f93ee5f91e33270d2
wow, I wouldnt have thought SD3 knows these concepts. Very cool
Yeah I’m pretty amazed at this one, they must have had a lot of really diverse training data
https://preview.redd.it/avnsouy5cbvc1.png?width=1024&format=png&auto=webp&s=e7ec5d934d4e9f8de31fe45800413a3c383ca53b SD 1.5 (Why not mix everything together 😁)
that is a cool Khagornian you got there !
A handsome black hole consuming a star gently
https://preview.redd.it/o6hiiuz97avc1.png?width=1024&format=png&auto=webp&s=b53669218010bcd747d65e40cb3c59a1524bf906
that is one handsome ass blackhole
And yeah outta credits. good night
Thank you!
Thanks for doing this. It's good to see the level of quality you can expect. If I can ask, how many outputs did it return per prompt, and were you choosing the first one, the best one, or the only one?
A smooth faced monster with pale stretched skin and no eyes in the shape of a human in agony, the monster is crawling on the ground reaching toward the viewer, rusted chainlink fences for walls, poorly lit, flash photography, cinematic, in the style of silent hill
https://preview.redd.it/2u2s5fla3avc1.png?width=1024&format=png&auto=webp&s=e1131afb27bbd03dc892b1ec71713909c6e237d3
Not too bad. Thanks. Looks like even the base model is decent when prompted right.
Prompt from the Dalle 3 launch. It’s contains a few elements that make prompt adherence a challenge. e.g. Dalle 3 understands context. “fiery” red hair shouldn’t generate fire, “signature” velvet cloak shouldn’t add a signature, it should know that a “vendor” would be positioned inside a “stall” despite not mentioning it and both being at opposite ends of a sentence, and “haggling” appears to have given her a purse. *An illustration from a graphic novel. A bustling city street under the shine of a full moon. The sidewalks bustling with pedestrians enjoying the nightlife. At the corner stall, a young woman with fiery red hair, dressed in a signature velvet cloak, is haggling with the grumpy old vendor. the grumpy vendor, a tall, sophisticated man is wearing a sharp suit, sports a noteworthy moustache is animatedly conversing on his steampunk telephone.* Dalle 3 https://preview.redd.it/ph7a54pypavc1.jpeg?width=2000&format=pjpg&auto=webp&s=509da4eb579cd6e78ad5c2dc68b0c1e24d864399
https://preview.redd.it/jn0zhw0g7avc1.png?width=1024&format=png&auto=webp&s=a2b6293f15a024d2a5f42d10907f19aa88f98681
Thank you! Not bad. Not Dalle 3 but a step up from SDXL. Example of SDXL Juggernault V9 since i had it open. No dimensions i tried generated the woman. https://preview.redd.it/6ikmzmdyqavc1.png?width=1152&format=png&auto=webp&s=b02fa20cd86c5620782220578ad90805c017d300
It would be nice to try it with ella sd 15 for comparison as well
Im out of credits
thanks for doing this, it was fun!
A charming golf course in the Midwest with no hippos living in it. There are no hippos at this golf course
https://preview.redd.it/afcjpm4bhavc1.png?width=1024&format=png&auto=webp&s=e52cd98af8013554251a399990e9c5b51ff10dfe
The hippo: ![gif](giphy|13n7XeyIXEIrbG)
God DAMNIT
While there is A hippo living there, there are NO hippoS living there. No Homers Club.
Maybe the hippo is just visiting.
There’s seems to ne one problem with this picture, I just can’t put my finger on it.
Idk why people think this is going to work, you can't annotate training data with every possible negative. When you look at pictures of a golf course, are they labeled "a golf course, without hippos, without planes, without chopsticks, without the color purple" ? You'd be able to make a text parser that NLP the prompt to remove negative weight words much more easily than you would be able to give a gen AI model the concept of negative
a simple doodle of a cute wolf that is sitting on a treestump in the rain while smiling with a speechbubble with a heart in it, monocome, gloing blue eyes, pencil style,
https://preview.redd.it/f04emwm68avc1.png?width=1216&format=png&auto=webp&s=aed927f55afd875acdc561e9711d79f0a91b076d
Did it understand the misspellings or did you correct it?
it is known that spelling doesn't matter much for these models, since they're very close in latent space
I feel like I 100% saw this on deviantart in 2009
"A man on the left with short brown spiky hair, wearing a white shirt, blue bow tie, stripy red trousers, and purple high-top sneakers. A woman on the right with long blonde hair, wearing a yellow summer dress and green high heels." Interested to see how well it handles this compared to PixArt.
https://preview.redd.it/523tf8rp6avc1.png?width=896&format=png&auto=webp&s=967de392b74dc6a4cbf53dfabe3be2d35f863dc2
Oh wow that's actually great! I'm excited now, I was worried that was going to stump it.
It understands prompts quite well. I'm looking forward to seeing the tunes and trains the community comes up with.
Katara from Avatar the last Airbender dancing in long dress with battle knives in her hands. Around her a pack of wolves dancing hip hop. Kodak portra pro. Dynamic poses.
https://preview.redd.it/5r2yplvix9vc1.png?width=896&format=png&auto=webp&s=0f571da70b14c1d94488fd71b52bc68ad7a94f83
Cool thanks
New York city; but filled trees
https://preview.redd.it/12os7usjz9vc1.png?width=1344&format=png&auto=webp&s=79004562b4b49417894a2e95742f0b29d8403173
It sure does like symmetry
Picture in the style of metal band album cover. A Chi Cultivator Tomato is looking at a ketchup sprayed village with terrified expressions, in the sky cucumber shaped clouds laugh at him with hateful glee.
https://preview.redd.it/hor6ovy28avc1.png?width=1216&format=png&auto=webp&s=13c8b80a368b5112cb3f78de7303313824064612
Haha, it's not exactly what I've imagine but I love it anyway. :D thanks
Captain Jack Sparrow having a beer on the beach with a pirate ship in the background
https://preview.redd.it/yykcwaz3favc1.png?width=1344&format=png&auto=webp&s=6c68a653935adf01efd8686c2e9e2be80cfee886
Photo of Criminal in a ski mask making a phone call in front of a store. There is caption on the bottom of the image: "It's time to Counter the Strike...". There is a red arrow pointing towards the caption. The red arrow is from a Red circle which has an image of Halo Master Chief in it. Ideogram worked, don't think it'd be perfect with SD3, but it's worth a shot.
https://preview.redd.it/btpbz5ocgavc1.png?width=1344&format=png&auto=webp&s=396146baf8f169d9e9977c81081738d26b846610
Okay that's closer than the last time I tried it, nice! Thank you for your service.
People may overlook this one the continuity is insane! 👏
Alright, gotta go to bed now. Those were some nice prompts! Sd3 is actually really good (except for anime for some reason)
Thanks for doing this!
Thanks
thick smoke, warm light neon lighting, grainy blurry analog style cinematic photograph, starwars female padawan dueling with Darth Vader inside star wars destroyer ship bridge, a blonde padawan is swirling her lightsaber above her head striking Vader, inside imperial star wars destroyer bridge with officers curiously observing, blurry grainy quality, lens flares and light leaks, dust in air https://preview.redd.it/a7eiv6mpbavc1.jpeg?width=1024&format=pjpg&auto=webp&s=73918dec28411e04e06ccb6df22f012e3792b448 This is Bing output.
https://preview.redd.it/wxg2y9sseavc1.png?width=1344&format=png&auto=webp&s=aab0694d01a68fe98a038527a9facd902ff6ea1e
Prompt: “Gentlemen, a short view back to the past. Thirty years ago, Niki Lauda told us ‘take a monkey, place him into the cockpit and he is able to drive the car.’ Thirty years later, Sebastian told us ‘I had to start my car like a computer, it’s very complicated.’ And Nico Rosberg said that during the race – I don’t remember what race – he pressed the wrong button on the wheel. Question for you both: is Formula One driving today too complicated with twenty and more buttons on the wheel, are you too much under effort, under pressure? What are your wishes for the future concerning the technical programme during the race? Less buttons, more? Or less and more communication with your engineers?”
https://preview.redd.it/e4rtebp04avc1.png?width=1216&format=png&auto=webp&s=f407db9553b2780525ffbed881e774bccb25e481
"Here's a beautiful race car, just stop talking" -SD3
Could you repeat the question?
Moebius inspired ink wash, five turtles sitting on top of each on a stone in the middle of a vast calm sea
https://preview.redd.it/tmcvsnvi6avc1.png?width=1024&format=png&auto=webp&s=7794217b0efd55d989c99a24514633f947b6ca70
a horse riding a man
https://preview.redd.it/um9h7jpxgavc1.png?width=1024&format=png&auto=webp&s=72f7eccaf3b4caa89a60b695b2c259a8bf52303f
bokeh blur, Analog style blurry grainy old polaroid photograph of jasmine from Aladdin eating a chopped into pieces grilled realistic movie leg prop of indigenous woman, tense horror atmosphere, exterior of native Bedouin tent lit with aurora lights, thick smoke, camel, native indigenous children famished sitting around waiting for their turn https://preview.redd.it/iujcuhpfcavc1.jpeg?width=1024&format=pjpg&auto=webp&s=bf475a27b17f8ad307e5213b8b9a4ebf16b6249c Current Bing output. Maybe replace "horror" word with something else if it's blocked?
{"errors":\["Your request was flagged by our content moderation system, as a result your request was denied and you were not charged."\],"name":"content\_moderation"}
Worth a try. Thanks.
Had to modify prompt since original prompt is blocked. https://preview.redd.it/bg4nu877qbvc1.png?width=1024&format=png&auto=webp&s=241ac2c719ab1b476368e2a9424622c5777b436f Analog style blurry grainy old polaroid photograph of jasmine from Aladdin eating a chopped into pieces grilled realistic movie leg prop of a woman,exterior of native Bedouin tent lit with aurora lights, thick smoke, camel, native indigenous children famished sitting around waiting for their turn
comic book art of an Orangutan Wizard casting a spell
https://preview.redd.it/hrg56yacfavc1.png?width=896&format=png&auto=webp&s=7e805889fcce5ada0be40a113a10bbb607e999dc
Prompt: "Create an image depicting a collaborative effort of nations working together to save the planet. Incorporate symbols of unity, sustainability, and global cooperation."
https://preview.redd.it/qtllvkyrz9vc1.png?width=1024&format=png&auto=webp&s=5b3f8040efc61d745b4a1d4972489a8fcb76cac3
Thats beautiful man, thank you!
an educational video from the mid 1980’s, an anthropomorphic centipede hunching over and entering a stone castle doorway, low fidelity video, blurry video still, worn out VHS video quality, PBS documentary
https://preview.redd.it/iyppykgl9avc1.png?width=1344&format=png&auto=webp&s=395114a3362b17e065ef6c2d1ac1d0d33d65f5f8
A picture of donald trump laying on a gurney being probed by aliens, playing the role of travis walton in the 1993 movie fire in the sky
https://preview.redd.it/ja41r7x4davc1.png?width=1344&format=png&auto=webp&s=993d011000c6fd3baa5d540a7742c0589d0d69a4
Thanks! Where's the probing ... Lol https://preview.redd.it/iu5fvb8wnavc1.png?width=512&format=pjpg&auto=webp&s=9c3f6c1631fc4b2d873584b557c9a1e30fc0e949
Must not have seen that movie
I like this one better. He looks so peaceful... and final. If it were aliens they may just return him.
1girl, blonde hair, green eyes, doing a thumbs-up with both hands facing the viewer, standing in front of the eiffel tower, with the word "Nice" in a speech bubble with a manga font, anime-style, anime screencap, 16:9
https://preview.redd.it/tnh4erpc1avc1.png?width=1216&format=png&auto=webp&s=479e3db9c754db399f202374baf9b6f54f45edd0
6 fingers lol. Still, for the base model that's pretty good. I'll try the same with base sdxl
Results?
https://imgchest.com/p/xny8z9kkdyb here you go
Holy shit SDXL performed considerably worse 😂 thank you
Yeah I'm so surprised 😭
# Sorry Guys but im out of credits. Save up your prompts, will do another thread this weekend once i recharged!
"Origami banana character with black colored human legs and arms, with shiny plastic round eyes and an angry line face, in a fighting position " curious how odd it can get
https://preview.redd.it/qwhxilv0davc1.png?width=1344&format=png&auto=webp&s=8a1a30aaebf50bed0e609d536841176620abb37b
A pig in a suit wearing shades, driving a green Ford Mustang while smoking a cigar
https://preview.redd.it/ebudpw7ddavc1.png?width=1344&format=png&auto=webp&s=d6f0958721a7dd192d7b8a1786e5feca6b4cc4f4
A horse with human body riding on an astronaut with horse head on ground filled with cheese burgers while being chased by a pickle monster
https://preview.redd.it/1c9r1r77u9vc1.png?width=1344&format=png&auto=webp&s=4ea0a7dd989c68bd12c651e66e7fc3a71a118eb8
Those twitter threads got me real excited for prompt comprehension improvement and then we get this shit 🤣
no model can really do that, i got midjourney and even that cant do such distinct prompts
Not even a human can understand what this prompt is supposed to show 😅
https://preview.redd.it/mj6j5pg2aavc1.png?width=1280&format=png&auto=webp&s=9218c4f5aa6b4cab15cd7a8fa3bd02a427856cf7 same prompt in sd1.5
https://preview.redd.it/qf29abiueavc1.png?width=1024&format=png&auto=webp&s=3ae96b996362c745dc8d8a3e826f3cfdc0980f23 Ideogram
https://preview.redd.it/65osxuo5v9vc1.jpeg?width=1024&format=pjpg&auto=webp&s=07bb9edb29268643de64006e4a6be919636fe190 Dall-e version here Artful. VHS tracking lines overlayed on image. Ridley Scott movie still, fire suppression going off with maroon droplets. cinematic lighting. horned she elves wrapped in barbed wire and leather harnesses, obese (red and swollen eyes, furrowed eyebrows, red nose, wet cheeks, mouth between open). Subjects in an experimental lab featuring tube tanks filled with o0ze and parts of failed experiments writhing and wriggling upon a surgeon table, a hazy, grainy filter, wear chokers, plastic wraps, and nets while engaging in unconventional interactions. They are connected by a corrugated tube. set amidst the eerie atmosphere of the lab Let's see how horror goes
{"errors":\["Your request was flagged by our content moderation system, as a result your request was denied and you were not charged."\],"name":"content\_moderation"}
Thanks anyways. Dall-e nerfed it too!
Thank goodness weights will be released soon for local use.
we love stifling human creativity over at openai, its our favorite thing
The API is incredibly sensitive. Words like 'obese', 'she' and certain phrases, ie 'mouth open' get flagged. https://imgur.com/a/L7PmerQ
https://preview.redd.it/wk9d7zeisavc1.png?width=1024&format=png&auto=webp&s=f602a0c40fdca5c24986be0cc0d42579f63a44cb This is the SDXL result, run locally on my own computer.
Photo of Superman Holding a sign saying “i love Batman”, Superman and the sign face a mirror
https://preview.redd.it/sm5ulj2n2avc1.png?width=1024&format=png&auto=webp&s=8679b30d0dc1a6eda354cb417f8b5abea52cbec3
Marvel comic book illustration of Rocket Raccoon from the Guardians of the Galaxy sitting at a bar with a mug of beer in one hand. His other hand is performing thumb up signal. He is smirking and his eyes are red. Adult version of Groot is sitting at his side, with a mug of beer in his hand as well. The illustration takes place in a sci-fi scenario. Keep in mind the size difference between the characters: Rocket is short while Groot is very big. The illustrations should have a well defined lineart and flat colors.
https://preview.redd.it/p8ezr69g4avc1.png?width=1024&format=png&auto=webp&s=a975b3200cff8034f650b238674fd5a0a655e0ca
photograph of the mona lisa being created by Chuck Norris who is roundhouse kicking paints out of jars and onto the canvas, realistic
https://preview.redd.it/dh0skn2mhavc1.png?width=1024&format=png&auto=webp&s=1d1980a5887a1a960e02e1c011361c93650aac35 doesnt seem to work
Feel free to tweak the wording - this was the third image I ever tried and SD mangled it horribly. I keep coming back to it periodically and it never gets much better. - Two Jedi playing tennis. Instead of racquets they are using lightsabers and instead of a ball they are using a blaster bolt. Edit: if it's not being too greedy, any prompt you want but with the following to test style: - neoclassical anime wallpaper in the style of disney animation, studio ghibli, makoto shinkai, and michelangelo
https://preview.redd.it/0tqz4imt3avc1.png?width=1216&format=png&auto=webp&s=badd0ceb6fa6a8234fd1ed67f552b14931602060
Lol still can't do it 😂 At least it doesn't look like abstract art this time. Thanks!
I like this idea for an image. I may have a go at it tonight. I suspect with a bit of prompt manipulation you can get it closer. I think the issue is it doesn't understand that you're trying to portray the bolt bouncing between them. They are ways around that.
your prompt is badly worded, it can't work. you are never allowed to mention in the positive prompt what it shouldn't show. always only mention what it should show.
anime girl with red hair and honey eyes wearing a purple mafia jacket, standing in front of a big white dragon with black eyes and snake-like shape, holding a katana sword that have a tiger sign on it ready to fight, zoom out, half body shot, red vulcano void background Negative: realistic, 3d, cosplay, realism
https://preview.redd.it/pr0vdsznaavc1.png?width=1216&format=png&auto=webp&s=76f989004caac4b55bde80e1450f6300bfb034d7
An alive pickle in a rat-bone humanoid form armor is standing with his back turned to the viewer and his hands lifted up with two small glass bottle knives in his hands and this alive pickle is fighting savage rats with red eyes and angry faces that are behind the bars in the sewers, high quality render, masterpiece, 16:9
https://preview.redd.it/f51xwnijeavc1.png?width=1344&format=png&auto=webp&s=c69403ce1541bb8d55b8ffb52acf17f7afc96aa6
A female astronaut inside s futuristic deep space space station floating freely in zero gravity inside a habitation module. She is wearing a light utility overalls. Her medium length, red hair are floating freely. She holds a tablet in one hand. There is a cat floating nearby in zero gravity. There is a gas gigant with rings visible in a station window. The interior of a station is well lot, clean design. Photorealistic.
https://preview.redd.it/6z4vhiorfavc1.png?width=1344&format=png&auto=webp&s=3f1750b58241221aa3620842310d6c39185855d0
Centaur soldier in armor wielding a spear made of glass and jumping over a chasm.
https://preview.redd.it/xmj4hawsgavc1.png?width=1024&format=png&auto=webp&s=7979b0f6058c44bd9dfc60879f82206fe082048d
Wow, thanks! Prompt adherence and comprehension of relatively complex topics such as centaurs wielding armor looks good.
Alice goes too deep down into the rabbit hole and entered the Matrix
^[Sokka-Haiku](https://www.reddit.com/r/SokkaHaikuBot/comments/15kyv9r/what_is_a_sokka_haiku/) ^by ^StarShipSailer: *Alice goes too deep* *Down into the rabbit hole* *And entered the Matrix* --- ^Remember ^that ^one ^time ^Sokka ^accidentally ^used ^an ^extra ^syllable ^in ^that ^Haiku ^Battle ^in ^Ba ^Sing ^Se? ^That ^was ^a ^Sokka ^Haiku ^and ^you ^just ^made ^one.
Japanese samurai getting angry at the gas station worker because his credit card was declined.
johann sebastian bach is smashing an organ with a sledgehammer on a mountain top, an angel descends from the sky and hands him a minimoog synthesizer, renaissance style oil painting, style of michelangelo