Support NeoGAF

Tumle · Sep 2, 2022

Unknown Soldier said:
RTX specifically is not required. Any Nvidia card which can run CUDA, which is basically all of them for the past decade, can run this.

However your limit is VRAM. You need a lot. My RTX 3090 with 24 GB of VRAM throws errors trying to render 1024x1024, though it looks like for a lot of people the sweet spot is 704x512.

i can run 512x512 on my RTX 3070... the incline in ram needed must be very steep.

Tumle · Sep 2, 2022

The_hunter said:
This is interesting and scary, as I thought art would never be a field that would be automated by machines. In my opinion the best application for this seems to be landscapes for concept art. I'm curios to know how this works, if it's just "remixing" artwork or creating new pieces.

Not sure exactly how it works.. but its not just remixing pictures. i'll see if i can find the video i saw about it on youtube..

Mikado · Sep 2, 2022

It just refuses to put an entire character in the frame

(That's probably merciful, since it doesn't do a great job with faces in this hand-drawn style anyway)

Honestly? I'm not sure the stable diffusion method will really lead to consistently presentable AI-generated art for things like character design.
Not that such a thing won't be achieved but I think it will be a different approach. SD just feels like a bit of a local maxima at the moment.

But it's great for breaking through a creative block and getting a bunch of ideas to riff on manually: "I like the helmet from this one, and the boots from this one."

Edit: Actually the biggest breakthrough is being able to run it locally with no cost-per-generation or other limits.

Like, I can just leave it running on a spare machine for hours. It generates a plausible concept every 12 seconds. In the time it would take to work up one design to a semi-polished state, hundreds or even thousands more sketches would be available. This isn't true for the web versions.

Mikado · Sep 3, 2022

Tumle said:
Also see people use exclamation marks and parentheses.. do they have any affect on the outcome?

I tried this a bit. It seems like it sort of changes the weights on the tokens - and produces different images for that same seed (with otherwise the same prompt). But I couldn't correlate those changes to any markup I made around specific words (like if I said something like `face!!!!!!!!!!` it's still not any more likely to have the face in the frame, much less emphasized. Instead, like, a boot will be a different pattern).

Shadowplay1979 · Sep 3, 2022

Try MidJourney, it blows all these programs away and is terrifying...heres some of what ive done on it.

I see adobe integrating this into Photoshop so artists can quickly iterate their own ideas into things.

Mikado · Sep 3, 2022

Shadowplay1979 said:
Try MidJourney, it blows all these programs away and is terrifying...heres some of what ive done on it.

I see adobe integrating this into Photoshop so artists can quickly iterate their own ideas into things.

(Personal rant time)

Midjourney and all the "Other Peoples' Computers" services definitely make some great results but I have several problems with them from my own perspective (which is why I've been sleeping on this whole field so far):

- I hate "cloud" services in general, but especially for art tools.
- Pay-per-iteration is an incredibly stupid concept especially for a stochastic process like this (I think can pay $600 for some sort of subscription to MJ that might get you more tokens or sthg? Not sure, because screw that, and also, the UI frontend is a discord channel?

)
- As a principle, I very much don't like the "Tools Dictate Their Usage" policy. Someone else gets to decide what you are allowed to create. Today maybe it's tits. Tomorrow maybe it's the wrong politics.

I have no expectation that Adobe's (inevitable) implementation will be any different since they already don't stop harassing people about making sure they save everything online instead of locally.

Being able to run your own tools, on your own machine, with a model of your own choosing (yes, someone else probably built the model but you can choose which one you integrate), is a Big Deal to me.

Some peoples' Hills To Die On is, like, paying full-price for Day One Cosmetic DLC. For me, it's non-local art tools. It is what it is.

Shadowplay1979 · Sep 3, 2022

Mikado said:
(Personal rant time)

Midjourney and all the "Other Peoples' Computers" services definitely make some great results but I have several problems with them from my own perspective (which is why I've been sleeping on this whole field so far):

- I hate "cloud" services in general, but especially for art tools.
- Pay-per-iteration is an incredibly stupid concept especially for a stochastic process like this (I think can pay $600 for some sort of subscription to MJ that might get you more tokens or sthg? Not sure, because screw that, and also, the UI frontend is a discord channel? )
- As a principle, I very much don't like the "Tools Dictate Their Usage" policy. Someone else gets to decide what you are allowed to create. Today maybe it's tits. Tomorrow maybe it's the wrong politics.

I have no expectation that Adobe's (inevitable) implementation will be any different since they already don't stop harassing people about making sure they save everything online instead of locally.

Being able to run your own tools, on your own machine, with a model of your own choosing (yes, someone else probably built the model but you can choose which one you integrate), is a Big Deal to me.

Some peoples' Hills To Die On is, like, paying full-price for Day One Cosmetic DLC. For me, it's non-local art tools. It is what it is.

oh dude im right there with you, i dont like the idea of paying for this and would rather it be on my own pc. Though right now...the tools i tried here suck compared to MJ, im hoping that changes though. Fortunately mj is actually cheap...the 600 is only for large AAA type studios...its 10/m for 200 images or 30 for unlimited.

As an industry artist ...yea....this kinda sucks for my concept artist friends. Im a 3d artist though and i welcome lots of ai help with the more tedious things...substance tools for instance...but yea eventually ai will be coming for me also. Though it will take a little longer.

There will always be a need for both types of artists in some capacity though. Remember ai only pools from the available data and input provided by whats available....its mimicing happy woman..not knowing what one is. You cant paint in the style of mechealangelo if there was no michaelangelo to pull from.

That being said its basically the early stages of the holodeck.....and i recall people still learning to play music, paint and perform plays even with a holodeck.

01011001 · Sep 3, 2022

Tumle said:
and it's not just for porn, so please share

I have the feeling we will see a bunch of AI generated hentai very soon tho

Mikado · Sep 3, 2022

01011001 said:
I have the feeling we will see a bunch of AI generated hentai very soon tho

"I dunno man, I just can't get it up anymore if the chick has less than 3 arms."

Northeastmonk · Sep 3, 2022

Lol whatever I did gave me nightmares. Thanks everyone

EviLore · Sep 3, 2022

portrait of nick offerman, d & d, wet, shiny, fantasy, intricate, elegant, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha

Lexica – Portrait of nick offerman, d & d, wet, shiny, fantasy, intricate, elegant, highly detailed, digital painting, artstation, concept art, smoot...

Generated images for prompt: "Portrait of nick offerman, d & d, wet, shiny, fantasy, intricate, elegant, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha "

lexica.art

Mikado · Sep 3, 2022

Shadowplay1979 said:
As an industry artist ...yea....this kinda sucks for my concept artist friends. Im a 3d artist though and i welcome lots of ai help with the more tedious things...substance tools for instance...but yea eventually ai will be coming for me also. Though it will take a little longer.

There will always be a need for both types of artists in some capacity though. Remember ai only pools from the available data and input provided by whats available....its mimicing happy woman..not knowing what one is. You cant paint in the style of mechealangelo if there was no michaelangelo to pull from.

Right now (for professionals), I think it would mostly be useful as a modern Deck of Oblique Strategies. Speaking for myself, there is a tendency to fall back on familiar pattern-language (3 Holes in a Triangular Formation! 45 Degree Panel Lines! That X-Shaped Indentation!). Maybe there's some value to be had in "Just Letting it Happen" and possibly coming up with something that one wouldn't have thought of on their own under the same time constraints, then providing a layer of "intelligence" by repainting or re-combining items from the collection of "found objects".

Of course I'm finding that this process has a pattern-language of its own and certainly, as you mentioned, it's mostly just regurgitating elements learned from existing visual treatments. It's effectively collage and not creation. Will be interesting to see where it goes.

EviLore · Sep 3, 2022

hyperrealistic portrait of high detail amber heard as a bee queen in ornate black robe yellow swan feathers as the mistress in fear being chased horror. by jeremy mann, fantasy art, photo realistic, dynamic lighting, artstation, poster, volumetric lighting, 4 k, award winning

Lexica – Hyperrealistic portrait of high detail amber heard as a bee queen in ornate black robe yellow swan feathers as the mistress in fear being ch...

Generated images for prompt: "Hyperrealistic portrait of high detail amber heard as a bee queen in ornate black robe yellow swan feathers as the mistress in fear being chased horror. by jeremy mann, fantasy art, photo realistic, dynamic lighting, artstation, poster, volumetric lighting, 4 k...

lexica.art

Elsa from frozen portrait of Scarlett Johansson, au naturel, hyper detailed, digital art, trending in artstation, cinematic lighting, studio quality, smooth render, unreal engine 5 rendered, octane rendered, art style by klimt and nixeu and ian sprigger and wlop and krenz cushart

Lexica – Elsa from frozen portrait of Scarlett Johansson, au naturel, hyper detailed, digital art, trending in artstation, cinematic lighting, studio...

Generated images for prompt: "Elsa from frozen portrait of Scarlett Johansson, au naturel, hyper detailed, digital art, trending in artstation, cinematic lighting, studio quality, smooth render, unreal engine 5 rendered, octane rendered, art style by klimt and nixeu and ian sprigger and wlop and...

lexica.art

scarlett johansson as delirium from sandman, ( hallucinating colorful soap bubbles ), by jeremy mann, by sandra chevrier, by jamie hewlett and richard avedon, punk rock, tank girl, high detailed, 8 k

Lexica – Scarlett johansson as delirium from sandman, ( hallucinating colorful soap bubbles ), by jeremy mann, by sandra chevrier, by jamie hewlett a...

Generated images for prompt: "Scarlett johansson as delirium from sandman, ( hallucinating colorful soap bubbles ), by jeremy mann, by sandra chevrier, by jamie hewlett and richard avedon, punk rock, tank girl, high detailed, 8 k "

lexica.art

Grildon Tundy · Sep 3, 2022

My prompt: "Super Mario as the Hunter from Bloodborne, in the style of el greco, late renaissance, hyper realistic, hyper detailed, trending on artstation"

This is wild. My RTX 3080 can generate one picture per 17 seconds. So much fun and the possibilities seem limitless.

"Super Mario jumping on a realistic turtle." Turtle is not pleased.

"Dynamic concept art of Max Payne wearing a leather jacket, neo noir, rainy streets, cyberpunk, neotokyo, stunning lighting, highly detailed, realistic, 4k"

Edit: sidenote: using myself as a case study, there is still artistry involved when using this program--meaning that i can put mash-up/remix prompts in, but I'm not getting the kind of outputs that would win an art contest. In that way, I think it might change WHAT an artist is, but wont necessarily change WHO the artists are, if that makes sense. But it's very fun to play with as a layperson!

01011001 · Sep 3, 2022

Mikado said:
"I dunno man, I just can't get it up anymore if the chick has less than 3 arms."

3 arms and a wolf tail... and I bet that's someone's fetish xD

Mikado · Sep 3, 2022

I'd have watched whatever OVA these were designs for, back in the 90's.

I wish we could get a list of the elements this thing pulled in to make the images so we could check out the original works. Some of these are way too clean to not be literal rips of something that exists.

Mikado · Sep 3, 2022

Also, for hilarity - if you leave blank lines in the GUI's prompt box it will just generate..... something... with no input. Some of the results are outstanding:

Wildebeest · Sep 3, 2022

Mikado said:
It just refuses to put an entire character in the frame

I tried some magic words to get characters in frame, and used a subject where it didn't matter that the face looked like a dog's breakfast. Seemed to work.

"Greenskin Half Orc warrior brooding in an attractive way, by Akim Kaliberda, pathfinder character art digital art trending on artstation HQ"

macfoshizzle · Sep 3, 2022

Can someone TLDR this for me. So this is all generated by AI in the form of text and scripting?

Mikado · Sep 3, 2022

macfoshizzle said:
Can someone TLDR this for me. So this is all generated by AI in the form of text and scripting?

Paraphrasing and simplifying here, but this approach sort of works as follows:

It starts with a field of noise. Then, through a series of iterative steps it tries to reverse the noise back into image by selecting from a group of input functions (themselves created by processing a large collection of input images with associated keywords - mostly scraped off the internet), weighted by the user's prompt keywords. If you don't run enough iterations it just produces chaos.

Prompt:
close up face shot of a beautiful young woman, cute face, tacticool, sci-fi, cables, digital illustration, line art by yoji shinkawa and masamune shirow'

1 Iteration:

2 Iterations:

4 Iterations

5 Iterations:

10 Iterations:

50 Iterations:

The entire approach is strongly influenced by the breadth and quality of the source data (called the "model").

Edit: You can see some of the locality at work by slightly changing the prompt and keeping the same seed. I just added the word sunglasses to the above prompt and got this:

You can see that while the overall shape of the image (determined largely by the first few iterations) is the same, but the addition of the sunglasses elements clearly brought in a different input for a lot of nearby face details, so the mouth and chin structure ended up completely different despite having no different guidance there. Also the tops of the frames are.... weird, probably where it's slightly indecisive w.r.t the sunglasses being a better match for the original dark eyes.
Screwing with the weighting of the sunglasses keyword by calling it sunglasses!!!!!!!!! seems to influence the weighting but that doesn't mean it makes better sunglasses (because computers don't know anything about the elements they're bringing in), it apparently just increases the weighting of image functions that contain the sunglasses keyword (which itself doesn't mean anything, because the keyword->function association is only as good as the original tagging for the source images).

Disturbing results ensue:

Overall, it's a neat trick, but it's not magic.

Haint · Sep 3, 2022

So what exactly uses so much VRAM with this?

Droxcy · Sep 3, 2022

The_hunter said:
This is interesting and scary, as I thought art would never be a field that would be automated by machines. In my opinion the best application for this seems to be landscapes for concept art. I'm curios to know how this works, if it's just "remixing" artwork or creating new pieces.

It's a good starting point for level design & character design I'll generate some prompts of an art direction or feel I want and then go based on that. But replacing real workers It won't happen

macfoshizzle · Sep 3, 2022

Mikado said:
Paraphrasing and simplifying here, but this approach sort of works as follows:

It starts with a field of noise. Then, through a series of iterative steps it tries to reverse the noise back into image by selecting from a group of input functions (themselves created by processing a large collection of input images with associated keywords - mostly scraped off the internet), weighted by the user's prompt keywords. If you don't run enough iterations it just produces chaos.

Prompt:
close up face shot of a beautiful young woman, cute face, tacticool, sci-fi, cables, digital illustration, line art by yoji shinkawa and masamune shirow'

1 Iteration:

2 Iterations:

4 Iterations

5 Iterations:

10 Iterations:

50 Iterations:

The entire approach is strongly influenced by the breadth and quality of the source data (called the "model").

Edit: You can see some of the locality at work by slightly changing the prompt and keeping the same seed. I just added the word sunglasses to the above prompt and got this:

You can see that while the overall shape of the image (determined largely by the first few iterations) is the same, but the addition of the sunglasses elements clearly brought in a different input for a lot of nearby face details, so the mouth and chin structure ended up completely different despite having no different guidance there. Also the tops of the frames are.... weird, probably where it's slightly indecisive w.r.t the sunglasses being a better match for the original dark eyes.
Screwing with the weighting of the sunglasses keyword by calling it sunglasses!!!!!!!!! seems to influence the weighting but that doesn't mean it makes better sunglasses (because computers don't know anything about the elements they're bringing in), it apparently just increases the weighting of image functions that contain the sunglasses keyword (which itself doesn't mean anything, because the keyword->function association is only as good as the original tagging for the source images).

Disturbing results ensue:

Overall, it's a neat trick, but it's not magic.

Pretty fucking cool

Grildon Tundy · Sep 4, 2022

Mikado said:
Paraphrasing and simplifying here, but this approach sort of works as follows:

It starts with a field of noise. Then, through a series of iterative steps it tries to reverse the noise back into image by selecting from a group of input functions (themselves created by processing a large collection of input images with associated keywords - mostly scraped off the internet), weighted by the user's prompt keywords. If you don't run enough iterations it just produces chaos.

Prompt:
close up face shot of a beautiful young woman, cute face, tacticool, sci-fi, cables, digital illustration, line art by yoji shinkawa and masamune shirow'

1 Iteration:

2 Iterations:

4 Iterations

5 Iterations:

10 Iterations:

50 Iterations:

The entire approach is strongly influenced by the breadth and quality of the source data (called the "model").

Edit: You can see some of the locality at work by slightly changing the prompt and keeping the same seed. I just added the word sunglasses to the above prompt and got this:

You can see that while the overall shape of the image (determined largely by the first few iterations) is the same, but the addition of the sunglasses elements clearly brought in a different input for a lot of nearby face details, so the mouth and chin structure ended up completely different despite having no different guidance there. Also the tops of the frames are.... weird, probably where it's slightly indecisive w.r.t the sunglasses being a better match for the original dark eyes.
Screwing with the weighting of the sunglasses keyword by calling it sunglasses!!!!!!!!! seems to influence the weighting but that doesn't mean it makes better sunglasses (because computers don't know anything about the elements they're bringing in), it apparently just increases the weighting of image functions that contain the sunglasses keyword (which itself doesn't mean anything, because the keyword->function association is only as good as the original tagging for the source images).

Disturbing results ensue:

Overall, it's a neat trick, but it's not magic.

That's really helpful to understand how it's working. Thank you.

Tumle · Sep 4, 2022

Shadowplay1979 said:
Try MidJourney, it blows all these programs away and is terrifying...heres some of what ive done on it.

I see adobe integrating this into Photoshop so artists can quickly iterate their own ideas into things.

Midjourney is implementing stable diffusion in a beta right now

Also not a fan of having to use discord to generate..
they all have there strengths and weaknesses

Pegasus Actual · Sep 4, 2022

Boaty McBoatface 2: The Boatening:

Tempted to keep generating these till the heat death of the universe.

Tumle · Sep 4, 2022

Pegasus Actual said:
Boaty McBoatface 2: The Boatening:

Tempted to keep generating these till the heat death of the universe.

Poooooooo!!

Makoto-Yuki · Sep 4, 2022

i just tried to install this but i can't get it working.

i downloaded "anaconda" and extracted the github into it. i had to download something else but i figure out where the fuck to download it. i signed up to the site but i don't see any download link. how hard is it for websites these days to give you a simple download link?

anyone able to help me out?

edit: nevermind. i never read the OP properly lol. i was trying to do it all from the github page.

EviLore · Sep 4, 2022

EviLore · Sep 4, 2022

It's all amazing.

Tumle · Sep 4, 2022

nightmare-slain said:
i just tried to install this but i can't get it working.

i downloaded "anaconda" and extracted the github into it. i had to download something else but i figure out where the fuck to download it. i signed up to the site but i don't see any download link. how hard is it for websites these days to give you a simple download link?

anyone able to help me out?

edit: nevermind. i never read the OP properly lol. i was trying to do it all from the github page.

Yea you shouldn’t have to do all the GitHub stuff, just download and execute

Makoto-Yuki · Sep 4, 2022

Tumle said:
Yea you shouldn’t have to do all the GitHub stuff, just download and execute

i've got it working now

wait until you see my artwork!

Makoto-Yuki · Sep 4, 2022

EviLore said:
It's all amazing.

are you using the GUI ? what are your settings?

i don't know if i should change steps, v scale, or sample prompts

EviLore · Sep 4, 2022

nightmare-slain said:
are you using the GUI ? what are your settings?

i don't know if i should change steps, v scale, or prompts

I only have access to my MBP at the moment, so I'm just looking through the entries on Lexica and sharing interesting results for now.

Makoto-Yuki · Sep 4, 2022

EviLore said:
I only have access to my MBP at the moment, so I'm just looking through the entries on Lexica and sharing interesting results for now.

ok thanks. i'm using that site for inspiration for now.

Pakoe · Sep 4, 2022

Amazing results.

Makoto-Yuki · Sep 4, 2022

been playing about with it. this is the best i got

Makoto-Yuki · Sep 4, 2022

Pakoe said:
Amazing results.

how are you getting such good results? what are your prompts?

Makoto-Yuki · Sep 4, 2022

these are cool

Pakoe · Sep 4, 2022

nightmare-slain said:
how are you getting such good results? what are your prompts?

old city! on fire on a rainy battlefield. by daniel f. gerhartz and matt stewart, fantasy, photorealistic, octane render, unreal engine, dynamic lighting, beautiful, perfect factions, trending on artstation, poster, volumetric lighting, very detailed faces, 4 k, award winning

Bragr · Sep 4, 2022

Not a good time to be a 3D designer at a game company.

Makoto-Yuki · Sep 4, 2022

Pakoe said:
old city! on fire on a rainy battlefield. by daniel f. gerhartz and matt stewart, fantasy, photorealistic, octane render, unreal engine, dynamic lighting, beautiful, perfect factions, trending on artstation, poster, volumetric lighting, very detailed faces, 4 k, award winning

what's with the exclamation marks? what do they do?

and what's the "by *artists names*"? does this connect online to find them?

Pakoe · Sep 4, 2022

nightmare-slain said:
what's with the exclamation marks? what do they do?

and what's the "by *artists names*"? does this connect online to find them?

I'm no expert, let that be clear lol.
I've read that the marks puts an emphasis on the word, haven't completely tested it yet.
As for the artist, I think it tries to emulate their style.

Tumle · Sep 4, 2022

nightmare-slain said:
what's with the exclamation marks? what do they do?

and what's the "by *artists names*"? does this connect online to find them?

No doesn’t connect online at all, it’s all stored in the learning algorithm, i tested with diffusing a prompt with out being connected to the internet and it still worked

Mikado · Sep 4, 2022

Pegasus Actual said:
Boaty McBoatface 2: The Boatening:

Tempted to keep generating these till the heat death of the universe.

I'm not convinced that this algorithm is going to have that much effect on professional art creation.
But career meme-crafters are going to have to start serious looking for new jobs.

Tumle · Sep 5, 2022

Great video to show how to manipulate or how stable diffusion manipulates images with different prompts settings on the same seed.

Ironbunny · Sep 5, 2022

Crazy to think how far this will go. Cant wait for a gif/webm version of this with added animation.

Makoto-Yuki · Sep 5, 2022

not mine but found on lexica. these are amazing. i don't know how people are getting such good results!

edit: this is the best i've got so far lol i think it's kinda cool. the program isn't working anymore for me (something about cuda memory?) so that's my fun over.

Grildon Tundy · Sep 5, 2022

nightmare-slain said:
the program isn't working anymore for me (something about cuda memory?) so that's my fun over.

Might be a dumb question, but did you start getting an error after changing some of your settings? I recall that upping the resolution or steps can cause memory issues.

Edit:

Tumle · Sep 5, 2022

nightmare-slain said:
not mine but found on lexica. these are amazing. i don't know how people are getting such good results!

edit: this is the best i've got so far lol i think it's kinda cool. the program isn't working anymore for me (something about cuda memory?) so that's my fun over.

Sounds like it doesn’t clear the memory on your graphicscard after your prompts..

Oh and try copying there prompts about details and settings, but insert your scene prompts.
Also try the seed number they used and any other settings they have changed

Support NeoGAF

Get Stable diffusion locally on your PC (RTX card needed) no restrictions and no censorship.

Member

Member

Member

Member

Member

Member

Member

Banned

Member

Gold Member

Expansive Ellipses

Member

Expansive Ellipses

Member

Banned

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Member

Gold Member

Expansive Ellipses

Expansive Ellipses

Member

Gold Member

Gold Member

Expansive Ellipses

Gold Member

Member

Gold Member

Gold Member

Gold Member

Member

Banned

Gold Member

Member

Member

Member

Member

Member

Gold Member

Member

Member

Similar threads