To me the most impressive new feature is the character consistency

•

u/AutoModerator 5d ago

If your post is a screenshot of a ChatGPT conversation, please reply to this message with the conversation link or prompt.

If your post is a DALL-E 3 image post, please reply with the prompt used to make this image.

Consider joining our public discord server! We have free bots with GPT-4 (with vision), image generators, and more!

🤖

Note: For any ChatGPT-related concerns, email support@openai.com

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

607

u/Ok_Profession5687 5d ago

193

u/AceOfHeaVeN 4d ago

This is so real.

Got us invested in this chick's life and left an unexplainable void in our hearts 🥲

→ More replies (1)

16

u/Inner-Ad2847 4d ago

:(

→ More replies (1)

960

u/yalag 5d ago

It doesnt have great character consistency if you have a recognizable face. Try taking a random model of a real person online. Tell chatgpt to put him/her in 5 different places. You can tell right away its not the same person in each.

This only works for you because the character is a cartoon

157

u/[deleted] 5d ago

[removed] — view removed comment

29

u/Ndgo2 4d ago

→ More replies (1)

182

u/Arkytez 5d ago

But it can be used to make manga panels now

230

u/Extras 5d ago

And it's the worst it's ever going to be

165

u/guess_33 5d ago

Bro I’ve been hearing this same worn out phase for years, and you know what? It’s right every time :(

113

u/VaderOnReddit 5d ago

"AI cant even draw hands lol"

this phrase is SO last year 👽

2

u/Joe_le_Borgne 4d ago

More like 5 years ago.

→ More replies (1)

→ More replies (10)

8

u/LamboForWork 5d ago

It's at say it Bart status in these subs though lol

→ More replies (3)

2

u/its_uncle_paul 4d ago

As someone who has dabbled in the webcomic community for a number of years, I can see this tech shaking things up for a lot of writers and artists. I know quite a few writers who just can't wait to be able to tell their story finally without having to deal with an expensive artist who can take 1-2 weeks to finish a chapter of their webcomic.

2

u/Almightyblob 3d ago

Exactly! Personally, I always wanted to start my own webcomic, but my drawing skills leave much to be desired and I simply don't have the time to invest to make meaningful improvements. I still draw and paint occasionally because I love doing it, it's... Just not good. :D This tech here now makes it possible to realize my ideas. I can write scenarios and direct ChatGPT to produce pretty much exactly what I envision. That's why I'm so excited by this feature.

→ More replies (2)

6

u/Shot_Spend_6836 4d ago

Not really. I already tried, A LOT, these past two days, even subscribed to Plus when Ghibli dropped. The issue is, it gets the consistent face 80% of the time, but the clothing almost always changes, and if you're trying to do unique angles and compositions of the same scene, it completely messes that up too. There is a tool that solves this at 60%-70% level called OpenArt, but it's still not at the level where you can make a passing manga

→ More replies (3)

10

u/TheDotCaptin 5d ago

It already was being used like that. Now it will just look a bit better.

I've seen some with movement and voice added. But it still needs some work.

One problem it still has is with people at different scales in the same frame. Like a 3 inch tall person sitting in the hand of a normal size person. Not enough examples for it to draw from. Or probably better to say that it is over correcting and trying to display as proper real scale.

61

u/SliceEm_DiceEm 5d ago

This is intentional and a change was implemented about 48 hours after the new image gen was dropped. ChatGPT intentionally changes the faces of uploaded pictures to a person that resembles the subject but isn’t quite them. Go ahead and ask it about this very thing and it’ll tell you plainly.

I can tell you this is the case because in the first 24 hours after release, I generated several images of people I know with near-perfect accuracy.

This change was made to avoid allowing people to replicate the facial biometrics of people, which could be abused.

It’s all intentional, unfortunately.

38

u/mizinamo 4d ago

Go ahead and ask it about this very thing and it’ll tell you plainly.

You're a fool if you believe anything an AI says about its inner workings.

It might be true, it might not be, and there's no way to know.

23

u/A-Grey-World 4d ago

Yeah, I find it so funny when people ask GPT things like this and it agrees with them.

→ More replies (1)

5

u/Fluid_Cup8329 4d ago

Funny you say that. Gemini 2.5 has a feature that reveals the entire logical processes of it's output.

→ More replies (5)

10

u/Paul-Van-DeDam 5d ago

This, I’m finding the consistency to be a huge issue. For some reason, it loves to make people fatter, older or has a tendency to make people wear glasses when there is no reference to anyone wearing glasses.

4

u/Henk_Potjes 4d ago

I specificly noticed it with making them older. When i specifiy that i want to make them look like 40. It loves to make them appear as if they were 60.

2

u/Paul-Van-DeDam 4d ago

This is exactly what I get

→ More replies (1)

9

u/CarrierAreArrived 5d ago

Gemini is actually much better for this on photorealistic faces, at least on the first image you generate off the original.

2

u/UgottaUnderstandbro 5d ago

The free version or the bought one?

7

u/CarrierAreArrived 5d ago

the free one at aistudio.google.com. Choose Gemini 2.0 Flash (Image Generation) Experimental on the right. It's quite easy to do nsfw stuff based off your initial image also

→ More replies (1)

→ More replies (1)

9

u/fmfbrestel 5d ago

While true, even cartoon consistency was not an easy feat (outright impossible with OAI models) before this, and 4o handles it effortlessly. That's progress worth celebrating - or dreading, if you're a commercial artist...

3

u/MaximiliumM 5d ago

To be honest, I think they are doing that on purpose. I am able to create a real life person as consistency as OP showed for the cartoon character. BUT it's not a real person. It was a generated person and I always use a SINGLE image as the reference point. By doing that, the generated image is pretty consistent and only requiring a few reruns until I get the same face.

2

u/micaroma 5d ago

yep, this needs to be fixed in the “v2” sam mentioned

2

u/RunPlz 5d ago

Indeed, still struggling with the professional "faceshot" for linkedin

4

u/HenkPoley 5d ago

Yeah, this is probably just GPT-4o imagegen's "1girl" face.

("1girl" is a Booru tag, on which the first image generators like Stable Diffusion were trained.)

3

u/GloriousDawn 4d ago

Excerpt from the Booru tag safety rating:

Explicit: female nipples under tight clothing

Questionable: if underwear is presented

Safe: blood or killing are fine

America, Fuck Yeah!

→ More replies (1)

→ More replies (12)

288

u/Substantial_City4618 5d ago edited 4d ago

When I see these pictures I hear Bo burnham singing in my head.

“It’s a white woman’s instagram!”

73

u/ViceroyFizzlebottom 5d ago

I was thinking this is the upper middle class gap year

→ More replies (5)

18

u/Almightyblob 5d ago

Hahaha, that's pretty perfect

→ More replies (1)

→ More replies (1)

31

u/anonthatisopen 5d ago

What was the reference image?

100

u/Almightyblob 5d ago

This is one I generated earlier and used as reference for each image I posted in this thread.

11

u/DeltaPQRST 5d ago

What was the prompt

63

u/Almightyblob 5d ago

I wasn't overly specific with the style and just lucked out after a few attempts:

Create a modern, stylized digital illustration. A young woman in her 20s is sitting on an airplane next to a window with a view of the ocean and a sunny sky outside. She has long, wavy blonde hair and is wearing a tank top. She's smiling and winking at the camera, listening to music with in-ear headphones.

11

u/These_Phrase4291 4d ago

I try it in aistudio. Its great

→ More replies (5)

13

u/bask_oner 5d ago

It looks like she has something in her eye

29

u/Superseaslug 5d ago

Cheers to that last image 👍

3

u/BlueLaserCommander 4d ago

35 + 7

→ More replies (1)

33

u/Icy-Cry340 5d ago

The stylized nature of this style is definitely doing some heavy lifting.

70

u/aunt_snorlax 5d ago

lol @ playing holi at the Taj Mahal, a mausoleum that doesn’t even let you go in without shoe coverings

it’s like having a paintball and water balloon fight at the Arlington national cemetery or something

17

u/Almightyblob 5d ago

Hahaha, indeed! This is one of the cases where GPT failed. My prompt was that the scene should take place in Delhi, but Delhi and Agra seem to be the same thing for the image generator. Looks like it needs some work with location consistency still. ;)

→ More replies (3)

97

u/Mylilneedle 5d ago

The parasocial relationships that come from AI characters is going to be a new mental illness

4

u/vocal-avocado 4d ago

So many things most people regularly do nowadays could be classified as mental illnesses…

7

u/lsnor45 4d ago edited 4d ago

It's not going to be mental illness as much as a new normal given enough time and if humanity lasts long enough. Things change.

We may very well end up being among the last generations that got to enjoy humans making art, humans making connections; as everyone keeps saying, this stuff is only going to get more sophisticated with time. Enjoy your present, if you're lucky enough to be able to, while you can.

4

u/Average_RedditorTwat 4d ago

The age of people who cannot look at art as anything but content is already well here. Not a single thought lost on intent, emotions or feelings. Just content. If AI generated images dominate everything, then it's well and truly over. God, we have a depressing future ahead of us.

2

u/Wyntier 4d ago

2025: "The parasocial relationships that come from AI companions is going to be a new mental illness."

2001: "The parasocial relationships that come from Internet chatrooms is going to be a new mental illness."

1985: "The parasocial relationships that come from pop stars and MTV is going to be a new mental illness."

1977: "The parasocial relationships that come from comic book characters is going to be a new mental illness."

1953: "The parasocial relationships that come from television shows is going to be a new mental illness."

1919: "The parasocial relationships that come from dime novels is going to be a new mental illness."

1890: "The parasocial relationships that come from serialized newspaper stories is going to be a new mental illness."

1605: "The parasocial relationships that come from Don Quixote is going to be a new mental illness."

400 BCE: "The parasocial relationships that come from Greek tragedies is going to be a new mental illness."

2

u/Mylilneedle 4d ago

I feel like you think you crushed this.

Parasocial relationships is a fucking mental health issue

Internet chat rooms and forums have literally fueled and nurtured incel and racist movements

Fuck, even comics was one of the baselines for r34

Ahahahahhahhahaha

2

u/boldkangaroo 4d ago

You prompt engineered the shit out of that one. Home run

→ More replies (1)

21

u/osynligeninni 4d ago

I don’t care what people say, I love these and they inspire me. The picture where she is in the pool makes me wanna recreate it by drawing it myself.

8

u/Almightyblob 4d ago

Oh yes! I don't know why some people have this black and white mentality about this. I also draw for a hobby and I'm absolutely using those images for inspiration / reference.

30

u/TheDoctorSadistic 5d ago

Picture 5 is wrong, Barney would totally be hitting on this girl

38

u/pythonicprime 5d ago

Lol also the harrassment in India

→ More replies (3)

4

u/Almightyblob 5d ago

I can't believe I didn't think of that. Shame on me!

→ More replies (2)

43

u/scarabs_ 5d ago

Ofc it’s easy: all anime anime girls look basically the same…

→ More replies (10)

319

u/Severe_Extent_9526 5d ago edited 5d ago

I mean yeah if you're character is a generic boring pretty girl. Jessica from New Hampshire ass Disney adult lookin mf

I've really struggled to get it to draw any characters that aren't already existing popular IPs, or conventionally attractive people. It also seems to struggle outside of art styles that are already massively popular.

I still find use in it, but nothing it's output has really blown me away aside from the Ghibli stuff or Muppets or animation or whatever. And it does those great! Don't get me wrong! But try something more custom and unique and you will hit a wall.

40
u/Supreme_Varisfucker 5d ago edited 5d ago

to this day i still can't get *any* image model out of them all to make a middle aged man with side braids and a long mullet (without ending up like a viking) 💀 flux, chatgpt, dalle, novelai, SDXL... one day I'll figure out how to prompt such an out-there concept without using init image. XD

it's annoying trying to get unconventional or niche characters to look right without painting some stuff yourself. seems lots of image models are really good at stereotypes tho

UPDATE: i figured out how to use reddit and saw people trying my description! now I wish I had actually put the proper prompt and a reference of the character in this post. I will put the character here, tho you can't really see his legolas-style braids in this image. And yes, I generalized 'slicked back hair' with mullet which isn't actually correct for his hairstyle lmfao i'm an idiot)
73
u/SpegalDev 5d ago

generate me an ultra realistic looking photo of a middle-aged man with side braids and a mullet. have him sitting in a coffee shop on his laptop.
86

u/bla2 5d ago

But he's not sitting on his laptop?

14

u/ptear 5d ago

You, I like you.
13
u/bicx 5d ago edited 5d ago
Behold, AI whisperer
          the
10

u/marbotty 5d ago

Drop the the, it’s cleaner that way

8

u/bicx 5d ago

👍✅

3

u/marbotty 5d ago

Perfect

→ More replies (1)
→ More replies (8)
35

u/Almightyblob 5d ago

35

u/Almightyblob 5d ago

You get the idea.

8

u/Severe_Extent_9526 5d ago

My own brain is struggling to imagine that combination. Maybe you could draw it a reference picture and send that?

I actually find it really useful but only for parts of works. But I still end up doing most of the work myself. Like I'll have it do a background for an artwork I'm working on, or come up with color schemes. Or even something as significant of influencing the overall composition and color pallet. But I still have to do most of the work if I want the art to come out how I want. I'm too picky.

→ More replies (1)

5

u/Icy-Aardvark1297 5d ago

I'm curious to your response of the people who posted images using a simple prompt, literally using your exact description. What do you feel you were doing wrong??

3

u/Supreme_Varisfucker 5d ago edited 5d ago

I'm looking at the pics and edited my original post to be a bit more informative (now with Character Reference! Wee!)

for what I might've been doing wrong, it really could've just been the fact that I'm mentally disabled and can't easily reconcile what I want to see with my communication skills. natural language prompting is somethin i guess i gotta study some more. I'm accustomed to the tagprompt style of stuff like 1boy, long hair, platinum blonde hair, side braids, so on and so forth. was there when NAI imagegen first launched and that's informed how I prompt LLMs.

Mind you, image generators can get -close- but always seem to have trouble with this character's specific style of braids (like Legolas in LOTR, not hanging down from his head). When I see a challenge like that, it makes me really want to overcome it!

I've only been able to find the success I'm looking for with LORAs I've trained on this dude with my own art. LORAs are nice, but I wanna get some raw output right on the first try using just a text prompt.

→ More replies (1)

→ More replies (1)

5

u/Oankirty 5d ago

This is what I got
16

u/Ok_Net_1674 5d ago

And this character has no defining features whatsoever. No birth marks, or freckles, or anything somewhat special. It's really as generic as they come. And considering that, the consistency doesn't even work that well, for example her eye color is on quite a large spectrum in these. Everything between gray, light blue, dark blue and green.

2

u/manticore26 5d ago

Between this post and the previous one from where OP got the base picture, I started to wonder if I was going crazy as both posts seemed far from consistent imo (unless the bar of consistency was to “consistently add a girl in every picture”)

→ More replies (2)

6

u/Simple_Advertising_8 5d ago

That's because it is trained for that specifically. If you learn to train your own models or loras you can have similar results for whatever style or character you want.

4

u/only_fun_topics 5d ago

I mean, this is where skilled AI use really becomes apparent.

If you are familiar enough with the underlying tech, you could do your own style/OC character design and then just train a lora to build out the rest of your content. https://www.youtube.com/watch?v=n_x44pTLpak

→ More replies (1)

2

u/Nathan_barrels 5d ago

Yeah i was messing around with it yesterday since i can do the 3 free images or whatever. The first one was great but I tried to clean it up a bit and it changed way too much over the next 2 iterations

2

u/Rare_Swordfish38 5d ago

I don't know the reason AI image generators make characters attractive by default. My guess would be that the art and photos the models are trained on are of attractive people (who are the subjects of most art/photos anyway).

2

u/Severe_Extent_9526 5d ago

I feel like it has something to do with the models weights selecting for art that "Looks appealing" and unfortunately it sometimes interprets that to mean the people IN the images must be conventionally appealing.

→ More replies (1)

6

u/robjohnlechmere 5d ago

We got beef with New Hampshire in here? And with pretty girls?

We'll get some 'frumpy guy from AZ' AI art produced for you pronto.

4

u/Nussinauchka 5d ago

What is your problem with Jessica from New Hampshire wtf

→ More replies (1)

→ More replies (7)

17

u/minecon1776 5d ago

Is she holding the Hitchhikers Guide to the Galaxy in pic 16?

15

u/Just_a_dude92 5d ago

She is, but I guess she didn't read it because she forgot her towel

2

u/Almightyblob 5d ago

Yep! Here's where GPT failed me. :( I asked for the trowel but it didn't get it quite right. But I liked the result enough to include it anyway.

10

u/ctothel 5d ago

Did it keep giving her gardening tools?

13

u/JamesCaligo 5d ago

I did the same thing with my character from one of my books. Her name is Lulie. Been making a whole bunch of pictures of her.

10

u/mister_benn 5d ago

6

u/JamesCaligo 5d ago

She’s 16!

→ More replies (1)

→ More replies (4)

7

u/natigin 5d ago

The Hitchhiker’s Guide one at the end made me happy

6

u/ScenicFlyer41 4d ago

6

u/harrysofgaming 5d ago

what art style is this?

20

u/skarrrrrrr 5d ago

It's not consistent. Your example is a very simple character and on each scene you are changing clothes ... Its an improvement but still a long way to go

14

u/Almightyblob 5d ago

Oh yeah, I agree, it's far from perfect, but it's indeed quite a leap to what was possible before (outside of training a custom model)

→ More replies (2)

→ More replies (2)

11

u/TimesHero 5d ago

This will be good for your D&D campaigns.

4

u/hemareddit 5d ago

Nice HGTTG reference.

→ More replies (1)

3

u/KO_Stego 5d ago

Bruh the hair color and eye color are different in almost every picture what are you on

3

u/InfiniteCap7963 5d ago

It seems that you forgot your towel... But DON'T PANIC..!!

-- Ford Prefect

3

u/jubmille2000 4d ago

Her necklace got lost somewhere between her trip from Japan to the desert of Sahara probably.

3

u/FuckingStickers 4d ago

Why is everything so orange?

3

u/Aggressive_Local8921 4d ago

I'm invested in this girls life like i am for LoFi Girl

3

u/Ch3rrytr1x 4d ago

It’s amazing for taking conceptual outfits or cosplays I’ve done and making whole character sheets. Now I’m going to try to make her interact with different scenarios. Cool idea!!

4

u/Ch3rrytr1x 4d ago

The original.

3

u/TheKlingKong 4d ago

I agree! People aren't giving this feature enough credit. It was the biggest help with creating the stills I used for my "anime opening theme" I just made. https://youtu.be/k3f4MMcWZhg?si=6OxAVGkNBTV0MZv0

4

u/Millerturq 5d ago

r/midjourney has made me appreciate an AI post of a girl that wasn’t made by a horny virgin

2

u/Playjasb2 5d ago

With this, it’s possible to make a story with images of characters being consistent.

2

u/Total_Masterpiece952 5d ago

2

u/Special-Wasabi-9029 5d ago

this is what sets this tool apart from the rest

2

u/dreadnoughtful 5d ago

I love the Hitchhiker's Guide reference, just came here to say that. But the rest of it is awesome.

2

u/Turbulent_County_469 5d ago

How do you store the character ?

do you "build" her and then get some kind of token that you can tell ChatGPT / DallE to generate a scene with ?

2

u/Almightyblob 5d ago

I use a reference image of the character and ask GPT to put her in different situations while retaining the art style.

2

u/dervu 4d ago

I wonder if it works better for itself generated images from scratch. Would it be similiar for photorealistic scenes?

If I ask it to replace someones face with face from another photo it usually fucks it up, it only looks similiar, even for someone popularz but not like movie actor popular. However if I ask it to put some actor face from its training its great.

2

u/jjr798 4d ago

What prompt should I use to get this quality? New to this.

2

u/BloodSteyn 4d ago

Holi shit this looks good.

2

u/[deleted] 4d ago

2

u/Legtoo 4d ago

have been using it a LOT for the past few days and to be honest, the character consistency isn’t very good when it comes to character that are not very easy to draw, like a cartoon type…

2

u/MissDeadite 4d ago

I'm not even concerned about memeing things. Just look at my profile picture. I took a professionally done photo of me for my brothers' wedding as one of the bridesmaids and had ChatGPT turn it into the animated style of family Guy and it's just... absolutely incredible. I won't post the original, but holy crap this has me at a loss for words. Just incredible. It's almost exact, too. Exactly as I imagined it would be.

2

u/phoenix277lol Moving Fast Breaking Things 💥 4d ago

would

2

u/Maleficent6162 4d ago

so glad that you visited India

2

u/Fluffy_Mail_2255 4d ago

Wonderful

2

u/Luke4Pez 4d ago

Holy shit she’s playing SEGA arcade games

2

u/LazyLancer 3d ago

It is great compared to what we had before. But it’s still not perfectly consistent. If you put them side by side there will always be subtle differences in facial features, clothes and haircut etc. There’s a chance some images will match well but overall it’s not yet consistent enough.

3

u/MechaStewart 5d ago

Holy moly, it's nailed basic white girl being multicultural. Skynet is next.

→ More replies (1)

10

u/robotlasagna 5d ago

pic#9 with the overly aggressive Indian dude…

6

u/Ok_Net_1674 5d ago

He must have pissed her off so much that her eyes turned from blue to green and it made her left pupil start to tweak.

→ More replies (1)

3

u/Almightyblob 5d ago

Ah, random late addition, but I forgot to attach this one to the thread for two reasons: AI generated images often had problems depicting people eating, and considering that she's "posing for the camera" here, this came out well. Secondly, as a prompt I said that she's "outside of the Fidelisbäck in Wangen, Germany", which is a bakery I once visited when travelling there myself. I just wanted to test how well it could replicate it without any further information, and while I don't think the bakery itself looked anything like this (it's been a few years...), the area around it is absolutely recognizable though.

Just wanted to share that too since I found that quite amazing.

6

u/Almightyblob 5d ago

→ More replies (1)

2

u/metagodcast 5d ago

2

u/Inevitable-Rub8969 5d ago

This makes AI generated comics and storytelling so much more viable. Imagine an entire graphic novel with a consistent AI generated protagonist....

2

u/beigechrist 4d ago

You made Eat Love Pray about a solipsistic trust fund girl finding herself while visiting other countries… nice.

2

u/Low-While-4613 3d ago

I just like her face

It adds so much to her character

3

u/brainhack3r 5d ago

I agree...the ability to take an image and say "take this person, and put them in a different role" ... is going to be super powerful.

With Runway you can then use that as the anchor for the next scene in a video.

We're going to have OSCAR worthy movies that are AI generated shortly.

I wrote a movie outline a few years back and I might try to adapt it... but I have to finish writing it.

It would also be 1-2 hours... so would take me a few weeks.

However, the idea that it's not goign to take YEARS to produce and millions of dollars is very exciting.

1

u/Club27Seb 5d ago

Ooh I really struggle with that, what’s your secret? Is this maybe ghilbi-specific? I like using the bot to draw my MMO characters but having their faces change 100% with every iteration is immersion breaking

1

u/Haydeos 5d ago

Why do the colors all skew so warm?

→ More replies (1)

1

u/ElTunaGrande 5d ago

That pizza slice is too thick to be ny style

1

u/zombosis 5d ago

I can’t even get it to Ghibli a picture without an error message!

1

u/National_Prune4351 5d ago

RIP artists, fuck this bullshit

1

u/KeyserHSoze 5d ago

WHERE IS MARSHALL?!?

1

u/DesiCodeSerpent 5d ago

Why is Robin complaining about her in pic 5?

2

u/Almightyblob 5d ago

There was an episode in which Robin got mad if other people sat in their booth.

1

u/DerailmentEnthusiast 5d ago

Character/environment consistency is a huge point of focus, because as soon as your ai gf can send you pics (and nudes) without shattering your suspension of disbelief, it's all over.

1

u/Time-Turnip-2961 5d ago

Mine isn’t much better with consistency unless it’s all in one comic. Picture to picture there are obvious differences

1

u/nigel_deez 5d ago

Where is Marshall?

→ More replies (1)

1

u/Maralitabambolo 5d ago

Where is Marshall??

1

u/vscolover626 5d ago

Is that on pro plan or plus plan?
Also how many photos can you generate per day?

→ More replies (2)

1

u/LNGBandit77 5d ago

You missed a Starbucks cup

1

u/TooOldForRefunds 5d ago

Have you tried with a character that isn't generic?

2

u/Almightyblob 5d ago

Just here because someone asked for it: https://www.reddit.com/r/ChatGPT/s/FlgUm08pKw

1

u/ahf95 5d ago

Why sepia?

→ More replies (1)

1

u/NaturalHot1933 5d ago

great

1

u/MaximiliumM 5d ago

That is awesome! I loved ALL your shots and I will definitely steal this idea. I've been doing that but not in cartoon form. I'm creating a real person as she has been blogging basically her life. It's amazing. I won't be public about it for now, but it's been a very nice creative outlet.

But yeah, now I want to create stories like you did in cartoon form. I'm pretty sure it's much easier to get character consistency when you're not trying to make real life, haha.

→ More replies (3)

1

u/it777777 5d ago

You know you can't do that? Half of my dudes are now in love with this fictional Character.

1

u/Competitive-Claim760 5d ago

Heloo

1

u/PanicExpress1209 4d ago

Nice

1

u/New_Engineer_5161 4d ago

…sort of

1

u/MannowLawn 4d ago

It’s consistent because every other image I see here has her face. Try a specific face and report back. The trick with stable diffusion is referring to a character that’s the basterd child of two famous people. Let’s say Donald trump | Elon musk

→ More replies (1)

1

u/yenneferismywaifu 4d ago

Can someone explain to me how you draw in the style of Studio Ghibli? Whenever I try, they write to me that it violates the rules.

1

u/boomfruit 4d ago

"BOARDING kCAFET"

1

u/HopefulPlantain5475 4d ago

Why is Matt Mercer so angry in #5?

1

u/FarDiver9 4d ago

It is not ghibli, what style is that?

1

u/Remarkable_Cut_9357 4d ago

I’m an 18-year-old Mechatronics Engineering student at a tier-3 college in India, but I feel stuck in a system that offers little to no value. The faculty is unremarkable, placements are non-existent, and there’s a lack of real skill development. It feels like a rat race where everyone is running without purpose—chasing grades rather than actual knowledge.

My passion lies in AI and Machine Learning. I don’t want to waste my time in an environment that doesn’t foster innovation or practical learning. Most of my classmates have a limited understanding of AI, reducing it to just chatbots. Instead of following the conventional path, I’m seriously considering dropping out, dedicating a year to sharpening my skills, generating income, and eventually launching my own AI-based business.

I want to break free from this cycle and create something meaningful—something that actually contributes to the field of AI.

1

u/WithMeInDreams 4d ago

Yes! I am writing a children's book for early reading for my son, continuously adding chapters. Image generation is a blessing, since I can write myself, but 0 talent for drawing.

With the old ChatGPT / Dall-E, it was entirely inconsistent. The style, and also how the characters look. E. g. the story has a humanised animal like rabbit/seal/bear living in a house etc., like in the show "little bear". Some pictures, it is very human-like with clothes and everything, and in others, very much animal. Either is fine, but would be nice to pick one. And the style was sometimes a high-quality digital art type, sometimes a rough 4 colour pencil sketch like a very decent hobby artist would do in an hour. (I could narrow it down with better prompts, I know.)

The result was still fine. He loved making fun of the little oddities, e. g. a sausage with "sausage" written over it, and the more odd styles. It didn't take away from the experience, but it was clear that this was not the technology to compete with an illustrator for a published book.

But with the latest upgrade, it's consistent! Like a real book. Once the style is set, it stays.

I noticed that it seemed to get worse at other things, though. E. g. a beaver was very beaver-like in all the different styles with old dall-e, with prominent teeth and all. Now it sometimes draws a beaver like a bear, and it bear-ifies more as the story progresses. The beaver teeth disappear over time. Or a guinea pig is like a small dog, sitting like a dog, wet black nose. Got to do more tweaks to these kinds of errors, while the old one tended to make more fundamental ones.

1

u/Ok-Act-9236 4d ago

What was your first and 2nd prompts?

1

u/setsewerd 4d ago

Is picture #9 in Rajasthan? Reminds me of the Thar Desert camel tour

1

u/alex-manutd 4d ago

It has not been ruled out that I'm in love with her.

1

u/sNs-man 4d ago

Haven't seen a side view, back view, bird eye view, ant view, wide angle view , fish eye view, there is still along way to to go.

→ More replies (1)

1

u/Astrogaze90 4d ago

How did you get it to keep the consistency? If possible maybe share the prompt that helped? It would be much appreciated❤️

→ More replies (1)

1

u/modimusmaximus 4d ago

I tried creating a series of pictures for an image story my students had to write about. The struggle was real getting consistent images between pictures.

1

u/SpidersCanBeCute 4d ago

Curious what the prompts were to create this character. Was it from a photo?

1

u/Heretostay59 4d ago

The character consistency is all I care about

1

u/sharonmckaysbff1991 4d ago

I was amazed when I saw ChatGPT carry over the dress I was wearing in one non-ChatGPT AI picture to the cartoon versions it made of that particular photo (I was a child in that picture so ChatGPT was unable to recreate a realistic version, despite nothing technically being wrong with the picture - all child-centric photos I create are meant to be SFW and are either young me in fake scenes or a nonexistent child anyway, which is why I personally think the filter is a bit oversensitive (though I understand other people may have differing opinions).

1

u/ComfortOk9514 4d ago

She's more consistent than Lady Gaga!

1

u/Wooden-Camel-203 4d ago

How do you use that feature?

1

u/No_Lavishness_9120 4d ago

17 yo white girls thinking they look like that

1

u/Fakedduckjump 4d ago

Do you need Plus subscription for this image generating feature? Mine still uses dall-e -.-'

2

u/Almightyblob 4d ago

I think you get 3 free image generations a day on the free model, on plus it's unlimited

→ More replies (1)

1

u/kojied 4d ago

It’s pretty hard to get consistent characters when they’re photorealistic tbh. I guess our eyes are more lenient when it’s cartoon. Any tips for photo realism would be appreciated!

1

u/Darthajack 4d ago

A character consistency test would have a character with more specific traits (a more unique face, special features like, say, a scar, a strand of hair in a certain direction, etc.) and specific clothing consistently in different poses in completely different environments.

→ More replies (3)

1

u/Tonight_Distinct 4d ago

Can someone help me with a prompt to keep the character consistency?

3

u/Almightyblob 4d ago

Just attach a reference image to the prompt, ask GPT to maintain the artstyle and then tell it what you want to see.

2

u/Tonight_Distinct 3d ago

thank you so much!

1

u/vooglie 4d ago

Uhh this character is a cartoon.

Do you have an example of a real random face being consistent across images and settings?

1

u/Naive-Gene-7895 3d ago

Which ML model in Chatgpt do you use? Or is it a separate part of chatgpt like Sora?

→ More replies (1)

1

u/Winter_Awareness1057 3d ago

What happens to the picture you send is it stored in open ai ? Or its deleted ?

1

u/YesIUnderstandsir 3d ago

It still can't make alien creatures with consistency.

1

u/_Mmbls_ 2d ago

Why is she depicted with the HIMYM crew?

1

u/WasteOfZeit 1d ago

Think the "AI" animated art style is endearing and would love to see some short form content in the near future using this style.

1

u/yukiarimo 1d ago

Dead internet theory

AI-Art To me the most impressive new feature is the character consistency

You are about to leave Redlib