• Welcome to Westlake Publishing Forums.
 

News:

    REGARDING MEMBERSHIP ON THIS FORUM: Due to spam, our server has disabled the forum software to gain membership. The only way to become a new member is for you to send me a private e-mail with your preferred screen name (we prefer you use your real name, or some variant there-of), and email adress you would like to have associated with the account.  -- Send the information to:  Russ at finescalerr@msn.com

Main Menu

AI for model generation

Started by Lawrence@NZFinescale, February 17, 2026, 02:12:23 PM

Previous topic - Next topic

lab-dad

Excellent progress!
That Irish worker is a good figure.
I can hear him now!
MJinTN

Stuart

Here's something I thought I would try.

I have a small adjustable figure like what a sculptor or artist might use when trying to work on a particular pose for an art piece.  I placed the figure in a pose that I though might be difficult for AI to understand with text alone.  I took a couple photos of the posed figure, sent them to ChatGPT, and placed a simple description of what I wanted. This is what I asked....

           Using these two figures as a reference, create a male figure dressed as a mechanic with a wrench in his hand as if making an adjustment to a piece of machinery. Place a cap on his head.  The time period is in the 1920's. Make the background white.


Figure 1.jpg
Figure 2.jpg

This was the result.

Kneeling mechanic.png

This process might prove useful when a particular pose is wanted and is difficult to describe or for AI to understand fully.

Stuart

Stuart

#32
Using the same descriptive input to ChatGPT as the first image, this second image was created.

Mechanic.png

Stuart

Lawrence@NZFinescale


Quote from: Stuart on February 27, 2026, 02:40:35 PMHere's something I thought I would try.

I have a small adjustable figure like what a sculptor or artist might use when trying to work on a particular pose for an art piece.  I placed the figure in a pose that I though might be difficult for AI to understand with text alone.  I took a couple photos of the posed figure, sent them to ChatGPT, and placed a simple description of what I wanted. This is what I asked....

           Using these two figures as a reference, create a male figure dressed as a mechanic with a wrench in his hand as if making an adjustment to a piece of machinery. Place a cap on his head.  The time period is in the 1920's. Make the background white.


Figure 1.jpg
Figure 2.jpg

This was the result.

Kneeling mechanic.png

This process might prove useful when a particular pose is wanted and is difficult to describe or for AI to understand fully.

Stuart

Cool.

One of the great things about the technology is that there are many ways to interact with it - and you can do what works for you.

A great tip: ChatGPT 'drifts' a lot as it doesn't act on the entire conversation BUT you can ask it to lock in and save a workflow to a key phrase for future use.  That way you get a repeatable process.  I have one to generate 4 views of a figure based on photo input and parameters for expression, pose, body type etc.  This gives me one step figure generation in a consistent format.  That figure can then be tweaked to suit.  No skill required to develop such a thing - just ask the AI to do it.
Cheers,

Lawrence in NZ
nzfinescale.com

lab-dad

That next "experiment" is excellent!!!

I got my neighbor to print the lathe operator file.
He printed twice and fingers were lost.
Hopefully I can fix/repair.
FWIW it was a filament printer.
Hopefully get him painted soon
MJinTN

Rail and Tie

Quote from: Stuart on February 27, 2026, 02:40:35 PMHere's something I thought I would try.

...

This process might prove useful when a particular pose is wanted and is difficult to describe or for AI to understand fully.

Stuart

Brilliant!  Bloody Brilliant!!
Cheers!
Darryl

"Leonard, check it out. I've bought an N Gauge locomotive. Half the size of HO. Look...it fits in my mouth!"

http://www.interactionhobbies.com
http://www.facebook.com/railandtie

Lawrence@NZFinescale

Quote from: Stuart on February 27, 2026, 02:53:11 PMUsing the same descriptive input to ChatGPT as the first image, this second image was created.


Stuart, your images show both the strengths and weaknesses of these tools.  The same input generates different output.  That's very handy when it comes to generating similar, but individual figures.  On the other hand if you are trying to refine the same figure it can be extremely frustrating. ChatGPT calls this 'drift' and it seems it doesn't use the image as such, but some sort of shorthand. As I understand it, from what the AI described, it analyses the image and prepares a 'text' description of what it sees and then works from that.  Every interpretation of the image might produce subtly different text and indeed every interpretation of the text produces a different result. Thus unless you some how lock and label a result, it's impossible to back up the bus if things go awry.

For instance, the AIs seem to favour US style headgear, which is inappropriate for much of the world historically.  If you get the perfect figure with inappropriate headgear it can be difficult to change the hat, without changing the whole figure a bit. Sometimes that really doesn't matter, but other times it sure does.

Clearly ChatGPT doesn't know the critical features of a spanner either :-)
Cheers,

Lawrence in NZ
nzfinescale.com

Stuart

Lawrence, yes I understand what you are saying. I too have been frustrated as I have tried to rework an image where parts of it were satisfactory and other parts not so. Try as I will, AI would not retain the portions I liked but would rework the image altogether usually with poorer results. I am not familiar with how to lock an image.

Stuart

Lawrence@NZFinescale

Quote from: Stuart on February 28, 2026, 08:52:22 PMLawrence, yes I understand what you are saying. I too have been frustrated as I have tried to rework an image where parts of it were satisfactory and other parts not so. Try as I will, AI would not retain the portions I liked but would rework the image altogether usually with poorer results. I am not familiar with how to lock an image.

Stuart

Bearing in mind I don't really know what I'm talking about here, and I'm paraphrasing ChatGPT's responses when I've asked it similar questions:

The basic principle is to ask ChatGPT. After doing a few figures along a similar workflow I asked ChatGPT how I could formalise that workflow and it did that - Extremely well I thought, with a level of refinement and flexibility that I hadn't considered.  Then I made a few changes, introduced drift again and it all went pear-shaped.  But the principle worked very well.  The problem is that if you want to make a change to something 'locked' you need to be VERY specific. While this is a bit frustrating, ChatGPT will rework the whole thing in a few seconds if things go awry so it's not the calamity it would be in the real world (I haven't got back to my workflow yet, but I'm very relaxed I'll be able to quickly get to a result). So my advice is to ask the question above of ChatGPT and have it generate an environment that suits you.

I don't think its a fully resolvable problem though.  We tend to think of the process as a road.  If you take a wrong turn, you back up and take a different route.  However ChatGPT is more like a river.  You can back up, but meanwhile everything has changed a little.  As noted above it seems that you cannot return or lock an image as such - only to the AI's interpretation of that image.  Locking it avoids reinterpretation of the original image, but not reinterpretation of the AI summary.

A bit of discipline can help too I suspect.  I think it's a mistake to chat with digressions as one would in person.  If you want to digress, start a new 'chat' and come back to the original focussed thread when done.  I confess I haven't often followed this advice very well.

I'm quite familiar with this sort of problem as it's not unlike many spousal conversations that I have.  It's the crux of AI I suppose.  It needs to be flexible to be useful, but not so flexible as to be useless.

Cheers,

Lawrence in NZ
nzfinescale.com

finescalerr

I have tried "talking" to AI to better understand how it works. I've experimented with both Gemini and CoPilot and can confirm Lawrence's comments. Current versions of AI may retain all the data leading up to a given point but, when you point to an error, they only focus on the more recent details and may bluff their way through the rest. That would account for what Lawrence has experienced. Articles I read suggest noticeable improvement may occur within one to five years. -- Russ

Lawrence@NZFinescale

Quote from: finescalerr on March 01, 2026, 12:03:10 PMArticles I read suggest noticeable improvement may occur within one to five years. -- Russ

Undoubtedly, it's not hard to catch them lying.  As I noted elsewhere, the overall process was impossible 4-6 months ago, borderline 2 months ago, and stunning today.

It's possible to get very tied up in debates about whether AI is 'good', whether it is actually 'intelligent' and so on.  I prefer to ask 'Can this new tool improve my enjoyment of model-making?'.  Personally I feel that it has.   
Cheers,

Lawrence in NZ
nzfinescale.com

finescalerr

Embrace technology but don't abuse it. -- Russ

lab-dad

FWIW my 1/16 lathe operator.
I only have access to a filament printer.
At high res the layers are very obvious.
It is adequate...
MJinTN

Bill Gill

Marty,
 I've read that a light wash of acetone can help smooth out layer lines on filament prints. No idea what type of filament or if that is accurate at all.

lab-dad

Thanks Bill!
I have a spare (like Harry) 🤣 so I will experiment.
MJinTN