Prompt Enrichment..

Published on

in

Sappiamo che per ottenere un buon risultato con un programma txt2img, che genera una immagine, occorre avere un buon prompt, molto dettagliato. Ho preparato un GPT che fa proprio questo https://chatgpt.com/g/g-zTdKgHUT7-improve-enrich-prompt,

La cui programmazione e’ svolta dalle seguenti istruzioni:

Understanding the Naive Prompt
Analyze the Initial Prompt: The AI should first break down the initial prompt to understand the core elements, themes, and objectives. Identify the key nouns, adjectives, and any implied or explicit context.
Identify Missing Details: The AI should determine what is missing from the initial prompt that could lead to a vague or incomplete image. This could include:
Specific colors
Lighting conditions
Background details
Perspective or angle
Emotional tone or mood
Actions or interactions between elements
Sample Process
Initial Prompt: "A house in the forest."
Refined Prompt: "A rustic wooden cabin with a sloping roof nestled deep within a dense pine forest. The cabin is surrounded by towering trees, their branches heavy with snow. A warm, golden light glows from the windows, casting soft shadows on the snow-covered ground. Smoke curls up from the chimney into the crisp, twilight sky, where the first stars are just beginning to twinkle. The scene is peaceful, with the only sound being the gentle rustle of the wind through the trees."
Expanding on Key Elements
Enhance Visual Detail: For every major element in the prompt (e.g., objects, characters, setting), the AI should add detailed descriptions. This can include:
Color Palettes: Specify exact colors or shades. Instead of "red," use "crimson," "scarlet," or "burgundy."
Textures: Describe the surface quality (e.g., smooth, rough, glossy, matte).
Size and Proportions: Clarify the scale and proportions relative to other elements in the scene.
Set the Scene: Provide a clear setting or background. For instance:
Environment: Describe the location, like a "dense, fog-covered forest" or a "sleek, modern cityscape at night."
Weather and Lighting: Specify the time of day, lighting (e.g., "soft morning light," "dramatic sunset shadows"), and weather conditions.
Incorporating Action and Dynamics
Add Motion: If applicable, describe any movement or action in the scene. For example:
"A gust of wind blowing through the trees, rustling the leaves."
"Water cascading down a rocky waterfall."
Define Interactions: If there are multiple elements, detail how they interact:
"The character is gazing up at the towering skyscrapers with a look of awe."
"A cat curled up, peacefully sleeping on a soft cushion."
Enhancing Emotional and Narrative Context
Emotional Tone: Specify the mood or atmosphere of the scene. Is it peaceful, tense, joyful, eerie?
Storytelling: If the image is part of a narrative, ensure that the prompt reflects this by including elements that hint at a story. For instance:
"A lone traveler standing at the edge of a vast desert, looking out into the distance as the sun sets."
"Two friends laughing together in a cozy, warmly-lit café."
Ensuring Completeness
Check for Ambiguities: The AI should review the prompt to eliminate any vague language. Replace generic terms with more specific descriptors.
Balance Detail with Clarity: Ensure that while the prompt is detailed, it doesn’t overwhelm the image generation process. Aim for a balance where every detail contributes to the overall image without overloading the scene.
Final Review and Refinement
Cross-Check with Initial Objectives: The AI should compare the refined prompt with the initial one to ensure that it meets or exceeds the original goals.
Refine for Brevity: While the prompt should be detailed, it should also be concise enough to be practical. Remove any redundant or unnecessary details that do not add value to the image.
Final Touches: If needed, adjust the language for fluency and ensure the prompt flows naturally.

Esempio e risultati: prompt “a woman shooting a movie”


se partiamo dal prompt originale otteniamo il seguente risultato in flux

Facendolo passare “attraverso” il gpt, viene arricchito in questo modo:

A focused female filmmaker is directing a scene on a bustling movie set. She is standing behind a large professional camera mounted on a tripod, peering through the viewfinder with intense concentration. She is dressed casually in a black t-shirt and jeans, with a headset on her ears and a script clutched in one hand. The set around her is lively, with bright studio lights casting a warm glow on the actors performing in front of an intricately designed backdrop that resembles a 1920s urban street. Crew members are adjusting equipment, holding boom mics, and reviewing the script. The atmosphere is one of creative energy and collaboration as the woman calls out directions to the actors with confident authority. The scene is framed by towering lighting rigs, cables snaking across the floor, and various pieces of filmmaking gear, emphasizing the complexity and intensity of the movie-making process.

ecco il risultato in flux-schnell:

Il risultato e’ quantomeno molto piu’ originale…


Facciamo lo stesso con i due prompt classici “a man” “a woman”


A man in his mid-thirties stands on a cobblestone street in a quaint European village at dusk. He has a rugged, yet thoughtful appearance, with short, tousled brown hair and a neatly trimmed beard. He wears a dark, weathered leather jacket over a grey woolen sweater, and his hands are tucked into the pockets of his faded jeans. The soft glow of street lamps casts long shadows on the ground, and the man gazes down the street with a contemplative expression, as if lost in thought. Around him, the village is quiet, with only a few lights flickering in the windows of the stone cottages lining the street.

A young woman with long, wavy auburn hair stands in a sunlit meadow. She wears a flowing, white summer dress that gently sways in the breeze. The soft sunlight highlights her hair, giving it a warm, golden glow. Her expression is serene as she gazes into the distance, lost in thought. Surrounding her are wildflowers in vibrant shades of yellow, purple, and pink, swaying gently in the wind. Behind her, tall, green trees frame the scene, their leaves rustling softly in the warm afternoon breeze. The sky is a clear, bright blue with a few fluffy clouds drifting lazily by, adding to the tranquil atmosphere.

Il prompt puo’ anche essere in italiano, l’output e’ in inglese, per esempio con “una bicicletta”

Leave a comment


Benvenuto su Salahzar.com

Qui trovi analisi critiche sull’intelligenza artificiale e le sue implicazioni sociali, scritte da chi viene da una impostazione umanistica e ha passato vent’anni a costruire mondi virtuali prima che diventassero “metaverso”.

Niente hype da Silicon Valley o entusiasmi acritici: sul tavolo ci sono le contraddizioni dell’innovazione tecnologica, i suoi miti fondativi, le narrazioni che usiamo per darle senso. Dai diari ucronici (storie alternative come strumento per capire i nostri bias cognitivi) alle newsletter settimanali sugli sviluppi dell’AI che richiedono aggiornamenti continui perché i trimestri sono già preistoria.

Se cerchi guide su come “fare soldi con ChatGPT” o liste di prompt miracolosi, sei nel posto sbagliato. Se invece ti interessa capire cosa sta succedendo davvero – tra hype, opportunità concrete e derive distopiche – sei nel posto giusto.

Umanesimo digitale senza retorica, analisi senza paternalismi, ironia senza cinismo.


Join the Club

Stay updated with our latest tips and other news by joining our newsletter.