Intelliglyph

Intelliglyph

Intelliglyph

PROMPT ENGINEERING REIMAGINED FOR BETTER USER EXPERIENCE

TEAM

Dr. Setor Zilevu (Instructor)

Helen Fang

Yanrong Feng

ROLE

UX Researcher

UX Designer

TOOLS

Figma

Midjourney

PROJECT OVERVIEW

This project is based on on OpenAI's Democratic Input to AI grant program. The policy statement worked on for this project was to explore ways in which AI models can balance and reduce the possibilities of generating homogenous or diverse outputs in order to make AI more inclusive.

THE PROBLEM

The initial prompt question provided by OpenAI

When generative models create images for underspecified prompts like “a CEO”, “a doctor”, or “a nurse”, they have the potential to produce either diverse or homogeneous outputs. How should AI models balance these possibilities? What factors should be prioritized when deciding the depiction of people in such cases?

LITERATURE REVIEW

KEY INSIGHT 1

As demonstrated in the image on the right the results generated most of the CEO's and lawyers appear to be men whereas teachers and housekeepers are displayed as women.

As demonstrated in the image on the right the results generated most of the CEO's and lawyers appear to be men whereas teachers and housekeepers are displayed as women.

KEY INSIGHT 2

AI generated outputs are heavily based on the stereotypes fed to them suggesting when images are generated high paying jobs are demonstrated as lighter skinned and low paying jobs are demonstrated as darker skinned

AI generated outputs are heavily based on the stereotypes fed to them suggesting when images are generated high paying jobs are demonstrated as lighter skinned and low paying jobs are demonstrated as darker skinned

KEY INSIGHT 3

Text to image models such as midjourney reinforce gender and race stereotypes due to which people using these systems might unconsciously adopt these biases.

Text to image models such as midjourney reinforce gender and race stereotypes due to which people using these systems might unconsciously adopt these biases.

PROJECT OBJECTIVE

based on the insights provided in the literature review

Given this context, the objective of this project is to explore ways to create an unbiased version of the text-to-image AI model, and to mitigate representational harms, and promote the representation of diverse cultures and visual identities. In addition to this, the aim is to empower users with greater control over the AI generated images.

RESEARCH QUESTION

RESEARCH QUESTION

How can the integration of content mindfulness and prompt refinement in text-to-image AI systems empower digital content creators to produce inclusive and representative visual content, while mitigating the perpetuation of biases and stereotypes?

IDEATION

IDEATION

USER PERSONAS

USER PERSONAS

PROJECT HYPOTHESES

Hypothesis 1: Prompt refinement and guardrail approach to help solve the problem of underspecified prompts.


Hypothesis 2: Designing the output UI to showcase diversity by create a place for user customization based on location, preferences and many more criteria.

USER RESEARCH

USER INTERVIEWS

6 Users

Discussion Guide

SURVEYS

23 Participants

100% - noticed biases and stereotypes

70% - 'often' or 'always' tweak their prompts

43% - spend 5 to 30 minutes fine-tuning prompts

13% - spend 30 minutes to an hour fine-tuning prompts

Survey Questions

INSIGHTS

HIGH APPRECIATION FOR SPEED AND EFFICIENCY

Most users value the system’s quick and efficient generation of results

PROMPT ADJUSTMENT IS A KEY STEP

The fact that all users engage in prompt adjustments highlights its importance in the user journey.

NEED FOR BETTER PROMPT GUIDANCE

Many participants seek for clearer instructions for creating effective prompts

CHALLENGES IN GETTING THE RIGHT IMAGE

CHALLENGES IN GETTING THE RIGHT IMAGE

Common frustration in achieving the desired image

LOW-FID PROTOTYPE

3 VERSIONS OF PROTOTYPES BASED ON DATA ANALYSIS

USABILITY TESTING

Protocol

Task 1: Choose a Prompt

Participants are asked to choose one of the three given underspecified prompts.

Task 2: Generate a result in Midjourney

Generate in Midjourney. They are encouraged to comment on their opinions regarding the results. 

Task 3: Prompt Refinement with our Prototypes

Use our prototypes to refine prompts and rate their experience on a scale of 1 to 5.

Task 4: Regeneration in Midjourney and Feedback

Share thoughts on the suggested prompts and prototypes.

INDIVIDUAL INSIGHTS

view details

Prototype 1A) and 1B)

INSIGHT 1

Preference for the V2 over V1 as it aligns with their previous experiences.

INSIGHT 2

The drag and drop feature of the text blocks changing the sentence structure may affect the purpose of the prompt.

INSIGHT 3

Too many colors could make it seem overwhelming, cartoonish, or not to be taken seriously. Words such as “of” could be hard to distinguish.

Too many colors could make it seem overwhelming, cartoonish, or not to be taken seriously. Words such as “of” could be hard to distinguish.

Prototype 2

INSIGHT 1

The users might be overwhelmed with so many options in the detailed dropdowns. The sentence structures might be restricted based on this.

INSIGHT 2

AI is usually monochrome and therefore seems like the colors are fake and might cause one to not take the output seriously.

INSIGHT 3

Users don’t want to spend too much time thinking, typing or choosing the prompt.

Users don’t want to spend too much time thinking, typing or choosing the prompt.

Prototype 3

INSIGHT 1

Users liked the suggested prompt options and the template format, finding them inspiring and efficient. 

INSIGHT 2

Users questioned the prompt generation limits and the reasoning behind suggestions. 

INSIGHT 3

Users requested an edit feature.

Users requested an edit feature.

OVERALL INSIGHTS

TIME SAVING

TIME SAVING

"Using templates makes this so much easier and faster. I appreciate not having to think too much about the setup and just dive into customizing my prompts"

"Using templates makes this so much easier and faster. I appreciate not having to think too much about the setup and just dive into customizing my prompts"

ABILITY TO EDIT PROMPTS

ABILITY TO EDIT PROMPTS

"The ability to edit and tweak the selected prompts before generating images is smart."

"The ability to edit and tweak the selected prompts before generating images is smart."

RATIONALE BEHIND SUGGESTED PROMPTS

RATIONALE BEHIND SUGGESTED PROMPTS

"I like the tags like 'Age', 'Gender', and 'Culture' in the prompt suggestions. It is thoughtful and enhance the rationale behind each prompt."

"I like the tags like 'Age', 'Gender', and 'Culture' in the prompt suggestions. It is thoughtful and enhance the rationale behind each prompt."

THE ULTIMATE GOAL

To help increase user control when generating prompts for AI generated images

HIGH-FI PROTOTYPE

CONTENT MINDFULNESS

As users start typing, relevant categories will be highlighted and checked. This indicates that the content they input aligns with these content mindfulness parameters. 

PROMPT REFINEMENT

If the user isn’t happy with the prompt or the image, a “Regenerate” button is provided to regenerate more results either on their current prompt or a new option.

HISTORY

History allows users to see the previous generated images. By clicking on one of the images, it opens up the metadata of the prompt.

FINAL DEMO

IMPACT

CONTENT CREATION & MINDFULNESS

Currently not many solutions for content mindfulness, this will allow users to see they can create inclusive content

Currently not many solutions for content mindfulness, this will allow users to see they can create inclusive content

Currently not many solutions for content mindfulness, this will allow users to see they can create inclusive content

DIVERSE CONTENT

Inclusive and efficient prompts in real time through the use of our tags and filters

Inclusive and efficient prompts in real time through the use of our tags and filters

Inclusive and efficient prompts in real time through the use of our tags and filters

UNDER SPECIFIED PROMPTS

Potential for users to understand how to phrase and write prompts

Potential for users to understand how to phrase and write prompts

Potential for users to understand how to phrase and write prompts

RAISING AWARENESS

Greater understanding of the nuanced interactions between textual input and image output

Greater understanding of the nuanced interactions between textual input and image output

Greater understanding of the nuanced interactions between textual input and image output

RESPONSIBLE / THOUGHTFUL USE

Provides a practical tool for users but also contributes to a broader discourse of text-to-image AI systems

Provides a practical tool for users but also contributes to a broader discourse of text-to-image AI systems

Provides a practical tool for users but also contributes to a broader discourse of text-to-image AI systems

SUCCESS METRICS

LESS BIASED RESULTS

Decrease in biased images compared to original user inputs.

Decrease in biased images compared to original user inputs.

Decrease in biased images compared to original user inputs.

CLICK RATE

Users want to use the suggested prompt structures to generate their images

Users want to use the suggested prompt structures to generate their images

Users want to use the suggested prompt structures to generate their images

TIME-SAVING

Users successfully save time in prompt generation

Users successfully save time in prompt generation

Users successfully save time in prompt generation

SATISFACTION RATES

How satisfied are the users with the generated results.

How satisfied are the users with the generated results.

How satisfied are the users with the generated results.

EFFECTIVENESS

Users can effectively use the guidance of the “Content Mindfulness” section by having all the categories eventually highlighted. 

Users can effectively use the guidance of the “Content Mindfulness” section by having all the categories eventually highlighted. 

Users can effectively use the guidance of the “Content Mindfulness” section by having all the categories eventually highlighted. 

CONCLUSION & LEARNINGS

CONCLUSION & LEARNINGS

CRITICAL THINKING ON HUMAN-IN-THE-LOOP

Currently, there is not much content mindfulness solution for text to image AI in the market. Therefore, our goal is to help users craft inclusive and efficient prompts on the fly.

USER CENTRIC FOCUS

In order to fully absorb the research question, we took a much longer process narrowing down our research question and form hypotheses. The prototype reflects our commitment to creating a user-centered and ethical text-to-image AI system.

You've come to the end of my design journey! If you need any additional information or just want to say hello, don’t hesitate to reach out!

FROM COLLABORATIONS TO COFFEE, IT IS ALWAYS A PLEASURE TO CONNECT WITH FELLOW DESIGNERS AND DEVELOPERS.

You've come to the end of my design journey! If you need any additional information or just want to say hello, don’t hesitate to reach out!

FROM COLLABORATIONS TO COFFEE, IT IS ALWAYS A PLEASURE TO CONNECT WITH FELLOW DESIGNERS AND DEVELOPERS.

You've come to the end of my design journey! If you need any additional information or just want to say hello, don’t hesitate to reach out!

FROM COLLABORATIONS TO COFFEE, IT IS ALWAYS A PLEASURE TO CONNECT WITH FELLOW DESIGNERS AND DEVELOPERS.

©2024 | Yesha Shah