ChatGPTのStable Diffusion用プロンプト作成法【例文付き】

2024年2月24日 2025年3月9日

谷田　朋貴

この記事でわかること

ChatGPTでStable Diffusionのプロンプトを作成する方法
ChatGPTでStable Diffusionのプロンプトを作成するメリット
マーケティングへの活用法

監修者プロフィール

谷田朋貴

一橋大学卒業後、伊藤忠丸紅鉄鋼株式会社、Web専業広告代理店を経て、株式会社電通デジタルに入社。アカウントプランナーとして国内の大手クライアントに対し、運用型広告を中心にデジタル全体のプロモーション施策の戦略立案・実行に従事。2023年12月、株式会社アドカルを創業。

Stable Diffusionに興味を持って調べている中で、

「Stable Diffusion用のプロンプトをChatGPTで作成できるのかな？」
「どのようなプロンプトをChatGPTに入力すればよいの？」
「Stable DiffusionとChatGPTを組み合わせてマーケティングに活用できる？」

といった疑問を持たれている方向けに、ChatGPTでStable Diffusionのプロンプトを作成する方法やメリット、マーケティングへの活用法について解説します。ぜひ最後までご覧ください。

LINE限定配信

生成AI情報を最速でGET！

✓ 週1回配信の最新生成AIニュースとトレンド情報

✓ LINE限定！使える生成AIプロンプトや技術情報を定期配信

✓ 今だけ全50ページの「ChatGPTの教科書」を無料プレゼント

期間限定の特典資料も配布中！

LINEで最新情報を受け取る →

＊新規登録者限定特典・4月30日まで

【お客様の声】

• 「生成AI情報で業務効率が3倍になりました」

• 「毎週の最新情報で常に一歩先を行けています」

• 「ChatGPTの教科書で基礎から学べました」

PCの方は
スマホで読み取り！

ChatGPTとは？
Stable Diffusionとは？
ChatGPTでStable Diffusionのプロンプトを作成する方法
ChatGPTでStable Diffusionのプロンプトを作成するメリット2点
- 画像生成における業務効率化
- クリエイティブの新しい切り口の発見
ChatGPTとStable Diffusionを組み合わせてマーケティングに活用する方法
- 広告バナーの制作
- SNSアカウント運用のクリエイティブ制作
生成AIビジネス研究所のYouTubeチャンネルで生成AIに関する情報を発信中！
まとめ

ChatGPTとは？

ChatGPT（チャットGPT）とは、ユーザーが入力した質問に対して、まるで人間のようにAIが回答するチャットサービスです。アメリカのOpenAI社が開発・提供しているサービスで、正式名称はChat Generative Pre-trained Transformer（文章生成モデル）になります。

2022年11月の公開以降、利用者が急増しています。

「GPT」という言語モデルがベースで、与えられたテキストの指示に対する自然言語の生成や、ネット上にある膨大な情報を学習して複雑な語彙や表現も理解できるのが特徴です。

Stable Diffusionとは？

Stable Diffusionは、テキストから画像を生成するための機械学習モデルです。プロンプトを入力することで、高品質な画像を生成することができ、クリエイティブな作品、デザイン、イラストレーションの制作に利用されています。

Stable Diffusionは、ディープラーニングと呼ばれる技術に基づいており、大量の画像とそれに関連するテキストデータを学習することで、与えられたテキスト記述に対応する画像を生成することができます。

ユーザーが特定の指示やイメージの説明を入力すると、それに基づいてStable Diffusionは画像を生成します。

例えば、「森の中のベンチに腰掛けて本を読んでいる男性」と入力すると、その記述に合った画像を生成することが可能です。

ChatGPTでStable Diffusionのプロンプトを作成する方法

ここでは、ChatGPTでStable Diffusionのプロンプトを作成する方法を具体例を用いて解説します。

下記のプロンプトのフォーマットを活用して、Stable Diffsuion用のプロンプトを作成し、実際にStable Diffusionで画像生成してみましょう。

・プロンプト例（下記太字部分を自由に変更して活用してください）

以下の手順に沿って、画像生成AIを活用し、現実感のある人物像を創出するための最適なプロンプトとネガティブプロンプトを策定してください。

#生成目標の画像詳細
構成の概要：具体的なシーンや背景を想定し、それに適した記述を行います。
民族や人種：多様性を考慮した選択肢を提供します。
年齢範囲：対象とする人物の年齢層を明確にします。
髪型と髪色：髪型のスタイルと色を具体的に記述します。
瞳の色：目の色に関して具体的な記述を加えます。
着用する服装：シチュエーションに合わせた服装の詳細を記述します。
表情：人物の感情を表す表情を具体的に指定します。

#条件
・プロンプトは、明瞭で具体的な英語の記述を用い、要素ごとにコンマで区切ります。
・最も重要な特徴をリストの先頭に置き、重要度が低いものは後にします。
・各プロンプトには、0.1から2.0までの範囲で重み付けを施し、特に強調したい要素を明示します。
・高品質な画像生成に必要なキーワード（例：ultra high quality, premium resolution）を含めて、画像のクオリティをさらに強化します。
・高解像度での出力を保証するキーワード（例：8k ultra high resolution）を加え、細部まで鮮明な画像を目指します。
・画像のリアリティを増すために、生々しい写真質（例：hyper-realistic, true-to-life photo quality）を目指すキーワードを選びます。
・人物の肌の質感や細部までを緻密に描写するためのキーワード（例：ultra-detailed skin texture, lifelike skin sheen）をプロンプトに含めます。
・ネガティブプロンプトは、生成したくない要素を明確に指定するものです。
・必要に応じて、ネガティブプロンプトのテンプレートを基に変更や追加を行ってください。

#ネガティブプロンプト
(worst quality,low quality:2), (painting,sketch,flat color), monochrome, grayscale, ugly face, bad face, poor anatomy, distorted eyes, absent fingers, acne, skin flaws, nsfw, nudity, nipples

#出力形式の変更
・プロンプトとネガティブプロンプトを、より詳細かつ具体的な指示に基づいて構築し、出力します。

#実施手順
1.プロンプトに基づいて画像を生成します。
2.「生成された画像にさらなる改善の余地はありますか？」とユーザーに問いかけ、具体的なフィードバックを求めます。
3.ユーザーからのフィードバックを反映してプロンプトを再度調整し、最適化を図ります。
4.上記のプロセスを繰り返し、最高品質の画像生成を目指します。

プロンプト例①草野球をしている男性

・プロンプト例

#生成目標の画像詳細
構成の概要：草野球で打席に入って構えている男性
民族や人種：日本人
年齢範囲：30代
髪型と髪色：黒髪の短髪
瞳の色：黒
着用する服装：青色の野球のユニフォーム
表情：真剣な表情

・出力されたプロンプト

“Create a hyper-realistic image of a Japanese man in his 30s, with short black hair and black eyes, showcasing a serious expression as he stands ready in the batting position at a grassy baseball field. He wears a blue baseball uniform, embodying the essence of a passionate player deeply focused on the game. Ensure the image captures the vibrant, lifelike colors and textures of the scene, with an emphasis on ultra-high quality (1.8), premium resolution, and 8k ultra-high resolution detail. Highlight the skin texture (1.7) and the natural sheen on the skin (1.6) to bring out the realism of the moment.”

・出力されたネガティブプロンプト

“Avoid any elements of low quality (2.0), including styles such as painting, sketch, flat color, monochrome, and grayscale. Exclude images with ugly or incorrect facial expressions, bad anatomy (2.0), distorted features, missing body parts, skin imperfections like acne or flaws, and absolutely no NSFW content, nudity, or explicit details.”

このプロンプトをStable Diffusionを活用した無料で使えるDreamStudioに入力します。

不自然な画像もありますが、ほぼ要望通りの画像が生成されました。

プロンプト例②スカイダイビングをしている女性

・プロンプト例

#生成目標の画像詳細
構成の概要：スカイダイビングをしている女性
民族や人種：アメリカ人
年齢範囲：20代
髪型と髪色：金髪
瞳の色：青色
着用する服装：スカイダイビング用の服
表情：怖がっている

・出力されたプロンプト

“A young American woman in her 20s skydiving, 1.5 (emphasis on the activity and the fear expression), with blonde hair and blue eyes, 1.3 (emphasis on distinctive facial features), wearing skydiving gear, 1.2 (emphasis on attire), showing a frightened expression, 1.5 (emphasis on the emotion). The image should capture ultra high quality, premium resolution, 1.8 (emphasis on image quality), with 8k ultra high resolution for crystal clear details, 1.8 (emphasis on resolution), and hyper-realistic, true-to-life photo quality, 2.0 (emphasis on realism), including ultra-detailed skin texture and lifelike skin sheen, 1.7 (emphasis on skin detail), to enhance the realism of the person.”

・出力されたネガティブプロンプト

“(worst quality,low quality:2), (painting,sketch,flat color), monochrome, grayscale, ugly face, bad face, poor anatomy, distorted eyes, absent fingers, acne, skin flaws, nsfw, nudity, nipples, avoid cartoonish or overly stylized representations, 2.0 (emphasis on avoiding non-realistic interpretations), ensure no background distractions or irrelevant elements, 1.5 (emphasis on focusing on the subject), and exclude any form of violence or inappropriate content, 2.0 (emphasis on maintaining appropriateness).”

同様にDreamStudioに入力すると下記画像が生成されました。

プロンプト例③サウナで瞑想している男性

・プロンプト例

#生成目標の画像詳細
構成の概要：サウナで瞑想している男性
民族や人種：日本人
年齢範囲：40代
髪型と髪色：白髪の長髪
瞳の色：黒
着用する服装：水着
表情：目をつぶって集中している

・出力されたプロンプト

“A Japanese man in his 40s meditating in a sauna, 1.5 (emphasis on the setting and activity), with long white hair and black eyes, 1.3 (emphasis on distinctive facial features), wearing a swimsuit, 1.2 (emphasis on attire), with his eyes closed focusing intently, 1.5 (emphasis on the expression). The image should capture ultra high quality, premium resolution, 1.8 (emphasis on image quality), with 8k ultra high resolution for crystal clear details, 1.8 (emphasis on resolution), and hyper-realistic, true-to-life photo quality, 2.0 (emphasis on realism), including ultra-detailed skin texture and lifelike skin sheen, 1.7 (emphasis on skin detail), to enhance the realism of the person.”

・出力されたネガティブプロンプト

“(worst quality,low quality:2), (painting, sketch, flat color), monochrome, grayscale, ugly face, bad face, poor anatomy, distorted eyes, absent fingers, acne, skin flaws, nsfw, nudity, nipples, avoid cartoonish or overly stylized representations, 2.0 (emphasis on avoiding non-realistic interpretations), ensure no background distractions or irrelevant elements, 1.5 (emphasis on focusing on the subject), and exclude any form of violence or inappropriate content, 2.0 (emphasis on maintaining appropriateness).”

同様にDreamStudioに入力すると下記画像が生成されました。

全50ページ「ChatGPTの教科書」を無料でダウンロード！

ChatGPTの基礎知識やビジネス活用法、業務で使えるプロンプト例をご紹介！

「ChatGPTの教科書」を無料でダウンロード

ChatGPTでStable Diffusionのプロンプトを作成するメリット2点

画像生成における業務効率化

Stable Diffusionで直接プロンプトを入力しようとすると、工数がかかります。

そこでChatGPTを上手く活用することにより、Stable Diffusioin用のプロンプトを自動で生成してくれるので業務の効率化を図ることができます。

プロンプトの作成にかける時間を短縮できれば、浮いた時間を別の工程にかけられるメリットがあるので、是非積極的にChatGPTでStable Diffusionのプロンプトを作成してみてください。