ブログへ戻る
チュートリアル公開 2026年6月4日10 分で読めます

Grok Imagine 1.5 Image-to-Video Prompt ガイド

still image を第一フレームとして守りながら、camera language とコピー可能な例で制御しやすい video prompt を作る実践ガイドです。

著者 Vogue AI Team更新 2026年6月4日

Grok Imagine 1.5 の image-to-video prompt は、まず第一フレームを守り、そのあと motion を足すと安定します。source image は単なる参考ではなく動画の開始点です。

TL;DR:フレームを守ってから動きを足す

  • 強い source image から始めます。
  • first-frame lock、identity、camera move、motion beats、timing、negative constraints の層で書きます。
  • dynamic ではなく slow push-in、dolly、orbit、rack focus、parallax を使います。
  • 動いてよい部分と変えてはいけない部分を明記します。
  • 最終テキストは生成ではなく編集で追加します。

Prompt の構造

xAI workflow では still image が video の starting point になります。そのため prompt は motion より先に first-frame preservation を指定する必要があります。

LayerInstructionReason
First-frame lockBegin exactly from the attached image.Keeps the source frame from becoming loose inspiration.
Identity rulesPreserve face, product label, UI screen, hands, logo, and layout.Prevents identity drift.
Camera languageUse push-in, dolly, orbit, pan, rack focus, parallax.Creates controllable movement.
Motion beatsAdd 1-3 timed movements.Short clips need readable beats.
Negative constraintsNo new people, no scene cut, no generated captions.Reduces common artifacts.
Review checkInspect identity, camera path, and text-safe space first.Makes iteration specific.

シナリオマトリクス

GoalSource imagePrompt focusCheck first
Product revealClean product still.Orbit, reflection, label lock.Logo distortion.
Portrait teaserStable face crop.Identity, breathing, push-in.Face morphing.
Social clipVertical layout with empty headline area.Handheld drift, light sweep.Generated captions.
Cinematic sceneDepth between foreground and background.Dolly, parallax, stable horizon.Scene jump.
UI showcaseClear screen hierarchy.Locked screen and reflection control.Fake UI changes.

コピーできる Grok Imagine 1.5 image-to-video prompts

Prompt blocks stay English-only so they can be copied directly into the video workflow.

Cinematic still for camera motion
This first-party prompt-library image matches dolly, parallax, light sweep, and focus-pull examples.
  • Product reveal: Animate the attached product image as a 6-second premium launch shot. Keep the product silhouette, label position, and material unchanged. Start with a locked first frame, then add a slow 20-degree orbit, soft rim-light movement, subtle background parallax, realistic reflections, no new text, no logo distortion.
  • Portrait motion: Animate the attached portrait as a calm editorial video. Preserve face identity, hairstyle, wardrobe color, and camera crop. Add a gentle push-in, natural breathing, soft fabric movement, eye contact held for the first 2 seconds, shallow depth of field, no extra hands, no face morphing.
  • Social teaser: Turn the attached campaign still into a vertical 8-second teaser. Keep the subject placement and empty headline area unchanged. Add slow handheld drift, background light sweep, small foreground particle motion, one clean reveal beat at second 4, no generated captions, no watermark.
  • Cinematic scene: Animate the attached environment still with controlled camera language. Begin exactly on the source image, then use a slow dolly forward, mild parallax between foreground and background, wind motion only on cloth and hair, stable horizon, no new characters, no sudden scene cut.

ケース 1:cinematic still の camera language

Keep the composition intact and add one camera path plus a few environmental beats.

  • Prompt: Animate this still as a cinematic tutorial opener. Preserve the full composition and subject scale. Add a slow dolly forward, gentle parallax in the background, slight light movement across the main subject, and one subtle focus pull near the end. No new objects, no scene cut, no text.

ケース 2:reference image から first-frame planning

Lifestyle still with camera LCD framing
This first-party image makes first-frame preservation easy to explain because people, couch, LCD screen, and room layout all need stable roles.

For lifestyle clips, name what the source image controls before adding small believable motion.

  • Prompt: Animate this reference-style lifestyle image as a nostalgic 6-second shot. Keep the couple, couch, camera LCD framing, and room layout stable. Add tiny handheld camera drift, soft ambient light flicker, natural blinking, and a slow rack focus from the LCD screen to the people. No identity drift, no extra people, no subtitles.

例:still image から video prompt へ

Brief

A skincare bottle still needs a 6-second launch teaser. Bottle shape, label, cap color, and top headline space must stay stable.

Prompt version 1

  • Animate the attached skincare bottle image as a 6-second premium launch teaser. Begin exactly from the source frame. Preserve bottle shape, cap color, label position, shadow, and empty top-third headline space. Add a slow 15-degree camera orbit, soft rim-light sweep, subtle reflection movement on the bottle, and mild background parallax. No new text, no logo distortion, no extra objects, no scene cut.

First revision

If identity drifts, strengthen preservation and reduce motion. If the clip is flat, keep the frame lock and add one timed beat.

失敗と修正

Failure modeFix firstAvoid
Identity driftAdd first-frame lock and protected details.More motion.
Random cameraUse one clear camera verb.Stacking every move.
New objectsAdd negative constraints.New story beats.
Broken textReserve clean space for editing.Final captions in generation.
Static clipAdd one timed beat.Rewriting source role.

Vogue AI での使い方

Use Vogue AI to build the still frame first, then pair that still with a concise motion prompt.

  • GPT Image 2 helps with instruction-heavy still cleanup.
  • Nano Banana helps quick image-to-image variations.
  • Midjourney helps cinematic mood and fashion framing.
  • Keep the video prompt shorter than the still-image prompt.
  • Save the source still and motion prompt together.

FAQ

なぜ第一フレームが重要ですか?

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.

source image をもう一度説明すべきですか?

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.

motion prompt の長さは?

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.

顔や商品が変わる理由は?

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.

悪い結果の後どう修正しますか?

Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.