Grok Imagine 1.5 の image-to-video prompt は、まず第一フレームを守り、そのあと motion を足すと安定します。source image は単なる参考ではなく動画の開始点です。
TL;DR:フレームを守ってから動きを足す
- 強い source image から始めます。
- first-frame lock、identity、camera move、motion beats、timing、negative constraints の層で書きます。
- dynamic ではなく slow push-in、dolly、orbit、rack focus、parallax を使います。
- 動いてよい部分と変えてはいけない部分を明記します。
- 最終テキストは生成ではなく編集で追加します。
Prompt の構造
xAI workflow では still image が video の starting point になります。そのため prompt は motion より先に first-frame preservation を指定する必要があります。
| Layer | Instruction | Reason |
|---|---|---|
| First-frame lock | Begin exactly from the attached image. | Keeps the source frame from becoming loose inspiration. |
| Identity rules | Preserve face, product label, UI screen, hands, logo, and layout. | Prevents identity drift. |
| Camera language | Use push-in, dolly, orbit, pan, rack focus, parallax. | Creates controllable movement. |
| Motion beats | Add 1-3 timed movements. | Short clips need readable beats. |
| Negative constraints | No new people, no scene cut, no generated captions. | Reduces common artifacts. |
| Review check | Inspect identity, camera path, and text-safe space first. | Makes iteration specific. |
シナリオマトリクス
| Goal | Source image | Prompt focus | Check first |
|---|---|---|---|
| Product reveal | Clean product still. | Orbit, reflection, label lock. | Logo distortion. |
| Portrait teaser | Stable face crop. | Identity, breathing, push-in. | Face morphing. |
| Social clip | Vertical layout with empty headline area. | Handheld drift, light sweep. | Generated captions. |
| Cinematic scene | Depth between foreground and background. | Dolly, parallax, stable horizon. | Scene jump. |
| UI showcase | Clear screen hierarchy. | Locked screen and reflection control. | Fake UI changes. |
コピーできる Grok Imagine 1.5 image-to-video prompts
Prompt blocks stay English-only so they can be copied directly into the video workflow.

- Product reveal: Animate the attached product image as a 6-second premium launch shot. Keep the product silhouette, label position, and material unchanged. Start with a locked first frame, then add a slow 20-degree orbit, soft rim-light movement, subtle background parallax, realistic reflections, no new text, no logo distortion.
- Portrait motion: Animate the attached portrait as a calm editorial video. Preserve face identity, hairstyle, wardrobe color, and camera crop. Add a gentle push-in, natural breathing, soft fabric movement, eye contact held for the first 2 seconds, shallow depth of field, no extra hands, no face morphing.
- Social teaser: Turn the attached campaign still into a vertical 8-second teaser. Keep the subject placement and empty headline area unchanged. Add slow handheld drift, background light sweep, small foreground particle motion, one clean reveal beat at second 4, no generated captions, no watermark.
- Cinematic scene: Animate the attached environment still with controlled camera language. Begin exactly on the source image, then use a slow dolly forward, mild parallax between foreground and background, wind motion only on cloth and hair, stable horizon, no new characters, no sudden scene cut.
ケース 1:cinematic still の camera language
Keep the composition intact and add one camera path plus a few environmental beats.
- Prompt: Animate this still as a cinematic tutorial opener. Preserve the full composition and subject scale. Add a slow dolly forward, gentle parallax in the background, slight light movement across the main subject, and one subtle focus pull near the end. No new objects, no scene cut, no text.
ケース 2:reference image から first-frame planning

For lifestyle clips, name what the source image controls before adding small believable motion.
- Prompt: Animate this reference-style lifestyle image as a nostalgic 6-second shot. Keep the couple, couch, camera LCD framing, and room layout stable. Add tiny handheld camera drift, soft ambient light flicker, natural blinking, and a slow rack focus from the LCD screen to the people. No identity drift, no extra people, no subtitles.
例:still image から video prompt へ
Brief
A skincare bottle still needs a 6-second launch teaser. Bottle shape, label, cap color, and top headline space must stay stable.
Prompt version 1
- Animate the attached skincare bottle image as a 6-second premium launch teaser. Begin exactly from the source frame. Preserve bottle shape, cap color, label position, shadow, and empty top-third headline space. Add a slow 15-degree camera orbit, soft rim-light sweep, subtle reflection movement on the bottle, and mild background parallax. No new text, no logo distortion, no extra objects, no scene cut.
First revision
If identity drifts, strengthen preservation and reduce motion. If the clip is flat, keep the frame lock and add one timed beat.
失敗と修正
| Failure mode | Fix first | Avoid |
|---|---|---|
| Identity drift | Add first-frame lock and protected details. | More motion. |
| Random camera | Use one clear camera verb. | Stacking every move. |
| New objects | Add negative constraints. | New story beats. |
| Broken text | Reserve clean space for editing. | Final captions in generation. |
| Static clip | Add one timed beat. | Rewriting source role. |
Vogue AI での使い方
Use Vogue AI to build the still frame first, then pair that still with a concise motion prompt.
- GPT Image 2 helps with instruction-heavy still cleanup.
- Nano Banana helps quick image-to-image variations.
- Midjourney helps cinematic mood and fashion framing.
- Keep the video prompt shorter than the still-image prompt.
- Save the source still and motion prompt together.
FAQ
なぜ第一フレームが重要ですか?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.
source image をもう一度説明すべきですか?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.
motion prompt の長さは?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.
字幕や logo を生成できますか?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.
顔や商品が変わる理由は?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.
悪い結果の後どう修正しますか?
Focus on first-frame preservation, protected identity details, one camera move, and one specific revision layer before regenerating.