Skip to content

V5 Story Maker Workflow

V5 Story Maker allows you to clone and recreate videos from YouTube sources with your own voiceover. This guide covers the complete workflow.


Overview

What it does: Takes a YouTube video as a visual source, combines it with your script and voiceover, and creates a new video that follows the original's visual structure.

Best for: - Content repurposing - Video localization - Creating derivative content - Educational recreations

Time required: 10-30 minutes


What You Need

Before starting, prepare:

Item Format Notes
YouTube URL Valid public URL Source video to clone
Transcription SRT file Your new script
Voiceover MP3 or WAV Your audio narration

How It Works

V5 Story Maker:

  1. Analyzes the source YouTube video's visual content
  2. Matches your script segments to source video scenes
  3. Extracts relevant clips from the source
  4. Composes a new video with your voiceover
  5. Delivers a complete video following the source's structure

Step 1: Access V5 Story Maker

  1. Click "V5 Story Maker" in the sidebar
  2. You'll see the main interface with:
  3. YouTube URL input
  4. File upload zones
  5. Configuration options

Step 2: Enter YouTube URL

Adding the Source

  1. Paste the YouTube URL in the input field
    https://www.youtube.com/watch?v=XXXXXXXXXXX
    
  2. Click "Load Video" or press Enter
  3. Video metadata is fetched and displayed

URL Requirements

  • ✅ Public YouTube videos
  • ✅ Standard YouTube URLs
  • ✅ Short URLs (youtu.be)
  • ❌ Private/unlisted videos
  • ❌ Age-restricted content
  • ❌ Live streams

Video Preview

After loading: - Thumbnail displayed - Video title shown - Duration indicated - Source validated


Step 3: Upload Your Script (SRT)

  1. Click "Upload Transcription"
  2. Select your SRT file
  3. System validates and parses the file

SRT Format Example:

1
00:00:00,000 --> 00:00:04,000
Welcome to our exploration of ancient civilizations.

2
00:00:04,000 --> 00:00:08,500
Today we journey through the pyramids of Egypt.

3
00:00:08,500 --> 00:00:12,000
These magnificent structures have stood for millennia.

💡 Tip: Your script doesn't need to match the source video's content exactly—the AI will find the best visual matches.


Step 4: Upload Voiceover

  1. Click "Upload Voiceover"
  2. Select your MP3 or WAV file
  3. Audio duration is detected

Audio Guidelines

  • Match SRT timing exactly
  • Clear narration quality
  • Consistent audio levels
  • Minimal background noise

Step 5: Configure Settings

Source Priority

Adjust how much the AI relies on the source video:

Setting Behavior
50% More freedom to find alternative visuals
70% Balanced approach (recommended)
90% Strongly prefer source video clips

💡 Tip: Higher priority means closer to the original video's look.

AI Budget

Control AI processing intensity:

Level Description
Low Faster, less thorough matching
Medium Balanced (recommended)
High More precise, slower processing

Step 6: Start Processing

  1. Review all inputs and settings
  2. Click "Create Story"
  3. Processing begins

Step 7: Monitor Progress

Processing Stages

Stage Description Time
Downloading Fetching source video 1-3 min
Analyzing Understanding source content 2-4 min
Matching Pairing script to scenes 2-5 min
Composing Creating final video 5-10 min

Progress Display

  • Overall percentage
  • Current stage name
  • Stage-specific details
  • Time estimates

Step 8: Review and Download

Preview

Once complete: 1. Video player shows result 2. Watch full video 3. Verify synchronization

Quality Check

  • ✅ Visuals match narration intent
  • ✅ Audio properly synchronized
  • ✅ Transitions are smooth
  • ✅ No jarring cuts

Download

  1. Click "Download"
  2. MP4 saves to device
  3. Optional: Download thumbnail

Understanding the Process

Scene Matching Logic

  1. Source Analysis - AI catalogs all scenes in source video
  2. Script Parsing - Your script is broken into segments
  3. Semantic Matching - Each segment matched to best scene
  4. Gap Filling - Missing matches use AI alternatives
  5. Composition - Final video assembled

What Gets Cloned

  • Visual structure and pacing
  • Scene transitions
  • Overall aesthetic

What's New

  • Your voiceover audio
  • Your script content
  • Timing based on your SRT

Best Practices

Choosing Source Videos

Good Sources: - ✅ Documentary footage - ✅ Stock video compilations - ✅ Educational content - ✅ Varied scene content

Avoid: - ❌ Heavily branded content - ❌ Music videos (visual-audio coupling) - ❌ Single-scene videos - ❌ Very short clips (< 1 min)

Script Alignment

For best results: - Write about similar topics to source - Match approximate length - Use clear, visual language

Source Priority Tuning

Scenario Recommended Priority
Same topic as source 80-90%
Related topic 60-80%
Different topic 50-60%

Use Cases

Content Localization

  1. Find English documentary
  2. Write translated script
  3. Record voiceover in new language
  4. Generate localized version

Educational Repurposing

  1. Find educational video
  2. Write simplified/advanced script
  3. Record new narration
  4. Create new educational level

Topic Pivoting

  1. Find visually relevant source
  2. Write script on related topic
  3. Record new voiceover
  4. Generate content for new angle

Troubleshooting

"YouTube URL Invalid"

  • Check URL format
  • Ensure video is public
  • Try standard URL format

"Source Download Failed"

  • Video may be restricted
  • Try different source video
  • Check internet connection

"Poor Scene Matches"

  • Lower source priority
  • Increase AI budget
  • Choose better source video

"Audio Out of Sync"

  • Verify SRT timestamps
  • Check voiceover matches SRT
  • Regenerate if needed

Comparison with Other Workflows

Feature V5 Story VidStitch AI B-roll Clips
Visual Source YouTube Web search Your footage
Script Required ❌ (transcript)
Voiceover Required
Control Level Medium Low High
Best For Cloning Creating Enhancing

Next Steps