Skip to main content
MindStudio
Pricing
Blog About
My Workspace
Video Editing

Merge Audio

The Merge Audio block takes an array of MP3 URLs and combines them into a single audio file.

Combine multiple audio clips into one file

The Merge Audio block takes an array of MP3 URLs and combines them into a single audio file. It is categorized under Video Editing and is designed for workflows that need to stitch together multiple audio segments — such as narration clips, music tracks, or generated speech — into one continuous output. The resulting file URL is saved to a variable you specify via the destinationVar field.

The block accepts a list of remote MP3 URLs as its primary input, along with optional parameters including file metadata (expressed as key-value pairs in the ffmpeg MP3 metadata format), an album art URL pointing to a remote image file to embed in the output, and an intermediateAsset flag. When intermediateAsset is set to true, the merged file is created but kept hidden from the gallery, which is useful when the file is an intermediate step rather than a final deliverable.

This block fits naturally into audio production pipelines, podcast assembly workflows, text-to-speech applications where multiple generated segments need to be joined, and any workflow that produces audio in stages before delivering a finished file. The output URL stored in the destination variable can be passed to downstream blocks for further processing, delivery, or storage.

What you can build

Real-world workflows powered by the Merge Audio block.

Podcast Episode Assembly

Combine individually recorded or generated segments — intro, main content, and outro — into a single podcast episode file.

Text-to-Speech Stitching

Join multiple AI-generated speech clips produced from long-form text into one continuous audio file for playback or download.

Audiobook Chapter Creation

Merge narrated chapter segments into a complete audiobook chapter, optionally embedding cover art and metadata like title and author.

Music Track Compilation

Concatenate multiple music clips or stems into a single MP3 file as part of an automated music production workflow.

Video Voiceover Preparation

Assemble separate voiceover takes into one audio track before syncing it with video in a downstream editing step.

Intermediate Audio Pipeline Step

Merge audio clips as a hidden intermediate asset within a larger workflow, keeping the gallery clean while passing the output URL to subsequent blocks.

Ready to add Merge Audio to your workflow?

Get Started Free

Common questions about Merge Audio

What are the required parameters for the Merge Audio block?

The required parameters are mp3Urls, which is an array of remote MP3 file URLs to merge, and destinationVar, which is the variable name where the output file URL will be saved.

What does the block return?

The block saves the URL of the merged audio file to the variable specified in destinationVar. That URL can then be referenced by downstream blocks in the workflow.

What is the intermediateAsset option used for?

When intermediateAsset is set to true, the merged audio file is created and accessible via its URL but is hidden from the gallery. This is useful when the merged file is a step in a larger pipeline rather than a final output you want users to see.

Can I embed metadata or album art in the merged file?

Yes. The fileMetadata field accepts key-value pairs in ffmpeg MP3 metadata format, allowing you to set tags like title, artist, or album. The albumArtUrl field accepts a URL pointing to a remote image file that will be burned into the output as album art.

What kinds of workflows commonly use this block?

This block is commonly used in text-to-speech pipelines, podcast or audiobook assembly workflows, voiceover preparation for video editing, and any multi-step audio production workflow where audio is generated or recorded in segments and needs to be joined into a single file.

Add Merge Audio to your workflow

Build powerful AI workflows with drag-and-drop blocks. No coding required.