Question 1

Which languages does ACE-Step 1.5 support for vocals?

Accepted Answer

ACE-Step 1.5 supports English, Mandarin Chinese, Japanese, Korean, Spanish, French, German, Portuguese, Italian, and several other languages with high phonetic accuracy and natural prosody.

Question 2

Why is this marked as vocals-only?

Accepted Answer

ACE-Step 1.5 is optimized specifically for vocal music generation and does not support pure instrumental generation. For instrumental tracks, consider Mureka V8 or MiniMax 2.6.

Question 3

How do I specify the language for vocal generation?

Accepted Answer

Simply write your lyrics or style prompt in the target language, or include a language specification in your prompt (e.g., 'Japanese J-pop vocals with English bridge'). The model detects and applies the appropriate vocal style.

Question 4

Can I generate tracks longer than 5 minutes?

Accepted Answer

Yes. ACE-Step 1.5 supports up to 600 seconds (10 minutes) per generation, the longest limit on Tunee. This makes it perfect for extended compositions, suites, and long DJ-style tracks.

Output Format	MP3, WAV
Max Duration	600 seconds
Credits	9 per track
Language Support	10+ languages

ACE-Step 1.5

Start Creating with ACE-Step 1.5