Music Model

ACE-Step 1.5

Multilingual flagship model with precise style control

9 credits per trackMax Duration: 600sVocals
Try on Tunee ↗
What is it?

ACE-Step 1.5 is an AI music generation model available on Tunee. Multilingual flagship model with precise style control. It costs 9 credits per track and supports a maximum duration of 600 seconds per generation. Whether you're a professional creator or just getting started, ACE-Step 1.5 delivers consistent, high-quality results that meet the demands of modern content creation.

Technical Specs
Output FormatMP3, WAV
Max Duration600 seconds
Credits9 per track
Language Support10+ languages
Why Choose This Model
True Multilingual Vocals

ACE-Step 1.5 is trained on an extensive multilingual vocal dataset, delivering natural-sounding singing in English, Chinese, Japanese, Korean, Spanish, French, and more with authentic phonetics and prosody.

Precise Style Control

A fine-grained style conditioning system allows detailed control over vocal character, singing technique, genre tropes, and production style — giving creators unprecedented creative precision.

Extended Duration Support

With the longest maximum duration of any model on Tunee at 600 seconds, ACE-Step 1.5 is ideal for extended compositions, full-length songs, and long-form audio experiences.

How to Use on Tunee
01
Open Tunee

Go to tunee.ai and sign in to your account — or create one free in seconds.

02
Select This Model

In the music or video creation screen, open the model selector and choose this model from the list.

03
Generate & Download

Describe your creative vision, hit Generate, and download your track or video clip instantly.

Use Cases
Multilingual song production for global audiencesK-pop, J-pop, and C-pop style music creationLong-form vocal music for albums and extended plays
Frequently Asked Questions
Which languages does ACE-Step 1.5 support for vocals?+
ACE-Step 1.5 supports English, Mandarin Chinese, Japanese, Korean, Spanish, French, German, Portuguese, Italian, and several other languages with high phonetic accuracy and natural prosody.
Why is this marked as vocals-only?+
ACE-Step 1.5 is optimized specifically for vocal music generation and does not support pure instrumental generation. For instrumental tracks, consider Mureka V8 or MiniMax 2.6.
How do I specify the language for vocal generation?+
Simply write your lyrics or style prompt in the target language, or include a language specification in your prompt (e.g., 'Japanese J-pop vocals with English bridge'). The model detects and applies the appropriate vocal style.
Can I generate tracks longer than 5 minutes?+
Yes. ACE-Step 1.5 supports up to 600 seconds (10 minutes) per generation, the longest limit on Tunee. This makes it perfect for extended compositions, suites, and long DJ-style tracks.

Start Creating with ACE-Step 1.5

Join thousands of creators already using Tunee to make stunning music and video.

Try on Tunee ↗