What is SoundID VoiceAI?

SoundID VoiceAI DAW plugin explained.

 

In this article:

 

What is SoundID VoiceAI?

SoundID VoiceAI is a voice and instrument AI transformation plugin for DAW. It allows changing the recorded singing voice to that of another human being or an instrument using AI technology:

  • Voice model library: transform your vocal track into a realistic singing voice from a studio-grade AI library of 23 voice models
  • Instrument model library: transform your melodic humming or beatbox to sound like drums, guitar violin, or other instruments from a studio-grade AI library of 21 instrument models

 

Transform singing voice tracks, generate backing vocals from a single voice track, transform speaking voice tracks, mimic instruments with your voice, and transform vocal inputs into realistic instruments for quick transfers of melodic ideas into DAW or creative sound generation, turn beatboxing into drums, and more.

 

 

How does it work?

VoiceAI extracts audio information from the source voice track, passes it onto the VoiceAI model selected in the DAW plugin, and applies the target voice (similar to a virtual instrument), resulting in a target voice track. The resulting voice track keeps most of the key melodic properties of the input voice but replaces all the details with sounds generated by the target voice model the user has selected.

 

The resulting voice track keeps most of the key melodic properties of the input voice but replaces all the details with sounds generated by the selected target voice model. Learn more here: Setting up with SoundID VoiceAI plugin

 

Does the input track need to be high quality?

SoundID VoiceAI plugin can cater to a relatively wide range of recording quality for the input track. Regular phone microphone recordings in a random space with reverb are perfectly okay to use - after processing, the output results will have the properties of studio-quality audio captured with a great microphone.

 

This applies only to a certain degree, there are some limits to take into consideration:

  • Repeated AI processing on the same audio capture will not produce identical results. Due to the creative nature of the AI models in SoundID VoiceAI, results will be slightly different each time.
  • Excessive reverb on the input audio can lead to melodic artifacts in the output.
  • It is possible to Reprocess the results for free (limited to 10 times per hour) to minimize excessive artifacts.
  • When applied to non-English singing, some amount of English accent might bleed over into the processed voice depending on the preset applied.
  • The AI models can sometimes introduce artifacts such as clipping "s'es" into the processing results. This is typically resolved by re-processing or adjusting the Transpose setting to a value closer to the input track pitch. 
  • The AI models work great for normal spoken voice tracks too, however, when applied to extreme emotional states of speech such as whispering or shouting, artifacts are possible.
  • Repeated AI processing of the same audio capture will not produce identical results. Due to the creative nature of the AI models in SoundID VoiceAI, results will be slightly different each time.
  • The intonation of the input voice audio is a key aspect of the AI models. Raspiness in the voice (rough, raspy, strained, or breathy properties), can lead to artifacts in the processing results.

 

How much does it cost?

Since the audio processing is cloud-based and server costs are calculated on per-minute basis, SoundID VoiceAI is a pay-as-you-go model, enabling you to pay for the token packs needed for audio processing only. There are no subscription fees or other hidden charges involved, and the SoundID VoiceAI plugin itself is free to download and install. Here is what you need to know:

  • Processing cost: 600 tokens = 1 minute of audio processing.
  • A minimum charge of 70 tokens (7 seconds) applies for each processing instance, followed by 10 tokens (1 second) increments.
  • Transpose adjustments to an already processed audio capture will require re-processing

SoundID VoiceAI token packs can be purchased from your Sonarworks Account:

  • Small token pack: 72,000 tokens (120 minutes of audio processing) - 19.99 EUR/USD
  • Medium token pack: 180,000 tokens (300 minutes of audio processing) - 39.99 EUR/USD
  • Large token pack: 360,000 tokens (600 minutes of audio processing) - 69.99 EUR/USD

 

A 7-day trial with 9000 free tokens is available in your Sonarworks Account. If you haven't created a Sonarworks Account in the past, sign up here

 

Note: The trial tokens will expire once the 7-day trial runs out, or once a token purchase is made. 

 

Buy tokens in Sonarworks Account.png

Related to

Was this article helpful?

1 out of 1 found this helpful

Have more questions? Submit a request

0 comments

Please sign in to leave a comment.