Adobe VoCo is an unreleasedaudio editing andgenerating prototype software byAdobe that enables novel editing and generation of audio. Dubbed "Photoshop-for-voice",[1] it was first previewed at theAdobe MAX event in November 2016. The technology shown at Adobe MAX was a preview that could potentially be incorporated intoAdobe Creative Cloud. It was later revealed that Voco was never meant to be released and was meant to be a research prototype.[2][3]
In 2023, Adobe introduced the ability to edit video by editing an AI-generated transcript of the video inPremiere Pro, demonstrating similar functionality to Voco.[4]
As the demo showed, the software takes approximately 20 minutes of the desired target's speech and generates a sound-alike voice includingphonemes that were not present in the target example material. Adobe stated Voco would lower the cost of audio production.[1][3]
Ethical and security concerns were raised over the ability to alter an audio recording to include words and phrases the original speaker never spoke, and the potential risk to voiceprintbiometrics.[1]
Concerns also rose that it may be used in conjunction with:
Adobe's lack of publicized progress opened opportunities for other projects to build alternative products to VOCO, such as Resemble AI and15.ai, a real-time text-to-speech tool using artificial intelligence.
WaveNet is a similar butopen-source research project at London-based artificial intelligence firmDeepMind, developed independently around the same time as Adobe Voco.
Thissimulation software article is astub. You can help Wikipedia byadding missing information. |