selig wrote:I'm not aware of any devices that can alter the sound of your voice, ala turning Tiny Tim into Frank Sinatra.
There are already devices that can "analyze the timbre dynamics and provide a GUI that can tweak relevant parameters. " and it's called an EQ.
I would fully expect synthesized speech to accomplish this goal much sooner than voice modification.
jappe wrote:
I agree that it could be simpler to do with synthesized speech, where for example dialect can be added rather than having to transform a possibly bad imitation into Elvis true voice.
(like it's sometimes easier to make an entirely new program than to transform a complex program into another.)
But if the synthesized speech can be done, then we'd only need to have phonem/speech detection of a voice in real time to feed that singing synthesizer with text, and we could have that dream device.
We already have speech to text, and we already have text to speech. We're just waiting for the quality to improve, right?
But here's something else to consider. It's the unique phrasing and intensity changes that can't be easily applied by a real time device. For someone wanting to use this effect, they would STILL have to put in a lot of work in learning to phrase like the singer they are emulating. Otherwise it wouldn't be worth the trouble cause if you don't have good phrasing there's little a voice emulator can do for you. That is to say, there are many more qualities that make a great vocal track beyond tone and pitch, but folks seem to assume that if you can just tune me and correct my tone, I'd be a fantastic singer - this would be true if you lack just only those two qualities, and nail the rest.
jappe wrote:I was vague when I mentioned timbre dynamics: I'm actually thinking about not static timbre, but instead catching spectral patterns of how the timbre changes over time or is dependent on other parameters like pitch or volume or tone duration. A device that works in the frequency domain, like Parsec...hmm..."Voicec"
To gather all possible intelligence from a singing voice, and make a smart interface to tweak interesting parameters without too much effort.
So when I want to tweak the timbre dynamics, I wan't the RE to make an analysis of clusters of frequencies that are related to each other (like if frequency A Increases Y times, then frequency B decreases Y x 2 time).
And after analysis, I want to have knobs to change relevant parameters for the identified change patterns, like for example Increasing/decreasing Y in the example above.
That and tons of other possible modifications.
Hmm...unsure if that made anything more clear
Yes, but it seems you are asking for something that would require almost as much training to pull off as learning to sing better in the first place IMO! At that level of complexity, you would also have to make it timeline based, which means it can't be an RE. You would have to introduce a new set of controls, concepts, and parameters that could be quite complex to someone who has never manipulated speech on this level before. With these comments I'm totally ignoring the limits on current technology that would likely prohibit such a device today, certainly as a real time effect.
And finally, I'm not sure I'd even care to listen to someone who doesn't have any vocal personality of their own - I'd probably rather listen to synthesized vocals if those were my only choices!
For me, the biggest thing I coach singers on in the studio is "believability". I don't want to hear someone READ the lyrics even if they're in tune and have great tone: I want to FEEL the lyrics. We can fix the rest!
There's very likely no device in our lifetime that will impart a believable feel on a lifeless vocal.
Some day, but not today (for which I'm thankful!).