by Y-my-R » Wed Feb 07, 2024 2:42 am
It's entirely possible, that the algorithms used in in Melodyne or Autotune have their "math" be better suited for one sample rate over another. I do use Melodyne Assistant, occasionally, but haven't noticed added noise.
How do you move the audio from the DAW (or recorder) to Melodyne? Were those signals ever converted between 44.1 and 48 kHz in the process? If so, that conversion "could" be causing the added noise.
When I use Melodyne, it's usually via direct ARA integration between Studio One and Melodyne. Again, I haven't noticed any noise being added, but I have also never compared this to how it is when using 48 kHz, because I don't usually use that sample rate.
If there indeed IS an audible difference and Melodyne/Autotune etc. DO sound better when working with 48 kHz, then that IS of course a very good reason to use 48 kHz. I just never heard about that, and never noticed any added noise when working in 44.1 kHz.
Incidentally, I am at the exact point in the project I'm working on right now, where the next step is to fine-tune the vocals with Melodyne. I recorded all that in 44.1 kHz/24-bit though... I'll pay very close attention while doing the pitch edits, if there's any noise added in the background... and if I do notice that, will do another test recording at 48 kHz and see if that's better (it's my own vocals I need to tune a little, so "that singer" is always available, hahaha).
But just to further comment on why I said what I said... let's say you have 48 "samples" for easier comparison (instead of 48,000), that are represented by the following "dots":
................................................
but you need to convert this to 44(-ish) samples, but the time-line (aka the length of the line above) needs to stay exactly the same. I can't do this in this text editor, but for comparison sake, here are 44 dots in a "bold" font, in the hope that this will come out longer (sorry... can't do the .1 dot in this example):
............................................
Hopefully it's visible that these won't align if the 44 dots need to fill out the same length/distance. Here the same thing again, below one another, with the 44 dots on the bottom in bold, so they hopefully use up more space:
................................................
............................................
[Hmm... I just submitted and took a look... unfortunately, the 44 dots in bold don't make it much longer... just try to imagine how the spaces between the dots in the lower line, would have to get larger to make it just as long as the upper line, and think about how those dots will NOT align above/below each other].
If you try to "press" the 48 dots, into the 44 dot shape, but at the over-all same length (i.e. "time" in the real world) a lot of the dots will overlap or fall in-between dots, etc.
...and that's what's happening when you resample 48 kHz to 44.1 kHz - but every time there's a partial overlap, every such overlapping "new" dot position, adds a little bit of noise, that is usually called "quantization noise."
Or in other words, if you overlay that nice square wave that makes up the word clock and sample frequency at 48 kHz with a signal at 44.1 kHz, it doesn't match up at all. And to put them in the right spots, you need to "hack" the square-wave apart, because you can't just shift the timing of each of those samples around and drop a few.
All that "in-between" placement or "chopping up" of the previously nice and even word clock chain of pulses, now gets some noise added, because the math is not even between 48000 and 44100, and that creates noise.
I mean... I hope Melodyne or other "tuning" apps like that don't resample internally. That would explain the added noise. But as mentioned before, I could totally imagine that the Melodyne algorythms work better, "math-wise" with one sample rate over another. But I guess only the Melodyne guys would know (... not sure if they'd admit that, though, if so).
But again, if it is as you describe (less noise added at 48 kHz when using "tuning" apps like this), then that is absolutely a valid reason to work in 48 kHz. Especially if you stack a lot of tracks, where the noise would get stacked as well, and potentially become a lot more audible, than when doing a single sample rate conversion from the stereo-master, when the mix is done.
Anyway... I don't want to come across like an annoying know-it-all, and totally stand corrected if there's reasons like that, to record in 48 kHz. But the resampling from 48 kHz to 44.1 kHz for "for CD" productions, would otherwise diminish the signal more, than the theoretical quality gain at the "higher" sample rate of 48 kHz.
It's just a half-tone or so of difference, if you play a 44.1 kHz sample at 48 kHz (without conversion) or vice versa. So, that difference is not going to capture the sort of "extra harmonic overtones" or make the waveform representation smoother to a point where this "could" matter, such as when using much higher frequencies such as 88.1, 96 or even 192 kHz.
From that point of view, the difference between 44.1 and 48 kHz is meaningless.
...if there are more artifacts at 44.1 when doing certain types of edits, that is, of course, totally valid, though. I haven't noticed myself, yet, but will try to pay attention when I do the upcoming vocal edits/tuning.