Jump to content

Phase vocoder

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Mooncow (talk | contribs) at 15:24, 22 March 2011 (See also). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

A phase vocoder is a type of vocoder which can scale both the frequency and time domains of audio signals by using phase information. The computer algorithm allows frequency-domain modifications to a digital sound file (typically time expansion/compression and pitch shifting).

At the heart of the phase vocoder is the short-time Fourier transform (STFT), typically coded using fast Fourier transforms. The STFT converts a time domain representation of sound into a time-frequency representation (the "analysis" phase), allowing modifications to the amplitudes or phases of specific frequency components of the sound, before resynthesis of the frequency domain representation into the time domain by the inverse STFT. The time scale of the resynthesis does not have to be the same as the time scale of the analysis, allowing for high-quality time-scale modification of the original sound file.

Since the boundaries between each window need to be in phase, this method results in an audible "phase smearing" effect, especially on transients. (Similar to phase dispersion.)[citation needed] Many other methods have been developed that mix aspects of the phase vocoder with other methods, to produce similar effects while maintaining phase coherence at transients, such as Ircam's SuperVP.[1]

Use in music

British composer Trevor Wishart used phase vocoder analyses and transformations of a human voice as the basis for his composition VOX 5 (part of his larger VOX Cycle).[2] Transfigured Wind by American composer Roger Reynolds uses the phase vocoder to perform time-stretching of flute sounds.[3]

The proprietary Auto-Tune pitch-correcting software, widely used in commercial music production, is based on the phase vocoder principle.[citation needed]

References

  1. ^ http://anasynth.ircam.fr/home/english/software/supervp
  2. ^ Wishart, T. ‘The Composition of Vox 5’. Computer Music Journal 12/4, 1988
  3. ^ Serra, X. 'A System for Sound Analysis/Transformation/Synthesis based on Deterministic plus Stochastic Decomposition', p.12 (PhD Thesis 1989)

See also