Jump to content

Additive synthesis

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 71.169.180.195 (talk) at 23:39, 16 January 2012 (Definitions: A little more rigorous. Not saying that this is done.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.[1] [2]

The timbre of musical instruments can be considered in the light of Fourier theory to consist of multiple harmonic or inharmonic partials or overtones. Each partial is a sine wave of different frequency and amplitude that swells and decays over time.

Additive synthesis generates its sound by adding the output of multiple sine wave generators. It may also be implemented using pre-computed wavetables or inverse Fast Fourier transforms.

Definitions

Harmonic additive synthesis is closely related to the concept of a Fourier series which is a way of expressing a periodic function as a sum of sinusoidal functions with frequencies equal to integer multiples of a common fundamental frequency. These sinusoids are called harmonics, sometimes overtones, and most generally, partialss. In general, a Fourier series contains an infinite number of sinusoidal components, with no upper limit to the frequency of the sinusoidal functions and includes a DC component (one with frequency of 0 Hz). Frequencies outside of the human audible range can be omitted in additive synthesis. As a result only a finite number of sinusoidal terms with frequencies that lie within the audible range are modeled in additive synthesis.

A waveform or function is said to be periodic if

for all and for some period .

The Fourier series of a periodic function is mathematically expressed as:

where

is the fundamental frequency of the waveform and equal the reciprocal of the period,
is the index over the summed harmonics, ranging from to
is the amplitude of the th harmonic,
is the phase offset of the th harmonic. atan2(&nbsp) is the four-quadrant arctangent function,

Being inaudible, the DC component, , and all components with frequencies higher than some finite limit, , shall be omitted in the expressions of additive synthesis.

Harmonic form

The simplest harmonic additive synthesis can be mathematically expressed as:

where is the synthesis output, , , and are the amplitude, frequency, and the phase offset of the th harmonic partial of a total of harmonic partials, and is the fundamental frequency of the waveform and the frequency of the musical note.

Time-dependent amplitudes

More generally, the amplitude of each harmonic can be prescribed as a function of time, , in which case the synthesis output is

Inharmonic form

Additive synthesis can also produce inharmonic sounds (which are non-periodic waveforms, within the time-frame given by the fundamental frequency) if the individual harmonics do not all have a frequency that is an integer multiple of the fundamental frequency.[3][4] Inharmonic additive synthesis can be described as

,

where is the constant frequency of th partial.

Time-dependent frequencies

In the general case, the instantaneous frequency of a sinusoid is the derivative (with respect to time) of the argument of the sine or cosine function. If this frequency is represented in Hz, rather than in angular frequency form, then this derivative is divided by . This is the case whether the partial is harmonic or inharmonic and whether its frequency is constant or time-varying.

In the most general form, the frequency of each non-harmonic partial is a function of time, , yielding

Broader definitions

Additive synthesis has been used as an umbrella term for the class of sound synthesis techniques that sum simple elements to create more complex timbres, even when the elements are not sine waves.[5][6] For example, F. Richard Moore listed additive synthesis as one of the "four basic categories" of sound synthesis along side subtractive synthesis, nonlinear synthesis, and physical modelling.[6] In this general sense, the pipe organ and the Hammond organ may be considered as additive synthesizers. Summation of principal components and Walsh functions have also been classified as additive synthesis.[7]

Implementation methods

Modern-day implementations of additive synthesis are mainly digital. (See section Discrete-time equations for the underlying discrete-time theory)

Oscillator bank synthesis

Additive synthesis can be implemented using a bank of sinusoidal oscillators, one for each partial.[1]

Wavetable synthesis

In the case of harmonic, quasi-periodic musical tones, wavetable synthesis can be as general as time-varying additive synthesis, but requires less computation during synthesis.[8] As a result, an efficient implementation of time-varying additive synthesis of harmonic tones can be accomplished by use of wavetable synthesis.

Group additive synthesis[9][10][11] is a method to group partials into harmonic groups (of differing fundamental frequencies) and synthesize each group separately with wavetable synthesis before mixing the results.

Inverse FFT synthesis

An inverse Fast Fourier Transform can be used to efficiently synthesize frequencies that evenly divide the transform period. By careful consideration of the DFT frequency domain representation it is also possible to efficiently synthesize time varying sinusoids of arbitrary frequency using a series of overlapped inverse Fast Fourier Transforms.[12]

Additive analysis/resynthesis

It is possible to analyze the frequency components of a recorded sound giving a "sum of sinusoids" representation. This representation can be re-synthesized using additive synthesis. One method of decomposing a sound into time varying sinusoidal partials is Fourier Transform-based McAulay-Quatieri Analysis.[13][14]

By modifying the sum of sinusoids representation, timbral alterations can be made prior to resynthesis. For example, a harmonic sound could be restructured to sound inharmonic, and vice versa. Sound hybridisation or "morphing" has been implemented by additive resynthesis.[15]

Additive analysis/resynthesis has been employed in a number of techniques including Sinusoidal Modelling,[16] Spectral Modelling Synthesis (SMS),[15] and the Reassigned Bandwidth-Enhanced Additive Sound Model.[17] Software that implements additive analysis/resynthesis includes: SPEAR,[18] LEMUR, LORIS,[19] and SMSTools.[20]

Relation to speech synthesis

During the 1970s additive synthesis was investigated in the context of speech synthesis research.[21]

Sinewave synthesis, a technique for synthesizing speech by replacing the formants (main bands of energy) with pure tone whistles may be considered a 'non-harmonic additive re-synthesis for speech'.[21][22]

Linear predictive coding (LPC) is an analysis and audio codec method for speech.[23][24][25] In the decoder of an LPC audio codec, subtractive synthesis using filters, or sinewave synthesis using oscillators, are utilized to re-synthesize speech.

Timeline of additive synthesizers

In the following timeline, dates in parentheses are the release dates of commercial synthesizers, or in case of unique experimental machines, completion dates.

The drawbars of the Hammond organ, used to control the amplitude of each harmonic
The drawbars on a Hammond organ, used to control the amplitude of each harmonic.

Historical precursors

Pipe organ (around 1450): Pipes of pipe organs can be combined in the manner of additive synthesis. The pipes that can be combined are mostly flute pipes, which create nearly sinusoidal or triangle waves. Pipes that generate other types of waveforms, for example square wave generating clarinet stops, are not suited to this purpose: they are used more as solo stops.

Telharmonium (1897): The Telharmonium, also known as the Dynamophone, was the first electronic musical instrument, predating the dynamic loudspeaker. The Telharmonium resembled and organ and allowed adjustment of the amplitudes of harmonics generated by tonewheels. Electronic amplification was not available at the time of its realization, which led to a bulky design using alternators. The first model weighed 7 tons.

Hammond organ (invented in 1934[26]): The Hammond organ is an electronic organ that uses nine drawbars to mix several harmonics, which are generated by a set of tonewheels.

pre-1970s

ANS Synthesizer (1958): The development of the ANS synthesizer was led by its inventor, Evgeny Murzin. The machine ran a bank of 576 sine wave oscillators over a span of 8 octaves (at 1/6 semitone intervals).[27] A second model (1964) extended the frequency range to 10 octaves [27] and is still functional.[28] The 720 oscillator frequencies are generated by 5 optical discs, photocells and band-pass amplifiers. Compositions are written in the form of spectrogram "scores" drawn on glass plates covered by black mastic, by scratching the mastic. Light that passes the scratches determines the oscillator amplitudes.[29]

NED Synclavier PSMT + V/PK

1970s

Alles Machine: The Bell Labs Digital Synthesizer, commonly known as the Alles Machine, was a real-time digital sound synthesis system,[30] developed at Bell Labs in 1970s following earlier non-real-time software experiments. It has been called the first true digital additive synthesizer.[31] Based on this technology, several commercial synthesizers were developed, including Crumar/DKI GDS (1980), a reduced version DK Synergy (1982), and the Atari AMY sound chip.

EMS Digital Oscillator Bank (DOB) and Analysing Filter Bank: According to the Inside Story of Electronic Music Studios (London), Ltd. (EMS),[32] in the early 1970s, David Cockerell and Peter Eastty developed a minicomputer controllable fully digital vocoder implementing real-time additive synthesis, following the previous analog version. It consisted of 64 digital oscillators with independent frequency and amplitude controls, 1024 point waveforms, and bank of 128 digital analysis filters to provide coefficients for the oscillators. (⇒See also #Additive analysis/resynthesis)

RMI Harmonic Synthesizer: Developed in 1974–76, the RMI Harmonic Synthesizer was an early digital additive / analog subtractive hybrid synthesizer marketed to consumers. The additive section consisted of 16 partials that could be mixed in an edit mode into a pre-calculated single-cycle stereo waveform. In play mode, the digital waveform was processed by analog modules such as a voltage-controlled filter controlled by an envelope generator, implementing subtractive synthesis.[33] RMI Harmonic Synthesizer was used by synth pioneer Jean Michel Jarre, for his albums Oxygène and Equinoxe.[34] A similar design (non-time-varying digital additive/analog subtractive hybrid synthesis) was also utilized in Oxford Synthesizer Company's OSCar (1983, 24 harmonics), Korg DSS-1 (1986, 128 harmonics), Kawai K3 (1986, 32 harmonics), Casio FZ-1 (1987, 48 harmonics).

New England Digital Synclavier: Synclavier was a programmable harmonic definable FM/additive synthesizer and sampler.[35][36] Initially, it was not a real additive synth[citation needed]: one can construct a patch defining 16 partials per voice[36] (as in the Casio FZ-1, Korg DSS-1, and Kawai K3) and apply dynamic enveloping, and FM operator with envelope, only with the partial timbre.[citation needed][clarification needed] Later, with the Synclavier software upgrade, one can specify several harmonic spectrums and crossfade between them in time.[citation needed]
Note that Synclavier's FM re-synthesis feature with fine grained time frame is sometimes considered[according to whom?][37] as equivalent to Wavetable synthesis, and Wavetable synthesis under some conditions is equivalent to time-varying additive synthesis. (⇒ See section Wavetable synthesis)

Fairlight Quasar (1975-1977) and Fairlight CMI (1979-1985): Built by Fairlight, Quasar and CMI implemented harmonic additive synthesis and FFT-based additive resynthesis.

1980s

Crumar/DKI General Development System (GDS, 1980) and DK Synergy (1982): Descendants of Alles Machine, GDS (marketed in the US by Digital Keyboards, Inc.) and Synergy are user-definable phase modulation semi-algorithmic synthesizers with additive capabilities, operating 32 digital oscillators.[38][39] This allows, for example, two-voice polyphony with 16 partials per voice.

Seiko Digital Sound System: Seiko introduced its Digital Sound System line of keyboard instruments in 1984, utilizing a 16-operator additive synthesis engine. Although the keyboards (the DS-101, DS-202, and DS-250) were not directly programmable, a separate programming device was available that allowed the user to create new presets.[40]

Kurzweil K150 (1986) [41]: The Kurzweil Music Systems K150 is an additive engine that trades off quantity of oscillators vs. polyphony and where one can program each partial individually with envelopes. Full programming (known as Fourier Synthesis option) is only possible using an old Apple II computer, and cannot be done from the front panel.

Kawai K5 (1987): While in Kurzweil K150 harmonics are controlled individually, in K5 manufactured by Kawai they are controlled in four groups. [42] The more recent Kawai K5000 is also an additive synth, but combined with samples.

Kawai K5000S

1990s

Kawai K5000: More contemporary popular implementations of additive synthesis includes the Kawai K5000 series of synthesizers in the 1990s.

2000–

2000s (decade) saw the advent of software synthesizers such as discoDSP Vertigo, Camel Audio Alchemy and Cameleon 5000, Image-Line Morphine, Harmless and Harmour[43] the VirSyn Cube, White Noise Audio Soft WNAdditive, and ConcreteFX Adder.

Discrete-time equations

This section reviews the theory underlying digital implementation of additive synthesis, appropriately described by discrete-time equations.

Harmonic form

In the discrete-time form the equation for harmonic additive synthesis (2) can be written as

where

is the output sample at discrete time ,
is the fundamental frequency of the waveform or the note frequency,
is the sampling frequency,
is an index over the summed harmonics, ranging from to
is the amplitude envelope of the th harmonic at discrete time ,
is a constant phase offset.

In discrete-time additive synthesis, the frequency of the highest harmonic must be less than the Nyquist frequency to prevent aliasing. This implies that .


If is constant, as in (1), the discrete-time equations can be written in a form suitable for generation with an inverse Fast Fourier transform,

,

where

and
.

Inharmonic form

The time-dependent frequency of a sinusoid at the time of sample can be defined in an implementation-friendly way as an angle increment

where is the angle increment at the time of sample . It is also the case that the angle is

.

By replacing the th harmonic frequency, , with a time-varying and general (not necessarily harmonic) frequency, (the time-dependent frequency of the th partial at the time of sample ), and generalising to a time-dependent phase offset , the equation for the sample output is

.

If with constant and is constant, all partials are harmonic and these more general equations reduce to the harmonic case above.

The time-dependent phase offset term of each partial can be absorbed into the time-dependent frequency term, by the substitution

.

If that substitution is made, all of the phase terms can be set to zero with no loss of generality (retaining the initial phase value at time ) and the expressions of inharmonic additive synthesis can be simplified to the discrete-time form of (3),

.

If this constant phase term (at time ) is expressed as , the general expression of additive synthesis can be further simplified:

,

where for all , and .

See also

References

  1. ^ a b Julius O. Smith III. "Additive Synthesis (Early Sinusoidal Modeling)". Retrieved 2012-01-14. The term "additive synthesis" refers to sound being formed by adding together many sinusoidal components Cite error: The named reference "JOS_Additive" was defined multiple times with different content (see the help page).
  2. ^ Gordon Reid. "Synth Secrets, Part 14: An Introduction To Additive Synthesis". Sound On Sound. Retrieved 2012-01-14.
  3. ^ Smith III, Julius O.; Serra, Xavier (2005), "Additive Synthesis", PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation, CCRMA, Department of Music, Stanford University, retrieved 2012-01-09 {{citation}}: Cite has empty unknown parameter: |1= (help); External link in |chapterurl= (help); Unknown parameter |chapterurl= ignored (|chapter-url= suggested) (help) (online reprint)
  4. ^ Smith III, Julius O. (2011), "Additive Synthesis (Early Sinusoidal Modeling)", Spectral Audio Signal Processing, CCRMA, Department of Music, Stanford University, ISBN 978-0-9745607-3-1, retrieved 2012-01-09 {{citation}}: External link in |chapterurl= (help); Unknown parameter |chapterurl= ignored (|chapter-url= suggested) (help)
  5. ^ Roads, Curtis (1995). The Computer Music Tutorial. MIT Press. p. 134. ISBN 0-262-68082-3.
  6. ^ a b Moore, F. Richard (1995). Foundations of Computer Music. Prentice Hall. p. 16. ISBN 0-262-68082-3.
  7. ^ Roads, Curtis (1995). The Computer Music Tutorial. MIT Press. pp. 150–153. ISBN 0-262-68082-3.
  8. ^ Robert Bristow-Johnson (November 1996). "Wavetable Synthesis 101, A Fundamental Perspective" (PDF).
  9. ^ Julius O. Smith III. "Group Additive Synthesis". CCRMA, Stanford University. Retrieved 2011-05-12.
  10. ^ P. Kleczkowski (1989). "Group additive synthesis". Computer Music Journal. 13 (1): 12–20.
  11. ^ B. Eaglestone and S. Oates (1990). "Proceedings of the 1990 International Computer Music Conference, Glasgow". Computer Music Association. {{cite journal}}: |chapter= ignored (help); Cite journal requires |journal= (help)
  12. ^ Rodet, X. (1992). "Spectral Envelopes and Inverse FFT Synthesis". Proceedings of the 93rd Audio Engineering Society Convention. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  13. ^ R. J. McAulay and T. F. Quatieri (Aug 1986), "Speech analysis/synthesis based on a sinusoidal representation", IEEE Transactions on Acoustics, Speech, Signal Processing ASSP-34: 744-754
  14. ^ McAulay-Quatieri Method
  15. ^ a b Serra, Xavier (1989). A System for Sound Analysis/Transformation/Synthesis based on a Deterministic plus Stochastic Decomposition (Ph.D. thesis). Stanford University. Retrieved 2012-01-13.
  16. ^ Julius O. Smith III, Xavier Serra. "PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation". Retrieved 2012-01-09.
  17. ^ Fitz, Kelly (1999). The Reassigned Bandwidth-Enhanced Method of Additive Synthesis (Ph.D. thesis). Dept. of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign. Retrieved 2012-01-13.
  18. ^ SPEAR Sinusoidal Partial Editing Analysis and Resynthesis for MacOS X, MacOS 9 and Windows
  19. ^ Loris Software for Sound Modeling, Morphing, and Manipulation
  20. ^ SMSTools application for Windows
  21. ^ a b Remez, R.E. (1981). "Speech perception without traditional speech cues". Science (212): 947–950. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  22. ^ Rubin, P.E. (1980). "Sinewave Synthesis Instruction Manual (VAX)" (PDF). Internal memorandum. Haskins Laboratories, New Haven, CT.
  23. ^ Kondoz, Ahmet M. (2004), Digital speech: coding for low bit rate communication systems, John Wiley and Sons, p. 65, ISBN 9780470870082
  24. ^ B. Atal and M. Schroeder (1970), "Adaptive predictive coding of speech signals", Bell Sys. Technical Journal (October 1970): 1973–87
  25. ^ J.Makhoul (1975), "Proceeding of IEEE", Linear prediction: A tutorial review', vol. 63, pp. 561–80
  26. ^ Laurens Hammond (Filing in Jan 19, 1934, accepted in Apr 24, 1934). "U.S. Pat. No. 1956350 – Electrical Musical Instrument". {{cite web}}: Check date values in: |date= (help)
  27. ^ a b "Евгений Мурзин и синтезатор АНС". 13 September 2009. Retrieved 13 January 2012.
  28. ^ "octopus" (21 October 2009). "АНС. коллективные действия". Retrieved 13 January 2012.
  29. ^ Derek Holzer (22 February 2010). "A brief history of optical synthesis". Retrieved 13 January 2012.
  30. ^ Alles, H.G. (1985). "A Portable Digital Sound-Synthesis System". In Roads, Curtis; Strawn, John (eds.). Foundations of Computer Music. MIT Press. pp. 244–249.
  31. ^ Joel Chadabe (1997). Electric Sound. Prentice Hall. p. 178. ISBN 978-0-13-303231-4.
  32. ^ Hinton, Graham (2002). "EMS: The Inside Story". Electronic Music Studios (Cornwall). {{cite web}}: Invalid |ref=harv (help) — story about the beginnings of EMS, studios, its non-commercial equipments, peoples and users.
  33. ^ "RMI Harmonic Synthesizer". Synthmuseum.com. Retrieved 2011-05-12.
  34. ^ "RMI Harmonic Synthesizer". Jarrography – The ultimate Jean Michel Jarre discography. Retrieved 2011-05-12.
  35. ^ "Synclavier Early History". Synclavier European Services. Retrieved 2011-05-12.
  36. ^ a b Risberg, Jeffrey S. (1980). "Digital Additive Synthesis for Computer Music". Audio Engineering Society Convention 66. {{cite book}}: External link in |chapterurl= (help); Unknown parameter |chapterurl= ignored (|chapter-url= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)
  37. ^ Klaus Michael Indlekofer (Last updated 2007-08-18). "Multimedia - Synthesizers and Samplers - Frequency- and phase-modulation synthesis". {{cite web}}: Check date values in: |date= (help)
    — Also similar discussions can be found on Harmony Central Forum, Fixunix Forum, TableHooters, etc.
  38. ^ "Crumar/DKI  GDS System & Synergy". Synthony Music's Synth & Midi Museum. Retrieved 2011-05-12.
  39. ^ Aaron Lanterman (ed.). "Digital Keyboards Synergy Preservation Page".
  40. ^ "Seiko Digital Sound System catalog 1983" (in Japanese). Seiko. 1983. Retrieved 2011-05-12.
  41. ^ "Kurzweil K150". Vintage Synth Explorer. Retrieved 2011-05-12.
  42. ^ "Kawai K5". Vintage Synth Explorer. Retrieved 2011-05-12.
  43. ^ "Image-Line Morphine". Image Line Software BVBA. Retrieved 2011-05-12.