Additive synthesis
This article needs additional citations for verification. (March 2011) |
Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.[1] [2]
The timbre of musical instruments can be considered in the light of Fourier theory to consist of multiple harmonic or inharmonic partials or overtones. Each partial is a sine wave of different frequency and amplitude that swells and decays over time.
Additive synthesis generates its sound by adding the output of multiple sine wave generators. It may also be implemented using pre-computed wavetables or inverse Fast Fourier transforms.
Theory
Harmonic additive synthesis is closely related to the concept of a Fourier series. A Fourier series is a way of expressing a periodic function as a sum of sinusoidal waves with frequencies equal to integer multiples of a common fundamental frequency; these sinusoidal waves are called harmonics. Harmonic additive synthesis can be mathematically expressed as:
- ,
where is the synthesis output, , , and are the amplitude, frequency, and the phase offset of the th harmonic partial of a total of harmonic partials, and is the fundamental frequency of the tone.
More generally, the amplitude of each harmonic can be prescribed as a function of time, , in which case the synthesis output is
- .
Additive synthesis can also produce inharmonic sounds (which are non-periodic waveforms, within the time-frame given by the fundamental frequency) if the individual harmonics do not all have a frequency that is an integer multiple of the fundamental frequency.[3][4] Inharmonic additive synthesis can be described as
- ,
where is the constant frequency of th partial.
In the general case, the instantaneous frequency of a sinusoid is the derivative (with respect to time) of the argument of the sine or cosine function. If this frequency is represented in Hz, rather than in angular frequency form, then this derivative is divided by . This is the case whether the partial is harmonic or inharmonic and whether its frequency is constant or time-varying.
In the most general form, the frequency of each non-harmonic partial is a function of time, , yielding
- .
Modern-day implementations of additive synthesis are mainly digital, and therefore more appropriately described by discrete-time equations.
Harmonic partials
In the discrete-time form the equation for harmonic additive synthesis can be written as
or, equivalently,
- ,
where
- is the output sample at discrete time ,
- is the fundamental frequency of the waveform or the note frequency,
- is the sampling frequency,
- is an index over the summed harmonics, ranging from to
- is the amplitude envelope of the th harmonic at discrete time ,
- is a time-dependent phase offset. atan2( ) is the four-quadrant arctan function.
While the frequency term is constant, a non-constant phase offset will detune the frequency of the partial slightly away from the exact harmonic value. The above equations can be re-written with a time-dependent frequency term and a constant phase term, as will be shown below in the context of non-harmonic partials. In discrete-time additive synthesis, the frequency of the highest harmonic must be less than the Nyquist frequency to prevent aliasing. If the phase terms are constant, this implies that . The DC term is often taken to be zero.
Inharmonic partials
The time-dependent frequency of a sinusoid at the time of sample can be defined in an implementation-friendly way as an angle increment
where is the angle increment at the time of sample . It is also the case that the angle is
- .
By replacing the th harmonic frequency, , with a time-varying and general (not necessarily harmonic) frequency, (the time-dependent frequency of the th partial at the time of sample ), the synthesized output (omitting the DC term) is given by
or
- .
If with constant , all partials are harmonic and these more general equations reduce to the harmonic case above.
The phase offset term of each partial can be absorbed into the time-dependent frequency term, by the substitution
- .
If that substitution is made, all of the phase terms can be set to zero with no loss of generality (retaining the initial phase value at time ) and the expressions of inharmonic additive synthesis can be simplified to
- .
If this constant phase term (at time ) is expressed as , the general expression of additive synthesis can be further simplified:
- ,
where for all , and .
Implementation methods
Oscillator bank synthesis
Additive synthesis can be implemented using a bank of sinusoidal oscillators, one for each partial.[1]
Wavetable synthesis
In the case of harmonic, quasi-periodic musical tones, wavetable synthesis can be as general as time-varying additive synthesis, but requires less computation during synthesis.[5] As a result, an efficient implementation of time-varying additive synthesis of harmonic tones can be accomplished by use of wavetable synthesis.
Group additive synthesis[6][7][8] is a method to group partials into harmonic groups (of differing fundamental frequencies) and synthesize each group separately with wavetable synthesis before mixing the results.
Inverse FFT synthesis
An inverse Fast Fourier Transform can be used to efficiently synthesize frequencies that evenly divide the transform period. By careful consideration of the DFT frequency domain representation it is also possible to efficiently synthesize time varying sinusoids of arbitrary frequency using a series of overlapped inverse Fast Fourier Transforms.[9]
Additive analysis/resynthesis
It is possible to analyze the frequency components of a recorded sound giving a "sum of sinusoids" representation. This representation can be re-synthesized using additive synthesis. One method of decomposing a sound into time varying sinusoidal partials is Fourier Transform-based McAulay-Quatieri Analysis.[10][11]
By modifying the sum of sinusoids representation, timbral alterations can be made prior to resynthesis. For example, a harmonic sound could be restructured to sound inharmonic, and vice versa. Sound hybridisation or "morphing" has been implemented by additive resynthesis.[12]
Additive analysis/resynthesis has been employed in a number of techniques including Sinusoidal Modelling,[13] Spectral Modelling Synthesis (SMS),[12] and the Reassigned Bandwidth-Enhanced Additive Sound Model.[14] Software that implements additive analysis/resynthesis includes: SPEAR,[15] LEMUR, LORIS,[16] and SMSTools.[17]
Relation to speech synthesis
During the 1970s additive synthesis was investigated in the context of speech synthesis research.[18]
Sinewave synthesis, a technique for synthesizing speech by replacing the formants (main bands of energy) with pure tone whistles may be considered a 'non-harmonic additive re-synthesis for speech'.[18][19]
Linear predictive coding (LPC) is an analysis and audio codec method for speech. [20][21][22] In the decoder of an LPC audio codec, subtractive synthesis using filters, or sinewave synthesis using oscillators, are utilized to re-synthesize speech.
Broader definitions of additive synthesis
Additive synthesis has been used as an umbrella term for the class of sound synthesis techniques that sum simple elements to create more complex timbres, even when the elements are not sine waves.[23][24] For example, F. Richard Moore listed additive synthesis as one of the "four basic categories" of sound synthesis along side subtractive synthesis, nonlinear synthesis, and physical modelling.[24] In this general sense, the pipe organ and the Hammond organ may be considered as additive synthesizers. Summation of principle components and Walsh functions have also been classified as additive synthesis.[25]
Timeline of additive synthesizers
![]() |
In the following timeline, dates in parentheses indicate when a commercial synthesizer was released, or in case of an experimental machine, completed.
![]() |
Historical precursors
Pipe organs (around 1450) allow the combination of pipes in the manner of additive synthesis. The pipes that can be combined this way are mostly flute pipes, which create nearly sinusoidal or triangle waves. Pipes that generate other types of waveforms, for example square wave generating clarinet stops, are not suited to this purpose: they are used more as solo stops.
Telharmonium (1897) was an organ-like electronic musical instrument that allowed adjusting the strength of harmonics, generated by tonewheels. Electronic amplification was not available at the time of its realization, which led to a bulky design using alternators. The first model weighed 7 tons.
Hammond organ, invented in 1934[26], is an electronic organ that uses nine drawbars to mix several harmonics, which are generated by a set of tonewheels.
pre-1970s
ANS synthesizer (1958) was invented by Evgeny Murzin. The synthesizer ran a bank of 576 sine wave oscillators over a span of 8 octaves (at 1/6 semitone intervals).[27] A second model (1964) extended the frequency range to 10 octaves [27] and is still functional.[28] The 720 oscillator frequencies are generated by 5 optical discs, photocells and band-pass amplifiers. Compositions are written in the form of spectrogram "scores" drawn on glass plates covered by black mastic, by scratching the mastic. Light that passes the scratches determines the oscillator amplitudes.[29]
![]() |
1970s
Bell Labs Digital Synthesizer (Alles Machine) was a real-time digital sound synthesis system,[30] developed at Bell Labs in 1970s following earlier non-real-time software experiments. It has been called the first true digital additive synthesizer.[31] Based on this technology, several commercial synthesizers were developed, including Crumar/DKI GDS (1980), a reduced version DK Synergy (1982), and the Atari AMY sound chip.
EMS Digital Oscillator Bank (DOB) and Analysing Filter Bank: According to the Inside Story of Electronic Music Studios (London), Ltd. (EMS),[32] in the early 1970s, David Cockerell and Peter Eastty developed a minicomputer controllable fully digital vocoder implementing real-time additive synthesis, following the previous analog version. It consisted of 64 digital oscillators with independent frequency and amplitude controls, 1024 point waveforms, and bank of 128 digital analysis filters to provide coefficients for the oscillators. (⇒See also #Additive analysis/resynthesis)
RMI Harmonic Synthesizer (1974-76) was an early digital additive / analog subtractive hybrid synthesizer marketed to consumers. The additive section consisted of 16 partials that could be mixed in an edit mode into a pre-calculated single-cycle stereo waveform. In play mode, the digital waveform was processed by analog modules such as a voltage-controlled filter controlled by an envelope generator, implementing subtractive synthesis.[33] RMI Harmonic Synthesizer was used by synth pioneer Jean Michel Jarre, for his albums Oxygène and Equinoxe.[34] A similar design (non-time-varying digital additive/analog subtractive hybrid synthesis) was also utilized in Oxford Synthesizer Company's OSCar (1983, 24 harmonics), Korg DSS-1 (1986, 128 harmonics), Kawai K3 (1986, 32 harmonics), Casio FZ-1 (1987, 48 harmonics).
New England Digital Synclavier was a programmable harmonic definable FM/additive synthesizer and sampler.[35][36] Initially, it was not a real additive synth[citation needed]: one can construct a patch defining 16 partials per voice[36] (as in the Casio FZ-1, Korg DSS-1, and Kawai K3) and apply dynamic enveloping, and FM operator with envelope, only with the partial timbre.[citation needed][clarification needed] Later, with the Synclavier software upgrade, one can specify several harmonic spectrums and crossfade between them in time.[citation needed]
Note that Synclavier's FM re-synthesis feature with fine grained time frame is sometimes considered[according to whom?][37] as equivalent to Wavetable synthesis, and Wavetable synthesis under some conditions is equivalent to time-varying additive synthesis. (⇒ See section Wavetable synthesis)
Fairlight Quasar (1975-1977) and Fairlight CMI (1979-1985) implemented harmonic additive synthesis and FFT-based additive resynthesis.
1980s
Crumar/DKI General Development System (GDS, 1980) and DK Synergy (1982), a descendant of Alles Machine, are user definable PM (as FM) semi algorithmic with additive capabilities, 32 digital oscillators synthesizers.[38][39] This allows, for example, two voice polyphony with 16 partials per voice.
Seiko introduced its Digital Sound System line of keyboard instruments in 1984, utilizing a 16-operator additive synthesis engine. Although the keyboards (the DS-101, DS-202, and DS-250) were not directly programmable, a separate programming device was available that allowed the user to create new presets.[40]
Kurzweil K150 (1986) [41] is an additive engine that trades off quantity of oscillators vs. polyphony and where one can program each partial individually with envelopes. Full programming (known as Fourier Synthesis option) is only possible using an old Apple II computer, and cannot be done from the front panel.
Kawai K5 (1987) does the same but is a simplification, being able to control only 4 groups of harmonics and not each one.[42] The more recent Kawai K5000 is also an additive synth, but combined with samples.
![]() |
1990s
More contemporary popular implementations of additive synthesis include the Kawai K5000 series of synthesizers in the 1990s.
2000–
2000s (decade) saw the advent of software synthesizers such as discoDSP Vertigo, Camel Audio Alchemy and Cameleon 5000, Image-Line Morphine, Harmless and Harmour[43] the VirSyn Cube, White Noise Audio Soft WNAdditive, and ConcreteFX Adder.
See also
References
- ^ a b
Julius O. Smith III. "Additive Synthesis (Early Sinusoidal Modeling)". Retrieved 2012-01-14.
The term "additive synthesis" refers to sound being formed by adding together many sinusoidal components
Cite error: The named reference "JOS_Additive" was defined multiple times with different content (see the help page). - ^ Gordon Reid. "Synth Secrets, Part 14: An Introduction To Additive Synthesis". Sound On Sound. Retrieved 2012-01-14.
- ^
Smith III, Julius O.; Serra, Xavier (2005), "Additive Synthesis", PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation, CCRMA, Department of Music, Stanford University, retrieved 2012-01-09
{{citation}}
: Cite has empty unknown parameter:|1=
(help); External link in
(help); Unknown parameter|chapterurl=
|chapterurl=
ignored (|chapter-url=
suggested) (help) (online reprint) - ^
Smith III, Julius O. (2011), "Additive Synthesis (Early Sinusoidal Modeling)", Spectral Audio Signal Processing, CCRMA, Department of Music, Stanford University, ISBN 978-0-9745607-3-1, retrieved 2012-01-09
{{citation}}
: External link in
(help); Unknown parameter|chapterurl=
|chapterurl=
ignored (|chapter-url=
suggested) (help) - ^ Robert Bristow-Johnson (November 1996). "Wavetable Synthesis 101, A Fundamental Perspective" (PDF).
- ^ Julius O. Smith III. "Group Additive Synthesis". CCRMA, Stanford University. Retrieved 2011-05-12.
- ^ P. Kleczkowski (1989). "Group additive synthesis". Computer Music Journal. 13 (1): 12–20.
- ^ B. Eaglestone and S. Oates (1990). "Proceedings of the 1990 International Computer Music Conference, Glasgow". Computer Music Association.
{{cite journal}}
:|chapter=
ignored (help); Cite journal requires|journal=
(help) - ^
Rodet, X. (1992). "Spectral Envelopes and Inverse FFT Synthesis". Proceedings of the 93rd Audio Engineering Society Convention.
{{cite journal}}
: Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^ R. J. McAulay and T. F. Quatieri (Aug 1986), "Speech analysis/synthesis based on a sinusoidal representation", IEEE Transactions on Acoustics, Speech, Signal Processing ASSP-34: 744-754
- ^ McAulay-Quatieri Method
- ^ a b Serra, Xavier (1989). A System for Sound Analysis/Transformation/Synthesis based on a Deterministic plus Stochastic Decomposition (Ph.D. thesis). Stanford University. Retrieved 2012-01-13.
- ^ Julius O. Smith III, Xavier Serra. "PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation". Retrieved 2012-01-09.
- ^ Fitz, Kelly (1999). The Reassigned Bandwidth-Enhanced Method of Additive Synthesis (Ph.D. thesis). Dept. of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign. Retrieved 2012-01-13.
- ^ SPEAR Sinusoidal Partial Editing Analysis and Resynthesis for MacOS X, MacOS 9 and Windows
- ^ Loris Software for Sound Modeling, Morphing, and Manipulation
- ^ SMSTools application for Windows
- ^ a b
Remez, R.E. (1981). "Speech perception without traditional speech cues". Science (212): 947–950.
{{cite journal}}
: Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^ Rubin, P.E. (1980). "Sinewave Synthesis Instruction Manual (VAX)" (PDF). Internal memorandum. Haskins Laboratories, New Haven, CT.
- ^ Kondoz, Ahmet M. (2004), Digital speech: coding for low bit rate communication systems, John Wiley and Sons, p. 65, ISBN 9780470870082
- ^ B. Atal and M. Schroeder (1970), "Adaptive predictive coding of speech signals", Bell Sys. Technical Journal (October 1970): 1973–87
- ^ J.Makhoul (1975), "Proceeding of IEEE", Linear prediction: A tutorial review', vol. 63, pp. 561–80
- ^ Roads, Curtis (1995). The Computer Music Tutorial. MIT Press. p. 134. ISBN 0-262-68082-3.
- ^ a b Moore, F. Richard (1995). Foundations of Computer Music. Prentice Hall. p. 16. ISBN 0-262-68082-3.
- ^ Roads, Curtis (1995). The Computer Music Tutorial. MIT Press. pp. 150–153. ISBN 0-262-68082-3.
- ^
Laurens Hammond (Filing in Jan 19, 1934, accepted in Apr 24, 1934). "U.S. Pat. No. 1956350 – Electrical Musical Instrument".
{{cite web}}
: Check date values in:|date=
(help) - ^ a b "Евгений Мурзин и синтезатор АНС". 13 September 2009. Retrieved 13 January 2012.
- ^ "octopus" (21 October 2009). "АНС. коллективные действия". Retrieved 13 January 2012.
- ^ Derek Holzer (22 February 2010). "A brief history of optical synthesis". Retrieved 13 January 2012.
- ^ Alles, H.G. (1985). "A Portable Digital Sound-Synthesis System". In Roads, Curtis; Strawn, John (eds.). Foundations of Computer Music. MIT Press. pp. 244–249.
- ^ Joel Chadabe (1997). Electric Sound. Prentice Hall. p. 178. ISBN 978-0-13-303231-4.
- ^
Hinton, Graham (2002). "EMS: The Inside Story". Electronic Music Studios (Cornwall).
{{cite web}}
: Invalid|ref=harv
(help) — story about the beginnings of EMS, studios, its non-commercial equipments, peoples and users. - ^ "RMI Harmonic Synthesizer". Synthmuseum.com. Retrieved 2011-05-12.
- ^ "RMI Harmonic Synthesizer". Jarrography – The ultimate Jean Michel Jarre discography. Retrieved 2011-05-12.
- ^ "Synclavier Early History". Synclavier European Services. Retrieved 2011-05-12.
- ^ a b
Risberg, Jeffrey S. (1980). "Digital Additive Synthesis for Computer Music". Audio Engineering Society Convention 66.
{{cite book}}
: External link in
(help); Unknown parameter|chapterurl=
|chapterurl=
ignored (|chapter-url=
suggested) (help); Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^
Klaus Michael Indlekofer (Last updated 2007-08-18). "Multimedia - Synthesizers and Samplers - Frequency- and phase-modulation synthesis".
{{cite web}}
: Check date values in:|date=
(help)
— Also similar discussions can be found on Harmony Central Forum, Fixunix Forum, TableHooters, etc. - ^ "Crumar/DKI GDS System & Synergy". Synthony Music's Synth & Midi Museum. Retrieved 2011-05-12.
- ^ Aaron Lanterman (ed.). "Digital Keyboards Synergy Preservation Page".
- ^ "Seiko Digital Sound System catalog 1983" (in Japanese). Seiko. 1983. Retrieved 2011-05-12.
- ^ "Kurzweil K150". Vintage Synth Explorer. Retrieved 2011-05-12.
- ^ "Kawai K5". Vintage Synth Explorer. Retrieved 2011-05-12.
- ^ "Image-Line Morphine". Image Line Software BVBA. Retrieved 2011-05-12.