Jump to content

Physical modelling synthesis

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Weeb Dingle (talk | contribs) at 19:26, 6 July 2017 (General methodology: basic editing). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Physical modelling synthesis refers to sound synthesis methods in which the waveform of the sound to be generated is computed using a mathematical model, a set of equations and algorithms to simulate a physical source of sound, usually a musical instrument.

General methodology

Modelling attempts to replicate laws of physics that govern sound production, and will typically have several parameters, some of which are constants that describe the physical materials and dimensions of the instrument, while others are time-dependent functions describing the player's interaction with the instrument, such as plucking a string, or covering toneholes.

For example, to model the sound of a drum, there would be a mathematical model of how striking the drumhead injects energy into a two-dimensional membrane. Incorporating this, a larger model would simulate the properties of the membrane (mass density, stiffness, etc.), its coupling with the resonance of the cylindrical body of the drum, and the conditions at its boundaries (a rigid termination to the drum's body), describing its movement over time and thus its generation of sound.

Similar stages to be modelled can be found in instruments such as a violin, though the energy excitation in this case is provided by the slip-stick behavior of the bow against the string, the width of the bow, the resonance and damping behavior of the strings, the transfer of string vibrations through the bridge, and finally, the resonance of the soundboard in response to those vibrations.

In addition, the same concept has been applied to simulate voice and speech sounds.[1] In this case, the synthesizer includes mathematical models of the vocal fold oscillation and associated laryngeal airflow, and the consequent acoustic wave propagation along the vocal tract. Further, it may also contain an articulatory model to control the vocal tract shape in terms of the position of the lips, tongue and other organs.

Although physical modelling was not a new concept in acoustics and synthesis, having been implemented using finite difference approximations of the wave equation by Hiller and Ruiz in 1971, it was not until the development of the Karplus-Strong algorithm, the subsequent refinement and generalization of the algorithm into the extremely efficient digital waveguide synthesis by Julius O. Smith III and others, and the increase in DSP power in the late 1980s[2] that commercial implementations became feasible.

Yamaha contracted with Stanford University in 1989[3] to jointly develop digital waveguide synthesis; subsequently, most patents related to the technology are owned by Stanford or Yamaha.

The first commercially available physical modelling synthesizer made using waveguide synthesis was the Yamaha VL1 in 1994.[4]

While the efficiency of digital waveguide synthesis made physical modelling feasible on common DSP hardware and native processors, the convincing emulation of physical instruments often requires the introduction of non-linear elements, scattering junctions, etc. In these cases, digital waveguides are often combined with FDTD,[5] finite element or wave digital filter methods, increasing the computational demands of the model.[6]

Technologies associated with physical modelling

Examples of physical modelling synthesis:

Hardware synthesizers

  • Korg OASYS and Korg Kronos – STR-1 Plucked string
  • Korg OASYS PCI
  • Korg Prophecy
  • Korg SOLO-TRI (an expansion board for the Trinity with the synth engine of the Prophecy)
  • Korg Z1
  • Korg MOSS-TRI (a expansion board for the Trinity with the synth engine of the Z1) and EXB-MOSS (a multi timbral expansion board for the Triton and the KARMA workstation with the synth engine of the Z1)
  • Yamaha VL1, VP1 and VL7
  • Yamaha VL70m, PLG-100VL and 150VL (VL70m in the form of a plug-in card that can be installed into any of several Yamaha keyboards, tone modules, and the SW1000XG high-end PC midi sound card)
  • Yamaha EX5, EX5R
  • Technics WSA1/WSA1R
  • Clavia Nord Modular G2
  • Alesis Fusion
  • Roland V-Piano
  • Pianoid
  • Physis Unico
  • Physis Piano (made in Italy, with a full touch controlled user interface)
  • Hartmann Neuron and Neuron VS

While not purely a hardware synth, the DS-XG sound cards based on the Yamaha YMF-7#4 family of audio chipsets (including 724, 744, 754, and 764), including the Yamaha WaveForce 192 (SW192XG) as well as many from other manufacturers and even some PC motherboards with such an audio chipset, included hardware-assisted software VL physical modelling (like a VL70m or PLG-VL, and compatible with same) along with the Yamaha XG, wave audio, and 3D gaming sound capabilities of the chipset. Unfortunately, only the VxD (Virtual Device Drivers) drivers for pre-NT kernel versions of Windows (3.x, 9#, and ME) support the physical modelling feature. Neither the WDM (Windows Device Model) drivers for Windows 98, 98SE, nor ME, nor any driver for any NT-kernel version of Windows (NT, 2000, XP, Vista, Windows 2003 Server, Windows 7, Windows 2008 Server, nor likely any future OSes) support this, nor can they due to OS limitations. Those OSes do support the other features of the card, though.

In their prime, the DS-XG sound cards were easily the most affordable way of obtaining genuine VL technology for anyone who already had a Windows 3.x, 9#, or ME PC. Such cards could be had brand new for as low as $12 USD (YMF-724 versions). But since they were not fully compatible with the AC-97 and later AC-98 standards, these chipsets faded from the market and have not been manufactured by Yamaha in nearly a decade.

Technics WSA1 and its rackmounted counterpart WSA1R was Technics' first and only try at high-end synthesizers. It featured 64 voices of polyphony with a combination of sample playback (for initial transients) and DSP acoustic modelling. Technics WSA1 was launched in 1995, but the musical community did not have enough confidence in Technics to buy a $5000 hardware synth. Only about 600 keyboards and 300 rack models were ever made, and most were sold at highly discounted prices.

Various Roland synthesizer models (V-Synth, V-Combo, XV-5080, Fantom, etc.), use a technology called COSM ("Composite Object Sound Modeling") which uses physical modeling techniques to more accurately replicate guitars, brass and other instruments. COSM has been superseded by "SuperNatural", which is also based on physical modeling techniques. Introduced first in 2008 as part of the ARX expansion boards for Fantom hardware synthesizers, "SuperNatural" modeling is used in Roland's V-Drums (TD-30, TD-15, TD-11), V-Accordions (FR-7, FR-8) and various synth models (Jupiter 80, Integra 7, FA-08, JD-Xi, etc.) Later this has been expanded to ACB ("Analogue Circuit Behaviour"), using similar physical modeling techniques as before, which were incorporated into Roland's latest line of hardware synthesizer products called "AIRA" (TB-3, System-1, System-1m, and most currently System-8 hardware synthesizers), as well as their 'Boutique' line of hardware modules (JP08, JX03 & JU06). While the Roland "ESC2" chip inside Roland's TD-30 and Integra-7 sound modules were marketed under "SuperNatural" modelling, the same "ESC2" chip inside latest Roland "AIRA" Products (System-1, System-1m, System-8) is currently marketed under "ACB" modelling technology. Hence, both "ACB" and "SuperNatural" physical modelling techniques are based on the very same Roland "ESC2" chip.

References

  • Hiller, L.; Ruiz, P. (1971). "Synthesizing Musical Sounds by Solving the Wave Equation for Vibrating Objects". Journal of the Audio Engineering Society.
  • Karplus, K.; Strong, A. (1983). "Digital synthesis of plucked string and drum timbres". Computer Music Journal. 7 (2). Computer Music Journal, Vol. 7, No. 2: 43–55. doi:10.2307/3680062. JSTOR 3680062.
  • Cadoz, C.; Luciani A; Florens JL (1993). "CORDIS-ANIMA : a Modeling and Simulation System for Sound and Image Synthesis: The General Formalism". Computer Music Journal. 17/1 (1). Computer Music Journal, MIT Press 1993, Vol. 17, No. 1.

Footnotes

  1. ^ Englert, Marina; Madazio, Glaucya; Gielow, Ingrid; Lucero, Jorge; Behlau, Mara. "Perceptual Error Analysis of Human and Synthesized Voices". Journal of Voice. doi:10.1016/j.jvoice.2016.12.015.
  2. ^ Vicinanza , D: Astra Project. http://www.astraproject.org/project.html, 2007.
  3. ^ Johnstone, B: Wave of the Future. http://www.harmony-central.com/Computer/synth-history.html, 1993.
  4. ^ Wood, S G: Objective Test Methods for Waveguide Audio Synthesis. Masters Thesis - Brigham Young University, http://contentdm.lib.byu.edu/cdm4/item_viewer.php?CISOROOT=/ETD&CISOPTR=976&CISOBOX=1&REC=19, 2007.
  5. ^ The NESS project http://www.ness.music.ed.ac.uk
  6. ^ C. Webb and S. Bilbao, "On the limits of real-time physical modelling synthesis with a modular environment" http://www.physicalaudio.co.uk