In music, timbre (/ˈtæmbər/ TAM-bər, also known as tone color or tone quality from psychoacoustics) is the perceived sound quality of a musical note, sound, or tone that distinguishes different types of sound production, such as choir voices and musical instruments, such as string instruments, wind instruments, and percussion instruments, and which enables listeners to hear even different instruments from the same category as different (e.g. a viola and a violin).
- ASA definition
- In music history
- Psychoacoustic evidence
- Tristimulus timbre model
The physical characteristics of sound that determine the perception of timbre include spectrum and envelope. Singers and instrumental musicians can change the timbre of the music they are singing/playing by using different singing or playing techniques. For example, a violinist can use different bowing styles or play on different parts of the string to obtain different timbres (e.g., playing sul tasto produces a light, airy timbre, whereas playing sul ponticello produces a harsh, even an aggressive tone). On electric guitar and electric piano, performers can change the timbre using effects units and graphic equalizers.
In simple terms, timbre is what makes a particular musical sound have a different sound from another, even when they have the same pitch and loudness. For instance, it is the difference in sound between a guitar and a piano playing the same note at the same volume. Both instruments can sound equally tuned in relation to each other as they play the same note, and while playing at the same amplitude level each instrument will still sound distinctively with its own unique tone color. Experienced musicians are able to distinguish between different instruments of the same type based on their varied timbres, even if those instruments are playing notes at the same pitch and loudness.
Tone quality and tone color are synonyms for timbre, as well as the "texture attributed to a single instrument". However, the word texture can also refer to the type of music, such as multiple, interweaving melody lines versus a singable melody accompanied by subordinate chords. Hermann von Helmholtz used the German Klangfarbe (tone color), and John Tyndall proposed an English translation, clangtint, but both terms were disapproved of by Alexander Ellis, who also discredits register and color for their pre-existing English meanings (Erickson 1975, 7). The sound of a musical instrument may be described with words such as bright, dark, warm, harsh, and other terms. There are also colors of noise, such as pink and white. In visual representations of sound, timbre corresponds to the shape of the image (Abbado 1988, 3).
The Acoustical Society of America (ASA) Acoustical Terminology definition 12.09 of timbre describes it as "that attribute of auditory sensation which enables a listener to judge that two nonidentical sounds, similarly presented and having the same loudness and pitch, are dissimilar", adding, "Timbre depends primarily upon the frequency spectrum, although it also depends upon the sound pressure and the temporal characteristics of the sound".
Timbre has been called, "...the psychoacoustician's multidimensional waste-basket category for everything that cannot be labeled pitch or loudness." (McAdams and Bregman 1979, 34; cf. Dixon Ward 1965, 55 and Tobias 1970, 409).
Many commentators have attempted to decompose timbre into component attributes. For example, J. F. Schouten (1968, 42) describes the, "elusive attributes of timbre", as "determined by at least five major acoustic parameters", which Robert Erickson (1975, 5) finds, "scaled to the concerns of much contemporary music":
- The range between tonal and noiselike character
- The spectral envelope
- The time envelope in terms of rise, duration, and decay (ADSR—attack, decay, sustain, release)
- The changes both of spectral envelope (formant-glide) and fundamental frequency (micro-intonation)
- The prefix, or onset of a sound, quite dissimilar to the ensuing lasting vibration
Erickson (1975, 6) gives a table of subjective experiences and related physical phenomena based on Schouten's five attributes:
See also "Psychoacoustic evidence" below.
The richness of a sound or note a musical instrument produces is sometimes described in terms of a sum of a number of distinct frequencies. The lowest frequency is called the fundamental frequency, and the pitch it produces is used to name the note, but the fundamental frequency is not always the dominant frequency. The dominant frequency is the frequency that is most heard, and it is always a multiple of the fundamental frequency. For example, the dominant frequency for the transverse flute is double the fundamental frequency. Other significant frequencies are called overtones of the fundamental frequency, which may include harmonics and partials. Harmonics are whole number multiples of the fundamental frequency, such as ×2, ×3, ×4, etc. Partials are other overtones. There are also sometimes subharmonics at whole number divisions of the fundamental frequency. Most instruments produce harmonic sounds, but many instruments produce partials and inharmonic tones, such as cymbals and other indefinite-pitched instruments.
When the tuning note in an orchestra or concert band is played, the sound is a combination of 440 Hz, 880 Hz, 1320 Hz, 1760 Hz and so on. Each instrument in the orchestra or concert band produces a different combination of these frequencies, as well as harmonics and overtones. The sound waves of the different frequencies overlap and combine, and the balance of these amplitudes is a major factor in the characteristic sound of each instrument.
William Sethares wrote that just intonation and the western equal tempered scale are related to the harmonic spectra/timbre of many western instruments in an analogous way that the inharmonic timbre of the Thai renat (a xylophone-like instrument) is related to the seven-tone near-equal tempered pelog scale in which they are tuned. Similarly, the inharmonic spectra of Balinese metallophones combined with harmonic instruments such as the stringed rebab or the voice, are related to the five-note near-equal tempered slendro scale commonly found in Indonesian gamelan music (Sethares 1998, 6, 211, 318).
The timbre of a sound is also greatly affected by the following aspects of its envelope: attack time and characteristics, decay, sustain, release (ADSR envelope) and transients. Thus these are all common controls on synthesizers. For instance, if one takes away the attack from the sound of a piano or trumpet, it becomes more difficult to identify the sound correctly, since the sound of the hammer hitting the strings or the first blast of the player's lips are highly characteristic of those instruments. The envelope is the overall amplitude structure of a sound, so called because the sound just "fits" inside its envelope: what this means should be clear from a time-domain display of almost any interesting sound, zoomed out enough that the entire waveform is visible.
In music history
The music of Debussy, composed during the last decades of the nineteenth and the first decades of the twentieth centuries, has been credited with elevating the role of timbre in music: "To a marked degree the music of Debussy elevates timbre to an unprecedented structural status; already in Prélude à l'après-midi d'un faune the color of flute and harp functions referentially" (Samson 1977, p. 195).
In rock music from the late 1960s to the 2000s, the timbre of specific sounds is important to a song. For example, in heavy metal music, the sonic impact of the heavily-amplified, heavily-distorted power chord played on electric guitar through very loud guitar amplifiers and rows of speaker cabinets is an essential part of the style's musical identity. You could not take the heavily-amplified electric guitar part and substitute it with the same notes played on a grand piano or pipe organ.
Often, listeners can identify an instrument, even at different pitches and loudness, in different environments, and with different players. In the case of the clarinet, acoustic analysis shows waveforms irregular enough to suggest three instruments rather than one. David Luce (1963, 16) suggests that this implies that, "Certain strong regularities in the acoustic waveform of the above instruments must exist which are invariant with respect to the above variables." However, Robert Erickson argues that there are few regularities and they do not explain our "...powers of recognition and identification." He suggests borrowing the concept of subjective constancy from studies of vision and visual perception (Erickson 1975, 11).
Psychoacoustic experiments from the 1960s onwards tried to elucidate the nature of timbre. One method involves playing pairs of sounds to listeners, then using a multidimensional scaling algorithm to aggregate their dissimilarity judgments into a timbre space. The most consistent outcomes from such experiments are that brightness or spectral energy distribution (Grey 1977), and the bite, or rate and synchronicity (Wessel 1979) and rise time (Lakatos 2000), of the attack are important factors.
Tristimulus timbre model
The concept of tristimulus originates in the world of color, describing the way three primary colors can be mixed together to create a given color. By analogy, the musical tristimulus measures the mixture of harmonics in a given sound, grouped into three sections. The first tristimulus measures the relative weight of the first harmonic; the second tristimulus measures the relative weight of the second, third, and fourth harmonics taken together; and the third tristimulus measures the relative weight of all the remaining harmonics (Peeters 2003; Pollard and Jansson 1982,):