Trisha Shetty (Editor)

44,100 Hz

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
44,100 Hz

In digital audio, 44,100 Hz (alternately represented as 44.1 kHz) is a common sampling frequency. Analog audio is recorded by sampling it 44,100 times per second, and then these samples are used to reconstruct the audio signal when playing it back.

Contents

44.1 kHz audio is widely used, due to this being the sampling rate used in Compact Discs, dating back to its use by Sony from 1979.

History

The 44.1 kHz sampling rate originated in the late 1970s with PCM adaptors, which recorded digital audio on video cassettes, notably the Sony PCM-1600 (1979) and subsequent models in this series. This then became the basis for Compact Disc digital audio (CD-DA), defined in the Red Book standard (1980). Its use has continued as an option in 1990s standards such as the DVD, and in 2000s, standards such as HDMI. This sampling frequency is commonly used for MP3 and other consumer audio file formats which were originally created from material ripped from Compact Discs.

Why 44.1 kHz?

The rate was chosen following debate between manufacturers, notably Sony and Philips, and its implementation by Sony, yielding a de facto standard. The technical reasoning behind the rate being chosen is as follows.

Human hearing and signal processing

Nyquist–Shannon sampling theorem says the sampling frequency must be greater than twice the maximum frequency one wishes to reproduce. Since human hearing range is roughly 20 Hz to 20,000 Hz, the sampling rate had to be greater than 40 kHz.

In addition, signals must be low-pass filtered before sampling to avoid aliasing. While an ideal low-pass filter would perfectly pass frequencies below 20 kHz (without attenuating them) and perfectly cut off frequencies above 20 kHz, in practice a transition band is necessary, where frequencies are partly attenuated. The wider this transition band is, the easier and more economical it is to make an anti-aliasing filter. The 44.1 kHz sampling frequency allows for a 2.05 kHz transition band.

In addition, 44,100 is the product of the squares of the first four prime numbers (2^2 * 3^2 * 5^2 * 7^2) and hence has many useful small factors.

Recording on video equipment

Early digital audio was recorded to existing analog video cassette tapes, as VCRs were the only available transports with sufficient capacity to store meaningful lengths of audio. To enable reuse with minimal modification of the video equipment, these ran at the same speed as video, and used much of the same circuitry. 44.1 kHz was deemed the highest usable rate meeting the following criteria

  • Compatible with both PAL and NTSC video
  • Requires encoding no more than 3 samples per video line per audio channel
  • Conclusion

    The actual choice of rate was the point of some debate, with other alternatives including 44,100/1.001 ≈ 44.056 kHz (corresponding to the NTSC color field rate of 60/1.001 = 59.94 Hz) or approximately 44 kHz, proposed by Philips. Ultimately Sony prevailed on both sample rate (44.1 kHz) and bit depth (16 bits per sample, rather than 14 bits per sample).

    The sample rate is composed as follows:

    NTSC:

    245 × 60 × 3 = 44,100 245 active lines/field × 60 fields/second × 3 samples/line = 44,100 samples/second (490 active lines per frame, out of 525 lines total)

    PAL:

    294 × 50 × 3 = 44,100 294 active lines/field × 50 fields/second × 3 samples/line = 44,100 samples/second (588 active lines per frame, out of 625 lines total)

    In actual practice, different machines used different video standards – for example, the Sony PCM-1610 only used 525/60 monochrome video (NTSC, US), not 625/50 (PAL, Europe) or NTSC color.

    Alternative rates

    Several other sampling rates were also used in early digital audio, most significantly 48 kHz, discussed below in status.

    Earlier rates included a 50 kHz sample rate, used by Soundstream (by Thomas Stockham) in the 1970s, following a 37 kHz prototype.

    In the early 1980s, a 32 kHz sampling rate was used in broadcast (esp. in UK and Japan), because this was sufficient for FM stereo broadcasts, which had 15 kHz bandwidth.

    Some digital audio was provided for domestic use in two incompatible EIAJ formats, corresponding to 525/59.94 (44,056 Hz sampling) and 625/50 (44.1 kHz sampling).

    Lastly, in what appears to be a coincidence, the 44.1 kHz sampling rate is exactly 4 times the line frequency of the old 441 lines German TV standard, which had a frequency of 441 × 50 ÷ 2 = 11,025 Hz (441 lines per frame, 50 fields per second, 2 fields per frame).

    Various multiples of 44.1 kHz are used – the lower rates 11.025 kHz and 22.05 kHz are found in WAV files, and are suitable for low-bandwidth applications, while the higher rates of 88.2 kHz and 176.4 kHz are used in mastering and in DVD-Audio – the higher rates are useful both for the usual reason of providing additional resolution (hence less sensitive to distortions introduced by editing), and also making the low-pass filtering easier, since a much larger transition band (between human-audible at 20 kHz and the sampling rate) is possible. The 88.2 kHz and 176.4 kHz rates are primarily used when the ultimate target is a CD.

    Consequences

    Subsequently, the DAT format was released in 1987 with 48 kHz sampling. This sample rate, which is a rounder number and also allows a larger transition band in low-pass filtering, has also become common. Converting between these rates was initially difficult, due to the relatively high numbers in the ratio between these rates: 44,100:48,000 = 147:160, but is today easy. This difference was initially exploited to make it difficult to copy 44.1 kHz CDs using 48 kHz DAT equipment.

    Status

    Due to the popularity of CDs, a great deal of 44.1 kHz equipment exists, as does a great deal of audio recorded in 44.1 kHz (or multiples thereof). However, some more recent standards use 48 kHz in addition to or instead of 44.1 kHz. In video, 48 kHz is now the standard, but for audio targeted at CDs, 44.1 kHz (and multiples) are still used.

    The HDMI TV standard (2003) allows both 44.1 kHz and 48 kHz (and multiples thereof). This provides compatibility with DVD players playing CD, VCD and SVCD content. The DVD and Blu-ray Disc standards use 48 kHz only.

    Most audio processors/sound cards contain DAC for both 44.1 kHz and 48 kHz, being able to natively output either, though some older processors include only 44.1 kHz output, and some cheaper newer processors only include 48 kHz output, requiring digital sample rate conversion to output other sample rates. Similarly, processors may be able to record natively at only certain sample rates.

    References

    44,100 Hz Wikipedia


    Similar Topics