Why is 44100 Hz the standard sampling rate for compact discs?

In summary, the extra 4100 Hz are there to account for the fact that human hearing can only process frequencies up to about 20 kHz.
  • #1
Bassalisk
947
2
I know that from the sampling theorem you have to go at least twice the frequency you want to sample.


Human hearing is around 20 kHz. So 40 kHz would be enough. What are those extra 4100 Hz there for. Now don't get me wrong, I tried google but all results were vague and only scratched the surface of the question.


Some of them said, reduce in entropy, some of the mentioned pass bands etc. Can anybody give me full, straight answer? Don't be afraid to go technical on me, I am very curios about this...
 
Engineering news on Phys.org
  • #2
A pre-sampling (Nyquist) filter needs to cut off unwanted high frequencies to avoid aliasing. But to have a fairly flat response up to where you can hear, a practical filter will need to have a band in which to 'roll off' and this requires an extra gap between your maximum programme frequency and half sample frequency. That takes you to, say 44kHz

I believe the actual choice of 44100Hz was to do with the fact that early recording of digital signals had to be done on existing Video recording equipment. A colour TV signal is a very complex thing and analogue recording of video 'only just works' on VHS (many people would say that it doesn't really work and I could agree). In order to use TV Analogue recording, the digital signal had to fit in with the existing standards so 44100Hz was high enough to suit the digital sound system and worked at acceptable bit rate for the , very complex, video, circuitry to work.
 
  • #3
This frequency is used when preparing an audio signal to be digitized onto a CD.

This is a quote from Wikipedia giving the historical reasons for its choice:

The exact sampling rate of 44.1 kHz was inherited from a method of converting digital audio into an analog video signal for storage on U-matic video tape, which was the most affordable way to transfer data from the recording studio to the CD manufacturer at the time the CD specification was being developed. The device that converts an analog audio signal into PCM audio, which in turn is changed into an analog video signal is called a PCM adaptor. This technology could store six samples (three samples per stereo channel) in a single horizontal line. A standard NTSC video signal has 245 usable lines per field, and 59.94 fields/s, which works out to be 44,056 samples/s/stereo channel. Similarly, PAL has 294 lines and 50 fields, which gives 44,100 samples/s/stereo channel. This system could store 14-bit samples with some error correction, or 16-bit samples with almost no error correction.
 
  • #4
One part of the answer is aliasing (again!). You can't design a practical filter that acts like a "brick wall" to block all frequencies above a particular value, without some nasty side effects like unwanted phase shifts that vary with frequency. It's much easier to design a filter with a steep slope to roll off the amplitude over a range like 20kHz - 22.05 KHz.

There are two frequency "standards" in practical use, because DAT (digital audio tape) was based on 48kHz sampling, and professional digital audio products (as compared with comsumer products) still use 48kHz or higher multiples of it (even as high as 384 kHz). For example DVD-audio disks use a 96kHz sample rate.

There is a story that the specification for audio CDs using 44.1 KHz was based on the requirement by Philips to issue their longest "popular" classical recording - Beethoven's 9th symphony conducted by Otto Klemperer, who was well known for slow tempos - on a single CD. That set the playing time at 74 minutes, and the 44.1 sampling rate then followed from the available technology to manufacture and play the disks. That may be apocryphal, but Klemperer's recording of the 9th does play for just a few seconds short of 74 minutes.

The advantage of the higher sampling rates is better capture of transient sounds with ultrasonic components (e.g. percussion instruments like cymbals etc) and better signal-to-noise ratio. If you can spread the "truncation noise" from rounding the results of digital processing to the nearest integer over the whole frequency range from 0 to 384 kHz, and tilt the frequency spectrum of the noise so most of it is at high frequences, you can then throw most of the noise away when you resample at 48 kHz.
 
  • #5
http://www.snopes.com/music/media/cdlength.asp

story gets better - a Sony executive's wife was fond of the 9th...

the adaptation of video recording equipment makes a lot of sense. it was used briefly for computer mass storage...
 
  • #6
Bassalisk said:
I know that from the sampling theorem you have to go at least twice the frequency you want to sample.


Human hearing is around 20 kHz. So 40 kHz would be enough. What are those extra 4100 Hz there for. Now don't get me wrong, I tried google but all results were vague and only scratched the surface of the question.


Some of them said, reduce in entropy, some of the mentioned pass bands etc. Can anybody give me full, straight answer? Don't be afraid to go technical on me, I am very curios about this...

The upper frequency cutoff of human hearing is not fixed from person to person, and it varies (decreases) with age in the same person. Young people often can hear ultrasounds above 20 kHz. I think even some musical instruments have spectral components up to 22 kHz. According to the sampling theorem, that would be 44 kHz. The 0.1 kHz is added to avoid any aliasing at the boundary,
 
  • #7
Dickfore said:
The upper frequency cutoff of human hearing is not fixed from person to person, and it varies (decreases) with age in the same person. Young people often can hear ultrasounds above 20 kHz. I think even some musical instruments have spectral components up to 22 kHz. According to the sampling theorem, that would be 44 kHz. The 0.1 kHz is added to avoid any aliasing at the boundary,
that's a bad explanation because its 'up to about 22kHz' not 'up to exactly 22 kHz' and adding 100hz to 'about 44khz' changes pretty much nothing about aliasing.

The reason for such odd frequency, as others have pointed out, is because of storing digital audio on the video tapes, fitting N samples into a single horizontal scan of the video signal.
 
Last edited:
  • #8
I would guess that the factorization [itex]44100 = 2^2 \times 3^2 \times 5^2 \times 7^2 [/itex] is more than "just a coincidence". But since there isn't any particular reason to chop audio signals into pieces exactly 1 second long, it's not entirely obvious why that is a nice property for a sampling rate. FWIW [itex]48000 = 2^7 \times 3 \times 5^3[/itex] also has lots of small prime factors.
 
  • #9
Dickfore said:
I think even some musical instruments have spectral components up to 22 kHz.

There are many instruments that have spectral components way above 22 kHz. A reasonable cut-off point for recording is about 100 kHz, which is why professional digital audio uses sample rates of 192 KHz or even 384 KHz. http://www.cco.caltech.edu/~boyk/spectra/spectra.htm
 
  • #10
Well I did expect an explanation technical in nature. So in a nutshell Philips was the "guilty" one for the standards?

Nice to learn something new today. Still, thank you all, you gave me a lot to work with. I will research this out even more.
 
  • #11
"I would guess that the factorization 44100=22×32×52×72 is more than "just a coincidence".

perhaps not a harmonic of TV vertical or horizontal sweep frequency?

like NTSC color carrier, 3.579xxxmhz not a multiple of 60..
 
  • #12
Bassalisk said:
Well I did expect an explanation technical in nature. So in a nutshell Philips was the "guilty" one for the standards?

Nice to learn something new today. Still, thank you all, you gave me a lot to work with. I will research this out even more.

Read the article in Wikipedia about compact disks.
http://en.wikipedia.org/wiki/Compact_disks
About the 6th topic on the page is the quote I gave earlier explaining exactly why 44.1 KHz is used.
 

Related to Why is 44100 Hz the standard sampling rate for compact discs?

1. Why is the sampling rate of 44100 Hz commonly used in audio recording?

The sampling rate of 44100 Hz is commonly used in audio recording because it is the minimum sampling rate required to accurately capture the range of human hearing, which is typically between 20 Hz and 20,000 Hz. This sampling rate allows for high fidelity recordings that accurately reproduce the original sound.

2. How does the sampling rate affect the quality of audio recordings?

The sampling rate directly affects the quality of audio recordings by determining the frequency range that can be accurately captured. A higher sampling rate allows for a wider frequency range to be captured, resulting in higher quality recordings with more detail and clarity.

3. Is 44100 Hz the only sampling rate used in audio recording?

No, 44100 Hz is not the only sampling rate used in audio recording. Other common sampling rates include 48000 Hz, 96000 Hz, and 192000 Hz. The choice of sampling rate depends on the specific application and desired audio quality.

4. How does the sampling rate affect the file size of audio recordings?

The sampling rate directly affects the file size of audio recordings. A higher sampling rate means more data is captured per second, resulting in larger file sizes. This is why higher quality recordings with wider frequency ranges tend to have larger file sizes.

5. Can a lower sampling rate be used for audio recording?

Yes, a lower sampling rate can be used for audio recording. However, using a lower sampling rate can result in a loss of audio quality, particularly in the higher frequency ranges. It is generally recommended to use a sampling rate of at least 44100 Hz for high quality audio recordings.

Similar threads

Replies
9
Views
1K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
6
Views
4K
  • Classical Physics
Replies
3
Views
1K
Replies
10
Views
2K
  • Astronomy and Astrophysics
Replies
10
Views
5K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
1
Views
2K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
1
Views
2K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
7
Views
2K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
1
Views
2K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
7
Views
3K
Back
Top