Audacity

What is sound?

Sounds are waves of air. We hear sounds because our ears are sensitive to these waves. One of the easiest types of sound waves to understand is a short, sudden sound like a hand clap. When hands clap together, they create a pressurised wave of air which moves at about 340 meters per second ( "the speed of sound" ). When this wave rea­ches ones ear, it pushes on the eardrum slightly, causing it to vibrate and the individual hears the clap.

waveform_clap.jpg

A hand clap is a short event that causes a single wave of air that quickly dies out. The image above shows the shape of the wave ("waveform") for a typical hand clap. 

Other sound waves are longer events. A ringing bell serves as a good example of this. When a bell rings, after the initial strike, the sound comes from the ongoing "ringing" of the bell. While the bell rings, it vibrates at a particular speed ("frequency") and this causes the nearby air to vibrate at the same speed. This causes waves of air to travel from the bell, again, at the speed of sound. Pressure waves from continuous vibration look more like this:

waveform_sine.jpg

Both of these types of waves are called sound waves or acoustic waves.

Digital recording and playback

A microphone consists of a small membrane which vibrates when it meets these acoustic waves. The microphone translates movements of the membrane into electrical signals.  Basically, a microphone converts acoustic waves into electrical waves.

If one examined the shape of an electrical wave from a microphone, that person would notice that it looks very similar to the shape of the original sound wave. The following is the sound wave we saw in the above explanation:

waveform_sine.jpg

The following is the electrical wave created by the microphone (notice the measurement for "Current"):

One could say that the shape of the electrical wave is analogous to ("similar to") the shape of the original sound wave. This is why these electrical waves that represent sound waves are called analog waves.

The main device used in digital recording is an Analog-to-Digital Converter (ADC). The ADC measures the voltage of an electrical wave thousands of times per second. It then uses these measurements to create a map of an electrical wave:

waveform_sampled.jpg

Each dot in the figure above represents one audio sample. The more samples per second, the more accurate the mapping of the electrical wave.

Playback of digital audio uses a Digital-to-Analog Converter (DAC). This takes the samples and converts them back into an electrical wave.  In a computer, this electrical wave is sent to a sound card's headphone or speaker sockets and the speakers recreate the original sound wave by vibrating their diaphragms.

A computer's sound card comes with an Analog-to-Digital Converter (ADC) for recording, and a Digital-to-Analog Converter (DAC) for playing audio. Your operating system (Windows, Mac OS X, Linux, etc.) talks to the sound card to actually handle the recording and playback, and audio applications talk to your operating system so that you can play sound files, capture sounds to a file, edit them, and mix multiple tracks while playing, etc.

Quality

Two factors determine the quality of a digital recording:

  • Sample rate: this is the rate which the ADC records samples or a DAC plays them back. Sample Rate is measured in Hertz (Hz), or samples per second. An audio CD has a sample rate of 44,100 Hz (often written as "44 KHz" for short). 

  • Sample format or sample size: Essentially, this is the number of digits in the digital representation of each sample. Think of the sample rate as the horizontal precision, and the sample format as the vertical precision. An audio CD has a precision of 16 bits.

Higher sampling rates allow a digital recording to accurately record higher frequencies.

Higher sample sizes allow for more dynamic range—better reproduction of  loud and soft sounds.

Audio file formats

There are two main types of audio files on a computer:

  • PCM stands for Pulse Code Modulation. This is just a fancy name for the technique where each number in the digital audio file represents exactly one sample in the waveform. Common examples of PCM files are WAV files, AIFF files, and Sound Designer II files.

  • Compressed audio files are the other type. Modern compressed audio files use sophisticated psychoacoustic algorithms to represent the essential frequencies of the audio signal in far less space than PCM files. Examples include MP3 (MPEG I, layer 3), Ogg Vorbis, and WMA (Windows Media Audio). Creating one of these files sacrifices some quality in order to use less disk space.