Recent from talks
PCM adaptor
Knowledge base stats:
Talk channels stats:
Members stats:
PCM adaptor
A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mastering early compact discs.
High-quality pulse-code modulation (PCM) audio requires a significantly larger bandwidth than a regular analog audio signal. For example, a 16-bit PCM signal requires an analog bandwidth of about 1-1.5 MHz compared to about 15-20 kHz of analog bandwidth required for an analog audio signal. A standard analog audio recorder cannot meet this requirement. One solution arrived at in the early 1980s was to use a videotape recorder, which is capable of recording signals with higher bandwidths.
A means of converting digital audio into a video format was necessary. Such an audio recording system includes two devices: the PCM adaptor, which converts audio into pseudo-video, and the videocassette recorder. A PCM adaptor performs an analog-to-digital conversion producing series of binary digits, which, in turn, is coded and modulated into a black and white video signal, appearing as a vibrating checkerboard pattern, which can then be recorded as a video signal.
Most video-based PCM adaptors record audio at 14 or 16 bits per sample, with a sampling frequency of 44.1 kHz for PAL or monochrome NTSC, or 44.056 kHz for color NTSC. Some of the earlier models, such as the Sony PCM-100, recorded 16 bits per sample, but used only 14 of the bits for the audio, with the remaining 2 bits used for error correction for the case of dropouts or other anomalies being present on the videotape.
The use of video for the PCM adapter helps to explain the choice of sampling frequency for the CD, because the number of video lines, frame rate and bits per line end up dictating the sampling frequency one can achieve. A sampling frequency of 44.1 kHz was thus adopted for the compact disc, as at the time, there was no other practical way of storing digital audio than by using a PCM adaptor and videocassette recorder combination.
It is simplest if the same number of lines are used in each field, and, crucially, it was decided to adopt a sample rate that could be used on both PAL and monochrome NTSC equipment. Since monochrome NTSC has a field rate of 60 Hz, and PAL has a field rate of 50 Hz, their least common multiple is 300 Hz, and with 3 samples per line, this yields a sample rate that is a multiple of 900 Hz. For monochrome NTSC the sample rate is 5m × 60 × 3, where 5m is the number of active lines per field, which must be a multiple of 5 (the rest used for synchronization), and for PAL the sample rate is 6n × 50 × 3, where 6n is the number of active lines per field, which must be a multiple of 6. The sampling rates that satisfy these requirements – at least 40 kHz (to encode up to 20 kHz sounds), no more than 46.875 kHz (requiring no more than 3 samples per line in PAL), and a multiple of 900 Hz (to allow encoding in both NTSC and PAL), are thus 40.5, 41.4, 42.3, 43.2, 44.1, 45, 45.9, and 46.8 kHz. The lower ones are eliminated due to low-pass filters requiring a transition band, while the higher ones are eliminated due to some lines being required for vertical blanking interval; 44.1 kHz was the higher usable rate, and was eventually chosen.
The sampling frequencies of 44.1 and 44.056 kHz were thus the result of a need for compatibility with the 25-frame (PAL countries) and 30-frame black and white (NTSC countries) video formats used for audio storage at the time.
Audio samples are recorded as if they were on the lines of a raster scan of video, as follows: analog video standards represent video at a field rate of 60 Hz (NTSC, North America – or 60/1.001 Hz ≈ 59.94 Hz for color NTSC) or 50 Hz (PAL, Europe), which corresponds to a frame rate of 30 frames per second (frame/s) or 25 frame/s – each field is half the lines of an interlaced image (alternating the odd lines and the even lines). Each of these fields is in turn composed of lines – a frame of 625 lines for PAL and 525 lines for NTSC, though some of the lines are actually for synchronizing the signal, and a field comprises half the visible lines in one vertical scan. Digital audio samples are then encoded along each line, thus allowing reuse of the existing synchronization circuitry – as video, the resulting images look like lines of binary black and white (rather, gray) dots along each scan line. The line frequency (lines per second) was 15,625 Hz for PAL (625 × 50/2), 15,750 Hz for 60 Hz (monochrome) NTSC (525 × 60/2), and 15,750/1.001 Hz (approximately 15,734.26 Hz) for 59.94 (color) NTSC, and thus to record audio at the required over 40 kHz required encoding multiple samples per line, with 3 samples per line being sufficient, yielding up to 15,625 × 3 = 46,875 for PAL and 15,750 × 3 = 47,250 for NTSC. It is desirable to minimize the number of samples per line, so that each sample can have more space devoted to it, thus making it easier to have a higher bit depth (16 bits, rather than 14 or 12 bits, say) and better error tolerance, and in practice, the signal was stereo, requiring 3 × 2 = 6 samples per line. However, some of these lines are devoted to (vertical) synchronization: specifically, the lines during the vertical blanking interval (VBI) could not be used, so a maximum of 490 lines per frame (245 lines per field) could be used in NTSC, and about 588 lines per frame (294 lines per field) on PAL (Note that, in video, PAL has (up to) 575 visible lines while NTSC has up to 485).
Hub AI
PCM adaptor AI simulator
(@PCM adaptor_simulator)
PCM adaptor
A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mastering early compact discs.
High-quality pulse-code modulation (PCM) audio requires a significantly larger bandwidth than a regular analog audio signal. For example, a 16-bit PCM signal requires an analog bandwidth of about 1-1.5 MHz compared to about 15-20 kHz of analog bandwidth required for an analog audio signal. A standard analog audio recorder cannot meet this requirement. One solution arrived at in the early 1980s was to use a videotape recorder, which is capable of recording signals with higher bandwidths.
A means of converting digital audio into a video format was necessary. Such an audio recording system includes two devices: the PCM adaptor, which converts audio into pseudo-video, and the videocassette recorder. A PCM adaptor performs an analog-to-digital conversion producing series of binary digits, which, in turn, is coded and modulated into a black and white video signal, appearing as a vibrating checkerboard pattern, which can then be recorded as a video signal.
Most video-based PCM adaptors record audio at 14 or 16 bits per sample, with a sampling frequency of 44.1 kHz for PAL or monochrome NTSC, or 44.056 kHz for color NTSC. Some of the earlier models, such as the Sony PCM-100, recorded 16 bits per sample, but used only 14 of the bits for the audio, with the remaining 2 bits used for error correction for the case of dropouts or other anomalies being present on the videotape.
The use of video for the PCM adapter helps to explain the choice of sampling frequency for the CD, because the number of video lines, frame rate and bits per line end up dictating the sampling frequency one can achieve. A sampling frequency of 44.1 kHz was thus adopted for the compact disc, as at the time, there was no other practical way of storing digital audio than by using a PCM adaptor and videocassette recorder combination.
It is simplest if the same number of lines are used in each field, and, crucially, it was decided to adopt a sample rate that could be used on both PAL and monochrome NTSC equipment. Since monochrome NTSC has a field rate of 60 Hz, and PAL has a field rate of 50 Hz, their least common multiple is 300 Hz, and with 3 samples per line, this yields a sample rate that is a multiple of 900 Hz. For monochrome NTSC the sample rate is 5m × 60 × 3, where 5m is the number of active lines per field, which must be a multiple of 5 (the rest used for synchronization), and for PAL the sample rate is 6n × 50 × 3, where 6n is the number of active lines per field, which must be a multiple of 6. The sampling rates that satisfy these requirements – at least 40 kHz (to encode up to 20 kHz sounds), no more than 46.875 kHz (requiring no more than 3 samples per line in PAL), and a multiple of 900 Hz (to allow encoding in both NTSC and PAL), are thus 40.5, 41.4, 42.3, 43.2, 44.1, 45, 45.9, and 46.8 kHz. The lower ones are eliminated due to low-pass filters requiring a transition band, while the higher ones are eliminated due to some lines being required for vertical blanking interval; 44.1 kHz was the higher usable rate, and was eventually chosen.
The sampling frequencies of 44.1 and 44.056 kHz were thus the result of a need for compatibility with the 25-frame (PAL countries) and 30-frame black and white (NTSC countries) video formats used for audio storage at the time.
Audio samples are recorded as if they were on the lines of a raster scan of video, as follows: analog video standards represent video at a field rate of 60 Hz (NTSC, North America – or 60/1.001 Hz ≈ 59.94 Hz for color NTSC) or 50 Hz (PAL, Europe), which corresponds to a frame rate of 30 frames per second (frame/s) or 25 frame/s – each field is half the lines of an interlaced image (alternating the odd lines and the even lines). Each of these fields is in turn composed of lines – a frame of 625 lines for PAL and 525 lines for NTSC, though some of the lines are actually for synchronizing the signal, and a field comprises half the visible lines in one vertical scan. Digital audio samples are then encoded along each line, thus allowing reuse of the existing synchronization circuitry – as video, the resulting images look like lines of binary black and white (rather, gray) dots along each scan line. The line frequency (lines per second) was 15,625 Hz for PAL (625 × 50/2), 15,750 Hz for 60 Hz (monochrome) NTSC (525 × 60/2), and 15,750/1.001 Hz (approximately 15,734.26 Hz) for 59.94 (color) NTSC, and thus to record audio at the required over 40 kHz required encoding multiple samples per line, with 3 samples per line being sufficient, yielding up to 15,625 × 3 = 46,875 for PAL and 15,750 × 3 = 47,250 for NTSC. It is desirable to minimize the number of samples per line, so that each sample can have more space devoted to it, thus making it easier to have a higher bit depth (16 bits, rather than 14 or 12 bits, say) and better error tolerance, and in practice, the signal was stereo, requiring 3 × 2 = 6 samples per line. However, some of these lines are devoted to (vertical) synchronization: specifically, the lines during the vertical blanking interval (VBI) could not be used, so a maximum of 490 lines per frame (245 lines per field) could be used in NTSC, and about 588 lines per frame (294 lines per field) on PAL (Note that, in video, PAL has (up to) 575 visible lines while NTSC has up to 485).