Hubbry Logo
Three-two pull downThree-two pull downMain
Open search
Three-two pull down
Community hub
Three-two pull down
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Three-two pull down
Three-two pull down
from Wikipedia
An illustration of the process

Three-two pull down (3:2 pull down) is a term used in filmmaking and television production for the post-production process of transferring film to video.

It converts 24 frames per second into 29.97 frames per second, converting approximately every four frames into five frames plus a slight slow down in speed.[1] Film runs at a standard rate of 24 frames per second, whereas NTSC video has a signal frame rate of 29.97 frames per second. Every interlaced video frame has two fields for each frame. The three-two pull down is where the telecine adds a third video field (a half frame) to every second video frame, but the untrained eye cannot see the addition of this extra video field. In the figure, the film frames A–D are the true or original images since they have been photographed as a complete frame. The A, B, and D frames on the right in the NTSC footage are original frames. The third and fourth frames have been created by blending fields from different frames.

Video

[edit]

2:3

[edit]
2:3 pulldown diagram (A-B-C-D to A1-A2-B1-B2-B1-C2-C1-D2-D1-D2)

In the United States and other countries where television uses the 59.94 Hz vertical scanning frequency, video is broadcast at 29.97 frame/s. For the film's motion to be accurately rendered on the video signal, a telecine must use a technique called the 2:3 pull down (or a variant called 3:2 pull down) to convert from 24 to 29.97 frame/s.[2]

The term "pulldown" comes from the mechanical process of "pulling" (physically moving) the film downward within the film portion of the transport mechanism to advance it from one frame to the next at a repetitive rate (nominally 24 frames/s). This is accomplished in two steps.

The first step is to slow down the film motion by 1/1000 to 23.976 frames/s (or 24 frames every 1.001 seconds). This difference in speed is imperceptible to the viewer. For a two-hour film, play time is extended by 7.2 seconds.

The second step is distributing cinema frames into video fields. At 23.976 frame/s, there are four frames of film for every five frames of 29.97 Hz video:

These four frames needs to be "stretched" into five frames by exploiting the interlaced nature of video. Since an interlaced video frame is made up of two incomplete fields (one for the odd-numbered lines of the image, and one for the even-numbered lines), conceptually four frames need to be used in ten fields (to produce five frames).

The term "2:3" comes from the pattern for producing fields in the new video frames. The pattern of 2-3 is an abbreviation of the actual pattern of 2-3-2-3, which indicates that the first film frame is used in 2 fields, the second film frame is used in 3 fields, the third film frame is used in 2 fields, and the fourth film frame is used in 3 fields, producing a total of 10 fields, or 5 video frames.[3] If the four film frames are called A, B, C and D, the five video frames produced are A1-A2, B1-B2, B1-C2, C1-D2 and D1-D2. That is, frame A is used 2 times (in both fields of the first video frame); frame B is used 3 times (in both fields of the second video frame and in the odd field of the third video frame); frame C is used 2 times (in the even field of the third video frame, and in the odd field of the fourth video frame); and frame D is used 3 times (in the even field of the fourth video frame, and in both fields of the fifth video frame). The 2-3-2-3 cycle repeats itself completely after four film frames have been exposed.

3:2

[edit]
3:2 pulldown diagram (A-B-C-D to A1-A2-A1-B2-B1-C2-C1-C2-D1-D2)

The alternative pattern of "3:2" is an abbreviation of the actual pattern of 3-2-3-2, which indicates that the first film frame is used in 3 fields, the second film frame is used in 2 fields, the third film frame is used in 3 fields, and the fourth film frame is used in 2 fields, producing a total of 10 fields, or 5 video frames.[2]

If the four film frames are called A, B, C and D, the five video frames produced are A1-A2, A1-B2, B1-C2, C1-C2 and D1-D2. That is, frame A is used 3 times (in both fields of the first video frame and in the odd field of the second video frame); frame B is used 2 times (in the even field of the second video frame, and in the odd field of the third video frame); frame C is used 3 times (in the even field of the third video frame and in both fields of the fourth video frame); and frame D is used 2 times (in both fields of the fifth video frame). The 3-2-3-2 cycle repeats itself completely after four film frames have been exposed.

In practice, there is no difference between the 2-3 and 3-2 patterns. A cycle that starts with film frame B yields a 3:2 pattern: B1-B2-B1-C2-C1-D2-D1-D2-A1-A2 (or 3-2-3-2). In fact, the "3-2" notation is misleading because according to SMPTE standards for every four-frame film sequence the first frame is scanned twice, not three times.[4]

Modern alternatives

[edit]
3:3:2:2 pulldown (A-B-C-D to A1-A2-A1-B2-B1-B2-C1-C2-D1-D2)

The above method is a "classic" 2:3 (a 2-3-2-3 cycle), which was used before frame buffers allowed for holding more than one frame. It has the disadvantage of creating two dirty frames (a mix from two different film frames, ex.: B1-C2 and C1-D2) in every five video frames.

The preferred method for doing a 2:3 is a 3-3-2-2 pattern (i.e. 3:3:2:2, 2:3:3:2 or 2:2:3:3) that creates only one dirty frame in every five. It produces A1-A2, A1-B2, B1-B2, C1-C2 and D1-D2, where only the second frame (A1-B2) is dirty.

While this method has a slight bit more judder, it allows for easier upconversion (the dirty frame can be dropped without losing information) and a better overall compression when encoding. Note that just fields are displayed—no frames hence no dirty frames—in interlaced displays such as on a CRT. Dirty frames may appear in other methods of displaying the interlaced video.[5]

Audio

[edit]

The rate of NTSC video (initially monochrome, only, but soon thereafter monochrome and color) is 29.97 frames per second, or one-thousandth slower than 30 frame/s, due to the NTSC color encoding process which mandated that the line rate be a sub-multiple of the 3.579545 MHz color "burst" frequency, or 15734.2637 Hz (29.9700 Hz, frame rate), rather than the (60 Hz) ac "line locked" line rate of 15750 Hz (exactly 30 Hz, frame rate). This was done to maintain compatibility with black and white televisions.

Because of this 0.1% speed difference, when converting film to video, or vice versa, the sync will drift and the audio will end up out of sync by 3.6 seconds per hour. In order to correct this error, the audio can be either pulled up or pulled down. A pull up will speed up the sound by 0.1%, used for transferring video to film. A pull down will slow the audio speed down by 0.1%, necessary for transferring film to video.

See also

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Three-two pulldown, also known as 3:2 pulldown, is a technique used in and to convert motion picture captured at 24 frames per second (fps) to formats like at 29.97 fps. This process addresses the frame rate mismatch by selectively repeating fields from film frames in a repeating 3:2 pattern—three fields from one frame followed by two from the next—resulting in 60 fields per second while maintaining near-original playback speed through a slight of the film to 23.976 fps. The method operates by projecting or scanning each film frame across video fields: the first frame contributes three fields, with the initial two forming a full video frame and the third pairing with a field from the second frame to create a split video frame; the second frame contributes the remaining two fields, completing the split frame and starting the next video frame in the pattern. This cadence repeats every five video fields, producing an average of 2.5 fields per film frame to align with the video standard's requirements. Although effective for broadcast compatibility, the uneven distribution can cause subtle motion artifacts, such as judder or flicker, particularly noticeable during fast panning shots where split fields combine elements from adjacent film frames. Originating in the 1950s as part of the television standard finalized in , 3:2 pulldown was developed to enable seamless airing of 24 fps theatrical films on 30 fps (later precisely 29.97 fps) interlaced systems without accelerating the content by 25%. It became a cornerstone of analog transfers and persists in digital workflows for previewing effects or archiving footage. In modern production, inverse (IVTC) algorithms reverse the process to reconstruct progressive 24 fps sequences for high-definition displays, while alternatives like native progressive at 23.976 fps or 60 fps eliminate the need for pulldown altogether.

Overview

Definition and Purpose

Three-two pulldown, also known as 3:2 pulldown, is a post-production technique used in filmmaking and television to transfer content originally captured at 24 frames per second (fps) on film to the 29.97 fps interlaced video standard of NTSC broadcast systems by selectively repeating video fields. This method forms a core part of the telecine process, where motion picture film is scanned and converted into an electronic video format suitable for television airing or further editing. The primary purpose of three-two pulldown is to reconcile the inherent frame rate mismatch between traditional cinematic , which operates at 24 fps to achieve smooth motion while conserving , and video standards, which use an interlaced format at approximately 30 fps (precisely 29.97 fps to accommodate the color subcarrier and prevent interference). By inserting additional fields without significantly altering playback speed—typically slowing the film by about 0.1% to align rates—this technique enables movies to be broadcast on television with minimal distortion, preserving the original timing and artistic intent. In the basic telecine process, each film frame is scanned into two interlaced video fields (one for odd lines, one for even lines), but to match the higher video field rate of 59.94 fields per second, pulldown inserts extra fields through repetition. The "3:2" ratio specifically describes the repeating pattern where, over a sequence of four film frames, ten video fields are generated (three fields from the first frame, two from the second, three from the third, and two from the fourth), ensuring the overall timing remains approximately synchronized. This field repetition pattern, while introducing minor motion artifacts, effectively bridges the rate conversion without resorting to uniform speedup or slowdown that could affect audio pitch or visual fluidity.

Historical Development

The three-two pulldown technique emerged in the amid the rapid expansion of U.S. television broadcasting, addressing the fundamental mismatch between 24 frames per second theatrical and the 30 frames per second video standard. Developed by RCA engineers to convert content for live transmission, it allowed networks to integrate movies into programming schedules without significant speed alterations. Early adoption occurred at major broadcasters like , which utilized RCA's systems to air Hollywood features, marking a pivotal shift from predominantly live content to -based shows. A critical advancement came in 1953 with the FCC's approval of the color television standard, which introduced a subtle frame rate reduction to 29.97 fps to prevent the new color subcarrier from interfering with audio signals and causing visible artifacts. This adjustment necessitated refined implementation of 3:2 pulldown in equipment, standardizing the process for color-compatible broadcasts and ensuring smooth integration of film libraries. RCA's innovations in this era, including advanced film chain systems, played a central role in enabling networks such as and to deliver colorized content reliably to viewers. By the 1970s, three-two pulldown had become embedded in consumer workflows, particularly with the launch of formats in 1976, where transfers routinely applied the technique to encode films for home playback on VCRs. This extension democratized access to cinematic content beyond theaters. The method endured into the digital age for DVD and early streaming, but its prominence waned in the 2000s as high-definition broadcasting prioritized native formats, reducing reliance on pulldown through inverse and progressive scanning. The technique's historical significance lies in facilitating the mass dissemination of Hollywood films via television, transforming evening broadcasts into cultural staples for millions. However, the inherent judder from uneven field repetition created a distinctive motion artifact, often recognized as the "filmed on TV" aesthetic that differentiated broadcast movies from smoother video productions.

Technical Mechanism

Frame Rate Conversion Process

The frame rate conversion process in three-two pulldown begins with advancing the film through a projector at a rate of 23.976 frames per second to align with NTSC video standards. Each film frame is then scanned by the telecine equipment into two interlaced fields: one containing the odd-numbered lines and the other the even-numbered lines, capturing the full image progressively but outputting it in an interlaced format for compatibility with broadcast video. This scanning occurs in real time, with the telecine machine synchronizing the film's mechanical pull-down motion—where the film is held steady for exposure—with the generation of video fields at 59.94 fields per second (equivalent to 29.97 frames per second interlaced). To achieve the target output rate, extra fields must be inserted since 24 film frames produce only 48 fields, while 29.97 fps interlaced video requires 60 fields per second. The timing is calibrated such that 24 film frames at 23.976 fps occupy exactly 1 second, matching the duration of 30 video frames at 29.97 fps, thus preserving the original playback speed without alteration. This 0.1% slowdown from a nominal 24 fps to 23.976 fps compensates for the NTSC video rate being slightly slower than an exact 30 fps, ensuring temporal alignment through field duplication rather than frame blending or speed adjustment. The pulldown process targets individual fields rather than complete frames to reduce motion artifacts like judder, as each video frame comprises two sequentially displayed fields, each lasting 1/59.94 seconds. Telecine machines, such as the Spirit DataCine, facilitate this conversion in real time by integrating high-resolution scanning with digital processing. These devices scan the film at up to and output interlaced SD or HD video signals, maintaining synchronization between the film's advancement and the video field's timing to produce seamless 59.94-field-per-second output. The standard insertion method for the extra fields is the 3:2 pulldown pattern.

Field Repetition Patterns

In the standard 3:2 pulldown process, film frames are converted to interlaced video fields through a repeating pattern of duplications to match the higher frame rate. Specifically, the first film frame (A) contributes three fields: its top field (A1), bottom field (A2), and a repetition of the top field (A1). The subsequent film frame (B) contributes two fields: bottom (B2) then top (B1). This alternation continues with the third frame (C) providing three fields: bottom (C2), top (C1), and a repetition of the bottom (C2); the fourth (D) providing two: top (D1), bottom (D2), resulting in 10 video fields derived from 4 film frames overall. The precise field mapping for one complete cycle follows this sequence, where each film frame's progressive content is separated into odd (top) and even (bottom) fields for interlaced output:
Video FieldSource Field
1 (odd)A top
2 (even)A bottom
3 (odd)A top (repeat)
4 (even)B bottom
5 (odd)B top
6 (even)C bottom
7 (odd)C top
8 (even)C bottom (repeat)
9 (odd)D top
10 (even)D bottom
This mapping ensures that repeated fields come from the same film frame, minimizing motion artifacts in the interlaced video. The cadence cycle of 3:2 pulldown spans every 4 s, which generate 5 video frames (10 fields total), maintaining synchronization over time. The pattern repeats every 10 fields to align precisely with video timing requirements. Mathematically, the 3:2 ratio reflects the field contributions (3 from one frame, 2 from the next), resulting in 10 fields from 4 film frames (8 base fields plus 2 repeats), which ensures that 24 fps film maps accurately to 29.97 fps video without cumulative timing drift.

Variants and Applications

3:2 Pulldown in NTSC

The 3:2 pulldown process is integral to the NTSC television standard, which served as the primary analog broadcast system in the United States and Japan from the mid-20th century until the digital transition. NTSC employs a 525-line interlaced scanning format at 29.97 frames per second (59.94 fields per second), necessitating the conversion of 24 frames per second film to match this rate without significant speed alteration. The standard pattern assigns three fields to one film frame and two to the next in a repeating cadence, typically starting with the odd (upper) field to align with NTSC's interlaced structure. In broadcasting applications, was crucial for adapting theatrical films to from the through the , enabling seamless playback of on network schedules and ensuring compatibility with consumer equipment. It was routinely embedded in master tapes for distribution, such as and early DVD releases in markets, preserving the film's temporal integrity while fitting the broadcast pipeline. During post-production, 3:2 pulldown was applied via telecine transfer, where 35mm film negatives or prints were scanned and converted to video tape in specialized color suites, allowing editors to work in the NTSC domain. This workflow was standard for preparing content for air, as seen in transfers of classic films like the original Star Wars trilogy (1977–1983), which were adapted using 3:2 pulldown for NTSC television broadcasts. A key advantage of 3:2 pulldown in NTSC systems lies in its synchronization of field dominance, typically with the top (odd) field first, which helps minimize color bleeding and related artifacts in analog transmission and playback by aligning the progressive film source with the interlaced video signal's color subcarrier timing. This approach leverages basic field repetition patterns to reduce visible distortions, ensuring cleaner integration within NTSC's 525-line framework.

2:3 Pulldown in PAL and Reverse Conversion

The 2:3 pulldown is a phase-shifted variant of the 3:2 pulldown, also used in systems, where the field repetition pattern begins with two fields from one film frame followed by three fields from the next (e.g., A1, A2, B1, B2, B2). This alternation optimizes motion continuity depending on the content and equipment settings. In PAL systems operating at 25 fps (50 fields per second), the standard conversion from 24 fps involves accelerating the playback speed by approximately 4.17% to 25 fps, avoiding the need for field repetition and minimizing judder, though it raises audio pitch unless compensated. An alternative "soft " method, known as Euro pulldown, inserts extra fields without speedup by using a repeating over 12 frames (typically 11 frames contributing two fields each and one contributing three, e.g., a 2:2...2:3 ), resulting in 50 fields per second while maintaining original speed. This has been used in European transfers since the to preserve audio integrity. Reverse conversion, or inverse telecine (IVTC), removes the added or repeated fields from interlaced video (29.97 fps for NTSC or 25 fps for PAL with pulldown) to reconstruct the original 24 fps progressive film sequence. This process is commonly employed in digital workflows for archival restoration, high-definition remastering, or film-out to recover clean progressive footage. IVTC algorithms detect the pulldown cadence and discard duplicates, applicable to both 3:2/2:3 patterns in NTSC and Euro pulldown in PAL. Compared to 's 3:2/2:3 patterns, speedup or Euro pulldown reduces the need for field dominance shifts but can introduce subtle motion artifacts if not handled properly in 625-line systems.

Audio Handling

Synchronization Challenges

The core challenge in synchronizing audio with three-two pulldown video stems from the frame rate mismatch between original 24 fps audio and the resulting 29.97 fps video output. To accommodate the NTSC standard, the projector in transfers operates at a slowed speed of 23.976 fps—a 0.1% reduction from the native rate—extending the overall duration slightly while inserting repeated fields to fill the extra frames. If the audio track remains at its original 24 fps timing without corresponding adjustment, it progresses faster relative to the video, causing cumulative drift that accumulates to about 3.6 seconds over one hour of material. This speed differential requires slowing the audio by 0.1% to match the video, introducing a subtle pitch downshift if not further corrected for pitch, becoming particularly evident in extended sequences where precise alignment is critical. The drift formula quantifying this error is approximately Time error = 0.001 × duration (in seconds), reflecting the 0.1% relative speedup of unadjusted audio against the slowed video timeline. Field repetitions inherent to the three-two pulldown process further complicate , as the uneven distribution of repeated fields (three fields from one frame, two from the next) can misalign audio cues with visual motion if not precisely calibrated, leading to noticeable lip-sync errors in dialogue-intensive scenes. In historical analog workflows of the , audio was typically transferred separately from the video via magnetic stripes on the film edge or fullcoat magnetic stock, necessitating manual using interlock motor systems to align the separate audio and picture reels during broadcast preparation. These systems achieved frame-accurate timing within 1/1000th of a frame but required ongoing adjustments to counteract mechanical variations, a common practice in era-specific transfers.

Compensation Techniques

To address the audio drift resulting from frame rate mismatch in 3:2 pulldown, several compensation techniques adjust timing and pitch to ensure synchronization between the original 24 fps audio and the resulting 29.97 fps video. The primary method during analog-to-digital transfer involves slowing the audio playback by approximately 0.1%—equivalent to multiplying the speed by 23.976/24 or 1000/1001—to match the extended video duration introduced by the pulldown process. This adjustment preserves the original pitch through varispeed playback, a technique that alters playback rate while maintaining perceptual pitch via real-time resampling, avoiding the unnatural tonal shifts of traditional tape speed changes. The resulting pitch drop from this slowdown is minimal, about -1.7 cents, often imperceptible to the human ear. In workflows where audio and video are processed separately, the audio track is dubbed onto the video tape after pulldown completion, relying on for precise alignment of audio waveforms to video frames. , a standard longitudinal embedding hours, minutes, seconds, and frames, enables frame-accurate synchronization during dubbing, ensuring lip sync and effects timing without manual trimming. Digital audio workstations (DAWs) introduced more flexible corrections, allowing pitch shifts without duration changes to fine-tune . For example, tools in early DAWs like enable a +1.7 cent pitch correction to offset the slowdown's effect, commonly applied in DVD authoring to conform audio to pulldown-altered video while preserving original timing. For higher-quality adjustments, advanced non-linear resampling methods, such as the , stretch or compress audio frames independently of video fields, minimizing artifacts like phasing or reverb tails. The analyzes the signal via , modifies phase and magnitude for time scaling, and resynthesizes the waveform, making it suitable for subtle compensations where linear methods introduce audible distortion.

Visual and Modern Considerations

Artifacts and Effects

Motion judder arises from the uneven field repetition in three-two pulldown, where frames are alternately repeated three and two times to convert 24 fps film to 60 fields per second, resulting in stuttering motion particularly during slow, steady camera pans. This irregularity creates a stutter or "hiccup" effect that repeats every five video frames (approximately one-sixth of a second), arising from the uneven field repetition in the 3:2 pattern and disrupting the smooth temporal flow of the original cinematic content. In high-motion scenes, such as action sequences, judder becomes more pronounced due to larger pixel offsets between repeated fields, leading to visible temporal discontinuities. Combing artifacts manifest during deinterlacing of three-two pulldown material, where mismatched fields from adjacent film frames are combined into a single video frame, producing horizontal jagged lines or "teeth" in areas of motion. These artifacts occur because the pulldown process generates hybrid frames containing fields from two different original frames, which standard algorithms fail to separate properly, resulting in visible comb-like distortions on progressive displays. The effect is especially evident in fast panning shots, where the spatial misalignment between odd and even fields creates a shimmering or serrated appearance. The perceptual impact of three-two pulldown alters the intended smooth 24 fps motion of into a more erratic "video look," reducing overall immersion and perceived quality. Viewer studies indicate that judder and associated stutter can cause and discomfort, with mean opinion scores (MOS) dropping to 30-50 on a 0-100 scale for affected sequences, particularly in prolonged viewing sessions. This uneven motion rendering has been linked to increased , especially in slower scenes where the repetition pattern accentuates subtle irregularities, leading to visual fatigue over time. Detection of three-two pulldown artifacts is often amplified by cadence breaks, such as those introduced by scene edits or post-production cuts that disrupt the consistent 3:2 field repetition pattern. These interruptions cause irregular frame blending, exacerbating judder and combing, which is particularly common in older film-to-video transfers from the 1980s, like action films with frequent cuts. In such cases, the broken cadence results in stuttered playback and heightened visibility of interlacing artifacts during motion, making the flaws more perceptible to viewers.

Digital Alternatives and Inverse Telecine

In digital workflows, native 24p has emerged as a primary alternative to traditional 3:2 pulldown, enabling cameras to record progressive frames directly at 24 fps without requiring frame duplication for broadcast compatibility. Digital cinema cameras, such as those from like the KOMODO 6K, support this native progressive recording to CFast 2.0 media, preserving the original film-like motion and eliminating the need for pulldown insertion during or distribution. Software solutions further facilitate clean frame rate conversions from 24p to 30 fps without relying on pulldown patterns. For instance, allows users to interpret footage at the desired or employ in Adobe Media Encoder, generating intermediate frames based on to maintain smooth playback while avoiding interlacing artifacts. Inverse (IVTC) provides a method to reverse the 3:2 pulldown process, algorithmically reconstructing the original 24 fps progressive video from telecined interlaced sources. This technique identifies the repeating field patterns introduced by pulldown—typically through field matching, where fields from the same original frame are paired based on similarity—and discards duplicates to restore the native , often enhanced by to handle cadence breaks or hybrid content. In tools like , IVTC operates in adaptive modes that analyze input frames to remove repeated fields, supporting both field-based and frame-based reconstruction for accurate recovery. Contemporary implementations leverage for efficient IVTC. Media players such as MPC-HC integrate GPU-accelerated and IVTC via filters like ffdshow or LAV Filters, enabling real-time processing of high-definition content with minimal CPU overhead. This capability is particularly valuable in Blu-ray remastering, where IVTC is applied to convert legacy interlaced masters back to progressive, ensuring faithful reproduction of the original film's timing and motion in high-resolution releases. The advantages of these digital approaches include significant judder reduction in and 4K playback, as restoring progressive eliminates the uneven frame repetition that causes motion stutter on modern displays. Platforms like prioritize such progressive formats for streaming, supporting native 24 fps delivery to minimize judder—especially in regions with mismatched frame rates—and enhance overall viewing smoothness without additional .

References

Add your contribution
Related Hubs
User Avatar
No comments yet.