Hubbry Logo
search
logo

Scratch vocal

logo
Community Hub0 Subscribers
Read side by side
from Wikipedia

A scratch vocal or guide vocal is an audio recording made without the intention of keeping it. The recording is intended for reference only.[citation needed] These vocals can be used for music or animation. In music, a singer may use a scratch recording to rerecord the song later. In animation, scratch vocals are often used to check the story and can be kept instead of auditioning for another vocalist.

In music

[edit]

A scratch vocal is a vocal performance that a singer records to provide a reference track that music producers and audio engineers can use as they craft other pieces of the recorded song.

Most of the time, the singer of a scratch vocal ultimately re-records the vocal performance after production is complete. However, there are a number of exceptions to this rule, such as "The Piña Colada Song" by Rupert Holmes, where the re-recording lacked the desired energy and spontaneity, or "Superstar" by the Carpenters, where the scratch was so well performed that a re-record was deemed unnecessary.

A guide vocal is a vocal performance that a singer records to provide a reference track for another singer or singers who will be performing or recording the song.[1]

In animation

[edit]

Scratch vocals are also often used in the production of feature-length animated films to bring storyboards to life as "animatics," in which storyboard frames are synced to the relevant dialogue, together with a rough soundtrack generated on a synthesizer. Animation directors "hire temporary voices to help find their way through various script revisions, visual renderings and other steps of the process."[2] Scratch vocals may be obtained from professional voice actors (who may or may not be well-established in the voice-over community but are generally unknown to the general public) or from anyone around the studio willing to chip in a line or two (as well as friends or family members).[3]

For lead roles of animated films, scratch vocals are nearly always replaced in the final cut by vocal tracks recorded by bankable stars or experienced character actors. However, in the rush to meet deadlines, if the scratch vocals for a minor role are good enough, the director may skip auditions and simply use the actor who recorded the scratch vocals in the role. This is how many animation studio employees (and their friends and family members) end up with minor credits as cast members on their studio's products.[3]

If scratch vocals for a lead role are exceptionally good, the studio may approve casting of the scratch vocalist in the lead role—as occurred with Pixar's Turning Red (2022).[2]

The current pattern of first building a rough draft of an entire animated film with scratch vocals and animatics was a relatively late development. In 1993, Toy Story (released in 1995) was the first animated film for which scratch vocals were recorded for all reels.[4] Before that point, development of animated films was much more chaotic in terms of when scratch vocals versus production sound were used for any particular reel in the film. For example, in the 1980s, Walt Disney Feature Animation experimented with recording production sound for all reels before starting animation and then using scratch vocals only for changes.[4]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
A scratch vocal, also known as a guide vocal, is a temporary audio recording of vocals made during the music production process without the intention of retaining it in the final mix.[1] It serves as a reference track to establish the song's melody, lyrics, tempo, and overall structure, enabling producers and musicians to build the instrumental arrangement around it before overdubbing a polished vocal performance.[2] Typically recorded quickly with minimal equipment, the scratch vocal captures the core idea of the song in a raw form, often alongside basic rhythm elements like drums or bass to maintain timing and energy.[1] In professional recording sessions, scratch vocals are essential for guiding live band performances, helping instrumentalists—such as guitarists or drummers—align their parts with the vocal phrasing and emotional delivery.[2] They facilitate a smoother workflow by allowing the full arrangement to be developed iteratively, with the final vocals added later in a controlled environment to achieve optimal tone and precision.[1] Beyond music, scratch vocals play a key role in film and animation production, where they are used in animatics or as placeholders for automated dialogue replacement (ADR), ensuring synchronization with visuals before permanent recordings are made.[3] While usually discarded, an exceptionally compelling scratch vocal may occasionally be retained if it preserves a unique spontaneity that proves difficult to recreate.[2] This practice has become a standard in modern recording studios since the advent of multitrack technology, streamlining the transition from demo to finished product and enhancing creative efficiency across genres.[3]

In music production

Definition and purpose

A scratch vocal, also known as a scratch track or guide vocal, is a temporary recording of a singer's performance created during the early stages of music production to serve as a reference for other elements of the track.[4] This provisional vocal is typically captured quickly and with minimal preparation, often using basic equipment, and is not intended to appear in the final mix unless its quality proves exceptional.[5] The primary purpose of a scratch vocal is to provide a structural and rhythmic foundation that guides the recording of instrumental parts, ensuring that the band or ensemble can align their performances with the song's vocal phrasing, tempo, and key.[6] By laying down vocals early—often as the first track after establishing basic chords and tempo—it allows producers and musicians to experiment with arrangements, test how the music supports the lyrics and melody, and maintain cohesion across the track without committing to a polished vocal take prematurely.[5] This approach facilitates efficient workflow in the studio, as instrumentalists can react to the vocal's energy and intent in real time, reducing the need for extensive revisions later.[5] In practice, scratch vocals enable remote collaboration, where a singer might record to a click track separately and import it into the session, helping the group stay synchronized during basic tracking sessions.[6] While usually replaced by a more refined performance, a compelling scratch vocal can occasionally become the keeper, preserving the raw inspiration of the initial capture.[5] This versatility underscores its role in balancing creative spontaneity with technical precision in modern recording processes.

Recording techniques

Recording scratch vocals in music production typically involves a streamlined process to capture a temporary guide track that establishes the song's timing, phrasing, and overall feel for subsequent instrumental recordings. These tracks are recorded early in the session, often before committing to the full band performance, allowing musicians to play along with a vocal reference that conveys the intended energy and structure. The emphasis is on efficiency rather than sonic perfection, as the scratch vocal is almost always replaced in the final mix.[7] A basic setup suffices for scratch vocal recording, prioritizing accessibility over high-fidelity equipment. Common choices include dynamic microphones such as the Shure SM57 or SM58, which are rugged, cost-effective, and effective in untreated spaces without requiring extensive room treatment. These mics are positioned 6 to 12 inches from the singer's mouth, often without a dedicated pop filter, to quickly capture intelligible vocals while minimizing setup time. The singer monitors a rough instrumental demo, click track, or empty rhythm section via headphones connected to a simple audio interface and digital audio workstation (DAW) like Reaper or Pro Tools.[7][8] Techniques focus on performance over technical precision, encouraging the vocalist—often the songwriter, producer, or a stand-in—to deliver takes with authentic emotion and groove, even if pitch or diction is imperfect. Multiple passes are recorded rapidly, with the best elements comped together if needed, but the goal is a single cohesive guide that inspires tight ensemble playing. In some workflows, a second microphone serves as a room mic to add subtle ambiance, enhancing the guide's musical context without complicating the process. This approach ensures the scratch vocal supports rhythmic cohesion.[8][7]

Replacement in final production

In music production, scratch vocals recorded during basic tracking sessions are systematically replaced in the final production phase to achieve a polished, professional sound. This replacement occurs during overdub sessions, where the lead vocalist re-performs the parts in isolation, often in a dedicated vocal booth to minimize bleed from other instruments and allow for precise control over performance and processing.[9] The process typically involves muting or deleting the original scratch track in the digital audio workstation (DAW), then layering new takes while referencing the scratch for timing, phrasing, and emotional delivery to maintain the song's groove established in the live band performance.[10] The primary reasons for replacement include improving vocal clarity, intonation, and dynamics, which are often compromised in the energetic but uncontrolled environment of basic tracking. Scratch vocals, captured quickly with minimal setup—such as a dynamic microphone like the Shure SM58 in a control room—prioritize feel over fidelity, but they can introduce artifacts like room noise or inconsistent levels that detract from the final mix.[11] Overdubs enable the application of specialized treatments, such as compression (e.g., UREI 1176), delay, and reverb, tailored to the vocal's role in the arrangement, ensuring it integrates seamlessly with the instrumentation. In some cases, multiple takes are comped (composite edited) from several performances to create an ideal vocal track.[10] A notable example is the production of The Go-Go's "Our Lips Are Sealed" from the 1981 album Beauty and the Beat. Belinda Carlisle's initial scratch vocals, recorded live in a booth at Pennylane Studios using a Neumann U47, guided the band's basic tracks for a raw, energetic feel. These were later replaced during overdubs at Soundmixers Studios, where Carlisle re-recorded in a reverberant bathroom space for added depth, contributing to the track's Top 10 success.[9] Similarly, for R.E.M.'s "So. Central Rain (I'm Sorry)" on Murmur (1983), Michael Stipe's live scratch vocals from Reflection Studios were overwritten in subsequent sessions using the same Neumann FET U47 but with added effects like DeltaLab delay, resulting in a more refined and emotive final vocal.[10] While replacement is standard, exceptional scratch takes may be retained if they capture an irreplaceable vibe. In The Killers' Day & Age (2008), Brandon Flowers' scratch vocals—intended as placeholders during demo sessions—were kept on about half the tracks due to their compelling quality, bypassing full overdubs and preserving the song's authentic energy.[11] This selective approach underscores how replacement prioritizes artistic and technical enhancement without discarding the foundational spirit of the scratch performance.

In film and animation

Role in pre-production

In the pre-production phase of animated films and series, scratch vocals serve as temporary audio placeholders to facilitate the creation of animatics, which are preliminary animated storyboards combining visuals, dialogue, and basic sound elements to test narrative flow and timing.[12] These recordings, often performed by directors, writers, or available actors, allow the production team to visualize scene pacing and character interactions early, before final voice casting occurs.[13] By integrating scratch dialogue into the animatic, animators can begin rough blocking of movements and lip synchronization, ensuring that the animation aligns with spoken words without delaying the overall pipeline.[14] The primary purpose of scratch vocals in this stage is to identify potential issues in script delivery, such as unnatural pauses or emotional beats, enabling iterative revisions to the storyboard or dialogue before committing to expensive animation assets.[15] For instance, in Pixar's workflow, storyboards are filmed with scratch voice tracks to produce a complete rough cut, helping directors refine the story's rhythm and structure.[13] This approach minimizes risks in later stages by providing a functional audio guide that simulates final performances, often recorded quickly to maintain momentum in collaborative reviews.[14] In practice, scratch vocals are crucial for resource allocation, as they permit animators and editors to proceed with visual development while professional voice actors are auditioned and selected. A notable example is the use of scratch tracks in Disney-Pixar's Inside Out 2, where storyboard artist Valerie LaPointe provided temporary vocals for the character Joy during animatic assembly, allowing the team to assess scene dynamics before Amy Poehler's final recording.[16] Similarly, in game animation like Double Fine's Psychonauts 2, scratch dialogue enabled early cutscene scripting and animation blocking, front-loading adjustments to avoid bottlenecks.[14] Overall, this technique streamlines pre-production by bridging script and visuals, ensuring cohesive storytelling from the outset.[12]

Synchronization with visuals

In film and animation production, scratch vocals—temporary dialogue recordings—play a crucial role in synchronizing audio with visual elements during the pre-production and early production phases. These tracks, often performed by directors, writers, or stand-in actors, establish the timing, pacing, and emotional tone of dialogue before final voice acting occurs. By integrating scratch vocals into animatics—rough animated storyboards—production teams can align key visual beats, such as character movements and camera angles, precisely with the audio cues, preventing costly revisions later in the pipeline.[17][3] The synchronization process begins after script approval, where scratch dialogue is recorded in a controlled environment to capture natural inflections and rhythm. This audio is then edited into the animatic reel alongside temporary sound effects and music, allowing editors and animators to visualize how dialogue drives the scene's dynamics. For lip synchronization specifically, animators reference the scratch track's phonemes and pauses to block out mouth shapes and facial expressions, ensuring that character animations match the intended delivery even if the final recording varies slightly in timing. This approach is standard in animated features, where audio precedes detailed visuals to maintain narrative flow and realism.[18][3] Once synchronization is achieved in the animatic stage, the scratch vocals guide subsequent previsualization (previz), where 3D roughs or 2D sketches refine the visuals while preserving audio alignment. This iterative syncing minimizes desynchronization issues during final animation, as adjustments to lip sync or action timing are made early based on the placeholder track. In practice, professional scratch recordings prioritize clarity and performance quality to avoid misleading the animation team, as poor timing can lead to inefficient workflows and increased costs.[17][18]

Transition to final voice acting

In film and animation production, the transition from scratch vocals—also known as prelay or temporary dialogue tracks—to final voice acting typically occurs after the initial animation phases, ensuring synchronization with character movements and visuals. Scratch vocals are recorded early in pre-production by voice actors, directors, or staff to establish timing, pacing, and emotional tone in the animatic reel, which serves as a blueprint for animators.[19][3] This preliminary audio guides lip-sync, gestures, and scene rhythm, allowing animators to create rough or final animation that aligns with the dialogue's delivery.[20] Once rough animation or layout is complete, final voice acting sessions are scheduled, often in post-production, where professional actors re-record lines to match the now-fixed visuals. This step, akin to automated dialogue replacement (ADR) in live-action, involves multiple takes to refine performance, adjust for any animation tweaks, and achieve higher audio quality in a controlled studio environment.[3][19] The scratch track is then replaced, though minor adjustments to mouth movements or timing may be made during compositing to ensure seamless integration.[20] In some cases, the scratch vocal performance proves so compelling that it is retained in the final mix, bypassing re-recording to preserve authenticity. For instance, in Disney's Bolt (2008), animator Mark Walton provided the initial scratch voice for the character Rhino, which directors Chris Williams and Byron Howard deemed ideal and kept largely intact, with much of the original track used without alteration.[21][22] This approach highlights how effective scratch vocals can influence casting and production decisions, reducing costs while maintaining creative intent.[23]

Historical development

Origins in analog recording

The practice of recording scratch vocals emerged alongside the development of multi-track analog recording techniques in the mid-20th century, enabling producers to layer audio elements while using temporary vocal guides for synchronization. Guitarist and inventor Les Paul pioneered these methods in the late 1940s by modifying tape recorders to allow sound-on-sound overdubbing, where an initial track served as a reference for adding harmonies, instrumentals, and effects without the need for live ensemble performances.[24] This innovation marked a shift from single-track monaural captures to more complex arrangements, with scratch vocals providing essential timing cues during the analog process. By the 1950s and 1960s, as 4-track and 8-track reel-to-reel tape machines became widely adopted in professional studios—such as Ampex models used by major labels—scratch vocals became a standard element of the basic tracking phase. Producers would record rough vocals simultaneously with rhythm instruments like drums, bass, piano, and rhythm guitar on the initial tape pass, using the record head for synchronization, to establish groove and structure before overdubbing final elements.[25] These temporary tracks, often captured on basic microphones to minimize commitment, helped mitigate issues like tape hiss and limited tracks in analog workflows, allowing for iterative builds without derailing the session's momentum. Examples include early rock and pop productions where the scratch vocal influenced the final mix's emotional contour, though it was invariably erased or muted in favor of polished takes.[26] In film and animation production during the analog era, scratch vocals—also known as temporary or guide dialogue—played a crucial role in pre-visualization and synchronization, dating back to the introduction of synchronized sound in the late 1920s and 1930s. With the shift to optical soundtracks on 35mm film, studios like Disney and Warner Bros. recorded provisional voice tracks to guide animators in matching mouth movements (lip-sync) and timing actions to dialogue, using exposure sheets and gang synchronizers for precision.[27] This method was vital in an era of hand-drawn cel animation, where final voice acting occurred post-animation via automated dialogue replacement (ADR) to avoid on-set noise interference, ensuring seamless integration of audio and visuals in features like early Mickey Mouse shorts.[3] The analog constraints of magnetic film stripes and early tape further emphasized the need for these placeholders, preventing costly re-animations due to timing mismatches.

Evolution in digital and AI eras

The advent of digital audio workstations (DAWs) in the late 1980s and 1990s marked a significant shift in the use of scratch vocals, enabling producers to record, edit, and replace temporary vocal tracks with greater precision and efficiency compared to analog tape methods. Tools like Pro Tools, introduced in 1991, allowed for non-destructive editing, multi-track layering, and easy synchronization, making it simpler to capture rough vocal guides during instrumental tracking without committing to final performances early in production.[28] This flexibility reduced the logistical challenges of analog workflows, where splicing tape for corrections was time-consuming, and facilitated collaborative demos by permitting quick iterations on melody and phrasing.[8] In film and animation, the digital era similarly transformed scratch tracks—temporary dialogue or vocal placeholders—through non-linear editing systems that emerged in the 1990s. Software such as Avid Media Composer enabled editors to sync provisional audio with visuals in real-time, aiding pre-visualization and pacing without the constraints of linear film cutting. For instance, in productions like Waterworld (1995), scratch dialogue recorded on set helped guide automated dialogue replacement (ADR) sessions, capturing essential timing and emotional tone amid challenging location noise, ultimately contributing to the film's Oscar-nominated sound design.[3] The integration of artificial intelligence (AI) in the 2020s has further evolved scratch vocals by automating generation and enhancement, minimizing the need for initial human recordings. In music production, AI platforms like Kits AI allow users to convert rough scratch tracks into polished demos through voice conversion models that adjust tone, clarity, and style, as seen in workflows for EDM and pop where placeholder vocals are rapidly prototyped to inspire full arrangements.[29] Similarly, tools such as IK Multimedia's ReSing (released 2025) employ neural networks to replace scratch vocal imperfections with professional AI-modeled performances, preserving the original phrasing while improving pronunciation and timbre.[30] In film sound design, AI text-to-speech systems have streamlined scratch dialogue creation, offering filmmakers cost-effective, customizable voiceovers for early cuts. ElevenLabs' AI tools, for example, generate natural-sounding dialogue from scripts, enabling rapid iteration on narrative flow and character voices during pre-production without scheduling actors, thus saving time and budget on projects with tight timelines.[31] This approach complements traditional scratch methods by providing scalable options for animation animatics or live-action temp tracks, though human oversight remains essential to maintain emotional authenticity.

References

User Avatar
No comments yet.