Mixed raster content

Mixed raster contentMain

Community hub

7 pages, 0 posts

0 subscribers

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Contribute something

About hubMembersContent overviewUpdatesRules

Main reference articles

Mixed raster content

View on Wikipedia

from Wikipedia

Mixed raster content (MRC) is a method for compressing images that contain both binary-compressible text and continuous-tone components, using image segmentation methods to improve the level of compression and the quality of the rendered image.^[1] By separating the image into components with different compressibility characteristics, the most efficient and accurate compression algorithm for each component can be applied.

MRC-compressed images are typically packaged into a hybrid file format such as DjVu and sometimes PDF.^[2] This allows for multiple images, and the instructions to properly render and reassemble them, to be stored within a single file.

Some image scanners optionally support MRC when scanning to PDF. A typical manual states that without MRC, the image is generated in a single process, with text and graphics not distinguished. With MRC, separate processes are used for text, graphics, and other elements, producing clearer graphics and sharper text, at the price of slightly slower processing. MRC is recommended to optimise the scanning of documents with harder-to-read text or lower-quality graphics.^[3] MRC can also reduce the size of the scanned file,^[4] though higher compression using JBIG2 can sometimes lead to character substitution errors in scanned documents.^[5]

File format

[edit]

MRC (ISO/IEC/ITU)
Internet media type	image/mrc
Magic number	\xFF\xD8
Extended from	JPEG
Standard	ISO/IEC 16485:2000; ITU-T Recommendation T.44 (01/2005)

A form of MRC is defined by international standard bodies as ISO/IEC 16485, or ITU recommendation T.44 (accessible free of charge). It defines a file format with bilevel masks and two data layers in each "stripe" of the image. The mask can be encoded in ITU T.4, JBIG1, or JBIG2, while the images can be JPEG, JBIG1, or run-length encoded color. The format is loosely based on JPEG, with a APP13 segment registered for this purpose.

It is not known whether this file format is actually used, as formats like DjVu and PDF have their own ways of defining layers and masks.^[2]

References

[edit]

^ de Queiroz, Ricardo L.; Buckley, Robert R.; Xu, Ming (28 December 1998). Mixed Raster Content (MRC) Model for Compound Image Compression (PDF). Visual Communications and Image Processing '99. SPIE Proceedings. Vol. 3653. doi:10.1117/12.334618. ISBN 978-0-8194-3124-0. Retrieved 2023-11-23.
^ ^a ^b "The MRC (Mixed Raster Content) Model and DjVu". PlanetDjVu. 10 June 2003.
^ "Text Formats". Visioneer OneTouch Scanning Guide (PDF). October 2019. p. 33.
^ "Image Compression". Kofax. Retrieved 11 March 2021.
^ "Scanning and Compression White Paper" (PDF). Xerox. 2013. Archived from the original (PDF) on 2022-01-21.

External links

[edit]

Mixed Raster Content (MRC) - Programme of Work Allocated to ISO/IEC JTC 1/SC 29/WG

v t e International Organization for Standardization (ISO) standards
List of ISO standards – ISO romanizations – IEC standards
1–9999	1 2 3 4 6 7 9 16 17 31 -0 -1 -3 -4 -5 -6 -7 -8 -9 -10 -11 -12 -13 68-1 128 216 217 226 228 233 259 261 262 302 306 361 500 518 519 639 -1 -2 -3 -5 -6 646 657 668 690 704 732 764 838 843 860 898 965 999 1000 1004 1007 1073-1 1073-2 1155 1413 1538 1629 1745 1989 2014 2015 2022 2033 2047 2108 2145 2146 2240 2281 2533 2709 2711 2720 2788 2848 2852 2921 3029 3103 3166 -1 -2 -3 3297 3307 3601 3602 3864 3901 3950 3977 4031 4157 4165 4217 4909 5218 5426 5427 5428 5725 5775 5776 5800 5807 5964 6166 6344 6346 6373 6385 6425 6429 6438 6523 6709 6943 7001 7002 7010 7027 7064 7098 7185 7200 7498 -1 7637 7736 7810 7811 7812 7813 7816 7942 8000 8093 8178 8217 8373 8501-1 8571 8583 8601 8613 8632 8651 8652 8691 8805/8806 8807 8820-5 8859 -1 -2 -3 -4 -5 -6 -7 -8 -8-I -9 -10 -11 -12 -13 -14 -15 -16 8879 9000/9001 9036 9075 9126 9141 9227 9241 9293 9314 9362 9407 9496 9506 9529 9564 9592/9593 9594 9660 9797-1 9897 9899 9945 9984 9985 9995
10000–19999	10006 10007 10116 10118-3 10160 10161 10165 10179 10206 10218 10279 10303 -11 -21 -22 -28 -238 10383 10585 10589 10628 10646 10664 10746 10861 10957 10962 10967 11073 11170 11172 11179 11404 11544 11783 11784 11785 11801 11889 11898 11940 (-2) 11941 11941 (TR) 11992 12006 12052 12182 12207 12234-2 12620 13211 -1 -2 13216 13250 13399 13406-2 13450 13485 13490 13567 13568 13584 13616 13816 13818 14000 14031 14224 14289 14396 14443 14496 -2 -3 -6 -10 -11 -12 -14 -17 -20 14617 14644 14649 14651 14698 14764 14882 14971 15022 15118 15189 15288 15291 15398 15408 15444 -3 -9 15445 15438 15504 15511 15686 15693 15706 -2 15707 15897 15919 15924 15926 15926 WIP 15930 15938 16023 16262 16355-1 16485 16612-2 16750 16949 (TS) 17024 17025 17100 17203 17369 17442 17506 17799 18004 18014 18181 18245 18629 18760 18916 19005 19011 19092 -1 -2 19114 19115 19125 19136 19407 19439 19500 19501 19502 19503 19505 19506 19507 19508 19509 19510 19600 19752 19757 19770 19775-1 19794-5 19831
20000–29999	20000 20022 20121 20400 20802 20830 21000 21001 21047 21122 21500 21778 21827 22000 22275 22300 22301 22395 22537 23000 23003 23008 23009 23090-3 23092 23094-1 23094-2 23270 23271 23360 23941 24517 24613 24617 24707 24728 25178 25964 26000 26262 26300 26324 27000 series 27000 27001 27002 27005 27006 27729 28000 29110 29148 29199-2 29500
30000+	30170 31000 32000 37001 38500 39075 40314 40500 42010 45001 50001 55000 56000 80000
Category

v t e IEC standards
IEC	60027 60034 60038 60062 60063 60068 60112 60228 60269 60297 60309 60320 60364 60446 60559 60601 60870 60870-5 60870-6 60906-1 60908 60929 60958 60980-344 61030 61131 61131-3 61131-9 61158 61162 61334 61355 61360 61400 61499 61508 61511 61784 61850 61851 61883 61960 61968 61970 62014-4 62026 62056 62061 62196 62262 62264 62304 62325 62351 62365 62366 62379 62386 62455 62680 62682 62700 63110 63119 63382
ISO/IEC	646 1989 2022 4909 5218 6429 6523 7810 7811 7812 7813 7816 7942 8613 8632 8652 8859 9126 9293 9496 9529 9592 9593 9899 9945 9995 10021 10116 10165 10179 10279 10646 10967 11172 11179 11404 11544 11801 12207 13250 13346 13522-5 13568 13816 13818 14443 14496 14651 14882 15288 15291 15408 15444 15445 15504 15511 15693 15897 15938 16262 16485 17024 17025 18004 18014 18181 19752 19757 19770 19788 20000 20802 21000 21827 22275 22537 23000 23003 23008 23270 23360 24707 24727 24744 24752 26300 27000 27000 family 27002 27040 29110 29119 33001 38500 39075 42010 80000 81346
Related	International Electrotechnical Commission

This computer graphics–related article is a stub. You can help Wikipedia by adding missing information.

Revisions and contributors Edit on Wikipedia Read on Wikipedia

View on Grokipedia

from Grokipedia

Mixed raster content (MRC) is a compression technique and imaging model designed for compound documents that combine binary elements like text, line art, and graphics with continuous-tone photographic or pictorial content, enabling efficient encoding by segmenting the image into layers and applying specialized compression algorithms to each.^[1] The MRC model employs a three-layer structure to represent the image: a high-resolution binary mask layer that selects between foreground and background pixels on a per-pixel basis; a foreground layer capturing sharp, colorful text and graphics at full resolution; and a background layer holding smoothed, low-resolution continuous-tone elements to reduce data volume.^[2] Segmentation typically involves classifying image blocks—such as 8×8 pixel regions—into categories like text, pictures, or uniform areas using rate-distortion optimization to balance compression efficiency and visual quality.^[2] Encoding in MRC applies tailored methods to each layer: the mask uses lossless binary coders like JBIG or CCITT Group 4 for precise edge definition without artifacts; the foreground and background leverage lossy algorithms such as JPEG or wavelet-based compression for color data, often at reduced resolutions for the background to minimize bitrate.^[3] This layered approach allows different resolutions and coding schemes within a single page, supporting flexible quality-compression tradeoffs and achieving ratios up to 150:1 for scanned documents while mitigating issues like jagged text edges through techniques such as adaptive error diffusion.^[2] First standardized as ITU-T Recommendation T.44 in 1999, with a revision in 2005, MRC facilitates efficient processing, interchange, and archiving of mixed-content images in applications including color facsimile (per RFC 2301), PDF optimization, and document scanning software, where it excels at preserving text legibility alongside compact storage of imagery.^[4] Originally proposed in the late 1990s for emerging standards like JPEG 2000 and Internet Fax, MRC has become integral to hyper-compression workflows in imaging SDKs and remains relevant for bandwidth-sensitive environments.^[1]

Introduction

Definition and Purpose

Mixed raster content (MRC) is a compression technique designed for compound images that combine binary-compressible elements, such as text and line art, with continuous-tone components, like photographs or graphics, by employing image segmentation to separate these distinct parts for targeted encoding. This approach addresses the compound image compression problem, where traditional single-algorithm methods fail to handle mixed content effectively. Conventional compression standards like JPEG apply lossy techniques optimized for photographic images, which introduce artifacts such as blurring and ringing around sharp edges in text and graphics, degrading readability and visual sharpness. In contrast, binary formats like those for text require lossless compression to preserve exact details, but they inefficiently handle continuous-tone areas, leading to larger file sizes. MRC resolves this by segmenting the image into regions suited to different compression strategies, enabling the use of specialized coders for each type of content. The primary purpose of MRC is to achieve superior compression ratios while maintaining high-quality preservation across diverse content types, reducing file sizes significantly—often by factors of 10 or more—without introducing noticeable artifacts in critical areas like text. This results in efficient storage and transmission for applications involving mixed media, such as scanned documents featuring text overlays on photographic backgrounds, where the text remains crisp and the imagery retains natural tones.^[5] By leveraging a multi-layered representation, MRC optimizes the trade-off between compression efficiency and fidelity, outperforming unified methods in both metrics for compound documents.

Historical Development

The Mixed Raster Content (MRC) model was initially proposed in 1998 by Ricardo L. de Queiroz, Robert R. Buckley, and Ming Xu in their paper presented at the SPIE Electronic Imaging conference, introducing a layered approach to compress compound images combining binary text and continuous-tone elements. This work laid the foundation for separating image components into distinct layers—a binary mask for text and graphics, a foreground for high-detail areas, and a background for smoother regions—to achieve better compression ratios than single-layer methods. The development of MRC was motivated by the shortcomings of JPEG compression in handling scanned documents during the late 1990s surge in digital archiving, where traditional JPEG struggled with sharp edges in text and graphics overlaid on photographic content, leading to artifacts and inefficient file sizes. As scanning technology proliferated for preserving books, maps, and forms, the need for a hybrid model that leveraged efficient binary coding for text (like JBIG) alongside lossy compression for images became evident, addressing the growing demands of electronic document storage and transmission. Key milestones in MRC's adoption included its formal standardization as ISO/IEC 16485 in 2000, which defined the format for representing mixed bi-level and multi-level raster pages using ITU-T recommended coding schemes.^[6] Around the same time, the emerging DjVu format integrated MRC-like layered compression techniques, enabling high-quality scanned document distribution over the internet by separating foreground text from background images, with early implementations appearing by 2000.^[7] The ITU-T Recommendation T.44, which specifies the MRC imaging format for efficient processing and archiving, was first issued in 1997, with subsequent versions in 1999 and finalized in 2005, incorporating enhancements for multilayer representations.^[8] Post-2000, MRC evolved through incorporation into JPEG 2000 extensions, particularly in Part 6 (JPEG 2000 Compound Image File Format), which adopted the multilayer MRC model for compound documents starting around 2001 to support scalable, high-fidelity compression. By the 2010s, MRC techniques were integrated into PDF compression tools, enhancing file size reduction for scanned PDFs while preserving text sharpness, as seen in software libraries and document management systems.^[5] No major updates to the core MRC standards have occurred by 2025, though it continues to be supported in software development kits like LEADTOOLS, which provide APIs for MRC encoding and decoding in modern imaging applications.^[9]

Technical Model

Layer Structure

Mixed Raster Content (MRC) utilizes a three-layer imaging model to efficiently represent compound images that combine sharp-edged elements like text and graphics with smoother continuous-tone regions such as photographs or halftones. This structure separates the image into distinct components, allowing for targeted processing and storage while preserving visual fidelity.^[4] The foreground layer captures sharp-edged elements like text and graphics, potentially including continuous-tone colors, at full or high resolution to preserve detail and sharpness. The background layer holds continuous-tone elements, such as images or textured paper backgrounds, at reduced resolution to efficiently compress smooth areas while maintaining gradations and colors. The mask layer is a bilevel segmentation map, using 1 bit per pixel to indicate whether the foreground (1) or background (0) contributes to each position in the final image.^[4] During reconstruction, the layers are composited by overlaying the foreground onto the background according to the mask, effectively "pouring" the foreground content through the mask onto the background plane. This selective blending ensures that sharp elements appear crisp while continuous-tone areas remain smooth, without interference between the two. A visual representation of this stacking can be seen in diagrams where the binary mask acts as a stencil, aligning the lower-resolution foreground precisely over the higher-resolution background to form the complete image. To optimize memory usage and computational efficiency, the image is divided into horizontal bands known as stripes, which span the full width of the page and allow processing at varying resolutions within each band. Stripes can be of different types—such as one-layer (for uniform continuous-tone regions), two-layer (background and mask), or three-layer (full model)—enabling selective application of the model based on local content. Key properties of the model include the ability for layers to operate at independent resolutions, with the mask typically maintained at the highest resolution to preserve edge accuracy for text, while foreground and background may be subsampled (e.g., by factors of 2 or 4) in non-critical areas, particularly the background for continuous-tone content. This flexibility reduces data volume without significant perceptual loss.^[2] The layered architecture supports independent compression of each component using algorithms suited to their characteristics, such as JBIG2 for the binary mask; for the foreground and background, JPEG or wavelet-based methods for continuous-tone parts, with binary elements in the foreground using JBIG2 where applicable.^[2]

Segmentation Process

The segmentation process in Mixed raster content (MRC) aims to decompose an input compound image—such as a scanned document containing text, graphics, and continuous-tone pictures—into distinct layers by identifying high-contrast regions typically associated with text and graphics for the foreground, while assigning smoother pictorial areas to the background. This separation is guided by a binary mask that indicates pixel ownership for each layer, enabling targeted compression of diverse content types. The original MRC model employs region classification to assign uniform masks to text/graphics regions or transition identification to detect edges for mask generation, ensuring sharp preservation of binary elements without degrading continuous-tone quality. Key techniques for segmentation include adaptive thresholding to produce bilevel masks, edge detection via morphological operations to refine boundaries, and clustering algorithms like the Expectation-Maximization (EM) method, which fits Gaussian mixtures to pixel data in perceptually uniform color spaces such as Lab* for robust classification of foreground and background pixels. Block-based processing is commonly used for efficiency, where the image is partitioned into small overlapping blocks (e.g., 8×8 or 16×16 pixels), and local statistics like variance and mean intensity are computed to apply per-block thresholds that minimize a rate-distortion cost function, balancing compression efficiency and reconstruction fidelity. These methods adapt to varying content by classifying pixels based on criteria such as high local contrast for text regions, often incorporating Markov random fields to enforce spatial consistency across blocks.^[10]^[11] The process unfolds in sequential steps: initially, the image is analyzed in horizontal stripes or blocks to subsample pixels and estimate parameters like cluster means and covariances; next, pixels are classified (e.g., foreground if exceeding a quadratic decision boundary in color space, background otherwise), yielding a preliminary binary mask; this mask then directs the rendering of foreground and background layers by selecting dominant colors or smoothed values, respectively, with optional post-processing like connected component analysis to isolate and refine objects such as embedded graphics. For large images, multi-resolution approaches process a downsampled version first to initialize thresholds, propagating decisions to full resolution for computational efficiency. In practice, scanned pages exemplify this: high-variance blocks in grayscale or color channels signal text blocks via thresholding akin to OCR preprocessing, generating masks that delineate sharp characters while relegating halftone images to the background, as demonstrated in magazine scans where EM clustering accurately extracts photographic objects amid text.^[10]^[11]^[12] Challenges in segmentation include handling scan noise, show-through artifacts, and ambiguous mixed regions where text overlays faint images, which can lead to misclassification and artifacts like jagged edges. Solutions involve pre-segmentation smoothing with Gaussian filters to reduce noise, adaptive parameter tuning in clustering (e.g., fallback to 1D luminance thresholding if 3D color fails), and global optimization via cost functions that penalize inconsistencies, such as connected component labeling to discard small erroneous foregrounds. These adaptive strategies minimize layer assignment errors, achieving precise masks that enhance overall document fidelity without excessive computational overhead.^[10]^[12]^[11]

Standards and Formats

ITU-T T.44 Specification

The ITU-T Recommendation T.44 (01/2005) defines the Mixed Raster Content (MRC) imaging format for telefax applications, enabling efficient processing, interchange, and transmission of compound images that combine pictorial and textual elements in document facsimile systems.^[4] This standard supports layered representation to optimize compression for mixed content, primarily targeting black-and-white and color documents in telecommunications environments.^[4] Originally issued in April 1999, with Amendment 1 in February 2000 adding color support, and consolidated in January 2005, it integrates with the T.30 facsimile protocol for real-time transmission.^[4]^[8] Key requirements of T.44 include support for resolutions up to 400 × 400 dpi, accommodating high-quality document imaging needs.^[13] It mandates a three-layer model consisting of a binary mask layer for segmentation, a foreground layer for text and graphics, and a background layer for continuous-tone elements like images or halftones.^[14] Binary layers, such as the mask, may be encoded using ITU-T T.4 (including MMR), JBIG1, or JBIG2 methods to ensure lossless compression of sharp-edged content.^[14] Continuous-tone layers leverage compatible ITU-T coding schemes, like JPEG for color or grayscale data, while the base mode requires implementation of one to three layers per stripe for compatibility.^[14] The scope of T.44 is focused on transmission-oriented applications within facsimile networks, distinguishing it from the ISO/IEC 16485 standard, which adopts the same core model but emphasizes storage and interchange without the telecom-specific transmission protocols.^[4] Although not formally standardized, the .mrc file extension is commonly used for MRC files in practice.^[9]

File Format Details

Mixed Raster Content (MRC) files are structured as a series of JPEG-like segments, beginning with the Start of Image (SOI) marker (0xFFD8) followed immediately by an Application (APP13) marker (0xFFED) to delineate MRC-specific content.^[15] This APP13 segment includes a length field and the MRC magic number identifier, which alerts decoders to the presence of MRC data within a JPEG-compatible framework.^[6] The overall file employs a binary stream format where subsequent segments encode page-level and stripe-level information, ensuring compatibility with JPEG decoders while extending functionality for multi-layer raster content.^[16] The core structure consists of a header segment providing global layer information, such as the number of layers (typically three: mask, foreground, and background) and page dimensions, followed by compressed data streams for each layer.^[17] Pages are divided into horizontal stripes for progressive processing, with each stripe preceded by a header that specifies parameters including stripe height, layer-specific resolutions, coded image width, and height.^[15] The mask layer, which is bi-level and spans the full stripe dimensions, selects between foreground and background contributions; the foreground stream captures high-detail elements like text at higher resolution, while the background handles continuous-tone areas at lower resolution.^[18] These streams are encapsulated as marker segments, allowing flexible integration of various coders without altering the base format syntax.^[19] The media type image/mrc is commonly used for MRC files, as listed in some MIME type registries for digital preservation and imaging standards.^[20] This MIME type facilitates internet transmission and storage, aligning with RFC specifications for fax and image interchange. MRC files support encapsulation within container formats like PDF and DjVu for multi-page documents and enhanced metadata handling, enabling seamless integration into workflows requiring lossless or lossy modes based on layer-specific coding choices.^[21] In PDF, MRC segments can be embedded as XObjects, preserving the multi-layer model while allowing selective recompression.^[22] The exact segment layout, including stripe headers with fields for width, height, and resolution, is defined in ISO/IEC 16485:2000, which harmonizes with ITU-T T.44 requirements for marker segment syntax and layer synchronization.^[6] The T.44 specification supports color handling in foreground and background layers through palette and tag mechanisms, including RGB and CMYK color spaces via compatible encodings such as JPEG.^[15] These allow for multi-channel color encoding while maintaining backward compatibility with grayscale modes.^[23]

Compression Techniques

Encoding Methods

Mixed Raster Content (MRC) encoding applies compression algorithms to the segmented layers to achieve high efficiency while preserving the distinct characteristics of text and imagery. After segmentation produces the foreground, background, and mask layers, each undergoes targeted preprocessing, such as quantization to reduce color depth in continuous-tone areas or downsampling to lower resolution in non-critical regions, before entropy coding is applied to minimize redundancy. This layered approach allows for optimized compression ratios by matching coding methods to content type, as outlined in the foundational MRC model. The mask layer, a binary representation of text and line art, employs lossless binary compression methods to ensure perfect reconstruction and readability; JBIG2 is the primary standard for its superior performance on textual patterns through pattern recognition and arithmetic coding, while bi-level compression methods such as Group 3 (T.4: MH, MR) or Group 4 (T.6: MMR) serve simpler cases with run-length encoding. The foreground layer, capturing constant or limited colors for textual elements, is compressed losslessly if binary or lossily using JPEG at reduced resolution (e.g., 1/4 or 1/8 of full size) to balance fidelity and size. The background layer, handling continuous-tone pictures and fills, relies on lossy DCT-based techniques like JPEG for intra-block prediction and quantization or JPEG 2000 for wavelet-based coding, enabling aggressive compression without impacting overlaid text sharpness. Key techniques emphasize lossless treatment for the mask and foreground to maintain text integrity, contrasting with lossy background compression that exploits spatial correlations in imagery; multi-stripe encoding divides the page into horizontal bands (typically 256 lines each), allowing independent layer coding per stripe for progressive transmission and random access during decoding. Optional hyper-compression extends this by further subsampling layers or applying advanced filters for ultra-low bit rates in archival scenarios. Entropy coding, such as Huffman or arithmetic, is universally applied across layers post-transformation to achieve final bitstream efficiency. MRC encoding delivers 10-20 times better compression ratios than plain JPEG for compound documents, with benchmarks on mixed text-image pages showing up to 70:1 ratios using JPEG for backgrounds/foregrounds and MMR for masks, compared to JPEG's 5-10:1 at equivalent perceptual quality. In PDF tools implementing MRC, 300 dpi color scans of documents are routinely reduced by 50-80% in file size relative to standard JPEG compression, preserving crisp text without visible artifacts.

Decoding Process

The decoding process for Mixed Raster Content (MRC) reconstructs the original image from the compressed file by parsing its structure and synthesizing the layered components according to ITU-T Recommendation T.44. The process begins with parsing the file header, which contains essential metadata such as page dimensions, layer resolutions, stripe configurations, and coding parameters for each layer (foreground, background, and selector mask). This header information enables the decoder to interpret the segmented data streams correctly.^[24] Following parsing, the selector mask—a binary layer typically at high resolution (e.g., 300–600 dpi)—is decoded first using methods like JBIG2 or T.4 bi-level coding to identify regions for foreground or background selection.^[24] Next, the foreground layer, which captures sharp elements like text and line art in color or grayscale at reduced resolution, is decoded using JPEG or similar continuous-tone methods. The background layer, representing smoother areas such as images or fills at even lower resolution, undergoes analogous decoding. If resolutions differ across layers, upsampling (e.g., via interpolation) aligns them to the mask's resolution.^[25]^[24] The core synthesis step involves pixel-wise compositing: for each pixel, the decoder selects values from the foreground if the mask bit is 1 or from the background if 0, blending them to form the final image raster. This selective reconstruction preserves high fidelity in textual regions while efficiently handling continuous-tone areas.^[25]^[26] To manage memory, MRC files are organized into horizontal stripes (bands), each containing independent layer segments that can be decoded sequentially in a band-by-band manner. This striped structure supports progressive rendering, allowing partial image display as data arrives, which is particularly useful for transmission over networks.^[24] Error resilience is inherent in the format, leveraging JPEG's built-in error detection (e.g., cyclic redundancy checks) for continuous-tone layers and JBIG2's robust bi-level coding for the mask. T.44 also facilitates recovery from partial transmissions by enabling decoding of complete stripes even if others are lost, aiding facsimile and web applications. The output is a full-color or grayscale raster image at the specified resolution, potentially including embedded metadata like color profiles from the header.^[24]

Applications

Document Imaging

Mixed raster content (MRC) plays a key role in document imaging by enabling efficient scanning and processing of physical documents that combine text, graphics, and images. In scanner hardware, MRC support facilitates real-time compression during the creation of PDF files, optimizing output quality without requiring post-scan processing. For instance, Visioneer OneTouch scanners incorporate MRC through their software interface, where users can enable the feature in scan properties to separately handle text for sharpness and images for detail preservation.^[27] A standard workflow for MRC-based document imaging begins with scanning at 300 dpi resolution to capture sufficient detail for readability. The process then involves automatic segmentation to isolate text, photographic, and background elements, followed by layer-specific compression using MRC techniques to generate compact, searchable PDFs. This approach ensures high-fidelity reproduction of mixed-content pages, such as forms with embedded photos or charts.^[28]^[9] In archiving applications, MRC preserves text sharpness across large collections of scanned documents, making it suitable for long-term preservation while minimizing storage demands in libraries and digital repositories. By layering content and applying targeted compression, MRC reduces file sizes substantially compared to uniform methods, aiding efficient management of vast archives without compromising legibility.^[29]^[30] Enterprise document management systems leverage MRC for handling mixed-content scans, achieving 50-70% size reductions that streamline storage and retrieval in high-volume environments.^[31] Overall, MRC's compression efficiency enhances document imaging by balancing quality and file size, particularly for searchable outputs in professional settings.^[32]

Integration with PDF and DjVu

Mixed raster content (MRC) has been integrated into the PDF format as a compression filter for images since the release of Adobe Acrobat 6 in 2003, allowing for efficient encoding of compound documents containing both text and continuous-tone elements.^[33] This integration enables MRC to be applied directly to image streams within PDF files, leveraging segmentation to separate textual foreground from background imagery for optimized compression. Tools such as ORPALIS PDF Reducer utilize this capability for hyper-compression, achieving file size reductions of up to 90% for scanned documents without significant loss in visual quality.^[21]^[34] In PDF implementation, MRC operates through XObject streams embedded with specific markers that denote the layered structure, facilitating selective decoding of foreground and background layers during rendering.^[35] As of 2022, SDKs like Dynamsoft's imaging libraries have incorporated MRC support for PDF generation in mobile scanning applications, enabling seamless compression of captured documents into portable formats. Recent enhancements as of 2024, such as improved MRC compression in Nitro PDF, continue to advance its use in cloud-based storage and processing workflows for archived scans.^[32]^[36] MRC forms a core component of the DjVu format since its standardization around 2001, where it structures documents into layers with bitonal text foregrounds encoded via JB2 and JPEG-like backgrounds using the IW44 wavelet codec.^[37] This layered approach in DjVu optimizes files for web distribution, achieving high compression ratios for color scanned documents while preserving sharp text edges and detailed imagery.^[38] The IW44 codec specifically wraps the MRC layers, supporting progressive decoding that prioritizes visible regions for efficient online viewing.^[7] Interoperability between MRC and archival standards is enhanced through export mechanisms that generate PDF/A-compliant files, ensuring long-term preservation of segmented document layers in regulated environments.^[39] Scanners and software workflows can thus produce MRC-encoded PDFs that meet PDF/A requirements, maintaining both compressibility and accessibility for institutional archives.^[40]

Evaluation

Advantages and Benefits

Mixed Raster Content (MRC) compression achieves superior efficiency for compound images by decomposing them into layers—a binary mask that selects between a foreground layer for text, line art, and graphics and a background layer for continuous-tone elements—allowing each to be encoded with the most suitable algorithm, such as JBIG2 for the binary mask and JPEG for the continuous-tone layers.^[26] This layered approach yields compression ratios 5 to 10 times better than standard JPEG for text-heavy documents, reducing a typical 300 dpi color magazine page from several hundred kilobytes in JPEG to 40-60 KB in MRC.^[38] For example, a 1 MB JPEG scan of a mixed document can often be compressed to around 100 KB using MRC while retaining perceptual quality.^[5] In terms of quality preservation, MRC maintains sharp text edges without the ringing artifacts common in JPEG compression of compound images, as the high-resolution mask layer ensures precise reconstruction of binary elements.^[26] Photographic regions retain fidelity through dedicated continuous-tone encoding, with studies showing 45-60% reduction in mean squared error distortion compared to other MRC implementations at equivalent bit rates.^[26] This preservation extends to optical character recognition (OCR), where MRC's clean separation of text layers supports improved accuracy by enhancing contrast and edge clarity without introducing compression-induced noise.^[32] The benefits extend to bandwidth savings, making MRC particularly suitable for document transmission and storage, as the reduced file sizes—often under 10% of uncompressed or JPEG equivalents for text-dominant pages—facilitate faster delivery over networks like fax systems defined in ITU-T T.44. Additionally, MRC offers scalability for high-resolution documents, where the mask layer can operate at full resolution (e.g., 600 dpi) while downsampling background and foreground layers, avoiding proportional increases in file size and enabling efficient handling of large scans without quality loss.^[26]

Limitations and Comparisons

Mixed raster content (MRC) compression incurs higher computational costs compared to single-layer methods due to the multi-layer segmentation and encoding process. Segmentation errors represent a key limitation, as inaccuracies in separating text from background can lead to text blurring or artifacts in the decoded image, particularly when non-text elements are misclassified as foreground.^[41]^[42] Early versions of the MRC standard, such as the 1999 ITU-T T.44 specification, offered limited color support, primarily focusing on monochrome or basic grayscale layers, with full color extensions added in later amendments like Mode 4.^[43] A notable risk in MRC arises from the use of JBIG2 compression in the binary mask layer, which can introduce character substitution errors, where similar-looking symbols are swapped, potentially altering document content in scanned images.^[44] Such errors have been observed in scanner implementations, with error rates reaching up to 1% in challenging documents featuring small fonts or noise.^[45] In comparisons, MRC outperforms JPEG for compound documents with text and graphics, achieving up to 8-10 times smaller file sizes while preserving sharp edges, but it underperforms on pure photographic images where the layered approach introduces unnecessary resolution loss in non-text areas.^[5] Versus JPEG 2000, MRC delivers similar compression ratios for mixed content but excels specifically in documents by combining JBIG2 for text masks with JPEG 2000 for continuous-tone layers, avoiding the ringing artifacts common in single-layer wavelet compression of edges.^[5] Compared to JBIG2 alone, which is optimized for binary images, MRC extends capabilities by incorporating photo handling through its foreground and background layers, though at the expense of added segmentation overhead.^[26] Overall, MRC strikes a balance between file size reduction and quality preservation for compound images but proves less suitable for non-compound content like uniform photographs, where simpler codecs suffice without the segmentation trade-offs.^[5] As of 2025, MRC remains relevant for compound document compression in SDKs and scanning software, though neural methods are gaining traction for general images.^[9]^[46]

History

Mixed raster content

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Mixed raster content

File format

See also

References

External links