FITS

FITSMain

Community hub

FITS

7 pages, 0 posts

0 subscribers

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Contribute something

About hubMembersContent overviewUpdatesRules

Main reference articles

FITS

View on Wikipedia

from Wikipedia

Not found

Revisions and contributors Edit on Wikipedia Read on Wikipedia

View on Grokipedia

from Grokipedia

The Flexible Image Transport System (FITS) is a digital file format and standard specifically designed for the storage, transmission, analysis, and archival preservation of scientific data in astronomy, supporting multidimensional arrays such as one-dimensional spectra, two-dimensional images, and higher-dimensional data cubes, along with tabular data and extensible metadata via keyword-value headers.^[1]^[2] Developed in the late 1970s by astronomers including Don Wells and Ron Harten to facilitate data interchange between observatories, FITS was first formally defined in 1981 and endorsed by the International Astronomical Union (IAU) in 1982, establishing it as the de facto standard for astronomical data exchange.^[3] Its structure consists of self-describing Header and Data Units (HDUs), where headers use fixed 80-character ASCII records grouped into 2880-byte blocks to store metadata like calibration information and coordinate systems, followed by binary or ASCII data units for the actual arrays or tables, ensuring portability across diverse hardware and software platforms.^[3]^[2] FITS has evolved through community-driven updates managed by the IAU FITS Working Group (IAUFWG), with key milestones including the addition of ASCII and binary table extensions in the 1980s and 1990s, the formalization of World Coordinate System (WCS) standards in the early 2000s for mapping data to physical coordinates, the release of version 3.0 in 2008 which incorporated 64-bit integer support, variable-length arrays, and consolidated prior extensions, and version 4.0 approved in 2016 and formally released in 2018, which integrated additional conventions such as Hierarchical Data Units (HDUs) and checksums into the core standard, all while maintaining backward compatibility under the principle of "once FITS, always FITS."^[3]^[4] Maintained by NASA's HEASARC FITS Support Office since the 1990s, the format is open and non-proprietary, with comprehensive documentation, validation tools, and libraries like CFITSIO available to ensure long-term sustainability and interoperability.^[1]^[2] Widely adopted by major observatories and missions—including the Hubble Space Telescope, Chandra X-ray Observatory, Sloan Digital Sky Survey, and ground-based facilities like the National Radio Astronomy Observatory (NRAO)—FITS underpins virtual observatories and data archives, enabling seamless sharing of petabytes of astronomical datasets.^[2] Beyond astronomy, it has seen limited application in fields like digitizing historical manuscripts, as adopted by the Vatican Library in 2010, highlighting its versatility for scientific imaging and metadata needs.^[2]

History and Development

Origins in Astronomy

In the late 1970s, radio astronomy faced significant challenges due to the proliferation of incompatible data formats across observatories, hindering the exchange of complex datasets such as visibility measurements from interferometers. This fragmentation was particularly acute with the impending operation of the National Radio Astronomy Observatory's Very Large Array (VLA), which required efficient sharing of multi-element interferometric data among institutions for processing and analysis.^[5] The need for a standardized, portable format on magnetic tapes became evident to facilitate collaboration without custom conversion software.^[6] An early prototype for image interchange was developed in 1976 by Don Wells at Kitt Peak National Observatory (KPNO) and Ron Harten at the Netherlands Foundation for Radio Astronomy (NFRA). To address these issues, astronomers Don Wells, then at the National Optical Astronomy Observatory (NOAO), and Eric Greisen at the NRAO developed an initial proposal for the Flexible Image Transport System (FITS) during a meeting at the VLA site on March 27–28, 1979.^[5] Their work built on earlier efforts for image interchange between observatories, refining a tape-based structure capable of handling astronomical images and extending it to support radio data needs.^[6] This collaboration, involving input from NRAO staff like Barry Clark, marked the foundational step toward a community-wide standard.^[5] The initial FITS design emphasized support for multi-dimensional arrays to represent interferometric visibility data, overcoming limitations of proprietary formats that lacked flexibility for such structures in radio observations.^[7] Successful tests of data interchange between NOAO and NRAO in 1979 validated the approach, demonstrating its utility for transporting processed images and raw observations.^[6] FITS saw its first informal adoption in 1981 through discussions within the IAU Commission 5 Working Group on Astronomical Data, where the format was presented and tested for broader use. This paved the way for formal standardization efforts shortly thereafter.^[6]

Standardization Process

The standardization of the Flexible Image Transport System (FITS) began with collaborative efforts among astronomers to address data interchange needs, particularly in radio astronomy where diverse computing environments required a portable format.^[8] In 1981, key proposals were published defining the core FITS structure, including the seminal paper "FITS: A Flexible Image Transport System" by Wells, Greisen, and Harten, which outlined the use of ASCII-encoded headers preceding binary data units to ensure human readability and cross-system compatibility, avoiding issues with varying binary representations on different computers. A companion paper by Greisen and Harten further specified conventions for image data, emphasizing simplicity and extensibility. This initial definition emerged from consensus-building involving representatives from major institutions, including the National Radio Astronomy Observatory (NRAO), the European Southern Observatory (ESO), Kitt Peak National Observatory (KPNO), and the Netherlands Foundation for Radio Astronomy (NFRA), who tested and refined the format through tape exchanges and community discussions to achieve broad agreement on its specifications.^[8] The process prioritized portability, with ASCII headers allowing metadata to be machine-parsable yet editable by hand, a critical feature for early astronomical computing. At the 1982 International Astronomical Union (IAU) General Assembly in Patras, Greece, FITS received formal endorsement as the recommended format for image data interchange, marking its establishment as an official standard.^[8] Governance of FITS transitioned to a dedicated body with the formation of the IAU FITS Working Group (IAUFWG) in 1988 at the IAU General Assembly in Baltimore, under Commission 5 (Astronomical Data), tasked with maintaining the standard, approving extensions, and ensuring community consensus through public reviews and technical panels.^[9] The IAUFWG continues to oversee evolution, registering new conventions and resolving ambiguities via endorsements from bodies like the American Astronomical Society's Working Group on Astronomical Standards (WGAS).^[10] To support implementation, NASA established the FITS Support Office at Goddard Space Flight Center (GSFC) in 1990 as part of the Science Office of Standards and Technology (NOST), providing documentation, testing resources, and assistance for astronomical missions.^[11]

Evolution of Versions

The FITS standard originated as a means to standardize data exchange in astronomy and has undergone several revisions to accommodate evolving needs, with a core principle of backward compatibility ensuring that files conforming to earlier versions remain fully valid and readable by updated software. This iterative development, overseen by the International Astronomical Union (IAU) FITS Working Group, has allowed FITS to remain the de facto format for astronomical data interchange for over four decades.^[8] The initial definition in 1981 laid the groundwork with a simple structure comprising ASCII header records of fixed-length keyword-value-comment cards followed by binary or ASCII data arrays, primarily supporting 2D images for basic astronomical observations. The random groups structure, defined in 1981, handled complex interferometry datasets, where each group pairs a primary data array with associated parameter arrays of varying lengths. In 1988, the format was extended by incorporating support for multiple Header Data Units (HDUs) and conventions for long strings to exceed the 68-character limit in header values using chained keywords. These additions addressed limitations in representing correlated multi-dimensional data from radio telescopes while preserving compatibility with earlier files.^[8] Version 3.0, approved in 2008, further advanced the format's flexibility by adding support for hierarchical groups, enabling logical associations of multiple header-data units (HDUs) across files for complex datasets; variable-length arrays in binary tables to store lists of differing sizes without fixed allocation; and the TDISPn keyword for specifying display formats in tables, improving readability of numerical columns.^[12] These enhancements facilitated better organization of multifaceted astronomical data, such as multi-wavelength observations, without invalidating prior structures.^[13] Version 4.0, approved by the IAU on July 22, 2016, and finalized with language edits in 2018, incorporated eight new features to bolster robustness and versatility, including mandatory checksums (CHECKSUM and DATASUM keywords) for verifying data integrity against corruption, logical data types for custom column definitions in tables, and provisions for non-astronomical extensions to broaden applicability beyond traditional stellar and galactic data.^[8] This version also formalized conventions for tiled compression and time coordinate systems, ensuring continued backward compatibility while adapting to modern computational demands like large-scale simulations and multi-messenger astronomy.^[14]

File Format Specifications

Header Structure

The header structure in FITS files is designed as a series of ASCII text records, each exactly 80 characters long, to ensure portability and readability across different systems. These records form key-value pairs that describe the associated data, with the keyword occupying positions 1-8, followed by an equals sign and value in positions 10-80, or a comment starting with a slash in position 11.^[8] The header must conclude with an "END" keyword, which has no associated value and is padded with spaces in positions 9-80, marking the logical end of the header block; the entire header, including padding to complete 2880-byte blocks if necessary, precedes the data unit.^[8] This fixed-record format originated from early tape storage constraints but has proven robust for modern file interchange.^[15] Certain keywords are mandatory in the primary header to define the file's conformance and data characteristics. The "SIMPLE" keyword, the first in the primary header, must be set to logical true (T) to indicate adherence to the FITS standard; its absence or false value renders the file non-conformant.^[8] The "BITPIX" keyword specifies the data type and number of bits per pixel or element, with allowed integer values of 8 (8-bit unsigned integer), 16 (16-bit signed integer), 32 (32-bit signed integer), 64 (64-bit signed integer), -32 (32-bit IEEE single-precision floating point), or -64 (64-bit IEEE double-precision floating point).^[8] "NAXIS" gives the number of dimensions in the data array, ranging from 0 (for no data) to 999, while the "NAXISn" keywords (for n=1 to NAXIS) provide the length of each axis as positive integers.^[8] Duplicate occurrences of these mandatory keywords result in an indeterminate value, potentially invalidating the header.^[8] In the overall file composition, each Header Data Unit (HDU) begins with such a header, which describes and is immediately followed by the corresponding data block, allowing FITS files to chain multiple HDUs.^[8] Headers must use only printable ASCII characters (32-126); inclusion of control characters or the delete character (127) invalidates the header.^[8] According to the standard, files with invalid headers are considered non-conformant, and conforming software may reject them or attempt recovery, though no specific recovery mechanism is mandated.^[8] This structure ensures self-describing files while maintaining strict formatting rules for interoperability.^[15]

Primary and Extension Data Units

The Flexible Image Transport System (FITS) organizes data into modular components known as Header Data Units (HDUs), which allow for a structured, extensible file format suitable for astronomical datasets. The primary HDU serves as the initial and mandatory component of any FITS file, consisting of a header followed by an optional primary data array, typically representing the main image or array data along with global metadata describing the file's overall content.^[16] This primary array is often a multidimensional image, such as a 2D pixel array for astronomical observations, and its header must include required keywords like SIMPLE (indicating conformance to the standard), BITPIX (specifying the data type), NAXIS (number of dimensions), and NAXISn (size of each dimension).^[16] Following the primary HDU, FITS files may include one or more extension HDUs to append supplementary data, enabling multi-extension files (MEFs) that support complex datasets beyond a single image. Each extension HDU begins with a header that includes the XTENSION keyword to specify its type, such as IMAGE for additional arrays, TABLE for ASCII tables, or BINTABLE for binary tables, allowing diverse data structures to be associated with the primary content.^[16] These extensions are optional but conform to registered standards, ensuring interoperability across astronomical software tools.^[16] The overall file structure concatenates these HDUs sequentially, with each HDU's header and data array padded to complete 2880-byte logical blocks—a fixed size chosen for historical compatibility with magnetic tape storage and to facilitate byte-stream portability across systems.^[16] This block alignment ensures that HDUs start at block boundaries, preventing parsing errors during file transfer or reading. For navigation within HDUs, particularly those involving grouped or heap-based data, headers include the PCOUNT keyword (indicating the number of bytes following the regular data array, such as heap storage in binary tables) and GCOUNT (specifying the number of groups or repeated units, typically set to 1 for standard single-array extensions).^[16] These keywords enable software to correctly interpret the extent and organization of the data payload.^[16]

Keyword Conventions

FITS header keywords provide metadata about the data units in a file, adhering to strict conventions to promote interoperability across astronomical software and observatories. Each keyword record is an 80-character fixed-length string, consisting of an 8-character keyword name in columns 1-8 (left-justified and padded with spaces if necessary), followed by an equals sign in column 9 (optionally preceded by spaces), the value starting in column 11, and an optional comment beginning with a slash (/) after the value.^[8] The keyword name must begin with a letter (A-Z) and may include uppercase letters, digits (0-9), hyphens (-), or underscores (_), ensuring compatibility with parsing tools.^[8] Keyword values support four primary types: strings enclosed in single quotes (up to 68 characters, with continuation via the CONTINUE keyword for longer strings), logical values 'T' (true) or 'F' (false), integers in signed decimal form, and floating-point numbers in fixed or exponential notation.^[8] These types allow precise description of data properties, such as dimensions or units, while the optional comment provides human-readable context without affecting machine parsing.^[8] Certain keywords are reserved by the FITS standard to describe specific aspects of the data, particularly for coordinate systems. For the World Coordinate System (WCS), keywords like CTYPEn specify the type of coordinate along axis n (e.g., 'RA---TAN' for right ascension in tangent projection), while CRVALn defines the reference value for that axis in the coordinate system.^[8] Other reserved keywords, such as NAXISn for array dimensions, ensure standardized interpretation of image or table structures across extensions.^[8] FITS keyword conventions emphasize flexibility within constraints: names are case-insensitive, conventionally written in uppercase for readability.^[8] The HIERARCH convention extends this by prefixing long or hierarchical names with 'HIERARCH', allowing names beyond 8 characters (e.g., HIERARCH ESO TEL = 'VLT'), which is parsed by treating the remainder as the full keyword.^[17] Special commentary keywords include COMMENT for general notes and HISTORY for chronological records of file modifications; both lack values (using '=' without a following value) and permit duplicates to accumulate information without overwriting prior entries.^[8] Validation rules enforce reliability: each record must exactly 80 characters, with no truncation or padding beyond the specified format.^[8] Duplicate keywords are prohibited for most cases to avoid ambiguity, except for COMMENT and HISTORY, where multiples are explicitly allowed; if duplicates occur with conflicting values, the interpretation is indeterminate, underscoring the need for compliant software.^[8] These conventions, defined in the IAU-approved FITS standard, facilitate seamless data exchange in astronomical workflows.^[8]

Data Types and Extensions

Image and Array Data

In the Flexible Image Transport System (FITS), image and array data are stored as raw binary arrays immediately following the associated header in either the primary Header Data Unit (HDU) or image extension HDUs. These arrays represent multi-dimensional datasets, such as 2D images or higher-dimensional data cubes, with the data encoded as a continuous bit stream.^[18] The data type and precision of the array elements are specified by the required BITPIX keyword in the header, which indicates the number of bits per data value and whether the values are integers or floating-point numbers. Valid BITPIX values include 8 for 8-bit unsigned integers, 16 for 16-bit signed integers, 32 for 32-bit signed integers, 64 for 64-bit signed integers, -32 for 32-bit IEEE single-precision floating-point, and -64 for 64-bit IEEE double-precision floating-point. Writers should select a BITPIX value appropriate to the range, accuracy, and form of the data, with integer types (positive BITPIX) often used for compact storage of quantized values and floating-point types (negative BITPIX) for continuous data requiring higher precision.^[18] To enable representation of physical values beyond the native range of integer data types—such as unsigned integers or scaled quantities—FITS supports linear scaling via the optional BSCALE and BZERO keywords. The physical (or true) value of each array element is computed as:

\text{physical value} = BZERO + BSCALE \times \text{array value}

The defaults are BSCALE = 1.0 and BZERO = 0.0, meaning no scaling is applied unless these keywords are present. For example, to store unsigned 16-bit integers in a signed 16-bit array, BZERO is set to 32,768 and BSCALE to 1.0.^[18] The dimensionality of the array is defined by the NAXIS keyword, which specifies the number of axes (from 0 for a null array to a maximum of 999), with each axis length given by the corresponding NAXISn keywords (where n ranges from 1 to NAXIS). This structure supports N-dimensional arrays, including 1D spectra, 2D images, and 3D or higher data cubes for volumetric or spectral imaging data, though practical limits are imposed by software and storage constraints rather than the standard itself.^[18] All multi-byte numeric values in FITS arrays, including integers and floating-point numbers, are stored in big-endian byte order (also known as network byte order), with the most significant byte first to ensure portability across diverse hardware architectures. The array elements are ordered in a Fortran-like convention, where the first axis (NAXIS1) varies most rapidly in memory.^[18]

Binary and ASCII Tables

FITS tables provide a mechanism for storing tabular data within extension headers, distinct from the multidimensional arrays used for images. Binary tables and ASCII tables represent the two primary formats for these extensions, enabling the representation of heterogeneous data in rows and columns. Binary tables offer a compact, efficient storage for numerical and complex data types, while ASCII tables prioritize human readability and broad portability across systems. Binary tables are defined by the XTENSION = 'BINTABLE' keyword in the header and use a binary data structure with BITPIX = 8 to indicate byte-level encoding.^[18] The table consists of a two-dimensional array where NAXIS = 2, with NAXIS1 specifying the number of bytes per row and NAXIS2 the number of rows.^[18] The number of columns is given by the TFIELDS keyword, limited to a maximum of 999 fields.^[18] Each column has a fixed width determined by the TFORMn keyword, which encodes the data type and repeat count; for example, '1J' denotes a single 32-bit integer occupying 4 bytes.^[18] Supported data types include logical (L), bit (X), unsigned byte (B), signed integers (I, J, K), single-precision float (E), double-precision float (D), complex numbers (C, M), and character strings (A), with platform-independent big-endian byte order and IEEE-754 floating-point representation.^[18] Variable-length arrays are accommodated through a supplemental heap area, referenced by the PCOUNT keyword for its size and THEAP for offsets via pointer descriptors (P or Q in TFORMn).^[18] Optional keywords such as TTYPEn for field names, TUNITn for units, TSCALn and TZEROn for linear scaling, TNULLn for undefined values, and TDIMn for multidimensional interpretations enhance usability.^[18] In contrast, ASCII tables are specified by XTENSION = 'TABLE' and store data as a character-based array with BITPIX = 8 and NAXIS = 2, where NAXIS1 is the number of characters per row and NAXIS2 the number of rows.^[18] Like binary tables, TFIELDS defines up to 999 columns, but positions are fixed by the TBCOLn keyword, which indicates the starting character position (1-based) for each field, with rows padded to uniform length using spaces.^[18] The TFORMn keyword specifies field width and type, such as 'A10' for a 10-character string, 'I10' for a 10-digit integer, 'F12.3' for a fixed-point float with 12 total digits and 3 decimals, 'E12.3' for exponential notation, or 'D12.3' for double-precision.^[18] Data must conform to 7-bit ASCII encoding, limiting types to text representations of numbers and strings, with no support for binary-specific formats like bits or complexes.^[18] Optional keywords mirror those in binary tables, including TTYPEn, TUNITn, TSCALn, TZEROn, TNULLn, and TDISPn for display formatting, but PCOUNT must be 0 since no heap is used.^[18] Both table types support random access to individual rows due to their fixed row lengths, facilitating efficient querying in large datasets, though the maximum number of rows is constrained by the overall file size limits of the FITS format.^[18] Binary tables excel in efficiency for numerical computations and storage of large volumes of data, as their compact encoding reduces file sizes and enables faster I/O operations compared to the verbose text-based ASCII format.^[18] Conversely, ASCII tables enhance portability by being human-readable and interpretable without specialized software, making them suitable for simple data exchange across diverse platforms, albeit at the cost of increased file size and processing overhead.^[18] The choice between them depends on the balance between performance needs and accessibility requirements in astronomical data handling.^[18]

Specialized Extensions

Specialized extensions in the FITS format enable the representation of complex astronomical data structures tailored to specific observational domains, building upon the core image and table capabilities to accommodate multidimensional or domain-specific datasets. These extensions are formally registered through the IAU FITS Working Group and adhere to the overarching FITS standard, ensuring interoperability across astronomical software and archives. They include mechanisms for handling interferometric visibilities, spherical sky mappings, coordinate projections, data integrity checks, and multi-beam observations in radio astronomy.^[4] The Random Groups structure serves as a specialized primary array format designed primarily for radio interferometry data, where each group consists of a fixed-length parameter array followed by a variable-length data array, facilitating the storage of visibility amplitudes and phases along with associated UV coordinates. This extension uses the GROUPS = T keyword in the header, with NAXIS1 = 0 indicating the absence of a primary data array, and GCOUNT specifying the number of groups while PCOUNT defines the parameter array length. Originally developed for UV FITS in interferometry applications, it allows efficient packing of complex-valued visibilities with metadata like baseline indices and integration times, though it has been deprecated for new uses outside radio interferometry since FITS version 3.0 due to the preference for binary tables in modern implementations.^[19]^[3]^[20] The Hierarchical Equal Area isoLatitude Pixelization (HEALPix) convention extends FITS to represent data on a spherical sky, particularly for full-sky maps such as cosmic microwave background (CMB) radiation intensity or galaxy surveys, by dividing the sphere into equal-area pixels arranged in a hierarchical structure with 12 base pixels subdivided into isosceles triangles. HEALPix FITS files typically employ binary table extensions where each row corresponds to a pixel, with columns for nested or ring-ordered indices, pixel values, and optional metadata like pixel weights or healpix order (Nside parameter, which determines resolution as 12 * Nside^2 pixels). This format supports efficient querying and visualization of spherical data, with headers including keywords like PIXTYPE = 'HEALPIX', ORDERING = 'RING' or 'NESTED', and COORDSYS for equatorial or galactic coordinates, making it a standard for missions like Planck and WMAP.^[21] World Coordinate System (WCS) extensions provide a framework for mapping pixel coordinates to physical celestial coordinates, using a set of standardized header keywords to describe projections, distortions, and transformations in multidimensional images. Central to this are the CTYPE keywords, which specify the coordinate type and projection for each axis; for example, 'RA---TAN' denotes right ascension in a gnomonic (tangent plane) projection, while 'DEC--SIN' indicates declination in an orthographic (sine) projection, enabling accurate astrometry for images from optical to radio telescopes. Additional keywords like CRPIX (reference pixel), CRVAL (reference value), CDELT (scale), and PV (projection parameters) or DIST (distortion) support linear and non-linear transformations, with the full WCS standard encompassing up to 72 keywords for complex cases like spectral or time axes. These extensions, formalized in FITS version 3.0 and refined in subsequent papers, ensure precise alignment of data across instruments and surveys.^[22]^[23] Introduced in FITS version 4.0, the Checksum extensions incorporate DATASUM and CHECKSUM keywords into headers to verify data integrity against corruption during storage or transmission, using 32-bit one's complement checksums computed over the entire Header-Data Unit (HDU). The DATASUM keyword stores a decimal representation of the checksum for the data portion (excluding headers), calculated by summing all 8-bit bytes modulo 2^32, while CHECKSUM holds the value that, when added to the header's checksum (including padding to 2880-byte blocks), yields zero for the full HDU. This mechanism, proposed in 2001 and ratified in 2016, applies recursively to extensions and supports verification tools without altering data content, enhancing reliability for large astronomical datasets archived at facilities like MAST or ESO.^[24]^[8] The Multi-Beam FITS (MBFITS) convention, registered in 2007, standardizes raw data from single-dish millimeter and sub-millimeter telescopes equipped with multi-beam receivers, such as those on the Atacama Pathfinder Experiment (APEX), by organizing observations into binary table extensions with columns for time, beam position, frequency, and spectral data. It employs keywords like MBFITS = T, BEAMCNT (number of beams), and IFCNT (intermediate frequency channels) to describe the array geometry and calibration, accommodating variable-length arrays for spectra per beam and integration. This extension facilitates the interchange of multi-pixel heterodyne data for molecular line surveys and continuum mapping, with headers detailing receiver configurations to ensure compatibility across pipelines like those at ESO or IRAM.^[25]^[26]

Implementation and Tools

Programming Libraries

CFITSIO, developed by the NASA Goddard Space Flight Center starting in 1995 and currently at version 4.6.3 (as of September 2025), serves as a core open-source library written in C (with Fortran bindings) for reading, writing, and manipulating FITS files. It provides essential APIs for accessing and modifying Header Data Units (HDUs), including primary images, binary tables, and ASCII tables, while supporting advanced features like data compression algorithms (e.g., Rice and GZIP) and integration with World Coordinate System (WCS) metadata for spatial transformations.^[27] Widely used as a foundation for higher-level tools, CFITSIO emphasizes portability across platforms and includes utilities for error recovery during file operations.^[27] The Astropy library, a community-driven Python package launched in 2013, incorporates the io.fits module as a high-level interface for FITS handling, building on CFITSIO for underlying I/O operations. This module facilitates seamless reading and writing of FITS data with automatic validation of headers and data integrity, while integrating directly with NumPy arrays to enable efficient in-memory manipulation of multidimensional image and table data.^[28] Astropy's design prioritizes ease of use for scientific workflows, offering convenience functions like fits.info() for summarizing file contents and support for lazy loading to handle large datasets without full immediate decompression.^[28] FITSIO represents a contemporary option in Rust, providing a safe, memory-efficient wrapper around CFITSIO bindings tailored for systems programming needs. Released as an open-source crate, it emphasizes performant parsing with zero-copy access to FITS data structures, allowing direct views into file buffers to minimize overhead in high-throughput applications.^[29] This library supports reading and writing HDUs, headers, and tables while leveraging Rust's ownership model to prevent common errors like buffer overflows during FITS operations.^[29] Support for FITS extends to other languages through dedicated bindings and native implementations. In Java, the nom.tam.fits library offers a pure-Java API for comprehensive FITS file I/O, including HDU creation, header editing, and data extraction, without relying on external C dependencies.^[30] IDL provides built-in procedures such as READFITS and MRDFITS for importing FITS images and tables into IDL arrays, supplemented by the IDLASTRO library for extended utilities like header querying.^[31]^[32] Similarly, MATLAB includes native functions like fitsread and fitswrite for loading and saving FITS data as matrices, with low-level access via the matlab.io.fits package for fine-grained control over file pointers and extensions.^[33]^[34] These libraries share key features to ensure reliability in astronomical data processing, such as robust error handling for non-conformant or corrupted FITS files—often through configurable warning levels and partial read capabilities—and full compliance with the FITS standard up to version 4.0, which includes enhancements for hierarchical groups and checksums.^[27]^[28]^[4] This standardization allows developers to perform cross-language operations on FITS structures like HDUs while maintaining data fidelity.^[4]

Viewing and Analysis Software

SAOImage DS9, developed by the Smithsonian Astrophysical Observatory, is a widely used multi-frame astronomical imaging and data visualization application first released in 1999. It supports the display and analysis of FITS images and binary tables, enabling users to handle multiple frame buffers simultaneously for comparative viewing. Key features include World Coordinate System (WCS) overlays for accurate positional mapping, region-of-interest analysis for measuring fluxes and statistics within defined shapes, and an extensible scripting interface using XPA (X Public Access) for automation and integration with other tools.^[35]^[36] FITS Liberator, a collaborative project by the European Space Agency (ESA), European Southern Observatory (ESO), and NASA, was initially released in 2004 as a plugin for Adobe Photoshop and Illustrator to facilitate the processing of raw astronomical FITS data into enhanced color images. It allows users to adjust scaling, stretch functions, and color mapping directly on FITS headers and data, producing publication-ready TIFF outputs without altering the original files. The software supports batch processing and is optimized for large images, making it accessible for both professional astronomers and outreach efforts to create visually compelling representations of telescope observations.^[37] Aladin Desktop, part of the Aladin Sky Atlas suite developed by the Centre de Données astronomiques de Strasbourg (CDS) since its first distribution in 1997, serves as an interactive tool for exploring the sky by overlaying FITS images onto large-scale surveys. It integrates seamlessly with astronomical databases like Simbad and VizieR, supporting over 20,000 data collections including SDSS and Gaia, and handles FITS files alongside other formats without size limits. Users can perform tasks such as zooming, panning, contour plotting, and photometric measurements, with support for various projections and coordinate systems to aid in source identification and multi-wavelength analysis.^[38]^[39] TOPCAT, a Java-based interactive viewer for tabular data developed by Mark Taylor and first released in 2003, excels in the visualization and manipulation of FITS binary and ASCII tables common in astronomical catalogs. It offers facilities for sorting, subsetting, and defining new columns via an expression language, alongside plotting capabilities in one to three dimensions with options like linear, logarithmic, and asinh scaling. Cross-matching between tables is a core strength, employing algorithms for spatial and parameter-based joins to facilitate correlation studies across datasets.^[40] For web-based access, JS9 provides a browser-compatible astronomical image display tool that supports loading and analyzing FITS images, tables, and multi-extension files via drag-and-drop or URL input. Developed as a lightweight successor to DS9, it enables panning, zooming, colormap adjustments, and basic analysis like region statistics and RGB compositing directly in web environments, with options for server-side extensions and API integration for custom applications.^[41]

Validation and Conversion Tools

Validation tools for FITS files ensure compliance with the standard by examining header keywords, data structures, and overall file integrity. The primary standalone validator developed by NASA's Goddard Space Flight Center (GSFC) is fitsverify, a command-line utility that rigorously tests FITS files against the requirements outlined in the FITS Standard.^[42] Fitsverify scans headers for mandatory keywords like END, verifies their order, data types, and formatting, and flags violations such as illegal characters or deprecated usage.^[42] It also assesses data integrity, including table column widths, variable-length arrays, and HDU (Header Data Unit) structure, categorizing issues as errors (e.g., missing END keyword) or warnings (e.g., non-standard keywords).^[42] When invoked with options like -tchksum and -testdata, fitsverify validates checksums to detect corruption in headers or data.^[42] A web-based version of fitsverify is available through the FITS Support Office for uploading and scanning files without local installation.^[43] Conversion tools facilitate interoperability by exporting FITS data to other formats, preserving metadata where possible. Funtools, a multi-format toolkit built on the CFITSIO library, supports reading FITS binary and ASCII tables and converting them to text-based outputs like ASCII or CSV for easier analysis outside astronomical software.^[27]^[44] For geospatial applications, the Geospatial Data Abstraction Library (GDAL) handles FITS images with World Coordinate System (WCS) headers, enabling conversion to GeoTIFF via the gdal_translate command, which maps FITS arrays to raster layers while retaining georeferencing for latitude-longitude projections.^[45]^[46] GDAL's FITS driver, dependent on CFITSIO, also supports multi-extension FITS (MEF) files and binary tables as vector layers since version 3.2.^[45] These tools enforce rules from the FITS 4.0 standard, particularly the use of checksum keywords for integrity verification. The CHECKSUM keyword provides a 32-bit one's complement checksum over the entire HDU, represented as a 16-character ASCII string that sums to negative zero when validated, while DATASUM computes a 32-bit ones' complement checksum of the data portion alone, including padding.^[8] Validation involves recalculating these values and comparing them to stored keywords; mismatches indicate corruption, and tools like fitsverify can automate this process to ensure files remain reliable through processing steps.^[42]^[8] Including these keywords is strongly recommended for public FITS files to support error detection during archival or transmission.^[8] Common issues addressed by these tools include endianness mismatches and padding errors, which can arise during file creation or transfer across systems. FITS files mandate big-endian byte order for multi-byte data, but reading on little-endian architectures without proper swapping—handled automatically by libraries like CFITSIO—can corrupt numerical values in tables or images.^[47] Padding errors occur when HDUs do not fill complete 2880-byte logical blocks with zeros after the END keyword or data array, violating the standard and potentially causing read failures; validators detect such anomalies to prevent downstream processing issues.^[48]^[8]

Adoption and Applications

Use in Astronomical Observatories

The Flexible Image Transport System (FITS) serves as the de facto standard for data archiving and exchange in major astronomical observatories, enabling the storage and distribution of observational data from both space- and ground-based telescopes. Since the launch of the Hubble Space Telescope (HST) in 1990, all HST images and associated data products have been archived in FITS format through the Mikulski Archive for Space Telescopes (MAST), facilitating long-term preservation and scientific analysis of ultraviolet, optical, and near-infrared observations. This adoption has ensured compatibility across diverse instruments, such as the Advanced Camera for Surveys (ACS) and Wide Field Camera 3 (WFC3), with raw and calibrated datasets readily accessible in multi-extension FITS files. In radio astronomy, FITS is integral to handling raw and reduced data from interferometric arrays. The Atacama Large Millimeter/submillimeter Array (ALMA) delivers both raw visibility data—initially in Algebraic Serial Data Markup (ASDM) format—and reduced products, including calibrated images and cubes, converted to FITS for analysis, supporting multi-wavelength studies of molecular gas and star formation. Similarly, the Karl G. Jansky Very Large Array (VLA) processes raw interferometric visibilities through the Common Astronomy Software Applications (CASA) package, outputting reduced data in FITS format to map sky positions and frequencies across bands from 200 MHz to 50 GHz. These formats preserve the complex metadata required for interferometry, such as antenna positions and phase calibrations. European Southern Observatory (ESO) facilities and the Gemini Observatory standardize pipeline outputs to FITS for multi-wavelength datasets, ensuring interoperability in optical, infrared, and spectroscopic observations. ESO instruments like the Multi-Unit Spectroscopic Explorer (MUSE) produce raw and processed data in multi-extension FITS files, with headers containing detailed provenance for query and reduction. Gemini pipelines for instruments such as the Gemini Multi-Object Spectrograph (GMOS) and Near-Infrared Integral Field Spectrometer (NIFS) generate calibrated spectra and images in FITS, supporting collaborative multi-wavelength campaigns across its northern and southern telescopes. Archival systems worldwide rely on FITS for hosting queryable datasets, promoting open access to observatory outputs. MAST provides FITS-based HST and other mission data through web interfaces, allowing searches by coordinates, object names, or metadata. The Canadian Astronomy Data Centre (CADC) archives FITS files from surveys like the Canada-France-Hawaii Telescope Legacy Survey, enabling programmatic access via tools like Astroquery. The ESO Archive stores complete FITS headers for all telescope observations, supporting advanced queries for raw and reduced products from facilities like the Very Large Telescope (VLT). The widespread adoption of FITS has enabled seamless data sharing across virtually all major astronomical observatories, fostering global collaborations in over 100 institutions by standardizing formats for diverse instruments and wavelengths. This interoperability underpins large-scale projects, such as time-domain surveys and multi-facility follow-ups, without the need for extensive format conversions.

Integration with Data Pipelines

The Flexible Image Transport System (FITS) serves as a foundational format for input/output operations in legacy astronomical data reduction pipelines, such as IRAF and its Python interface PyRAF, where it facilitates essential tasks like image calibration and stacking.^[49]^[50] In IRAF, the FITS kernel enables seamless reading, modification, and creation of FITS files as primary images or extensions, supporting workflows that process raw observational data into calibrated products.^[51] PyRAF extends this capability into Python environments, allowing astronomers to integrate FITS handling with scripting for automated batch processing of multi-extension files common in optical and near-infrared imaging.^[50] Modern pipelines leverage FITS through libraries like AstroPy and the Common Astronomy Software Applications (CASA) for efficient data processing in optical and radio astronomy, respectively. AstroPy's io.fits module provides robust tools for opening, inspecting, and manipulating FITS headers and data arrays, enabling Python-based pipelines to perform end-to-end analysis from raw frames to derived products.^[28] CASA, developed for interferometric data from facilities like the Atacama Large Millimeter/submillimeter Array (ALMA), imports FITS images for calibration and exports results in FITS format, supporting tasks such as visibility-to-image conversion while maintaining compatibility with FITS metadata standards.^[52]^[53] In typical workflows, FITS I/O underpins core reduction steps, including bias subtraction to remove detector readout noise, flat-fielding to correct pixel sensitivity variations, and astrometry to align images with celestial coordinates.^[54]^[55] These operations rely on FITS headers to store instrumental parameters like exposure time and gain, ensuring provenance tracking across pipeline stages. For instance, AstroPy routines can extract bias frames from FITS extensions, subtract them from science images, and update headers with processing history, while CASA applies similar corrections in radio contexts by converting FITS inputs to internal measurement sets.^[28]^[52] Handling large-scale datasets presents challenges for FITS integration, particularly with surveys like the Legacy Survey of Space and Time (LSST) at the Rubin Observatory, which generates approximately 20 terabytes of raw data nightly—as of its early operations starting in October 2025—and is expected to accumulate around 500 petabytes after processing overall in FITS-compatible formats for raw exposures and coadded images.^[56]^[57]^[58]^[59] Pipelines must address issues like file I/O bottlenecks and storage overhead for multi-gigabyte FITS files, often using distributed computing to parallelize processing while preserving FITS's extensible structure for metadata.^[60] The FITS 4.0 standard enhances pipeline reliability through formalized checksum conventions, including the CHECKSUM and DATASUM keywords, which verify HDU integrity against corruption during automated transfers and modifications.^[8]^[25] This incremental update mechanism allows pipelines to detect and recover from errors in high-throughput environments, reducing downtime in observatories processing vast datasets.^[8]

Extensions Beyond Astronomy

While primarily developed for astronomical data, the Flexible Image Transport System (FITS) has been adapted for use in other scientific domains, leveraging its support for multidimensional arrays, tables, and extensible metadata.^[8] In geospatial applications, FITS integrates with libraries like the Geospatial Data Abstraction Library (GDAL), enabling handling of multi-spectral raster data from satellite imagery.^[45] For instance, GDAL's FITS driver supports arbitrary image types and georeferencing via World Coordinate System (WCS) keywords, including latitude-longitude projections and planetary extensions for three-dimensional datums, facilitating processing of non-astronomical Earth observation data such as multi-band satellite scenes.^[45]^[61] In particle physics, FITS finds application through integration with analysis frameworks like ROOT, developed at CERN for handling large-scale experimental data.^[62] ROOT's TFITS interface, built on the CFITSIO library, allows reading and writing FITS files containing images (multidimensional arrays) or tables (row-column structures with mixed data types), which is useful for storing histograms, spectra, and data cubes from detector outputs.^[62] This enables seamless interchange of complex datasets in high-energy physics experiments, where FITS headers provide human-readable metadata to describe data provenance and parameters.^[62] FITS version 4.0 enhances adaptability for non-astronomical contexts by permitting conforming extensions for novel data organizations that cannot fit existing types, such as variable-length arrays in binary tables or compressed datasets.^[8] Custom keywords can be added to headers for domain-specific needs, including non-WCS coordinate descriptions (e.g., via CNAME for spectral axes or user-defined 8-character names), while long-string values are supported through the CONTINUE keyword for detailed annotations.^[8] Examples include snapshots of relational databases using an empty primary Header Data Unit (HDU) followed by table HDUs, demonstrating FITS's portability for tabular scientific data beyond imaging.^[8] Despite these features, FITS adoption remains limited outside astronomy and related sciences due to its specialized structure, which prioritizes self-describing scientific datasets over general-purpose raster formats.^[2] Its emphasis on multidimensional arrays and extensible headers suits niche applications like geospatial multi-spectral analysis or physics event data but lacks broad interoperability with commercial imaging standards.^[2]

History

FITS

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

FITS

FITS

History and Development

Origins in Astronomy

Standardization Process

Evolution of Versions

File Format Specifications

Header Structure

Primary and Extension Data Units

Keyword Conventions

Data Types and Extensions

Image and Array Data

Binary and ASCII Tables

Specialized Extensions

Implementation and Tools

Programming Libraries

Viewing and Analysis Software

Validation and Conversion Tools

Adoption and Applications

Use in Astronomical Observatories

Integration with Data Pipelines

Extensions Beyond Astronomy

References

Add your contribution

Related Hubs

Contribute something