Hubbry Logo
logo
Digital object identifier
Community hub

Digital object identifier

logo
0 subscribers
Read side by side
from Wikipedia

Digital object identifier
Full nameDigital Object Identifier
AcronymDOI
OrganisationInternational DOI Foundation
IntroducedOctober 1997; 28 years ago (1997-10)[1]
Example10.1000/182
Websitewww.doi.org/the-identifier/what-is-a-doi/ Edit this at Wikidata

A digital object identifier (DOI) is a persistent identifier or handle used to uniquely identify various objects, standardized by the International Organization for Standardization (ISO).[2] DOIs are an implementation of the Handle System;[3][4] they also fit within the URI system (Uniform Resource Identifier). They are widely used to identify academic, professional, and government information, such as journal articles, research reports, data sets, and official publications.

A DOI aims to resolve to its target, the information object to which the DOI refers. This is achieved by binding the DOI to metadata about the object, such as a URL where the object is located. Thus, by being actionable and interoperable, a DOI differs from ISBNs or ISRCs which are identifiers only. The DOI system uses the indecs Content Model to represent metadata.

The DOI for a document remains fixed over the lifetime of the document, whereas its location and other metadata may change. Referring to an online document by its DOI should provide a more stable link than directly using its URL. But if its URL changes, the publisher must update the metadata for the DOI to maintain the link to the URL.[5][6][7] It is the publisher's responsibility to update the DOI database. If they fail to do so, the DOI resolves to a dead link, leaving the DOI useless.[8]

The developer and administrator of the DOI system is the International DOI Foundation (IDF), which introduced it in 2000.[9] Organizations that meet the contractual obligations of the DOI system and are willing to pay to become a member of the system can assign DOIs.[10] The DOI system is implemented through a federation of registration agencies coordinated by the IDF.[11] The cumulative number of DOIs has increased exponentially over time, from 50 million registrations in 2011 to 391 million in 2025.[12] The rate of registering organizations ("members") has also increased over time from 4,000 in 2011 to 9,500 in 2013, but the federated nature of the system means it is not immediately clear how many members there are in total today.[13] Fake registries have even appeared.[14]

Nomenclature and syntax

[edit]

A DOI is a type of Handle System handle, which takes the form of a character string divided into two parts, a prefix and a suffix, separated by a slash.

prefix/suffix

The prefix identifies the registrant of the identifier and the suffix is chosen by the registrant and identifies the specific object associated with that DOI. Most legal Unicode characters are allowed in these strings, which are interpreted in a case-insensitive manner. The prefix usually takes the form 10.NNNN, where NNNN is a number greater than or equal to 1000, whose limit depends only on the total number of registrants.[15][16] The prefix may be further subdivided with periods, like 10.NNNN.N.[17]

For example, in the DOI name 10.1000/182, the prefix is 10.1000 and the suffix is 182. The "10" part of the prefix distinguishes the handle as part of the DOI namespace, as opposed to some other Handle System namespace,[A] and the characters 1000 in the prefix identify the registrant; in this case the registrant is the International DOI Foundation itself. 182 is the suffix, or item ID, identifying a single object (in this case, the latest version of the DOI Handbook).

DOI names can identify creative works (such as texts, images, audio or video items, and software) in both electronic and physical forms, performances, and abstract works[18] such as licenses, parties to a transaction, etc.

The names can refer to objects at varying levels of detail: thus DOI names can identify a journal, an individual issue of a journal, an individual article in the journal, or a single table in that article. The choice of level of detail is left to the assigner, but in the DOI system it must be declared as part of the metadata that is associated with a DOI name, using a data dictionary based on the indecs Content Model.

Display

[edit]

The official DOI Handbook explicitly states that DOIs should be displayed on screens and in print in the format doi:10.1000/182.[19]

Contrary to the DOI Handbook, Crossref, a major DOI registration agency, recommends displaying a URL (for example, https://doi.org/10.1000/182) instead of the officially specified format.[20][21]. The DOI Foundation guarantees these URLs to be persistent[22] ie. such URLs are PURLs — providing the location of a name resolver which will redirect HTTP requests to the correct online location of the linked item.[10][23]

The Crossref recommendation is primarily based on the assumption that the DOI is being displayed without being hyperlinked to its appropriate URL—the argument being that without the hyperlink it is not as easy to copy-and-paste the full URL to actually bring up the page for the DOI, thus the entire URL should be displayed, allowing people viewing the page containing the DOI to copy-and-paste the URL, by hand, into a new window/tab in their browser in order to go to the appropriate page for the document the DOI represents.[24]

Content

[edit]

Major content of the DOI system currently includes:

In the Organisation for Economic Co-operation and Development's publication service OECD iLibrary, each table or graph in an OECD publication is shown with a DOI name that leads to an Excel file of data underlying the tables and graphs. Further development of such services is planned.[26]

Other registries include Crossref and the multilingual European DOI Registration Agency (mEDRA).[27] Since 2015, RFCs can be referenced as doi:10.17487/rfc....[28]

Features and benefits

[edit]

The IDF designed the DOI system to provide persistent identification. Each DOI name permanently and clearly identifies the object it belongs to (although when the publisher of a journal changes, sometimes all the DOIs will be changed, with the old DOIs no longer working). It also associates metadata with objects, allowing it to provide users with relevant pieces of information about the objects and their relationships. Included as part of this metadata are network actions that allow DOI names to be resolved to web locations where the objects they describe can be found. To achieve its goals, the DOI system combines the Handle System and the indecs Content Model with a social infrastructure.

The Handle System ensures that the DOI name for an object is not based on any changeable attributes of the object such as its physical location or ownership, that the attributes of the object are encoded in its metadata rather than in its DOI name, and that no two objects are assigned the same DOI name. Because DOI names are short character strings, they are human-readable, may be copied and pasted as text, and fit into the URI specification. The DOI name-resolution mechanism acts behind the scenes, so that users communicate with it in the same way as with any other web service; it is built on open architectures, incorporates trust mechanisms, and is engineered to operate reliably and flexibly so that it can be adapted to changing demands and new applications of the DOI system.[29] DOI name-resolution may be used with OpenURL to select the most appropriate among multiple locations for a given object, according to the location of the user making the request.[30] However, despite this ability, the DOI system has drawn criticism from librarians for directing users to non-free copies of documents, that would have been available for no additional fee from alternative locations.[31]

The indecs Content Model as used within the DOI system associates metadata with objects. A small kernel of common metadata is shared by all DOI names and can be optionally extended with other relevant data, which may be public or restricted. Registrants may update the metadata for their DOI names at any time, such as when publication information changes or when an object moves to a different URL.

The International DOI Foundation (IDF) oversees the integration of these technologies and operation of the system through a technical and social infrastructure. The social infrastructure of a federation of independent registration agencies offering DOI services was modelled on existing successful federated deployments of identifiers such as GS1 and ISBN.

Comparison with other identifier schemes

[edit]

A DOI name differs from commonly used Internet pointers to material, such as the Uniform Resource Locator (URL), in that it identifies an object itself as a first-class entity, rather than the specific place where the object is located at a certain time. It implements the Uniform Resource Identifier (Uniform Resource Name) concept and adds to it a data model and social infrastructure.[32]

A DOI name also differs from standard identifier registries such as the ISBN, ISRC, etc. The purpose of an identifier registry is to manage a given collection of identifiers, whereas the primary purpose of the DOI system is to make a collection of identifiers actionable and interoperable, where that collection can include identifiers from many other controlled collections.[33]

The DOI system offers persistent, semantically interoperable resolution to related current data and is best suited to material that will be used in services outside the direct control of the issuing assigner (e.g., public citation or managing content of value). It uses a managed registry (providing both social and technical infrastructure). It does not assume any specific business model for the provision of identifiers or services and enables other existing services to link to it in defined ways. Several approaches for making identifiers persistent have been proposed.

The comparison of persistent identifier approaches is difficult because they are not all doing the same thing. Imprecisely referring to a set of schemes as "identifiers" does not mean that they can be compared easily. Other "identifier systems" may be enabling technologies with low barriers to entry, providing an easy to use labeling mechanism that allows anyone to set up a new instance (examples include Persistent Uniform Resource Locator (PURL), URLs, Globally Unique Identifiers (GUIDs), etc.), but may lack some of the functionality of a registry-controlled scheme and will usually lack accompanying metadata in a controlled scheme.

The DOI system does not have this approach and should not be compared directly to such identifier schemes. Various applications using such enabling technologies with added features have been devised that meet some of the features offered by the DOI system for specific sectors (e.g., ARK).

A DOI name does not depend on the object's location and, in this way, is similar to a Uniform Resource Name (URN) or PURL but differs from an ordinary URL. URLs are often used as substitute identifiers for documents on the Internet although the same document at two different locations has two URLs. By contrast, persistent identifiers such as DOI names identify objects as first class entities: two instances of the same object would have the same DOI name. In May, 2024, an Internet Draft was introduced to register the "doi" scheme,[34]. Many experts were not aware of this draft,[35] and the latest draft has currently expired.

Resolution

[edit]

To resolve a DOI name, it may be input to a DOI resolver, such as one at the official website https://doi.org/.

DOI name resolution is provided through the Handle System, which is an infrastructure developed and operated by CNRI (Corporation for National Research Initiatives), and is freely available to any user encountering a DOI name. Resolution redirects the user from a DOI name to one or more pieces of typed data: URLs representing instances of the object, services such as e-mail, or one or more items of metadata. To the Handle System, a DOI name is a handle, and so has a set of values assigned to it and may be thought of as a record that consists of a group of fields. Each handle value must have a data type specified in its <type> field, which defines the syntax and semantics of its data. While a DOI persistently and uniquely identifies the object to which it is assigned, DOI resolution may not be persistent, due to technical and administrative issues.

Another approach, which avoids typing or copying and pasting into a resolver is to include the DOI in a document as a URL which uses the resolver as an HTTP proxy, such as https://doi.org/ (preferred)[36] or http://dx.doi.org/, both of which support HTTPS. For example, the DOI 10.1000/182 can be included in a reference or hyperlink as https://doi.org/10.1000/182. This approach allows users to click on the DOI as a normal hyperlink. Indeed, as previously mentioned, this is how Crossref recommends that DOIs always be represented (preferring HTTPS over HTTP), so that if they are cut-and-pasted into other documents, emails, etc., they will be actionable.

An interesting consequence of the fact that DOIs depend entirely on CNRI's Handle System infrastructure (whereby CNRI operates the global root servers and wrote the protocol) is that the proxy services DOI.org/<#> and hdl.handle.net/<#> are interoperable. For example, the following URIs resolve to the same publication:
https://doi.org/10.1016/S0021-9258(19)52451-6
https://hdl.handle.net/10.1016/S0021-9258(19)52451-6

There are other DOI resolvers and HTTP Proxies apart from NCRI's Handle System. At the beginning of the year 2016, a new class of alternative DOI resolvers was started by http://doai.io/ (now discontinued [37]). This service was unusual in that it tried to find a non-paywalled (often author archived) version of a title and redirected the user to that instead of the publisher's version.[38] Since then, other open-access favoring DOI resolvers have been created, notably https://oadoi.org/ in October 2016[39] (rebranded in 2017 as https://unpaywall.org/). While traditional DOI resolvers solely rely on the Handle System, alternative DOI resolvers first consult multiple Open Access resources such as institutional libraries with the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), or indexing services based in OAI-PMH, such as BASE (Bielefeld Academic Search Engine).[37][39]

An alternative to HTTP proxies is to use one of a number of add-ons and plug-ins for browsers, thereby avoiding the conversion of the DOIs to URLs,[40] which depend on domain names and may be subject to change, while still allowing the DOI to be treated as a normal hyperlink. A disadvantage of this approach for publishers is that, at least at present, most users will be encountering the DOIs in a browser, mail reader, or other software which does not have one of these plug-ins installed.

IDF organizational structure

[edit]

The International DOI Foundation (IDF), a non-profit organization created in 1997, is the governance body of the DOI system.[41] It safeguards all intellectual property rights relating to the DOI system, manages common operational features, and supports the development and promotion of the DOI system. The IDF ensures that any improvements made to the DOI system (including creation, maintenance, registration, resolution and policymaking of DOI names) are available to any DOI registrant. It also prevents third parties from imposing additional licensing requirements beyond those of the IDF on users of the DOI system.

Key Information

The IDF is controlled by a Board elected by the members of the Foundation, with an appointed Managing Agent who is responsible for co-ordinating and planning its activities. Membership is open to all organizations with an interest in electronic publishing and related enabling technologies. The IDF holds annual open meetings on the topics of DOI and related issues.

Registration agencies, appointed by the IDF, provide services to DOI registrants: they allocate DOI prefixes, register DOI names, and provide the necessary infrastructure to allow registrants to declare and maintain metadata and state data. Registration agencies are also expected to actively promote the widespread adoption of the DOI system, to cooperate with the IDF in the development of the DOI system as a whole, and to provide services on behalf of their specific user community. A list of current RAs is maintained by the International DOI Foundation. The IDF is recognized as one of the federated registrars for the Handle System by the DONA Foundation (of which the IDF is a board member), and is responsible for assigning Handle System prefixes under the top-level 10 prefix.[42]

Registration agencies generally charge a fee to assign a new DOI name; parts of these fees are used to support the IDF. The DOI system overall, through the IDF, operates on a not-for-profit cost recovery basis.

Standardization

[edit]

The DOI system is an international standard developed by the International Organization for Standardization in its technical committee on identification and description, TC46/SC9.[43] The Draft International Standard ISO/DIS 26324, Information and documentation – Digital Object Identifier System met the ISO requirements for approval. The relevant ISO Working Group later submitted an edited version to ISO for distribution as an FDIS (Final Draft International Standard) ballot,[44] which was approved by 100% of those voting in a ballot closing on 15 November 2010.[45] The final standard was published on 23 April 2012.[2]

DOI is a registered URI under the info URI scheme specified by IETF RFC 4452.[46] info:doi/ is the infoURI Namespace of Digital Object Identifiers.[47]

The DOI syntax is a NISO standard, first standardized in 2000, ANSI/NISO Z39.84-2005 Syntax for the Digital Object Identifier.[48]

The maintainers of the DOI system have registered a DOI namespace for URNs.[49]

See also

[edit]

Notes

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
A Digital Object Identifier (DOI) is a unique alphanumeric string assigned to a digital object, such as a journal article, book chapter, dataset, or image, to provide persistent and reliable identification and access on the internet, regardless of changes in location or ownership.[1] Developed in the late 1990s by the Association of American Publishers and the Corporation for National Research Initiatives (CNRI), the DOI system was introduced to address the challenges of linking and managing intellectual property in digital environments, building on the Handle System technology for resolution.[2] The system is governed by the International DOI Foundation (IDF), a not-for-profit organization established in 1998, which acts as the ISO Registration Authority for the DOI standard under ISO 26324, first published in 2012 and revised in 2022 and 2025.[3][4] DOIs follow a structured format consisting of a prefix (indicating the registration agency and registrant) and a suffix (unique to the object), separated by a slash, such as 10.1000/abc123, and are resolved through the doi.org domain to locate the associated metadata or content.[1] This persistence is ensured through a federated network of Registration Agencies (RAs) that assign DOIs, maintain metadata, and handle resolutions, with the IDF overseeing policies to prevent obsolescence even if an RA ceases operations.[2] Widely adopted in scholarly publishing, data repositories, and creative industries, DOIs facilitate semantic interoperability by supporting standardized metadata schemas, enabling features like citation tracking, content negotiation, and cross-referencing across platforms.[2] As of 2025, the system resolves billions of DOIs annually, underscoring its role in ensuring long-term accessibility of digital resources.[1]

Overview and Syntax

Definition and Purpose

A Digital Object Identifier (DOI) is a unique alphanumeric string assigned to a digital object to provide persistent identification and facilitate access to it, regardless of changes in its location or ownership. It consists of a prefix identifying the registration agency and a suffix specifying the particular object, formatted as "10.prefix/suffix" (e.g., 10.1000/182). DOIs are implemented using the Handle System, a general-purpose global name service that ensures the identifier remains stable over time.[5] The primary purpose of the DOI system is to address the challenges of identifying and locating content in digital networks, where traditional URLs often become obsolete due to server migrations, content relocation, or archival shifts. Developed as a technical and social infrastructure, it enables reliable resolution to the current location or metadata of an object, supporting long-term persistence without a predefined expiration. This framework, standardized under ISO 26324:2025 (revised from 2022), allows organizations to build applications for content management, discovery, and interoperability across industries such as publishing, data repositories, and multimedia.[5][4] By decoupling the identifier from the object's location, DOIs promote enhanced discoverability and archival stability, solving issues like the "404 not found" errors common on the web. They are managed by the International DOI Foundation through accredited registration agencies, ensuring global uniqueness and free public resolution via the doi.org proxy server. This system extends to any entity—physical, digital, or abstract—fostering value-added services like metadata linking while maintaining backward compatibility with existing identifier schemes.[1][5]

Nomenclature and Syntax

The nomenclature of a Digital Object Identifier (DOI) adheres to a standardized syntax outlined in ISO 26324:2025, which specifies the format for identifying objects of any form—digital, physical, or abstract—while ensuring interoperability and persistence.[4] This syntax is implemented within the Handle System, where a DOI name is structured as a prefix followed by a forward slash (/) and a suffix, forming a unique, alphanumeric string.[6] The complete DOI may be referenced with the "doi:" prefix (e.g., doi:10.1000/182) or resolved via the uniform resource locator https://doi.org/ followed by the DOI name.[1] The prefix defines the DOI namespace and always begins with "10.", where "10" serves as the directory indicator uniquely allocated to the DOI system by the International DOI Foundation (IDF).[6] Following this, the registrant code consists of one or more sequences of digits (and periods for hierarchy), assigned sequentially by the IDF to registration agencies (RAs), which then allocate sub-prefixes to individual registrants such as publishers or institutions.[7] For instance, a prefix like 10.1000 identifies a specific RA or registrant namespace, while hierarchical extensions such as 10.1000.1 allow for subdivided namespaces to manage large volumes of identifiers without overlap. Prefixes are numeric-only (digits and periods) to maintain simplicity and avoid conflicts in resolution.[6] The suffix, separated from the prefix by a slash, is an opaque, locally unique identifier chosen by the registrant to reference the specific digital object within its namespace.[8] It supports printable Unicode characters but, in practice, uses alphanumeric strings (a-z, A-Z, 0-9) along with limited punctuation such as hyphens (-), periods (.), underscores (_), semicolons (;), parentheses (()), and forward slashes (/), interpreted case-insensitively.[9] There is no prescribed length, though guidelines recommend 6–10 characters for brevity and usability, avoiding patterns or embedded metadata (e.g., dates or titles) to prevent predictability or conflicts.[8] Examples include 10.1000/xyz-123 for a generic object or 10.1109/5.771073 for a journal article, ensuring each combination remains globally unique.[6] This flexible yet controlled syntax supports the DOI's core function of persistent identification across diverse content types.[4]

Content and Metadata

Types of Digital Objects

The DOI system is designed to provide persistent identifiers for a wide range of entities, broadly categorized as digital, physical, or abstract objects, with "digital objects" encompassing not only electronic files but also representations of physical items and conceptual entities.[10] According to the DOI Handbook, these objects can include creations—such as intellectual property like inventions, literary works, images, and designs—and parties, such as individuals or organizations involved in their production.[10] This flexibility allows DOIs to support diverse applications beyond traditional publishing, ensuring unique identification across domains like research, entertainment, and standards development.[11]

Scholarly Publications

DOIs are most commonly assigned to scholarly publications, including journal articles, books, book chapters, conference proceedings, and theses, facilitating citation and access in academic contexts.[10] For instance, registration agencies like Crossref manage DOIs for approximately 119 million journal articles and over 10 million book chapters as of 2025, primarily in the humanities, social sciences, and sciences.[10][12] These identifiers link to metadata schemas that describe the publication's structure, authorship, and publication details, enabling interoperability with citation databases.[10]

Research Data and Datasets

Research data, including datasets from scientific experiments, surveys, and simulations, represent a growing category of DOI-identified objects, promoted by agencies like DataCite to enhance data sharing and reproducibility. DOIs for datasets often include metadata on file formats (e.g., CSV, HDF5), access conditions, and related publications, with examples spanning fields like genomics, climate modeling, and social sciences.[10] As of 2025, DataCite has registered over 106 million DOIs for research data, underscoring their role in open science infrastructures, with DataCite now supporting over 1,700 member organizations across 66 countries.[13]

Multimedia and Audiovisual Content

DOIs identify multimedia objects such as audio files, videos, images, and software, particularly in entertainment and creative industries through agencies like the Entertainment Identifier Registry (EIDR).[10] For example, EIDR assigns DOIs to films, television episodes, and video assets, linking them to rights management and distribution metadata. Physical manifestations like CDs or DVDs can also receive DOIs, treating them as structural types under the INDECS framework, which classifies such items as "creations" with physical or digital forms.[10]

Other Entities

Beyond core categories, DOIs can identify abstract or party-based objects, such as events, standards documents, natural history collections, and organizational entities like universities or publishers.[11] For instance, the International DOI Foundation supports DOIs for built environment data (e.g., architectural plans) and talent identities in the creative sector.[11] In addition to organizational entities and talent identifiers, DOIs have also been used experimentally to anchor identity and authorship metadata for non-human digital personas. In the context of AI authorship research, the concept of a Digital Author Persona has been implemented through the AI persona Angela Bogdanova: the semantic JSON-LD specification of this persona is archived on Zenodo under a DOI, and her ORCID record lists her as an official co-author of that schema.[14][15] In such cases the DOI does not identify a conventional article, but an identity and authorship specification that can be cited, versioned, and linked to related publications, illustrating how DOIs can stabilize emerging forms of non-human participation in scholarly communication. These applications leverage the DOI's kernel metadata to specify referent types, ensuring persistent resolution even for non-traditional content like performances or intellectual abstractions.[10]

Metadata Association

Metadata is associated with a Digital Object Identifier (DOI) through a structured registration process managed by Registration Agencies (RAs), where content providers submit descriptive information about the identified entity, known as the referent, alongside the DOI assignment. This association occurs via the DOI System's data model, which is based on the INDECS (Interoperability of Data in E-Commerce Systems) framework and standardized under ISO 26324, ensuring that metadata describes the referent unambiguously and supports interoperability across systems.[10][16] At the core of this association is the requirement for a DOI Kernel Metadata Declaration for every DOI, which provides a mandatory minimum set of elements to identify and administer the referent consistently. Kernel metadata includes essential fields such as the DOI name itself, the date and time of registration, the DOI status (e.g., active or inactive), the referent type (e.g., serial, monograph), and identifiers for the creator or responsible entity. This kernel set, defined in the DOI Kernel Schema—an extensible XML-based standard—ensures semantic equivalence and enables basic recognition and resolution services regardless of the RA or content type.[10][17] Beyond the kernel, additional metadata can be associated to enrich description and functionality, drawing from domain-specific schemas like ONIX for books or custom extensions approved by RAs. During registration, input metadata from providers is processed by RAs, which validate it against the data dictionary and ontology to maintain quality and interoperability; this metadata is then stored in RA-managed repositories and can be output in formats such as XML, JSON, or RDF for querying via services like the DOI Resolution Mechanism or RA-specific APIs. For instance, Crossref and DataCite, as prominent RAs, integrate this metadata to support discovery, citation tracking, and linking across scholarly content.[10][18][13] The association mechanism leverages the Handle System, the underlying technology for DOI resolution, where metadata records are linked to the DOI handle, allowing updates to metadata without altering the persistent identifier itself. This decoupling of identifier from location or content changes ensures long-term stability, while policies mandate timely metadata updates and declarations to prevent obsolescence. Interoperability is further enhanced by mappings between schemas, enabling automated exchange and integration with other persistent identifier systems.[10]

Resolution and Functionality

Resolution Process

The resolution of a Digital Object Identifier (DOI) relies on the Handle System, a global, distributed name resolution infrastructure that assigns, manages, and resolves persistent identifiers for digital content.[2] The DOI system builds upon this foundation by adding specialized organizational and technical enhancements, including a proxy server and metadata frameworks, to ensure semantic interoperability and long-term persistence.[2] To initiate resolution, a user constructs a URI by prefixing the DOI with "https://doi.org/", such as "https://doi.org/10.1000/182" for the DOI Handbook, and submits it via a web browser or application.[19] This directs the request to the DOI proxy server, a load-balanced cluster of web servers that interfaces with the underlying Handle System. The proxy server queries the Handle System's global namespace using the DOI as a handle, retrieving the associated location data—typically a URL—from the handle record maintained by the relevant Registration Agency (RA). The server then issues an HTTP redirect (status code 302 or 303) to the target URL, transferring the user to the digital object or its representation.[19] This process supports secure protocols like HTTPS, IPv6 addressing, and DNSSEC for enhanced reliability and security.[19] The proxy server caches resolved values for up to 24 hours to optimize performance and reduce load on the Handle System, while distributing queries across multiple servers to handle high volumes.[19] If the handle record includes multiple location values (using the handle type "10320/loc"), the system supports multiple resolution, allowing retrieval of several endpoints, services (e.g., email contacts), or typed metadata pieces rather than a single redirect; this enables advanced use cases like appropriate copy selection based on user location or device.[19] In contrast, single resolution returns one primary URL for straightforward access.[19] For programmatic or non-browser use, DOIs in URIs must follow specific encoding rules: the entire string uses UTF-8 character encoding, with reserved characters (e.g., "#", "/", "?") percent-encoded (e.g., "#" becomes "%23") to prevent misinterpretation by parsers.[19] The proxy server also accepts optional query parameters to customize behavior, such as "?noredirect=1" to return handle metadata in JSON format instead of redirecting, "?auth" for authentication checks, or "?locatt=RA:Crossref" to filter by RA-specific locations.[19] Additionally, a RESTful API endpoint at the proxy server allows developers to query DOIs programmatically, returning structured responses for integration into applications.[19] Persistence in resolution is maintained through the DOI system's governance: RAs update handle records as needed (e.g., if a resource moves), and the International DOI Foundation (IDF) oversees the directory and rules to prevent identifier deletion or reassignment, ensuring the DOI remains resolvable indefinitely even if the original owner changes.[2]

Features and Benefits

The Digital Object Identifier (DOI) system provides persistent identification for digital objects, ensuring that each DOI remains a stable reference regardless of changes in the object's location or ownership. This persistence is achieved through the underlying Handle System, which separates the identifier from the location data, allowing updates to the resolution target without altering the DOI itself. As a result, DOIs facilitate long-term accessibility and citation stability, critical for scholarly communication and digital preservation.[1] A core feature of DOIs is their resolvability, where entering a DOI into the resolver at https://doi.org/ directs users to the current resource, such as a journal article, dataset, or book chapter. This process is free for end-users and supports high-volume access, with the system handling an average of over 1,100 resolutions per second globally (as of 2025) and more than 117 billion total resolutions to date.[1] DOIs are designed for both human and machine use, incorporating structured metadata in formats like XML or JSON, which enhances searchability and interoperability across platforms.[1][20] DOIs also enable metadata management and extensibility, allowing registration agencies to associate rich, updatable descriptive information with each identifier. This supports applications beyond simple linking, such as content discovery; while provenance tracking—where actions on a digital object like views or downloads can be recorded throughout its lifecycle—is facilitated by the broader Digital Object Architecture (DOA). For instance, in the publishing industry, DOIs improve internal content management by cataloging assets, enabling efficient retrieval and repurposing across an organization.[21][22] The benefits of DOIs extend to scalability and flexibility, accommodating diverse object types from physical artifacts to abstract concepts, and scaling to billions of resolutions without performance degradation through distributed servers. This architecture promotes interoperability in digital ecosystems, streamlining citations in academic works and reducing link rot, while fostering innovation in services like usage analytics and rights management. Overall, DOIs address key challenges in digital information management by providing a reliable, future-proof framework for identification and access.[20][21]

Comparisons and Alternatives

With Other Identifier Schemes

The Digital Object Identifier (DOI) system differs from other identifier schemes in its emphasis on persistence, interoperability, and metadata integration across diverse digital objects, rather than being tied to specific content types or locators. Unlike Uniform Resource Locators (URLs), which serve primarily as transient addresses for web resources and can change or break, DOIs provide stable, long-term identification decoupled from location, enabling redirection to current access points regardless of changes in hosting or format.[23] Similarly, DOIs contrast with Uniform Resource Names (URNs), which are namespace-based identifiers under the IETF's URI framework but lack a centralized resolution mechanism or mandatory metadata, relying instead on ad hoc resolvers that may not guarantee global accessibility.[23] DOIs build directly on the Handle System, a decentralized resolution infrastructure developed by the Corporation for National Research Initiatives (CNRI) in 1995, which uses a prefix-suffix structure for unique identification and supports multiple services per identifier. While Handles are general-purpose and free for basic use (though requiring local server setup for advanced deployment), DOIs extend this foundation by incorporating a standardized metadata schema (via the DOI Handbook and ISO 26324) and centralized governance through the International DOI Foundation (IDF), ensuring commercial viability and widespread adoption in scholarly publishing.[23] This relationship allows DOIs to inherit Handle's resolution robustness—routing queries through global servers like those at doi.org—but adds layers of registration agency oversight and fee-based minting to fund persistence commitments.[24] In comparison to Archival Resource Keys (ARKs), DOIs prioritize structured metadata and landing pages for published content, whereas ARKs offer greater flexibility for unpublished or evolving resources, including physical objects, with no fees or mandatory metadata. Both schemes support HTTP resolution and use similar prefix-suffix formats (e.g., ARK: ark:/99999/abc123 vs. DOI: 10.99999/abc123), but ARKs emphasize decentralization through Name Assigning Authority Numbers (NAANs) managed by entities like the California Digital Library, allowing custom inflections for metadata queries without central authority.[25] Persistent Uniform Resource Locators (PURLs), maintained by OCLC, function more as redirection services for web links, lacking the object-agnostic scope of DOIs and requiring maintenance of redirect targets, which can introduce single points of failure unlike the DOI's multi-service resolution.[25] DOIs complement rather than compete with content-specific registries like the International Standard Book Number (ISBN) and International Standard Serial Number (ISSN), which provide definitive identification within fixed domains—books and serial publications, respectively—but do not inherently resolve to actionable services or metadata. For instance, an ISBN like 978-1-2345-9999-0 identifies a book edition without built-in persistence guarantees, while a DOI can encapsulate or extend it (e.g., via the ISBN-A service, which assigns DOIs to ISBNs for enhanced functionality like e-commerce integration).[24] This interoperability is facilitated through metadata crosswalks, where DOIs in systems like Crossref or DataCite reference ISBNs or ISSNs, enabling seamless linking across scholarly ecosystems without supplanting these standards.[24]

Limitations and Criticisms

While the DOI system aims to provide persistent identification for digital objects, its persistence is not absolute, as deletions occur due to factors such as duplicate assignments, retractions, or errors in registration. An analysis of Crossref datasets revealed approximately 708,282 deleted DOIs, primarily associated with scholarly articles like journal papers and conference proceedings, which undermines reliable long-term access and affects bibliometric analyses by distorting citation tracking.[26] Although DOIs are designed to be permanent and cannot be deleted, unresolved "dead" links can occur if publishers fail to update resolution targets after content migration or removal, highlighting the need for ongoing maintenance in a dynamic digital environment.[27] Cost barriers represent a significant limitation, particularly for smaller publishers, independent researchers, and institutions in developing regions. Annual registration fees of approximately $1 USD per journal article DOI through agencies like Crossref (with volume discounts), combined with membership fees starting at $275 USD, along with ongoing metadata maintenance expenses, can strain limited budgets and deter widespread adoption.[28] This issue exacerbates global inequalities, as an oligopoly of large publishers (e.g., Elsevier, Springer Nature) controls the majority of DOI prefixes—15 major entities account for most of the top 200—leaving outputs from the Global South underrepresented and less visible in international scholarly communication.[29] Inconsistencies in DOI registration further compromise the system's reliability. Studies of databases like Web of Science and Scopus have identified errors where DOIs present on journal websites are missing or mismatched, affecting up to 24.9% of documents in some samples and hindering discoverability, altmetrics tracking, and citation accuracy.[30] Moreover, not all scholarly outputs receive DOIs, particularly from non-commercial or regional publishers, which limits their integration into global indexing and perpetuates visibility gaps.[29] The DOI's resolution often directs users to subscription-based or paywalled content rather than open access versions, frustrating scholars without institutional access and drawing criticism for reinforcing commercial barriers in scholarly communication. Recent efforts, such as open access metadata enhancements and subsidized registration for low-income countries, aim to address paywall and equity issues, though challenges persist.[31] While large commercial publishers dominate DOI assignment, participation has expanded through non-profit agencies like DataCite, though smaller and non-traditional publishers from the Global South still face barriers. Usability issues can arise from the DOI's alphanumeric format, making manual entry somewhat cumbersome.[32]

Governance and Organization

International DOI Foundation

The International DOI Foundation (IDF) is a not-for-profit organization that serves as the central governance body for the Digital Object Identifier (DOI) system.[3] It operates as the registration authority for the ISO 26324 standard, ensuring the persistent identification and accessibility of digital objects across various communities and industries.[10] Established on October 10, 1997, as a non-stock membership corporation under Delaware law in the United States, the IDF is tax-exempt under Section 501(c)(6) of the U.S. Internal Revenue Code and functions on a self-funding basis through membership fees and operational charges from registration agencies.[10] The IDF's primary role involves coordinating the DOI system's infrastructure, including prefix allocation to registration agencies, maintenance of the DOI Directory, and development of policies to promote interoperability and long-term persistence.[3] It oversees a network of registration agencies (RAs) that manage DOI registries for specific sectors, such as publishing and data management, while ensuring compliance with international standards.[10] Built upon the Handle System developed by the Corporation for National Research Initiatives in the early 1990s, the IDF has scaled the DOI system to handle over 2 billion resolutions per month as of 2025, supporting the identification of diverse digital objects like journal articles, datasets, and multimedia content.[3][1] Governance of the IDF is member-driven, with a Board of Directors elected by its membership, comprising representatives from charter members, RAs, and general members serving three-year terms.[10] The Board includes key officers—a Chair, Vice Chair, and Treasurer—and meets regularly, with an Executive Committee handling interim decisions to maintain operational efficiency.[10] Membership is structured into classes: General Members (open to organizations in digital content and rights management), RAs (accredited agencies managing DOI assignments), Charter Members (founding entities with enhanced rights), and Affiliates (supporting organizations without voting privileges).[10] Annual fees vary by membership class and location, contributing to a cost-recovery model that sustains technical infrastructure and policy initiatives without profit motives.[10] In addition to governance, the IDF promotes the adoption of DOIs through technical support, metadata schema maintenance (such as the DOI Kernel Schema), and collaboration on extensions like DOI Event and DOI Resolution.[3] It safeguards intellectual property rights within the system and ensures metadata interoperability, enabling seamless resolution across global networks.[10] The organization remains committed to evolving the DOI system in alignment with ISO 26324:2022, focusing on scalability and community-driven enhancements.[3]

Registration Agencies

Registration Agencies (RAs) are independent organizations authorized by the International DOI Foundation (IDF) to manage the registration, resolution, and metadata services for Digital Object Identifiers (DOIs) within specific communities or sectors.[33] They allocate DOI prefixes to registrants, register individual DOI names, and ensure the persistence and accessibility of associated metadata, thereby supporting the DOI system's goal of providing long-term, actionable identifiers for digital objects.[33] RAs operate under formal agreements with the IDF that outline their rights, obligations, and intellectual property responsibilities, ensuring compliance with DOI policies and standards.[33] The appointment of an RA begins with a business case submitted to the IDF Board, which evaluates its viability, particularly in underserved sectors or communities; successful applicants, which can be commercial, governmental, or not-for-profit entities, often build on existing registries or collaborate with stakeholders to add DOI functionality.[33] Beyond core registration services, RAs offer tailored value-added features, such as community-specific metadata schemas, resolution enhancements, and integration with sector tools, while maintaining the DOI Handle System for global interoperability.[33] If an RA ceases operations, the IDF coordinates the transfer of responsibilities to another agency to preserve DOI persistence.[33] RAs nominate representatives to the IDF Board and participate in regular meetings to share best practices and address system-wide challenges.[33] As of the latest available data, there are twelve active DOI Registration Agencies, each specializing in distinct domains to meet diverse identification needs.[34] The following table summarizes their key focuses, operational scopes, and services:
Agency NameAcronymPrimary FocusWebsiteKey Services and Scope
Airiti DOI-Scholarly publishing in Chinese-speaking regionshttp://www.airiti.com/en/page_doi.htmlDOI registration, resolution, and cited-by linking for academic materials in Chinese and English.[34]
BSI Identify-Construction product identificationhttps://identify.bsigroup.comPersistent identifiers (UPINs) and metadata storage for construction products to enhance industry digitization and safety.[34]
Chinese DOI-Research content in Chinahttp://dx.chinadoi.cnDOI services for academic journals, datasets, dissertations, and other scholarly outputs, operated jointly by ISTIC and Wanfang Data.[34]
China National Knowledge InfrastructureCNKIChinese information resourceshttps://oversea.cnki.net/index/Management of DOIs for e-journals, newspapers, dissertations, and multidisciplinary databases across sectors.[34]
Crossref-Scholarly communicationshttps://www.crossref.orgRegistration and metadata services for over 200 million research objects from more than 23,000 members in 163 countries as of 2025.[34][35][36]
DataCite-Research data and outputshttps://datacite.orgDOIs for datasets and other research materials to improve discoverability, citation, and metadata interoperability.[34]
Entertainment Identifier RegistryEIDREntertainment and video serviceshttps://www.eidr.orgContent and Video Service Identifiers for audio/video assets and delivery platforms in the media industry.[34]
HAND-Talent identification in entertainment and sportshttps://www.handidentity.com/Universal Talent Identifiers for legal entities, virtual humans, and fictional characters.[34]
Japan Link CenterJaLCScience and technology in Japanhttps://japanlinkcenter.org/top/english.htmlMetadata aggregation and DOI services for Japanese scholarly content, with global distribution via Crossref and DataCite.[34]
Korean DOI Center-Korean science and technologyhttps://www.doi.or.kr/wordpress/#contentDOIs for journal articles, patents, and traditional knowledge resources, managed by KISTI.[34]
multilingual European DOI Registration AgencymEDRAInternet documents and intellectual propertyhttps://www.medra.orgPersistent identifiers for tracking citations and relationships in intellectual property and online documents.[34]
Publications Office of the European UnionOPEU publicationshttps://op.europa.eu/en/homeDOIs for official EU publications, journal articles, datasets, and research grants.[34]
These agencies collectively ensure broad coverage across scholarly, technical, and creative domains, with Crossref and DataCite handling the largest volumes of registrations to support global research infrastructure.[34]

Standardization and History

Standards

The Digital Object Identifier (DOI) system is governed primarily by the international standard ISO 26324, which defines its core functional components and operational principles.[4] Published in its third edition in March 2025, ISO 26324 specifies the syntax for DOI names, the descriptive metadata associated with them, and the resolution mechanisms to locate the identified objects, whether physical, digital, or abstract.[4] This standard establishes general guidelines for the creation, registration, and administration of DOIs, ensuring persistence and interoperability without prescribing specific implementation technologies.[37] The syntax component of ISO 26324 outlines the structure of a DOI name as a prefix-suffix pair, where the prefix includes a directory indicator (historically "10." for DOIs) followed by a registrant code assigned by a Registration Agency, and the suffix provides a unique identifier within that namespace.[37] For example, a DOI might appear as "10.1000/xyz123", with the prefix "10.1000" denoting the registrant and the suffix "xyz123" specifying the object.[1] The 2022 revision (second edition) amended this syntax to remove the mandatory "10" directory indicator, allowing flexibility while maintaining compatibility, a change carried forward into the 2025 edition.[38] As a Uniform Resource Name (URN) namespace registered with the Internet Assigned Numbers Authority (IANA), DOIs follow the formal syntax "urn:doi:" followed by the prefix and suffix, enabling integration with broader URI frameworks.[38] Resolution in the DOI system, as detailed in ISO 26324, relies on the Handle System infrastructure managed by the Corporation for National Research Initiatives (CNRI) and now advanced through the DONA Foundation (Digital Object Numbering Association).[2] Originally specified in RFCs 3650, 3651, and 3652, the Handle System protocols were updated and replaced in 2022 by the Digital Object Identifier Resolution Protocol (DO-IRP), which provides a secure, global name resolution service over the internet.[38] This protocol allows DOI resolution to redirect users to current locations or services associated with the identifier, supporting over 12 billion annual resolutions as of 2023.[38] The standard emphasizes semantic interoperability, permitting DOIs to link to metadata schemas like Dublin Core for enhanced description, though it does not mandate a specific format.[37] Governance aspects in ISO 26324 designate the International DOI Foundation (IDF) as the ISO Registration Authority, responsible for overseeing compliance, prefix assignment, and system evolution through its Registration Agencies.[37] Developed under ISO Technical Committee 46 (Information and documentation), Subcommittee 9 (Identification and description), the standard has evolved from its initial 2012 publication to incorporate advancements in persistent identification needs.[4] Complementary specifications, such as those from DONA for DO-IRP implementation, ensure the system's robustness, with over 390 million DOIs assigned by more than 5,000 organizations worldwide as of 2025.[38][39]

Historical Development

The need for a persistent identification system for digital content emerged in the early 1990s amid the rapid growth of electronic publishing, as traditional identifiers like the ISBN proved inadequate for managing intellectual property in networked environments.[40] Concepts such as the Publisher Item Identifier (PII) and Unique Digital Identifier (UDID) were proposed to address these challenges, aiming to create unique, persistent names for digital objects that could support resolution to locations or services.[40] In 1994, Douglas Armati presented the UDID concept at the Frankfurt Book Fair, prompting the International Association of Scientific, Technical and Medical Publishers (STM) and the International Publishers Association (IPA) to form a task group to explore digital content identification.[40] The following year, in 1995, the Association of American Publishers (AAP) established its Enabling Technologies Committee, with Armati producing key study papers for both AAP and STM that outlined requirements for a robust identifier system.[40] These efforts culminated in a joint statement of support from AAP, IPA, and STM, leading to the announcement of the Digital Object Identifier (DOI) system at the 1997 Frankfurt Book Fair.[40] In the same year, the International DOI Foundation (IDF) was founded as a not-for-profit organization, with Charles Ellis as its first chair, to oversee the system's development and management.[40] From 1998 to 2000, the DOI Foundation collaborated with the INDECS (Interoperable Digital-content Services) project, a European Union-funded initiative, to refine the system's architecture for interoperability in digital content services.[41] Technically, the DOI system was built on the Handle System developed by the Corporation for National Research Initiatives (CNRI), providing a resolution mechanism that linked identifiers to digital objects or metadata services.[41] Early prototypes, such as the 1999 DOI-cross-citation (DOI-X) initiative, demonstrated reference linking capabilities, paving the way for services like CrossRef.[40] The first DOI applications were launched in 2000, coinciding with the publication of the DOI Handbook and the standardization of DOI syntax under ANSI/NISO Z39.84.[40] This marked the system's operational debut, with initial adopters including major publishers like Academic Press, which assigned DOIs to journal articles in its IDEAL platform.[42] By this point, hundreds of thousands of DOIs had been registered, focusing on scholarly content to enable persistent access and citation.[42] Further maturation occurred through international standardization efforts, beginning in 2004, leading to the publication of ISO 26324 in 2012, which formalized the full DOI system as an international standard for persistent identifiers.[40] An update to ISO 26324 followed in 2022, reflecting ongoing refinements to support evolving digital ecosystems.[41]

References

User Avatar
No comments yet.