Hubbry Logo
Amazon S3 GlacierAmazon S3 GlacierMain
Open search
Amazon S3 Glacier
Community hub
Amazon S3 Glacier
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Amazon S3 Glacier
Amazon S3 Glacier
from Wikipedia

Amazon S3 Glacier is an online file storage web service that provides storage for data archiving and backup.[2]

Key Information

Glacier is part of the Amazon Web Services suite of cloud computing services, and is designed for long-term storage of data that is infrequently accessed and for which retrieval latency times of 3 to 5 hours are acceptable. Storage costs are a consistent $0.004 per gigabyte per month, which is substantially cheaper than the Simple Storage Service (S3) Standard tier.[3]

Amazon hopes this service will move businesses from on-premises tape backup drives to cloud-based backup storage.[4]

Storage

[edit]

The underlying technology used by Glacier is unknown and subject to speculation.

Amazon officially states in their S3 FAQS:[5]

Q: What is the backend infrastructure supporting the S3 Glacier Flexible Retrieval and S3 Glacier Deep Archive storage class?

We prefer to focus on the customer outcomes of performance, durability, availability, and security. However, this question is often asked by our customers. We use a number of different technologies which allow us to offer the prices we do to our customers. Our services are built using common data storage technologies specifically assembled into purpose-built, cost-optimized systems using AWS-developed software. The S3 Glacier storage classes benefit from our ability to optimize the sequence of inputs and outputs to maximize efficiency accessing the underlying storage.

ZDNet says, that according to private e-mail, Glacier runs on "inexpensive commodity hardware components".[4] In 2012, ZDNet quoted a former Amazon employee as saying that Glacier is based on custom low-RPM hard drives attached to custom logic boards where only a percentage of a rack's drives can be spun at full speed at any one time.[6][7] Similar technology is also used by Facebook.[8]

There is some belief among users that the underlying hardware used for Glacier storage is tape-based, because Amazon has positioned Glacier as a direct competitor to tape backup services (both on-premises and cloud-based).[9] This confusion is exacerbated by the fact that Glacier has archive retrieval delays (3–5 hours before archives are available) similar to that of tape-based systems[dubiousdiscuss] and a pricing model that discourages frequent data retrieval.[10]

The Register claimed that Glacier runs on Spectra T-Finity tape libraries with LTO-6 tapes.[11][12] Others have conjectured Amazon using off-line shingled magnetic recording hard drives, multi-layer Blu-ray optical discs, or an alternative proprietary storage technology.[13]

Data storage consultant Robin Harris speculated that the storage is based on cheap optical disks such as Blu-ray, based on hints from public sources.[14]

Cost

[edit]

Glacier has two costs, one for storage and one for retrieval. Uploading data to Glacier is free. Storage pricing is simple: it currently costs 0.4 cents per gigabyte per month, which is 82% cheaper than S3 Standard. When Glacier launched in 2012, the storage charge was set to 1 cent per gigabyte per month. This was reduced to 0.7 cents in September 2015 and to the current 0.4 cents in December 2016.[15]

Glacier used to charge for retrievals based on peak monthly retrieval rate, meaning that (ignoring the free tier) if you downloaded four gigabytes in four hours, it would cost the same as if you downloaded 720 gigabytes in 720 hours, in a 30-day month. This made it cheaper to spread out data retrievals over a long period of time, but failing to do so could result in a surprisingly large bill. In one case, a user stored 15 GB of data in Glacier, retrieved 693 MB for testing, and ended up being charged for 126 GB due to retrieval rate calculation.[16] This pricing policy was widely regarded as a "time bomb" set to go off on retrieval.[17]

In 2016, AWS revised their retrieval pricing model.[18] The new model bases the retrieval fee on the number of gigabytes retrieved. This can amount to a 99% price cut for users who perform only one Glacier retrieval in a month. At the same time, AWS introduced new methods of retrieval that take different amounts of time. An expedited retrieval costs one cent per request and three cents per gigabyte, and can retrieve data in one to five minutes. A standard retrieval costs five cents per thousand requests and one cent per gigabyte, and takes three to five hours. A bulk retrieval costs 2.5 cents per thousand requests and 0.25 cents per gigabyte, and takes seven to twelve hours. AWS also introduced provisioned capacity for expedited retrievals, each unit of which costs $100 per month and guarantees at least three expedited retrievals every five minutes, and up to 150 MB/s of retrieval bandwidth. Without provisioned capacity, expedited retrievals are done on a capacity available basis.[19]

Data deleted from Glacier less than 90 days after being stored incurs a charge equal to the cost of storage for the remainder of the 90 days. (In effect, the user pays for 90 days minimum.) This move was designed to discourage the service's use in cases where Amazon's other storage offerings (e.g. S3) are more appropriate for real-time access. After 90 days, deletion from Glacier is free.

Retrieving data from Glacier is a two-step process. The first step is to retrieve the data into a staging area, where it stays for 24 hours.[20] The second step is to download the data from the staging area, which may incur bandwidth charges.[21]

Glacier is also available as a "storage class" in S3.[22] Objects can only be put into Glacier by lifecycle rules, which can be configured to put the objects in Glacier once they have reached a certain age. Pricing is the same, but there is no staging area; instead, retrieved objects are simultaneously stored in Glacier and in Reduced Redundancy class for a number of days that the user specifies.

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Amazon S3 Glacier is a family of secure, durable, and low-cost object storage classes within designed for archiving data that is infrequently accessed and retained for long periods, such as backups, compliance records, and media assets. It provides scalable storage with retrieval times ranging from milliseconds to hours or days, depending on the chosen class, enabling organizations to optimize costs for data that does not require immediate access. Launched originally as a standalone service in 2012, Amazon Glacier was fully integrated into S3 as storage classes to simplify management and leverage S3's APIs and tools. The service evolved from its initial release on August 20, 2012, when AWS introduced Amazon Glacier as a dedicated archival storage solution priced at $0.01 per GB per month, emphasizing secure and reliable long-term data preservation. Early integrations allowed automatic transitions from S3 Standard storage via lifecycle policies, and by 2016, pricing dropped to $0.004 per GB per month with added retrieval options including expedited (1-5 minutes) and bulk (5-12 hours) tiers. Key milestones include the 2018 addition of direct PUT support for S3, the 2019 launch of the Deep Archive class for even lower costs, and 2021 updates renaming the original class to Flexible Retrieval while introducing Instant Retrieval for near-real-time access. Starting December 15, 2025, the standalone Amazon Glacier service will no longer accept new customers, directing users to the S3-integrated classes for continued archival needs. Amazon S3 Glacier comprises three primary storage classes tailored to different access patterns: S3 Glacier Instant Retrieval, which offers millisecond access for data needed within 90 days with a 90-day minimum storage duration at up to 68% lower cost than S3 Standard-Infrequent Access; S3 Glacier Flexible Retrieval, providing options from minutes to 12 hours for backups and analytics with minimum storage durations of 90 days; and S3 Glacier Deep Archive, the lowest-cost option at $0.00099 per GB per month for rarely accessed compliance data, with 12-48 hour retrieval and a 180-day minimum. All classes support seamless S3 lifecycle management for automatic tiering and integrate with services like AWS CloudTrail for auditing. These classes deliver 99.999999999% (11 9's) durability across multiple Availability Zones, server-side encryption, and compliance features such as S3 Object Lock for write-once-read-many (WORM retention to meet regulations like HIPAA and SEC Rule 17a-4. Pricing is usage-based with no minimum fees, focusing on storage costs plus retrieval and data transfer fees, making it suitable for petabyte-scale archives across industries including healthcare, , and media.

Overview

Introduction

Amazon S3 Glacier is a low-cost service within (AWS) Simple Storage Service (S3), designed specifically for data archiving and long-term backup of infrequently accessed information. It enables users to securely store large volumes of data, such as backups, media archives, and compliance records, while providing virtually unlimited . The service emphasizes exceptional durability, engineered to deliver 99.999999999% (11 nines) data durability over a given year through redundant storage across multiple devices and facilities. It also offers tailored to archival needs, ensuring reliable access when data is retrieved. Unlike standard S3 storage classes optimized for frequent access, Amazon S3 Glacier prioritizes cost efficiency for data that is rarely retrieved, with restoration times typically spanning from minutes to hours or days rather than milliseconds. This makes it ideal for scenarios where immediate access is not required, such as long-term retention for or disaster recovery. Launched in as part of the AWS S3 ecosystem, it has evolved to include multiple storage classes by 2025, allowing flexible options for varying archival requirements. At its core, Amazon S3 Glacier stores data as objects within S3 buckets, supporting automatic tiering through lifecycle policies to transition objects between storage classes based on access patterns and retention needs. This integration with S3 provides seamless management, including features like versioning, encryption, and access controls, while maintaining the service's focus on economical, durable archival storage.

History

Amazon S3 Glacier was initially launched on August 20, 2012, as a standalone archival storage service by (AWS), designed for secure, durable, and low-cost data archiving at $0.01 per GB per month, utilizing a vault-based architecture with retrieval times of 3 to 5 hours. This service addressed the need for affordable long-term storage separate from the core object storage, enabling customers to archive data without the higher costs of frequent access tiers. In November 2012, AWS introduced integration with Lifecycle policies, allowing automatic transitions of S3 objects to Glacier vaults for cost optimization. By November 2016, AWS enhanced with significant price reductions—to $0.004 per GB per month in key regions—and expanded retrieval options, including expedited (1-5 minutes) and bulk (5-12 hours) tiers to improve flexibility for varying access needs. The service saw further evolution in November 2018 when AWS officially integrated it into as a native storage class, renaming it and enabling direct uploads via S3 APIs, which simplified management and eliminated the need for separate vault operations. This transition marked a shift toward seamless incorporation within the S3 ecosystem, supporting broader adoption for archival workloads. In March 2019, AWS introduced the S3 Glacier Deep Archive storage class, offering even lower costs at $0.00099 per GB per month for data accessed less than once or twice a year, with retrieval times of 12 to 48 hours, targeting ultra-long-term retention scenarios. Advancements continued in November 2021 with the launch of S3 Glacier Instant Retrieval, providing millisecond access for rarely retrieved data at $0.004 per GB per month, alongside renaming the original S3 Glacier class to S3 Glacier Flexible Retrieval to better reflect its variable retrieval options and include free bulk retrievals. As of 2025, AWS announced that the original standalone Amazon Glacier service would cease accepting new customers starting December 15, 2025, fully directing users to the integrated S3 Glacier storage classes, with no disruptions for existing customers and continued support for hybrid cloud integrations via enhanced like AWS DataSync. This update underscores the complete migration to S3-native archival storage, with minor ongoing optimizations in and capabilities but no new storage classes introduced.

Storage Classes

Glacier Instant Retrieval

Amazon S3 Glacier Instant Retrieval is designed for long-lived data that is rarely accessed, typically once per quarter, while providing millisecond retrieval times to support performance-sensitive archival needs. This storage class offers real-time access with the same latency and throughput as S3 Standard-Infrequent Access, enabling near-instant retrieval for objects without the need for restoration processes. It was introduced on November 30, 2021, as a new option within the S3 family to bridge the gap between frequent-access storage classes and traditional archival tiers, allowing users to optimize costs for data that requires occasional but rapid recovery. Key specifications include a minimum storage duration of 90 days, after which early deletion incurs pro-rated charges for the full period. For billing purposes, objects smaller than 128 KB are charged at the 128 KB rate to account for the fixed overhead in archival storage. The class provides high durability of 99.999999999% (11 nines) over a given year, ensuring robust protection against , and an of 99.9% over the same period. This storage class is ideal for active archives where quick recovery is essential, such as media assets, in online file-sharing platforms, , data, and health records. It balances cost efficiency with accessibility, making it suitable for scenarios where data is not frequently retrieved but must be available without significant delays, unlike slower archival options. Trade-offs include higher storage costs compared to deeper archive classes, offset by the elimination of retrieval fees and expedited access for time-critical applications.

Glacier Flexible Retrieval

Amazon S3 Glacier Flexible Retrieval, formerly known as S3 Glacier, is an archival storage class designed for data that requires occasional access with retrieval times measured in minutes to hours. It provides three retrieval options to balance speed and cost: expedited retrievals complete in 1–5 minutes for urgent needs, standard retrievals typically finish within 3–5 hours, and bulk retrievals take 5–12 hours at no additional retrieval cost. These options allow users to provision capacity for faster restores when necessary, making it suitable for scenarios where immediate access is not required but flexibility is valued. Introduced as part of the original S3 Glacier service and renamed in November 2021 to emphasize its retrieval versatility, this class features a minimum storage duration of 90 days and imposes no minimum object size, enabling efficient storage of diverse data sets from small files to large archives. Upon renaming, storage costs were reduced by up to 10% compared to the prior S3 Glacier pricing, further lowering the expense for long-term retention. It maintains high reliability with 99.99% availability over a given year and 99.999999999% (11 9's) durability, achieved through redundant storage across multiple Availability Zones. This storage class is particularly well-suited for backups, disaster recovery, and offsite where data is accessed infrequently, such as one to two times per year, and retrieval within several hours is acceptable. Objects in Glacier Flexible Retrieval require a restore request before access, and it integrates seamlessly with lifecycle policies to automate transitions for cost optimization. Each archived object incurs 40 KB of additional metadata storage—comprising 32 KB at the Glacier Flexible Retrieval rate and 8 KB at the S3 Standard rate—to support efficient retrieval processes.

Glacier Deep Archive

Amazon S3 Glacier Deep Archive is the lowest-cost storage class within the Amazon S3 Glacier family, optimized for the long-term retention of that is accessed infrequently, such as backups and archival records requiring durable, secure storage over extended periods. This class achieves its cost efficiency through optimized placement across multiple AWS Availability Zones, ensuring 99.999999999% (11 9's) durability while prioritizing minimal retrieval needs. It serves as an ideal solution for organizations seeking to store petabytes of without the operational overhead of physical . Retrieval from Glacier Deep Archive is available via two options: standard retrieval, which restores data within 12 hours, or bulk retrieval, which completes within 48 hours; unlike other Glacier classes, expedited retrieval is not supported to maintain the lowest pricing tier. Objects in this class have a minimum storage duration of 180 days, after which early deletion incurs pro-rated charges for the remaining period. Additionally, each archived object requires 40 KB of metadata storage—comprising 32 KB at the Deep Archive rate and 8 KB at the S3 Standard rate. This fixed metadata charge makes storing very small objects (under 40 KB) less cost-efficient per GB. As of 2025, storage in Deep Archive costs $0.00099 per GB-month (or $1 per TB-month), representing the most affordable option and up to 75% lower than the S3 Flexible Retrieval class. Launched on , 2019, this storage class was introduced to deliver competitive against off-premises tape archival services, enabling customers to eliminate the capital and maintenance expenses of on-premises tape libraries while providing millisecond access to metadata for efficient inventory management. Glacier Deep Archive is particularly well-suited for compliance-driven archives in sectors like and healthcare, where long-term regulatory retention is mandatory; efforts in media and entertainment, such as storing historical footage; and as a scalable replacement for traditional tape-based systems, reducing total ownership costs through automated management and global redundancy.

Retrieval and Management

Retrieval Options

For the S3 Glacier Flexible Retrieval and S3 Glacier Deep Archive storage classes, Amazon S3 offers up to three retrieval tiers to access archived data: Expedited, Standard, and Bulk (with Expedited available only for Flexible Retrieval, and Standard and Bulk for both classes). Expedited retrieval provides the quickest turnaround for urgent access needs and is available exclusively for objects in the S3 Glacier Flexible Retrieval storage class, with provisioned capacity options to ensure dedicated throughput. Standard retrieval serves as the default balanced option suitable for most use cases, while Bulk retrieval optimizes for cost efficiency when restoring large volumes of data, often at no additional retrieval fee for objects in S3 Glacier Flexible Retrieval. Additionally, the AWS Free Tier includes up to 10 GB of free data retrieval per month across eligible Glacier storage classes, enabling low-volume access without charges. Retrieval jobs are initiated programmatically using the Amazon S3 RestoreObject operation, where users specify the desired tier (Expedited, Standard, or Bulk) along with the object's key and restoration period. Once initiated, subsequent access occurs via GetObject for full retrieval or GetObject with range parameters for partial downloads of the restored object. For legacy Amazon Glacier vaults, the InitiateJob handles similar retrieval requests, supporting inventory, archive, or retrieval jobs. Upon job initiation, Amazon S3 assigns a unique Job ID for tracking progress. Users monitor status by polling the HeadObject API to check restoration completion or, for vault-based operations, the DescribeJob or ListJobs APIs. Notifications for job completion can be configured via Amazon S3 Event Notifications, which trigger on restore events and integrate with Amazon SNS or AWS Lambda for automated workflows. In legacy Glacier setups, vault notifications publish to SNS topics upon job finalization. Provisioned capacity enhances reliability for Expedited retrievals in S3 Glacier Flexible Retrieval by reserving resources; each unit of capacity supports up to three retrievals every five minutes and a throughput of 150 MB per second, purchasable via the Amazon S3 console, , or AWS CLI. This feature mitigates potential throttling during high-demand periods without altering underlying retrieval times, which vary by storage class. Amazon S3 Select integrates with by allowing SQL-based in-place queries on restored objects in active storage tiers, enabling selective data extraction without downloading entire archives; however, direct queries on unretrieved objects require prior restoration, as standard S3 Select does not support archive tiers natively. (Note: The former S3 Select feature for querying archives without restoration is available only to existing customers, with new access closed since July 2024.) Key limitations include the lack of real-time streaming access to archived objects, necessitating a restore job before any Get or Select operations. Partial retrievals via S3 Select incur charges proportional to the scanned data volume, measured in GB, regardless of the returned results size. All retrievals are subject to API request quotas, such as 1,000 transactions per second per prefix.

Lifecycle Policies and Vaults

Amazon S3 Lifecycle policies enable automated management of by defining rules that transition objects between storage classes based on age, thereby optimizing costs for archival data. These rules allow users to specify transitions from higher-cost classes like S3 Standard to S3 Glacier classes—for example, to S3 Glacier Instant Retrieval after 90 days (with a 90-day minimum storage duration in the class), to S3 Glacier Flexible Retrieval after 30 days (with a 90-day minimum storage duration in the class), or to S3 Glacier Deep Archive after 30 days (with a 180-day minimum storage duration)—subject to the target class's minimum storage duration rather than a fixed prerequisite age for the transition. For instance, a common configuration transitions infrequently accessed objects from S3 Standard-IA to S3 Glacier Flexible Retrieval after 90 days to balance accessibility and cost savings. Vaults in Amazon Glacier represent a legacy organizational structure for the standalone Glacier service, separate from S3-integrated storage classes, where archives are stored without direct S3 integration. Users create vaults to group archives logically, and each vault can hold unlimited data with access controlled via IAM policies. Vault Lock policies, introduced in 2015, provide write-once-read-many (WORM) compliance by allowing users to apply immutable retention rules using IAM policy syntax; once initiated and completed after a 24-hour validation period, the lock cannot be altered or removed. These policies support retention periods up to 100 years to meet regulatory requirements like SEC Rule 17a-4. Vault inventory jobs facilitate by generating point-in-time listings of all within a vault, including metadata such as archive IDs, sizes, and creation dates. Initiated via the Initiate Job , these jobs produce output in (default) or CSV format, with completion typically taking up to a few hours; inventories are updated daily but reflect a snapshot from the job initiation time. The retrieved can be downloaded directly or, for , migrated to an S3 bucket using AWS solutions guidance to enable querying and further processing. Expiration rules within S3 Lifecycle configurations automate the permanent deletion of objects after defined retention periods, helping control storage growth and costs in Glacier classes. For example, objects can be set to expire 3,650 days after creation following a transition to S3 Glacier Flexible Retrieval, but early expiration incurs minimum storage duration charges—90 days for S3 Glacier Instant Retrieval or Flexible Retrieval, and 180 days for S3 Glacier Deep Archive. These rules apply only to current object versions in versioned buckets, with noncurrent versions handled separately via dedicated actions. Best practices for implementing these policies emphasize combining object tags with lifecycle rules for granular control, reducing the number of rules needed across diverse datasets. Tags, such as "TransitionArchive:90" indicating a 90-day shift to S3 Glacier Instant Retrieval, allow filtering rules to apply specific actions to subsets of objects regardless of prefix, enabling flexible management like separate retention for compliance-tagged data. Automate tag application during uploads via the PutObject API or triggers, and for existing objects, use S3 Batch Operations with inventory reports to tag at scale without incurring extra costs.

Pricing

As of November 2025 (US East - N. Virginia), Glacier offers tiered storage pricing based on the selected storage class, designed for long-term archival with costs measured per per month (GB-month). The S3 Glacier Instant Retrieval class charges $0.004 per GB-month, providing access for data that may need quick retrieval. The S3 Glacier Flexible Retrieval class is priced at $0.0036 per GB-month, suitable for infrequently accessed data with retrieval times ranging from minutes to hours. The lowest-cost option, S3 Glacier Deep Archive, costs $0.00099 per GB-month for data expected to remain untouched for extended periods, with retrieval times of 12 hours or more. All Glacier classes apply minimum object size charges to ensure cost efficiency for small files. For Instant Retrieval, objects smaller than 128 KB are charged as 128 KB. Flexible Retrieval and Deep Archive include a 40 KB minimum, comprising 8 KB of metadata at S3 Standard rates and 32 KB at the respective class rates. To encourage long-term retention, each class enforces a minimum storage duration, with pro-rated charges applied for early deletions. Instant Retrieval and Flexible Retrieval require a 90-day minimum, while Deep Archive mandates 180 days; deleting data before these periods incurs fees for the remaining duration. There are no fees for data ingress into S3 Glacier storage classes.
Storage ClassStorage Cost (per GB-month)Minimum Object SizeMinimum Duration
Glacier Instant Retrieval$0.004128 KB90 days
Glacier Flexible Retrieval$0.003640 KB90 days
Glacier Deep Archive$0.0009940 KB180 days

Retrieval and Request Fees

Amazon S3 Glacier incurs retrieval fees based on the chosen retrieval option and the amount of data retrieved, with variations between the Flexible Retrieval and Deep Archive storage classes. For Expedited retrieval (1-5 minutes) in Flexible Retrieval, users pay $10 per 1,000 requests plus $0.03 per GB retrieved. Expedited retrieval is also available for Deep Archive but may take 1-12 hours. Standard retrieval (3-5 hours for Flexible, 12 hours for Deep) costs $0.05 per 1,000 requests plus $0.01 per GB in Flexible Retrieval and $0.10 per 1,000 requests plus $0.02 per GB in Deep Archive, while Bulk retrieval (5-12 hours for Flexible, 48 hours for Deep) is free for both requests and data in Flexible Retrieval and $0.025 per 1,000 requests plus $0.0025 per GB in Deep Archive. A limited free tier allows up to 10 GB of Standard retrievals per month at no charge. Request fees apply to API operations for managing and accessing data in Glacier buckets using Glacier storage classes. PUT, COPY, and POST requests cost $0.005 per 1,000 requests, while GET and SELECT requests are priced at $0.0004 per 1,000 requests. These fees support operations like uploading archives, initiating retrieval jobs, and querying data without full retrieval. Data transfer out from Glacier to the follows a tiered structure to accommodate varying usage volumes. The first 100 GB per month is free, followed by $0.09 per GB for the next 10 TB, with rates decreasing for higher volumes (e.g., $0.085 per GB for the next 40 TB). Transfers within the same AWS , such as to Amazon EC2, incur no additional fees. For Select queries, which enable in-place querying of archived data without full retrieval, fees are based on the data scanned and returned. Standard Select queries charge $0.008 per GB scanned, with higher rates for Expedited queries (e.g., $0.03 per GB scanned plus request fees). Returned data incurs additional transfer costs if downloaded. Vault-specific operations, such as initiating inventory jobs to list archives, cost $0.025 per 1,000 objects retrieved in the inventory report. These jobs help manage large-scale archives but add to operational expenses for frequent audits. Provisioned capacity units can be purchased for predictable high-volume Expedited retrievals at additional fixed monthly fees.
Fee TypeDescriptionPricing (US East, N. Virginia)
Expedited Retrieval (Flexible Retrieval)Per 1,000 requests + per GB$10 + $0.03/GB
Expedited Retrieval (Deep Archive)Per 1,000 requests + per GB$10 + $0.03/GB
Standard Retrieval (Flexible Retrieval)Per 1,000 requests + per GB$0.05 + $0.01/GB
Standard Retrieval (Deep Archive)Per 1,000 requests + per GB$0.10 + $0.02/GB
Bulk Retrieval (Flexible Retrieval)Per 1,000 requests + per GBFree
Bulk Retrieval (Deep Archive)Per 1,000 requests + per GB$0.025 + $0.0025/GB
PUT/COPY/POST RequestsPer 1,000 requests$0.005
GET/SELECT RequestsPer 1,000 requests$0.0004
Data Transfer Out (first 100 GB/month)To Free
Data Transfer Out (next 10 TB/month)To $0.09/GB
Select Queries (Standard, scanned)Per GB scanned$0.008
Vault Inventory JobsPer 1,000 objects$0.025

Use Cases and Integration

Archival Applications

Amazon S3 Glacier is widely utilized for long-term data retention in scenarios requiring durable, low-cost archival storage, such as legal holds, media libraries, and scientific datasets. Organizations employ Glacier's storage classes to preserve records that must be retained for extended periods due to regulatory requirements, ensuring data integrity over decades without frequent access. For instance, media companies archive vast libraries of historical content, like video footage and audio files, leveraging Glacier's high durability of 99.999999999% to safeguard irreplaceable assets. Similarly, research institutions store scientific datasets, including genomic sequences and environmental observations, for future analysis while minimizing ongoing costs. In backup strategies, Glacier supports offsite replication for disaster recovery by integrating with Cross-Region Replication, enabling data to be mirrored across geographic regions for . This approach facilitates tiered retrieval options—ranging from minutes in Flexible Retrieval to 12-48 hours in Deep Archive—allowing organizations to balance recovery point objectives (RPO) and recovery time objectives (RTO) based on data criticality. For example, critical backups can use faster tiers to achieve lower RTOs, while less urgent archives opt for deeper, cheaper storage to reduce overall expenses without compromising availability during recovery. Such strategies enhance resilience against site failures or , as demonstrated in solutions using AWS Storage Gateway for hybrid environments. Glacier offers substantial cost savings for cold data, with storage costs up to 95% lower than Standard, making it economical for infrequently accessed archives. This reduction stems from optimized pricing tiers, where Deep Archive charges as low as $0.00099 per GB per month compared to $0.023 for Standard, enabling organizations to scale petabyte-level storage affordably. In healthcare, enables HIPAA-compliant retention of patient records and imaging data, as seen with providers like hc1.com archiving encrypted datasets for long-term access during audits or legal reviews. Financial institutions, such as , utilize for audit trails and , storing transaction logs and compliance documents to meet retention mandates like SEC rules while optimizing . These applications highlight 's role in supporting industry-specific without excessive overhead. A key challenge in archival applications is planning for retrieval delays, which can range from several minutes to days depending on the storage class, requiring workflows to incorporate asynchronous access patterns. Users must design applications with these latencies in mind, using tools like Lifecycle policies to automate transitions and avoid unexpected costs from expedited restores.

Integration with

Glacier operates as a set of storage classes within the ecosystem, enabling users to store objects directly in S3 buckets while designating them for archival purposes. This integration allows seamless object storage by uploading data to standard S3 buckets and specifying a Glacier storage class—such as S3 Glacier Flexible Retrieval (), S3 Glacier Deep Archive (DEEP_ARCHIVE), or S3 Glacier Instant Retrieval ()—at the time of upload via the AWS Management Console, API, or other interfaces. Unlike the legacy Amazon Glacier service, which required separate vault management, these classes treat archival objects as native S3 objects, supporting full S3 compatibility for metadata, versioning, and tagging. As of December 15, 2025, the standalone Amazon Glacier service no longer accepts new customers, directing users to the S3-integrated classes. Lifecycle enhances this integration by enabling automatic tiering of objects from frequently accessed classes like S3 Standard to classes based on predefined access patterns and age thresholds. Users configure S3 Lifecycle policies in the console or via to transition objects after a specified number of days, optimizing costs without manual intervention—for instance, moving unused for 30 days to GLACIER. This applies across an entire or to specific prefixes/tags, ensuring flows dynamically through the storage tiers. Practical tools facilitate uploads and management within this integrated framework. The AWS (CLI) supports direct designation of Glacier classes during operations, as in the command aws s3 cp file.txt s3://my-bucket/ --storage-class [GLACIER](/page/Glacier), which uploads a local file to an S3 bucket in the Flexible Retrieval class. Similarly, AWS SDKs provide programmatic support; for example, in Python using Boto3, the upload_file method accepts an ExtraArgs dictionary with 'StorageClass': '[GLACIER](/page/Glacier)' to store objects in the archival tier. Hybrid setups combine Glacier classes with S3 Intelligent-Tiering for adaptive storage, where objects automatically move to archival access tiers (e.g., Archive Access after 30 days of inactivity or Deep Archive Access after 180 days) based on real-time access patterns, without predefined rules. This dynamic movement leverages monitoring to balance cost and availability seamlessly within S3. For users with data in legacy Amazon Glacier vaults prior to the 2012 launch of Amazon Glacier and its integration with S3, migration paths involve automated transfer to S3 buckets using AWS solutions like the Data Transfer from Amazon Glacier Vaults to Amazon S3 implementation, which restores and copies archives to a target bucket with the desired Glacier class. This process eliminates the need for separate vault management, aligning legacy data with modern S3 features.

Security Features

Encryption Methods

Amazon S3 Glacier secures data through server-side options for storage at rest, applicable to its storage classes such as S3 Glacier Flexible Retrieval and S3 Glacier Deep Archive. These options include SSE-S3, which employs AWS-managed keys with AES-256 and is automatically enabled as the default for all new objects uploaded to buckets, incurring no additional cost. SSE-KMS utilizes customer-managed keys via AWS Key Management Service (KMS) for enhanced control, allowing integration with compliance requirements, while SSE-C permits customers to supply their own keys, which uses to handle the encryption and decryption processes. Key rotation differs by method: AWS performs automatic rotation for the underlying keys in SSE-S3, whereas SSE-KMS relies on manual rotation configured through the AWS KMS console, , or automated policies. SSE-C requires customers to manage key rotation independently, as keys are provided per request. All methods ensure and using the robust AES-256 standard. For data in transit, Glacier mandates with TLS 1.2 or later for all operations and file transfers, preventing interception during upload, retrieval, or management activities. This applies uniformly to interactions with storage classes and ensures end-to-end protection without additional configuration. In the legacy vault-based Glacier service, archives are automatically encrypted at rest upon upload using AWS-managed AES-256 keys. The legacy service will stop accepting new customers on December 15, 2025, but existing vaults and data remain secure and accessible indefinitely. No customer key options are available. AWS handles key rotation automatically in this model.

Access Controls and Compliance

Amazon S3 Glacier, as a storage class within , utilizes AWS Identity and Access Management (IAM) policies to manage permissions at both the bucket and object levels. These policies are defined in JSON format and allow administrators to specify allow or deny actions, such as s3:GetObject for retrieving data or s3:RestoreObject for initiating retrieval jobs from Glacier storage. Conditions can be included to enforce restrictions, including source ranges via the aws:SourceIp condition key, multi-factor authentication requirements with aws:MultiFactorAuthPresent, or time-based access using aws:CurrentTime. Bucket policies complement IAM by providing resource-based access control directly on S3 buckets containing objects. These policies can grant or deny specific actions, for example, allowing s3:GetObject on objects in the storage class while denying deletions with s3:DeleteObject. Like IAM policies, they support conditions such as IP-based restrictions to limit access from particular networks, ensuring granular control over archival data. Bucket policies are particularly useful for cross-account access, where permissions can be delegated to external AWS accounts without modifying user-level IAM roles. For regulatory compliance, Amazon S3 Glacier supports standards including GDPR, HIPAA, and PCI DSS through features that enable data immutability and auditing. Vault locks, available in the legacy Amazon Glacier service, provide write-once-read-many (WORM) protection by allowing policies to be locked against modification, supporting retention requirements for financial regulations like SEC Rule 17a-4. In the current S3 Glacier storage classes, this is achieved via S3 Object Lock, which enforces WORM semantics in two modes: Governance mode, where authorized users can bypass retention with specific permissions, and Compliance mode, which prevents all modifications or deletions regardless of privileges. Object Lock supports retention periods from 1 day to a maximum of 100 years, or indefinitely, and includes legal hold functionality to indefinitely protect objects without a predefined expiration. These mechanisms help meet immutability needs for compliance frameworks by preventing premature data alteration. Auditing in Amazon S3 Glacier is facilitated through integration with AWS CloudTrail, which logs all calls related to storage actions, including uploads to Glacier classes, retrieval initiations, and changes. CloudTrail captures detailed event , such as the principal identity, request parameters, and response elements, enabling comprehensive trails for compliance verification. Logs can be stored durably in S3 for long-term retention and analyzed using Amazon Athena or other tools to detect unauthorized access attempts. This integration ensures traceability of all interactions with Glacier , supporting requirements for logs in regulations like HIPAA and PCI DSS.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.