Skip to content

Latest commit

 

History

History
250 lines (143 loc) · 22.3 KB

storage-blob-storage-tiers.md

File metadata and controls

250 lines (143 loc) · 22.3 KB
title description services author ms.service ms.topic ms.date ms.author ms.component
Premium, Hot, Cool, and Archive storage for blobs - Azure Storage
Premium, Hot, Cool, and Archive storage for Azure storage accounts.
storage
kuhussai
storage
article
10/18/2018
kuhussai
blobs

Azure Blob storage: Premium (preview), Hot, Cool, and Archive storage tiers

Overview

Azure storage offers different storage tiers which allow you to store Blob object data in the most cost-effective manner. The available tiers include:

  • Premium storage (preview) provides high performance hardware for data that is accessed frequently.

  • Hot storage: is optimized for storing data that is accessed frequently.

  • Cool storage is optimized for storing data that is infrequently accessed and stored for at least 30 days.

  • Archive storage is optimized for storing data that is rarely accessed and stored for at least 180 days with flexible latency requirements (on the order of hours).

The following considerations accompany the different storage tiers:

  • The Archive storage tier is only available at the blob level and not at the storage account level.

  • Data in the Cool storage tier can tolerate slightly lower availability, but still requires high durability and similar time-to-access and throughput characteristics as Hot data. For Cool data, a slightly lower availability SLA and higher access costs compared to Hot data are acceptable trade-offs for lower storage costs.

  • Archive storage is offline and offers the lowest storage costs but also the highest access costs.

  • Only the Hot and Cool storage tiers can be set at the account level. Currently the Archive tier cannot be set at the account level.

  • Hot, Cool, and Archive tiers can be set at the object level.

Data stored in the cloud grows at an exponential pace. To manage costs for your expanding storage needs, it's helpful to organize your data based on attributes like frequency-of-access and planned retention period to optimize costs. Data stored in the cloud can be different in terms of how it is generated, processed, and accessed over its lifetime. Some data is actively accessed and modified throughout its lifetime. Some data is accessed frequently early in its lifetime, with access dropping drastically as the data ages. Some data remains idle in the cloud and is rarely, if ever, accessed once stored.

Each of these data access scenarios benefits from a different storage tier that is optimized for a particular access pattern. With Hot, Cool, and Archive storage tiers, Azure Blob storage addresses this need for differentiated storage tiers with separate pricing models.

Storage accounts that support tiering

You may only tier your object storage data to Hot, Cool, or Archive in Blob storage or General Purpose v2 (GPv2) accounts. General Purpose v1 (GPv1) accounts do not support tiering. However, customers can easily convert their existing GPv1 or Blob storage accounts to GPv2 accounts through a simple one-click process in the Azure portal. GPv2 provides a new pricing structure for blobs, files, and queues, and access to a variety of other new storage features as well. Furthermore, going forward some new features and prices cuts will only be offered in GPv2 accounts. Therefore, customers should evaluate using GPv2 accounts but only use them after reviewing the pricing for all services as some workloads can be more expensive on GPv2 than GPv1. For more information, see Azure storage account overview.

Blob storage and GPv2 accounts expose the Access Tier attribute at the account level, which allows you to specify the default storage tier as Hot or Cool for any blob in the storage account that does not have an explicit tier set at the object level. For objects with the tier set at the object level, the account tier will not apply. The Archive tier can only be applied at the object level. You can switch between these storage tiers at any time.

Premium access tier

Available in preview is a Premium access tier which makes frequently accessed data available via high-performance hardware. Data stored in this tier is stored on solid-state drives, which are optimized for lower latency an higher transactional rates compared to traditional hard drives. The Premium access tier is available via the Block Blob storage account type only.

This tier is ideal for workloads that require fast and consistent response times. Data that involves end-users such as interactive video editing, static web content, online transactions and the like are a good candidates for the Premium access tier. This tier is tailored for workloads that perform many small transactions, such as capturing telemetry data, messaging, and data transformation.

To use this tier, provision a new Block Blob storage account and start creating containers and blobs using the Blob Service REST API, AzCopy, or Azure Storage Explorer.

During preview, the Premium access tier:

  • Is available as locally redundant storage (LRS)
  • Is only available in the following regions: US East 2, US Central, and US West
  • Does not support automatic tiering and data lifecycle management

To learn how to register for the Premium access tier preview, see Introducing Azure Premium Blob Storage.

Hot access tier

Hot storage has higher storage costs than Cool and Archive storage, but the lowest access costs. Example usage scenarios for the Hot storage tier include:

  • Data that is in active use or expected to be accessed (read from and written to) frequently.
  • Data that is staged for processing and eventual migration to the Cool storage tier.

Cool access tier

Cool storage tier has lower storage costs and higher access costs compared to Hot storage. This tier is intended for data that will remain in the Cool tier for at least 30 days. Example usage scenarios for the Cool storage tier include:

  • Short-term backup and disaster recovery datasets.
  • Older media content not viewed frequently anymore but is expected to be available immediately when accessed.
  • Large data sets that need to be stored cost effectively while more data is being gathered for future processing. (For example, long-term storage of scientific data, raw telemetry data from a manufacturing facility)

Archive access tier

Archive storage has the lowest storage cost and higher data retrieval costs compared to Hot and Cool storage. This tier is intended for data that can tolerate several hours of retrieval latency and will remain in the Archive tier for at least 180 days.

While a blob is in Archive storage, it is offline and cannot be read (except the metadata, which is online and available), copied, overwritten, or modified. Nor can you take snapshots of a blob in Archive storage. However, you may use existing operations to delete, list, get blob properties/metadata, or change the tier of your blob.

Example usage scenarios for the Archive storage tier include:

  • Long-term backup, secondary backup, and archival datasets
  • Original (raw) data that must be preserved, even after it has been processed into final usable form. (For example, Raw media files after transcoding into other formats)
  • Compliance and archival data that needs to be stored for a long time and is hardly ever accessed. (For example, Security camera footage, old X-Rays/MRIs for healthcare organizations, audio recordings, and transcripts of customer calls for financial services)

Blob rehydration

To read data in Archive storage, you must first change the tier of the blob to Hot or Cool. This process is known as rehydration and can take up to 15 hours to complete. Large blob sizes are recommended for optimal performance. Rehydrating several small blobs concurrently may add additional time.

During rehydration, you may check the Archive Status blob property to confirm if the tier has changed. The status reads "rehydrate-pending-to-hot" or "rehydrate-pending-to-cool" depending on the destination tier. Upon completion, the Archive status property is removed, and the Access Tier blob property reflects the new Hot or Cool tier.

Blob-level tiering

Blob-level tiering allows you to change the tier of your data at the object level using a single operation called Set Blob Tier. You can easily change the access tier of a blob among the Hot, Cool, or Archive tiers as usage patterns change, without having to move data between accounts. All tier changes happen immediately. However, rehydrating a blob from Archive can take several hours.

The time of the last blob tier change is exposed via the Access Tier Change Time blob property. If a blob is in the Archive tier, it cannot be overwritten, so uploading the same blob is not permitted in this scenario. You can overwrite a blob in a Hot or Cool tier, in which case the new blob inherits the tier of the blob that was overwritten.

Blobs in all three storage tiers can co-exist within the same account. Any blob that does not have an explicitly assigned tier infers the tier from the account access tier setting. If the access tier is inferred from the account, you see the Access Tier Inferred blob property set to "true", and the blob Access Tier blob property matches the account tier. In the Azure portal, the access tier inferred property is displayed with the blob access tier (for example, Hot (inferred) or Cool (inferred)).

Note

Archive storage and blob-level tiering only support block blobs. You also cannot change the tier of a block blob that has snapshots.

Data stored in the Premium access tier cannot be tiered to Hot, Cool or Archive using Set Blob Tier or using Azure Blob Storage lifecycle management. To move data, you must synchronously copy blobs from Premium access to Hot using the Put Block From URL API or a version of AzCopy that supports this API. The Put Block From URL API synchronously copies data on the server, meaning the call completes only once all the data is moved from the original server location to the destination location.

Blob lifecycle management

Blob Storage lifecycle management (Preview) offers a rich, rule-based policy that you can use to transition your data to the best access tier and to expire data at the end of its lifecycle. See Manage the Azure Blob storage lifecycle to learn more.

Blob-level tiering billing

When a blob is moved to a cooler tier (Hot->Cool, Hot->Archive, or Cool->Archive), the operation is billed as a write operation to the destination tier, where the write operation (per 10,000) and data write (per GB) charges of the destination tier apply. If a blob is moved to a warmer tier (Archive->Cool, Archive->Hot, or Cool->Hot), the operation is billed as a read from the source tier, where the read operation (per 10,000) and data retrieval (per GB) charges of the source tier apply.

If you toggle the account tier from Hot to Cool, you will be charged for write operations (per 10,000) for all blobs without a set tier in GPv2 accounts only. There is no charge for this change in Blob storage accounts. You will be charged for both read operations (per 10,000) and data retrieval (per GB) if you toggle your Blob storage or GPv2 account from Cool to Hot. Early deletion charges for any blob moved out of the Cool or Archive tier may apply as well.

Cool and Archive early deletion

In addition to the per GB, per month charge, any blob that is moved into the cool tier (GPv2 accounts only) is subject to a Cool early deletion period of 30 days, and any blob that is moved into the Archive tier is subject to an Archive early deletion period of 180 days. This charge is prorated. For example, if a blob is moved to Archive and then deleted or moved to the Hot tier after 45 days, you will be charged an early deletion fee equivalent to 135 (180 minus 45) days of storing that blob in archive.

Comparison of the storage tiers

The following table shows a comparison of the Hot, Cool, and Archive storage tiers.

Hot storage tier Cool storage tier Archive storage tier
Availability 99.9% 99% N/A
Availability
(RA-GRS reads)
99.99% 99.9% N/A
Usage charges Higher storage costs, lower access and transaction costs Lower storage costs, higher access and transaction costs Lowest storage costs, highest access and transaction costs
Minimum object size N/A N/A N/A
Minimum storage duration N/A 30 days (GPv2 only) 180 days
Latency
(Time to first byte)
milliseconds milliseconds < 15 hrs
Scalability and performance targets Same as general-purpose storage accounts Same as general-purpose storage accounts Same as general-purpose storage accounts

Note

Blob storage accounts support the same performance and scalability targets as general-purpose storage accounts. See Azure Storage Scalability and Performance Targets for more information.

Quickstart scenarios

In this section, the following scenarios are demonstrated using the Azure portal:

  • How to change the default account access tier of a GPv2 or Blob storage account.
  • How to change the tier of a blob in a GPv2 or Blob storage account.

Change the default account access tier of a GPv2 or Blob storage account

  1. Sign in to the Azure portal.

  2. To navigate to your storage account, select All Resources, then select your storage account.

  3. In the Settings blade, click Configuration to view and/or change the account configuration.

  4. Select the right storage tier for your needs: Set the Access tier to either Cool or Hot.

  5. Click Save at the top of the blade.

Change the tier of a blob in a GPv2 or Blob storage account.

  1. Sign in to the Azure portal.

  2. To navigate to your blob in your storage account, select All Resources, select your storage account, select your container, and then select your blob.

  3. In the Blob properties blade, click the Access tier dropdown menu to select the Hot, Cool, or Archive storage tier.

  4. Click Save at the top of the blade.

Pricing and billing

All storage accounts use a pricing model for Blob storage based on the tier of each blob. Keep in mind the following billing considerations:

  • Storage costs: In addition to the amount of data stored, the cost of storing data varies depending on the storage tier. The per-gigabyte cost decreases as the tier gets cooler.
  • Data access costs: Data access charges increase as the tier gets cooler. For data in the Cool and Archive storage tier, you are charged a per-gigabyte data access charge for reads.
  • Transaction costs: There is a per-transaction charge for all tiers that increases as the tier gets cooler.
  • Geo-Replication data transfer costs: This charge only applies to accounts with geo-replication configured, including GRS and RA-GRS. Geo-replication data transfer incurs a per-gigabyte charge.
  • Outbound data transfer costs: Outbound data transfers (data that is transferred out of an Azure region) incur billing for bandwidth usage on a per-gigabyte basis, consistent with general-purpose storage accounts.
  • Changing the storage tier: Changing the account storage tier from Cool to Hot incurs a charge equal to reading all the data existing in the storage account. However, changing the account storage tier from Hot to Cool incurs a charge equal to writing all the data into the Cool tier (GPv2 accounts only).

Note

For more information about pricing for Blob storage accounts, see Azure Storage Pricing page. For more information on outbound data transfer charges, see Data Transfers Pricing Details page.

FAQ

Should I use Blob storage or GPv2 accounts if I want to tier my data?

We recommend you use GPv2 instead of Blob storage accounts for tiering. GPv2 support all the features that Blob storage accounts support plus a lot more. Pricing between Blob storage and GPv2 is almost identical, but some new features and price cuts will only be available on GPv2 accounts. GPv1 accounts do not support tiering.

Pricing structure between GPv1 and GPv2 accounts is different and customers should carefully evaluate both before deciding to use GPv2 accounts. You can easily convert an existing Blob storage or GPv1 account to GPv2 through a simple one-click process. For more information, see Azure storage account overview.

Can I store objects in all three (Hot, Cool, and Archive) storage tiers in the same account?

Yes. The Access Tier attribute set at the account level is the default tier that applies to all objects in that account without an explicit set tier. However, blob-level tiering allows you to set the access tier on at the object level regardless of what the access tier setting on the account is. Blobs in any of the three storage tiers (Hot, Cool, or Archive) may exist within the same account.

Can I change the default storage tier of my Blob or GPv2 storage account?

Yes, you can change the default storage tier by setting the Access tier attribute on the storage account. Changing the storage tier applies to all objects stored in the account that do not have an explicit tier set. Toggling the storage tier from Hot to Cool incurs write operations (per 10,000) for all blobs without a set tier in GPv2 accounts only and toggling from Cool to Hot incurs both read operations (per 10,000) and data retrieval (per GB) charges for all blobs in Blob storage and GPv2 accounts.

Can I set my default account access tier to Archive?

No. Only Hot and Cool storage tiers may be set as the default account access tier. Archive can only be set at the object level.

In which regions are the Hot, Cool, and Archive storage tiers available in?

The Hot and Cool storage tiers along with blob-level tiering are available in all regions. Archive storage will initially only be available in select regions. For a complete list, see Azure products available by region.

Do the blobs in the Cool storage tier behave differently than the ones in the Hot storage tier?

Blobs in the Hot storage tier have the same latency as blobs in GPv1, GPv2, and Blob storage accounts. Blobs in the Cool storage tier have a similar latency (in milliseconds) as blobs in GPv1, GPv2, and Blob storage accounts. Blobs in the Archive storage tier have several hours of latency in GPv1, GPv2, and Blob storage accounts.

Blobs in the Cool storage tier have a slightly lower availability service level (SLA) than the blobs stored in the Hot storage tier. For more information, see SLA for storage.

Are the operations among the Hot, Cool, and Archive tiers the same?

Yes. All operations between Hot and Cool are 100% consistent. All valid Archive operations including delete, list, get blob properties/metadata, and set blob tier are 100% consistent with Hot and Cool. A blob cannot be read or modified while in the Archive tier.

When rehydrating a blob from Archive tier to the Hot or Cool tier, how will I know when rehydration is complete?

During rehydration, you may use the get blob properties operation to poll the Archive Status attribute to confirm when the tier change is complete. The status reads "rehydrate-pending-to-hot" or "rehydrate-pending-to-cool" depending on the destination tier. Upon completion, the Archive status property is removed, and the Access Tier blob property reflects the new Hot or Cool tier.

After setting the tier of a blob, when will I start getting billed at the appropriate rate?

Each blob is always billed according to the tier indicated by the blob's Access Tier property. When you set a new tier for a blob, the Access Tier property immediately reflects the new tier for all transitions. However, rehydrating a blob from the Archive tier to a Hot or Cool tier can take several hours. In this case, you are billed at Archive rates until rehydration is complete, at which point the Access Tier property reflects the new tier. At that point you are billed for that blob at the Hot or Cool rate.

How do I determine if I will incur an early deletion charge when deleting or moving a blob out of the Cool or Archive tier?

Any blob that is deleted or moved out of the Cool (GPv2 accounts only) or Archive tier before 30 days and 180 days respectively will incur a prorated early deletion charge. You can determine how long a blob has been in the Cool or Archive tier by checking the Access Tier Change Time blob property, which provides a stamp of the last tier change. See Cool and Archive early deletion section for more details.

Which Azure tools and SDKs support blob-level tiering and Archive storage?

Azure portal, PowerShell, and CLI tools and .NET, Java, Python, and Node.js client libraries all support blob-level tiering and Archive storage.

How much data can I store in the Hot, Cool, and Archive tiers?

Data storage along with other limits are set at the account level and not per storage tier. Therefore, you can choose to use all of your limit in one tier or across all three tiers. For more information, see Azure Storage Scalability and Performance Targets.

Next steps

Evaluate Hot, Cool, and Archive in GPv2 Blob storage accounts

Check availability of Hot, Cool, and Archive by region

Manage the Azure Blob storage lifecycle

Evaluate usage of your current storage accounts by enabling Azure Storage metrics

Check Hot, Cool, and Archive pricing in Blob storage and GPv2 accounts by region

Check data transfers pricing