Tiered Storage for ClickHouse Clusters
ClickHouse allows for more than one block device to be designated for data storage. This not only allows using additional storage types, such as object storage like AWS S3, but also introduces support for storage tiering as found in typical hot-cold storage setups. It enables data being moved over to a relatively cheaper storage once the local disk is sufficiently filled or the data has sufficiently aged. This approach can significantly reduce overall storage costs. In most situations, for users with a large dataset this is likely the preferable storage configuration that they would want to use.
Instaclustr for ClickHouse lets you provision a ClickHouse cluster with storage tiering enabled.
Limitations
Before you begin, please take note of the following limitations with Tiered storage for ClickHouse:
- Can currently only be used with RIYOA provider account setup. We’re working on adding support for it for RIIA clusters.
- Currently only AWS S3 can be used as remote storage. We will be adding in support for other hyperscalers’ object storage systems as we extend support for those hyperscalers.
- Feature cannot be enabled/disabled on an existing cluster.
- Only available for use by MergeTree and Log family table engines.
Please follow guides on Configure a ClickHouse Cluster for Tiered Storage and Creating a ClickHouse Cluster to start creating and using your ClickHouse cluster with Tiered Storage.