Teradata Hash Index: Minimizing Disk IOs and Optimizing Row Access

04/11/202604/23/2023 by Roland Wenzlofsky

The Teradata Hash Index aims to reduce disk IOs by providing an alternative access path to rows, as with all Teradata index structures.

This can be achieved through three primary mechanisms:

The storage of a vertical subset (columns) of the table in the hash index structure
Selecting a better Primary Index to avoid costly data redistribution during join preparation
The Ordering of rows by value to support range scans

The Teradata Hash Index can cover either all or a portion of the selected columns.

If the Teradata Hash Index does not cover the selected columns, the corresponding base table rows can be retrieved in a subsequent step. The ROWIDs, which point to the base table rows, are carried by each index row. The ROWIDs extracted from the Hash Index are placed into a spool and then utilized to access the base table rows. Base table access can be skipped in the first case.

These characteristics are similar to those of Teradata Secondary Indexes. However, Teradata Hash Indexes allow the creation of a custom Primary Index, enabling row distribution among AMPs to be tailored to workload requirements. Secondary Indexes do not offer this flexibility.

Teradata Hash Index Design

A Teradata hash index can store rows on the same AMP as the base table row through its Primary Index, which is identical to the Primary Index of the hash index. This is akin to a Non-Unique Secondary Index (NUSI).

The system automatically maintains the Teradata Hash Indexes, as with any index. However, updating, inserting, or deleting data incurs overhead, as the base table and index structures require maintenance.

Accurate statistics are essential for the Primary Index columns of Hash Indexing to enable the Teradata Optimizer to estimate Hash Index usage costs accurately.

When deciding between a Single Table Join Index and a Hash Index, note that although the Hash Index can be seen as a variant of the STJI, there are distinct differences.

The most significant constraints when compared to a Single Table Join Index include:

A Hash Index cannot have a Partitioned Primary Index
A Hash Index cannot have a Non-Unique Secondary Index.
Hash Indexes cannot be specified for NOPI or column‑partitioned base tables as they are designed around the Teradata hashing algorithm.
A hash index cannot be column partitioned
A hash index must have a Primary Index; a Single Table Join Index can be created with or without a primary index if a table is column-partitioned (as column stores on Teradata never have a Primary Index)

This is an example of creating a Hash Ordered Index:

CREATE HASH INDEX HASH_IDX (COL1, COL2) ON MYTABLE
BY (COL1) ORDER BY HASH
CREATE HASH INDEX HASH_IDX (COL1, COL2) ON MYTABLE
BY (COL1) ORDER BY VALUE (COL2);

The “by” clause defines the columns utilized for data distribution, similar to the Primary Index. However, it is important to note that the columns used for data distribution must be included in the Hash Index column list. To improve fault tolerance, we can utilize FALLBACK protection in the same manner as it is used for base tables.

The Hash Index row’s second copy will be saved on a backup AMP. Without fallback protection, Hash Index use and updates on the base table are impossible when the primary AMP is unavailable.

However, it is important to note that implementing fallback protection requires twice the storage capacity for the Hash Index structure.

In conclusion, Hash Indexes can be used as Single Table Join Indexes, but their performance is not significantly different. Creating a Hash Index is also easier due to its simpler syntax than a Single Table Join Index.

Please refer to the official Teradata documentation by following this link:

The Teradata Hash Index

More on indexing here: Teradata Indexing

Related Services

⚡ Need Help Optimizing Your Data Platform?

We cut data platform costs by 30–60% without hardware changes. 25+ years of hands-on tuning experience.

Explore Our Services →

📋 Considering a Move From Teradata?

Get a personalized migration roadmap in 2 minutes. We have migrated billions of rows from Teradata to Snowflake, Databricks, and more.

Free Migration Assessment →

▶ Follow me on LinkedIn for daily insights on data warehousing and platform migrations.

Stay Ahead in Data Warehousing

Get expert insights on Teradata, Snowflake, BigQuery, Databricks, Microsoft Fabric, and modern data architecture — delivered to your inbox.

Leave a Comment Cancel reply

DWHPro

Expert network for enterprise data platforms. Senior consultants, project teams built for your challenge — across Teradata, Snowflake, Databricks, and more.

📍Vienna, Austria & Jacksonville, Florida

Quick Links

Services Team Teradata Book Blog Contact Us

Connect

LinkedIn → [email protected]

Newsletter

Join 4,000+ data professionals.
Weekly insights on Teradata, Snowflake & data architecture.