Understanding Teradata Hash Collisions – A Case Study

tune3

To comprehend the issue of Teradata hash collisions, I will briefly explain how rows are allocated. If you are unfamiliar with Teradata Architecture or require a refresher, I suggest reading the following article beforehand: As you know, a hashing algorithm distributes a table’s rows to the AMPs. The Foundations The hashing algorithm accepts one or …

Read more

The Importance of Minimizing Teradata I/O: Understanding Logical vs. Physical IOs and Their Impact on Performance

tune4

Why is Teradata I/O so crucial? Minimizing Teradata I/O is a crucial aspect of performance tuning. IOs involve transferring data from storage to main memory, which is essential for Teradata to process data. Transferring data to the main memory is significantly slower than accessing data in the main memory or CPU cache. Minimizing IOs can …

Read more

Understanding Teradata’s Primary AMP Index for Improved Performance

tune4

Experienced Teradata users are familiar with the concept of a primary index. However, a new term has surfaced with the introduction of Teradata Release 15: Primary AMP Index. This blog post will demonstrate a Primary AMP’s benefits and optimal usage for improving load and query performance. Initially, let us examine Teradata’s approaches throughout the years …

Read more

Shrinking Teradata Tables: Reduce Table Size by 90% with This Simple Trick

tune4

Despite implementing Multivalue Compression to minimize the size of our tables, we still require additional space. Shrinking A Teradata Table To A Minimum Size I once used a trick that reduced an already optimized table with multivalue compression to just 10% of its original size. This unique situation is worth monitoring for potential opportunities. I …

Read more

Optimize Teradata UNION ALL with a Single Table Scan Trick

tune1

The Idea Behind This Trick for Teradata UNION ALL What if you need to apply a UNION ALL operation to distinct columns within a single table? Typically, the process would involve: This method has a drawback: To query the Customer table with just one full table scan and no need for UNION ALL, I sought …

Read more

Teradata Access Paths: 8 Essential Ones to Know

tune1

The optimizer has various access paths to consider when creating the execution plan, each with unique advantages and disadvantages. This article will introduce the 8 most essential access paths. The lack of a hash index may surprise you; however, Teradata has entirely substituted it with the single-table join index and discourages its utilization in the …

Read more

Optimizing Teradata Joins: Handling Skewed Workloads with Partial Redistribution and Partial Duplication

tune4

How does Teradata handle Skew in Joins? The imbalanced workload is a primary issue that must be addressed in performance optimization. Efficient optimization is crucial for join steps due to the possibility of Teradata transferring significant data volumes between AMPs. The optimizer can perform joins separately for primary index values that are skewed and those …

Read more

Choosing the Right Teradata Data Types

tune2

How Do I Select The Appropriate Data Type In Teradata? Converting datatypes incurs substantial costs and demands significant CPU resources when dealing with extensive tables. Incorrect data type selection hinders the execution plan. This article will discuss selecting appropriate data types for optimal performance. Consistency in selecting data types across different tables is crucial, as …

Read more

Teradata Sample Statistics: When, How, and Why to Use Them

tune1

Introduction to Teradata Sample Statistics Discover the optimal utilization of Teradata Sample Statistics, including when, how, and why to implement them. Sample statistics require columns with a high degree of diversity in values. A UPI satisfies this criterion, and only columns with numerous unique values should be considered for collecting sample statistics for the NUPI. …

Read more

Maximizing Performance with Multiple Teradata Sessions

tune2

Teradata Sessions explained in a few Sentences Example 1: Multiple Sessions which do not improve performance The graph shows that all four transactions (T1, T2, T3, T4) are concurrently active on all AMPs. Even for a single-row lookup, these transactions necessitate accessing all AMPs. None of the transactions use a single rowhash for access, resulting …

Read more

DWHPro

Expert network for enterprise data platforms. Senior consultants, project teams built for your challenge — across Teradata, Snowflake, Databricks, and more.

📍Vienna, Austria & Miami, Florida

Quick Links
Services Team Teradata Book Blog Contact Us
Connect
LinkedIn → [email protected]
Newsletter

Join 4,000+ data professionals.
Weekly insights on Teradata, Snowflake & data architecture.