How Join Indexes Can Optimize Performance in a Normalized Data Model

tune2

A normalized data model can increase the complexity of creating performant queries due to the higher number of tables that must be linked compared to a denormalized data model. It is essential to select a primary index precisely to optimize queries and joins, enabling them to have a direct access path. However, relationship tables often …

Read more

Loading Data into Teradata using Python and Fastload

tool3

This article assumes prior basic knowledge of Python and demonstrates the easy process of loading data using Python and the Teradata SQL Driver for Python with Fastload. If you are using Windows, we recommend using WinPython if you don’t have Python installed yet. Because a Jupyter Notebook is included, we use it to show how …

Read more

Teradata SQL Tuning: How Query Rewriting Can Reduce Runtime from 40 Minutes to Seconds

sql2

It’s time to share a new Teradata SQL tuning case study that showcases the impressive impact of query rewriting on performance. We are studying the query below that originally took 40 minutes to run. As a SQL tuning specialist, I always prioritize adding missing statistics and refreshing stale ones. I analyzed the SQL statement that …

Read more

High-Performance Calculations with Teradata Ordered Analytical Functions

sql4

Teradata employs two distinct approaches in Ordered Analytic Functions for preparing the data layout necessary for processing. This article explains both approaches and their respective advantages and disadvantages. Teradata Ordered Analytical Functions Teradata Analytic Functions are versatile tools that allow for a wide range of applications. The ability to retrieve previous and subsequent rows is …

Read more

Understanding Teradata Join Estimation: Heuristics and Importance of Statistics Collection

tune4

What is Teradata Join Estimation? This article demonstrates the functioning of Teradata Join Estimation in the absence of statistics. It presents the heuristics employed to estimate row count and emphasizes the importance of collecting statistics on all join columns. Teradata Join Estimation Heuristics The worst-case scenario involves joining two tables without any collected statistics. We …

Read more

VantageCloud Lake: Turbocharge Your Data Warehousing with Teradata’s Innovative Solution

arch1

Introduction Parallel database architectures have undergone significant advancements over the past four decades, transitioning from shared memory architecture to shared disk architecture and, finally, to the more efficient shared-nothing architecture. Databases designed specifically for cloud environments incorporate elements of shared-disk and shared-nothing architectures. Teradata is a powerful and scalable relational database management system designed to …

Read more

Optimizing Teradata Queries: From No Index to Hashed NUSI

tune3

The initial situation without any index In this blog, I will demonstrate how to optimize a query using Teradata’s tools. We will begin with the following test scenario: The data is evenly distributed. To demonstrate the query’s selectivity for the tested indexes we will define later, I assigned a significant portion of rows the same …

Read more

Teradata Rollbacks: Understanding the Impact on Performance and How to Avoid Them

admin3

How to Abort the Teradata Rollback Executing a DML statement on a sizable table may trigger a prolonged ROLLBACK. In such cases, you must choose between waiting for the ROLLBACK to complete or terminating it. Cancelling a rollback avoids wasting additional resources, particularly when the system cannot run in parallel due to high skew, which …

Read more

The Importance of Minimizing Teradata I/O: Understanding Logical vs. Physical IOs and Their Impact on Performance

tune4

Why is Teradata I/O so crucial? Minimizing Teradata I/O is a crucial aspect of performance tuning. IOs involve transferring data from storage to main memory, which is essential for Teradata to process data. Transferring data to the main memory is significantly slower than accessing data in the main memory or CPU cache. Minimizing IOs can …

Read more

Improving Database Performance and Security with Teradata Write-Ahead Logging (WAL)

arch4

Accessing data has always been the bottleneck of database systems. Once the data resides in the main memory, it can be promptly processed. Although SSDs are now predominantly used instead of hard disks, copying data into the main memory remains the slowest operation. The Teradata Write-Ahead Logging (WAL) feature enhances DML performance and boosts data …

Read more

DWHPro

Expert network for enterprise data platforms. Senior consultants, project teams built for your challenge — across Teradata, Snowflake, Databricks, and more.

📍Vienna, Austria & Jacksonville, Florida

Quick Links
Services Team Teradata Book Blog Contact Us
Connect
LinkedIn → [email protected]
Newsletter

Join 4,000+ data professionals.
Weekly insights on Teradata, Snowflake & data architecture.