Do you work in an environment where a Teradata compression tool is installed that conforms to your satisfaction?
If not, I assume you are probably not very pleased with having to walk through the databases, table by table, to find out what to compress on which column.
Typically, this is time-consuming, fatiguing, and subject to momentary human judgment.
Every reevaluation is costly again. Therefore, Teradata compression activities often stop once the largest decile of tables has been treated, leaving a lot of potential unused. I can offer you a solution to this problem:
What Is The Teradata DWHPRO TITHONIZER?
Inspired by the Greek legend persona of Tithonos, whose fate was to shrink ever more, DWHPRO TITHONIZER determines the best possible and feasible current compression potential for a table given constraints and assumptions.
DWHPRO TITHONIZER determines a set of compression candidate columns and the compression candidate values per column.
It then selects the combination that achieves the highest savings under the given technical constraints and the current table demographics.
It can be used for compressing a table for the first time or an update on given compressions.
It does not apply the compression findings directly. Instead, it stored them in a separate table to be implemented at a later stage.
This allows you to separate the potentially resource-intense analysis from the application work.
Also, DWHPRO TITHONIZER can be applied to one table in isolation or all of a database’s tables. This enables you to perform demo or study runs over representative tables before any mass activity.
The core assumption behind DWHPRO TITHONIZER is that it is possible to determine the one combination of compressions over columns that achieves the highest net benefit under the given circumstances and constraints.
It gathers the following information:
- Is the table populated with enough compression rows to have a savings potential?
- Which columns can be candidates for compression for technical reasons?
- Which columns shall be candidates for compression from a business or policy point of view?
- Which values, given columns, are candidates?
- What is the benefit of saved bytes of compressing one value?
- What is the cost of compressing this one value?
- What is the Presence Byte consumption of a given compression in total?
Compressed value sets for every column are summarized in levels representing one Presence Bit consumed each.
Based on this information, a dominant path for compression is determined by ordering the various levels of columns by the “savings per header cost ratio”.
Highly frequent level 1 values with little cost in bytes for the table header come first in line. Conversely, one will find lengthy and infrequent values with razor-thin or no net benefit.
After potentially adjusting for the net benefit and local Presence Byte break-even points, the dominant path is translated into a series of compression statements stored in the result table TITHONOS. From there, alter table statements can be generated to apply the compression.
For a detailed description of DWHPRO TITHONIZER, please refer to the pdf file attached to this article:
I recommend reading it before you start working with the Stored Procedure.
DWHPRO TITHONIZER was developed and tested for Teradata Versions 13 and 14, 15, and 16.
Improvements, refinements, or modifications of functionality or methodology will occur as experience with it is gained and time to do it is available.
I also hope for valuable input from you as our readers. If you need additional features or find any bugs, send us an email:
Feel free to use or modify DWHPRO TITHONIZER for training or working purposes. But please note in the program code that the original version is from DWHPRO.
Before you start finding out what the Stored Procedure can do for you, let me remind you that the responsibility for using it and any consequence thereof lies wholly in the hand of whoever uses it.
I am not responsible for any damage, misuse, or other negative consequences from DWHPRO TITHONIZER applications.
If you have gathered experience, I would be pleased to hear of any results you achieved with it: