The 15-Year Detour: How the Data Industry Spent Billions Reinventing SQL

Somewhere around 2020, the data world quietly arrived at a conclusion that Teradata engineers could have told you in 1984: SQL on a massively parallel architecture is a pretty good way to process large volumes of data. The path to get there was anything but quiet. It involved billions in capital, an entire generation of …

Read more

Introduction to Apache Spark: A Powerful Solution for Big Data Processing and Analytics

arch1

Introduction Processing and analyzing large volumes of data quickly and efficiently is essential in today’s data-driven world. Apache Spark, an open-source big data processing engine, is a leading solution for handling massive datasets that offers a fast and flexible alternative to traditional data processing frameworks like Hadoop’s MapReduce. This article introduces Apache Spark, explores its …

Read more

Teradata Rollbacks: Understanding the Impact on Performance and How to Avoid Them

admin3

How to Abort the Teradata Rollback Executing a DML statement on a sizable table may trigger a prolonged ROLLBACK. In such cases, you must choose between waiting for the ROLLBACK to complete or terminating it. Cancelling a rollback avoids wasting additional resources, particularly when the system cannot run in parallel due to high skew, which …

Read more

DWHPro

Expert network for enterprise data platforms. Senior consultants, project teams built for your challenge — across Teradata, Snowflake, Databricks, and more.

📍Vienna, Austria & Miami, Florida

Quick Links
Services Team Teradata Book Blog Contact Us
Connect
LinkedIn → [email protected]
Newsletter

Join 4,000+ data professionals.
Weekly insights on Teradata, Snowflake & data architecture.