The Medallion Architecture Is Not New. We Just Called It Something Else.

Why data warehouse professionals have been doing “bronze, silver, gold” for over 20 years.

medal

If you have been working in data warehousing for any length of time, the first time you heard about the “medallion architecture” you probably had one reaction: We already do this.

You were right.

The medallion architecture, popularized by Databricks as a core design pattern for their lakehouse platform, defines three layers: bronze for raw data, silver for cleansed and conformed data, and gold for business-ready aggregates. If that sounds familiar, it should. The data warehousing community has followed this exact pattern since the late 1990s.

What Databricks did was give an old idea a catchy name. That is nothing, but it is not the revolution it was marketed as either.

The Pattern We Have Always Used

Every well-designed data warehouse follows a layered approach to data processing. The names vary, the concept does not.

In a traditional data warehouse, data first lands in a staging area. This is the point of entry. Data arrives from source systems with minimal transformation. The goal is speed: get the data in, validate the basics, and make it available for the next step.

From staging, data moves into an integration layer (sometimes called the cleansed layer, the ODS, or the core warehouse). Here, data quality rules are enforced. Duplicates are removed. Data types are standardized. Business keys are resolved. Source system idiosyncrasies are smoothed out.

Finally, data is transformed into a presentation layer: data marts, dimensional models, and aggregated reporting tables. These structures are optimized for consumption by analysts, dashboards, and business users.

Now, rename those three layers. Call them bronze, silver, and gold. You have the medallion architecture.

So What Did Databricks Actually Change?

The logical concept is identical. The differences lie in the technical implementation and some philosophical shifts about how data should be treated at each layer. These differences are real, even if they do not justify calling the whole thing a new architecture.

Persistent raw data. In the traditional approach, staging was often ephemeral. You loaded raw data, processed it, and then truncated or dropped the staging tables before the next load cycle. Storage was expensive, especially on systems like Teradata, Oracle, or DB2, where every gigabyte on disk carried a significant license cost. The medallion philosophy assumes cloud storage is cheap. Bronze tables retain raw data permanently, including malformed records, late-arriving data, and schema drift. The argument is that you might need to reprocess from scratch if business rules change, and having the original raw data available makes that possible without going back to the source systems.

Every layer is a first-class table. In traditional architectures, the staging area was often implemented as flat files, temporary tables, or volatile tables. These had limited transactional guarantees, no versioning, and no easy way to query historical loads. In the medallion architecture, every layer is a full Delta Lake table with ACID transactions, schema enforcement, and time travel. That means you can query any layer at any point in time, audit changes, and roll back if something goes wrong. This is a genuine improvement in operability, even if the logical pattern is unchanged. It is worth noting that this capability is not exclusive to Delta Lake. Snowflake achieves similar operability through its micro-partition architecture, Time Travel, and zero-copy cloning, without Delta Lake at all. The point is that modern cloud platforms, regardless of the specific technology, have made every layer auditable and queryable. That was harder to achieve on traditional systems.

Streaming and incremental processing. Traditional ETL was predominantly batch-oriented. You scheduled nightly or hourly load windows, processed everything in sequence, and the pipeline was idle between runs. The medallion architecture is designed around incremental and streaming-capable processing. Databricks features such as Auto Loader (for bronze ingestion) and Delta Live Tables enable each layer to update continuously and independently. To be clear: incremental and continuous loading is not a new concept. Teradata supported incremental loading via TPump long before Databricks existed. Snowflake offers Snowpipe for serverless continuous ingestion and scheduled Tasks for orchestration. What the medallion architecture changed was not the existence of these capabilities but their accessibility. The tight integration of Auto Loader and Delta Live Tables, combined with the layered pattern, made continuous processing easier for teams that had previously relied on batch-only workflows.

Multi-consumer access. Traditional data warehouses assumed a single, tightly coupled system. You staged, transformed, and served data from the same engine. The medallion architecture assumes decoupled consumers. Data scientists might query Bronze directly to explore raw data. Analysts work with silver. Dashboards consume gold. Different compute engines can access any layer because the data is stored in open formats (Parquet/Delta) on shared cloud storage. This is less a feature of the medallion pattern itself and more a consequence of the lakehouse architecture it sits on.

The Marketing vs. The Reality

Let me be direct: the medallion architecture gained so much traction not because of technical innovation.

The big data movement of the 2010s produced a generation of data engineers who came from software engineering, not from data warehousing. Many of them had never built a traditional ETL pipeline with staging, integration, and presentation layers. When organizations moved to data lakes, they often abandoned layered architectures entirely. Everything was dumped into a single S3 bucket or HDFS directory with no structure, no quality enforcement, and no clear lineage. The industry coined a term for this: the data swamp.

The medallion architecture provided these teams with a framework to restore discipline to their data pipelines. It told them: separate your raw data from your cleaned data from your business data. Use distinct layers. Enforce quality progressively.

That was valuable advice. It was also advice that the data warehousing community had been giving for two decades.

What Databricks did well was packaging. “Bronze, silver, gold” is simple, visual, and memorable. It is easier to explain to a room of engineers than “staging, integration, and presentation layers following Kimball’s dimensional modeling methodology.” Good naming matters. But let us not confuse good marketing with invention.

What Teradata Professionals Should Take Away

If you are coming from a Teradata background, you already understand layered data processing intuitively. When you encounter the medallion architecture in Databricks, Snowflake, or any other cloud platform, do not let the new vocabulary intimidate you.

The principles you have applied for years still hold: land data quickly, clean it systematically, and serve it in structures optimized for the consumer.

What has changed is the economics. On Teradata, storage was expensive, so you minimized what you kept in staging. In the cloud, storage is cheap, so keeping raw data permanently becomes practical. On Teradata, compute was a fixed resource, so you carefully optimized batch windows. In the cloud, compute scales on demand, so incremental and streaming approaches become feasible.

The medallion architecture is not a new idea. It is your old idea, running on new infrastructure.

Roland Wenzlofsky is a data warehouse consultant with over 20 years of experience helping major organizations manage and optimize their analytical platforms. He is the author of “Teradata SQL Tuning” and writes regularly about data warehousing, Teradata, Databricks, and Snowflake.

Related Services

🏗️ Planning a Data Platform Migration?

Architecture-first approach: we design before a single line of code is written. Zero data loss across every migration delivered.

Our Migration Services →

📊 Data Platform Migration Survey

Help us map where the industry is heading. Results are public — see what others chose.

1. What is your current data platform?

2. Where are you migrating to (or evaluating)?

Migrating FROM
Migrating TO

Thanks for voting! Share this with your network.

Follow me on LinkedIn for daily insights on data warehousing and platform migrations.

Stay Ahead in Data Warehousing

Get expert insights on Teradata, Snowflake, BigQuery, Databricks, Microsoft Fabric, and modern data architecture — delivered to your inbox.

Leave a Comment

DWHPro

Expert network for enterprise data platforms. Senior consultants, project teams built for your challenge — across Teradata, Snowflake, Databricks, and more.

📍Vienna, Austria & Jacksonville, Florida

Quick Links
Services Team Teradata Book Blog Contact Us
Connect
LinkedIn → [email protected]
Newsletter

Join 4,000+ data professionals.
Weekly insights on Teradata, Snowflake & data architecture.