My 2 Cents: Netezza 1Q2013

Since my blogs tend to be in response to some stimulus they may not reflect a holistic view on any particular product. The “My 2 Cents” series will try to provide a broader view… Please consider this as you read on… Summary Netezza put a new spin on data warehousing… they made it easy. The Netezza software … Continue reading “My 2 Cents: Netezza 1Q2013”

MPP, IMDB and Moore’s Law

In the post here I listed the units of parallelism (UoP) applied by various products on a single node. Those findings are summarized in the table below. Product Version/HW Cores per Node UoP per Node Notes Teradata EDW 6700H 16 32 Uses hyper-threads. Greenplum DCA UAP Edition 16 8 Recommends 1 Segment for each 2 … Continue reading “MPP, IMDB and Moore’s Law”

Price/Performance of HANA, Exadata, Teradata, and Greenplum

Here is an attempt to build a Price/Performance model for several data warehouse databases. Added on February 21, 2013: This attempt is very rough… very crude… and a little too ambitious. Please do not take it too literally. In the real world Greenplum and Teradata will match or exceed the price/performance of Exadata… and the … Continue reading “Price/Performance of HANA, Exadata, Teradata, and Greenplum”

The Cost of Dollars per Terabyte

Let me be blunt: using price per terabyte as the measure of a data warehouse platform is holding back the entire business intelligence industry. Consider this… The Five Minute Rule (see here and here) clearly describes the economics of HW technology… suggesting exactly when data should be retained in memory versus when it may be moved … Continue reading “The Cost of Dollars per Terabyte”

Teradata, HANA and NUMA

Teradata is circulating a document to customers that claims that the numbers SAP has published in its 100TB PoC white paper (here) demonstrates that HANA suffers from scaling issues associated with the NUMA-effect. The document is so annoyingly inaccurate that I have to respond. NUMA stands for non-uniform-memory-access. This describes an architecture whereby each core … Continue reading “Teradata, HANA and NUMA”

My 2 Cents: Greenplum 1Q2013

Since my blogs tend to be in response to some stimulus they may not reflect a holistic view on any particular product. The “My 2 Cents” series will try to provide a broader view… Please consider this as you read on… Summary From a technical perspective, Greenplum is my favorite data warehouse database. Built on … Continue reading “My 2 Cents: Greenplum 1Q2013”

HANA Support for OLTP and BI In a Single Table

This is a rehash of my post for SAP here… I thought you might find it interesting as it describes the architecture HANA uses to support OLTP and BI against a single table. A couple of points to think about: If you have only one database structure you can optimize for only one query; e.g. … Continue reading “HANA Support for OLTP and BI In a Single Table”

My 2 Cents: Teradata 1Q2013

Since my blogs tend to be in response to some stimulus they may not reflect a holistic view on any particular product. The “My 2 Cents” series will try to provide a broader view… Summary Despite my criticisms of some of their market positions (here, here, here, and here) Teradata provides the single best data … Continue reading “My 2 Cents: Teradata 1Q2013”

A Quick Five Minute Rule Update for In-memory Databases

6/26/2014: I fixed the calculation… I had an error in my spreadsheet. Sorry. The 2012 break-even point is 217 minutes. – Rob Following on to my blog on the Five Minute Rule and in-memory databases here I decided to quickly and informally recalculate the 4KB break-even point based on current technology (rather than use the … Continue reading “A Quick Five Minute Rule Update for In-memory Databases”

The Five Minute Rule and In-memory Databases

I was recently reminded of a couple of papers written by Jim Gray and Gianfranco Putzolu  that calculated the cost of keeping data in memory vs the cost of paging it in from disk. I was happy to see that the thread was being kept alive by Goetz Graefe. These papers used the cost of … Continue reading “The Five Minute Rule and In-memory Databases”