IBM BLU and SAP HANA

As I noted here, I think that the IBM BLU Accelerator is a very nice piece of work. Readers of this blog are in the software business where any feature developed by any vendor can be developed in a relatively short period of time by any other vendor… and BLU certainly moves DB2 forward in…

HANA Memory Utilization

The current release of HANA requires that all of the data required to satisfy a query be in-memory to run the query. Let’s think about what this means: HANA compresses tables into bitmap vectors… and then compresses the vectors on write to reduce disk I/O. Disk I/O with HANA? Yup. Once this formatting is complete…

Who is How Columnar? Exadata, Teradata, and HANA – Part 2: Column Processing

In my last post here I suggested that there were three levels of maturity around column orientation and described the first level, PAX, which provides columnar compression. This apparently is the level Exadata operates at with its Hybrid Columnar Compression. In this post we will consider the next two levels of maturity: early materialized column…

Who is How Columnar? Exadata, Teradata, and HANA – Part 1: Column Compression

There are three forms of columnar-orientation currently deployed by database systems today. Each builds upon the next. The simplest form uses column-orientation to provide better data compression. The next level of maturity stores columnar data in separate structures to support columnar projection. The most mature implementations support a columnar database engine that performs relational algebra…

Memory Trends and HANA

If the Gartner estimates here are correct… then DRAM prices will fall 50% per year per year over the next several years… and then in 2015 non-volatile RAM (see the related articles below) will become generally available. It has been suggested that memory prices will fall slower than data warehouses will grow (see here). That does not seem to…

Some Unaudited HANA Performance Numbers

The following performance numbers are being reported publicly for HANA: HANA scans data at 3MB/msec/core On a high-end 80-core server this translates to 240GB/sec per node HANA inserts rows at 1.5M records/sec/core Or 120M records/sec per node… Aggregates 12M records/sec/core Or 960M records per node… These numbers seem reasonable: A 100X improvement over disk-based scan…

My .2 Cents on HANA

Out of respect for my employer let me repeat what I have said before regarding HANA… HANA provides far and away the best price/performance of any data warehouse… no question. HANA will win the performance portion of every POC just based on the facts detailed in this graphic: Other products database code lives thousands to tens…