Deprecated: Function split() is deprecated in /home/tbriggs/public_html/s9y/plugins/serendipity_event_metadesc/serendipity_event_metadesc.php on line 101

Question: Netezza vs. Vertica

I received the following question last week from Mike T:

curious as to your evaulation between vertica and netezza. i have used netezza for several projects at a couple of clients.


My response, unfortunately, is that I can't quite say yet.
I've been using Netezza for about two years now, and I'm consistently impressed with the people and the performance, if not always the robustness of the product. Now, my software isn't perfect either, so I'm not really throwing stones here (or at least I'm trying not to), but I think it's fair to say that there are so many 'moving parts' in Netezza's system - hardware, software, networking, SQL, APIs, etc. - that it's unrealistic to expect all the pieces to play nicely together all the time. I am thoroughly impressed by what they've done, but let's not claim that it's perfect.

At least not yet. I've always been more impressed with the people than the product - which is saying something, given how much I like the product - so I'm confident that in time the kinks will indeed get worked out.

As for Vertica, my experience has been fairly short-lived (a couple months) and has been disappointingly hands-off (by my choice, not theirs): I've read lots of documentation (you should start here) and provided data sets/query workloads for benchmarks, but I have not yet installed, configured or directly used a Vertica system. The architecture itself is so blindingly obvious that one can't help but believe in it, and the performance results I've seen have backed that up. The cast of characters over at Vertica certainly makes for some good PR as well, so I'm confident that we'll be hearing a lot about them and a lot from them in the future.

So, if you don't mind some unrequested prognostication, here's how I see this playing out: Vertica will be successful because Netezza paved the way for non-traditional database systems. But their success will be somewhat niche - a very large niche, but niche - partially because the system is spectacular for some things but absolutely hideous for others. More importantly, however, Michael Stonebreaker wants all databases to be specialized, and thus having a database that was good for everybody would defeat his larger purpose. So a very large niche it will be.

Netezza, then, will become the general purpose large-scale data appliance of choice. It's overall performance profile, lower administration costs and generally stellar performance characteristics will land it in a somewhat-orthogonal niche to Vertica. A broader niche, but still a niche.

Oh, and lack of the word 'warehouse' in my description of Netezza is not an accident. Part of Netezza's success will come from moving beyond the data warehousing space. Stay tuned. :-)
Trackbacks

Trackback specific URI for this entry

This post over at DBMS2 got me thinking - is Netezza's materialized view functionality an unintended - or maybe just un-promoted - column store? NZ's materialized views work by extracting a column or columns from a table and storing them separately, in
Professor David DeWitt posted the following comments in reply to my Netezza/Vertica predictions. I've reproduced them here for visibility, and so I can respond to them more properly. It's important not to misinterpret Mike Stonebraker's comments about
Well, the word is finally out: Netezza is making user-defined functions available. I've been waiting to blog about this for a while, and I'm rather ticked off that Monash and others beat me to it. Oh well, guess that's what I get for not reading all NZ
Comments
It's important not to misinterpret Mike Stonebraker's comments about database systems and "one size no longer fitting all applications." Commercial data processing needs inspired early database design, and current relational products from mainstream vendors, such as Oracle and Microsoft, reflect this heritage. Furthermore, the development of commercial relational database systems in the late 1980s and early 1990s was driven almost entirely by the debit-credit benchmark (TPC/A and TPC/B). Consequently, traditional commercial offerings do an excellent job at transaction processing and a relatively poor job of handling ad hoc, decision support queries on multi-terabyte database systems (a space that Teradata has dominated at the high end). While the plummeting cost of hardware has enabled more and more organizations to mine their multi-terabyte databases, many companies find Teradata solutions to be out of their price range. As a result, vendors such as Vertica, Greenplum, and Netezza have found traction in the data warehousing space.

Mike's vision of the future in built upon the belief that: 1) the market will trend towards superior performance through specialization, and 2) that different data sets -- or maybe even the same data set -- will be managed by different types of database systems depending on the target application. For example, a stock tick stream might be managed initially by a stream database system in order to provide real time answers to continuous queries, but then it will be loaded into a data warehouse where it will be managed by a system like Vertica or Netezza for historical analysis.

It is a mistake to label a system that is tuned for a specific function such as data warehousing as serving a "niche" market. Just as Oracle and SQL Server are designed for transaction processing, Vertica, Netezza, Greenplum, and Teradata are built specifically for handling ad hoc queries on multi-terabyte data warehouses. And even within this latter space, the different products are specialized in different ways. Rather than relying on the indices used extensively by database systems targeted for the OLTP marketplace, Netezza focuses on very fast sequential scan performance using custom hardware. Vertica takes this specialization to the next performance level by observing that a database system based on a column-oriented architecture is much better for OLAP applications than one that uses a row-oriented architecture -- an architecture designed originally for OLTP applications.
[David DeWitt, professor of computer sciences at the University of Wisconsin and Advisor to Vertica Systems, Inc.]
Add Comment



Enclosing asterisks marks text as bold (*word*), underscore are made via _word_.
Standard emoticons like :-) and ;-) are converted to images.
E-Mail addresses will not be displayed and will only be used for E-Mail notifications.