ReadWriteEnterprise

Vertica, the Analytics Behind All the Zynga Games

So you want to buy a tractor? Build a house? Scrabble? So do the more than 62 million other gamers who play Farmville daily. That's why keeping its communities humming 24 hours a day, seven days a week, and responding to issues in real-time is no game for Zynga.

To get the job done, the company turned to Vertica, which is now owned by HP. Says Ken Rudin, VP of Analytcs at Zynga: "With over 40 million players, 3TB of new data a day and 230 nodes spread across two clusters, Zynga's columnar data warehouse from Vertica is no analytical windup toy."

Jacquelyn Gavron is a Brooklyn, NY-based freelancer who specializes in usability and content for mobile and online apps. Previously, she was Content Manager for Citi Mobile and Executive Editor at PC Magazine. She can be reached at jgavron.scribe4hire@gmail.com.
That's for sure. The moment data hits Vertica, Rudin's team can observe what's going on in a game, then optimize the experience as fast as you can say Farmville. If performance dips during spikes in usage, analysts can add nodes on-the-fly-- a feature known as "elastic computing"--and remove them just as quickly when activity levels out.

Squeezing BIG Data

As Rudin mentioned, unlike traditional RDBMS systems, Vertica stores data in columns instead of rows. (So do other data warehouses including AsterData and ParAccel).

Data stored in columns can be more tightly compressed - 10:1 in the case of Vertica. "We can reduce 10 terabytes of data to 1 terabyte," says Mingsheng Hong, Technical Marketing Specialist at Vertica. This can be critical as the size of data sets shifts from terabyte to petabytes.

In addition, Vertica can operate on this compressed data, which improves performance by reducing CPU, memory and disk I/O at processing time, according to the company's web site. Traditional RDBMS systems, in turn, can't work with compressed data--it's got to be extracted first.

Who Would Guess?

Hong also claims that Vertica's performance is 50x to 100x faster than traditional RDBMS systems. Michael Relich, CIO of Guess Inc., agrees.

"We have 520 stores and buyers go there to check inventory using their iPads," says Relich. "Batch loads took six to seven hours with Oracle, and data wasn't ready for buyers in the morning."

"We looked at a competitive hardware solution but were attracted by commodity (x86) hardware approach," adds Relich. "Queries that took hours were reduced to minutes. Those that took minutes were reduce to seconds." As a result, "Guess will be rolling out Vertica to its South Korean operations, says Relich.

Note: Vertica offers an SDK to write custom methods for analyzing data directly in its database.

Other stories of interest:

Teradata Acquires AsterData

HP Acquires Vertica

Business Analytics Predictions from Gartner and Forrester


ReadWriteWeb encourages comments, but please remember: Keep it nice, keep it clean, and avoid promotional comments. We do pre-moderate some comments with links. For more information, please read our full comment policy.
blog comments powered by Disqus
Recommended Story
RWW SPONSORS



RWW PARTNERS