Google Analytics may be free, but it is still based on proprietary technology - which means you only ever get reports on the things that Google thinks are necessary and some of those reports are aimed at people using Google's other services (managing campaigns on AdWords, for example). Further, using Google Analytics means that you're tied to Google's TOS. Enter Piwik, which aims to be an open source alternative to Google Analytics. It is closely affiliated with OpenX, the open source ad server alternative to Google Ad Manager [Ed: which we just started using on RWW].
While OpenX has been around a while and has good traction, Piwik is fairly new and under the radar. It surfaced first as PHPMyVisites and is only at version 0.1.5, which is a sign that it is still for the highly committed.
Piwik originates from France and almost all the attention so far has been from outside USA. It seems to be catching on in Japan and China, as well as in Europe. The team responsible for Piwik seems to be made up of interns from big companies such as Amazon, Intel and even Google.
You need PHP and MySQL to use Piwik, which is no problem for techies but any widespread adoption will need a hosted version. Some entrepreneur is likely to offer that. Plug-ins to WordPress and other blogging platforms will also be needed, but cannot be that hard.
Open source products usually get traction, even when they are raw, when the alternative is too expensive or restrictive. Google Analytics is free, so it is hard to see rapid adoption by bloggers and small media companies. It is possible that Piwik will get better traction in companies that already use one of the expensive web analytics services from companies such as Coremetrics, Omniture and Visual Sciences/Websidestory.

Larger companies may have some pause for concern before switching to Google Analytics, because of the service's Terms of Service. An analyst in the IT department is likely to point out two clauses (yes big companies DO read this stuff):
"2. FEES AND SERVICES . Subject to Section 15 herein, the Service is provided without charge to You for up to 5 million pageviews per month per account, and if You have an active Adwords campaign in good standing, the Service is provided without charge to You without a pageview limitation.
Google may change its fees and payment policies for the Service from time to time including but not limited to the addition of costs for geographic data, the importing of cost data from search engines, or other fees charged to Google or its wholly-owned subsidiaries by 3rd party vendors for the inclusion of data in the Service reports."
So, don't go above 5 million page views without paying something to Google via AdWords. That may make one a bit uneasy, but it is the "Google may change its fees and payment policies" that will have both IT and Legal aiming to nix the deal.
Then a bit later on you get:
"6. INFORMATION RIGHTS AND PUBLICITY . Google and its wholly owned subsidiaries may retain and use, subject to the terms of its Privacy Policy (located at http://www.google.com/privacy.html , or such other URL as Google may provide from time to time), information collected in Your use of the Service. Google will not share information associated with You or your Site with any third parties unless Google (i) has Your consent; (ii) concludes that it is required by law or has a good faith belief that access, preservation or disclosure of such information is reasonably necessary to protect the rights, property or safety of Google, its users or the public; or (iii) provides such information in certain limited circumstances to third parties to carry out tasks on Google's behalf (e.g., billing or data storage) with strict restrictions that prevent the data from being used or shared except as directed by Google . When this is done, it is subject to agreements that oblige those parties to process such information only on Google's instructions and in compliance with this Agreement and appropriate confidentiality and security measures."
In short, they have the right to use your data. We have lived for a long time in a world where Microsoft and others charged for the software and kept it under a tight IP control, but did not try to take any use of the data. Google reversed that. Google is happy to let you use anything for free, as long as they get to use your data. That is OK for most individuals and small businesses, but totally unacceptable for large companies.
Piwik is working hard to be developer-friendly, not just relying on open source. They claim 4 big advantages:
But the nub of the issue for most big companies will be the data ownership issue. That is why Piwik is really a Google Analytics alternative, even if they will see few firms switching from GA initially. IT groups, under pressure to cut costs, will look at Google Analytics as an alternative to their current analytics software and reject the option based on TOS restrictions. Then some bright spark will Google the term "open source alternative google analytics" and see Piwik on top of the organic listings.
Listed below are links to blogs that reference this entry: Piwik: Open Source Google Analytics Alternative.
TrackBack URL for this entry: http://www.readwriteweb.com/cgi-bin/mt/mt-tb.cgi/3915
Comments
Subscribe to comments for this post OR Subscribe to comments for all Read/WriteWeb posts
I think I'll review the software more detailed in my site. Thanks for the info.
Posted by: Mircea | May 8, 2008 3:51 AMThis looks great! I really want to deploy it, but I can't seem to get it to work on IIS, and there doesn't seem to be much documentation on getting it to work with IIS. Looks like a file permissions issue:
[code]
Piwik couldn't write to some directories.
Try to Execute the following commands on your Linux server:
If this doesn't work, you can try to create the directories with your FTP software, and set the CHMOD to 777 (with your FTP software, right click on the directories, permissions).[/code]
Posted by: Chris | May 8, 2008 8:13 AMMy main problem with Google analytics was the degradation of performance, so I've had it switched off for quite some time.
Hey Chris, please report your problem on the piwik trac. We are aware of the IIS problem and will very soon be fixed :)
Thanks for the article RWW!
Posted by: matt | May 8, 2008 9:42 AMThanks Bernard for such analysis. We really need people like you to help us design the future of what Web Analytics should be.
We hope that we will have plenty of people to help us make Piwik multilingual, which is currently done with Beta testers.
Thierry - Marketing Manager Piwik.org
Posted by: Thierry | May 8, 2008 10:50 AMI'm not one of the big guys. My visits are measured in hundreds, not thousands or (wow) millions.
Nevertheless, I am sorely dissatisfied with the quality of info included in the free Google Analytics package. You'd think that the Giant, being omnipresent on the web, would have accurate numbers. Taking just one metric and comparing to other sites such as MyBlogLog, FeedBurner or Compete shows a huge difference in the stats. Which one is right, if any? If they're not right, then who is closest to being accurate?
I'm going to check this Piwik out. It seems to me that the only way to get accurate metrics is to do it yourself.
Thanks, Bernard.
Posted by: Jon | May 8, 2008 11:24 AMJon, that is an interesting point that I had not considered. I had assume Google Analytics was at least reliable. Does anybody else see this problem?
Posted by: bernard lunn | May 8, 2008 11:36 AMI did what you said Matt, and submitted the issue to trac. Thanks!
Posted by: Chris | May 8, 2008 11:37 AMJon, I use the Firefox extension called Noscript, and I do not allow GA to run. Not sure how that affects the numbers, but according to the Mozilla site Noscript gets 146K downloads per week.
Posted by: tommyboy | May 8, 2008 12:56 PMYou're comparing different things. Google Analytics is a hosted solution, while Piwik is not. Maybe you want to say that Piwik is an alternative to Urchin, although it still has a long way to go.
Posted by: Ionut | May 8, 2008 1:09 PM@ Bernard: Somewhere in the back of my mind I am thinking that the differences I see are due to having such a small audience. Wordout is growing, but in its public form is pretty new. I only opened it up to Googlebots back in Nov 2007. The only other public record of it is from 2004 in the Wayback Machine. So I am thinking that maybe the relative age and readership of the blog on the web has a "weight" assigned within Google's analytics, and that as time passes and the site grows its readership, more information will yield more accurate numbers? If this is true, sites like RWW would never notice, because the differences would be such small percentages that they fit easily within an error factor.
@ tommyboy: i hadn't considered this. i wonder, does NoScript block all the spiders, ie, Compete, Alexa, FeedBurner and the rest? And if so...
@ matt/Thierry: would it also block Piwik?
Posted by: Jon | May 8, 2008 6:39 PMIn addition to piwik, there is a more mature similar product available:
http://www.openwebanalytics.com/
Posted by: Evan Wired | May 8, 2008 6:43 PM@Bernard: Google Analytics underreporting by a significant margin is a common complaint. I've seen it on every forum I frequent... SitePoint, WebHostingTalk, DigitalPoint. And this is in comparison to other hosted services, not just to log analyzers like Webalizer/AWStats.
GA is a really nice package, as was Urchin Live before it, and they've added a lot of really nice ecommerce tracking integration to make it useful for AdWords advertisers. I, however, would never rely on it as a sole source of either statistics or conversion tracking data. It's just not accurate. Install something else and run at least two trackers on any site you want to measure usage of.
I, of course, use my W3Counter for the vanilla web stats, but I also use AdWatcher for conversion tracking.
Posted by: Dan Grossman | May 8, 2008 7:46 PM