<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>Data Services - ReadWriteWeb</title>
      <link>http://www.readwriteweb.com/archives/data-services/</link>
      <description>Data Services on ReadWriteWeb</description>
      <language>en</language>
      <copyright>Copyright 2012 Richard MacManus</copyright>
      <managingEditor>readwriteweb@gmail.com</managingEditor>
      <lastBuildDate>Sat, 28 Jan 2012 20:46:29 -0800</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.35-en</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

      
      <item>
         <title>Data Privacy: What Bill Gates Said 10 Years Ago</title>
		<description><![CDATA[<p><img alt="DataPrivacyDayLogo.jpg" src="http://www.readwriteweb.com/images/DataPrivacyDayLogo.jpg" width="150" height="152" class="mt-image-none" style="" />Today is <a href="http://www.staysafeonline.org/dpd/about">International Data Privacy Day</a>, an event backed by companies like Intel, Ebay, Facebook and Microsoft, and dedicated to educating data owners about best practices in protecting the privacy of consumer data.</p>

<p>The need to keep people from being exploited on account of violations of their privacy is clear, well-known, intuitive and amply articulated by highly capable people.  The up-side of <em>making use of</em> peoples' data is far less so.  The two concerns are closely tied together.  That's something Bill Gates is likely very aware of, if his comments 10 years ago are any indication.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=31557&amp;cb=31557' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=31557&amp;n=31557' border='0' alt='' /></a></p>]]>

<![CDATA[<p>The forthcoming era of computing is all about data.  In as much as that data is associated with people, it's essential that data owners feel secure in the belief that they can make use of their data in computing without concern it will be misused.  </p>

<p>Bill Gates got this about the last era of computing, the first instances of e-commerce and the web.  He wrote <a href="http://www.wired.com/techbiz/media/news/2002/01/49826">a famous company-wide memo</a> ten years ago this month all about the importance of what a controversial hardware-based security paradigm called Trusted Computing.</p>

<blockquote>"If we don't do this, people simply won't be willing -- or able -- to take advantage of all the other great work we do. Trustworthy Computing is the highest priority for all the work we are doing. We must lead the industry to a whole new level of Trustworthiness in computing."</blockquote>

<p>Regarding Privacy in particular, the Gates memo put some things in ways we can relate to today, but other things seem antiquated.</p>

<blockquote>"Users should be in control of how their data is used. Policies for information use should be clear to the user. Users should be in control of when and if they receive information to make best use of their time. It should be easy for users to specify appropriate use of their information including controlling the use of email they send."</blockquote>

<p>Users should be in control of when and if they receive information to make best use of their time!  Can you imagine that?  Info overload as privacy violation.  It makes sense, yet it seems hopelessly antiquated too.</p>

<p>"In the past, we've made our software and services more compelling for users by adding new features and functionality, and by making our platform richly extensible," he wrote. </p>

<blockquote>"We've done a terrific job at that, but all those great features won't matter unless customers trust our software.

<p>"So now, when we face a choice between adding features and resolving security issues, we need to choose security. Our products should emphasize security right out of the box, and we must constantly refine and improve that security as threats evolve."</blockquote></p>

<p>Here's how the International Data Privacy Day organization puts it today.</p>

<blockquote>"In this networked world, in which we are thoroughly digitized, with our identities, locations, actions, purchases, associations, movements, and histories stored as so many bits and bytes, we have to ask - who is collecting all of this data - what are they doing with it  - with whom are they sharing it?  Most of all, individuals are asking 'How can I protect my information from being misused?'  These are reasonable questions to ask - we should all want to know the answers. 

<p>"Data Privacy Day promotes awareness about the many ways personal information is collected, stored, used, and shared, and education about privacy practices that will enable individuals to protect their personal information.  </blockquote></p>

<p>Robert Siciliano, an Online Security Evangelist at McAfee, <a href="http://blogs.mcafee.com/consumer/data-privacy-day-2012">paints a much more negative picture in a blog post yesterday</a> - probably even about the companies participating in International Data Privacy Day.  McAfee is owned by the primary sponsor of the event, though, Intel.  Siciliano speaks for many people when he says:</p>

<blockquote>"Lately, it seems that barely a day goes by when we don't learn about a major Internet presence taking steps to further erode users' privacy. The companies with access to our data are tracking us in ways that make Big Brother look like a sweet little baby sister.

<p>"Typically when we hear an outcry about privacy violations, these perceived violations involve some apparently omnipotent corporation recording the websites we visit, the applications we download, the social networks we join, the mobile phones we carry, the text messages we send and receive, the places we go, the people we're with, the things we like and dislike, and so on.</p>

<p>"How do they do this? By offering us free stuff to consume online and infrastructure for the online communities that tie us together. We gobble up their technologies, download their programs, use their services, and mindlessly click 'I Agree' to terms and conditions we haven't bothered to read."</blockquote></p>

<p>It's a cynical perspective that refers to all the glory of the Interwebs as simply free stuff to consume with mindless clicks.</p>

<p>I think I prefer the description Gates might have offered.  The global computer is now rich with features and opportunities, but those will be put at risk if people don't trust the network.  Please, Mr. Zuckerberg, don't spoil this opportunity. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/data_privacy_what_bill_gates_said_10_years_ago.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/data_privacy_what_bill_gates_said_10_years_ago.php</link>
         <guid>http://www.readwriteweb.com/archives/data_privacy_what_bill_gates_said_10_years_ago.php</guid>
         <category>Data Services</category>
         <pubDate>Sat, 28 Jan 2012 20:46:29 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Why Facebook&apos;s Data Sharing Matters</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/images/thefacebooklogo.jpg">Facebook has cut a deal with political website <a href="http://politico.com">Politico</a> that allows the independent site machine-access to Facebook users' messages, both public and private, when a Republican Presidential candidate is mentioned by name.  The data is being collected and analyzed for sentiment by <a href="https://www.facebook.com/notes/us-politics-on-facebook/politico-facebook-team-up-to-measure-gop-candidate-buzz/10150461091205882">Facebook's data team</a>, then delivered to Politico to serve as the basis of <a href="http://www.politico.com/news/stories/0112/71345.html">data-driven political analysis and journalism</a>.</p>

<p>The move is being <a href="http://mediagazer.com/120113/p7#a120113p7">widely condemned in the press</a> as a violation of privacy but if Facebook would do this right, it could be a huge win for everyone.  Facebook could be the biggest, most dynamic census of human opinion and interaction in history.  Unfortunately, failure to talk prominently about privacy protections, failure to make this opt-in (or even opt out!) and the inclusion of private messages are all things that put at risk any remaining shreds of trust in Facebook that could have served as the foundation of a new era of social self-awareness.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=31258&amp;cb=31258' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=31258&amp;n=31258' border='0' alt='' /></a></p>]]>

<![CDATA[<p><a href="http://www.politico.com/news/stories/0112/71345.html"><img alt="FBPolitico.jpg" src="http://www.readwriteweb.com/images/FBPolitico.jpg" width="610"  class="mt-image-none" style="" /></a></p>

<p>We, ok I, have long argued here at ReadWriteWeb that aggregate analysis of Facebook data is an idea with world-changing potential.  The analogy from history that I think of is about Real estate Redlining.  Back in the middle of the last century, when US Census data and housing mortgage loan data were both made available for computer analysis and cross referencing for the first time, early data scientists were able to prove a pattern of racial discrimination by banks against people of color who wanted to buy houses in certain neighborhoods.  The data illuminated the problem and made it undeniable, thus leading to legislation to prohibit such discrimination.</p>

<p>I believe that there are probably patterns of interaction and communication of comparable historic importance that could be illuminated by effective analysis of Facebook user data.  Good news and bad news could no doubt be found there, if critical thinking eyes could take a look.</p>

<p>"Assuming you had permission, you could use a semantic tool to investigate what issues the users are discussing, what weight those issues have in relation to everything else they are saying and get some insights into the relationships between those issues," writes systemic innovation researcher <a href="https://plus.google.com/112439267620869130664/about">Haydn Shaughnessy</a> in a comment on Forbes privacy writer Kashmir Hill's <a href="http://www.forbes.com/sites/kashmirhill/2012/01/13/from-now-on-your-political-musings-on-facebook-are-being-mined">coverage</a> of the Politico deal. "As far as I can see people use sentiment analysis because it is low overhead; the quickest, cheapest way to reflect something of the viewpoints, however fallible the technique. Properly mined though you could really understand what those demographics care about." </p>

<p>Several years ago I had the privilege to sit with Mark Zuckerberg and make this argument to him, but it doesn't feel like the company has seized the world-changing opportunity in front of it.</p>

<p>Facebook does regularly analyzes its own data of course.  And sometimes it publishes what it finds.  For example, two years ago the company <a href="http://www.readwriteweb.com/archives/facebook_scientists_dissect_facebook_say_its_alive.php">cross referenced the body of its users' names</a> with US Census data that tied last names and ethnicity.  Facebook's conclusion was that the site used to be disproportionately made up of White people - but now it's as ethnically diverse as the rest of America.  Good news!  </p>

<p>But why do we only hear the good news?  That millions of people are talking about Republican Presidential candidates might be considered bad news, but the new deal remains a very limited instance of Facebook treating its user data like the platform that it could be.</p>

<p>It could be just a sign of what's to come, though.  "This is especially interesting in terms of the business relationships--who's allowed to analyze Facebook data across all users?" asks Nathan Gilliatt, principal at research firm <a href="http://socialtarget.com">Social Target</a> and co-founder of <a href="http://analyticscamp.org">AnalyticsCamp</a>.  "To my knowledge, they haven't let other companies analyze user data beyond publicly shared stuff and what people can access with their own accounts' authorization. This says to me that Facebook understands the value of that data. It will be interesting to see what else they do with it."</p>

<p>I've been told that Facebook used to let tech giant HP informally hack at their data years ago, back when the site was small and the world's tech privacy lawyers were as yet unaroused.  That kind of arrangement would have been unheard of for the past several years, though.  Two years ago, social graph hacker Pete Warden pulled down Facebook data from hundreds of millions of users, analyzing it for interesting connections before <a href="http://www.readwriteweb.com/archives/facebook_user_data_analysis.php">planning on releasing it to the academic research community</a>.  Facebook's response was assertive and <a href="http://petewarden.typepad.com/searchbrowser/2010/04/how-i-got-sued-by-facebook.html">came from the legal department</a>.  Warden decided not to give the data to researchers after all.  (Disclosure: I am writing this post from Warden's couch.)</p>

<p>"Like a lot of Facebook's studies, this collaboration with Politico is fascinating research, it's just a real shame they can't make the data publicly available, largely due to privacy concerns" bemoans Warden. "Without reproducability, it loses a lot of its scientific impact. With a traditional opinion poll, anyone with enough money can call up a similar number of people and test a survey's conclusions.  That's not the case with Facebook data."</p>

<p>"Everyone is going 'gaga' over the potential for Facebook," says Kaliya Hamlin, Executive Director of a trade and advocacy group called the <a href="http://personaldataecosystem.org/">Personal Data Ecosystem Consortium</a>.  <br />
<blockquote>"The potential exists only because they have this massive lead (monopoly) so it seems like they should be the ones to do this.</p>

<p>"Yes we should be doing deeper sentiment analysis of peoples' real opinions. But in a way that they are choosing to participate - so that the entities that aggregate such information are trusted and accountable.</p>

<p>"If I had my own personal data store/service and I chose to share say my music listening habits with a ratings service like Neilson - voluntarily join a panel. I have full trust and confidence that they are not going to turn on me and do something else with my data - it will just go in a pool.</p>

<p>"Next thing you know Facebook is going to be selling to the candidate the ability to access people who make positive or negative comments in private messages. Where does it end? How are they accountable and how do we have choice?"</blockquote></p>

<p>Not everyone is as concerned about this from a privacy perspective.  "There are many things in the online world that give me willies for Fourth-Amendment-like reasons," says Curt Monash of data analyst firm <a href="http://monash.com/">Monash Research</a>. "This isn't one of them, because the data collectors and users aren't proposing to even come close to singling out individual people for surveillance."</p>

<p>Monash's primary concern is in the quality of the data. "There's a limit as to how useful this can be," he says. "Online polls and similar popularity contests are rife with what amounts to ballot box stuffing. This will be just another example.  It is regrettable that you can now stuff an online ballot box by spamming your friends in private conversation."</p>

<p>It doesn't just have to be about messages, though.  Social connections, Likes and more all offer a lot of potential for analysis, if it's done appropriately.</p>

<p>"We need trust and accountability frameworks that work for people to allow analysis AND not allow creepiness," says Hamlin.</p>

<p>Two years ago social news site Reddit began giving its users an option to "<a href="http://www.readwriteweb.com/archives/thousands_of_reddit_users_donate_their_data_to_sci.php">donate your data to science</a>" by opting in to have activity data made available for download.  Massive programming Question and Answer site <a href="http://stackoverflow.com">StackOverflow</a> has long made available periodic dumps of its users' data for analysis.  "You never know what's going to come out of it," StackOverflow co-founder Joel Spolsky says about analysis of aggregate user data.</p>

<p>The unknown potential is indicitive not just of how valuable Facebook data is, but potentially of the relationship between data and knowledge generally in the emerging data-rich world.</p>

<p>That's the thesis of author David Weinberger's new book, <a href="http://toobigtoknow.com">Too Big to Know</a>.  "It's not simply that there are too many brickfacts [datapoints] and not enough edifice-theories," he <a href="http://www.theatlantic.com/technology/archive/2012/01/to-know-but-not-understand-david-weinberger-on-science-and-big-data/250820/">writes</a>. "Rather, the creation of data galaxies has led us to science that sometimes is too rich and complex for reduction into theories. As science has gotten too big to know, we've adopted different ideas about what it means to know at all."</p>

<p>The world's largest social network, rich with far more signal than any of us could wrap our heads around, could help illuminate emergent qualities of the human experience that are only visible on the network level.</p>

<p>Please don't mess up our chance to learn those things, Mr. Zuckerberg.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/why_facebooks_data_sharing_matters.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/why_facebooks_data_sharing_matters.php</link>
         <guid>http://www.readwriteweb.com/archives/why_facebooks_data_sharing_matters.php</guid>
         <category>Analysis</category>
         <pubDate>Fri, 13 Jan 2012 19:21:33 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Automatic File Conversions and More with Dropbox Automator</title>
		<description><![CDATA[<p><img alt="dropbox150.jpg" src="http://www.readwriteweb.com/dropbox150.jpg" width="150" height="150" class="mt-image-none" style="" />Computers keep getting closer and closer to making people obsolete. The latest step towards human obsolescence? <a href="http://dropboxautomator.com/" title="Dropbox Automator">Dropbox Automator</a>, a Web-based tool for setting up actions that happen as soon as you put a file in a Dropbox folder. It&#8217;s not flawless just yet, but it might provide a useful service for many Dropbox users. </p>

<p>The service is powered by <a href="https://app.wappwolf.com/" title="Wappwolf">Wappwolf</a>, an online &#8220;<a href="https://app.wappwolf.com/Start/contact" title="Wappwolf: About Us">action store</a>&#8221; that features a set of <strong>Web actions</strong> that can process files. For example, it has ready made actions to encrypt and decrypt files, extract text from PDFs, convert documents to PDF, generate QR codes and manipulate images. </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30977&amp;cb=30977' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30977&amp;n=30977' border='0' alt='' /></a></p>]]>

<![CDATA[<h2 id="dropboxautomator">Dropbox Automator</h2>

<p>The Dropbox Automator works by connecting to your Dropbox account and then defining actions based on which folder you place files into. For example, I connected my Dropbox account and created a folder called <strong>Appwolf</strong>. Then I defined actions to convert files placed into that folder into PDFs. </p>

<img alt="like-to-connect-1.jpg" src="http://www.readwriteweb.com/like-to-connect-1.jpg" width="577" height="342" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" />

<p>You can also do things like upload files to Slideshare, sign PDFs, scrape PDFs to text files and even translate files automatically using Bing Translator. It looks like much of Automator&#8217;s functionality just comes from tapping into Web-based services. </p>

<p>You can also automatically upload photos to Facebook or Flickr, add a bug (stamp) to a photo, resize or rotate photos and much more.</p>

<h2 id="afewglitches">A Few Glitches</h2>

<p>I found that the service isn&#8217;t entirely glitch free. It says that it can covert HTML files to PDF, which it does&#8230; but it just converts the text to PDF, so the tags are presented in the document instead of used for formatting. It might be that you need the header information before the service (<a href="http://www.en.conv2pdf.com/" title="conf2pdf">conf2pdf</a>) properly recognizes the file as HTML instead of plain text. </p>

<p>When Dropbox Automator zips files, it uses a format that doesn&#8217;t seem to be recognized on Mac OS X as a zip file. At least not by the <strong>Archive Utility</strong> that comes with OS X Lion.</p>

<img alt="folder-appwolf.jpg" src="http://www.readwriteweb.com/folder-appwolf.jpg" width="599" height="552" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /><em>Converting Files Using Dropbox Automator</em>

<p>It does convert OpenDocument Format (ODF) files OK, when it actually converts them. Of two ODF files I placed in the Appwolf directory, only one was converted. The other was placed in the <strong>processed</strong> folder that Dropbox Automator creates, but no PDF ever materialized. </p>

<p>But it&#8217;s a brand new service and I suspect they&#8217;re still shaking the bugs out. The service, at least for now, is free. How will they make their money? It&#8217;s unclear, but some of the actions you set up for files may cost money. So it&#8217;s possible that the developers will add premium services or charge a fee to other services for connecting users. If it catches on, I do hope that <a href="http://www.readwriteweb.com/cloud/2011/12/2011-the-year-the-free-ride-di.php" title="2011: The Year the Free Ride Died">they start providing paid accounts</a> so users can support the service. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/automatic_file_conversions_and_more_with_dropbox_a.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/automatic_file_conversions_and_more_with_dropbox_a.php</link>
         <guid>http://www.readwriteweb.com/archives/automatic_file_conversions_and_more_with_dropbox_a.php</guid>
         <category>News</category>
         <pubDate>Fri, 30 Dec 2011 14:25:00 -0800</pubDate>
<author>Joe Brockmeier</author>
      </item>
      
      <item>
         <title>Can Big Data Be Outsourced? Mu Sigma&apos;s $150 Million in VC Backing</title>
		<description><![CDATA[<p><img alt="musigmalogo.png" src="http://www.readwriteweb.com/images/musigmalogo.png" width="150" height="150" class="mt-image-none" style="" />They say Big Data is going to be big business, big innovation - a big deal.  But how is it going to go down?  Applied math and decision science company <a href="http://www.mu-sigma.com/">Mu Sigma</a> announced more than $100 million in new venture backing yesterday, including from previous investors Sequoia Capital, bringing the company's total investment to $150 million.  Mu Sigma provides big data services to some of the biggest companies in the world.  </p>

<p>How do they do it?  With a combination of math, science, creative thinking and long hours of hard work.  As democratized publishing, network connected devices and the instrumentation of everyday life combine to create a great blue ocean of big data all around us, the latest Mu Sigma funding is a valuable opportunity to get a taste of how one emerging leader in that market combines technology, math and art to engage with this big trend.  Not everyone agrees that outsourcing Big Data work like this is the solution, though.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30968&amp;cb=30968' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30968&amp;n=30968' border='0' alt='' /></a></p>]]>

<![CDATA[<h2>Can Big Data Be Outsourced?</h2>

<p>Mu Sigma says it exists to "enable businesses to institutionalize data-driven decision making."  Its 1300 employees in Chicago and Bangalore help clients with marketing, supply chain and risk analytics.  The firm says it "is arguably the world's largest pure-play decision sciences and analytics services company."</p>

<p>Employee reviews of the company on website <a href="http://www.glassdoor.com/Reviews/Mu-Sigma-Reviews-E253258.htm">GlassDoor</a> paint a picture of hard-driving young employees working grueling hours for low pay, but learning a lot at a young company.</p>

<p>The seven year old firm helps clients with things like customer segmentation and purchase likelihood analysis in marketing, fraud detection and severity and statistical analysis for FDA trials in risk analysis and supply chain work like trend plotting, due date quoting, expedition optimization, location allocation "decisioning", etc.  All based on data.</p>

<p>How can Mu Sigma compete in each of those tasks with other firms that specialize in one or the other?  That's unclear, but the company has developed momentum based on its broad approach.  Mu Sigma says that it's profitable, though the company declined to provide any specific financial numbers.  </p>

<p>Not everyone believes that solutions like Mu Sigma are the answer to Big Data problems and opportunities.  "I'm skeptical of the idea of end to end 'analytics outsourcing' right now," says Peter Skomoroch, of <a href="DataWrangling.com">DataWrangling.com</a>.<br />
 <blockquote>"There is value in having external experts embedded with internal teams to help with big data, but to compete companies will also need to build up in-house talent.   </p>

<p>"It is tough to find good data people, and even more difficult to find ones with business sense and domain knowledge.  Insight and creativity are not likely to be commoditized any time soon.  The competitive advantage in this space will go to companies that build up unique datasets and build teams that know how to leverage them.  Most game changing analytics is going to come from a small set of talented individuals, not an army of contractors."</blockquote></p>

<p>In-house data scientists are incredibly hard to find, though.  Cathy O'Neil, data scientist at ad startup Intent Media, <a href="http://mathbabe.org/2011/12/26/a-good-data-scientist-is-hard-to-find/">says</a> this is in part because "It is far less sexy to try to honestly find the confidence interval of a prediction than it is to model behavior."<br />
<blockquote>"Data scientists are considered magical when they forecast behavior that was hitherto unknown, and they are considered total downers when they tell their CEO, 'hey there's just not enough data to start that business you want to start,' or 'hey this data is actually really fat-tailed and our confidence intervals suck.'</p>

<p>"In other words, it's something like what the head of risk management had to face at a big bank taking risks in 2007. There's a responsibility to warn people that too much confidence in the models is bad, but then there's the political reality of the situation, where you just want to be liked and you don't actually have the power to stop the relevant decisions anyway."</blockquote></p>

<p>Perhaps given that reality, outside big data firm Mu Sigma is clearly a company with some economic wind in its sails.  Deborah Gage at the Wall St. Journal's Venture Wire provides a good look at <a href="http://blogs.wsj.com/venturecapital/2011/12/28/mu-sigma-lands-big-money-for-big-data/?mod=google_news_blog">the company's fast growth and interesting training program</a> in her coverage this morning.  </p>

<h2>Mu Sigma and Innovation</h2>

<p>Reading previous coverage of the company's work elsewhere, one name keep coming up: Zubin Dowlaty, Vice President and head of innovation and development at Mu Sigma.</p>

<p>Dowalty spent the 1990's doing statistical modeling at UPS.  Then he joined the publicly traded InterContinental Hotels Group, where he was first the Director of Analytics in Consumer Insight and then the VP of Decision Sciences.  He was featured prominently in a 2008 New York Times story about <a href="http://www.nytimes.com/2008/04/09/technology/techspecial/09predict.html">corporations using Prediction Markets</a> to surface cost-saving and other ideas from inside their companies.  Dowalty was photographed for the story wearing a wizard's cap and holding a magical looking walking staff in his hands.  </p>

<p>He built an elaborate system to invite the hotel company's employees to submit and vote on ideas, win rewards if theirs were selected and to surface via crowdsourcing strategic initiatives the company could act on. "We wanted to tap the creative class that may not be able to voice their ideas,"  Dowlaty told the Times.</p>

<p>Once at Mu Sigma, Dowlaty has become one of the company's most visible public figures.  His statements, as the head of innovation and development at a firm so focused on innovation, are noteworthy.</p>

<p>In a January 2011 article from <a href="http://tdwi.org/Articles/2011/01/05/Rise-of-Data-Science.aspx?Page=3">The Data Warehousing Institute</a> on the rise of the data scientist, Dowlaty articulates the role of art and of science in big data.<br />
<blockquote>"I'm not a big fan of the spaghetti method.  It makes me nervous when people run a lot of analytic techniques just to get the answer they want, instead of being objective. Doing this job properly requires the rigor of a scientist. The scientist can see things that other people cannot see."</blockquote></p>

<p>As a standalone statement, that doesn't sound particularly creative.  It is important, though.  "The 'spaghetti method,'" cautions Josh Wills, Chief Data Scientist at <a href="http://cloudera.com">Cloudera</a>, "frantically searching for a technique that gives you the answer you want (or potentially, the answer that someone higher up in the org wants), as opposed to using the scientific method. This is a big problem in the industry, and the theory is that using an external firm mitigates that habit to some extent. Being a good data scientist often means telling powerful people stuff that they don't want to hear."</p>

<p>Other statements from Dowlaty help put that sentiment about rigor in creative context.  Mu Sigma itself uses a variety of different analytic models to tackle all the problems they engage with.</p>

<p>Dowlaty <a href="http://www.revolutionanalytics.com/why-revolution-r/case-studies/Portfolio-Strategy-Helps-Mu-Sigma-Maintain-Leadership-Role-in-Competitive-Market.php">told Revolution Analytics</a>, whose R statistics software Mu Sigma makes use of:<br />
<blockquote>"We like to diversify our models...We have a portfolio of about 10 models that we'll run to assess the stability of the coefficient and the predictive capability of that particular model. By running all the models, you can see which ones are the best predictors."</blockquote></p>

<p>Revolution says of Dowlaty's use of R at Mu Sigma, "The benefit of an 'ensemble' approach is that when new analytic techniques emerge, they can be brought into the mix without causing disruption. This makes the R especially valuable to Dowlaty, since the R software library evolves continually as members of the worldwide R community contribute new packages and programs."</p>

<p>In fact, both rigor and flexibility are key to the paradigm Dowlaty advocates. "The trend is toward a multi-disciplinary approach to extracting value from data," he told The Data Warehousing Institute early this year. "It's not just about math anymore. You also need technology skills, but what ultimately separates the analyst from the scientist is the dimension of artistic creativity. It's the soft skills that make the big difference."</p>

<p>That combination of skills is what enables the firm to tackle the incredibly complex work they do.  Dhiraj Rajaram, Mu Sigma's CEO and the man who founded the company in 2004, spoke at the 2010 <a href="http://www.predictiveanalyticsworld.com/">Predictive Analytics World</a> conference on a panel with Mu Sigma customer Walmart.  </p>

<p>Walmart Financial Services, which named Mu Sigma its Supplier of the Year in 2011, works with the big data company to analyze and optimize the marketing of its financial products.</p>

<p>Decision Management analyst James Taylor <a href="http://jtonedm.com/2010/02/16/marketing-mix-modeling-at-walmart-financial-services-pawcon/">blogged the following summary</a> of the conference presentation about the collaboration between Walmart Financial Services and Mu Sigma.  This sounds like very complicated work.<br />
<blockquote>"WFS uses transaction life analysis around run rate and growth, price / mix analysis, financial returns and qualitative analysis of the creative. Marketing Mix modeling, optimization, lets them see the effect of individual campaigns (there's a lot of Walmart stuff going on in the market), account for seasonality and manage at the store level. The idea is to make sure the marketing investment is optimized, targeted and repeatable.</p>

<p>"Marketing mix optimization uses weekly sales, store traits and demographics, event information and macro-economic data to see how effective specific events were and what was the contribution to the overall effect. What was the value or contribution of each element, did they cannibalize each other, did they resonate in specific areas etc."</blockquote></p>

<p>That sounds like a potent combination of math, science and creative thinking.  It's probably more a picture of the sector than of one company alone.  Forrester analyst James Kobielus specializes in big data and says he's done one briefing with Mu Sigma but didn't detect any particular unique flavor to the firm's work relative to others in the sector.   Mu Sigma hasn't yet responded to our request for comment on this article.</p>

<p>Perhaps this company is typical of the sector and the questions to ask about it are more general.</p>

<p>"My caveat with services like MuSigma is that they can analyze your data, but they can't change your business," says Cloudera's Wills.  <br />
<blockquote>"You are free to ignore what they tell you, and it is often the case that the answers they can give you are limited by your business practices and the data that you currently collect. The advantage of having an in-house data scientist, especially one with some programming skill, is that they can develop systems that collect better data so that they can come up with better answers.</blockquote></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/can_big_data_be_outsourced_mu_sigmas_150_million_i.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/can_big_data_be_outsourced_mu_sigmas_150_million_i.php</link>
         <guid>http://www.readwriteweb.com/archives/can_big_data_be_outsourced_mu_sigmas_150_million_i.php</guid>
         <category>Analysis</category>
         <pubDate>Thu, 29 Dec 2011 08:53:59 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>After Years of Missteps, Facebook&apos;s Timeline is an Epic Win</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/images/facebook_logo_square_apr10.jpg">Facebook's <a href="http://www.readwriteweb.com/archives/facebook_timeline.php">new Timeline profile feature</a> is great, even if it is a little strange.  It's narcissistic, but that's a big part of the fun of it, and I'm not sure that other peoples' timelines are nearly as interesting as mine is to me. </p>

<p>It's an incredibly feature-rich new type of social network profile. It's a re-imagination of what a profile can be.  It makes me want to use Facebook more, to share more data with Facebook so that it can be preserved and displayed so nicely, years into the future.  While other Facebook features have pushed users into posting publicly by default, or posted their activities from other places they didn't understand would become part of the public record, I think Timeline is a genuine value add to incentivize users to share more.  I think it's great.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30778&amp;cb=30778' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30778&amp;n=30778' border='0' alt='' /></a></p>]]>

<![CDATA[<p><img alt="FBTimeline.png" src="http://www.readwriteweb.com/images/FBTimeline.png" width="610" height="379" class="mt-image-none" style="" /><br />
Data is at the heart of the Facebook Timeline, your data - about your life, about your activities as recorded on Facebook and about  your social connections.  The music you listen to, the places you go and the things you do.  Insights and experiences built on top of data are going to be a big part of the future of human/computer interactions.  Facebook Timelines are a great first look at that idea for hundreds of millions of people. They are also something that Twitter can never do, for both technical and cultural reasons.</p>

<p>It's one thing to see this data all in a News Feed as Facebook has long showed it, it's fundamentally different to see Yourself and Others presented like a work of art in this new Timeline layout.  </p>

<p>By highlighting the content you've published that has received the most social engagement, in the form of comments and Likes, your Facebook Timeline takes its best shot at presenting your Best Self to the world.  The mundane updates are hidden in the background and the highlights of your life, if you posted about them on Facebook, are programmatically discoverable and now displayed in an attractive page layout.</p>

<p>It doesn't work perfectly, my Timeline says that I married my wife 3 times on 3 different dates, but generally speaking it works really well.  It looks great on m.facebook.com too.</p>

<p>The Facebook Timeline represents the Instrumentation of Your Life, making things measurable and then building on top of those measurements.  It's a big deal in the world of social software.</p>

<p>That Facebook launched such a bold new implementation of every user's data about themselves just months after getting slapped with a 20 year privacy audit requirement from the US government is bold.</p>

<h2>As Not Seen on Twitter</h2>

<p>Meanwhile, over on Twitter, that competing social network can't remember what you did two weeks ago.  It does remember, it just won't let you remember.  Historical content on Twitter is severely limited.  </p>

<p>The company has said officially that's because Twitter is all about the here and now, it's real-time.  Unofficially it's said though that the root of the problem was in a series of database creation decisions that were made years ago.  It would now be super expensive to change that. </p>

<p>There is something about Twitter that's more conversational, more News focused and less conducive culturally to something like Timeline.  </p>

<p>For the vast majority of its users, I'd also guess that Twitter accounts post fewer messages and get fewer responses that can be measured to determine highlights than is the case on Facebook.  </p>

<p>Facebook also has a lot of structured data in the user's profile and changes to that become events, which social activity swarms around and which then become notable points in your life.  You changed your marital status?  That's probably going to get a lot of discussion.  There is no equivalent on Twitter.  Were Twitter to highlight your biggest tweets, they would likely be the wittiest quips you've made over the years, not the real life events.</p>

<p>Twitter is working on convincing people that tweets are great for reading, that it's largely a reading experience.  Facebook, on the other hand, has always wanted you to share, share, share.  </p>

<p>Many of us are doing things outside of Facebook, though.  A lot of that is being shared back into our Newsfeed, but not all of it.  I am very impressed with what Facebook has done, but I wish there was some more effective competition out there.  There are various startups who have tried to do this, though none anywhere near as well as Facebook's hired and acquired team of world-beating design pros. </p>

<p>I joined Facebook 5 years ago this Fall, according to my Timeline.  It's cool to see all that history presented so nicely and it makes me want to put more content into Facebook so I can see it later.  I imagine that's the point. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/after_years_of_missteps_facebooks_timeline_is_an_e.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/after_years_of_missteps_facebooks_timeline_is_an_e.php</link>
         <guid>http://www.readwriteweb.com/archives/after_years_of_missteps_facebooks_timeline_is_an_e.php</guid>
         <category>Data Services</category>
         <pubDate>Fri, 16 Dec 2011 09:05:15 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Engag.io: A Tool to Track All Your Conversations Online in One Place</title>
		<description><![CDATA[<p><img alt="engagiologo.png" src="http://www.readwriteweb.com/images/engagiologo.png" width="150" height="150" class="mt-image-none" style="" />Social media is supposed to be all about engagement and authenticity, but sometimes it can feel so distributed and overwhelming that conversations get lost.  A new web app called <a href="http://Engag.io">Engag.io</a> has tackled this classic problem and offers a pretty good solution that I think you'll want to check out.  It's in private alpha right now but we've got an invite code at the bottom of this post.  That someone is making an app like this gives me hope that there are still great ideas that can be built on top of the most basic building blocks of the social web.</p>

<p>Engag.io, which gets its name from being the place for your online engagement input and output, is like an inbox for all your conversations on Twitter, Facebook, Google Plus, Foursquare and blog comments.  It's an inbox with analytics.  It's built by the team behind content curation company <a href="http://www.eqentia.com/">Eqentia</a>.  Eqentia is ambitious but a little too complicated; Engag.io is very simple and the value of it will be immediately obvious to many people.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30695&amp;cb=30695' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30695&amp;n=30695' border='0' alt='' /></a></p>]]>

<![CDATA[<p><img alt="engagioscreen.png" src="http://www.readwriteweb.com/images/engagioscreen.png" width="610" height="289" class="mt-image-none" style="" /></p>

<p>In order to get started with Engagio, you have to authenticate with different services you use around the web.  Fortunately, this has become super easy to do and very secure with just a few clicks.  The open authentication standards that have been developed over recent years make mashups like Engagio really easy to implement and that's awesome.</p>

<p>You can log in with your accounts on Twitter, Facebook, Foursquare, Google Plus, Disqus, Hacker News and Tumblr.   Then Engagio will watch for comments posted to and from you on any of those services and give you one unified inbox to track the conversations inside of.  </p>

<p>"We believe that having a universal Conversation Inbox could become a daily time saver," says the Engagio blog.  "It will save you time because you don't have to check the multiple source sites where you have placed your comments. And you can for example focus on replies first, before you get to other commenting."</p>

<div class="pullquote">That's pure gold, right there - but a few days later I'd already forgotten who said it to me, where to find it, etc.  Enter Engagio Comment Search and boom!  All my problems are solved.</div>The ability to search your comments is really nice too.  It's already coming in handy for me.  The other day on Twitter I was talking about the concept of the Project Triangle: Fast, good, cheap - pick two.  I was saying that I've been thinking about how different companies in my life relate to that equation and author-from-the-future <a href="https://twitter.com/#!/toddsattersten">Todd Sattersten</a> says to me, "@marshallk dropping one to get the other two is a faulty construct. Vary 4th element Scope to allow all 3 #agile...My review of @kmaney Trade-off http://t.co/RrejjqJQ and check out my ebook Fixed to Flexible for more use http://t.co/6nygC7OX."

<p>That's pure gold, right there - but a few days later I'd already forgotten who said it to me, where to find it, etc.  Enter Engagio Comment Search and boom!  All my problems are solved.</p>

<p><img alt="engagioscreen2.png" src="http://www.readwriteweb.com/images/engagioscreen2.png" width="610" height="347" class="mt-image-none" style="" /></p>

<p>It's a great idea and I've been returning to the site daily to try and stay engaged with people who took the time to respond to me around the web.  It's mostly Twitter conversations and some Google Plus threads in my experience, but I hope that Engagio will help me be all the more...in touch with conversations in other places too.</p>

<p>The analytics part of the service could really use some UI work, but the idea is that Engagio will show you who you're interacting with the most.  You might be surprised who some of your top responders are - and those are people you should probably engage with all the more.  Or at least know, if you're going to be as social as you might want to be in the social media.</p>

<p>The Engagio team could use someone to sit down with them and go through some real-life commenting experiences because I think the user flow could really be improved.  Site founder William Mougayar is a commenting machine, he posts comments all the time everywhere, but I suspect his experiences are different from the way other people would want to use a service like this.</p>

<p>Super blogger and tech investor Fred Wilson, a man who gets more and more intelligent comments in response to his online activity than probably anyone else you'll ever meet, has been<a href="http://www.avc.com/a_vc/2011/12/engagio.html"> a cheerleader for Engag.io</a>.  Wilson says he urged Mougayar to "make it like gmail for social conversations."   Gmail is deceptively simple though and Engagio will take more work to get close to that level of usefulness.  As <a href="http://www.fakegrimlock.com/">FAKE GRIMLOCK</a> put it, "IS MVP. UGLY OK FOR NOW."  A minimum viable product it is, but one that I think many people will want to see developed further.</p>

<p>That this is a tool designed to make the living social graph more transparent and sticky is exciting.  I absolutely love the idea.  Several users have pointed out that a mobile interface would suit real user behavior especially well and I agree with that.  </p>

<p>A small number of people can jump in and kick the tires now, using the code "rwwengage" at <a href="http://Engag.io">Engag.io</a>.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/engagio_a_tool_to_track_all_your_conversations_onl.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/engagio_a_tool_to_track_all_your_conversations_onl.php</link>
         <guid>http://www.readwriteweb.com/archives/engagio_a_tool_to_track_all_your_conversations_onl.php</guid>
         <category>Data Services</category>
         <pubDate>Mon, 12 Dec 2011 17:09:30 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Easy-to-Use Mashup Tool ifttt Gets Betaworks Backing</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/images/lead-images/ifttt150.png">Point and click web mashup startup <a href="http://IFTTT.com">ifttt</a> ("if this then that") has raised financing from cutting-edge tech incubator <a href="http://Betaworks.com">Betaworks</a>.  News of the funding came to us via <a href="http://neuvc.com/labs/vcdelta/">NeuVC's bot</a> watching the firm's portfolio page, which is fitting given the nature of the startup.</p>

<p>ifttt allows anyone to set up a chain of conditional actions between a wide variety of web services, like "If I post a photo to Flickr, save it to my Dropbox."  The company calls these "recipes."  We wrote about the service when <a href="http://www.readwriteweb.com/archives/how_to_back_up_your_life_automatically_with_ifttt.php">it launched to the public in September</a>.  Microsoft's Scott Hanselman also <a href="http://www.hanselman.com/blog/EssentialIFTTTIfThisThenThatProgrammingWorkflowsForHumansUsingTheWebsSocialGlue.aspx">wrote up a nice review of the service</a> and says "this is going to be huge."  ifttt isn't just a single service, though, and it isn't even just an amalgamation of multiple services strung-together; it's a great example of a whole paradigm of DIY mashups.  As Blogger and WordPress were to self-publishing and YouTube was to video publishing, so ifttt could be to working with interlinked web applications for everyday people.  Can this startup herald a new era of lay hackers?  The UI is good, the only question is whether there's really enough demand for such a service.<br />
</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30650&amp;cb=30650' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30650&amp;n=30650' border='0' alt='' /></a></p>]]>

<![CDATA[<p>ifttt was started by Linden Tibbets, a computer scientist formerly at design powerhouse <a href="http://www.ideo.com/">IDEO</a>, film artist Alexander Tibbets and designer Jesse Tane, also formerly of IDEO.  </p>

<p>Here's how the startup introduced itself at launch:<br />
<blockquote>"We began with the theory that as our digital tools became more domain specific and easier to use, there would be vast amounts of creative potential in how any two tools might be used in tandem. We knew that with this immense potential came a problem of equal proportions. There just aren't enough developers and designers in the world to craft all these connections. A million developers at a million laptops wouldn't even make a dent. So we set out to build an incredibly simple tool that anyone could use to define creative, event-driven tasks that fit the pattern 'if this then that.'"</blockquote></p>

<p>If there's anyone who can pull something like this off, having experience at IDEO is great background from which to give it a shot.</p>

<p>The most popular ifttt recipes are <a href="http://ifttt.com/recipes?sort=popular">here</a>; co-founder Linden Tibbets's are <a href="http://ifttt.com/people/linden">here</a>.</p>

<p>Is this something that a whole lot of people are going to be interested in and go to the trouble to do?  I know I am and I wouldn't be surprised to see that you are, RWW readers, but it will be interesting to see how this becomes a business.  Either way, it's great to see one of the web's most interesting investors back something so focused on generating creative use of online tools.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/easy-to-use_mashup_tool_ifttt_gets_betaworks_backi.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/easy-to-use_mashup_tool_ifttt_gets_betaworks_backi.php</link>
         <guid>http://www.readwriteweb.com/archives/easy-to-use_mashup_tool_ifttt_gets_betaworks_backi.php</guid>
         <category>Data Services</category>
         <pubDate>Fri, 09 Dec 2011 11:03:13 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>In A Data Driven World, Tablet Publishers Have An Evolving Toolset</title>
		<description><![CDATA[<p><img alt="newspapers150.jpg" src="http://www.readwriteweb.com/newspapers150.jpg" width="150" height="150" class="mt-image-none" style="" />The media and news industry, after 10 years of disruption and economic torture, finally thought that it had gotten a step ahead. Publishers were in on the ground floor when the tablet revolution started with products ready to go even before Steve Jobs introduced us to the original iPad. The marriage of tablets to publishing would be a boon for everybody. </p>

<p>The honeymoon has not been sweet. </p>

<p>Publishers did not have the tools to create fully functional magazines from the very start. Sure, they were nice looking, but that was about it. Over the last two years, though, publishers and developers have created dynamic tools that allow the news media to create apps that do not just meet user expectations, but go beyond them. </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30418&amp;cb=30418' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30418&amp;n=30418' border='0' alt='' /></a></p>]]>

<![CDATA[<p>The growth of the toolset for publishers directly coincides with the rise of the startup community that has emerged to add new functionality to major mobile operating systems. For instance, Urban Airship, a cloud services provider for mobile, was probably not ready to fully unleash its suite for publishers when the iPad was first announced. Localytics, a mobile analytics company out of DogPatch labs in Boston, was just starting. Other cloud services like Kinvey, StackMob and Parse were nothing more than ideas. </p>

<p>What can these services offer now that was more difficult to do when publishers initially came around to the idea of tablet magazines and newspapers? Quite a bit, actually. Let's look at what<a href="http://urbanairship.com/blog/2011/11/29/the-future-is-bright-for-newsstand-publishers/"> Urban Airship can do for publishers</a> looking to put magazine apps in Apple's iOS Newsstand.</p>

<blockquote><ul>
	<li>Newsstand library support: configures newsstand apps for auto-renewable subscriptions and single issue purchases through Apple's in-app purchasing toolset.</li>
	<li>Content delivery: Hosts and delivers content available to users on any iOS device.</li>
	<li>Push notifications: This is a big one and perhaps one of the main items missing from original tablet apps. The ability to create easy push notifications enables publishers to put news right under the noses of users, increasing engagement and page views and alerting readers of new issues. The evolution of push notifications for mobile and tablet apps will be a critical piece in how publishers utilize the devices. </li>
	<li>Audience segmentation: Who is reading what? Where do you want to send certain readers? Urban Airship has "tags and aliases" features to associate mobile users with publishers CRM and personalize the experience.</li>
	<li>Free seed content: Users hate to download a news app and then automatically be forced to subscribe before getting content. Publishers need to sweeten the deal a little bit and give users free content, such as back issues, listed next to subscription services. </li>
	<li>Reports: How are users interacting with the news app? Analytics is the heart of understanding mobile users and there are a variety of companies out there, from Urban Airship, Localytics, Flurry, Apsalar and others that can help administrators reach the core goals of engagement and behavioral analysis.</li>
</ul></blockquote>

<p><img alt="newsstand_ios5.jpg" src="http://www.readwriteweb.com/newsstand_ios5.jpg" width="600" height="313" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></p>

<p>What it comes down to is that publishers need to start looking at tablet apps like a developer would look at any other app, like a game or a utility. The goal is to gain reach, engage users, analyze behavior and monetize it. The Guardian and Future Publishing use Urban Airship and large publishers employ Flurry, Apsalar and Localytics for engagement and advertising insights. </p>

<p>There are also design tools that publishers can turn to for creating news apps that have evolved in the last year or so. <a href="http://www.businesswire.com/news/home/20111129005588/en/Mag-Expands-Tablet-Publishing-Platform-Enterprise-Offering">Mag+</a> recently announced a new suite of publishing tools and pricing tiers for the smallest publisher to the largest magazine. Designed by editors and designers for editors and designers, it provides three tiers of publishing tools. There are also solutions from companies like Daylife, which launched <a href="http://www.readwriteweb.com/archives/real-time_live_website_editing_coming_the_daylife.php">real-time media management in June. </a></p>

<p>Apps are a data driven ecosystem, tied to the cloud, in the pockets of users on the go. News publishers need to understand the tools that can help them make the jump for the decaying old media standard to flexible and responsive new media technology companies. <br />
</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/tablet_publishers_evolving_toolset.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/tablet_publishers_evolving_toolset.php</link>
         <guid>http://www.readwriteweb.com/archives/tablet_publishers_evolving_toolset.php</guid>
         <category>Mobile</category>
         <pubDate>Wed, 30 Nov 2011 09:00:00 -0800</pubDate>
<author>Dan Rowinski</author>
      </item>
      
      <item>
         <title>Education-Specific HTML to Be Submitted to Search Engines Soon</title>
		<description><![CDATA[<p><img alt="LRMIlogo.jpg" src="http://www.readwriteweb.com/images/LRMIlogo.jpg" width="150" height="150" class="mt-image-none" style="" />Students, educators and others interested in finding the best published content, events and experts for learning new things will be heartened to learn that a new metadata markup standard is in the works to make discovery of learning materials easier than ever.  Perhaps more importantly, it will make those materials easier for machines to find.  Once finding the right content is a solved problem, many new things could become possible.</p>

<p>The <a href="http://www.lrmi.net">Learning Resource Metadata Initiative</a> (LRMI), a project co-led by the Association of Educational Publishers and Creative Commons, <a href="http://creativecommons.org/weblog/entry/30463">today took the next step</a> towards submitting its specification to <a href="http://schema.org">Schema.org</a>, the collaboration between Google, Yahoo and Bing that maps out 100 different types of content online in a standardized format.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30278&amp;cb=30278' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30278&amp;n=30278' border='0' alt='' /></a></p>]]>

<![CDATA[<p>The LRMI 0.5 spec lets publishers communicate in a page's HTML things like the competencies taught, the competencies required, the type of educational materials and the typical age range of intended users for anything educational published online.  Time required for completion, degree of interactivity and a small number of other ways of describing educational content are included in the spec.</p>

<p>Active participants working to figure out how to construct LRMI and how to integrate it into Schema.org include people from small non-profits like open curriculum community <a href="http://www.curriki.org/">Curriki</a>, corporate education technology giant <a href="http://www.pearson.com">Pearson</a>, international information standards group <a href="http://dublincore.org/">Dublin Core</a> and intellectual property law group <a href="http://creativecommons.org">Creative Commons</a>, among others.</p>

<center><iframe width="480" height="360" src="http://www.youtube.com/embed/c_cI34l3t2k" frameborder="0" allowfullscreen></iframe></center>

<p>Participants debate on the official mailing list over new terminology, balancing concerns like coherence with Schema.org, ease of input by people who will enter metadata to go with resources being published online and specificity gained or lost by the way that metadata fields are named and framed.</p>

<p>While some semantic technologies are <a href="http://www.readwriteweb.com/archives/the_top-down_semantic_web.php">able to assert categorization from the top down</a>, whether content publishers participate or not, it seems likely that the kind of data that will be communicated in LRMI will require informed participation by the producers of the content themselves.  Requiring participation in categorization could pose a challenge to hopes the spec will gain meaningful adoption.</p>

<p>The LRMI effort doesn't seem well-known yet outside its own ranks, either; the official website has almost no inbound links indexed by Google yet and none of the education technology blogs we track here at ReadWriteWeb have mentioned LRMI yet.  The project was just announced last month though and in the education market, a month isn't a very long time.</p>

<p>LRMI isn't alone though, either.  Nathan Angell, a Board Director at the collaborative open education software community <a href="http://sakaiproject.org/sakai-foundation">Sakai Foundation</a> and a Product Manager at <a href="http://www.rsmart.com/">rSmart</a>, calls LRMI "another welcome intervention in growing list of data specifications for education."</p>

<blockquote>"These days we have access to an unbelievable number of learning resources--both open and proprietary--but it's still hard to find the right ones, quality resources, suited to your needs, when you need them.

<p>"For example, in the Sakai community, we have built a new platform--the Open Academic Environment--that helps people create and tag learning materials, and most importantly, share them openly by default. </p>

<p>"With the LRMI specification, we can help people tag their materials with exactly the right information that will make them easy for others to find and use...and even better, we can augment the suggested content widgets we already have in place to discover resources in the moment that match the very specific needs of a particular educator or student."</blockquote></p>

<p>Angell, who isn't associated with LRMI in particular, sees data specifications like this as potential game changers.  Those suggested content widgets are really shorthand for computation that can begin at a higher level of abstraction if the hard work of content categorization and description has already been done in a standardized way.  That means education technology providers, search engines and others don't have to invest time and energy into understanding educational resources online - they can begin with a pre-existing understanding of that content and then offer higher-level features and services on top of already-organized information.   </p>

<p>"LRMI helps set the stage for the hive mind that will help our children's children learn faster and better than we ever thought possible," Angel says. "In comparison, school today will look like drawing pictures in the dirt with a stick."<br />
</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/education-specific_html_to_be_submitted_to_search.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/education-specific_html_to_be_submitted_to_search.php</link>
         <guid>http://www.readwriteweb.com/archives/education-specific_html_to_be_submitted_to_search.php</guid>
         <category>Data Services</category>
         <pubDate>Mon, 21 Nov 2011 23:31:33 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>How the DC&apos;s Metro Opened Up Its Data</title>
		<description><![CDATA[<p><img alt="metro-150.jpg" src="http://www.readwriteweb.com/metro-150.jpg" width="150" height="150" class="mt-image-none" style="" />Three years ago, the <a href="http://www.wmata.com">Washington Metropolitan Area Transit Authority</a> looked lost, and so did many of its riders.</p>

<p>Those who hadn't memorized Metro's schedules had to employ its persnickety <a href="http://www.wmata.com/rider_tools/tripplanner/tripplanner_form_solo.cfm">Trip Planner</a>, a clunky Web form that not only won't let you click on a map to specify your location but also chokes on cities, states, Zip codes and even commas if you add them to a street address. Meanwhile, other U.S. cities had enjoyed transit directions from sites like Google Maps since at least <a href="http://googleblog.blogspot.com/2005/12/public-transit-via-google.html">2005</a>. But not DC.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30225&amp;cb=30225' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30225&amp;n=30225' border='0' alt='' /></a></p>]]>

<![CDATA[<p>Worse yet, after the first step to share schedules, converting data to the standard <a href="http://www.wmata.com/rider_tools/tripplanner/tripplanner.cfm">General Transit Feed Specification</a> format, Metro had halted the effort. In December 2008, a spokesman <a href="http://greatergreaterwashington.org/post/1495/metro-refuses-to-participate-in-google-transit/">told the urban-development blog Greater Greater Washington</a> that continuing it was "not in our best interest from a business perspective." That left Metro riders with kludgey, screen-scraping workarounds like <a href="http://dcist.com/2006/05/09/introducing_las.php">DCist's text-messaging service</a>.<br />
<a href="http://www.readwriteweb.com/trip.jpg"><img alt="trip.jpg" src="http://www.readwriteweb.com/assets_c/2011/11/trip-thumb-610x381-36121.jpg" width="610" height="381" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></a><br />
But now, Metro rail and bus directions are clicks or taps away in third-party sites and applications, allowing passengers to benefit from such innovations as <a href="http://googlemobile.blogspot.com/2011/07/google-maps-57-for-android-introduces.html">Google's stop-by-stop transit navigation on Android phones</a>. The changes in between suggest a road map for other organizations having their own open-data debates.</p>

<p><b>Start lobbying to open a conversation.</b> Greater Greater Washington editor David Alpert quickly had <a href="http://greatergreaterwashington.org/post/1503/irate-riders-flooding-wmata-mailboxes/">hundreds of signatures on a petition</a> protesting the decision. That persuaded Metro to detail its objections: <a href="http://washingtonexaminer.com/local/2008/12/metro-involvement-google-transit-held-details">fear of losing $68,000</a> in yearly ad revenue from the Trip Planner page, a wish to be paid for its data, and the legalese around data sharing. That then widened the discussion from an argument over APIs to one over the proper use of taxpayer dollars.</p>

<p>"David came along and started this campaign," said Christopher Zimmerman, an Arlington County Board member who chaired Metro's board in 2008. "It wouldn't have happened without the public pressure." Alpert could also lobby Metro from closer in after <a href="http://greatergreaterwashington.org/post/1636/joining-the-riders-advisory-council/">joining its Riders' Advisory Council in January of 2009</a>.</p>

<div class="super-pullquote"><em><a href="mailto:rob@robpegoraro.com">Rob Pegoraro</a> worked for more than a decade covering technology for the Washington Post. His <a href="http://robpegoraro.com">blog can be found here.</a></em></div>

<p><b>Flipping the debate from potential profits to actual expenses. </b>Gordon Linton, a former head of the Federal Transit Administration who served alongside Zimmerman on the Metro board, focused on the opportunity cost of giving data to other sites. He said the agency had given away resources - for example, <a href="http://www.thecommondenominator.com/041706_news2.html">free parking for car-sharing services</a> - that it later realized could yield income.</p>

<p>"While we were raising fares and cutting service if we were investing staff time and energy for a product that would reap some financial benefit for those who would use it and sell it, then we in turn should get some money back," Linton said.</p>

<p>Zimmerman took the opposite argument: the ease of upgrading one aspect of the Metro experience. "We were having a lot of difficulties," he said. "If some of these things don't cost us anything or don't cost us a lot [to fix], we should do them right away."<br />
<a href="http://www.readwriteweb.com/WMATAmap.jpg"><img alt="WMATAmap.jpg" src="http://www.readwriteweb.com/assets_c/2011/11/WMATAmap-thumb-610x458-36124.jpg" width="610" height="458" class="mt-image-left" style="float: left; margin: 0 20px 20px 0;" /></a><br />
Alpert suggested this debate encouraged Metro staffers to rethink things. "Staff may have felt they were under orders from the board to maximize revenue. Zimmerman gave them permission not to worry about that."</p>

<p>Oh, and Google never had any interest in paying for a schedule feed. Wrote spokeswoman Anne Espiritu: "We do not pay agencies for their data."</p>

<p><b>A change in leadership can help. </b>All of this effort got sidetracked on June 22, 2009 when <a href="http://www.washingtonpost.com/wp-srv/special/metro/red-line-crash/">two Red Line trains collided</a> and killed nine passengers. Things were set back further when general manager John Catoe <a href="http://voices.washingtonpost.com/getthere/2010/01/dc_metro_system_general_manage.html">unexpectedly resigned in early 2010</a> and his interim replacement Richard Sarles had to focus on safety upgrades.</p>

<p>But Sarles' earlier employer, NJ Transit, had provided rail schedules to Google <a href="http://www.njtransit.com/tm/tm_servlet.srv?hdnPageAction=PressReleaseTo&amp;PRESS_RELEASE_ID=2405">back in 2008</a>. He wanted to follow suit here, said Metro spokesman Dan Stessel.</p>

<p>Metro and Google <a href="http://greatergreaterwashington.org/post/8993/google-and-wmata-signed-google-transit-agreement-in-july/">signed a data-sharing agreement in July of 2010</a>, once Google had dropped earlier demands for an indemnification clause. Metro directions <a href="http://www.bing.com/community/site_blogs/b/maps/archive/2010/09/16/bing-maps-gets-transit-directions.aspx">showed up on Microsoft's Bing Maps in September 2010</a>; they <a href="http://www.wmata.com/about_metro/news/PressReleaseDetail.cfm?ReleaseID=4919">arrived on Google in May</a> after a final shove from Sarles following his appointment as <a href="http://www.wmata.com/about_metro/news/PressReleaseDetail.cfm?ReleaseID=4819">WMATA's full-time general manager in January</a>.</p>

<p>(But even now, you must return to Metro's sites or a few third-party apps for bus and train arrival predictions. Google only <a href="http://googlecode.blogspot.com/2011/08/introducing-gtfs-realtime-to-exchange.html">announced a standard format for that data, GTFS-realtime</a>, in August, and Metro is still weighing support for that. And some regional bus systems that Metro's Trip Planner includes have yet to provide their own GTFS data to mapping sites.)</p>

<p>What about the original financial arguments? We may never know how the math worked out: without detailed surveys, you can't draw a line from clicking on maps to boarding trains. But Stessel, who didn't provide a dollar cost for the work involved, suggested that the rationale merchants invoke to invest in intangibles like store or site designs works for public transit too: "Our primary motivation is improving the customer experience." Zimmerman had a more philosophical justification for how a government agency should act: "Putting information in the public domain is part of what we do."</p>

<p>Sometimes, being open isn't easy. You could say that there were many stops on the route that Metro took.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/three_years_ago_the_washington.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/three_years_ago_the_washington.php</link>
         <guid>http://www.readwriteweb.com/archives/three_years_ago_the_washington.php</guid>
         <category>Data Portability</category>
         <pubDate>Fri, 18 Nov 2011 03:00:00 -0800</pubDate>
<author>Rob Pegoraro</author>
      </item>
      
      <item>
         <title>What a Tweet Can Tell You</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/images/datasift_aug10.jpg">Imagine a tiny little sun, just bursting with heat and light, but trapped inside a hard metal cover with a few holes to let beams of energy stream out from inside.  Now imagine there were millions of those little suns, maybe the size of basketballs or tennis balls, all rolling down an assembly line one after another, each with a unique pattern of holes and beams of light streaming out into the world.</p>

<p>That's what Twitter is.  Inside every unborn tweet you can find infinite potential - someone will be in a place, with social context and they will say something, anything, and give that potential a form.  They will say something and it will be instantly available to anyone in the world who's subscribed. Each tweet <a href="http://www.readwriteweb.com/archives/this_is_what_a_tweet_looks_like.php">has more than 30 fields of metadata</a> under the hood; the value populating each of those fields makes up the unique patterns of holes in the metal cover that lets the light out from inside.  <strong>A company launched today that lets you control a robot that drills holes in the metal covers trapping the infinite potential of the sun inside.</strong></p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30202&amp;cb=30202' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30202&amp;n=30202' border='0' alt='' /></a></p>]]>

<![CDATA[<center><img src="http://rww.readwriteweb.netdna-cdn.com/images/map_of_a_tweet.png"></center>

<p>"One could spend months mining Twitter using @DataSift," said Paul M. Watson, <strike>CEO</strike> CTO of stream curation startup Storyful, today on Twitter. "Great balance of usability & power in the CSDL." (Curated Stream Definition Language)</p>

<p>Were one to look into the screaming firehose of hundreds of millions of tweets on Twitter and call out to the robot gods of data sifting, "Give me the tweets by self-proclaimed South Africans, living in Ireland, with positive sentiment and that have been retweeted by data-loving tech investor Roger Ehrenberg!"  Were one to dip into that river in search of a sliver like that, which may or may not exist (it does, Watson's fits the bill), then the freshly launched startup <a href="http://datasift.com">Datasift</a> would be the tool one would use to do so. It would cost you pennies, too.</p>

<p>Years in the making, Datasift launched today as the second licensed reseller of tweets.  The startup doesn't just resell tweets like yesterday's news, though.</p>

<p>Datasift lets anyone parse the full firehose of Twitter messages with its simple Curated Stream Definition Language and see the resulting flow of messages that fit the criteria described.</p>

<p><img alt="datasiftpic1.jpg" src="http://www.readwriteweb.com/images/datasiftpic1.jpg" width="610" height="446" class="mt-image-none" style="" /></p>

<p>You wouldn't likely ask for all the tweets from self-proclaimed South Africans living in Ireland, but you might ask for all the tweets posted by women living in a particular state in the US and using any of a list of keywords.  Market research, political monitoring, news reporting, there are all kinds of use cases.  Whether the set of would-be customers intersects substantially with the set of people curious enough to think of the right questions to ask the data is a big question.</p>

<p>Anyone can say anything on Twitter, and with Datasift you can ask whether anything is being said. 80%+ of retweets of @justinbieber are from Females, the company says.</p>

<p>Datasift just opened to the public today, so despite its best efforts there are still some kinks that need to be worked out.  While the team attended its launch party, the preview rendering functionality stopped working for a little while.  The pricing, while explained a number of different ways, still needs more clarification and development, founder Nick Halstead said today on Twitter.</p>

<p>The query and filtering tool isn't as easy to use as the company would like it to be, either.  "It is decidedly not easy to use," says recent University of Washington Masters of Science in Information Management graduate and former ReadWriteWeb researcher Emily Cunningham.  "I find their UI clumsy.  Creating streams is confusing, not intuitive at all, hard to understand."</p>

<p>But the potential here is huge.  It's a simple proposition, too:  Twitter is now an incredibly rich source of information about all kinds of topics.  "I learn about most things through Twitter," DataSift's new CEO Rob Bailey told <a href="http://vator.tv/news/2011-11-16-closer-look-at-a-ceo-datasifts-rob-bailey">Vator.tv</a> this week. "I spend more time on that platform than I do on any cable channel . . . I've learned so much more about events like the occupy movement that cable news just wasn't covering . . . and I was able to see that the RIM audience was getting more and more frustrated leading up to the outages they experienced. It's such a powerful news source."</p>

<p>Those are words that many of us can relate to, but Bailey now leads a team of engineers building a tool that aims to make it relatively simple for anyone to create filters to capture the messages about the events that we all catch wind of on Twitter.</p>

<p>Anyone who's willing to pay for some tweets, that is, even at a low low price.  Datasift is intended for businesses users.</p>

<p>Can Datasift build a business serving a broad range of business users interested in juggling those spheres of light, filtering by patterns and in turn using that information to create new levels of value?  Time will tell.  It's a very ambitious undertaking.</p>

<p><em>Disclosure: The author leads a stealth startup that works in the social media data space as well, more likely a Datasift customer than a competitor though.</em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/what_a_tweet_can_tell_you.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/what_a_tweet_can_tell_you.php</link>
         <guid>http://www.readwriteweb.com/archives/what_a_tweet_can_tell_you.php</guid>
         <category>Data Services</category>
         <pubDate>Thu, 17 Nov 2011 00:15:56 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>New 5 Billion Page Web Index with Page Rank Now Available for Free from Common Crawl Foundation</title>
		<description><![CDATA[<p><img alt="commoncrawllogo.jpg" src="http://www.readwriteweb.com/images/commoncrawllogo.jpg" width="150" height="150" class="mt-image-none" style="" />A freely accessible index of 5 billion web pages, their page rank, their link graphs and other metadata, hosted on Amazon EC2, was announced today by the <a href="http://www.commoncrawl.org/">Common Crawl Foundation</a>.  "It is crucial [in] our information-based society that Web crawl data be open and accessible to anyone who desires to utilize it," writes Foundation director <a href="http://www.linkedin.com/in/lisagreen">Lisa Green</a> on the organization's blog.</p>

<p>The Foundation is an organization dedicated to leveraging the falling costs of crawling and storage for the benefit of "individuals, academic groups, small start-ups, big companies, governments and nonprofits."  It's lead by Gilad Elbaz, the <a href="http://www.readwriteweb.com/archives/adsense_forefather_makes_14_million_business_listi.php">forefather of Google AdSense</a> and the CEO of data platform startup <a href="http://Factual.com">Factual</a>.  Joining  Elbaz on the Foundation board is internet public domain champion <a href="http://en.wikipedia.org/wiki/Carl_Malamud">Carl Malamud</a> and semantic web serial entrepreneur <a href="http://www.novaspivack.com/">Nova Spivack</a>.  Director Lisa Green came to the Foundation by way of Creative Commons.<br />
</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30013&amp;cb=30013' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30013&amp;n=30013' border='0' alt='' /></a></p>]]>

<![CDATA[<p>The Foundation explains the scope of the project thusly.</p>

<blockquote>"Common Crawl is a Web Scale crawl, and as such, each version of our crawl contains billions of documents from the various sites that we are successfully able to crawl. This dataset can be tens of terabytes in size, making transfer of the crawl to interested third parties costly and impractical. In addition to this, performing data processing operations on a dataset this large requires parallel processing techniques, and a potentially large computer cluster.

<p>"Luckily for us, Amazon's EC2/S3 cloud computing infrastructure provides us with both a theoretically unlimited storage capacity coupled with localized access to an elastic compute cloud."</blockquote></p>

<p>The organization was formed three years ago, just now started talking about itself publicly and believes that free access to all this information could lead to "a new wave of innovation, education and research."</p>

<p>Open Web Advocate <a href="http://walkah.net/james-walker">James Walker</a> agrees:  "An openly accessible archive of the web - that's not owned and controlled by Google - levels the playing field pretty significantly for research and innovation."</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/common_crawl_foundation_announces_5_billion_page_w.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/common_crawl_foundation_announces_5_billion_page_w.php</link>
         <guid>http://www.readwriteweb.com/archives/common_crawl_foundation_announces_5_billion_page_w.php</guid>
         <category>Data Services</category>
         <pubDate>Mon, 07 Nov 2011 15:42:48 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Twitter Starts to Monetize the Right to Repost Tweets Online</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/twitter-tv.jpg"><a href="http://twitter.com">Twitter</a> is cutting deals with third-party providers of services that re-syndicate Tweets online, the company <a href="https://dev.twitter.com/blog/platform-partner-spotlight-mass-relevance-and-crimson-hexagon">announced today</a>, and the first one is Austin, Texas-based <a href="http://www.massrelevance.com/">Mass Relevance</a>.  Mass Relevance has access to the full Twitter fire hose and offers its customers a filtering, curation and display technology to add Tweets about a TV show, political campaign or other event to their web pages.</p>

<p>The potential for syndicated Tweets is big, but hopefully Twitter won't go after everyone else in the world who puts Tweets on other websites as a part of their business.  The company doesn't seem to be welcoming interested parties to license those rights either.  We've asked Twitter for comment on the prospect of enforcement of <a href="https://dev.twitter.com/terms/api-terms">the prohibition against unofficial resyndication of Tweets</a> (who said this stuff was free as the wind?) but haven't heard back from the company yet.  (Update: Twitter's comment below.)</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=30003&amp;cb=30003' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=30003&amp;n=30003' border='0' alt='' /></a></p>]]>

<![CDATA[<p>Mass Relevance looks like a cool service and it seems pretty straight forward.  There's no indication of how much the company paid for the right to resell Tweets, but they probably paid dearly.  Twitter's terms of service read "You will not attempt or encourage others to:<br />
sell, rent, lease, sublicense, redistribute, or syndicate access to the Twitter API or Twitter Content to any third party without prior written approval from Twitter."</p>

<p><em>Below: No unauthorized Tweets on TV!  An example from licensee Mass Relevance's website.</em><br />
<img alt="massrel-1.jpg" src="http://www.readwriteweb.com/images/massrel-1.jpg" width="610" height="478" class="mt-image-none" style="" /></p>

<p>Twitter as a developer platform has traveled a rocky road, lots of highs and lots of lows.  Enabling serious business use of Tweets is going to be an important next step.  Hopefully that will happen in a way that's accessible to small developers and allows them to create the fabulous things a broad ecosystem can produce better than one consisting of a limited set of high-end companies.</p>

<p>A number of the leaders of Twitter's partnership team came from Current.TV, where in the last Presidential election there was some great work done showing live Tweets with live video of Presidential debates.  Will use cases like that have to pay up in the future?  Will it be a price that won't stifle experimentation?  For what it's worth, were we all posting messages on a distributed, open source, microblogging protocol this probably wouldn't be happening.</p>

<p>Writing on the Twitter developers' blog, Twitter's Jason Costa wrote today, "Expect to see additional partnerships of this kind as we look for new ways to help everyone get the best out of Twitter."</p>

<p>ORLY? That doesn't sound like "click here to buy a license."  That sounds like Twitter is going to drive this themselves, the company is looking for new ways to help everyone get the best out of Twitter.  Isn't the lesson of a platform that no single company will ever be able to produce as much innovation as a larger ecosystem of independent developers?</p>

<p>Am I the only one feeling uncomfortable about this?</p>

<p><strong>Update:</strong> In response to this article and an email inquiry, Twitter's Jodi Olsen offered the following comment.  "We're not disclosing terms of the deal, but this is about growth--not a revenue play for us. Our goal with this partnership, which we expect to be the first of many, is to empower the ecosystem and help media grow."<br />
</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/twitter_starts_to_monetize_the_right_to_repost_twe.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/twitter_starts_to_monetize_the_right_to_repost_twe.php</link>
         <guid>http://www.readwriteweb.com/archives/twitter_starts_to_monetize_the_right_to_repost_twe.php</guid>
         <category>Data Services</category>
         <pubDate>Mon, 07 Nov 2011 12:31:20 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>How Google Calendar Could Be Smarter</title>
		<description><![CDATA[<p><img src="http://rww.readwriteweb.netdna-cdn.com/images/google_calendar_logo_jan09.png">Google Calendar <a href="http://gmailblog.blogspot.com/2011/11/suggested-times-in-google-calendar.html">announced a new feature today called Suggested Times</a>; it allows users to co-ordinate their available times to have a meeting together.  I haven't got it turned on in my account yet, but there's a screenshot below.  It looks ok.  The description of how it works isn't at all clear in the announcement.</p>

<p>The idea is great, though: <strong>calendar data as the basis of new features.</strong>  People say that your telephone contact list is your ultimate social network - and they say that your email inbox is a potent platform for potential products and services to be developed on top of.  What about your calendar, though?  The new Google Calendar feature is just the tip of the iceberg - so much more could be made possible if the data latent in our calendars were made programmatically available and turned into a platform for development.  Like what?  A few ideas are below, I'd love to learn about any other ideas you've got, too.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=29965&amp;cb=29965' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=29965&amp;n=29965' border='0' alt='' /></a></p>]]>

<![CDATA[<center><img alt="gcalsug.jpg" src="http://www.readwriteweb.com/images/gcalsug.jpg" width="528" height="359" class="mt-image-none" style="" /></center>

<p>Imagine...<br />
<ul><li>When did you last have a meeting scheduled with the same person you're scheduling something with now?  Could meeting notes or other assets be linked?</li><br />
<li>Who else might you want to invite - like Gmail's who else to email recommendation feature.</li><br />
<li>"Would you like to schedule a Google Hangout at this time?"</li><br />
<li>Data visualization of historical patterns of your schedule.  Did you know that you tend to schedule far fewer meetings on afternoons later in the week than in the mornings early in the week?  I'd like to see a visualization of my contacts' patterns of busy and free so I know when a good time to suggest a meeting might be.</li><br />
<li>Are you about to schedule something that will conflict with an event on a public events calendar and that you're likely to be interested in?  That would be awesome to be notified about.  ReadWriteWeb staff hacker Tyler Gillies has built a <a href="http://Plancast.com">Plancast</a> layer on <a href="http://Geoloqi.com">Geoloqi</a> -it triggers a push notification to peoples' phones whenever they are in a part of town where a public event with the category "nerdout" has been planned.  That's pretty awesome and paints a picture of what the future might look like.</li></ul></p>

<p><a href="https://plus.google.com/115305365013962529268/about">John Girard</a>, a Portland entrepreneur, says on Google+ that he wants to see calendar events and location integrated.  "Who is in the office and who isn't; who will dial in; who is traveling and unavailable etc.," he writes. "One of my big issues with <a href="http://tungle.com">Tungle</a> has been that it's still not good at helping me to schedule in person meetings when I am traveling. The temporal is only one aspect of meetings - geography / location (whether virtual or actual) is almost completely neglected as of yet."</p>

<div class="pullquote">See also: <a href="http://www.readwriteweb.com/archives/plancast_future_as_platform.php">he Future as Platform: Mark Hendrickson's Vision for Plancast</a> from 18 months ago!</div>Recommendations don't just have to be based on raw calendar availability, either.  "I'd imagine that someone has researched the best time and day of week to hold meetings (other than never) for given results," says Canadian open data geo geek <a href="https://plus.google.com/116298590841525009265/about">Jason Birch</a>.  "For instance, if you want a meeting to be brief hold it right before lunch or at end of day, if you want to alert participants then hold it mid-morning, etc, etc.  It would be great if this kind of research (again, assuming that it exists) were taken into account in time order as an optional sort vs earliest possible time."

<p>Some people just want Google Calendar to function like other apps do.  "I was hoping for a public events offering from Google," says gold spinner at Gravity.com <a href="https://plus.google.com/111513514746627349681/about">Robbie Coleman</a>.  "Tired of evite and not all of my friends/family have (or want) Facebook accounts."</p>

<p>Time is a factor that can be cross referenced with almost anything else to return interesting new data points.  Ought our calendars not do more for us than sit on our computers and our phones, all alone and boring?  I think they ought to.  What more would you like to see your calendar do?</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/how_google_calendar_could_be_smarter.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/how_google_calendar_could_be_smarter.php</link>
         <guid>http://www.readwriteweb.com/archives/how_google_calendar_could_be_smarter.php</guid>
         <category>Data Services</category>
         <pubDate>Thu, 03 Nov 2011 21:12:49 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>GeoIQ Releases Real-Time Streaming Social &amp; Device Data &amp; Mapping API Platform </title>
		<description><![CDATA[<p><img alt="GeoIQlogo.jpg" src="http://www.readwriteweb.com/images/GeoIQlogo.jpg" width="147" height="151" class="mt-image-none" style="" />Location data provider <a href="http://GeoIQ.com">GeoIQ</a> today <a href="http://blog.geoiq.com/2011/11/02/streaming-data/">detailed a new offering called GeoIQ Social</a>. This is a real-time streaming API that delivers location-enabled data from Twitter, Pachube-enabled sensor hardware and other platforms into a map-friendly output format that can be updated as the data changes.  Boom!</p>

<p>Sentiment analysis, user ranking, data from sensors and potentially much more can all be taken into account in requesting data from the API.  Connectors have been built for "all sorts of databases including PostgreSQL, MySQL, Oracle, HBase, and MongoDB as well as an even newer types of databases and APIs like Google Fusion Tables."  Awesome.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d.ads.readwriteweb.com/ck.php?n=29921&amp;cb=29921' target='_blank'><img src='http://d.ads.readwriteweb.com/avw.php?zoneid=14&amp;cb=29921&amp;n=29921' border='0' alt='' /></a></p>]]>

<![CDATA[<p>The company concludes its discussion of the new API by saying it intends to extend far beyond where it is today in dynamic data and dynamic mapping.  "Without giving it all away," writes Chris Helm, Head of Analytics at GeoIQ, "we're thinking along the lines of realtime analytics, dynamic event alerting and more tools for easy collaboration."</p>

<p>That sounds fabulous to me.  When physical place and the real-time social web come together in the form of streaming data APIs, the possibilities for augmenting time, place, civil society and the meaning of the web are substantial.</p>

<center><iframe src="http://player.vimeo.com/video/31266476?title=0&amp;byline=0&amp;portrait=0" width="600" height="325" frameborder="0" webkitAllowFullScreen allowFullScreen></iframe></center>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/geoiq_releases_real-time_streaming_social_data_map.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/geoiq_releases_real-time_streaming_social_data_map.php</link>
         <guid>http://www.readwriteweb.com/archives/geoiq_releases_real-time_streaming_social_data_map.php</guid>
         <category>Data Services</category>
         <pubDate>Wed, 02 Nov 2011 12:41:34 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
   </channel>
</rss>
