<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>Interviews - ReadWriteWeb</title>
      <link>http://www.readwriteweb.com/archives/interviews/</link>
      <description>Interviews on ReadWriteWeb</description>
      <language>en</language>
      <copyright>Copyright 2009 Richard MacManus</copyright>
      <managingEditor>readwriteweb@gmail.com</managingEditor>
      <lastBuildDate>Thu, 12 Nov 2009 14:19:58 -0800</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.23-en</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

      
      <item>
         <title>Twitter Data &amp; the Future of TweetDeck</title>
		<description><![CDATA[<p><img alt="dodsworth150.jpg" src="http://www.readwriteweb.com/images/dodsworth150.jpg" ><strong>An Interview With TweetDeck Founder Iain Dodsworth</strong></p>

<p><font style="float: right; margin-left: 10px;"><script type="text/javascript"><br />
tweetmeme_url = 'http://www.readwriteweb.com/archives/the_value_of_twitter_data_the_future_of_tweetdeck.php';<br />
tweetmeme_source = 'rww';<br />
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>A small startup company called <a href="http://infochimps.org">InfoChimps</a> released for sale yesterday three very large sets of data extracted from 500 million Twitter messages.  Included in the offering are the senders and recipients of 1 billion @ messages, Retweets and Favorites.  We wrote in-depth about the release <a href="http://www.readwriteweb.com/archives/twitter_data_dump_infochimp_puts_1b_connections_up.php">late last night</a>.  This morning we interviewed <a href="http://twitter.com/iaindodsworth">Iain Dodsworth</a>, creator of the most popular Twitter client, <a href="http://tweetdeck.com">TweetDeck</a>, about the value he might find in that data and the direction he's aiming to take TweetDeck in the future.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=17098&amp;cb=17098' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=17098&amp;n=17098' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><img alt="dodsworthdeck.jpg" src="http://www.readwriteweb.com/images/dodsworthdeck.jpg" ><br />
<center><img alt="tweetdeckcaption2.jpg" src="http://www.readwriteweb.com/images/tweetdeckcaption2.jpg" ></center></p>

<p><strong>Dodsworth:</strong> Straight off the bat - an archive of tweets could form the basis of a profiler and that's very interesting. Sentiment analysis (which I am ALL over) requires that kind of base corpus.</p>

<p><strong>RWW:</strong> InfoChimps isn't releasing full text yet, but they would do a custom slice if you wanted it.</p>

<p><strong>Dodsworth:</strong> It's the historical element that a large number of services are missing and where they will fall flat - analysis based on the last few hundreds tweets is almost pointless.</p>

<p><strong>RWW:</strong> I'm curious what "a profiler" might mean to you and what this data could help make possible in those terms.</p>

<p><strong>Dodsworth:</strong> For me a true profiler would be akin to the holy grail - we would analyse who a person converses with, who RTs them the most, essentially all interactions.  Then we would track activity metrics (how many tweets sent, replies) and then we would analyse language patterns (usage of certain words) to ascertain how they express themselves and pinpoint sentiment.   Off the top of my head this could lead to elements of intention prediction and I'm steering TweetDeck to have this kind of very very basic Artificial Intelligence at its heart.</p>

<p>I'm currently researching intent predicition inside high frequency trading systems and it's fascinating and could directly relate to TweetDeck and social media systems/services in general.</p>

<p>[Dodsworth's <a href="http://uk.linkedin.com/in/iaindodsworth">background</a> is in developing for financial services, at places like Prudential Financial and PricewaterhouseCoopers.]</p>

<p><strong>RWW:</strong> What would intention prediction look like in this context? On twitter?</p>

<p><strong>Dodsworth:</strong> At its most basic if TweetDeck could predict what the user was probably about to require next, based on current activity, then it could start to collate that data in the background - cross twitter/facebook/linkedin data for example.  I'm looking at it right now from a cross-service data gathering perspective where our servers do the gathering and hopefully get around the issues of API limits for example.</p>

<p>This is based on future functionality we're mapping out now which is a lot more complex than looking at someone's profile or seeing how many RTs one of your tweets has.</p>

<p>I'm thinking the scope is full social graph rather than just twitter/facebook.</p>

<p><strong>RWW:</strong> I guess I'm having a hard time imagining "what the user was probably about to require next, based on current activity, then TweetDeck could start to collate that data in the background - cross twitter/facebook/linkedin data for example" might look like.  Like, if I'm looking at a person's profile, I'd probably like to see their LinkedIn data?</p>

<p><strong>Dodsworth:</strong> Good example...or see how a certain person you're tweeting with right now stacks up against "similar" people you've spoken to - a box could pop up mid-conversation and give you a tonne of metrics on this person.  How full of [crap] are they? Are they a social media guru? Would you be wise to tell this person anything sensitive?  Based on previous language patterns, is the person you're tweeting with right now probably lying? A bit out there but possible in theory.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/the_value_of_twitter_data_the_future_of_tweetdeck.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/the_value_of_twitter_data_the_future_of_tweetdeck.php</link>
         <guid>http://www.readwriteweb.com/archives/the_value_of_twitter_data_the_future_of_tweetdeck.php</guid>
         <category>Interviews</category>
         <pubDate>Thu, 12 Nov 2009 14:19:58 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Kiva&apos;s Causemopolitan on World Tour: Social Media for Social Good</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/berrent.jpg">It's been a long and winding road for serial volunteer and social media philanthropist <a href="http://www.thecausemopolitan.com/">Sloane Berrent</a>.</p>

<p>Since her unplanned departure from an L.A.-based startup in 2008, <font style="float: right; margin-left: 10px;"><script type="text/javascript"><br />
tweetmeme_url = 'http://www.readwriteweb.com/archives/kivas_causemopolitan_on_social_media_for_social_go.php';<br />
tweetmeme_source = 'rww';<br />
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>Berrent has traveled through eight countries, documenting and publicizing the struggles of those in developing areas through her blog posts, tweets, images, videos, and her own presence at events at home and abroad. From post-Katrina New Orleans to a trash dump in Manila to a monastery in Burma, read on for her story of trying to achieve social good through social media.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16577&amp;cb=16577' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16577&amp;n=16577' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><strong>RWW: "Social media for social good" has become the catchphrase du jour, it seems. What does it actually mean; how much can social media users affect social change, and how?</strong></p>

<p>I am a strong believer in the idea that the things you do online are meant to facilitate your offline interactions. People are so fast to click a button, and that can be great. Retweeting, forwarding, and Facebook walls are great engagements. But what's more difficult is the donate button. That's the big hurdle and disconnect. I'm trying to provide these inspirational opportunities in timeboxed campaigns. Social media is slowly catching on, but there's a lot of noise. Standing out is hard; it's important to have an offline component.</p>

<p><object width="610" height="361.14"><param name="movie" value="http://www.youtube.com/v/yRShYkNb6fk&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/yRShYkNb6fk&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="610" height="361.14"></embed></object> <em>Berrent was visibly disturbed by what she witnessed at this Manila trash dump, where she saw shoeless children running through piles of debris.</em></p>

<p><strong>RWW: Tell me about your experiences with Kiva borrowers. What kinds of people and enterprises have you seen? In your opinion, does microlending have a measurable impact on struggling local economies?</strong></p>

<p>Kiva is really unique. It has a lot of power users - more than any nonprofit I've ever seen. One man has made a thousand loans. It's individual stories, and people really connect. You get updates on that person, and people say it's their favorite email of the month. As a microlending company, Kiva is one spoke in the larger wheel of microfinance. On a global scale, it has a very big impact.</p>

<p>Typically, when you go to a village or province, certain industries are prevalent. In a fishing community, maybe the borrower bought a fishnet or a fishing boat. In an area with a lot of bamboo, it's going to be crafts. I worked in eleven branch offices. I met over 40 different female borrowers individually and over 250 in my time there.</p>

<p>I can see that the money Kiva provides makes a difference. Microfinance is a very slow process, and there are gems and sparks of people who break through the poverty cycle. When you see villages changing, it's really something. It's like watching grass grow, but it's really beautiful grass.</p>

<p><img src="http://www.readwriteweb.com/kiva2.jpg"> <em>This woman is a pig farmer and a recipient of funds from a <a href="http://www.kiva.org/about/aboutPartner?id=124">Kiva-affiliated organization</a>.</em></p>

<p><strong>RWW: Now you're working on a <a href="http://causeitsmybirthday.com/">seven-day, seven-city tour</a> to raise awareness and funds for malaria prevention through bed nets. Where did this idea come from?</strong></p>

<p>It's a city-by-city competition on who can raise the most money for malaria nets, but also an <a href="http://causeitsmybirthday.com/donate.html">opportunity for anyone to donate</a> who wants to get involved. The tour starts this Saturday night in New York City and continues for the next seven days in Miami, New Orleans, Chicago, Seattle, San Francisco, and ends in Los Angeles on Friday...</p>

<p>I'd just finished Kiva training, and I was going to the Philippines for three months. And all I could think was, "When I come back, I'm going to be <em>thirty</em>." I've honed in a lot on my direction - using the Internet to help people. And what if I could use this opportunity to give back, involving people in different parts of the country - something really ambitious?</p>

<p>I wanted it to be about saving lives. I wanted to say, "I saved this many lives on my birthday." I've done a lot of work in HIV and AIDS; I looked into that and polio and malaria, and that's what stuck with me. The campaign has no administrative fees. One hundred percent of the funds go to malaria... in rural northern Ghana. Providing malaria nets will really be a part of saving lives there.</p>

<p><img src="http://www.readwriteweb.com/kiva1.jpg"> <em>Berrent <a href="http://www.thecausemopolitan.com/teaching-of-the-buddhas/">met this monk</a> in Burma and spent the afternoon pagoda-hopping with him.</em></p>

<p><strong>RWW: What needs or gaps do you see in philanthropic efforts online?</strong></p>

<p>I think it's not having a strategy to begin with, not knowing the tools in your toolbox before you start. There's a lot to be said for jumping in and having fun, but nonprofits don't have the resources to play around online. They think it's about getting interns and getting followers and fans without figuring out why a medium is important and how to make it successful for them.</p>

<p><strong>RWW: What's one surprise - good or bad - that you've come across since you started working with Kiva? What did you not expect from this experience, and what did you learn?</strong></p>

<p>I learned that it's much more complicated than the website makes it seem. There's an entire division devoted to foreign exchange currency. The operational cost analysis, the challenges of technology in the developing world, the processes of remittance - it's incredibly complex. There are regional specialists. On the site, you can make a loan in five clicks, but a lot of machinery comes together to make it that way.</p>

<p><strong>RWW: What's next for you? Is there more globe-trotting in your immediate future? How do you think the web will continue to be part of your life and career?</strong></p>

<p>One of the best parts of this past year has been that I've gone through long periods where I didn't have Internet access. That's brought me a heightened and renewed sense of my purpose in the world and my authentic desire to make the world a better place. I'd like to be able to continue to support campaigns - even for-profit ventures - that I believe in, and I think social business is a wonderful intersection of the two.</p>

<p>I want to explore avenues with online and offline components, while continuing to blog and tell stories I'm passionate about.</p>

<p><img src="http://www.readwriteweb.com/kiva3.jpg"> <em>Follow Berrent's next adventures on <a href="http://twitter.com/sloane">Twitter</a> or at <a href="http://www.thecausemopolitan.com/">her blog</a>.</em></p>

<p>And all this is just the tip of the iceburg that is Sloane Berrent's fascinating story. For a fuller look at her travels and timeline, check out this list of her <a href="http://www.thecausemopolitan.com/9-favorite-posts-of-the-past-6-months/">nine favorite posts</a> on her blog, The Causemopolitan, covering humanitarianism, her work in New Orleans, the phenomenon of serendipity in international travel, and much more.</p>

<p>Many thanks to Sloane Berrent for the use of her <a href="http://www.youtube.com/user/sloaneberrent">videos</a> and <a href="http://www.flickr.com/photos/answerwithaction/">images</a> as well as for sharing her story with us and our readers.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/kivas_causemopolitan_on_social_media_for_social_go.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/kivas_causemopolitan_on_social_media_for_social_go.php</link>
         <guid>http://www.readwriteweb.com/archives/kivas_causemopolitan_on_social_media_for_social_go.php</guid>
         <category>Interviews</category>
         <pubDate>Tue, 29 Sep 2009 22:16:49 -0800</pubDate>
<author>Jolie O&apos;Dell</author>
      </item>
      
      <item>
         <title>How Tim O&apos;Reilly Aims to Change Government</title>
		<description><![CDATA[<center><img alt="oriellyingermany.jpg" src="http://www.readwriteweb.com/images/oriellyingermany.jpg" width="610" height="403"></center>Some people go to Washington to try to make the government more honest; others try to make it smaller.  Technologist Tim O'Reilly is spending time in Washington, and bringing Washington officials to San Francisco, to do something different - perhaps something more realistic.  O'Reilly is trying to help government become a platform for innovation.  A "government as platform" would supply raw digital data and other forms of support for private sector innovators to build on top of.

<p>Tim O'Reilly is a publisher of technical books, the organizer of a series of conferences on diverse topics, an investor in web startup companies and smart electrical grid technologies.  He's credited with shepharding the term Web 2.0 into public consciousness and he regularly uses his extensive influence to call on technologists to "do something worthy," especially in the face of ecological and political crisis.  Now he's brokering meetings of Obama administration officials and bleeding-edge geeks.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16122&amp;cb=16122' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16122&amp;n=16122' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>"What I learned when I went to Washington," O'Reilly told me by phone as he drove down a California highway earlier this month, "was how much the dialogue is determined by the companies that go there."  O'Reilly is a man in the habit of helping determine dialogue around important issues and the opportunity the Obama administration offers to change government is no different.  "[Google CEO] Eric Schmidt told me - 'tell a big story - talk to people and then share what you've learned.'"</p>

<p>O'Reilly is talking to people, but he's helping people talk to eachother as well.  He's introducing officials like Vivek Kundra, the new CIO of the Federal government, and Federal CTO Aneesh Chopra to ground-breaking hackers like geek rennaisance man <a href="http://factoryjoe.com">Chris Messina</a> and <a href="http://ycombinator.com">YCombinator</a> founder Paul Graham.  He's bringing together geospatial visionaries and the government officials that provide them the GPS data they work with.  </p>

<p>"What I've learned from all these conversations," O'Reilly says,"is about government as a platform.  It's not just social media use by government, or government using wikis.  No, it's something more profound. How do you think like a platform provider?  We've moved our government from a lean vehicle for collective action, and over the last 200 years it has become so strong that it's now 40% of GDP. I want to go back to the original vision of the role of government: a convener of things that we as individuals and companies can't do alone. Standard setting, pilot programs; government providing enabling technologies for citizens to serve themselves.</p>

<p>"This morning we did a call with the White House and some geohackers, talking about what's wrong with government geodata now and how could it be fixed.  The government people said we need to translate this into real projects that will appeal to politicians. 'If you fix this kind of geodata then we'll be able to provide this service - street safety, education attainment, public policy objectives,' was what they wanted to hear from the hackers.  It's really about social innovation, building better tools for us as a nation to use technology to focus on real problems."</p>

<p>Healthcare, education and innovation policy are the three sectors O'Reilly says have the most momentum when it comes to government as platform.</p>

<p>"The old model," O'Reilly argues, "said we'll build services ourselves or we'll make deals with a few prefered providers that we'll then offer to our customers.  This is very similar to what we saw recently in the cell phone market. Rather than providing all the apps themselves, Apple provided a platform and said to developers 'go build on it.' That's where I think the government is trying to go. Instead of offering a website, here's an API [application programming interface]. Can we spark innovation against what we're doing? It's not about picking a provider or partner and then your conduit to the private sector is them, instead its about evangelizing your platform so far more people develop on top of it."</p>

<p>"There are absolutely other companies coming to Washington and saying otherwise, to stick with old model," he says,"but there's an opportunity for government to say if people want to build services on this then we need the data we make public to be granular and timely. We should not be publishing updates once a month.  Real time, local, responsive to users - those are new thinking for government.  It's just like the 90's when government was discovering websites, now they are discovering web services and we're saying this is what they need to look like."</p>

<p>That conversation will become very public when O'Reilly hosts the <a href="http://www.gov2summit.com/">Gov 2.0 conference</a> next month.  The lineup of geeks and people from the government is already intruiging and O'Reilley has said that some of the holes in the schedule are placeholders for very high-profile speakers who haven't yet sent final confirmation.</p>

<p>"Vivek [Kundra, US CIO] says he wants to make working for government sexy," O'Reilly says.  "It's a huge part of our economy and there's a lot of opportunity for entreprenuers. Why are we letting beltway bandits get away with overchanging government to do work?  We're missing opportunities to get our best thinking into government planning."</p>

<p>Making work for the government sexy is going to be a very big challenge.  If there's a person and a paradigm that just might be able to do it, though, Tim O'Reilly and this vision of "government as platform" might be the right combination.</p>

<p><em>Photo: <a href="http://www.flickr.com/photos/adders/2962853002/">O'Reilly at Web 2.0 Expo Berlin</a>, by Adam Tinworth</em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/how_tim_oreilly_aims_to_change_government.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/how_tim_oreilly_aims_to_change_government.php</link>
         <guid>http://www.readwriteweb.com/archives/how_tim_oreilly_aims_to_change_government.php</guid>
         <category>Interviews</category>
         <pubDate>Thu, 20 Aug 2009 13:44:01 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>ReadWriteWeb Interview With Tim Berners-Lee, Part 2: Search Engines, User Interfaces for Data, Wolfram Alpha, And More...</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/tbl_may08.jpg" />In part 2 of my one-on-one interview with Tim Berners-Lee, we explore a variety of topics relating to Linked Data and the Semantic Web. If you missed it, in <a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">Part 1 of the interview</a> we covered the emergence of Linked Data and how it is being used now even by governments. </p>
<p><font style="float: right; margin-left: 10px;"><script type="text/javascript">
tweetmeme_url = 'http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php';
tweetmeme_source = 'rww';
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>In Part 2 we discuss: how previously reticent search engines like Google and Yahoo have begun to participate in the Semantic Web in 2009, user interfaces for browsing and using data, what Tim Berners-Lee thinks of new computational engine Wolfram Alpha, how e-commerce vendors are moving into the Linked Data world, and finally how the Internet of Things intersects with the Semantic Web.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15658&amp;cb=15658' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15658&amp;n=15658' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>Semantic Web and Search Engines Like Google, Yahoo</h2>
<p><em>RWW: You've been talking about the Semantic Web for many years now. Generally the view is that Semantic Web is great in theory, but we're still <a href="http://www.readwriteweb.com/archives/rdf_semantic_web_apps.php">not seeing a large number of commercial web apps that use RDF</a> (we've seen a number of scientific or academic ones). However we have <a href="http://www.readwriteweb.com/archives/understanding_the_new_web_era_web_30_linked_data_s.php">begun to see some traction with RDFa</a> (embedding RDF metadata into XHTML Web content), for example <a href="http://www.readwriteweb.com/archives/search_options_google_search_evolves.php">Google's Rich Snippets</a> and <a href="http://www.readwriteweb.com/archives/semtech_making_the_web_searchable_searchmonkey.php">Yahoo's SearchMonkey</a>. Has the takeup of RDFa taken you by surprise?</em></p>
<p>TBL: Not really, but the takeup by the <strong>search engines</strong> is interesting. In a way I was happy to see that, it was a milestone for those things to come out of the search engines. The search engines had typically not been keen on the Semantic Web - maybe you could argue that their business is making order out of chaos, and they're actually <em>happy</em> with the chaos. And if you provide them with the order, they don't immediately see the use of it. </p>
<div class="pullquote">
<p>"The search engines have not been keen on the Semantic Web [...] their business is making order out of chaos, and they're actually happy with the chaos."</p>
</div>
<p>Also I think there was misunderstanding in the search engine industry that the Semantic Web meant metadata, and metadata meant keywords, and keywords don't work because people lie. Because traditionally in information retrieval systems, keywords haven't proven up to the task of finding stuff on the Web. One of the reasons is that people lie, the other is that they can't be bothered to enter keywords. So keywords have gotten a bad reputation, then metadata in general was tarred with this 'keywords don't work' brush. Because a lot of Semantic Web data included metadata, then people thought that with Semantic Web data -- again, that people will lie and won't have the time to produce it. </p>
<p><img src="http://www.readwriteweb.com/images/rich-snippets.png" /><br />
<em>Google rich snippets example; image credit: Matt Cutts</em></p>
<p><em>Now</em> I think there's a realization that when you're putting data online, that people are motivated NOT to lie. For example when your band is going to produce its next album, or when your band is going to play next downtown, you're motivated to put that information up there on the Semantic Web. There's an awful lot of cases when actually data is really important to people; and it's on the web anyway. So I think it's great that some of the search engine companies are starting to read RDFa. </p>
<p>Does this mean that they [search engines] will start to absorb the whole RDF data model? If they do, then they will be able to start pulling all of the linked data cloud in. </p>
<div class="pullquote">
<p>"The web of linked data and the web of documents actually connect in both directions, with links."</p>
</div>

<p>Will they know what to do with it? Because when it's data in a very organized form, I think some people have been misunderstanding the Semantic Web as being something that tries to make a better search engine - i.e. when you type something into a little box. But of course the great thing about the Semantic Web is that you can query it, you can ask a complicated query of the Semantic Web, like a SQL query (we call it a SPARQL query), and that's such a different thing to be able to do. It really doesn't compare to a search engine. </p>
<p>You've got search for text phrases on one side (which is a useful tool) and querying of the data on the other. I think that those things will connect together a lot. </p>
<p>So I think people will search using a search text engine, and find a webpage. On the front of the webpage they'll find a link to some data, then they'll browse with a data browser, then they'll find a pattern which is really interesting, then they'll make their data system go and find all the things which are like that pattern (which is actually doing a query, but they'll not realize it), then they'll be in data mode with tables and doing statistical analysis, and in that statistical analysis they'll find an interesting object which has a home page, and they'll click on that, and go to a homepage and be <em>back</em> on the Web again. </p>
<p>So the web of linked data and the web of documents actually connect in both directions, with links.</p>

<h2>User Interfaces for Semantic Content</h2>
<p><em>RWW: At the recent SemTech conference, <a href="http://www.readwriteweb.com/archives/the_state_of_the_market_in_semantic_technologies.php">Tom Tague of Thomson Reuters' Calais project suggested</a> that user interfaces for semantic content are key in getting more take-up. With that in mind, I wonder if you've seen some great interfaces or designs for semantic applications in recent months - if so which ones and why did they impress you?</em></p>
<p>TBL: I think that whole area is very exciting at the moment. The only piece of hacking I've done over the past few years has been on a thing called <a href="http://www.w3.org/2005/ajar/tab">the Tabulator</a> [a data browser and editor], which is addressing exactly that. Partly because I wanted to be able to look at this data. And now there are lots of different ways that people need to be able to look at data. You need to be able to <strong>browse through it</strong> piece by piece, exploring the world of data. You need to be able to look for <strong>patterns</strong> of particular things that have happened. Because this is data, we need to be able to use all of the power that traditionally we've used for data. When I've pulled in my chosen data set, using a query, I want to be able to do [things like] maps, graphs, analysis, and statistical stuff. </p>
<p><img src="http://www.readwriteweb.com/images/tabulator_july09.jpg" /><br />
<em>W3C Tabulator, a data browser/editor; Image credit: <a href="http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/publishing/">wiwiss.fu-berlin.de</a></em></p>
<p>So when you talk about user interfaces for this, it's really very very broad. Yes I think it's important. There's also the distinction we can make between the <strong>generic interfaces</strong> and the <strong>specific interfaces</strong>. </p>
<p>There will always be specific interfaces; for example if you're looking at calendar data, there's nothing else like a calendar that understands weeks, months and years. If you're looking at a genome, it's good to have a genetics-specific user interface. </p>
<div class="pullquote">
<p>"I want to be able to do maps, graphs, analysis, and statistical stuff."</p>
</div>
<p>However you also need to be able to connect that data, through generic interfaces. So if my genome data was taken during an experiment which happened over a particular period, I need to be able to look at that in the calendar - so I can connect the genetics to the calendar. </p>
<p>So one of the things I hope to see is domain-specific things for various different domains, <em>and</em> the generic user interfaces. And hopefully the generic interfaces will be able to tie together all of the domains.</p>

<p><b><em>Next Page: Wolfram Alpha; e-Commerce and Linked Data</em></b></p>

<!--nextpage-->

<h2>Wolfram Alpha and Natural Language Interfaces</h2>
<p><em>RWW: An interesting new product was <a href="http://www.readwriteweb.com/archives/wolframalpha_our_first_impressions.php">launched this year</a> called <a href="http://www.wolframalpha.com/">Wolfram|Alpha</a>, described as a 'computational knowledge engine.' It's kind of a mix between Google (search) and Wikipedia (knowledge), and its key attribute is that enables you to compute something. The founders think that 'computing' things on the fly is something we're going to see a lot of in future. What's your take on Wolfram|Alpha?</em></p>
<p>TBL: There are two parts to that sort of technology. One of them is a sort of stilted natural language interface. We've seen those sort of natural language queries for years. Boris Katz [from W3C] created a system called <a href="http://start.csail.mit.edu/start-system.html">START</a> <em>[a software system designed to answer questions that are posed to it in natural language]</em>. I think with the Semantic Web out there, those sorts of interfaces are going to become important, very valuable, because people will be able to ask more complicated things. The search engine has traditionally been limited to just a phrase, but some of the search engines are now starting to realize that  if they put data behind them and have computation engines, then you can ask things like 'what's this many pounds in dollars?' and so on. So yes, those interfaces will become important. </p>
<div class="pullquote">
<p>"Those sorts of interfaces will become important [...] people will be able to ask more complicated things."</p>
</div>

<p>Conversational interfaces have always been a really interesting avenue. We've had voice browser work in W3C, that has been an interesting alternative avenue. It's possible that as compute power goes up, we'll see a prolifieration of machines capable of doing voice. It'll move from the mainframe to being able to run on a laptop or your phone. As that happens, we'll get actual voice recognition and pattern natural language at the front end. That will perhaps be an important part of the Semantic Web. </p>
<p><img src="http://www.readwriteweb.com/images/wolfram_football.jpg" /></p>
<p>We talked before about what a great challenge the Semantic Web is going to be from a user interface point of view. Conversational interfaces are going to be part of [solving] that. Of course it's also going to be really valuable to have compositional interfaces - for the visually impaired and so on. </p>
<p>Wolfram|Alpha is also a large curated database of data sets. Obviously I'm interested in the big data set which is out there, which is Linked Data. This everybody can connect to. I don't really know a lot about the internals of Wolfram|Alpha's data set. I don't know whether they're likely to put any of it out on the web as Linked Data - that might be an interesting addition. I imagine that quite a lot of it may have come <em>from</em> the web of Linked Data.</p>

<h2>e-Commerce and Linked Data</h2>
<p><em>RWW: There have been <a href="http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html">reports recently</a> that both Google and Yahoo will be supporting the Good Relations ontology and linked data for e-commerce. Companies such as Best Buy are already putting out product information in RDFa. What would be your advice to e-commerce vendors right now, to help them transition to this world of structured data on the Web. The same question could be asked across many verticals, but e-commerce seems like one area which has some momentum right now. Would you advise them just to put out their data as Linked Data?</em></p>
<p>TBL: Yup! Certainly this year is the year to do it. I've been advising governments to do it and when you look at an enterprise, you find that a lot of the issues are the same. But when you put your data from government or enterprise out there, make sure you don't disturb existing ecosystems. Don't threaten those systems, because you've spent years building them up.  </p>

<p>Maybe there's an analogy with when the Web first started and the first bookshops went online. They were more or less a flyer, saying 'hey we have a great bookshop at 23 Main St, come on down!'. Let's say that a person named Joe owned one of these early online bookshops. If somebody had suggested to Joe that he should put his catalog online, Joe would've felt that that was very proprietary data. And he'd be worried that other bookshops would see where he was weak, so they'd be able to advertise themselves as filling that niche he's weak in. </p>
<div class="pullquote">
<p>"When you put your data out there, make sure you don't disturb existing ecosystems."</p>
</div>

<p>But when his competitors Fred and Albert put their catalogs online, then Joe can check which books people are browsing at Fred and Albert's websites. So Joe would [finally] be pursuaded to put his book catalog up online. But he doesn't put up the prices... until Albert and/or Fred does. And even if catalog and pricing is up there, <em>nobody</em> puts their stock levels online. And there was a period of time when nobody [i.e. online booksellers] had their stock levels up. But people got fed up with ordering stuff that wasn't in stock. So the first book shop to actually tell you about stock levels suddenly was then unbelievably attractive to its customers. </p>
<p>So there's this syndrome of <strong>progressive competitive disclosure</strong>. This happens when people realize that if you're going to do business with somebody, if you're going to have your partners up and down the supply chain, really it's useful to check the data web - and life goes much more quickly and open. </p>

<p>Best Buy may be what starts the ball rolling [among e-commerce vendors]. Now if I want to look out for what [products are] available, I can write a program to see what there is. If somebody wants to compete with Best Buy, to my program they'll be invisible unless they can get their data up in RDF. Doesn't matter whether they use RDFa or RDF XML, as long as it maps in a standard fashion to the RDF model, then they will be visible.</p>

<p><b><em>Next Page: Internet of Things; Conclusion</em></b></p>

<!--nextpage-->

<h2>The Internet of Things</h2>
<p><em>RWW: I'm fascinated by how the Internet is becoming more and more integrated into the real world. For example the <a href="http://www.readwriteweb.com/archives/5_companies_building_the_internet_of_things.php">Internet of Things</a>, where everyday objects become <a href="http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php">Internet connected via sensors</a>. Have you been following this trend closely too, and if so what impact do you think this will have on the Web in say 5 years time?</em></p>
<p>TBL: It connects very much with Semantic Web [and] with linked data. With Linked Data you've got the ability to give a thing a URI. So I can give a URI to my phone, and I can say <em>that's</em> my phone in Linked Data. And also the company that made it can give a URI to the model of the phone. They can also put online all the specs of the phone, and then I can make a link to say that my phone is an example of that product. So now any system which is dealing with me and has access to that data will be able to figure out the sorts of things I can do with my phone, which actually is really valuable. Especially if the phone breaks. </p>
<div class="pullquote">
<p>"The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last mile."</p>
</div>

<p>The Semantic Web has already given URIs to things, and to types of things. When the things themselves have an RFID chip in them, then I think it's a very exciting world. One can take that RFID chip, go to the Internet and find out the data about the thing. Whether we'll be able to do that, whether the manufacturers will be open enough to <em>allow me</em> to turn data about the identifier of the thing into data <em>about the thing</em>, is yet to be seen. But it's a very exciting idea. </p>
<p><img src="http://www.readwriteweb.com/images/pachube.png" /><br /><em><a href="http://www.pachube.com/">Pachube</a>, an example of the Internet of Things (see <a href="http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php">ReadWriteWeb profile</a>)</em></p>
<p>Similarly, I'd like to be able to scan a barcode and get back nutritional information about what's in - for example - a can of food. But we don't have that yet. To get that sort of thing, which is very powerful, we need to build look-up systems, which allow you to translate an RFID code or a barcode into an HTTP address. </p>
<p>The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last link - the last mile. Give the thing a notion of its own identity in the web.</p>


<h2>Conclusion</h2>
<p>RWW: The over-riding message in both <a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">Part 1</a> and 2 of our interview with Tim Berners-Lee, is for companies and organizations to <strong>make their data available online</strong>. Preferably as Linked Data, which uses a subset of Semantic Web technologies. But Berners-Lee noted, in Part 1 of our interview, that he'd even be happy with the data in CSV (comma separated values) format. </p>
<p>It's clear that we've seen a lot of progress in linked data already in 2009. In upcoming posts on ReadWriteWeb, we'll continue to track this trend and explain how organizations can contribute their data.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php</link>
         <guid>http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php</guid>
         <category>Interviews</category>
         <pubDate>Thu, 09 Jul 2009 06:00:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>ReadWriteWeb Interview With Tim Berners-Lee, Part 1: Linked Data</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/tbl_may08.jpg" />During my recent trip to Boston, I had the opportunity to visit MIT. At the end of a long day of meetings with various MIT tech masterminds, I made my way to the funny shaped building (see photo right-below) where the World Wide Web Consortium (W3C) and its director Tim Berners-Lee work. <font style="float: right; margin-left: 10px;"><script type="text/javascript">
tweetmeme_url = 'http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php';
tweetmeme_source = 'rww';
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>Berners-Lee is of course the man who invented the World Wide Web 20 years ago.</p> 
<p>This was my first meeting with the Web's creator, whose work and philosophy was a direct inspiration for me when I launched ReadWriteWeb back in 2003.<sup>1</sup></p>
]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15639&amp;cb=15639' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15639&amp;n=15639' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><img src="http://www.readwriteweb.com/images/w3c_building.jpg" align="right" />After shaking hands, I told Tim Berners-Lee that this blog's name was in part inspired by the first browser, which he developed, called &quot;<em>WorldWideWeb</em>&quot;. That was a read/write browser; meaning you could not only browse and read content, but  create and edit content too. It was a shame then when Mosaic, a read-only browser, became the first mainstream web browser in the mid-90s. It wasn't until the rise of Web 2.0 that the read/write philosophy gained widespread acceptance.<sup>2</sup> On that note, we launched into the interview... </p>
<p><strong>Note:</strong> the interview will be published in two parts, with Part 1 today on the topic of Linked Data. Part 2 will explore other topics and will run tomorrow.</p>

<p><strong>UPDATE: <a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Part 2 of this interview</a></strong> is now available</b>.</p>
<h2>How Linked Data Relates to The Semantic Web</h2>
<p><em>RWW: Earlier this year you gave <a href="http://www.readwriteweb.com/archives/linked_data_is_blooming_why_you_should_care.php">an inspiring talk at TED about Linked Data</a>. You described Linked Data as a sea change akin to the invention of the WWW itself - i.e. we've gone from a Web of documents to a Web of data. Can you please explain though how Linked Data relates to the Semantic Web, is it a subset of it?</em></p>
<p>TBL: They fit in completely, in that the linked data actually uses a small slice of all the various technologies that people have put together and standardized for the Semantic Web. </p>
<div class="pullquote">
<p>Linked Data uses a small slice of the technologies that make up the Semantic Web.</p>
</div>
<p>We started off with the Semantic Web roadmap, which had lots of languages that we wanted to create. [However] the community as a whole got a bit distracted from the idea that <em>actually</em> the most important piece is the interoperability of the data. The fact that things are identified with URIs is the key thing. </p>
<p>The Semantic Web and Linked Data connect because when we've got this web of linked data, there are already lots of technologies which exist to do fancy things with it. But it's time now to concentrate on getting the web of linked data out there. </p>
<p><img src="http://www.readwriteweb.com/images/tbl_rgm_july09.jpg" /><br />
<em>Web inventor Tim Berners-Lee and ReadWriteWeb founder Richard MacManus</em></p>
<h2>How Linked Data Has Evolved via Grassroots</h2>
<p><em>RWW: Linked Data has had a lot of grassroots support, which you mentioned in your TED speech. This is something Semantic Web technologies, such as RDF, have struggled to get over the years. Has the W3C been pushing the more bottom-up Linked Data world, because of the frustration over <a href="http://www.readwriteweb.com/archives/rdf_semantic_web_apps.php">lack of take-up of top-down Semantic Web</a>?</em></p>
<p>TBL: A lot of the initial RDF and OWL projects came out of the academic world; and some of them were projects to show what you could do in a closed world. And the files were zipped up and left on a disc. While they were interesting projects, and while the systems were useful systems, the semantic web community maybe missed the point of the 'web' bit and  focused too much on the 'semantic'. However the work that's been done in the Semantic Web, the standards, was really valuable. It's relatively recently for example that <a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a> [an RDF query language] has been developed. </p>
<div class="pullquote">
<p>"It's time now to concentrate on getting the web of linked data out there."</p>
</div>
<p>Somebody drew an analogy the other day: can you imagine trying to promote a world of databases without SQL? Even though it's not an interoperable protocol, it's just a query language. So similarly, all that's been put into RDF, rdfs and OWL is very valuable to the linked data community. </p>
<p>The Linked Data community tend to use a subset of that [Semantic Web technologies], of OWL for example. But they certainly use SPARQL. So you could argue that really it wasn't ready to be deployed widely. </p>
<p>Linked Data started as a very informal <a href="http://www.w3.org/DesignIssues/LinkedData.html">Design Issues note</a> that I put in; it was a grassroots movement from very early on. So <em>yes</em> W3C has been emphasizing the importance of Linked Data. It's been the <a href="http://www.w3.org/2001/sw/interest/">Semantic Web Interest Group</a> of course, and various [other Semantic Web] activities, which has been pushing it. But also Linked Data has been <em>seized on</em> - a group of people for example put together <a href="http://dbpedia.org/">DBpedia</a>.<sup>3</sup> That wasn't commissioned, that was that they just thought it would be a really cool idea.</p>
<p><img src="http://www.readwriteweb.com/images/web_data_apr09c.jpg" /><br />
<em>Graph of Linked Data sets on the Web, as at March 2009</em></p>
<h2>Linked Data and Governments<br />
</h2>
<p><em>RWW: In <a href="http://www.w3.org/DesignIssues/GovData.html">a recent Design Issues note</a>, you urge <a href="http://www.readwriteweb.com/archives/open_goverment_berners-lee_and_the_uk_to_show_obam.php">governments to put their data online</a> as Linked Data (although you'd also be happy for governments to just make available the raw data - presumably so that others can then structure it). What do you realistically expect, for example, the U.S. or U.K. governments to do over the next year? And in the near future, do you foresee different governments interconnecting their Linked Data sets? </em></p>
<p>TBL: One can't generalize, governments are (like most big organizations) fascinatingly diverse inside them. So you'll find that there are places inside governments where you get a champion who <em>gets</em> linked data and who's just written a script and produced some linked data. So in the UK government for example, you'll find there's RDFa [in the code of its website] for civil service jobs. So if somebody wants to make a database of all the jobs, they can do that very easily. </p>
<div class="pullquote">
<p>"The first step of actually putting the data out there is the one that nobody else can do."</p>
</div>
<p>There are other cases where the easiest thing for somebody to do is to just put data up in whatever form it's available. Comma separated values (CSV) files are remarkably popular. They're exported sometimes from spreadsheets. It's remarkable how much information is in spreadsheets. Or sometimes pulled out of a database and then put up on the web. It's not as good, not as useful to the community, as if Linked Data had been put up there and linked. But the first step of actually putting the data out there is the one that nobody else can do. </p>
<p><img src="http://www.readwriteweb.com/images/Data.govscreen.jpg" /><br />
  <em><a href="http://data.gov/">Data.gov</a>, a catalog of public data, was <a href="http://www.readwriteweb.com/archives/datagov_finally_launches_looks_nice_but_short_on_d.php">launched in May by the U.S. government</a></em></p>
<p>The way to go is for government departments to go the extra step and convert [their data] into Linked Data. One of the nice things about Linked Data, when they have a pile of it, is that they could run a SPARQL server on it. SPARQL servers are a commodity product,  a solution for all of the people who say 'but actually I wanted to have XML.' A SPARQL server will  generate an XML file [and] allow somebody to write out, effectively, a URL for the XML file. </p>
<div class="pullquote">
<p>"Linked Data is the backplane, it's the thing that you connect to in both directions."</p>
</div>
<p>In fact, I don't see why SPARQL servers shouldn't provide CSV files, something which as far as I know isn't in the standards. But I'd recommend it, certainly in government context, because CSV files are what people have and what people want. </p>
<p>So the message [for government] is to use RDF. Linked Data is the <a href="http://en.wikipedia.org/wiki/Backplane">backplane</a>, it's the thing that you connect to in both directions. As a [web] producer your job is to make sure that you produce Linked Data one way or another. And as a consumer, there are lots of ways to consume that data once it's out there as Linked Data. </p>
<p><em><strong>In <a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Part 2 of this interview</a></strong> we discuss: how previously reticent search engines like Google and Yahoo have begun to participate in the Semantic Web in 2009, user interfaces for browsing and using data, what Tim Berners-Lee thinks of new computational engine Wolfram Alpha, how e-commerce vendors are moving into the Linked Data world, and finally how the Internet of Things intersects with the Semantic Web. <strong><a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Read Part 2 here</a></strong>.</em></p>
<p><strong>Footnotes:</strong></p>
<p>1. The <a href="http://www.readwriteweb.com/archives/the_readwrite_w.php">very first sentence written on this blog</a>, on 20 April, 2003, was: &quot;The World Wide Web in 2003 is beginning to fulfill the hopes that Tim Berners-Lee had for it over 10 years ago when he created it.&quot;</p>
<p>2. For more on read/write browsers, you can read another early RWW post entitled <a href="http://www.readwriteweb.com/archives/what_became_of.php">What became of the Browser/Editor</a>.</p>
<p>3. DBpedia is a community project to extract structured information from Wikipedia; see <a href="http://www.readwriteweb.com/archives/where_to_find_open_data_on_the.php">ReadWriteWeb's profile</a> of this and similar resources.</p>
]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php</link>
         <guid>http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php</guid>
         <category>Interviews</category>
         <pubDate>Wed, 08 Jul 2009 06:00:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>How Mathew Ingram Manages a News Site That Gets 5,000 Comments a Day</title>
		<description><![CDATA[<p><img alt="ingramgoodpic150.jpg" src="http://www.readwriteweb.com/images/ingramgoodpic150.jpg" ><a href="http://www.mathewingram.com/work/">Mathew Ingram</a> is the Communities Editor at the Toronto-based <a href="http://www.theglobeandmail.com/">Globe And Mail</a>, Canada's biggest newspaper. He's a traditionally-trained reporter, but he's got years of experience blogging and using experimental new services, so he has one foot planted firmly in each world.   We interviewed Mathew as part of our first premium report, <a href="http://www.readwriteweb.com/reports/">The ReadWriteWeb Guide to Online Community Management</a>, where you'll find interviews and gleaned wisdom from 40 top experts in the field.  The following is an excerpt from that interview that we thought would be of general interest to readers; it's about online community, transitioning from traditional to social media and it's about Twitter (what isn't these days?). We hope you enjoy it.</p>

<p>This is historically important stuff.  "The transition from one-way to two-way media is not something that newspapers are used to doing," Ingram told us. "It's a big change." </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15323&amp;cb=15323' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15323&amp;n=15323' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>This is such a big change and Ingram is doing such a good job of making the most of it that when Jennifer Preston was appointed <a href="http://www.readwriteweb.com/archives/nytimes_appoints_social_media_editor.php">the first Social Media Editor of the New York Times</a> last month, several observers (us included) recommended that she look to Ingram's work for inspiration.</p>

<p>"The earliest version of community we had was comments on news stories," Ingram told us. "For anyone who runs a blog, you take that for granted; but for us, that was a big step. We were the first newspaper to do that in 2005. It crept up for us; there weren't that many people commenting. Now we're getting five, six, seven thousand comments a day. On good or bad days we can get up to ten thousand comments.  [We're sure he gets help dealing with all of those!]</p>

<p>"I like to call that community 1.0 or 1.5, because they all just sit in a big heap at the bottom of the story. It's like a petri dish of a community; it's little micro-organisms that could become community. You see people who reply to each other, good and bad commenters who return, people who assist each other. One thing I want to encourage more is writers responding to comments and using comments as a resource. That's commenting 2.0, I think. </p>

<p>"Community is great because it makes people feel good, democratizes the process, but also delivers value. One of our writers wrote a story, and the comments pointed out that she only talked to one guy about one aspect of the story. She said 'I read the comments and thought F*!@ you. I wrote a story. Go write your own.' But then she admitted it was true, phoned someone else, and updated the story. For me, that's a gigantic win for us and for readers as well. That's where the feedback should be. " </p>

<p>"I've also seen a noticeable change in tone in comments and other interactive forums, like <a href="http://Coveritlive.com">Coveritlive.com</a>. As soon as someone from the paper steps in and makes a comment, the whole tone changes. If you just give people a blank wall and a spray paint can, you get a predictable outcome. But as soon as anyone says we should stick to the topic or knock off the personal attacks, it has a noticeable effect. </p>

<p>"Comments are the base level of interaction. I've been thinking of other ways to enhance that. We've got live blog, a wiki project, and hopefully we've got groups and forums around a particular issue. </p>

<p>"One of the biggest things we need to do is identify and encourage members of the community who are thoughtful, intelligent, and produce comments of value -- encouraging them to contribute more, elevating what they do and suppressing some of the noise. I'm hoping our new Web publishing system that lets people vote on comments will help with that. I'm trying to think of more ways to use the volunteer fire department principle. Identify key members, ask them to contribute more, and incentivize them. Making their comments look different, giving them a title, giving them different tools. There's no way we can moderate all these comments every day, and the only way to do it is take advantage of our community. I think a task or a goal helps a community gel." </p>

<p><strong>Does the Globe use TwItter?</strong> It sure does. "I have been using it as a way to connect with people and push out features," Ingram told us. "You can pull Twitter feeds into Coveritlive.com. We did an Oscar one, an Obama visit, covered a shooting in the subway. I was looking for people commenting on Twitter on those topics, pulling in what people say. I've retweeted, approved users, or approved with hashtags. There is a surprising number of everyday people on Twitter; the Mayor of Toronto is on it. But something like that for raw information delivery is always going to be valuable. You may be touching only 1% or .1% of the population, but they are reaching ten times that many people." </p>

<p>Ingram's closing thoughts on the changing media landscape: "Sometimes you do things, like the policy wiki we set up to get people's input on serious issues, the first issue we got a lot of input on and the second one we got a lot less input on. It's the ghost-town phenomenon. Or they are talking about what you want them to talk about but someplace else. You can build a cool night club and tell people about it, but if people don't want to come, if they want to go to an empty warehouse, then that's what they are going to do. As a big media entity, we used to have the audience; now you have to win over an audience to pay attention to you. I don't know how to solve that one either."</p>

<p><em>Mathew Ingram is an active participant in conversations on Twitter about international media and technology; you can connect with him at <a href="http://twitter.com/mathewi">@mathewi</a>.  Learn more about <a href="http://www.readwriteweb.com/reports/">The ReadWriteWeb Guide to Online Community Management</a> via <a href="http://www.readwriteweb.com/reports/">this link</a>.</em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/how_mathew_ingram_manages_a_news_site_that_gets_50.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/how_mathew_ingram_manages_a_news_site_that_gets_50.php</link>
         <guid>http://www.readwriteweb.com/archives/how_mathew_ingram_manages_a_news_site_that_gets_50.php</guid>
         <category>Interviews</category>
         <pubDate>Tue, 09 Jun 2009 15:40:37 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>How Tough Is It Today Being a VC? 10 Questions for Two Early-Stage Stars</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/clavier_hornik_mar09a.png" width="150" height="83" />Pity the poor venture capitalist. Times were... well, so cushy. Money was flowing, deals were being done in record time, monetization was something one worried about later, and Silicon Valley was bursting at the seams. The sweet smell of wealth creation was everywhere. But suddenly, money got tight and the portfolio companies of many VC firms went on life support. So let's hear from a couple of well-known early-stage investors, each with close to a decade under his belt, who we learned are largely undaunted by the current melancholia.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=14455&amp;cb=14455' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=14455&amp;n=14455' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>We spoke with Jeff Clavier, founder and managing partner of <a href="http://www.softtechvc.com/">SoftTech VC</a>, Palo Alto, CA, and Dave Hornik, a partner at <a href="http://www.augustcap.com/">August Capital</a>, Menlo Park, CA. Both have enviable records of successful investments, and both still find new startups worthy of their funds (though they agree the pace has slowed). We conducted the interviews by email after chatting with both gentlemen at the recent DEMO '09 conference. (The questions for each are identical, though the interviews were conducted separately.)</p>
<p><strong>Question:</strong> How would you describe the current state of early-stage VC in general?</p>

<p><img src="http://www.readwriteweb.com/images/clavier_hornik_mar09c.jpg" align="left" width="200" height="224" /><strong>Clavier:</strong> Early-stage technology investments have traditionally been made by a mix of business angels, which invest their own money, and venture capital firms, which invest from funds they raise. Over the past four years, a few seed-stage firms have become prominent players in early-stage investing: <a href="http://www.firstround.com/">First Round Capital</a>, <a href="http://www.maplesinvestments.com/">Maples Investments</a>, Baseline Ventures, <a href="http://www.trueventures.com/">True Ventures</a>, <a href="http://alsop-louie.com/">Alsop Louie</a>, <a href="http://www.kpgventures.com/">KPG Ventures</a>, and my firm, <a href="http://www.softtechvc.com/">SoftTech VC</a>. In the current environment, all of these firms are actively investing and so are "professional" business angels. More casual angels have disappeared from the market since October 2008, when the public markets started unraveling.</p>

<p><strong>Hornik:</strong> I think it is a very tough time for venture capital in general. The public markets are closed up tight as a drum. But more importantly, the financing environment is very tough. It is particularly hard to assume that a company you finance today will be able to remain financed in the future. So venture capitalists are being very careful about what companies they fund, with a particularly careful eye on capital efficiency.</p>

<p><strong>Question:</strong> What would you say about the state of VC investing specifically in the digital content or digital content technology space? Do VCs look on this space favorably right now?</p>

<p><strong>Clavier:</strong> VCs continue to invest broadly in digital content, technology, services, etc. The bar has just gone much higher on what companies have to demonstrate to show themselves worthy of an investment: traction, revenues, market potential, etc.</p>

<p><img src="http://www.readwriteweb.com/images/clavier_hornik_mar09b.jpg" align="right" width="299" height="200" /><strong>Hornik:</strong> There is no question a number of exciting companies in the digital media space will have no problem getting financed. But many others will not be able to find backers. There is a general sense that online advertising is slowing down. Because a large number of the digital media opportunities are monetized with advertising, many of the companies coming up for funding will face a fairly skeptical venture community and have a tough time getting funded.</p>

<p><strong>Question:</strong> What have been your most notable deals in this space since you started investing?</p>

<p><strong>Clavier:</strong> I started investing in the consumer Internet space five years ago, at the very beginning of Web 2.0, and since then have closed more than 50 deals. Five of them got acquired: <a href="http://www.truveo.com/">Truveo</a>, <a href="http://www.userplane.com/">Userplane</a>, <a href="http://www.mybloglog.com/">MyBlogLog</a>, <a href="http://www.kaboodle.com/">Kaboodle</a>, and <a href="http://www.mayasmom.com/">Maya's Mom</a>. In my angel portfolio, I have companies like <a href="http://www.mint.com/">Mint</a>, <a href="http://www.kongregate.com/">Kongregate</a>, and <a href="http://www.buzznet.com/">Buzznet</a>. More recently, I invested in <a href="http://tapulous.com/">Tapulous</a>, maker of the #1 game on the iPhone, as well as <a href="http://www.circleofmoms.com/">Circle of Moms</a>, <a href="http://getsatisfaction.com/">GetSatisfaction</a>, and <a href="http://www.socialmedia.com/">SocialMedia</a>. All of these companies have millions of users, but it is still the beginning for them.</p>

<p><strong>Hornik:</strong> The partners at August Capital were the earliest investors in such companies as Microsoft, Sun, Compaq, Intuit, Symantec, Seagate, Skype, and many others. I have had the good fortune to invest in such exciting digital media companies as <a href="http://www.evite.com/">Evite</a>, <a href="http://web.tickle.com/">Tickle</a>, <a href="http://www.sixapart.com/">Six Apart</a>, <a href="http://www.videoegg.com/">VideoEgg</a>, etc.</p>

<p><strong>Question:</strong> Have you done any digital content deals in the past six months?</p>

<p><strong>Clavier:</strong> I typically don't talk about recent deals. Funding announcements are most often done when companies launch. In the past six months, I have invested in Outright, <a href="http://foodzie.com/">Foodzie</a>, and <a href="http://textdigger.com/">TextDigger</a>, and I am also about to close two deals in digital content infrastructure.</p>

<p><strong>Hornik:</strong> We have invested in a couple but, unfortunately, we aren't talking about either publicly. Stay tuned.</p>

<p><strong>Question:</strong> Is now a good time to be a VC investor?</p>

<p><strong>Clavier:</strong> I personally consider the current environment a great time for investing. Opportunities I see tend to be stronger; there are stronger talent pools around companies; and the general focus is on building sustainable businesses, with revenues being part of the short-term plan, as opposed to an afterthought.</p>

<p><strong>Hornik:</strong> I think it is a great time to be a VC at August Capital. We have had the good fortune to invest successfully in a number of really interesting companies in past down economies. We believe that great entrepreneurs are undaunted by the challenging economy. And a number of things make it easy to build a company in these difficult times: plentiful talent, cheaper rents, less competition, etc.</p>

<p><strong>Question:</strong> What is the mindset of your investing partners right now, including other VC firms you often invest with? What percentage of them are positive?</p>

<p><strong>Clavier:</strong> We are all busy looking at deals and investing right now. It is, however, fair to say that the pace has slowed down a little bit compared to last year.</p>

<p><strong>Hornik:</strong> I certainly think that plenty of venture investors feel quite daunted by the market conditions. Not only is it harder to raise money for their portfolio companies, it is harder for them to raise money for their own firms. But plenty of folks out there have seen these up-and-down cycles before and remain enthusiastic about venture investing. I personally remain quite optimistic about the future of venture investing; so long as there is technical innovation, there will be great opportunities in venture capital.</p>

<p><strong>Question:</strong> Is VC investing more difficult in the current environment?</p>

<p><strong>Clavier:</strong> I would not say it is more or less difficult than it was in the past. The big question for us is, what type of companies are likely to be successful in these challenging times. There is a flight to quality in terms of management teams and a bigger focus on short-term revenue. But otherwise, it is pretty much the same.</p>

<p><strong>Hornik:</strong> It is definitely more difficult. If capital is the lubricant of markets, then we are facing some pretty serious challenges. But there will be opportunity to succeed despite the markets. And those firms that have a long history of success will be able to weather the storm far more easily.</p>

<p><strong>Question:</strong> What types of digital content solutions or tools are you interested in funding? What's on your wish list?</p>

<p><strong>Clavier:</strong> I don't maintain a wish-list of tools or companies, to be honest. My investment strategy is sector-based, within the realm of consumer Internet. That includes social media, gaming, search and discovery, monetization and ad networks, and consumer and cloud infrastructure. I am currently looking at mobile deals as well, on new platforms like the iPhone and Android.</p>

<p><strong>Hornik:</strong> I don't really have a wish list. I'm always looking for smart, talented entrepreneurs who can tell me what is interesting. The entrepreneurs know way more than the venture community, so I have always followed their lead.</p>

<p><strong>Question:</strong> Has deal flow slowed down for first-round financing? And can we assume that requests for follow-on financings are way up? On which are you spending most of your time these days?</p>

<p><strong>Clavier:</strong> Deal flow has slowed down a little bit, but I find what comes my way is of higher quality. I have closed six follow-on rounds in the portfolio in the last six months: companies I had seed-funded that received capital from new investors in a subsequent round, and I see that as very positive. It was a lot of hard work; it took more time than in the past; and valuations were in line with market realities. But these deals got done. I have allocated 75% of my time to my existing portfolio, and 25% to looking at new deals.</p>

<p><strong>Question:</strong> Any further thoughts on the subject?</p>

<p><strong>Clavier:</strong> We're in the most challenging economic and financing environment of the last decade. I have been an investor for nine years, but I'm certain that we'll see fantastic companies emerge from these difficult times. And I am excited to be involved in the early-stage community that will help build them.</p>

<p><strong>Hornik:</strong> I joined the venture business in June of 2000, which was a challenging time in its own right. But I found some very interesting companies to invest in then. And I'm sure I will find some interesting companies to invest in over the next couple of years as well. I'm looking forward to it.</p>

<em><p>Graeme Thickins is an independent writer, consultant, and blogger based in Minneapolis, MN, and San Clemente, CA. His main blog is <a href="http://www.tech-surf-blog.com/">www.Tech-Surf-Blog.com</a>, where he recently posted some 23 interviews he conducted at the DEMO '09 conference in early March.</em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/tough_today_being_vc_10_questions_for_two_early_stage_stars.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/tough_today_being_vc_10_questions_for_two_early_stage_stars.php</link>
         <guid>http://www.readwriteweb.com/archives/tough_today_being_vc_10_questions_for_two_early_stage_stars.php</guid>
         <category>Interviews</category>
         <pubDate>Wed, 01 Apr 2009 09:00:00 -0800</pubDate>
<author>Graeme Thickins</author>
      </item>
      
      <item>
         <title>The Future of Firefox: Interview With Mozilla&apos;s Chief Innovation Officer</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/firefox_logo_jan09.jpg" />In my recent visit to Silicon Valley,  I got the chance to visit the <a href="http://www.mozilla.org/">Mozilla</a> headquarters. Among others at the organization, I  spoke to <a href="http://cbeard.typepad.com/">Chris Beard</a> - Mozilla's Chief Innovation Officer and the person overseeing its efforts to bring new concepts to the browser, a.k.a. <a href="http://labs.mozilla.com/">Mozilla Labs</a>. We discussed where Firefox is heading and how it compares to Google Chrome in particular. We also talked about Mozilla's new mobile browser Fennec, the add-on platform, and how recent innovations by Mozilla - such as Weave and Ubiquity - fit into the big picture. In this post we'll focus on the near future of Firefox.</p>
]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=14368&amp;cb=14368' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=14368&amp;n=14368' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>Firefox vs Chrome</h2>
<p>Chris Beard and I first discussed what Mozilla is doing to keep Firefox, its flagship product, competitive  in the latest generation of the 'browser wars'. Google, whose headquarters are just up the road from Mozilla and who I also visited  on the same day, upped the ante in the browser industry in September last year when it <a href="http://www.readwriteweb.com/archives/weekly_wrapup_chrome_edition.php">launched</a> a brand new browser called <a href="http://www.readwriteweb.com/tag/chrome">Chrome</a>. Not only that, but Google went out of its way to claim that Chrome represents the next generation of browsers, because (according to Google) it is much better than existing browsers at managing the increasingly sophisticated  web apps we see on the Web nowadays. </p>
<p>Beard noticeably bristled at the suggestion that Chrome performs better with heavy duty web apps. He noted that Firefox is also working hard to make highly interactive web apps run smoothly. Regarding Google's claim that Chrome's isolated tab processes mean a more stable browser, Beard replied that Firefox too is very stable and that it doesn't crash much these days. And to be fair, in this author's experience the latest production versions of Firefox have indeed performed much better than they used to. I still get the odd browser crash though.</p>
<h2>What's the Vision for Firefox?</h2>
<p>But arguments about browser stability aren't going to differentiate the two browsers, Firefox and Chrome, in the eyes of the general public. So I asked Chris Beard to explain Mozilla's vision for the future of Firefox. Beard replied that the vision for Firefox is to <strong>help users navigate and manage an increasingly complex world</strong>. Beard likened this concept to intelligent agents; and he also used the term 'trusted assistant'. Beard told me that the browser will be &quot;tied to services&quot; - he mentioned the current activity happening in the Linked Data and Semantic Web communities.</p>
<p>Add-ons are a huge part of the current Firefox experience and Chris Beard said that some of those add-ons will become more integrated into the core browser. While that isn't a new trend, I noted that it sounds similar to <a href="http://www.readwriteweb.com/archives/flock_20_launches_adds_myspace.php">what Flock has done</a>. <a href="http://www.flock.com/">Flock</a> is a browser built on the Mozilla platform that integrated many social web elements into the browsing experience (Flickr, YouTube, etc). I suggested that Firefox may want to offer bundles of add-ons, so that users don't have to go hunting around for various individual add-ons. Beard said that yes, this is in the works. He said that users will be able to create add-on &quot;lists&quot; and offer them as a single click to other users - much like Amazon's wish lists. However he noted that there are usability issues to overcome, because some add-ons aren't necessarily compatible with others. He said that currently Firefox has around 8000 add-ons and that we can expect this bundling feature to come out in the next couple of months.</p>
<p>As for other upcoming changes to Firefox, Beard  told me that many aspects of the current Firefox experience could be in the cloud - for example bookmarks and the <a href="http://www.mozilla.com/en-US/firefox/features/">"Awesome Bar"</a> (Mozilla's term for its adaptive learning URL bar). Beard said that portability of the user experience is important in this era of the Web and so they'll be looking to offer certain functionality and data in the cloud.</p>
<p>Another part of Mozilla's strategy for Firefox going forward is to integrate aspects from some of its associated products, such as <a href="https://wiki.mozilla.org/Labs/Ubiquity">Ubiquity</a> (an experimental Firefox add-on that gives your browser a context sensitive command-line - see <a href="http://www.readwriteweb.com/archives/ubiquity_gets_an_update_prettier_faster.php">ReadWriteWeb's most recent write-up</a>) and its sync product <a href="http://wiki.mozilla.org/Labs/Weave/0.2/Release_Notes">Weave</a> (<a href="http://www.readwriteweb.com/archives/mozilla_releases_weave_02.php">our write-up</a>). Beard told me that all Mozilla products are designed to be extended, but this may include making them part of the core Firefox browser. Ubiquity, for example, may end up being baked into Firefox in the future.</p>
<p>In my next post, we'll explore Mozilla's strategy for Fennec (its new mobile browser) and we'll look at recent developments in other Mozilla products such as Ubiquity and Weave.</p>
]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/the_future_of_firefox_chris_beard.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/the_future_of_firefox_chris_beard.php</link>
         <guid>http://www.readwriteweb.com/archives/the_future_of_firefox_chris_beard.php</guid>
         <category>Interviews</category>
         <pubDate>Tue, 24 Mar 2009 18:42:43 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>Being Harry Potter, While You Walk to Work</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/wetellstories.jpg"><a href="http://danhon.com/">Dan Hon</a> is building a radical new future for one of humanity's oldest activities - the telling of stories.  The modest young UK CEO's design company <a href="http://www.sixtostart.com/">Six to Start</a> won Best in Show at this week's SXSW Web Awards.  The company's project, called <a href="http://wetellstories.co.uk/">Telling Stories</a>, is a six part experiment with the book publisher Penguin.</p>

<p>Hon's vision of the future is sci-fi influenced, cross-platform and web-native.  He mocks the "urban games" of online hipsters but believes there will soon be a layer of "Harry Potter ether" that we can dip in and out of while we're walking to work.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=14285&amp;cb=14285' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=14285&amp;n=14285' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>I talked with Hon on a plane ride away from SXSW.  He was on his way to the Canadian equivalent, Interactive '09.</p>

<h2>Making Books a Different Animal</h2>

<p>The Telling Stories project transformed the work of six UK book authors into six different web experiences.  Hon said the authors were mainstream writers whose reactions ranged from indifferent to bemused when they were first approached.  After participating, all six are now enthusiastic to do more on the web, he said.</p>

<p>Hon's favorite of the six parts was a mystery thriller written about the streets of London that his company transformed into a Google Maps overlay; the map marker became a flying first-person narrator for the bird's-eye readers. Book chapters unfolded as map annotations.<br />
<center><img src="http://www.readwriteweb.com/images/westorriesscreen2.jpg"></center><br />
Another section of Telling Stories put a husband and wife team of novelists on a website where visitors could watch their keystrokes in real time, including the delete key.</p>

<p>Another author's book was serialized into 140 character abridged lines and delivered over months to followers on Twitter.</p>

<p>The whole Telling Stories project has been applauded as a great example of book publisher Penguin boldly stepping into a new medium.  Hon says the authors were assured that visiting emissaries from the internet had not come to destroy them.</p>

<p>The project has brought the authors creative opportunity and substantial exposure.  Personalized social serendipity service <a href="http://stumbleupon.com">StumbleUpon</a> has brought in half of the traffic to Telling Stories, Hon says; sometimes up to 10,000 people will Stumble on to the site on a seemingly random day.</p>

<p>Those visitors are encouraged to jump media and buy the full dead-tree version of the web-ified stories.  Hon says though that he thinks the division between media types will become much less clear in the near future.</p>

<h2>The Future of Stories</h2>

<p><img alt="dan hon CC by Dan Taylor on Flickr.jpg" src="http://www.readwriteweb.com/images/dan%20hon%20CC%20by%20Dan%20Taylor%20on%20Flickr.jpg" width="220" height="211" align="right">This CEO and I didn't talk much about monetization - emergent forms of creativity shaking up the old are more exciting.  We didn't tackle the debasement of literature by Twitter because Twitter's awesome potential is more interesting.</p>

<p>We talked just hours after the iPhone OS 3.0 announcement was made and Hon was excited that the new Bluetooth connectivity could mean a vastly improved interface for glucose monitors, for example.  He said that developments like this could be the stepping stones toward a future of <em>ubiquitous computing</em>.</p>

<p>"Soon people will realize that there is no 'mobile internet' - there is only the Internet," he says.  "And stories are everywhere."  Hon says web content today is like the early days of TV, when all anyone could think to do was broadcast actors from the theater in the new medium.  But new types of media enable fundamentally new types of content and experiences.</p>

<p>For example, we're just beginning to learn how to leverage the web's social connections, Hon says.  He points to the first iteration of "urban games" as something rudimentary that won't last: groups of people organizing online to meet in person dressed, let's say, as Pac-man characters, running through city streets and posting videos of their adventures on YouTube.   "Those games ask people to get up and do something they don't really want to do," Hon says.  </p>

<p>Instead, he believes that the future of interactive story telling will be pervasive - it will be available throughout your typical day.  Walking to work, even while at work.</p>

<p>"I have no idea what we can produce in this medium," he said, "but I think it's going to be like turning the whole world into Disney Land."</p>

<p>Just remember, Dan, how much free time you said you discovered when you quit playing World of Warcraft.  Turning the whole world into Disney Land is nothing to take lightly.    That said, I'll see you when we meet up in the Harry Potter ether.  I won't be surprised if you and your team help build it.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/being_harry_potter_while_you_walk_to_work.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/being_harry_potter_while_you_walk_to_work.php</link>
         <guid>http://www.readwriteweb.com/archives/being_harry_potter_while_you_walk_to_work.php</guid>
         <category>Authoring Tools</category>
         <pubDate>Wed, 18 Mar 2009 11:43:08 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Vint Cerf: We Still Have 80 Per Cent of the World to Connect</title>
		<description><![CDATA[<p><img alt="vint_cerf_playing_Spacewar_feb_09.jpg" src="http://www.readwriteweb.com/images/vint_cerf_playing_Spacewar_feb_09.jpg" width="113" height="168" />"By 2010 we will have run out of IP addresses if we don't do something about it," <a href="http://www.google.com/corporate/execs.html#vint">Vint Cerf</a>, Google's chief Internet evangelist and the man commonly referred to as "the father of the Internet," told ReadWriteWeb last month. (Video embedded below.)</p>

<p>With the number of Internet-enabled devices particularly mobile phones soaring, very few IP addresses remain vacant, and with only about 20 per cent of the world connected to the Net, that's a problem.  And consumers, if you think this doesn't affect you, think again.  That latest gadget you bought - is it IPv6 compatible?</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=13952&amp;cb=13952' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=13952&amp;n=13952' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>TCP/IP: So, what's it all about anyway?</h2>

<p>To fully understand <a href="http://www.join.uni-muenster.de/Dokumente/Howtos/IPv6_for_Beginners.php?lang=en">IPv6</a> we need to take a look at <a href="http://www.livinginternet.com/i/ii_tcpip.htm">TCP/IP</a> and this means a quick trip back in time.</p>

<p>It all started way back in 1969, when the Advanced Research Projects Agency Network (ARPANET) was using a transmission protocol known as the <a href="http://www.livinginternet.com/i/ii_ncp.htm">Network Control Protocol</a>(NCP) to transmit data across networks.  Protocols, if you think of them as languages, are needed so that networks and computers can talk to one another.</p>

<p>Expensive, cumbersome and slow, NCP was found to be limiting and in 1973, the U.S. Defense Advanced Research Projects Agency (<a href="http://www.darpa.mil/">DARPA</a>) initiated a research program, known as the <a href="http://www.isoc.org/internet/history/cerf.shtml">Internetting project</a>, to develop a better communication protocol.</p>

<p>The networks which emerged from this research became the basis for what we know as the Internet, and the protocols developed during this time became known as the TCP/IP Protocol Suite.</p>

<p>At its most basic level, the IP part ensured packets were routed to the right place by providing unique identifying numbers to all hosts connecting to the network, and the TCP part managed the transfer of that data. </p>

<p><img alt="synsynackack_feb_09.jpg" src="http://www.readwriteweb.com/images/synsynackack_feb_09.jpg" width="337" height="138" /></p>

<p>On January 1, 1983 NCP was deemed obsolete when the ARPANET switched over to the new TCP/IP protocol suite, and as a result, marked this <a href="http://blogoscoped.com/archive/2008-01-01-n53.html">date</a> as the official birth date [for some] of the Internet.</p>

<h2>Getting to V1 from V6</h2>

<p>According to the <a href="http://www.livinginternet.com/">Living Internet</a>, after Vint Cerf and Robert Kahn designed TCP/IP, DARPA contracted with three sites to develop operational versions: <a href="http://www.bbn.com/">BBN</a>, <a href="http://www.stanford.edu/">Stanford</a> and the <a href="http://www.ucl.ac.uk/">University College London</a>, and four increasingly better versions of TCP/IP were developed: TCPv1, TCPv2, which then split into TCPv3 and IPv3. Stability finally arrived with TCPv4 and IPv4; the standard protocol we know and use today.</p>

<p>IPv4 uses 32-bit addresses, which limits the address space to 4,294,967,296 (232) possible unique addresses. But, as some of these are <a href="http://www.iana.org/assignments/ipv4-address-space/">reserved</a> for specific purposes, it reduces the total number available. </p>

<p>IPv6 with its <a href="http://www.iana.org/assignments/ipv6-address-space">128 bit addresses</a> increases the number of potential unique addresses to 3.4e+38 (a little bit more than 340 trillion, trillion, trillion).  Additionally, it is designed to rectify issues found with IPv4 such as data <a href="http://www.ipv6.com/articles/security/IPsec.htm">security</a>. </p>

<p>IPv6 is expected to slowly replace IPv4, with the two protocol systems expected to run simultaneously for many years.</p>

<h2>But, what happened to IPv5?</h2>

<p>Typically, the most often asked question when talking about IPv4 and IPv6 is what happened to IPv5?  IPv5 was known as an experimental streaming audio/video protocol.  According to <a href="http://www.oreillynet.com/onlamp/blog/2003/06/what_ever_happened_to_ipv5.html">Raffi Krikorian</a>, a protocol named ST, the <a href="http://www.faqs.org/rfcs/rfc1819.html">Internet Stream Protocol</a> was created in the late 1970's and two decades later revised to become ST2, at which point it was implemented in commercial projects by IBM, NeXT, Apple and Sun.  ST and ST2 were already given that magical "5" notes Krikorian.  Given it had little to do with the fundamental structure of IP addressing, IPv5 is not commonly recognized.</p>

<h2>We're running out of IP addresses</h2>

<p>While the establishment of a single networking protocol was an important step toward maintaining order in the then new internetworked world, no one could have guessed the growth of the Internet, nor the number of IP addresses required to cover the ever growing demand.</p>

<p>"My only defense is that decision was made in 1977, at a time when it was uncertain if the Internet would work," <a href="http://www.techworld.com.au/article/264531/big_changes_ahead_internet_says_vint_cerf">Cerf said recently</a>, adding that a "128-bit address space seemed excessive back then."</p>

<p>Watch our video below to get Cerf's take on IPv6 - and why switching over is so important.</p>

<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/z1UwPxefFYk&hl=en&fs=1&rel=0"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/z1UwPxefFYk&hl=en&fs=1&rel=0" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>

<p><br />
<em>Recorded at <a href="http://searchmarketingexpo.com/west">SMX West</a> 2009 by ReadWriteWeb<br />
Vint Cerf image: Vint Cerf playing <a href="http://flickr.com/photos/joi/494396202/">Spacewar on PDP-1</a><br />
Credit: Flickr <a href="http://flickr.com/photos/joi/">Joi</a></em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/vint_cerf_we_still_have_80_per.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/vint_cerf_we_still_have_80_per.php</link>
         <guid>http://www.readwriteweb.com/archives/vint_cerf_we_still_have_80_per.php</guid>
         <category>Interviews</category>
         <pubDate>Sat, 21 Feb 2009 14:00:00 -0800</pubDate>
<author>Lidija Davis</author>
      </item>
      
      <item>
         <title>ReadWriteWeb France Catches Up with Matt Mullenweg </title>
		<description><![CDATA[<p><img alt="wordpress_logo_jan_09.jpg" src="http://www.readwriteweb.com/wordpress_logo_jan_09.jpg" width="144" height="124"  />While <a href="http://ma.tt/2009/02/paris-wordcamp/">Matt Mullenweg</a> was in France for <a href="http://wordcamp.fr/">WordCamp Paris</a> 2009, the team from <a href="http://fr.readwriteweb.com/2009/02/19/entrevues/matt-mullenweg-media-sociaux-et-politique/">ReadWriteWeb France</a> took the opportunity to catch up with him and ask him a few questions about open source, WordPress, and the future. <br />
</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=13922&amp;cb=13922' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=13922&amp;n=13922' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>Below, you'll find the first interview in a five-part series. The <a href="http://www.youtube.com/view_play_list?p=C7F391D94AA6A7AF">entire interview</a> has Matt discussing open source, Creative Commons, developing WordPress with the community, social media in politics, the Obama campaign, and what Fabrice Epelboin of ReadWriteWeb France described as "an upcoming global translator social network service for an open source project." </p>

<center><object width="480" height="385"><param name="movie" value="http://www.youtube.com/v/D_Q9FlBGgOY&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/D_Q9FlBGgOY&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="480" height="385"></embed></object></center>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/readwriteweb_france_matt_mullenweg.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/readwriteweb_france_matt_mullenweg.php</link>
         <guid>http://www.readwriteweb.com/archives/readwriteweb_france_matt_mullenweg.php</guid>
         <category>Interviews</category>
         <pubDate>Thu, 19 Feb 2009 02:30:45 -0800</pubDate>
<author>Rick Turoczy</author>
      </item>
      
      <item>
         <title>Techmeme&apos;s New Editor: An Interview with Megan McCarthy</title>
		<description><![CDATA[<p><img alt="meganpic2.jpg" src="http://www.readwriteweb.com/images/meganpic2.jpg" width="151" height="136"><a href="http://techmeme.com">Techmeme</a> is a semi-automated site that tracks the hottest conversations among tech blogs each day, with updates every five minutes.  It's one of the most innovative efforts in news gathering today.  In December, Techmeme <a href="http://www.readwriteweb.com/jobwire/2008/12/techmeme-hires-megan-mccarthy.php">hired its first human editor</a>, freelance writer Megan McCarthy.</p>

<p>McCarthy tends the gears of Techmeme, makes sure the content on the site remains of high quality and helps ensure the inclusion of new and important voices.  It sounds like an awesome job and one that has probably never existed before - a half woman, half robot, news gathering machine.   How can you get your blog on Techmeme?  What's in the future for the site?  We asked Megan in the following interview.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=13822&amp;cb=13822' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=13822&amp;n=13822' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>The Techmeme Editor's Job Each Day</h2>

<p><strong>Marshall Kirkpatrick:</strong> What do you do all day?  I imagine you standing next to one of the most awesome news discovery machines available, tending it, making sure it keeps running smoothly, and looking out beyond its reaches to feed it things it hasn't gotten to yet itself.  Is that an accurate picture?</p>

<p><strong>Megan McCarthy:</strong>  That is fairly accurate, actually.  I make sure that the news on Techmeme represents an accurate, current, and full overview of what's happening in technology right now. So, that's trimming back stories that aren't relevant, adding in viewpoints that ought to be heard, etc.</p>

<p><strong>Marshall:</strong>  Can you tell us a little bit about your personal background?  </p>

<p><strong>Megan:</strong>  My personal background is a little varied. Prior to [writing for] Valleywag, I bounced around a few different jobs and places and never really found a niche. I lived in Hawaii for a few years, had various office drone jobs and other gigs to pay the bills (Nanny, bartender, coffee server).  But I loved following technology and reading about what was happening in silicon valley - and I've been a news junkie since I was young.</p>

<h2>News Selection and Twitter Tips on Techmeme</h2>

<p><img alt="Techmemesidebar.jpg" src="http://www.readwriteweb.com/images/Techmemesidebar.jpg" width="329" height="435" align="right"><strong>Marshall:</strong>  So, did your coming on board "break" the "objectivity" of the site?</p>

<p><strong>Megan:</strong>  Techmeme is biased and has been so for a while.  If you read <a href="http://news.techmeme.com/081203/automated">Gabe's post announcing the addition of an editor</a>, he makes that point.</p>

<p>What do you think, though? What changes have you noticed since I joined?</p>

<p><strong>Marshall:</strong>  I have noticed no changes to story selection, perhaps less wonky stuff.  I've always considered Techmeme a very reliable source of news  and I think you're doing a good job continuing that tradition - but there were certainly some people who grumbled about the human touch being formally introduced, an editor.</p>

<p><strong>Megan:</strong>  I think some of those people might grumble about anything.</p>

<p><strong>Marshall:</strong>  How can new bloggers get indexed on Techmeme?</p>

<p><strong>Megan:</strong>  We <a href="http://news.techmeme.com/090128/twitter-tips">just introduced a program</a> where people can tip relevant posts to us through Twitter.  Anyone can tip any post they think is relevant to us.</p>

<center><img alt="TechmemeTwittercredit.jpg" src="http://www.readwriteweb.com/images/TechmemeTwittercredit.jpg" width="540" height="132" ></center>

<p><strong>Marshall:</strong>  How is the new Twitter tips program working out?  I see a lot of stories go up with thanks to Twitter, quite a lot - is it changing the face of the site substantially?  Changing the content?</p>

<p>I see a handful of people getting thanks over and again, I imagine there's limited participation so far but how does the algorithm determine whose tips to accept and whose not to?</p>

<p>Also, a lot of people are sending tips regarding their own stories - is that ok?  Even mainstream media outlets.</p>

<p><strong>Megan:</strong>  I don't think it's changing the content overall. Many of the stories that are tipped are ones which are worthy of a Techmeme headline. Not everything that gets tipped to us gets on the site. There are two situations that I can think of where the tip program does affect the content: It can help surface breaking stories faster, and if there are two similar stories from different outlets and someone cares enough to tip a certain one, that will probably effect which one ends up as a headline on Techmeme.</p>

<p>As for people tipping their own stories... personally I'm not completely opposed to it. If a writer has a breaking story that he or she wants to let us know right away, that's a good way to do it.  But, they should keep in mind that their twitter handle will be credited with tipping us to the story.  If "Thanks: Marshall" showed up next to every Techmeme headline you get, people might put two and two together and think that you really like your work.</p>

<p>To my knowledge, the identity of the person tipping the story has no effect on whether or not it will show up on the page. It's about the post itself.</p>

<p><strong>Marshall:</strong>  Well, if shame and loads of people saying "you're an f*ing jackass" was sufficient deterrent to anti-social behavior in social media, then...[indecipherable, record of this part of the conversation lost forever.]</p>

<p><strong>Megan:</strong>  Ha.  Is he though?</p>

<p><strong>Marshall:</strong>  Oh I'm sure he is.  ANYWAY.  Is accuracy taken into account on Techmeme?</p>

<p><strong>Megan:</strong>  Accuracy is absolutely taken into account on Techmeme.  That's one of my goals, anyway.  If there's a post which has a lot of buzz around it, which turns out not to be true...</p>

<p><strong>Marshall:</strong>  What does that look like?  Are you like "Steve Jobs is NOT out at Apple, I don't believe those reports! Story...gone!"</p>

<p><strong>Megan:</strong>  Or, a story that says "Steve Jobs NOT out at Apple" gets published next to the earlier, erroneous rumor.</p>

<p><strong>Marshall:</strong>  Then you yank the false story?</p>

<p><strong>Megan:</strong>  Either yank it or surround it with stories pointing out *why* it's false.  Sometimes the false rumor becomes a story itself  and yanking it can be jarring.  We want our readers to be able to visit the site and know what's going on in technology - to know what people are talking about.  The earlier rumor would probably be replaced as the top story by one with the correct information, but yanking it without giving our readers full context of the overall arc might be a bit jarring.</p>

<p><strong>Marshall:</strong>  You have to be reading a lot of these stories in great detail.  What time does your work day start and end?</p>

<p><strong>Megan:</strong>  I start around 7:30ish and end later than that.  News never stops!</p>

<h2>The Future of Techmeme and Other Aggregators</h2>

<p><strong>Marshall:</strong>  So, everyone wants to be an aggregator these days.  All the young kids are like "mommy, I'm going to grow up to find recommended stories for an online news publisher."    <br />
 <br />
What kinds of things do you foresee becoming points of leverage for content aggregators and news discovers in the future?</p>

<p><strong>Megan:</strong>  I think a reliable real-time web is going to have the greatest impact on aggregation services. I'd love to be able to see stories from sites as they're published, without a lag.</p>

<p>I hope that quality, accurate, and speedy stories get rewarded by receiving more attention - and that new voices are discovered and make the media chorus sound fuller and stronger.</p>

<p>You were asking me about my electric sheep dreams.</p>

<p><strong>Marshall:</strong>  Are you a cyborg?</p>

<p><strong>Megan:</strong>  Depends on my mood.</p>

<p><strong>Marshall:</strong>  At least between 7am and 7pm?</p>

<p><strong>Megan:</strong>  That sounds about right.  This is super-nerdy, but reading an overwhelming amount of news is something that I rather enjoy doing.</p>

<p><em>Thanks to Megan McCarthy and <a href="http://techmeme.com">Techmeme</a> for doing this interview and doing the things they do each day - help us find the hottest conversation in technology.  We appreciate it.  You can find <a href="http://twitter.com/megan">Megan on Twitter</a> as well.   Photo at top by <a href="http://laughingsquid.com">Scott Beale</a></em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/techmemes_new_editor.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/techmemes_new_editor.php</link>
         <guid>http://www.readwriteweb.com/archives/techmemes_new_editor.php</guid>
         <category>Analysis</category>
         <pubDate>Thu, 12 Feb 2009 17:24:32 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Recommendation Systems: Interview with Satnam Alag</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/satnam_alag_feb09a.jpg" width="150" height="189" />In a recent post, we looked at <a href="http://www.readwriteweb.com/archives/recommender_systems.php">recommendation systems</a>, briefly reviewing how Amazon and Google have implemented their own systems for recommending products and content to their users.</p>

<p>We had the opportunity to speak with Satnam Alag, author of the recently published <a href="http://www.manning.com/alag/">Collective Intelligence in Action</a>, about what makes for a good recommendation system, where the technology is heading, and why Netflix is finding it so hard to improve its own system.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=13716&amp;cb=13716' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=13716&amp;n=13716' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><i><strong>Disclosure:</strong> I wrote the forward to 'Collective Intelligence in Action', however I have absolutely no financial interest in the book.</i></p>

<p><em><strong>ReadWriteWeb</strong>: In our recent <a href="http://www.readwriteweb.com/archives/netflix_prize_2009.php">post about Netflix</a>, we identified four main approaches to recommendations: Personalized recommendation: based on prior behavior of the user; Social recommendation: based on prior behavior of similar users; Item recommendation: based on the item itself; And a combination of all three. Do you agree with the four approaches we laid out in our article?</em></p>

<p><strong>Satnam</strong>: Those four categories are pretty comprehensive. I present an alternate classification of recommendation systems in <a href="http://www.manning.com/alag/">my book</a>. I lay out two fundamental approaches. The first approach, item-based analysis, determines items that are related to a particular item. When a user likes a particular item, related ones are recommended. The second approach, user-based analysis, first determines users who are similar to that user.</p>

<p>Further, there are two main approaches to finding similar items and similar users. For the first, content-based analysis, content associated with the item, especially text, is used to compute similarity. In the second, the collaborative approach, actions such as ratings, bookmarking, and so forth are used to find similar items. For the second, user-based analysis, a number of approaches have been taken, including ones based on profile information, user actions, and lists of the user's friends or contacts. Of course, you can combine any these item/user and content/collaborative approaches to build a recommendation system.</p>

<p>The dimensions of the particular item and user space are helpful in deciding whether to use an item-based or user-based approach. Typically, an item-based approach is used to bootstrap one's application when the number of users is small. As the user base grows, the item-based approach is augmented by a user-based approach.</p>

<p><em><strong>ReadWriteWeb</strong>: Other than Amazon and Netflix, which Internet companies have most impressed you in their implementation of recommendation systems?</em></p>

<p><strong>Satnam</strong>: Other than Amazon and Nextflix, Google News' personalization is my personal favorite. Google News is a good example of building a scalable recommendation system for a large number of users (several million unique visitors per month) and a large number of items (several million new stories every two months), with constant item churn. This is different from Amazon's, whose rate of item churn is much lower. Google decided to use collaborative filtering for its recommendation system mainly because of its access to the data of its large user base and because this same approach could be applied to other applications, countries, and languages. A content-based recommendation system perhaps could have worked just as well, but may have required language- or location-specific tweaking. Google also wanted to leverage the same collaborative filtering technology to be able to recommend images, videos, and music, for which it's more difficult to analyze the underlying content.</p>

<p>Among start-ups, my personal favorite is the one we are developing at my current company, <a href="http://www.nextbio.com/">NextBio</a>. It's not available yet but should be next month. The key point about this particular recommendation engine is its strong use of an ontology, similar in concept to tags, to develop a common vocabulary for items and users. The system then makes use of profile information and user interactions, both short- and long-term, to provide recommendations. The system leverages both item- and user-based approaches.</p>

<p><em><strong>ReadWriteWeb</strong>: What commercial opportunities do you forsee with recommendation systems over the next few years?</em></p>

<p><strong>Satnam</strong>: A good personalized recommendation system can mean the difference between a successful and a failed website. Given that most applications now invite users to interact and to leverage user-generated content, new content is being generated at a phenomenal rate. Showing the right content to the right user at the right time is key to creating a sticky application. I would be surprised if most successful websites did not leverage recommendation systems to provide personalized experiences to their users.</p>


<p><em><strong>ReadWriteWeb</strong>: Your book includes a discussion of collaborative filtering. Can you tell us a bit about how this fits into the overall picture of recommendation systems?</em></p>

<p><strong>Satnam</strong>: In recent years, an increasing amount of user interaction has provided applications with a large amount of information that can be converted into intelligence. This interaction may be in the form of ratings, blog entries, item tagging, user connections, or shared items of interest. This has led to the problem of information overload. What we need is a system that can recommend items based on the user's interests and interactions. This is where personalization and recommendation engines come in.</p>

<p>In my book, I take a holistic view of adding intelligence to one's application, a recommendation engine being one way to do it. The book focuses on both content-based and collaborative approaches to building recommendation systems. It focuses on capturing relevant information about the user, information from both within and outside one's application, and converting it into recommendations. One of the things you mentioned in your write-up on recommendation systems is that you would like to apply such a system to your website to recommend things to users. Someone reading my book should be able to create such a system using the techniques I demonstrate.</p>

<p><em><strong>Next Page: Satnam's thoughts on the Netflix Prize and whether the 10% mark will ever be reached.</strong></em></p>

<!--nextpage-->

<p><em><strong>ReadWriteWeb</strong>: <a href="http://www.readwriteweb.com/archives/netflix_prize_2009.php">Netflix is offering $1 million</a> to the team that can improve its recommendation algorithm by 10%. It's been over 2 years now, with the leading company at 9.63%. There is some skepticism, though, that 10% will be reached anytime soon, because now the contestants are making only incremental progress. Do you expect the 10% mark to be reached soon?</em></p>

<p><strong>Satnam</strong>: Netflix's recommendation engine, Cinematch, uses an item-to-item algorithm (similar to Amazon's) with a number of heuristics. Given that Netflix' recommendation system has been very successful in the real world, it is pretty impressive that teams have been able to improve on it by as much as 9.63%. Of course, the Netflix competition doesn't take into account speed of implementation or the scalability of the approach. It simply focuses on the quality of recommendations in terms of closing the gap between user rating and predicted rating. So, it isn't clear whether Netflix will be able to leverage all of the innovation coming out of this competition. Also, the Netflix data doesn't contain much information to allow for a content-based approach; it's for this reason that teams are focusing on collaborative-based techniques.</p>

<p>The challenges to reaching the 10% mark are:</p>

<p><strong>Skewed data:</strong> The data set for the competition consists of more than 100 million anonymous movie ratings, using a scale of one to five stars, made by 480,000 users for 17,770 movies. Note that the user-item data set for this problem is sparsely populated, with nearly 99% of user-item entries being zero. The distribution of movies per user is skewed. The median number of ratings per user is 93. About 10% of users rated 16 or fewer movies, while 25% of users rated 36 or fewer. Two users rated as many as 17,000 movies. Similarly, the ratings per movie are also skewed: almost half the user base rated one popular movie (Miss Congeniality); about 25% of movies had 190 or fewer ratings; and a handful of movies were rated fewer than 10 times.</p>

<p><strong>The approach:</strong> The winning team, BellKor, spent more than 2,000 combined hours poring over data to find the winning solution. The winning solution was a linear combination of 107 sets of predictions. Many of the algorithms involved either the nearest-neighbor method (k-NN) or latent factor models, such as SVD/factorization and Restricted Boltzmann Machines (RBMs).</p>

<p>The winning solution uses k-NN to predict the rating for a user, using both the Pearson-r correlation and cosine methods to compute the similarities, with corrections to remove item-specific and user-specific biases. Latent semantic models are also widely used in the winning solution.</p>

<p>The BellKor team found it important to use a variety of models that compensated for each other's shortcomings. No one model alone could have gotten the BellKor team to the top of the competition. The combined set of models achieved an improvement of 8.43% over Cinematch, while the best model -- a hybrid of k-NN applied to output from RBMs -- improved the result by 6.43%. The biggest improvement by LSI methods was 5.1%, with the best pure k-NN model scoring below that. (K for the k-NN methods was in the range of 20 to 50.) The BellKor team also applied a number of heuristics to further improve the results.</p>

<p>The BellKor team demonstrates a number of guidelines for building a winning solution to this kind of competition:</p>

<ul><li>Combining complementary models helps improve the overall solution. Note that a linear combination of three models, one each for k-NN, LSI, and RBM, would have yielded fairly good results, an improvement of 7.58%.</li>
<li>A principled approach is needed to optimize the solution.</li>
<li>The key to winning is building models that can accurately predict when there is sufficient data, without over-applying in the absence of adequate data.</li></ul>

<p>The final solution will be along the same lines, combining multiple models with heuristics. Contestants will probably reach the magic 10% mark in the next year or two.</p>

<p><em><strong>ReadWriteWeb</strong>: Some people think the 10% mark can't be reached with algorithms alone, but that the "human" element is required. For example, ClerkDogs is a service that hires actual former video-store clerks to "create a database that is much richer and deeper than the collaborative filtering engines." It's a similar approach to that of Pandora, which has 50 employees who listen to and tag songs. How far do you think algorithms can go in making recommendations?</em></p>

<p><strong>Satnam</strong>: Recommendation systems are not perfect. A number of elements go into making successful ones, including approach, the speed of computing results, heuristics, the exploration and exploitation of coefficients, and so on. But it has been shown in the real world that the more personalized you can make recommendations, the higher the click-through rate, the stickier the application, and the lower the bounce rate.</p>

<p>Using humans to form a rich database for recommendations may work for small applications, but it would probably be too expensive to scale. I don't see them competing against each other, human versus machine. Even with human/expert recommendations, one first needs to find a human/expert with tastes similar to those of the user, especially if you want to go after the long tail.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/recommendation_systems_interview_satnam_alag.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/recommendation_systems_interview_satnam_alag.php</link>
         <guid>http://www.readwriteweb.com/archives/recommendation_systems_interview_satnam_alag.php</guid>
         <category>Filtering Services</category>
         <pubDate>Sun, 08 Feb 2009 21:25:37 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>Interview with Google Policy Analyst Derek Slater</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/google-logo.jpg" />We interviewed Google's Derek Slater at the <a href="http://www.sanfranmusictech.com/">SanFran MusicTech Summit</a> yesterday. Slater is a Policy Analyst at Google and in our interview he discusses net neutrality, copyright and other public policy issues Google deals with on a daily basis.</p> 

<p>Derek Slater now runs public policy for Google, but before that he worked with the Berkman Center and the Electronic Frontier Foundation.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=12228&amp;cb=12228' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=12228&amp;n=12228' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>Some of the topics covered in the interview:</p>

<ul>
	<li>Google's philosophy that copyright is supposed to benefit the public.</li>
	<li>Support for rights holders and ways to get them high Google rankings.</li>
<li>Google's history, starting in a garage and how their innovation thrived.</li>
<li>Copyright issues and Comcast.</li>
</ul>

<p><object width="400" height="300">	<param name="allowfullscreen" value="true" />	<param name="allowscriptaccess" value="always" />	<param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=2025001&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=CC0000&amp;fullscreen=1" />	<embed src="http://vimeo.com/moogaloop.swf?clip_id=2025001&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=CC0000&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="400" height="300"></embed></object><br /><a href="http://vimeo.com/2025001?pg=embed&amp;sec=2025001">An Interivew with Google Policy Analyst Derek Slater</a> from <a href="http://vimeo.com/user843198?pg=embed&amp;sec=2025001">alex williams</a> on <a href="http://vimeo.com?pg=embed&amp;sec=2025001">Vimeo</a>.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/google_public_policy_interview.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/google_public_policy_interview.php</link>
         <guid>http://www.readwriteweb.com/archives/google_public_policy_interview.php</guid>
         <category>Interviews</category>
         <pubDate>Tue, 21 Oct 2008 13:40:19 -0800</pubDate>
<author>Alex Williams</author>
      </item>
      
      <item>
         <title>Interview With Last.fm Founder Richard Jones: Part 3, Design &amp; Features</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/lastfm-logo.jpg" alt="last.fm" />This week we interviewed one of the founders of online music service <a href="http://last.fm">last.fm</a>, Richard &quot;Mr Scrobble&quot; Jones. We're running the interview in 3 parts, over 3 days. This is Part 3 about design and features; following on from <a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_richard_jones_part1.php">Part 1 about last.fm and its competition</a> and <a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_part2_business_models.php">Part 2 about business models</a>.</p>
<p>In this post we explore last.fm's feature set, how it compares to MySpace Music, and what we can expect to see from last.fm in the near future. Richard Jones also discusses how last.fm has managed to avoid the legal difficulties that have plagued Pandora.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=12061&amp;cb=12061' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=12061&amp;n=12061' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><strong>RWW: One of the enduring features of last.fm has been its <a href="http://www.readwriteweb.com/archives/top_10_lastfm_mashups.php">mashups</a>, based on your awesome Audioscrobbler database. RJ, what are a couple of your 
  favorite recent last.fm mashups that you've seen (external apps or internal)?</strong></p>
<p>RJ: Well, we liked the <a href="http://tv.timbormans.com/">Last.fm/YouTube mashup</a> Tim Bormans made so much that we hired him! Internally we've been working on multi-tag search which is available at our <a href="http://playground.last.fm/multitag">Playground</a>, which allows you to search for music using multiple genres (folk + rock + gabba, etc); also on Playground we've been tweaking our <a href="http://playground.last.fm/neighbours">Musical Soulmates</a> app, which you could possibly consider the prototype for a future Last.fm dating service ;)</p>
<p>And of course, scrobbling continues to be an almost default setting for new music services now - from Hype Machine to Muxtape (RIP) to Blip.fm to the new version of VLC (which has 4 million downloads already since launch a couple of weeks ago), it seems like everything has to integrate scrobbling now. Great for us obviously, and great for our recommendations which will continue to improve as more people scrobble. People are scrobbling at a rate of 800 million times a month currently.</p>
<p><strong>RWW: In terms of features, Pandora is similar to last.fm, in that both services have great recommendations and allow the user to discover new music. Both are streaming music services, yet it just seems to be Pandora - of all the 4 major services we've discussed so
  far - that has been having <a href="http://www.readwriteweb.com/archives/pandora_save_internet_radio.php">legislation issues</a>. Can you clarify for our readers how last.fm has managed to avoid those sticky issues, when
Pandora hasn't? I think many people are confused about that.</strong></p>
<p>Last.fm is about more than just online radio. We've got millions of tracks available free-on-demand as well, and beyond that there's a
  massive social network element to the site. We also offer videos, the biggest events listings on the web (personalised to your taste), and our
  own audio and video content under the Last.fm/Presents banner. So the point is, online radio is only one of the things we do, so the
  legislation affects us only in one particular area of the Last.fm experience. As online radio is pretty much the single focus for Pandora, it understandably hits them harder.</p>
<p>The wider issue here, of course, is that royalty rates <em>are</em> high, and the debate around this needs to continue so we can reach a mutually beneficial and economically workable resolution. We don't want to see legitimate online broadcasters stifled by this - it's not good for music
  fans, artists or the wider music industry.</p>
<p><strong>RWW: Lastly, one of the most interesting aspects of online music is its
  ability for new artists to be discovered. It's something MySpace has
  done well in the past, but we get the sense the perfect solution
hasn't been found yet. As our own <a href="http://www.readwriteweb.com/archives/perfect_music_streaming_service.php">Marshall Kirkpatrick asked recently</a>:</strong></p>
<blockquote>
  <p><strong>&quot;How about a service that scans my iTunes library and my online listening history, determines my genres of interest and then never plays music from artists I've already listened to. Or makes sure to play some that I haven't.&quot;</strong></p>
</blockquote>
<p><strong>Does last.fm have a feature like that coming up? ;-)</strong></p>
<p>RJ: Do you think Myspace has done this well in the past? As you can tell from my reply to your first question <em>[see <a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_richard_jones_part1.php">Part 1</a>]</em>, I would argue that Myspace has been a rather difficult site to navigate unless you're after popular stuff - and it's possibly going to be more of the same on Myspace Music, as the major labels jostle for frontpage real estate and push more indie/obscure music off the page.</p>
<p>Our recommendation system is being constantly refined to give music fans the best music discovery service on the web. I think we've got that covered. What's equally important is that these artists being discovered, if they're Long Tail or DIY, get the same kind of licensing and royalty breaks that more established artists get, which is why our Artist Royalty Program exists.</p>
<p>We've been doing this for 6 years, as I said, which is why it's kind of funny to be talking about this now because of Myspace Music. They're just catching up to free-on-demand after we pioneered the model almost a year ago. Now they've got to figure out how to make it easy to discover music that suits your taste (sharing playlists is one thing, but how do you find that music to share in the first place?), which we've been doing since 2002. After that, maybe they'll start paying unsigned artists. I would hope all this will come to Myspace Music at some point in the future - but it's happening on Last.fm now.</p>
<p><strong>See also:</strong> <a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_richard_jones_part1.php">Interview With Last.fm Founder Richard Jones: Part 1, The Competition</a> and <a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_part2_business_models.php">Part 2, Business Models</a></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/interview_with_lastfm_founder_part3_design_features.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/interview_with_lastfm_founder_part3_design_features.php</link>
         <guid>http://www.readwriteweb.com/archives/interview_with_lastfm_founder_part3_design_features.php</guid>
         <category>Interviews</category>
         <pubDate>Fri, 03 Oct 2008 12:45:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
   </channel>
</rss>