<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>Semantic Web - ReadWriteWeb</title>
      <link>http://www.readwriteweb.com/archives/semantic-web/</link>
      <description>Semantic Web on ReadWriteWeb</description>
      <language>en</language>
      <copyright>Copyright 2009 Richard MacManus</copyright>
      <managingEditor>readwriteweb@gmail.com</managingEditor>
      <lastBuildDate>Mon, 26 Oct 2009 12:00:06 -0800</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.23-en</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

      
      <item>
         <title>GetGlue.com: Distributed Networking &amp; Recommendations Made Simple &amp; Fun</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/getglue.jpg">Once just a browser add-on that allowed users to surf smarter across several verticals, AdaptiveBlue's <a href="http://getglue.com">Glue</a> is now a site-centric product that acts as both a hub and a spoke of the social web.</p>

<p>Glue's <a href="http://www.readwriteweb.com/archives/echo_creator_khris_loux_on_the_ties_that_bind_the.php">synaptic web</a>-esque technology is based on a user's browsing across common sites such as Amazon, Wikipedia and YouTube, and those visits and any interactions (comments, "likes," etc.) feeding back to automatically create a taste profile and a web of affinity with other users and recommendations of other items or content across about a dozen categories, including music, books and movies. So, can this be done without violating users' privacy or - worse yet - frustrating and boring them into attrition?</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16893&amp;cb=16893' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16893&amp;n=16893' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><em>[Disclosure: AdaptiveBlue founder and CEO Alex Iskold is a ReadWriteWeb writer.]</em></p>

<p>AdaptiveBlue VP Fraser Kelton thinks both aims can be accomplished. In a phone interview this morning, he shared that Glue has a three-tiered set of privacy controls to ensure that items are shared only when a user wants them to be. As the privacy/permissions inverse of Facebook Beacon, an <a href="http://www.readwriteweb.com/archives/facebook_beacon_apology.php">ill-fated</a> system for socially powered product recommendation, the Glue system is completely opt-in for users and opt-out for publishers.</p>

<p><img src="http://www.readwriteweb.com/glue1.jpg"><em>Glue's all-new user profiles are rolling out today at GetGlue.com.</em></p>

<p><br />
"Nobody's confortable sharing all of their stream," said Kelton. "Financial sites, adult content - we don't want any of that stuff shared." In addition to having a set list of sites from which to pull browsing and interaction data, Kelton said, "The secondary privacy controls can prevent visits to a website from being shared unless the user has a specific interaction on that site. You can also opt to have a private profile so only your friends can see the items you interact with." Finally, Kelton told us that users can delete or edit their own shared content on an item-by-item basis.</p>

<p>Our other concern was whether Glue would be a semantic web geek-fest or whether the average end user would also be able to use, enjoy, and return to the site. Kelton said the site went over well with non-techie beta testers, many of whom started out using the GetGlue.com website and eventually also installed the add-on for an even richer experience.</p>

<p>One of the factors we think makes Glue so much fun for end users, from geeks to complete Luddites, is its Foursquare-like system of achievement and rewards. "We're big fans of game interaction driving the user experience," said Kelton, who also revealed that the product team considered Xbox 360 as inspiration for the Glue system. Users can earn stickers on their profiles based on how many items they browse or interact with, how many users discover content through them, and how much of an expert they are on a particular topic.</p>

<p><img src="http://www.readwriteweb.com/glue2.jpg"><em>Glue's game-like system of badges or stickers as rewards for user interactions and discovery makes for addictive browsing.</em></p>

<p>Each item indexed by Glue can have a guru, and only one guru at a time. The result is friendly competition, increased interactions and a very sticky website that's compelling and entertaining for even the least technical end user.</p>

<p>The site also allows for content to be pushed to a user's existing social networks, such as Facebook and Twitter. Here's a demo video showing more about how the site works:</p>

<p><object width="610" height="366.95"><param name="movie" value="http://www.youtube.com/v/udDRJGHPJ4k&hl=en&fs=1&rel=0&color1=0x5d1719&color2=0xcd311b&hd=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/udDRJGHPJ4k&hl=en&fs=1&rel=0&color1=0x5d1719&color2=0xcd311b&hd=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="610" height="366.95"></embed></object></p>

<p>As previously mentioned, the user experience of Glue - both the site and the revamped add-on - is one of synapses and nodes, not back-and-forth browsing and search for content. As data and content from all around the web is associated with films, books, music, and other items, users are able to quickly scan and consume related content without further navigation. This type of curated experience is something the AdaptiveBlue team calls "shortcut search."</p>

<p><img src="http://www.readwriteweb.com/glue3.jpg"><em>As a social network, GetGlue.com is a hub; but the way it curates content makes it a series of spokes, as well.</em></p>

<p>"You don't have to go to Rotten Tomatoes or YouTube. We know there's a subset of information users want, and we're bringing it in," said Kelton, who is very excited about Glue's position on distributed social networking and the semantic web. "Jeremiah [Owyang] uses the term 'social colonization.' I also think we're implementing what Charlene Li talks about when she uses the phrase, 'The network is all around us.'"</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/getgluecom_distributed_networking_recommendation.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/getgluecom_distributed_networking_recommendation.php</link>
         <guid>http://www.readwriteweb.com/archives/getgluecom_distributed_networking_recommendation.php</guid>
         <category>Semantic Web</category>
         <pubDate>Mon, 26 Oct 2009 12:00:06 -0800</pubDate>
<author>Jolie O&apos;Dell</author>
      </item>
      
      <item>
         <title>The Web of Services: Machine-Accessible Services</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/web_data_apr09a.jpg" height="169" width="150" align="left">
In the last two posts in this series, we discussed the <a href="http://www.readwriteweb.com/archives/web_of_data_machine_accessible_information.php">Web of data</a>, which makes structured interlinked data sets machine-accessible, and the <a href="http://www.readwriteweb.com/archives/web_of_identities_making_machine-accessible_people_data.php">Web of identities</a>, which makes data about people machine-accessible while addressing privacy and data volatility.</p>

<p>This time, we'll focus on the Web of services, which makes services accessible to and processable for machines. These Webs all have a semantic architecture in common and follow basic <a href="http://www.w3.org/DesignIssues/Principles.html">Web principles</a>, such as being decentralized, modular, simple, addressable via URIs, and built for machines.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16801&amp;cb=16801' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16801&amp;n=16801' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>The services sector has become the world's biggest business sector, accounting for 64% of the worldwide gross domestic product. The sector has pressure on it to make its services easier and more widely accessible, as well as to quickly adapt to ever faster changes in the market environment.</p>

<p>The effort to standardize such things as service-oriented architectures (SOA) and Web services has taken years, but still we have no clear definition of what constitutes a service at a conceptual level. The interface, which is the format of what goes in and out of the service, is often described formally, but what the service is actually doing, semantically speaking, is not. While there are a number of different approaches to semantically describing Web services, such as <a href="http://en.wikipedia.org/wiki/Owl-s">OWL-S</a>, <a href="http://en.wikipedia.org/wiki/WSMO">WSMO</a> and <a href="http://en.wikipedia.org/wiki/Web_Services_Semantics">WDSL-S</a>, none so far has managed to break out of its academic confines.</p>

<p>Today, there are already all kinds of services with different levels of complexity, and their number is expected to grow exponentially. The services follow different standards, and a lot of them are proprietary, uni-directional and designed to be used by humans to mash up something new. Editorial catalogs such as <a href="http://www.programmableweb.com">ProgrammableWeb</a> and search engines for Web services such as <a href="http://webservices.seekda.com">seekda</a> are designed for humans who are searching for a particular service for that reason. For tasks that are unsolvable for machines, there are even Web services such as <a href="http://www.mturk.com">Amazon's Mechanical Turk</a>, which have humans in the back end answering tricky queries.</p>

<p>The problem with all of this is that each of the tens of thousands of services is <em>accessible</em> but not <em>findable</em> by a machine without a machine-understandable description. Thus, every service nowadays has to be wired to a machine by hand. So, what would machines be capable of if services were annotated with semantic descriptions?</p>

<ul>
<li><strong>Service discovery</strong><br />
Given an index of Web services, a machine charged with finding the right service for a particular problem could choose one among those that have been indexed.</li>

<li><strong>Contracting and execution</strong><br />
Once a service has been selected, a machine could look up its terms and decide on contracting and execution details. How often would the service be needed? And what would be the cheapest contract then?</li>

<li><strong>Billing or revenue sharing</strong><br />
Depending on the autonomy of the machine, one could imagine something like an <a href="http://en.wikipedia.org/wiki/Software_agent">Autonomous Agent</a>, which automatically makes the best deal with the service provider on such things as billing or revenue sharing for service usage.</li>

<li><strong>Replacement on failure, based on experience</strong><br />
Of course, the machine would be able to replace a failing service with an equivalent one. It could also rate a service and publish it.</li>

<li><strong>Service orchestration</strong><br />
A machine could, given enough intelligence, split a task into sub-tasks and then discover, contract and orchestrate services to solve these sub-tasks. And after the sub-tasks have been addressed, the main task would be solved. Such orchestration could involve the parallelization of tasks, for speeding up or redundancy purposes, or chaining services (whereby the output of one service is inputted into the next).</li>
</ul>

<p>Research projects such as <a href="http://www.tripcom.org">TripCom</a>, <a href="http://www.ip-super.org">SUPER</a>, <a href="http://www.shape-project.eu">SHAPE</a> and <a href="http://www.soa4all.eu">SOA4All</a> are dealing with these ideas and scenarios.</p>

<p>Future scenarios are limited only by our imagination: machines could autonomously pursue goals on behalf of their master user or company, according to a specified level of freedom. These agents could solve increasingly complex problems and be granted increasingly more autonomy (finally ending up as <a href="http://en.wikipedia.org/wiki/Skynet_%28Terminator%29">Skynet</a>).</p>

<p>In the next and final post in this series, we will discuss how all of these scenarios could become a reality with the arrival of all three Webs: a revolution in the ability of machines to access, process and apply information.</p>

<p>Do you also count the Web of services as a third Web? Where do you see its limits?</p>

<p><em>(Photo by <a href="http://www.flickr.com/photos/zorro_art/2748852945/">zorro-art</a>.)</em></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/web_of_services_machine-accessible_services.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/web_of_services_machine-accessible_services.php</link>
         <guid>http://www.readwriteweb.com/archives/web_of_services_machine-accessible_services.php</guid>
         <category>Semantic Web</category>
         <pubDate>Fri, 16 Oct 2009 12:00:39 -0800</pubDate>
<author>Alexander Korth</author>
      </item>
      
      <item>
         <title>Screencasts of Twine&apos;s Facelift; Does It Live Up to the Hype?</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/twine/twine_logo.jpg">We've chronicled semantic web service Twine's <a href="http://www.readwriteweb.com/archives/twine_first_mainstream_semantic_web_app.php">birth</a>, <a href="http://www.readwriteweb.com/archives/twine_disappoints.php">checkered youth</a>, and <a href="http://www.readwriteweb.com/archives/twine_traffic_falls_make_or_break_time.php">recent woes</a> in terms of traffic waning and criticism waxing.</p>

<p>We've been given screencasts of the new version of this knowledge management application - screencasts of both the consumer- and developer-facing facets of the site. Take a look, and let us know if the new Twine lives up to expectations. This new version, we are told, will be live by the end of the year.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16456&amp;cb=16456' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16456&amp;n=16456' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>The consumer product promises to supplant keyword search by treating the web like a huge database, with filtering capabilities that allow users to pare down search results to only the most relevant, applicable, and useful links.</p>

<p><object width="610" height="493.74"><param name="movie" value="http://www.youtube.com/v/jWF3m14i7Vk&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/jWF3m14i7Vk&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="610" height="493.74"></embed></object></p>

<p>Developers and other techies can check out this screencast exploring Twine's collaboratively authored ontologies:</p>

<p><object width="610" height="v"><param name="movie" value="http://www.youtube.com/v/Uto0OigDaQU&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/Uto0OigDaQU&hl=en&fs=1&color1=0x5d1719&color2=0xcd311b" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="610" height="493.74"></embed></object></p>

<p>The Twine folks see the new version as a realization of Tim Berners-Lee's vision of the semantic web. So what do ReadWriteWeb readers think; is the new Twine worth the wait? Does it live up to the hype? Leave your expert comments below.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/screencasts_of_twines_facelift_does_it_live_up_to.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/screencasts_of_twines_facelift_does_it_live_up_to.php</link>
         <guid>http://www.readwriteweb.com/archives/screencasts_of_twines_facelift_does_it_live_up_to.php</guid>
         <category>Semantic Web</category>
         <pubDate>Fri, 18 Sep 2009 15:00:44 -0800</pubDate>
<author>Jolie O&apos;Dell</author>
      </item>
      
      <item>
         <title>Twine Traffic Falls - New Version Coming, But it&apos;s Make or Break Time</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/twine/twine_logo.jpg">Since it was <a href="http://www.readwriteweb.com/archives/twine_first_mainstream_semantic_web_app.php">first unveiled to ReadWriteWeb back in October 2007</a>, Semantic Web application <a href="http://www.twine.com/">Twine</a> has traveled a <a href="http://www.readwriteweb.com/archives/twine_disappoints.php">rocky road</a>. The product is a knowledge management service, in practical terms similar to social bookmarking site <a href="http://www.delicious.com">Delicious</a>. However, almost from the start there have been vocal critics of Twine. </p>
<p>The latest critique is a scathing post by Semantic Web consultant Greg Boutin, entitled <a href="http://www.semanticsincorporated.com/2009/09/twine-in-freefall.html">Twine in Freefall?</a>. Boutin argues that Twine's traffic has taken a dive recently. We followed up with Twine founder Nova Spivack for his response. He admits that traffic has declined &quot;20-25%,&quot; but says that Twine is focusing on an all-new version of its product.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16413&amp;cb=16413' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16413&amp;n=16413' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>Over the years Nova Spivack hasn't been shy about hyping his company, sometimes <a href="http://www.readwriteweb.com/archives/twine_public_launch.php">dissing other products in the process</a>. On that note, we should note that Spivack and Boutin appear to have had a personal squabble. In Boutin's post he says that earlier this year Spivack &quot;decided to libel me on twitter, through tweets he has since deleted.&quot; We don't know the specifics of this, but clearly this battle with Spivack has  colored Boutin's view of Twine now. Nevertheless, the statistics Boutin points to are valid.</p>
<p>The premise of Greg Boutin's post is that Twine had earlier in the year trumpeted passing Delicious and Friendfeed in traffic - ReadWriteWeb was <a href="http://www.readwriteweb.com/archives/twine_could_soon_surpass_delicious_prepares_ontolo.php">the first to cover the Delicious trend</a>, back in March. However now Boutin points to statistics from Compete, Alexa and Quancast showing a marked drop in traffic. Compete was the source we used in our March post, so below we've pasted comparison charts from then and now:</p>
<p><img src="http://www.readwriteweb.com/images/twinetraffic.jpg" /><br />
  <em>Compete chart from March '09 showing that Twine was trending upwards, while Delicious growth appeared to have tapered off.</em></p>
<p><img src="http://www.readwriteweb.com/images/twine_traffic_sep09.jpg" /><br />
<em>Compete chart showing that Twine did indeed pass Delicious in March '09, however over June-July Twine's traffic has plummeted but Delicious held steady.</em></p>
<h2>Twine: Yes Traffic Has Dropped, But We're Focusing on Version 2...</h2>
<p>ReadWriteWeb questioned Twine about these statistics and the company admits that &quot;our internal data shows us down 20-25%.&quot; </p>
<p>Twine appears to put some of this down to problems with version 1 of its product. The company told us that it is putting all of its focus and marketing efforts into a brand new version (more on that below). Therefore the drop in traffic is something Twine and its investors are comfortable with, for now. </p>
<p>Nova Spivack also told us that Twine had indexing problems with Google over the summer. If this was the case, that may be a big reason for the decline in Compete. Spivack explained that "we have about 500K pages that should be indexed. They [Google] are only indexing 140K pages, but we're basically not worrying about it, since T2 <em>[version 2, see below for details]</em> will change the game and the way we deal with Google anyway. . .&quot;</p>
<h2>Twine 2.0: Make or Break</h2>
<p>Twine says that it is busy working on a new version of its product, which is why it hasn't been active on the PR front in the last few months. The company is hoping the new version gets its momentum back. </p>
<p>Nova Spivack elaborated on the new version to ReadWriteWeb, which we'll quote in its entirety because it illuminates Spivack and company's marketing approach:</p>
<blockquote>
  <p>"In the last 9 months we have made a breakthrough with the new version of Twine that changes the economics of vertical search and navigation on the Web. This new technology enables Twine to provide Web-scale faceted navigation and search across numerous vertical search categories. We are able to index structured data (like recipes, products, reviews, or any kind of database driven or XML content) with search-engine performance and scalability. This is a huge leap beyond what we were able to deliver in the first version of Twine."</p>
  <p>"As a result of this breakthrough, we have made a strategic decision to focus all of our resources on bringing Twine version 2 (T2) to market by the end of the year. Version 1 of Twine will remain online until we are able to cutover to version 2. We are doing no further work on version 1 and no marketing for it, either. We are of course still supporting it from a technical and user perspective, however. But all our focus is on T2 moving forward."</p>
</blockquote>
<p>Twine also told us that it has signed deals with nearly a dozen major content providers and brands to integrate the new search capabilities into their online services. See <a href="http://www.readwriteweb.com/archives/twine_could_soon_surpass_delicious_prepares_ontolo.php">our March post</a> for more context about the new version.</p>
<p>So what do we  think about this latest twist to the Twine saga? With language like &quot;breakthrough,&quot; &quot;changes the economics&quot; and &quot;huge leap beyond,&quot; once again Twine is hyping itself up. While there have always been signs that Twine is at least partly delivering on its clear promise, the fact is that <strong>Twine continues to struggle to deliver a product the market wants</strong>. This accounts for its inconsistent growth and much of the criticism of usability which Twine has endured.</p>
<p>We continue to cheer for Twine, but it does seem that version 2 is make or break for the company.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/twine_traffic_falls_make_or_break_time.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/twine_traffic_falls_make_or_break_time.php</link>
         <guid>http://www.readwriteweb.com/archives/twine_traffic_falls_make_or_break_time.php</guid>
         <category>Products</category>
         <pubDate>Tue, 15 Sep 2009 14:07:32 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>IBM&apos;s New Image Recognition-Based Search</title>
		<description><![CDATA[<p><img alt="ibm_search_sept09.jpg" src="http://www.readwriteweb.com/images/ibm_search_sept09.jpg" width="150" height="55">We've all seen photos of ourselves in locations we can't quite remember. Often they're from exotic travels or from days long past. Regardless of the reason for your memory loss, IBM is working on a tool that can help. In collaboration with the European Union consortium, the company is testing <a href="http://sapir.isti.cnr.it/index">SAPIR</a> (Search in Audio-Visual Content Using Peer-to-peer Information Retrieval). The image matching search technology allows users to pull results from large collections of audio-visual content without using tags for search. Instead, users can upload images and match them to similar ones - perhaps even ones with signage and labels. The system analyzes everything from digital photographs, to sound files to video. From here it automatically indexes and ranks the media for retrieval. </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16358&amp;cb=16358' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16358&amp;n=16358' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>A recent <a href="http://www.scribd.com/doc/15748837/IDC-Multimedia-White-Paper-As-the-Economy-Contracts-the-Digital-Universe-Expands">IDC white paper</a> reports, "The digital universe is messy...95% of the data in the digital universe is unstructured, meaning its intrinsic meaning cannot be easily divined by simple computer programs. There are ways to imply meaning to unstructured data, and the semantic web project is promising to develop the tools to help us do that in the future." </p>

<p>Two such "divining" projects include <a href="http://cophir.isti.cnr.it/">CoPhIR</a> (Content-based Photo Image Retrieval) Test-Collection and IBM's <a href="http://mufin.fi.muni.cz/imgsearch/">MUFIN</a> (Multi-Feature Indexing Network). These projects tie into SAPIR's back end by extracting data from the <a href="http://flickr.com">Flickr archive</a> and indexing features such as scalable color, color structure, color layout, shape edges and texture. </p>

<p><object width="560" height="340"><param name="movie" value="http://www.youtube.com/v/n43fIpOGbd4&hl=en&fs=1&"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/n43fIpOGbd4&hl=en&fs=1&" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="560" height="340"></embed></object></p>

<p>As shown in the video of Madrid's Plaza de España, SAPIR identifies matching media in the same way that humans derive intrinsic value from visual and sensory clues. Users can also choose to combine search terms with additional text to further drill down in search results. As is the case with regular search, if you already know the city where your image was taken, you're one step closer to finding your result. Additionally, SAPIR also has the ability to index sound and video files. </p>

<p>While the catalogue of media is still very limited, theoretically we may one day be able to search for almost anything using this technique. If Ashton Kutcher wears a pair of sunglasses we like, we can scan the image and search for the storefronts stocking them. If we're looking for the name of a town square, we can find it in the tags of similar images. And finally, if we're looking to self-diagnose we can compare photos of ourselves against jaundice or malaria patients. </p>

<p>The advantage of this tool is that we may one day have a chance to collect up the disparate bits in the digital ether and identify them as useful points of information. To test SAPIR in its early research stage, visit the <a href="http://sapir.isti.cnr.it/index">homepage</a>. You may also want to test out <a href="http://mufin.fi.muni.cz/imgsearch/">MUFIN</a> to compare results. </p>

<p><img alt="ibm_search_sept09a.jpg" src="http://www.readwriteweb.com/images/ibm_search_sept09a.jpg" width="610" height="287"></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/ibms_image_recognition_powers_sapir_search.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/ibms_image_recognition_powers_sapir_search.php</link>
         <guid>http://www.readwriteweb.com/archives/ibms_image_recognition_powers_sapir_search.php</guid>
         <category>Search Services</category>
         <pubDate>Thu, 10 Sep 2009 12:23:00 -0800</pubDate>
<author>Dana Oshiro</author>
      </item>
      
      <item>
         <title>Eqentia Launches Semantic Portals - Competes with OpenCalais, Evri</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/eqentia_logo.jpg" />At the SemTech conference in June I met with William Mougayar, founder and CEO of a <a href="http://portal.eqentia.com">semantic news platform</a> called <a href="http://www.eqentia.com/">Eqentia</a>. At the time the product was in development, but it is officially launching today. In a nutshell, Eqentia aggregates content into topics using semantic technology. In that respect it is similar to <a href="http://www.opencalais.com/">OpenCalais</a> (<a href="http://www.readwriteweb.com/archives/calais_4_linked_data.php">our coverage</a>) and <a href="http://www.evri.com/">Evri</a> (<a href="http://www.readwriteweb.com/archives/how_does_the_web_feel_evri_tells_you.php">our coverage</a>). While all three products have different focuses, each semantically tags and aggregates content in a contextual manner. </p>
<p>The difference, claims Eqentia, is that &quot;with Evri or OpenCalais, the onus is on the programmer.&quot; Eqentia says that with its product, &quot;the content is already semanticized and all you have to do is to place it on your portal while preserving your SEO.&quot; The other two companies may disagree with that, but let's take a closer look at Eqentia.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16250&amp;cb=16250' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16250&amp;n=16250' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><em><strong>Disclosure:</strong> We have decided to use this product on ReadWriteWeb, to fuel our upcoming topic pages. Expect this feature to launch within a few weeks.</em></p>
<p>At its heart, Eqentia is an aggregation platform. It promotes itself as &quot;an aggregator of context, not just content.&quot; The way it does this is to add context in the <strong>navigation</strong>. Each portal has its own taxonomy, which Mougayar described as &quot;a bit like a hierarchical tagging structure.&quot; He said that &quot;we basically wrap any content with a semantic wrapper.&quot;</p>
<h2>How it Works</h2>
<p>Under the hood, Eqentia does &quot;content harvesting&quot; from social media sites such as Twitter, blogs and more. Currently Eqentia is getting content from over 13,000 feeds, collecting an estimated 65,000 articles daily. </p>
<p>Eqentia told us that it's indexed 20 million articles so far. The largest topic currently is Outsourcing, with 90,000 articles. Other topics include: Cloud Computing: 60,000; Supply Chain Management: 40,000; Twitter: 20,000; Social Media: 11,000.</p>
<p>Eqentia then does &quot;text mining and filtering&quot; and the	results are run through an &quot;Aggregation Engine&quot; (which has rules for sources and filters). Finally there is what Eqentia calls &quot;Semantics Management&quot; - including entity extractions, taxonomy definition, controlled vocabulary.</p>
<h2>What The User Gets</h2>
<p>Eqentia is starting off with a focus on &quot;professional&quot; content topics. It will target business and technology content, ignoring more mainstream topics like current affairs, sports, entertainment.</p>
<p><img src="http://www.readwriteweb.com/images/eqentia_screenshot1.jpg" /></p>
<p>Eqentia is launching with 3 products:</p>
<p><strong>1)	Out-of-the-box portals. </strong>These will give general users free access to topic streams (of which there are 12 at launch, with more coming). There will be email options, widgets and RSS feeds available.</p>
<p><strong>2)	Personalized portal. </strong>These can be private or public. <em>[note: this is what ReadWriteWeb has signed up for]</em></p>
<p><strong>3)	Enterprise.</strong> A SaaS platform that can be customized. A stated use case is for large companies to &quot;disseminate organized news intelligence for their employees across distinct groups or market segments.&quot;</p>
<h2>Conclusion: Tough Competition, But Important Market</h2>
<p>The proof will be in the pudding as to how Eqentia compares to  OpenCalais and Evri. We've been very impressed with both OpenCalais and Evri in our previous coverage, so Eqentia has high standards to live up to. In particular Eqentia is going to have to nail the User Experience, because it is relying on its interface  a lot to give value to the user.</p>
<p>Finally, Mougayar noted to us that &quot;if web 2.0/social media rewarded the socially savvy user, the semantic web/web 3.0 will reward the research oriented user.&quot; It's a nice marketing line, but we are apt to agree that products like Eqentia, OpenCalais and Evri are bringing much needed smarts to the oceans of content in the Web.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/eqentia.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/eqentia.php</link>
         <guid>http://www.readwriteweb.com/archives/eqentia.php</guid>
         <category>Products</category>
         <pubDate>Wed, 02 Sep 2009 06:00:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>Machine-Powered Medical Info: HealthBase Semantic Search</title>
		<description><![CDATA[<p><img alt="healthbase_semantic_aug09.jpg" src="http://www.readwriteweb.com/images/healthbase_semantic_aug09.jpg" width="150" height="38">We've all seen how semantic technologies improve search results, but rarely do we see those results put to use in such a targeted way. Jens Tellefsen, VP of Marketing and Product Strategy at <a href="http://netbase.com/index.php">NetBase Solutions</a> spoke to ReadWriteWeb about today's launch of <a href="http://healthbase.netbase.com/">healthBase</a> - a medical search and discovery application. Using a variety of semantic indexing techniques, the company crawls the web's leading medical and health players including the Mayo Clinic, PubMed (US National Library of Medicine) WedMd, Medical News Today and Discovery Health. What makes this a truly unique technology is that rather than requiring any data manipulation from humans, Netbase's search results are completely automated. </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16246&amp;cb=16246' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16246&amp;n=16246' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>Says Tellefsen, "Rather than using keywords or basic entities to search through billions of documents, NetBase can actually read and extract linguistic meaning from entire sentences and concepts." According to Tellefsen, healthBase can determine causal relationships, treatments and conditions and automatically aggregate that data into meaningful answers. Given the fact that more than 75% of the population seeks out online health information, a semantic tool with sentence-level understanding can potentially help dispel medical myths on a massive scale. </p>

<p><img alt="healthbase_semantic_aug09a.jpg" src="http://www.readwriteweb.com/images/healthbase_semantic_aug09a.jpg" width="610" height="361">NetBase employs the same principals across a variety of enterprise tools, but healthBase is its first foray into consumer-facing products. While the company is used to powering corporate, federal and market research, healthBase allows NetBase to show off its content intelligence tool in a way that gives us insight into our selves and our bodies. </p>

<p>Because NetBase is not reliant on manual annotation or custom taxonomies, the system is also very scalable. It took roughly 2 days to produce all of the data in healthBase - a feat that would never be possible by a combination human and machine system. </p>

<p>"It's important for us to address real issues with semantic technologies outside of a lab," Says Tellefsen. To try healthBase visit <a href="http://healthbase.netbase.com/">healthbase.netbase.com</a></p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/machine-powered_medical_info_healthbase_semantic_s.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/machine-powered_medical_info_healthbase_semantic_s.php</link>
         <guid>http://www.readwriteweb.com/archives/machine-powered_medical_info_healthbase_semantic_s.php</guid>
         <category>Semantic Web</category>
         <pubDate>Wed, 02 Sep 2009 05:00:00 -0800</pubDate>
<author>Dana Oshiro</author>
      </item>
      
      <item>
         <title>Dygest: Survey All The Hottest News Coverage in One Digest</title>
		<description><![CDATA[<p><img alt="dygestlogo.jpg" src="http://www.readwriteweb.com/images/dygestlogo.jpg" width="150" height="47" ><a href="http://dygest.net">Dygest.net</a> is a fascinating new service that looks over hundreds of topical sources online, finds out what the hot topics are and publishes excerpts of coverage that help tell the whole story succinctly on one page.  It's like Cliffs Notes for meme-tracking and it appears to work quite well.</p>

<p>At first glance Dygest looks like the work of one more computer scientist who's trained a robot to hunt for hot links.  There are a lot of those now.  This one uses natural language processing algorithms instead of tracking links between sources, though, and the summaries from multiple sources are enjoyable to read.  Demo sites have been set up for <a href="http://dygest.net/tech">technology</a>, <a href="http://dygest.net/green">environmental</a>, <a href="http://dygest.net/film">film</a>, <a href="http://dygest.net/chicago">Chicago-area</a> and <a href="http://dygest.net/bayarea">San Francisco area</a> news.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16065&amp;cb=16065' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16065&amp;n=16065' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><img alt="TechDygest.jpg" src="http://www.readwriteweb.com/images/TechDygest.jpg" width="610" height="430" ></p>

<p>The digests aren't perfect but they are surpassingly coherent.  The site isn't particularly eye catching at all, but as a proof of concept it's very interesting.  You can either get a broad understanding of a story by reading highlighted paragraphs from various sources, or pick one of those sources to dive deep into its particular coverage. The opportunity to sample different writing styles, angles and key points makes Dygest fun to read casually and for in-depth exploration of the news.</p>

<p>The team behind Dygest says they think the service would be well suited for mobile consumption, but white labeling the technology is high on their list of ideas as well.  <a href="http://www.onespot.com/">OneSpot</a> comes to mind as an immediate competitor, but there are others.</p>

<p>Dygest has pre-seeded each of its demo sites with a few hundred sources, but the possibilities the analysis engine presents are many.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/dygest_survey_all_the_hottest_news_coverage_in_one.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/dygest_survey_all_the_hottest_news_coverage_in_one.php</link>
         <guid>http://www.readwriteweb.com/archives/dygest_survey_all_the_hottest_news_coverage_in_one.php</guid>
         <category>Semantic Web</category>
         <pubDate>Fri, 14 Aug 2009 13:21:04 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>How Does the Web Feel? Evri&apos;s New Sentiment API Tells You</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/evri-logo.png">Semantic search engine <a href="http://www.evri.com">Evri</a> can now understand how the web feels with the launch of their new <a href="http://www.evri.com/developer/rest#API-GetSentimentInformation">sentiment web API</a>. While busy scouring the net for people, places, and things and determining the relationships between them, the search engine is now able to understand the <em>feelings</em> associated with these entities, too, be them positive or negative. Using the API, developers can build applications for things like market intelligence, market research, sports and entertainment, brand management, product reviews and more.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=16061&amp;cb=16061' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=16061&amp;n=16061' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[

<h2>Not Just Good or Bad, but Who, What, and Why, Too</h2>

<p>At first we thought Evri's API would simply rank things as positive or negative, much like the Twitter tracker <a href="http://twendz.waggeneredstrom.com/">twendz</a> does today, highlighting positive, negative, and neutral items. However, the sentiment API does so much more, allowing you deeper insight into the "who's," and "what's," and "why's" associated with the particular expression or feeling.</p>

<p>To be more specific, according to the <a href="http://blog.evri.com/index.php/2009/08/11/sentiment-api-exposes-webs-feelings/">announcement</a>, Evri lets you: </p>

<ul>
  <li>Find the percentage of positive and negative expressions of sentiment made <em>by</em> an entity, or <em>about</em> an entity. For example, find out what percentage of things being written about the iPhone are positive and which percent are negative. </li>

  <li>Discover <strong>who</strong> is <em>criticizing</em> and who is <em>praising</em> a particular person, place or thing. For example, see who is <em>criticizing</em> and <em>praising </em>Microsoft right now. </li>

  <li>Read <strong>what</strong> <em>praisers</em> and <em>critics</em> are saying about an entity. For example, see what the GOP are saying about the Democrats. </li>

  <li>Discover <strong>who</strong> or what your favorite entity is <em>bashing</em> and <strong>why</strong>. For example, see who Lance Armstrong is complaining about. </li>

  <li>Discover <strong>who</strong> or <strong>what</strong> your favorite entity is <em>praising</em> and why. For example, see <a href="http://api.evri.com/v1/sentiment/summary/about?sentimentSource=/organization/world-health-organization-0x4abf4&amp;includeSummaryDetails=true&amp;sort=date">who</a> the World Health Organization is commending and <a href="http://api.evri.com/v1/sentiment/about?entityURI=/location/mexico-0x3acdd&amp;sentimentSource=/organization/world-health-organization-0x4abf4&amp;sort=date">why</a>. </li>
</ul>

<p>When unleashed upon the web as a whole, this could unearth a veritable goldmine of information. Just thinking of how many different ways it could be used is enough to blow anyone's mind. Of course, marketers will be the first to jump on board, looking for practical ways to track the feelings about their companies, clients, and brands and why they're changing, but an engine that understands sentiment could do so much more than just this. It can literally take the pulse of the web the way we take the pulse of Twitter using apps like the above-mentioned twendz to rank trends as positive or negative. </p>

<h2>Demo: The "Vibology Meter"</h2>

<p>To demonstrate what <a href="http://www.evri.com">Evri</a> can do, the company created a widget called the "Vibology Meter." (Sadly, no link is provided).  The widget not only ranks the good or bad "vibes" about a particular entity (in the example, Barack Obama), but also explores topics associated with that entity and whether or not the primary entity feels positively or negatively towards them. For example, the widget shows Obama is negative towards the GOP and Rush Limbaugh but feels positive about Michele Obama. (Well, that's good!)</p>

<p><img src="http://www.readwriteweb.com/images/evri_sentiments_1.png"></p>

<p>When you click on any one of the associated topics (or click on "anything" to see all topics of either positive or negative slant), you're then presented with a sidebar of information. Here, snippets from articles found on the web display along with a title, link, and timestamp. </p>

<p><img src="http://www.readwriteweb.com/images/evri_sentiments_2.png"></p>

<p>Of course, this is just a simple example of the <a href="http://www.evri.com/developer/rest#API-GetSentimentInformation">Evri API</a> in action. We're sure the developers out there can think up even better ideas than this.</p>

<h2>Challenges Ahead</h2>

<p>The challenge now for <a href="http://www.evri.com">Evri</a> is to keep expanding its index in order to track more sources to rank. At the moment, the engine doesn't track a large slice of the web the way a typical search engine like Google does - in fact they don't even claim to <em>be </em>a search engine...despite what that "Go to" box on their homepage would have you believe. Instead, Evri looks specifically at the people, places, and things on the web and maps the connections between them. </p>

<p>To determine these connections - and now, the associated sentiments as well - <a href="http://www.evri.com">Evri</a> pulls from a limited number of "highly regarded" sources. That means you'll definitely see a site like CNN used to rank a person like Obama, but the myriad of tiny politico blogs will be ignored. That's actually a shame, since delving into this "long tail" of the web could give a better overall picture of how <em>all </em>people really feel, not just the sentiments expressed on high-profile sites written by top bloggers and journalists. Still, we know indexing and parsing this long tail is something that's much easier said than done. </p>

<p>In the end, what Evri's doing, even on this smaller scale, is definitely interesting. We hope to see <a href="http://www.evri.com/developer/rest#API-GetSentimentInformation">the new API</a> put to good use in the near future. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/how_does_the_web_feel_evri_tells_you.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/how_does_the_web_feel_evri_tells_you.php</link>
         <guid>http://www.readwriteweb.com/archives/how_does_the_web_feel_evri_tells_you.php</guid>
         <category>Semantic Web</category>
         <pubDate>Fri, 14 Aug 2009 07:37:20 -0800</pubDate>
<author>Sarah Perez</author>
      </item>
      
      <item>
         <title>Spicing Up Your Blog: Apture vs. Zemanta Balloons</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/apture_logo_feb09.jpg">Pop-up info windows: someone had to do it right, right?  After years of pushy, worthless little window overlays that pop up when you hover over a link, there are now a number of companies trying to offer bloggers and their readers a whole lot of value in what could be a handy format.</p>

<p>Below we briefly review two of these services, <a href="http://apture.com">Apture</a> and <a href="http://zemanta.com">Zemanta's</a> Balloons.  Is this kind of product really worth using?  Once you add a pop-up of someone's LinkedIn profile next to their name as you type it, you may never want to go back to not having a tool like this at your disposal.  </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15983&amp;cb=15983' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15983&amp;n=15983' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>The best-known startup in this space right now is Apture, a company that launched last year and lets you fill pop-ups with all kinds of multimedia content.  The newest entrant to this market is Zemanta, a semantic web company that's used by bloggers to add related links to their posts all over the web.  Last week Zemanta released a product called Balloons; it looks a lot like Apture but it's open source, semantically smart and standards-based.  We decided to put both products to the test, and here are the pros and cons we found in each.</p>

<h2>Apture</h2>

<p>We started by testing out Apture's WordPress plug-in (on <a href="http://marshallk.com">my personal blog</a>) and were very happy with the results.   It takes just a few minutes to install, and learning to use it is quite intuitive.  We wrote <a href="http://www.readwriteweb.com/archives/apture_popups_media.php">an extensive review of Apture</a> in February.</p>

<p><img alt="apturereal1.jpg" src="http://www.readwriteweb.com/images/apturereal1.jpg" width="610" height="409" ></p>

<p><img alt="apture2-2.jpg" src="http://www.readwriteweb.com/images/apture2-2.jpg" width="610" height="404" ></p>

<p><strong>Pros:</strong><br />
<ul><li>It's beautiful. From the admin section to the pop-up windows, design has been emphasized at Apture.</li><br />
<li>Lots of user control.  The amount of control users have over what's included in their pop-ups is amazing.  You can choose between assets with a few clicks, or you can pick out start and end timestamps in an embedded YouTube video.  The list of options is big and keeps getting bigger, as evidenced by the recent addition of really nice LinkedIn and Twitter profile options.</lI><br />
<li>You can now include multiple tabs in one link, making it easy to pack a lot of information inside.</li><br />
<li>The user experience is solid, and the product is pretty well baked.</li><br />
</ul></p>

<p><strong>Cons:</strong><br />
<ul><li>Apture is proprietary software offered by one company, unlike Zemanta's standards-based offering, which was built as part of a consortium of developer- and community-minded companies.</li><br />
<li>Sometimes it hangs on the UI.  We found one bug that the company has since fixed, but pop-up loading is sometimes slower than we'd like.</li><br />
</ul></p>

<h2>Zemanta</h2>

<p><a href="http://zemanta.com">Zemanta</a> is a feature-rich service for bloggers and has a great API that developers can use to automatically discover keywords in bodies of text in lots of different scenarios.  You should check it out.  It's quite easy to use.  Last week the company released a feature that competes with Apture, called <a href="http://www.zemanta.com/balloons/">Balloons</a>.  Balloons is now automatically included in the blogging plug-in from Zemanta, which is very easy to get started with.  </p>

<p>To be frank, we would recommend installing the core Zemanta plug-in for the rest of its features but using Apture for info pop-ups instead.  The way the two products are administered is very different; Zemanta detects key concepts in the text of your post and suggests Balloon links you can add with a click.  You're limited to adding just those handful of Balloons; you can't link up just any text you want.</p>

<p><img alt="zemantascreen1.jpg" src="http://www.readwriteweb.com/images/zemantascreen1.jpg" width="609" height="370"><br />
<center><img alt="zemanta2.jpg" src="http://www.readwriteweb.com/images/zemanta2.jpg" width="477" height="413" ></center></p>

<p><strong>Pros:</strong><br />
<ul><li>Zemanta is open source and standards-based.  It feels good to use.</li><br />
<li>Zemanta works with the rest of the tech community and has some <a href="http://www.readwriteweb.com/archives/change_the_world_with_one_click_zemanta_adds_auto-.php">awesome tools for supporting non-profit organizations</a>.  Did we mention that it feels good to use Zemanta?</li><br />
<li>The auto-detection of key concepts -- just click on the buttons and they're linked to resources -- makes Zemanta a little bit faster to use than Apture.  It takes fewer clicks.</li><br />
</ul></p>

<p><strong>Cons:</strong><br />
<ul><li>This tool isn't nearly as pretty.  In fact, the pop-ups are almost the opposite of pretty.</li><br />
<li>You have far less control over the sources of information you can include.  Zemanta's Balloons is tied to the ambitious <a href="http://www.readwriteweb.com/archives/common_tag_brings_standards_to_metadata.php">CommonTags standards effort</a> and apparently does not include anything outside the world of standards.  That's noble but limiting.</li><br />
<li>Most of the links Zemanta inserts are to <a href="http://www.freebase.com/">FreeBase</a>, which is like a machine-readable version of Wikipedia but also a noble, well-funded mess.  Thus the pop-ups you get from Zemanta are quite hit and miss.</li><br />
<li>There are Amazon affiliate ads in the Zemanta product; Apture's business strategy appears to be to serve bloggers for free and ad-free and charge big publishers to white-label the service.  Zemanta's Amazon ads might get on your nerves.</li><br />
<li>This is a very early product, having just launched last week.  We hope it is further developed.</li></ul></p>

<p>That's our experience so far with these tools.  If you've tried either or both, we'd love to know about your experience as well.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/spicing_up_your_blog_apture_vs_zemanta_balloons.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/spicing_up_your_blog_apture_vs_zemanta_balloons.php</link>
         <guid>http://www.readwriteweb.com/archives/spicing_up_your_blog_apture_vs_zemanta_balloons.php</guid>
         <category>Blogging</category>
         <pubDate>Fri, 07 Aug 2009 09:08:38 -0800</pubDate>
<author>Marshall Kirkpatrick</author>
      </item>
      
      <item>
         <title>Siri: Virtual Personal Assistant Prepares For Debut</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/siri_logo_july09.jpg" /><a href="http://siri.com/">Siri</a> has been getting a lot of hype over the past year. It's an as yet unreleased product that aims to be a &quot;Virtual Personal Assistant&quot; (VPA). At the recent SemTech conference in San Jose, I sat down with two of the founders of Siri: Dag Kittlaus (CEO) and Tom Gruber (CTO). I was informed that the product will launch end of summer U.S. time - starting out as an iPhone app, but later other platforms will be supported.  The iPhone app will go into private beta July/August time period, then launch in Q4 2009 or Q1 2010.</p>
<p>Siri has been preparing for this for some time now. While the product is still more promise than substance, I at least got to look at some real-life iPhone demos in San Jose.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15690&amp;cb=15690' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15690&amp;n=15690' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p>Before we go any further: there is bad news for international users. Siri told me that the product will be <em><strong>US only</strong></em> for some time, which they claimed was due to the ecosystem of services it will utilize (Yelp, Google Maps and similar services). For international users, including this author, sadly we won't get to use Siri until the end of 2010 at the earliest!</p>
<h2> What Users Can Expect (U.S. Ones at Least...)</h2>
<div class="pullquote">
<p>Siri wants to enable a "person-centric world," rather than the device-centric one we have today.</p>
</div>
<p>Last October it <a href="http://www.readwriteweb.com/archives/semantic_stealth_startup_siric.php">raised $8.5 million</a> and in November ReadWriteWeb named it as <a href="http://www.readwriteweb.com/archives/10_more_semantic_apps_to_watch.php">a Semantic App to watch</a>, purely based on the promise of the app. Siri was spun out of SRI International and its core technology is based on the ambitious <a href="http://caloproject.sri.com/">CALO</a> artificial intelligence project. Siri is aiming to be a &quot;personalized assistant that learns.&quot;</p>
<p>I asked Kittlaus and Gruber what specifically is in store for users of Siri? They told me that Siri will start as a mobile application, but it will become available over IM, email and other communication platforms too. Kittlaus added that you will be able to &quot;access your assistant in the manner you choose.&quot; The iPhone app will be the first one launched, at the end of summer, however the pair were button-lipped about which device would be next. </p>
<p><img src="http://www.readwriteweb.com/images/siri_screenshot_july09.jpg" align="right" />Siri also sees a future for their product in <strong>smart devices</strong>, for example VCRs - where they could replace the instruction manual. The overall message is that Siri will be &quot;a systems play in everyday systems.&quot; Later in the interview, Kittlaus and Gruber described Siri to me as an &quot;intelligent interface&quot;.</p>
<p>Siri wants to enable a &quot;person-centric world,&quot; rather than the device-centric one we have today. In other words, they want Siri to serve the user and act as an intermediary between the person and the device. In the demos I saw, you could type or speak instructions into an iPhone - e.g. &quot;What is a good movie about to start near my current location?&quot; - and the product would deliver a textual and graphical answer.</p>
<h2>Siri vs Chandler</h2>
<p>I asked if there are any similarities between Siri and another 'Personal Assistant' product, the Mitch Kapor inspired <a href="http://chandlerproject.org/">Chandler</a> - an open source project which <a href="http://www.readwriteweb.com/archives/chandler_future.php">re-defined the phrase 'scope creep'</a> for the web 2.0 era (it finally reached version 1.0 status in August 2008, after more than 7 years development). </p>
<p>Kittlaus and Gruber replied that Siri will be very task-focused and will be much more pragmatic than Chandler. They claimed that Siri will &quot;focus on domains we understand,&quot; such as movies and restaurants. They also noted that the time and place for a personal assistant product is <em>now</em>, due to rise of the Mobile Internet. So Chandler suffered from bad timing too.</p>
<h2>How Will Siri Make Money?</h2>
<p>As noted above, one business model for Siri will be partnerships with companies that make personal appliances such as VCRs. What other ways will Siri make money, I asked? </p>
<div class="pullquote">
<p>Siri will have affiliate agreements on transactions, such as movie tickets.</p>
</div>
<p>Kittlaus and Gruber told me that they will have affiliate agreements on transactions, for example movie tickets. What's more, their model will be CPA based - where advertisers define what they want Siri to do and pay for the outcomes. An example might be Netflix paying for every new customer.</p>
<h2>Browser Technology to the Fore</h2>
<p>We spoke a bit about the problems of designing Siri for many different devices. Siri is looking for browser technology to come to their rescue, pointing to <a href="http://webkit.org/">WebKit</a>'s progress <a href="http://www.readwriteweb.com/archives/browser_wars.php">in the mobile browsing world</a>. Siri is also looking for upcoming browsers to integrate functionality such as speech, which Siri will be able to leverage; inputting commands into Siri by voice is expected to be a big use case. As a lot of Siri's processing is done in 'the cloud,' and browser technology will be an important factor in the success or otherwise of Siri. </p>
<p>I think this is a pretty good bet by Siri, given all of the competition and advances in the browser space right now - check out what IBM is doing in a <a href="http://www.readwriteweb.com/archives/ibm_blue_spruce_first_look.php">browser-related project called Blue Spruce</a>, as just one example.</p>
<p><img src="http://www.readwriteweb.com/images/siri_graph.jpg" /></p>
<h2>Mobile a Big Part of Siri's Vision</h2>
<p><div class="pullquote">
<p>The demographic that Siri is targeting initially is 18-35 urban and social.</p>
</div>Mobile will drive a lot of Siri's vision. Gruber mentioned that mobile gives developers &quot;a sensible closed world.&quot; Then the pair commented that a number of Siri's use cases are distinctly mobile. </p>
<p>The demographic that Siri is targeting initially, 18-35 urban and social, are early adopters of the iPhone and mobile in general. This user base is also a multi-tasking one. That again plays to Siri's goal  to deliver personalized information in context, which these days usually means on a mobile device. </p>
<p>The proof - as always - will be in the pudding when Siri is launched at end of the summer to U.S. users. We think it's a great shame that international users won't be able to enjoy this product too, until probably 2011. In any case, you can see Tom Gruber give a demo of Siri in <a href="http://www.semanticuniverse.com/semtech-keynote-game-changer-siri-virtual-personal-assistant.html?utm_source=streamsend&utm_medium=email&utm_content=4953022&utm_campaign=Highlights%20from%20the%20Semantic%20Technology%20Conference%20-%20Video%2C%20Audio%20and%20Blogs">this SemTech video</a> (slides <a href="http://tomgruber.org/writing/Siri-SemTech09.pdf">here</a>):</p>
<p>
<object width="601" height="451"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=5424527&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=cc0000&amp;fullscreen=1" /><embed src="http://vimeo.com/moogaloop.swf?clip_id=5424527&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=cc0000&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="601" height="451"></embed></object>
</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/siri_virtual_personal_assistant_prepares_for_debut.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/siri_virtual_personal_assistant_prepares_for_debut.php</link>
         <guid>http://www.readwriteweb.com/archives/siri_virtual_personal_assistant_prepares_for_debut.php</guid>
         <category>Products</category>
         <pubDate>Mon, 13 Jul 2009 04:30:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>ReadWriteWeb Interview With Tim Berners-Lee, Part 2: Search Engines, User Interfaces for Data, Wolfram Alpha, And More...</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/tbl_may08.jpg" />In part 2 of my one-on-one interview with Tim Berners-Lee, we explore a variety of topics relating to Linked Data and the Semantic Web. If you missed it, in <a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">Part 1 of the interview</a> we covered the emergence of Linked Data and how it is being used now even by governments. </p>
<p><font style="float: right; margin-left: 10px;"><script type="text/javascript">
tweetmeme_url = 'http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php';
tweetmeme_source = 'rww';
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>In Part 2 we discuss: how previously reticent search engines like Google and Yahoo have begun to participate in the Semantic Web in 2009, user interfaces for browsing and using data, what Tim Berners-Lee thinks of new computational engine Wolfram Alpha, how e-commerce vendors are moving into the Linked Data world, and finally how the Internet of Things intersects with the Semantic Web.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15658&amp;cb=15658' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15658&amp;n=15658' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>Semantic Web and Search Engines Like Google, Yahoo</h2>
<p><em>RWW: You've been talking about the Semantic Web for many years now. Generally the view is that Semantic Web is great in theory, but we're still <a href="http://www.readwriteweb.com/archives/rdf_semantic_web_apps.php">not seeing a large number of commercial web apps that use RDF</a> (we've seen a number of scientific or academic ones). However we have <a href="http://www.readwriteweb.com/archives/understanding_the_new_web_era_web_30_linked_data_s.php">begun to see some traction with RDFa</a> (embedding RDF metadata into XHTML Web content), for example <a href="http://www.readwriteweb.com/archives/search_options_google_search_evolves.php">Google's Rich Snippets</a> and <a href="http://www.readwriteweb.com/archives/semtech_making_the_web_searchable_searchmonkey.php">Yahoo's SearchMonkey</a>. Has the takeup of RDFa taken you by surprise?</em></p>
<p>TBL: Not really, but the takeup by the <strong>search engines</strong> is interesting. In a way I was happy to see that, it was a milestone for those things to come out of the search engines. The search engines had typically not been keen on the Semantic Web - maybe you could argue that their business is making order out of chaos, and they're actually <em>happy</em> with the chaos. And if you provide them with the order, they don't immediately see the use of it. </p>
<div class="pullquote">
<p>"The search engines have not been keen on the Semantic Web [...] their business is making order out of chaos, and they're actually happy with the chaos."</p>
</div>
<p>Also I think there was misunderstanding in the search engine industry that the Semantic Web meant metadata, and metadata meant keywords, and keywords don't work because people lie. Because traditionally in information retrieval systems, keywords haven't proven up to the task of finding stuff on the Web. One of the reasons is that people lie, the other is that they can't be bothered to enter keywords. So keywords have gotten a bad reputation, then metadata in general was tarred with this 'keywords don't work' brush. Because a lot of Semantic Web data included metadata, then people thought that with Semantic Web data -- again, that people will lie and won't have the time to produce it. </p>
<p><img src="http://www.readwriteweb.com/images/rich-snippets.png" /><br />
<em>Google rich snippets example; image credit: Matt Cutts</em></p>
<p><em>Now</em> I think there's a realization that when you're putting data online, that people are motivated NOT to lie. For example when your band is going to produce its next album, or when your band is going to play next downtown, you're motivated to put that information up there on the Semantic Web. There's an awful lot of cases when actually data is really important to people; and it's on the web anyway. So I think it's great that some of the search engine companies are starting to read RDFa. </p>
<p>Does this mean that they [search engines] will start to absorb the whole RDF data model? If they do, then they will be able to start pulling all of the linked data cloud in. </p>
<div class="pullquote">
<p>"The web of linked data and the web of documents actually connect in both directions, with links."</p>
</div>

<p>Will they know what to do with it? Because when it's data in a very organized form, I think some people have been misunderstanding the Semantic Web as being something that tries to make a better search engine - i.e. when you type something into a little box. But of course the great thing about the Semantic Web is that you can query it, you can ask a complicated query of the Semantic Web, like a SQL query (we call it a SPARQL query), and that's such a different thing to be able to do. It really doesn't compare to a search engine. </p>
<p>You've got search for text phrases on one side (which is a useful tool) and querying of the data on the other. I think that those things will connect together a lot. </p>
<p>So I think people will search using a search text engine, and find a webpage. On the front of the webpage they'll find a link to some data, then they'll browse with a data browser, then they'll find a pattern which is really interesting, then they'll make their data system go and find all the things which are like that pattern (which is actually doing a query, but they'll not realize it), then they'll be in data mode with tables and doing statistical analysis, and in that statistical analysis they'll find an interesting object which has a home page, and they'll click on that, and go to a homepage and be <em>back</em> on the Web again. </p>
<p>So the web of linked data and the web of documents actually connect in both directions, with links.</p>

<h2>User Interfaces for Semantic Content</h2>
<p><em>RWW: At the recent SemTech conference, <a href="http://www.readwriteweb.com/archives/the_state_of_the_market_in_semantic_technologies.php">Tom Tague of Thomson Reuters' Calais project suggested</a> that user interfaces for semantic content are key in getting more take-up. With that in mind, I wonder if you've seen some great interfaces or designs for semantic applications in recent months - if so which ones and why did they impress you?</em></p>
<p>TBL: I think that whole area is very exciting at the moment. The only piece of hacking I've done over the past few years has been on a thing called <a href="http://www.w3.org/2005/ajar/tab">the Tabulator</a> [a data browser and editor], which is addressing exactly that. Partly because I wanted to be able to look at this data. And now there are lots of different ways that people need to be able to look at data. You need to be able to <strong>browse through it</strong> piece by piece, exploring the world of data. You need to be able to look for <strong>patterns</strong> of particular things that have happened. Because this is data, we need to be able to use all of the power that traditionally we've used for data. When I've pulled in my chosen data set, using a query, I want to be able to do [things like] maps, graphs, analysis, and statistical stuff. </p>
<p><img src="http://www.readwriteweb.com/images/tabulator_july09.jpg" /><br />
<em>W3C Tabulator, a data browser/editor; Image credit: <a href="http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/publishing/">wiwiss.fu-berlin.de</a></em></p>
<p>So when you talk about user interfaces for this, it's really very very broad. Yes I think it's important. There's also the distinction we can make between the <strong>generic interfaces</strong> and the <strong>specific interfaces</strong>. </p>
<p>There will always be specific interfaces; for example if you're looking at calendar data, there's nothing else like a calendar that understands weeks, months and years. If you're looking at a genome, it's good to have a genetics-specific user interface. </p>
<div class="pullquote">
<p>"I want to be able to do maps, graphs, analysis, and statistical stuff."</p>
</div>
<p>However you also need to be able to connect that data, through generic interfaces. So if my genome data was taken during an experiment which happened over a particular period, I need to be able to look at that in the calendar - so I can connect the genetics to the calendar. </p>
<p>So one of the things I hope to see is domain-specific things for various different domains, <em>and</em> the generic user interfaces. And hopefully the generic interfaces will be able to tie together all of the domains.</p>

<p><b><em>Next Page: Wolfram Alpha; e-Commerce and Linked Data</em></b></p>

<!--nextpage-->

<h2>Wolfram Alpha and Natural Language Interfaces</h2>
<p><em>RWW: An interesting new product was <a href="http://www.readwriteweb.com/archives/wolframalpha_our_first_impressions.php">launched this year</a> called <a href="http://www.wolframalpha.com/">Wolfram|Alpha</a>, described as a 'computational knowledge engine.' It's kind of a mix between Google (search) and Wikipedia (knowledge), and its key attribute is that enables you to compute something. The founders think that 'computing' things on the fly is something we're going to see a lot of in future. What's your take on Wolfram|Alpha?</em></p>
<p>TBL: There are two parts to that sort of technology. One of them is a sort of stilted natural language interface. We've seen those sort of natural language queries for years. Boris Katz [from W3C] created a system called <a href="http://start.csail.mit.edu/start-system.html">START</a> <em>[a software system designed to answer questions that are posed to it in natural language]</em>. I think with the Semantic Web out there, those sorts of interfaces are going to become important, very valuable, because people will be able to ask more complicated things. The search engine has traditionally been limited to just a phrase, but some of the search engines are now starting to realize that  if they put data behind them and have computation engines, then you can ask things like 'what's this many pounds in dollars?' and so on. So yes, those interfaces will become important. </p>
<div class="pullquote">
<p>"Those sorts of interfaces will become important [...] people will be able to ask more complicated things."</p>
</div>

<p>Conversational interfaces have always been a really interesting avenue. We've had voice browser work in W3C, that has been an interesting alternative avenue. It's possible that as compute power goes up, we'll see a prolifieration of machines capable of doing voice. It'll move from the mainframe to being able to run on a laptop or your phone. As that happens, we'll get actual voice recognition and pattern natural language at the front end. That will perhaps be an important part of the Semantic Web. </p>
<p><img src="http://www.readwriteweb.com/images/wolfram_football.jpg" /></p>
<p>We talked before about what a great challenge the Semantic Web is going to be from a user interface point of view. Conversational interfaces are going to be part of [solving] that. Of course it's also going to be really valuable to have compositional interfaces - for the visually impaired and so on. </p>
<p>Wolfram|Alpha is also a large curated database of data sets. Obviously I'm interested in the big data set which is out there, which is Linked Data. This everybody can connect to. I don't really know a lot about the internals of Wolfram|Alpha's data set. I don't know whether they're likely to put any of it out on the web as Linked Data - that might be an interesting addition. I imagine that quite a lot of it may have come <em>from</em> the web of Linked Data.</p>

<h2>e-Commerce and Linked Data</h2>
<p><em>RWW: There have been <a href="http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html">reports recently</a> that both Google and Yahoo will be supporting the Good Relations ontology and linked data for e-commerce. Companies such as Best Buy are already putting out product information in RDFa. What would be your advice to e-commerce vendors right now, to help them transition to this world of structured data on the Web. The same question could be asked across many verticals, but e-commerce seems like one area which has some momentum right now. Would you advise them just to put out their data as Linked Data?</em></p>
<p>TBL: Yup! Certainly this year is the year to do it. I've been advising governments to do it and when you look at an enterprise, you find that a lot of the issues are the same. But when you put your data from government or enterprise out there, make sure you don't disturb existing ecosystems. Don't threaten those systems, because you've spent years building them up.  </p>

<p>Maybe there's an analogy with when the Web first started and the first bookshops went online. They were more or less a flyer, saying 'hey we have a great bookshop at 23 Main St, come on down!'. Let's say that a person named Joe owned one of these early online bookshops. If somebody had suggested to Joe that he should put his catalog online, Joe would've felt that that was very proprietary data. And he'd be worried that other bookshops would see where he was weak, so they'd be able to advertise themselves as filling that niche he's weak in. </p>
<div class="pullquote">
<p>"When you put your data out there, make sure you don't disturb existing ecosystems."</p>
</div>

<p>But when his competitors Fred and Albert put their catalogs online, then Joe can check which books people are browsing at Fred and Albert's websites. So Joe would [finally] be pursuaded to put his book catalog up online. But he doesn't put up the prices... until Albert and/or Fred does. And even if catalog and pricing is up there, <em>nobody</em> puts their stock levels online. And there was a period of time when nobody [i.e. online booksellers] had their stock levels up. But people got fed up with ordering stuff that wasn't in stock. So the first book shop to actually tell you about stock levels suddenly was then unbelievably attractive to its customers. </p>
<p>So there's this syndrome of <strong>progressive competitive disclosure</strong>. This happens when people realize that if you're going to do business with somebody, if you're going to have your partners up and down the supply chain, really it's useful to check the data web - and life goes much more quickly and open. </p>

<p>Best Buy may be what starts the ball rolling [among e-commerce vendors]. Now if I want to look out for what [products are] available, I can write a program to see what there is. If somebody wants to compete with Best Buy, to my program they'll be invisible unless they can get their data up in RDF. Doesn't matter whether they use RDFa or RDF XML, as long as it maps in a standard fashion to the RDF model, then they will be visible.</p>

<p><b><em>Next Page: Internet of Things; Conclusion</em></b></p>

<!--nextpage-->

<h2>The Internet of Things</h2>
<p><em>RWW: I'm fascinated by how the Internet is becoming more and more integrated into the real world. For example the <a href="http://www.readwriteweb.com/archives/5_companies_building_the_internet_of_things.php">Internet of Things</a>, where everyday objects become <a href="http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php">Internet connected via sensors</a>. Have you been following this trend closely too, and if so what impact do you think this will have on the Web in say 5 years time?</em></p>
<p>TBL: It connects very much with Semantic Web [and] with linked data. With Linked Data you've got the ability to give a thing a URI. So I can give a URI to my phone, and I can say <em>that's</em> my phone in Linked Data. And also the company that made it can give a URI to the model of the phone. They can also put online all the specs of the phone, and then I can make a link to say that my phone is an example of that product. So now any system which is dealing with me and has access to that data will be able to figure out the sorts of things I can do with my phone, which actually is really valuable. Especially if the phone breaks. </p>
<div class="pullquote">
<p>"The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last mile."</p>
</div>

<p>The Semantic Web has already given URIs to things, and to types of things. When the things themselves have an RFID chip in them, then I think it's a very exciting world. One can take that RFID chip, go to the Internet and find out the data about the thing. Whether we'll be able to do that, whether the manufacturers will be open enough to <em>allow me</em> to turn data about the identifier of the thing into data <em>about the thing</em>, is yet to be seen. But it's a very exciting idea. </p>
<p><img src="http://www.readwriteweb.com/images/pachube.png" /><br /><em><a href="http://www.pachube.com/">Pachube</a>, an example of the Internet of Things (see <a href="http://www.readwriteweb.com/archives/pachube_internet-enabled_environments.php">ReadWriteWeb profile</a>)</em></p>
<p>Similarly, I'd like to be able to scan a barcode and get back nutritional information about what's in - for example - a can of food. But we don't have that yet. To get that sort of thing, which is very powerful, we need to build look-up systems, which allow you to translate an RFID code or a barcode into an HTTP address. </p>
<p>The Semantic Web is a web of things, conceptually. Tying an actual thing down to a part of the web is the last link - the last mile. Give the thing a notion of its own identity in the web.</p>


<h2>Conclusion</h2>
<p>RWW: The over-riding message in both <a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">Part 1</a> and 2 of our interview with Tim Berners-Lee, is for companies and organizations to <strong>make their data available online</strong>. Preferably as Linked Data, which uses a subset of Semantic Web technologies. But Berners-Lee noted, in Part 1 of our interview, that he'd even be happy with the data in CSV (comma separated values) format. </p>
<p>It's clear that we've seen a lot of progress in linked data already in 2009. In upcoming posts on ReadWriteWeb, we'll continue to track this trend and explain how organizations can contribute their data.</p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php</link>
         <guid>http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php</guid>
         <category>Interviews</category>
         <pubDate>Thu, 09 Jul 2009 06:00:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>ReadWriteWeb Interview With Tim Berners-Lee, Part 1: Linked Data</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/tbl_may08.jpg" />During my recent trip to Boston, I had the opportunity to visit MIT. At the end of a long day of meetings with various MIT tech masterminds, I made my way to the funny shaped building (see photo right-below) where the World Wide Web Consortium (W3C) and its director Tim Berners-Lee work. <font style="float: right; margin-left: 10px;"><script type="text/javascript">
tweetmeme_url = 'http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php';
tweetmeme_source = 'rww';
</script><script type="text/javascript" src="http://tweetmeme.com/i/scripts/button.js"></script></font>Berners-Lee is of course the man who invented the World Wide Web 20 years ago.</p> 
<p>This was my first meeting with the Web's creator, whose work and philosophy was a direct inspiration for me when I launched ReadWriteWeb back in 2003.<sup>1</sup></p>
]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15639&amp;cb=15639' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15639&amp;n=15639' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<p><img src="http://www.readwriteweb.com/images/w3c_building.jpg" align="right" />After shaking hands, I told Tim Berners-Lee that this blog's name was in part inspired by the first browser, which he developed, called &quot;<em>WorldWideWeb</em>&quot;. That was a read/write browser; meaning you could not only browse and read content, but  create and edit content too. It was a shame then when Mosaic, a read-only browser, became the first mainstream web browser in the mid-90s. It wasn't until the rise of Web 2.0 that the read/write philosophy gained widespread acceptance.<sup>2</sup> On that note, we launched into the interview... </p>
<p><strong>Note:</strong> the interview will be published in two parts, with Part 1 today on the topic of Linked Data. Part 2 will explore other topics and will run tomorrow.</p>

<p><strong>UPDATE: <a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Part 2 of this interview</a></strong> is now available</b>.</p>
<h2>How Linked Data Relates to The Semantic Web</h2>
<p><em>RWW: Earlier this year you gave <a href="http://www.readwriteweb.com/archives/linked_data_is_blooming_why_you_should_care.php">an inspiring talk at TED about Linked Data</a>. You described Linked Data as a sea change akin to the invention of the WWW itself - i.e. we've gone from a Web of documents to a Web of data. Can you please explain though how Linked Data relates to the Semantic Web, is it a subset of it?</em></p>
<p>TBL: They fit in completely, in that the linked data actually uses a small slice of all the various technologies that people have put together and standardized for the Semantic Web. </p>
<div class="pullquote">
<p>Linked Data uses a small slice of the technologies that make up the Semantic Web.</p>
</div>
<p>We started off with the Semantic Web roadmap, which had lots of languages that we wanted to create. [However] the community as a whole got a bit distracted from the idea that <em>actually</em> the most important piece is the interoperability of the data. The fact that things are identified with URIs is the key thing. </p>
<p>The Semantic Web and Linked Data connect because when we've got this web of linked data, there are already lots of technologies which exist to do fancy things with it. But it's time now to concentrate on getting the web of linked data out there. </p>
<p><img src="http://www.readwriteweb.com/images/tbl_rgm_july09.jpg" /><br />
<em>Web inventor Tim Berners-Lee and ReadWriteWeb founder Richard MacManus</em></p>
<h2>How Linked Data Has Evolved via Grassroots</h2>
<p><em>RWW: Linked Data has had a lot of grassroots support, which you mentioned in your TED speech. This is something Semantic Web technologies, such as RDF, have struggled to get over the years. Has the W3C been pushing the more bottom-up Linked Data world, because of the frustration over <a href="http://www.readwriteweb.com/archives/rdf_semantic_web_apps.php">lack of take-up of top-down Semantic Web</a>?</em></p>
<p>TBL: A lot of the initial RDF and OWL projects came out of the academic world; and some of them were projects to show what you could do in a closed world. And the files were zipped up and left on a disc. While they were interesting projects, and while the systems were useful systems, the semantic web community maybe missed the point of the 'web' bit and  focused too much on the 'semantic'. However the work that's been done in the Semantic Web, the standards, was really valuable. It's relatively recently for example that <a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a> [an RDF query language] has been developed. </p>
<div class="pullquote">
<p>"It's time now to concentrate on getting the web of linked data out there."</p>
</div>
<p>Somebody drew an analogy the other day: can you imagine trying to promote a world of databases without SQL? Even though it's not an interoperable protocol, it's just a query language. So similarly, all that's been put into RDF, rdfs and OWL is very valuable to the linked data community. </p>
<p>The Linked Data community tend to use a subset of that [Semantic Web technologies], of OWL for example. But they certainly use SPARQL. So you could argue that really it wasn't ready to be deployed widely. </p>
<p>Linked Data started as a very informal <a href="http://www.w3.org/DesignIssues/LinkedData.html">Design Issues note</a> that I put in; it was a grassroots movement from very early on. So <em>yes</em> W3C has been emphasizing the importance of Linked Data. It's been the <a href="http://www.w3.org/2001/sw/interest/">Semantic Web Interest Group</a> of course, and various [other Semantic Web] activities, which has been pushing it. But also Linked Data has been <em>seized on</em> - a group of people for example put together <a href="http://dbpedia.org/">DBpedia</a>.<sup>3</sup> That wasn't commissioned, that was that they just thought it would be a really cool idea.</p>
<p><img src="http://www.readwriteweb.com/images/web_data_apr09c.jpg" /><br />
<em>Graph of Linked Data sets on the Web, as at March 2009</em></p>
<h2>Linked Data and Governments<br />
</h2>
<p><em>RWW: In <a href="http://www.w3.org/DesignIssues/GovData.html">a recent Design Issues note</a>, you urge <a href="http://www.readwriteweb.com/archives/open_goverment_berners-lee_and_the_uk_to_show_obam.php">governments to put their data online</a> as Linked Data (although you'd also be happy for governments to just make available the raw data - presumably so that others can then structure it). What do you realistically expect, for example, the U.S. or U.K. governments to do over the next year? And in the near future, do you foresee different governments interconnecting their Linked Data sets? </em></p>
<p>TBL: One can't generalize, governments are (like most big organizations) fascinatingly diverse inside them. So you'll find that there are places inside governments where you get a champion who <em>gets</em> linked data and who's just written a script and produced some linked data. So in the UK government for example, you'll find there's RDFa [in the code of its website] for civil service jobs. So if somebody wants to make a database of all the jobs, they can do that very easily. </p>
<div class="pullquote">
<p>"The first step of actually putting the data out there is the one that nobody else can do."</p>
</div>
<p>There are other cases where the easiest thing for somebody to do is to just put data up in whatever form it's available. Comma separated values (CSV) files are remarkably popular. They're exported sometimes from spreadsheets. It's remarkable how much information is in spreadsheets. Or sometimes pulled out of a database and then put up on the web. It's not as good, not as useful to the community, as if Linked Data had been put up there and linked. But the first step of actually putting the data out there is the one that nobody else can do. </p>
<p><img src="http://www.readwriteweb.com/images/Data.govscreen.jpg" /><br />
  <em><a href="http://data.gov/">Data.gov</a>, a catalog of public data, was <a href="http://www.readwriteweb.com/archives/datagov_finally_launches_looks_nice_but_short_on_d.php">launched in May by the U.S. government</a></em></p>
<p>The way to go is for government departments to go the extra step and convert [their data] into Linked Data. One of the nice things about Linked Data, when they have a pile of it, is that they could run a SPARQL server on it. SPARQL servers are a commodity product,  a solution for all of the people who say 'but actually I wanted to have XML.' A SPARQL server will  generate an XML file [and] allow somebody to write out, effectively, a URL for the XML file. </p>
<div class="pullquote">
<p>"Linked Data is the backplane, it's the thing that you connect to in both directions."</p>
</div>
<p>In fact, I don't see why SPARQL servers shouldn't provide CSV files, something which as far as I know isn't in the standards. But I'd recommend it, certainly in government context, because CSV files are what people have and what people want. </p>
<p>So the message [for government] is to use RDF. Linked Data is the <a href="http://en.wikipedia.org/wiki/Backplane">backplane</a>, it's the thing that you connect to in both directions. As a [web] producer your job is to make sure that you produce Linked Data one way or another. And as a consumer, there are lots of ways to consume that data once it's out there as Linked Data. </p>
<p><em><strong>In <a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Part 2 of this interview</a></strong> we discuss: how previously reticent search engines like Google and Yahoo have begun to participate in the Semantic Web in 2009, user interfaces for browsing and using data, what Tim Berners-Lee thinks of new computational engine Wolfram Alpha, how e-commerce vendors are moving into the Linked Data world, and finally how the Internet of Things intersects with the Semantic Web. <strong><a href="http://www.readwriteweb.com/archives/readwriteweb_interview_with_tim_berners-lee_part_2.php">Read Part 2 here</a></strong>.</em></p>
<p><strong>Footnotes:</strong></p>
<p>1. The <a href="http://www.readwriteweb.com/archives/the_readwrite_w.php">very first sentence written on this blog</a>, on 20 April, 2003, was: &quot;The World Wide Web in 2003 is beginning to fulfill the hopes that Tim Berners-Lee had for it over 10 years ago when he created it.&quot;</p>
<p>2. For more on read/write browsers, you can read another early RWW post entitled <a href="http://www.readwriteweb.com/archives/what_became_of.php">What became of the Browser/Editor</a>.</p>
<p>3. DBpedia is a community project to extract structured information from Wikipedia; see <a href="http://www.readwriteweb.com/archives/where_to_find_open_data_on_the.php">ReadWriteWeb's profile</a> of this and similar resources.</p>
]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php</link>
         <guid>http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php</guid>
         <category>Interviews</category>
         <pubDate>Wed, 08 Jul 2009 06:00:00 -0800</pubDate>
<author>Richard MacManus</author>
      </item>
      
      <item>
         <title>Faviki&apos;s Social Bookmarking Tool Makes Semantic Tagging Even Easier</title>
		<description><![CDATA[<p><img src="http://www.readwriteweb.com/images/faviki_001.png">When <a href="http://www.readwriteweb.com/archives/semantic_tagging_with_faviki.php">we first looked</a> at <a href="http://www.faviki.com/">Faviki</a>, a social bookmarking application which made its debut last year, we were intrigued by their idea of "semantic tagging." What makes Faviki different from its competitors, services like <a href="http://del.icio.us">del.icio.us</a>, <a href="http://www.diigo.com/">Diigo</a>, and the now-defunct <a href="http://ma.gnolia.com/">Ma.gnolia</a>, is the way the service suggests tags to its users. The suggestions don't come from the community of Faviki users and their tagging history - they come from structured info extracted from the <a href="http://wikipedia.org">Wikipedia</a> database. </p>

<p>Today, Faviki is releasing an upgrade to their service which will give you even better control over the tagging process, making bookmarking even easier than before. They're also announcing support for OpenID. </p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15580&amp;cb=15580' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15580&amp;n=15580' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[

<h2>A Better Tagging Interface</h2>

<p>The biggest upgrade today is Faviki's enhanced tagging interface. In the past, <a href="http://www.faviki.com">Faviki</a> struggled with some of the tag suggestions pulled out of Wikipedia because they were too long and too hard to enter for practical use. Plus, users wanted to use tags of their own creation, not the tag suggestions. </p>

<p>For example, if someone is tagging an article about the soccer player "Filippo Inzaghi," they may want to tag it by the player's nickname "Pippo." Before, this was not possible. But now, if Faviki doesn't understand a tag, it will pull in possible matches and ask you "What exactly do you mean by ______?" After you pick your selection, Faviki will remember your choice. </p>

<p><img src="http://www.readwriteweb.com/images/faviki_tagging1.png"></p>

<p>This is an important change for the service because it means users can tag web pages any which way they want, but they're still linked to the structured data on the back-end. That way, when someone searches through Faviki's community tags, all the web pages for that particular item or concept will appear, even if people tagged them using their own personal keywords. </p>

<h2>Beyond Wikipedia</h2>

<p>Another change in <a href="http://www.faviki.com">Faviki's</a> service is the ability to define new tags. Prior to today, the service was limited to searching Wikipedia for tag suggestions, but now it has the whole web at its fingertips. If a tag is entered which doesn't match anything from Wikipedia, Faviki will search Google for relevant URLs and then ask if the links presented represent the same tag. As multiple users go through this process, Faviki learns what URLs best represent that concept and adds the new tags created by the users to its database.</p>

<p><img src="http://www.readwriteweb.com/images/faviki_tagging2.png"></p>

<h2>API, OpenID, and More</h2>

<p><a href="http://www.faviki.com">Faviki</a> has also just launched a <a href="http://faviki.wikia.com/wiki/API">Save/Edit API</a> that provides a way to save and edit bookmarks from other applications. In addition, they've introduced support for OpenID. Other new features arriving today include a smarter autocomplete list, the ability to convert tags, spam control, the ability to export/backup your bookmarks, and a new tag description tooltip. </p>

<p>The only issue we have with <a href="ttp://www.faviki.com">Faviki</a> is the same one we had before: there's still no import function available. That means you'll have to leave your extensive bookmark collection behind if you want to use this service. We suppose that it could be difficult to properly tag and match all of our old bookmarks, but without this feature, Faviki doesn't have the best shot at attracting the heaviest users of social bookmarking services. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/favikis_social_bookmarking_tool_makes_semantic_tagging_easier.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/favikis_social_bookmarking_tool_makes_semantic_tagging_easier.php</link>
         <guid>http://www.readwriteweb.com/archives/favikis_social_bookmarking_tool_makes_semantic_tagging_easier.php</guid>
         <category>Products</category>
         <pubDate>Thu, 02 Jul 2009 06:04:01 -0800</pubDate>
<author>Sarah Perez</author>
      </item>
      
      <item>
         <title>Search and Rescue: 6 Approaches to Semantic Data Collection</title>
		<description><![CDATA[<p><img alt="semantic_search_logo_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_logo_jun09.jpg" width="150" height="150">It's been more than ten years since Tim Berners-Lee first spoke about the semantic web and computers indexing all web-based data. He said, "The day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The 'intelligent agents' people have touted for ages will finally materialize." Since then a handful of companies have attempted to tackle the issue of machine-based indexing and language interpretation. None of them are perfect. Below are 6 unique approaches to semantic data collection.</p>]]>
<![CDATA[<p align="right"><em>Sponsor</em><br /><a href='http://d1.openx.org/ck.php?n=15511&amp;cb=15511' target='_blank'><img src='http://d1.openx.org/avw.php?zoneid=11205&amp;cb=15511&amp;n=15511' border='0' alt='' align="right" /></a></p>]]>

<![CDATA[<h2>1. <a href="http://powerset.com">Powerset</a></h2>
<img alt="semantic_search_bing_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_bing_jun09.jpg" width="610" height="237">
This site was one of the first to publicly apply machine-based natural language processing to a consumer search engine. Nevertheless, because public expectations were so high, when Powerset launched a Wikipedia-only beta, <a href="http://www.readwriteweb.com/archives/powerset_vs_google.php">reviewers were harsh.</a> The site was acquired by Microsoft shortly after the initial launch and the team has been low key ever since.  While Powerset is one of the definitive semantic engines in existence, Microsoft is currently concentrating on using Powerset's technology to index Wikipedia pages in Bing. Powerset's search result pages actually contain a "Try this on Bing Reference" note in the sidebar of the site.  
<br />
<br />
<h2>2. <a href="http://www.cuil.com/">Cuil</a></h2>
<img alt="semantic_search_cuil_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_cuil_jun09.jpg" width="610" height="210">
This team touted its language processing product as being much faster to index pages than Google; however, consumers rarely covet speed over quality and the site <a href="http://www.readwriteweb.com/archives/cuil_publicity.php">was criticized right from the start</a>. Expectations were not met as Cuil's claim to 120 billion pages indexed did not match up to the results on <a href="http://www.readwriteweb.com/archives/google_hits_one_trillion_pages.php">Google's reported 1 trillion unique URLs.</a> However, what Cuil did right was separate related search results from regular web results. That being said, without any human intervention, the related results are often bizarre and irrelevant. For instance, my name produces the rankings of Ultimate Fighting Challenge Champions. 
<br />
<br />
<h2>3. <a href="http://hakia.com">Hakia</a></h2>
<img alt="semantic_search_hakia_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_hakia_jun09.jpg" width="610" height="282">
This is a natural language search engine where sponsored results, regular web results and "credible" web results are broken down visually into separate categories. Similar to Wikipedia, Hakia <a href="http://www.readwriteweb.com/archives/hakia_relaunches_with_credible.php">employs a community monitoring system for credibility</a> and "credible" results must be peer reviewed and seemingly free of corporate interest. One of the great features of Hakia is that users can tab over the site to show only images or news. 
<br />
<br />
<h2>4. <a href="http://worio.com/search/">Worio</a></h2>
<img alt="semantic_search_worio_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_worio_jun09.jpg" width="610" height="282">
Worio is considered a "discovery engine" as it is not technically a search engine destination site. While users are still required to visit the <a href="http://worio.com/search/">Worio destination</a>, search is actually powered by Yahoo, Google or Windows Live search. Regular web results appear in the larger left-side column and natural language-based "discoveries" appear on the right. These discoveries are further refined by personal bookmarks and shared relevancy with Facebook friends. 
<br />
<br />
<h2>5. <a href="http://labs.mozilla.com/2008/08/introducing-ubiquity/">Ubiquity</a></h2>
<object width="400" height="298"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=1561578&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" /><embed src="http://vimeo.com/moogaloop.swf?clip_id=1561578&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="400" height="298"></embed></object><p><a href="http://vimeo.com/1561578">Ubiquity for Firefox</a> from <a href="http://vimeo.com/user532161">Aza Raskin</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
Ubiquity is perhaps the opposite of a semantic web engine, but it serves a similar function for those looking to aggregate useful data. The Firefox plugin allows users to create command lines that incorporate natural language search with a series of mashups. Users can then combine relevant data from Craigslist, translation tools, maps, reviews  and social networks for easy user visualization. While the end product is an extremely useful document, users may not be ready for the drastic behavioral change of using command lines for semantic data collection. 
<br />
<br />
<h2>6. <a href="http://www.semanti.com/">Semanti</a></h2>
<img alt="semantic_search_semanti_jun09.jpg" src="http://www.readwriteweb.com/semantic_search_semanti_jun09.jpg" width="610" height="229">
From a consumer standpoint, Semanti sits somewhere on the spectrum between Worio and Ubiquity. ReadWriteWeb <a href="http://www.readwriteweb.com/readwritestart/2009/06/semantic-search-engine-gets-he.php">reviewed the product earlier this week</a> and like Ubiquity it is a Firefox plug-in rather than a destination site. However, like Worio, it employs leading search engines, bookmarking and Facebook friends to produce results. Semanti's key difference is that it prompts users to choose from multiple definitions prior to completing the search. Decision-making is actually human-powered rather than machine-powered. CEO, Bruce Johnson, said, "I tried machine-based semantic tagging, but my priority has always been a faster search experience." While this is not the "use of intelligent agents" that Berners-Lee suggested, it is a "semantic" tool in that it helps the user distill meaning and relevancy from language.

<p>If you've got more examples of semantic data collection tools, list them in the comments below. </p>]]>
<![CDATA[<strong><a href="http://www.readwriteweb.com/archives/search_and_rescue_6_approaches_to_semantic_data_collection.php#comments-open">Discuss</a></strong>]]>

</description>
         <link>http://www.readwriteweb.com/archives/search_and_rescue_6_approaches_to_semantic_data_collection.php</link>
         <guid>http://www.readwriteweb.com/archives/search_and_rescue_6_approaches_to_semantic_data_collection.php</guid>
         <category>Semantic Web</category>
         <pubDate>Thu, 25 Jun 2009 15:45:41 -0800</pubDate>
<author>Dana Oshiro</author>
      </item>
      
   </channel>
</rss>