ReadWriteWeb

Funding the Semantic Web: Dapper's Ad Network Plan

Written by Marshall Kirkpatrick / February 6, 2008 9:15 AM / 2 Comments

The founders of the data extraction and API creation service Dapper announced this week that their aim is to leverage Dapper in the service of ad networks and derive a semantic index of pages around the web from that activity. They will launch their ad powering product at Ad:Tech in April. Essentially, it will perform ad funded indexing of the semantic web.

Here's how it will work: Dapper lets users identify and tag particular fields on any page. It then extracts the value in that field and makes it available in XML. As a result of this advertising activity, Dapper believes a substantial quantity of pages around the web could have fields of interest delineated and tagged with relevant terms. Relationships between pages and fields and terms and tags can all be extracted and analyzed from this aggregated activity.

The company has already built a demonstration semantic search engine based on Dapper activity and its ability to parse search results by semantic meaning and detail is quite sophisticated. The potential applications of a semantic index built by Dapper are really exciting to consider.

Dapper currently has 35,000 extraction functions (Dapps) created, but they are betting that a clear profit motive will incentivize advertisers to create many, many more. Advertisers will pay to have web content delineated by field and categorized.

The company argues that advertisers see substantially increased relevance and click-through if ads can be served based on very specific fields of content on a page. Early prototypes run on top music site Pitchfork and book summary site Shvoong saw 100 to 500% increases in CTR.

While Dapper's approach would likely leave the vast majority of fields on a page unindexed, it could also rack up a whole lot of semantic knowledge by riding the profit motive to discover the semantic meaning of the most monetizable fields on a far greater number of pages than would likely be analyzed otherwise. What better way to analyze the web than to ride along with ad networks? I can't think of any better way.

I think Dapper has a shot at helping fund the semantic analysis of much of the web. What will they do with the data other than use it to contextualize ads? That's another question, but an interesting one to consider.

Dappercamp was a great event this week and the tool itself is one I highly recommend. It's in startup mode and I'll be frank - many of the output formats simply don't work and there are a number of errors throughout the site. None the less, I derive significant value for my work every time I engage with it. Here's a screencast tutorial I recorded on the service. Several Dapps, Dapper-created data extractions, have become daily go-to sources of information for me - but I also recognize that only so many people are going to be as excited about this technology for research purposes. For the rest of the world, for the viability of the company, and for the potentially gigantic secondary benefit of widespread semantic indexing - I think putting Dapper in service of ad networks is a plan of simple brilliance.

Comments

Subscribe to comments for this post OR Subscribe to comments for all Read/WriteWeb posts

  • Are there any open source Semantic projects like Dapper ?

    Should an private company be the dominate force in collecting data and adding Semantic information to it ?

    Would and do we trust Microsoft or Google if they were doing this?

    Can I be asked if Dapper can crawl my content ?

    Can I tell them I dont want my content crawled ?

    Can I delete my content from Dappers servers ?

    Posted by: william | February 7, 2008 7:33 AM


  • Hi William,

    You pose excellent questions. Let's see if I can get you some good answers:

    > Are there any open source Semantic projects like Dapper ?

    To our knowledge, there aren't really any projects like Dapper, open-source or otherwise. While our extraction engine isn't open-source, many of the tools around it are. As we grow and add more semantic features, many of those are likely to be open-source as well.

    > Should an private company be the dominate force in collecting data and adding Semantic information to it ?

    I think that the last ten years of the Web have shown that there needs to be an economic incentive for the Semantic Web to come into existence. While the utopic notion of the Semantic Web being created by every website owner is a nice concept, we've seen that there just isn't incentive for site owners to create the semantic layer. Dapper gives the power to the people who have the incentive (the users of the content, as opposed to the site administrator).

    > Would and do we trust Microsoft or Google if they were doing this?

    If their intentions were, like ours, to revolutionize and democratize the flow of content on the web, I think yes. Making money along the way is a necessary ingredient in order to fund such an ambitious goal.

    > Can I be asked if Dapper can crawl my content ?

    Dapper is currently opt-out.

    > Can I tell them I dont want my content crawled ?

    Yes, see:

    http://www.dapper.net/control-access.php

    > Can I delete my content from Dappers servers ?

    Yes, from the above URL. We don't actually store any content - we only channel it back, but you can prevent that from being allowed.

    I hope this helps!

    Jon

    Posted by: Jon Aizen | February 7, 2008 10:47 AM




RECENT JOBS


RWW READERS


TEXT LINK ADS


RWW PARTNERS

adaptiveblue

Yahoo Buzz