semantic search - ReadWriteWeb http://www.readwriteweb.com/feeds/search/semantic search en Copyright 2009 Richard MacManus readwriteweb@gmail.com Tue, 24 Nov 2009 10:13:22 -0800 http://www.sixapart.com/movabletype/?v=4.23-en http://blogs.law.harvard.edu/tech/rss Quintura Launches Site Search - Private Beta Offer for RWW Readers QuinturaRussian search engine Quintura (one of our sponsors) has released a private beta of a custom search service for blogs or websites. It's similar to the Eurekster swicki we use on ReadWriteWeb. The defining feature of Quintura's service is a tag cloud that moves fluidly when you click it - see it in action on our network blog AltSearchEngines.

If you'd like to check out Quintura's private beta, click here. Read on for some background on the semantic technology powering Quintura...

]]>Sponsor

]]> Quintura has been marketing itself lately as a Google killer. But something I hadn't realised until recently is that Quintura uses Semantic Web technologies to power its search engine. Yakov Sadchikov, President and CEO of Quintura, told me that "as opposed to Google's PageRank, the Quintura algorithm utilizes an 'active' semantic networking technology. The Quintura algorithm involves a lot of parallel calculations that only became possible with today's computer processing power." He went on to say that "Quintura indexes a set of documents and builds a semantic network of them. The network is displayed visually as a dynamic tag cloud."

Theoretically this means that search results are based on context, rather than on how many people linked to it - as Google's PageRank essentially is. There is a lot of search innovation happening outside of Google, but semantic search is seen as The Holy Grail solution in some quarters. I still find Google's super-fast search to be more than adequate for my daily search needs, but as we move into this new Web era of Semantic Apps and open data, it's good to see new types of search emerging.

]]>Discuss]]>
http://www.readwriteweb.com/archives/quintura_launches_site_search.php http://www.readwriteweb.com/archives/quintura_launches_site_search.php Products Fri, 25 Jan 2008 10:40:02 -0800 Richard MacManus
Hakia Licenses its Semantic Search Technology Semantic search engine hakia is announcing today, at the Search Engine Strategies conference in New York, that it is licensing its proprietary OntoSem technology to other companies. This will enable third parties to build semantic search applications. The first such customer to be made public is RiverGlass, Inc, a provider of real-time analytics. RiverGlass will integrate hakia's OntoSem technology into its analysis software.

]]>Sponsor

]]> This is an interesting development by hakia - and has some parallels to the young Google, which you'll recall started out by licensing its search technology to the likes of Yahoo. But the parallels end there, because this move by hakia is more about licensing their underlying search technology to power the proprietary applications of other companies - whereas Google was a branded search app integrated into Yahoo's front-end.

According to hakia, this is what their OntoSem technology does:

  • information retrieval, analysis, and distribution
  • text summarization
  • information assurance and security
  • machine translation
  • ontology support
  • terminology standardization
  • supply chain automation

Essentially, it will enable third parties to find and use "the meaning of language" in their applications. Hakia's definition of 'semantic search' by the way differs from the traditional Semantic Web definition, in that hakia search aims to automatically determine meaning from search queries using its algorithms - whereas Semantic Web is all about adding metadata to information to enable connections between data.

At this early stage there aren't any visuals from RiverGlass showing how they're using hakia technology, but the company told us that "we will see the biggest boon in increased relevancy of results".

Disclosure: hakia is a RWW sponsor

]]>Discuss]]>
http://www.readwriteweb.com/archives/hakia_licenses_semantic_search.php http://www.readwriteweb.com/archives/hakia_licenses_semantic_search.php Alt Search Engines Tue, 18 Mar 2008 08:00:00 -0800 Richard MacManus
Web 3.0 Manifesto Published - Suggest a Web 3.0 App and Win a Defrag Ticket! Project10X has just published a "Web 3.0 Manifesto". It's a kind of sequel to their Semantic Wave 2008 report released in January this year. Mills Davis, Managing Director of Project10X, told us via email that the new manifesto "reveals how semantic technologies will drive product and service opportunities in the next stage of the internet."

You can download the Executive Summary here. We got a look at the whole report and it is packed full of great data, including the two top 10 lists of Semantic Web opportunities detailed below.

Also in this post we're giving away 3 free tickets to Defrag for the best suggestions in the comments for 'web 3.0' apps. See below for more details.

]]>Sponsor

]]> Top-10 List [of Semantic technology opportunities] -- Consumer
1 Interest networking
2 Semantic social networking
3 Semantic bookmarks
4 Semantic search & QA
5 Semantic desktop / webtop
6 Semantic blogs, wikis
7 Semantic identity management
8 Semantic mobility
9 Semantic email & IM
10 Reality browsing, avatars, & context-aware games

Top-10 List -- Enterprise
1 Information sharing
2 Semantic search, discovery, & navigation
3 Semantic mashups and composite applications
4 Semantic infrastructure / middleware SSOA, SBPM, SWS, virtualization, policy-based computing
5 Semantic business intelligence
6 Semantic ERP applications CRM, PLM, SCM, HRM
7 Semantic governance, compliance, & risk
8 Semantic web sites, wikis, collaboration, interest networking, & collective knowledge systems
9 Semantic advertising, marketing, personalization, & customization
10 Intelligent systems knowledge-based research, design, engineering, simulation, planning, scheduling, optimization, & decision support.

Win a Ticket to Defrag

Totally unrelated to Project10X's report, but we just happen to have 3 tickets to the Defrag conference in Denver Nov 3-4 to give away. To win one, simply leave a comment in this post detailing what kind of 'Web 3.0' app you wish to see developed. You can use the above top 10 lists as inspiration, or wing it ;-)

The best 3 as chosen by our editors will win a full ticket to Defrag.

UPDATE: Thanks everyone for the great comments. The 3 winners of the Defrag passes are:

Scott Brinker
Jesse Wilkins
Edward Benson

]]>Discuss]]>
http://www.readwriteweb.com/archives/web_30_manifesto_published.php http://www.readwriteweb.com/archives/web_30_manifesto_published.php Semantic Web Thu, 16 Oct 2008 00:22:46 -0800 Richard MacManus
Evri Beta Launches: Search Less - Understand More

Evri, a Paul Allen backed semantic search engine, is launching into a limited beta tonight. Evri was first shown publicly at the D6 conference. Evri's CEO Neil Roseman likes to talk about Evri in terms of organizing content instead of calling it a search engine. At its core, however, Evri definitely is a search engine, though it adds a very sophisticated semantic layer on top of its results that emphasizes the relationships between different search terms.

]]>Sponsor

]]> In its early stages, Evri is only going to start out with a limited set of results and possible search terms, based on what it considers to be the most popular terms and people. This approach of starting with only the most popular terms is reminiscent of Mahalo. However, unlike Mahalo, which relies on paid editors and volunteers to create its results, Evri completely relies on its algorithms to create connections between people, products, concepts, and events.

Evri especially prides itself for having developed a system that can distinguish between grammatical objects such subjects, verbs, and objects to create these connections. In his demo at D6, Roseman described the system as being similar to "an army of 7th grade grammar students graphing the Web."

evri-screen.png

Evri is entering in direct competition with a number of recent entries to the semantic search market, especially Powerset and Hakia. Powerset, however, only indexes Wikipedia articles, while Hakia tries to index all of the web, but focuses less on the relationships between objects and more on providing highly organized results for a given term.

You can sign up for invites to Evri on their homepage. The first wave of users should be receiving invites tonight.

For a more in-depth look at the state of semantic search, see also Alex Iskold's article on the myth and reality of semantic search.

]]>Discuss]]>
http://www.readwriteweb.com/archives/evri_beta_launches_search_less.php http://www.readwriteweb.com/archives/evri_beta_launches_search_less.php News Tue, 24 Jun 2008 21:01:00 -0800 Frederic Lardinois
Do Semantic Search Companies Need a Semantic Map? It's All Semantics... This week we reported that Cognition had announced "the largest commercially available Semantic Map of the English language." In our interview with Cognition CEO Scott Janus, we asked him to compare Cognition's technologies to those of other semantic search companies Hakia and Powerset. Janus pointed to their large Semantic Map as the main differentiator. Indeed he told us that semantic search companies "must include a comprehensive semantic map" to be successful.

Is this true? We sought a response from both Hakia and Microsoft-owned Powerset on this semantically charged question.

]]>Sponsor

]]> Cognition claims that its Semantic Map has over 10 million semantic connections, including "over 4 million semantic contexts (word meanings that create contexts for specific meanings of other related words)".

Hakia CEO Riza C. Berkan responded in the comments to the original article that "hakia is deploying Ontological Semantics (OntoSem)", which he described as "a network of concepts reflecting ontology." He went on to say that hakia covers "over [a] million words in English".

However Berkan noted that the size of a Semantic Map does not necessarily matter: "the sheer size of the collection of words or concepts does not represent, by any means, the capability of the system." Hakia's position is that "there is no silver bullet for a semantic solution that will succeed", as long as the system developed is scalable and imposes "minimum reliance on 'words'".

Semantopoly: Advance token to nearest Semantic Context

At this point we were still confused. Cognition uses the term "semantic map" and said it was necessary to have. One of the commenters on the original post agreed with that assumption. Yet Hakia's Riza Berkan didn't use the term "semantic map". So we asked Hakia in a follow-up email, does it or does it not have a semantic map? Dr. Christian Hempelmann, Hakia's Chief Scientific Officer, responded:

"The term sometimes comes up in the context of data integration, but "Semantic map" is not a term used in linguistics. I can only speculate that it is what is commonly called an ontology. To the degree that they let us on about it in the documentation on their website, Cognition operates with only 2 main relations, much like WordNet: hyperonymy/hyponymy (e.g. cat is-a feline is-a mammal; their "taxonomy") and synonymy (e.g., "buy" means almost the same as "purchase"; their "thesaurus"). Furthermore, this map is not independent of English, cannot grow into other languages. hakia, on the other hand, has an ontology with many more relations, effectively raising our "semantic map" to the size of a higher power, and can and is already growing into other languages."

We also tried to get a comment from Powerset, but as of writing we haven't received it.

So, are we all clearer now on what is a Semantic Map, is it needed, and does size matter? Er, it depends. If you think you know the answers, tell us in the comments please!

]]>Discuss]]>
http://www.readwriteweb.com/archives/do_semantic_search_companies_need_a_semantic_map.php http://www.readwriteweb.com/archives/do_semantic_search_companies_need_a_semantic_map.php Analysis Fri, 19 Sep 2008 15:05:28 -0800 Richard MacManus
Comment of the Day: Semantic Marketing Today's winning comment comes from Alex Iskold's must-read post Semantic Web Patterns: A Guide to Semantic Technologies. In the post Alex identifies the patterns that are beginning to emerge in the Semantic Web, classifies the different trends, and examines what the future holds. One of the comments to the post introduced us to the term "semantic marketing". Scott Brinker is curious about "how marketing will evolve to take advantage of the semantic web, whether it's in consumer or B2B plays."

]]>Sponsor

]]> Scott lists 7 possible missions for semantic marketing, leading with: "Marketing becomes the champion of generating the underlying data." (hmmm, that's similar to the issue of content management on corporate Intranets!). Here is Scott's full list of suggestions for semantic marketing, which you can also read here:

For participating in the discussion, Scott you've won a $30 Amazon voucher - courtesy of our competition sponsors AdaptiveBlue and their Netflix Queue Widget.

Great post.

One of the questions I find most fascinating is how marketing will evolve to take advantage of the semantic web, whether it's in consumer or B2B plays. I think this is more than a linear extension of how marketers have been optimizing the web today, but something qualitatively different. I suggest that SEO + Semantic Web = SEO++ (after all, it is sort of an object-oriented paradigm shift).

Here are 7 possible missions for "semantic marketing":

1. Marketing becomes the champion of generating the underlying data.
2. Marketing views categorization, metadata, RDF graphs, relevant microformats, etc., as a new kind of market positioning and placement -- "semantic branding", if you will.
3. Marketing takes a much broader view of distribution and promotion of its semantic web data in search engines and vertical networks (SEO++), including the sponsorship or creation of new niche semantic networks.
4. Marketing comes up with new ways to incentivize the conversion of semantic web interactions in real business objectives.
5. Marketing will have a real challenge with tracking and attributing distributed data in the semantic web to measure its impact -- from multi-touch marketing to micro-touch marketing. Hard problem but entrepreneurial ingenuity will prevail.
6. Marketing will want to leverage other people's data in their own value-add mash-ups (interesting "joint venture" semantic data partnerships), as well as for internal-only apps focused on market research and competitive intelligence.
7. Marketing will need to be concerned with brand protection in the semantic web: quality control to watch for bad data, conflicting data, competitive misuse, etc.

If you're interested, http://www.chiefmartec.com/2008/03/marketing-in-th.html is the full post. Would love feedback from other marketers and semantic web afficionados.

]]>Discuss]]>
http://www.readwriteweb.com/archives/semantic_marketing.php http://www.readwriteweb.com/archives/semantic_marketing.php Comments Competition Wed, 26 Mar 2008 23:00:00 -0800 Richard MacManus
Machine-Powered Medical Info: HealthBase Semantic Search healthbase_semantic_aug09.jpgWe've all seen how semantic technologies improve search results, but rarely do we see those results put to use in such a targeted way. Jens Tellefsen, VP of Marketing and Product Strategy at NetBase Solutions spoke to ReadWriteWeb about today's launch of healthBase - a medical search and discovery application. Using a variety of semantic indexing techniques, the company crawls the web's leading medical and health players including the Mayo Clinic, PubMed (US National Library of Medicine) WedMd, Medical News Today and Discovery Health. What makes this a truly unique technology is that rather than requiring any data manipulation from humans, Netbase's search results are completely automated.

]]>Sponsor

]]> Says Tellefsen, "Rather than using keywords or basic entities to search through billions of documents, NetBase can actually read and extract linguistic meaning from entire sentences and concepts." According to Tellefsen, healthBase can determine causal relationships, treatments and conditions and automatically aggregate that data into meaningful answers. Given the fact that more than 75% of the population seeks out online health information, a semantic tool with sentence-level understanding can potentially help dispel medical myths on a massive scale.

healthbase_semantic_aug09a.jpgNetBase employs the same principals across a variety of enterprise tools, but healthBase is its first foray into consumer-facing products. While the company is used to powering corporate, federal and market research, healthBase allows NetBase to show off its content intelligence tool in a way that gives us insight into our selves and our bodies.

Because NetBase is not reliant on manual annotation or custom taxonomies, the system is also very scalable. It took roughly 2 days to produce all of the data in healthBase - a feat that would never be possible by a combination human and machine system.

"It's important for us to address real issues with semantic technologies outside of a lab," Says Tellefsen. To try healthBase visit healthbase.netbase.com

]]>Discuss]]>
http://www.readwriteweb.com/archives/machine-powered_medical_info_healthbase_semantic_s.php http://www.readwriteweb.com/archives/machine-powered_medical_info_healthbase_semantic_s.php Semantic Web Wed, 02 Sep 2009 05:00:00 -0800 Dana Oshiro
50+ Semantic Web Pros to Follow on Twitter Here at ReadWriteWeb, we find the Semantic Web fascinating. We write about it a lot. What is the semantic web? The way we explain it is that it's a paradigm advocating that the meaning of content on the web be made machine readable.

Why would you want to do that? Because once the "meaning" of text is automatically discernible, there's a whole new world of things we can do with content on the web. Far out things that full text search for the mere presence of keywords would never be able to accomplish. Who's working on the semantic web and how can you meet them? Read on.

]]>Sponsor

]]> In November, 2007 we published a list of 10 Semantic Web companies to watch. Then, one year later, we published a new list for 2008 of Semantic Web companies to watch.

Based on those lists, and reader suggestions in comments of other companies that should be watched, we present to you a list of 50+ Twitter users who work at Semantic Web companies. If you find this sector as interesting as we do, you might want to add some of these people to your microblogging community. You can click through the arrows in the iframe below to scroll through all the accounts and add the people listed. RSS readers who'd like to see the list should click through to the full post.

Mashery

A handful of these are company accounts, but most are accounts from individual employees. Want to suggest anyone we missed? (We know there are lots we've missed!) Let us know in comments. You can also meet the RWW crew on Twitter.

If this iFrame is driving you batty, see also this old list of links to all the accounts displayed below.

]]>Discuss]]>
http://www.readwriteweb.com/archives/50_semantic_web_pros_to_follow_on_twitter.php http://www.readwriteweb.com/archives/50_semantic_web_pros_to_follow_on_twitter.php Semantic Web Mon, 19 Jan 2009 18:48:45 -0800 Marshall Kirkpatrick
Sponsor Announcement: Web 3.0 Conference The Web 3.0 Conference & Expo is happening October 16-17, Santa Clara, CA. ReadWriteWeb is a media sponsor of the event.

Web 3.0 Conference and Expo will explore the strategies, tools, technologies and the big ideas necessary for building impactful, socially relevant, and profitable Web 3.0 products, services and companies. So whether you are a designer, developer, entrepreneur, strategist, or venture capitalist, if you are thinking about the next generation of the Web, the Web 3.0 Conference is the place to be.

]]>Sponsor

]]> There will be keynote addresses by Scott Prevost of Powerset and Amaid Solomon of Peer39. Featured Business/Marketing track sessions include:

* Monetization Implications of Web 3.0 (Semantic Advertising, etc.)
* Product Marketing, Key Biz Strategies
* Business Risks of Web 3.0. What Risks?
* Business models, strategies, valuation
* Semantic Startup 101 - Successes, challenges, strategic decisions

Featured Technology track sessions include:

* From Web 2.0 to 3.0 - Tales from the Trenches
* Opportunity Costs of ODBC and relational data models
* Knowledge Discovery from semantic metadata
* Web 3.0 Channel Demo - mobile, video, social apps
* plus Semantic Search & Services Demos

Register by Oct 1 and save.

]]>Discuss]]>
http://www.readwriteweb.com/archives/sponsor_announcement_web_30_conference.php http://www.readwriteweb.com/archives/sponsor_announcement_web_30_conference.php Events Sat, 20 Sep 2008 04:59:58 -0800 Admin
A Productive Application of Semantic Search Noesis is a new semantic web search engine that helps scientists studying the environment access and retrieve the research data they need. Developed at the University of Alabama in Huntsville, the new engine has the potential to enable scientists and researchers everywhere to perform more productive and focused searches thanks to the semantic technology Noesis uses.

]]>Sponsor

]]> About Noesis

The Noesis search engine (PDF) is different than regular search engines because it employs the use of semantics to help its users better shape their search queries. The results of this lead to better, more accurate, and more complete sets of search results. Those results can then be refined even further by Noesis' end users if necessary.

The goal of the Noesis project is to provide scientists working in the field of Atmospheric Science a way to better search through the "hidden web" of scientific catalogs that traditional search engines cannot reach. Because these catalogs are built using a standard vocabulary, the most efficient searches on the catalogs involve using specific terminology.

To create Noesis, researchers simply annotated those specific vocabulary terms with ontologies - the machine-readable definitions for the words that help computers understand the concept of the term and its relationship to other terms. Of course, annotations alone do not make a semantic web search engine. The ontologies must be coupled with a tool that's capable of searching through them. To that end, Noesis employs something they call the Ontology Interface Service (OIS), a SOAP-based web service interface to an inference engine. When a user performs a search, the OIS is also immediately searched for associated concepts. The Specializations and Generalizations discovered are returned in a tree structure which the user can navigate further. Synonyms and related terms are also shown, and, using checkboxes, they can be appended to the original query to refine it further.

Although the project was designed for use in one select area of science, its framework could easily be replicated in other scientific fields of study.

The Semantic Web: Better in Niches?

The main problem with the semantic web today is that the assignment of those above-mentioned ontologies - the pieces of code that allow machines to grasp meanings that humans innately understand - is that there's no solid way to automate their assignment. At the present time, no automatic or semi-automatic processes to do so have been achieved...at least, not to the point that a true vision of a new, intelligent web can be realized.

Most of the time, annotating web resources must be done using manually inserted bits of code placed into various web pages. Obviously, that's a challenge when you consider the size of the internet - it would be impossible to manually annotate this ever-growing resource. Unfortunately, without automated methodologies, a true semantic web will remain an unrealized dream.

However, in smaller communities, the semantic web can easily become a reality. Scientific data catalogs only represent small portions of the web as whole. Because of their limited size, manually annotating the resources they contain is a manageable feat. This is the case with Noesis. It shows there is promise for the semantic web after all - if only in small niches.

Image credit: rule100

]]>Discuss]]>
http://www.readwriteweb.com/archives/a_productive_application_of_semantic_search.php http://www.readwriteweb.com/archives/a_productive_application_of_semantic_search.php Semantic Web Wed, 14 Jan 2009 08:01:38 -0800 Sarah Perez
Semantic Web Wish List 2009 At the end of last year we presented our list of the top 10 Semantic Web Apps of 2008. ReadWriteWeb reader Zoltán Andrejkovics wrote in to us afterwards, suggesting that we do a post looking at what Semantic Web apps we'd like to see emerge in 2009. Zoltán gave us 5 apps he wants to see this year, and we also asked our Twitter friends for their views (you can follow ReadWriteWeb on Twitter here).

We at ReadWriteWeb are tracking the Semantic Web space closely - so far we've identified 20 products (see our first 10, then 10 more) that we're paying particular close attention to. But we know there is a lot of opportunity yet for commercializing the Semantic Web, so we encourage you to add your wish list in the comments.

]]>Sponsor

]]> At ReadWriteWeb, we look for more commercial Web apps - whether they be consumer or enterprise. So here are 5 of those we'd like to see emerge and/or grow during 2009:

  1. Microsoft makes a very bold play with Powerset technology and starts to challenge Google in search (despite Google's attempts to use semantic web technology, we'd love Microsoft to ramp it up in search - competition is good for consumers!).
  2. Semantic Web advertising apps for publishers - we have our eye on Dapper MashupAds in this sector, but we'd like to see others take up this challenge too.
  3. Semantic apps for managing your finances - makes connections between transactions, things that you wouldn't normally pick up.
  4. Semantic apps for health industry - there are many opportunities here, but in general there is much the Semantic Web could do to organize the maze of data in the health indsutry.
  5. A Personalized Memetracker - Techmeme founder Gabe Rivera would be astonished if this happens, but we'd love to see a product that can give a Techmeme-like organization and layout to one's RSS feeds. So at a glance, you can see which stories in your own set of RSS feeds are hot and who's linking to them. Whether Semantic Web technology can achieve that, we don't know ;-)

Zoltán Andrejkovics, who suggested this topic, is a PhD student at Corvinus University of Budapest and his 5 wishes as a researcher are:

  1. Smart notes; easy to find/browse notes, using NLP search.
  2. Smart RSS; automatic article-collecting app based on my own interests.
  3. Mind writing; using not only words, but "thought" objects, that the NLP engine puts into words.
  4. Assistant; "my mirror", learns from my words, behavior on the net, and supports my work, handles calendar, etc.
  5. Smart bookmarks; works like smart notes.

Here are some reactions from RWW readers on Twitter (it was very short notice before this post was published, so if we missed you please add your wish list to the comments):

superphoebe: "I'd love to see more semantic blogging tools like Zemanta, but with more sources, really great search and a super simple dashboard."

Marcelo Sánchez: "Zigtag for bringing real semantic tagging and Freebase as the next Wikipedia"

garlin: "I'd like an app that uses semantic tech to identify/analyze the political bias in a particular article/piece of writing."

kevin grandia: "would love to see a better way of submitting content to conversion services like Calais."

Rama Mamuaya: "language based search engines like Hakia or Powerset should be rising fast. Should evolve from search engine, to answer engine."

Jean-Jacques Halans: "mobile safari reading microformats, for adding to calendar, contacts, lookup address on map"

Stephen Edgar: "More on the Semantic Wiki app's and API's such as http://tinyurl.com/27vnno"

Chris Saad: "my hope is to see APML import AND export from more apps ;)"

Tell us your Semantic Web App Wish List for 2009.

]]>Discuss]]>
http://www.readwriteweb.com/archives/semantic_web_wish_list_2009.php http://www.readwriteweb.com/archives/semantic_web_wish_list_2009.php Semantic Web Wed, 07 Jan 2009 19:59:10 -0800 Richard MacManus
SemanticHacker Offers Cynical Bounty for Semantic Apps Contextual ad platform Textwise launched SemanticHacker.com today, a contest for applications and business plans built on the company's semantic analysis technology. While the new API is offered for free, the contest will award three winners with $100k and one winner could score up to $1 million based on subsequent commercialization.

Semantic APIs only make sense, superior contextual advertising is clearly the most obvious cash-cow of semantics and a bounty sounds like a good idea - but there's something about this contest that doesn't quite feel right.

]]>Sponsor

]]>

"One Meeelion Dollars"...A Cynical Gimmick?

As is probably always the case, this bounty looks like a very cheap way to acquire some development that will pay off in a far larger sum. Textwise has spent years developing their technology so perhaps that's fair.

Taking the technology for a test run on SemanticHacker.com doesn't produce very exciting results, though. You can try it yourself right on the front page of the site. While the super-hyped Twine at least attempts to add all kinds of value to the text it analyzes, the Textwise engine's results are clearly suited only to general, topical and contextual advertising.

The polite way to put this, as the company did in its press outreach, is as follows:

The Textwise technology is focused on discovery as opposed to the “extraction” and metadata-based approaches of most semantic web technologies. This is not the same as the “who/what/when/where/how” thread that is based around RDF, OWL and other similar ontologies and approaches. No - this is a semantic discovery tool that can take in huge chunks of text and grok their essential "aboutness". The core technology (with origins in DARPA and intelligence agency stuff) is designed to decode the "DNA" of documents and be able to tell you what they're about.

Like we said, suited far better for advertising than for anything else more interesting. If semantic technology delivers nothing more than spy-agency-built ad networks then that's going to be a real tragedy.

Semantic web expert Paul Miller points out that the contest's limitation to US participants is also a real loss of opportunity and says he'd be much more interested in a bounty not tied to any particular vendor's semantic API.

Market Context

Semantic markup is ready to hit the big time now that Yahoo! has announced that it will index the stuff. A long list of other startups are doing something similar to what TextWise is doing today. Hakia started licensing its semantic search technology yesterday. We've written before about Dapper's forthcoming semantic ad network.

We've offered extensive coverage of the Reuters Open Calais semantic API extravaganza here, as well.

There's a lot of movement in this space; the market is downright crowded. A search through our "top blogs in the semantic web" custom search engine (from the RWW toolkit for 2008) shows that none of the 60 blogs we follow on this space have ever written about TextWise or the SemanticExchange network they participate in. Maybe that shows the limitations of our list of sources, but maybe it says that TextWise just got a whole lot more publicity for a relatively small amount of money.

]]>Discuss]]>
http://www.readwriteweb.com/archives/semantichacker_offers_bounty.php http://www.readwriteweb.com/archives/semantichacker_offers_bounty.php Semantic Web Wed, 19 Mar 2008 10:09:33 -0800 Marshall Kirkpatrick
The State of the Market in Semantic Technologies Tom Tague from Thomson Reuters' OpenCalais team did a keynote speech today at SemTech in San Jose. His presentation was a wonderful wrapup of current semantic technology trends, and what we can expect over the next few years.

To open, he said that where we are now in the evolution of the Web is content rich, but information poor - plus "experientially deficient". He suggested that 'web 3.0' is about cleaning up the mess of web 2.0 and improving interfaces. In terms of semantic technology, he explained that over the past 5 years it has evolved from invention of standards to a period of commercial innovation on top of those inventions. While standards are still being worked on, now "we are at an inflection point where innovation is exploding."

]]>Sponsor

]]> Tague called Calais, the project he leads at Thomson Reuters, "a web service a.k.a. plumbing". They've had 13 releases, talked with 100+ customers about Calais, have 13,000 registered developers. He put the ideas that he's been talking about with customers and developers into 6 buckets, which we've listed with sub-categories below.

Tools

  • Semantic data mgmt
  • Semantic data generation
  • Databases
  • Integration and workflow

Tague said that tools are important, particularly in the enterprise. He sounded a note of caution to tools vendors: they need to simplify their stories, along with have "simple basic tools."

Social

  • Semantics-powered link sharing
  • Network mining
  • News sharing
  • Tweet mining

Tague said that we shouldn't focus on providing "frosting" on top of current social Web tools. He advised to focus on commercial imperatives, such as the categories above.

Advertising

  • Semantic ad placement
  • Contextual ad placement
  • Semantically driven landing pages
  • Mashup ads

There are clearly opportunities to improve advertising using semantic technology, said Tague.

Search

Tague noted that semantic search may be "the answer to the question nobody is asking." He said that we should look at general "semantic search" vs domain specific semantically-enhanced search. The latter is where the commercial opportunity actually is, but he questioned the economics of general semantic search.

Publishing

He put this into 3 sub-categories:

  • A-Content Producers - from back office to user experience
  • B-Editorial + Aggregation Publishing Models
  • C-Robotic publishing - aggregation only

Tague explained that Calais has really focused on this over the last 8-9 months. He said that classic publishers can get an enormous amount of value from this. Right now the big focus is "back in the bolier room," for example to cut editors from 3 to 2. He expects that later on more focus will go on enhancing the user experience.

Tague thinks that B is the biggest opportunity, using Huffington Post as an example. He said that it gives a "near newspaper like experience" at perhaps a 5th of the cost. It's an area where they're seeing adoption of Calais.

Interface

Tague noted that gaming is a huge industry that the semantic technology industry can learn from. He listed these attributes:

  • Great story line
  • High interactivity, immediate responsiveness
  • No interuptions
  • Graphically engaging
  • Seamless
  • Fun

So he asked who out there is trying to really change the user experience in semantic technology? He listed 4 companies (all of whom we've profiled on ReadWriteWeb):

  • Zemanta
  • Apture
  • Feedly
  • Glue

Tague told the audience that the next big innovation in interface will be something that stays with the user where they are, which will be mobile and in the browser.

To sum up, Tague suggested that semantic technologies vendors should decide whether they care about semantics or about user value. If it's semantics, then be a tools vendor. He said the basic building blocks are out there already, so focus on user experience.

Disclosure: SemTech has been a recent sponsor of ReadWriteWeb

]]>Discuss]]>
http://www.readwriteweb.com/archives/the_state_of_the_market_in_semantic_technologies.php http://www.readwriteweb.com/archives/the_state_of_the_market_in_semantic_technologies.php Conferences Tue, 16 Jun 2009 09:23:17 -0800 Richard MacManus
Cognition Announces "World's Largest Semantic Map" Cognition Technologies, a Semantic Web company that specialises in Natural Language Processing (NLP) search, is today announcing the release of what it claims is "the largest commercially available Semantic Map of the English language." We interviewed Cognition CEO Scott Janus to find out what this means.

We also discovered that Cognition, which currently licenses its technology to other organizations, is planning to build a general consumer search engine - which will compete with Google and others.

]]>Sponsor

]]> What is a Semantic Map?

A Semantic Map is kind of like a dictionary, in that it's a representation of Cognition's ability to define things. Cognition claims that its Semantic Map has over 10 million semantic connections; over 4 million semantic contexts (word meanings that create contexts for specific meanings of other related words); over 536,000 word senses (word and phrase meanings); 75,000 concept classes (or synonym classes of word meanings); 7,500 nodes in the technology's ontology or classification scheme; and 506,000 word stems (roots of words) for the English language.

Image from Cognition

The company says that its Semantic Map "is more than double the size of any other computational linguistic dictionary for English".

Cognition Technologies has been working on its technology for 24 years, with a lot of input from lexicographers and linguists over that time. Because they've used a mix of algorithms and human input, Cognition has been able to discern relevancy, meaning, synonymy. Scott Janus told us that one of Cognition's strengths is that it can disambiguate words and phrases, which Janus says differentiates them from the keyword and pattern matching algorithms of Google, Yahoo and others.

For example Janus told us that Cognition's technology can find results even if direct words are not used - which he says Google can't do.

Cognition Plans General Search Engine

The comparisons to Google led us to ask the obvious question: does Cognition's semantic technology have a more general application? In other words, does Cogition plan to take on Google by creating a search engine for consumers? CEO Scott Janus replied that yes they do plan to "one day offer search on the general web". However he said that they need more capital funding to index the entire Web, put infrastructure in place, etc.

As of now Cognition will continue to license its semantic technology to verticals like law and health. Janus told us that Cognition is "good for complex content where lot of synonyms are used", so right now data-intensive industries are where it is aiming.

Cognition's current applications include legal (e.g. LexisNexis Concordance's case management), health (e.g. MEDLINE), and a semantically charged version of Wikipedia.

Image from Cognition

Cognition vs Powerset and Hakia

Two other Semantic search engines we've been tracking closely on ReadWriteWeb are Powerset and Hakia. We asked CEO Scott Janus what makes Cognition different from those two products?

In a nutshell, Janus says that its Semantic Map is bigger and better.

Specifically, he said that Powerset is actually "not so similar" to Cognition. According to Janus, Powerset does "parsing" - which it licensed from Xerox Parc. That is 20-25% of the solution, said Janus, but Powerset "doesn't have a good semantic map". Cognition went so far as to write a white paper (pdf) explaining why it thinks Powerset "misses the point".

As for Hakia, Janus said that as far as he can see Hakia is focused on "ontological classifications" - classifying words and concepts together. But he says Hakis doesn't have as full a semantic map as Cognition, so he thinks Cognition has "a better understanding" compared to Hakia.

In summary, Janus told us that semantic search companies "must include a comprehensive semantic map" to be successful. We're sure that Powerset and Hakia will have different opinions on what makes a successful semantic search company, but it does make for a good differentiator for Cognition.

Open Question

Tell us in the comments what you think of Cognition and whether you think it can compete with Google in the long run?

]]>Discuss]]>
http://www.readwriteweb.com/archives/cognition_semantic_map.php http://www.readwriteweb.com/archives/cognition_semantic_map.php Semantic Web Tue, 16 Sep 2008 09:55:00 -0800 Richard MacManus
And Nerds Became Kings: Yahoo! to Announce Semantic Web Support TechCrunch and Search Engine Land are reporting this morning that Yahoo! will now be indexing Semantic Web and Microformats markup from around the web and will use that information to display more structured search results. Here is the Yahoo! post about the news.

We asked last month how vulnerable Google is in search and the leveraging of standards-based structured data may be the most obvious approach to improving on the search industry's current best practices. As Tim Berners-Lee said just weeks ago the time for the semantic web is now.

]]>Sponsor

]]> What Does This Mean?

Here's one example of what that could mean: Today, a web service might work very hard to scour the internet to discover all the book reviews written on various sites, by friends of mine, who live in Europe. That would be so hard that no one would probably try it. The suite of technologies Yahoo! is moving to support will make such searches trivial. Once publishers start including things like hReview, FOAF and geoRSS in their content then Yahoo!, and other sites leveraging Yahoo! search results, will be able to ask easily what it is we want to do with those book reviews. Say hello to a new level of innovation.

This has been really geeky stuff for a long time, with little market traction and a whole lot of promises from academic research and outlying innovators. That will now change.

The basic idea behind Semantic Web technology is that by signaling what kind of content you are publishing on an item-by-item or field-by-field basis, publishers can help make the meaning of their text readable by machines. If machines are able to determine the meaning of the content on a page, then our human brains don't have to waste time determining, for example, which search results go beyond containing our keywords and actually mean what we are looking for.

Publishers will now be able to clearly designate content on a page as related to other particular content, as business card type information, as a calendar event, a review or as many other types of content. It will make Yahoo! a lot smarter and should shake up the world of Search Engine Optimization and web publishing, a lot.

Who Does the Markup?

Many observers of the Semantic Web, including us at times, have argued that it's unrealistic to expect web publishers to markup their own content and that a more realistic path to market for technologies based on semantics is to build applications that can parse the semantics out of other peoples' content from outside.

In my interview with Mark Zuckerberg last week, for example, the Facebook CEO expressed disinterest in participating in the Semantic Web. I didn't publish it in the interview, but he indicated such a move would be up to a third party site organizing information via the Facebook Platform if it was going to happen at all. He will probably change his tune now, as adding hCard support to Facebook public profiles will now be a no-brainer. Other publishers will be faced with similar questions.

Semantic web markup will quickly become standard practice though for all CMS/publishing systems and we'll wonder what we ever did without it or why it seemed so hard.

Google Will Soon Follow

This move by Yahoo! will likely be followed up by Google, it's just too much opportunity for any search engine to pass up. Semantic markup is like a content-level site map, something all the search engines have agreed on a standard for already. Semantic web technology is next. There will be big job opportunities, more than there are for SEO in the short term, for people who can help publishers implement Semantic Web markup retroactively and into the future.

The Semantic Web was one of a handful of topics that we identified as key themes for the coming year in our RWW Toolkit for 2008. Check that toolkit out for resources you can use to follow this important topic as it unfolds.

]]>Discuss]]>
http://www.readwriteweb.com/archives/yahoo_supports_semantic_web.php http://www.readwriteweb.com/archives/yahoo_supports_semantic_web.php Semantic Web Thu, 13 Mar 2008 09:35:11 -0800 Marshall Kirkpatrick