<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" 
      xmlns:thr="http://purl.org/syndication/thread/1.0">
  <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php" />
  <link rel="self" type="application/atom+xml" href="http://www.readwriteweb.com/atom.xml" />
  <id>tag:,2008:/1/tag:72.47.210.69,2007://1.3641-</id>
  <updated>2008-07-02T20:25:28Z</updated>
  <title>Comments for Is Google a Semantic Search Engine?</title>
  
  <generator uri="http://www.sixapart.com/movabletype/">Movable Type 4.1</generator>
  <entry>
    <id>tag:72.47.210.69,2007://1.3641</id>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php" />
    <link rel="service.edit" type="application/atom+xml" href="http://www.readwriteweb.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=1/entry_id=3641" title="Is Google a Semantic Search Engine?" />
    <published>2007-03-26T20:00:03Z</published>
    <updated>2007-12-16T23:11:16Z</updated>
    <title>Is Google a Semantic Search Engine?</title>
    <summary><![CDATA[Written by Phill Midwinter, a search engineer from the UK. This is a great follow-up to our article last Friday, Hakia Takes On Google With Semantic Technologies. What is&nbsp;a Semantic Engine? Semantics are said to be ‚Äòthe next big thing‚Äô in search engine technology. We technology bloggers routinely drum up articles about it and sell...]]></summary>
    <author>
      <name>Guest Author</name>
      
    </author>
    
    <category term="Analysis" />
    
    <content type="html" xml:lang="en" xml:base="http://www.readwriteweb.com/">
      <![CDATA[<p><i>Written by <a href="http://phillmidwinter.wordpress.com/">Phill Midwinter</a>,
a search engineer from the UK. This is a great follow-up to our article last
Friday, <a href="http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php">Hakia
Takes On Google With Semantic Technologies</a>.</i></p>
<h2>What is&nbsp;a Semantic Engine?</h2>
<p>Semantics are said to be ‚Äòthe next big thing‚Äô in search engine
technology. We technology bloggers routinely drum up articles about it and
sell it to you, the adoring masses, as a product that will change your web
experience forever. Problem is, we often forget to tell you exactly what
semantics are - we just get so excited. So let's explore this...</p>
<p>Wikipedia says:</p>
<blockquote>
<p>‚Äú<em><b>Semantics</b></em> (<a href="http://en.wikipedia.org/wiki/Ancient_Greek" title="Ancient Greek">Greek</a>
<i>semantikos</i>, giving signs, significant, symptomatic, from <i>sema</i>, <a href="http://en.wikipedia.org/wiki/Sign" title="Sign">sign</a>)
refers to the aspects of <a href="http://en.wikipedia.org/wiki/Meaning_(linguistic)" title="Meaning (linguistic)">meaning</a>
that are expressed in a <a href="http://en.wikipedia.org/wiki/Language" title="Language">language</a>,
<a href="http://en.wikipedia.org/wiki/Code" title="Code">code</a>,
or other form of representation. Semantics is contrasted with two other aspects
of meaningful expression, namely, <i><a href="http://en.wikipedia.org/wiki/Syntax" title="Syntax">syntax</a></i>,
the construction of complex signs from simpler signs, and <i><a href="http://en.wikipedia.org/wiki/Pragmatics" title="Pragmatics">
pragmatics</a></i>, the practical use of signs by <a href="http://en.wikipedia.org/wiki/Agent" title="Agent">agents</a>
or <a href="http://en.wikipedia.org/wiki/Community" title="Community">communities</a>
of interpretation in particular circumstances and contexts. By the usual
convention that calls a study or a theory by the name of its subject matter, <b><i>semantics</i></b>
may also denote the theoretical study of meaning in systems of signs.‚Ä?</p>
</blockquote>
<p>...which is absolutely no help.</p>]]>
      <![CDATA[<p>Semantics as it relates to our topic, search engines, actually covers a few
closely related fields. In this instance what we are looking at deciphering (as
a basic example)&nbsp;is whether a computer can discern if there is a link
between two words, such as <em>cat</em> and <em>dog</em>. You and I both know
that cats and dogs are common household pets, and can be categorized as such.
The human brain seems to comprehend this easily, but for a computer it is a much
more complex task and one I won‚Äôt go into here - because it&nbsp;would most
likely&nbsp;bore you.</p>
<p>If we take as read then, that the search engine now has semantic
functionality, how does that enable it to refine its search
capability?</p>
<ul>
  <li>It can automatically place pages into dynamic categories, or tag them
    without human intervention. Knowing what topic a page relates to is
    invaluable for returning relevant results.</li>
  <li>It can offer related topics and keywords to help you narrow your search
    successfully. With a keyword like <em>sport</em> the engine
    would offer you a list of sports perhaps as well as sports related&nbsp;news
    and&nbsp;blogs.</li>
  <li>Instead of offering you the related keywords, the engine can directly
    incorporate them back into the search with less weight than the user
    inputted ones. It‚Äôs still contested as to whether this will produce better
    results or just more varied ones.
  </li>
  <li>If the engine uses statistical analysis to retrieve it‚Äôs semantic
    matches to&nbsp;a keyword (as Google is likely to do) then its likely that
    keywords currently associated with hot news topics will bring those in as
    well. For example, using my engine to search for the keyword police, brought
    up peerages (relating to the uk‚Äôs cash for honors scandal recently).</li>
</ul>
<p>So, according to me:</p>
<p><b>‚ÄúA semantic search engine is a search engine that takes the <em>sense</em>
of a word as a factor in its ranking algorithm <em>or</em> offers the user a
choice as to the <em>sense</em> of a word or phrase.‚Ä?</b></p>
<p>This is not in line with the purists of what is known as ‚ÄòThe Semantic
Web‚Äô, who believe that for some reason we should spend all our time tagging
documents, pages and images to make them acceptable for a computer to read.
Well, I‚Äôm sorry but I‚Äôm not going to waste my time tagging when a computer
is able to derive context and do it for me. I may have offended Tim Berners Lee
by saying this, but as the creator of the Web he should know better.</p>
<h2>How does Google match up?</h2>
<p>Until extremely recently, Google‚Äôs semantic technology (which they‚Äôve had
now for quite a while) was limited to matching those adsense blocks to your
website‚Äôs content. This is neat, and a good practical example of the
technology - but not relevant to their core search product. However if you&nbsp;make
a single keyword search today, chances are you may spot a block like this
at the bottom of your results page:</p>
<p><img border="0" src="http://www.readwriteweb.com/images/semantics_citizen.jpg" width="463" height="82"></p>
<p>This is more or less exactly what I was just writing about. They‚Äôre
offering you alternatives based upon your initial search, which in this case was
obviously for <em>citizen</em>. Citizen is a bank, a watchmaker and (if I‚Äôm not
mistaken) it means you‚Äôre a member of a country or something. This is the first
clear example of Google employing a semantic engine that works by analyzing the
context of words in their index and returning likely matches for sense.
</p>
<p>Some of you may be wondering why they aren‚Äôt doing this for multiple
keyword phrases, which I can take a guess at from some of my own work. Analyzing
the context of a word statistically is intensive and slow; and if you try and
analyze two, you slow the process further and so on. It is likely they have
problems doing so for more than one keyword currently, and Google as ever is cautious
about changing their interface too radically too quickly. This
implementation of semantics gives hope that they haven‚Äôt adopted the purist
view of ‚ÄòThe Semantic Web‚Äô where everything is tagged and filed neatly into
nice little packages.</p>
<p>Google is all too aware of the following very large problems with that idea:</p>
<ul>
  <li>Users are stupid.</li>
  <li>Users are lazy.</li>
  <li>Redefining the way they‚Äôve indexed what is assumed to be petabytes of
    data would require them to effectively start again.</li>
  <li>It‚Äôs not as powerful or dynamic.</li>
</ul>
<h2>How Google can utilize Semantic technologies</h2>
<p>It‚Äôs my belief that Google will increasingly tie this technology into their
core search experience as it improves in speed and reliability. It has some
phenomenally powerful uses and I‚Äôve taken the liberty of laying out a few of
my suggestions on where they can go with this:</p>
<p>Self aware pages</p>
<ul>
  <li>Tagging pages with keywords has always been used on the internet to let
    search engines know what kind content the page contains.</li>
  <li>Using a Google API we can generate the necessary keywords on the fly as
    the page loads. This cuts out a large amount of work for SEO.</li>
  <li>A Google API enabled engine wouldn‚Äôt even need to look at these
    keywords, it could generate them itself.</li>
  <li>Not only a page can be self aware these days, people tag everything -
    including links. The Google API could conceivably be used to tag every
    single word on a page, creating a page that covers every single keyword
    possibility. This is overkill - but a demonstration of the power available.</li>
</ul>
<p>Narrow Search</p>
<ul>
  <li>When you begin a search, you enter just one or two keywords in the topic
    you‚Äôre interested in.</li>
  <li>Related keywords appear, which you can then select from to target your
    search and remove any doubts about dual meanings of a word for example.</li>
  <li>This step repeats every time you search, also possible is <i>opinionated
    search.</i></li>
</ul>
<p>Opinionated Search</p>
<ul>
  <li>Because of the way Google statistically finds the senses of keywords from
    the mass of pages in its index, what in fact it finds is the majority
    opinion from those pages of what the sense of a word is.</li>
  <li>At the base level, you can select from the average opinion of related
    keywords and subjects from its entire index.</li>
  <li>You can find the opinion at other levels as well though, and this is where
    the power comes in in terms of really targeting what the user is looking for
    quickly and efficiently. All the following mean that this is the first true
    example of social search:</li>
  <ul type="circle">
    <li>Find the opinion over a range of dates, good for current events, modern
      history, changes in trends.</li>
    <li>Find the opinion over areas of geography, or by domain extension (.co.uk,
      .com).</li>
    <li>Find the opinion over a certain group of websites, or just one website
      in particular - compare that with another site.</li>
    <li>Find the opinion not only over the above things but also subjects,
      topics, social and religious groups.</li>
    <li>At the most ridiculous example level, you could even find what topics 18
      year olds on myspace living in Leeds most talk about - but that I could
      probably guess. The point is that this is targeting demographics on a
      really unprecedented level.</li>
  </ul>
  <li>Add the sites or web pages to your personal profile that you think most
    closely reflect your opinions, this data can then be taken into account in
    all future searches returning greater personal relevancy.</li>
</ul>
<h2>Conclusion</h2>
<p>Google is using semantic technology, but is not yet a fully fledged semantic
search engine. It does not use NLP (Natural Language Processing), but this is not
a barrier to producing some truly web changing technology with a bit of thought
and originality. NLP may well be (I hate myself for writing this) web 4.0 and
semantics is web 3.0 - they are in fact different enough to be classified as
such in my eyes and the technology <a href="http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php"> Hakia is developing</a> is certainly markedly
distinct from Google‚Äôs semantic efforts.</p>
<p><font style="float: right"><script type="text/javascript">
digg_url = 'http://digg.com/software/Is_Google_a_Semantic_Search_Engine';
</script>
<script src="http://digg.com/tools/diggthis.js"
type="text/javascript"></script></font>There are barriers that Google needs to overcome... is it capable of becoming
fully semantic without modifying it‚Äôs index too drastically; can Google
continue to keep the results simple and navigable for its varied user base? Most
importantly, does Google intend to become a fully semantic search engine and to do
so within a timescale that won‚Äôt damage their position and reputation? I like to think that
although the dragon is sleeping, that
doesn‚Äôt mean it‚Äôs not dreaming!</p>]]>
    </content>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30755</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30755" />
    <title>Comment from Hashim on 2007-03-26</title>
    <author>
        <name>Hashim</name>
        <uri>http://www.hiphop-blogs.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.hiphop-blogs.com">
        <![CDATA[<p>"Some of you may be wondering why they aren‚Äôt doing this for multiple keyword phrases, which I can take a guess at from some of my own work."</p>

<p>I've been seeing this for plenty of multi-word phrases. Search 'hip hop music' in Google. It gives you these refinements at the bottom:</p>

<p>music<br />
hip hop music history<br />
hip hop music online<br />
hip hop songs<br />
hip hop artists<br />
bet<br />
hip hop radio</p>]]>
    </content>
    <published>2007-03-26T21:06:17Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30756</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30756" />
    <title>Comment from Phill Midwinter on 2007-03-26</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>You're right. I've been playing with this a little more today - not only does it work for one word searches but also for pre categorised phrases it seems... such as car insurance, or indeed hip hop music. I'll find out as much as I can but on first impression it's not an entirely automated process yet.</p>]]>
    </content>
    <published>2007-03-26T21:18:28Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30757</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30757" />
    <title>Comment from Phill Midwinter on 2007-03-26</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>The real question here I was driving at is why it's not available on almost all searches as opposed to a select few.</p>]]>
    </content>
    <published>2007-03-26T21:20:00Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30758</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30758" />
    <title>Comment from Ali on 2007-03-26</title>
    <author>
        <name>Ali</name>
        <uri>http://www.everybodygoto.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.everybodygoto.com">
        <![CDATA[<p>Is this the same as AI search, perhaps?</p>]]>
    </content>
    <published>2007-03-26T21:34:42Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30759</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30759" />
    <title>Comment from Mark Johnson on 2007-03-26</title>
    <author>
        <name>Mark Johnson</name>
        <uri>http://www.philosophygeek.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.philosophygeek.com">
        <![CDATA[<p>When I relate "cat" and "dog", I don't go looking through all of the instances of "cat" and "dog" in my life and see how often they co-occur.  From your examples, that's all that Google is doing: they're looking for keywords that show up in documents frequently and then allow you to refine your query based on that.  That seems like a very weak form of semantics.  For example, it would be difficult, to classify dogs and cats as animals using that method -- and it doesn't seem like Google can do that.  Also, you reference document classification (e.g. auto-tagging) and it's not clear that Google can do that either.</p>

<p>So how is Google a semantic engine?</p>]]>
    </content>
    <published>2007-03-26T21:50:43Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30760</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30760" />
    <title>Comment from Clyde Smith on 2007-03-26</title>
    <author>
        <name>Clyde Smith</name>
        <uri>http://www.prohiphop.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.prohiphop.com">
        <![CDATA[<p>bad:</p>

<p>"the purist view of ‚ÄòThe Semantic Web‚Äô where everything is tagged and filed neatly into nice little packages."</p>

<p>good:</p>

<p>"The Google API could conceivably be used to tag every single word on a page, creating a page that covers every single keyword possibility."</p>

<p>So, I take it you have problems with the nice little packages?  Doesn't every categorization system including Google's ultimately have similar problems?</p>

<p>In any case, I have trouble seeing ways that having a coherent, consistent system for tagging would be a bad thing since that does not block other systems for doing things.</p>

<p>On a related matter, you're right to say bad things about yourself for using the term Web 4.0.  I have yet to encounter a serious usage of Web 3.0 or any other such variation that spoke to a shift of the depth and complexity of Web 2.0.</p>

<p>Not to sound like a religious nut or anything!</p>]]>
    </content>
    <published>2007-03-26T21:56:26Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30761</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30761" />
    <title>Comment from Ryan Fujiu on 2007-03-26</title>
    <author>
        <name>Ryan Fujiu</name>
        <uri>http://www.pronetadvertising.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.pronetadvertising.com">
        <![CDATA[<p>I am a smarter person now that I've read this post.</p>]]>
    </content>
    <published>2007-03-26T22:02:26Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30762</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30762" />
    <title>Comment from Ken Ewell on 2007-03-26</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>That was a great piece Phil.  </p>

<p>I agree with your insight on Google and your generalization of semantics as sense information.  I am not very impressed with Hakia as there is too much empty jargon in their explanations of what they are doing to make any kind of evaluation. I also like your definition of semantics as well.</p>

<p>As you have implied, word sense can be explicated, yet, if you are someone who asserts that meaning is personal and takes place in the mind.  That essentially means that meaning is ignored for you when using today's technology.  </p>

<p>Proclaim that the semantic web is about making sense of what words mean to individual browsers of resources on the Internet, and almost every computer professional involved in the semantic web would say that is not what the semantic web is about. This is because one of the tenets of the symbol grounding problem in computing suggests a fundamental decision be taken to neglect meaning and concepts of the mind altogether.  It will take another fundamental decision to change the status quo.</p>

<p>You probably would agree, Phil, that mind does not register or record every word imagined and do statistics on them. It works with much smaller sets of cognitivistic concepts that such large numbers of words attempt to represent in some ways. For this reason, I do not think Google, or any other search engine using keyword based indexes and some NLP or AI, could achieve an opinion search as you have outlined.  Readware software can.</p>

<p>Readware is a mature semantic framework that delivers and utilizes sense information from words. It can also 'discern' concepts and topics and categories from the discourse on web pages read by the analyst sever.  While Readware was not listed among the alternate search engines listed by Read/Write articles, it may be the only search engine that realizes how to seek, identify and compute semantic relevance from the words used in discourse and context.</p>

<p>While many researchers and developers may cringe at the prospect that the mind is where meaning is found, we can show that it possible to use cognitive objects and a conceptual model to identify sense information and precisely weigh and measure relevance among variable words forms and grammars in various forms of text and discourse.</p>]]>
    </content>
    <published>2007-03-26T22:07:02Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30763</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30763" />
    <title>Comment from Jochem Prins on 2007-03-26</title>
    <author>
        <name>Jochem Prins</name>
        <uri>http://www.jochemprins.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.jochemprins.com">
        <![CDATA[<p>I completely agree with Mark Johnson. The only thing Google is doing at the moment is using statistical information to help people specify or narrow down their search queries. In my opinion, this has little to do with semantics.</p>

<p>To use your "citizen" example: if Google would ask me if I am looking for the financial institution "Citizen bank" of for the Crystal manufacturer "Citizen crystels", than I would call this (the first step of) semantic search.</p>

<p>By the way, I don't think that manual tagging / annotation is necessary for the semantic web to become reality. A lot of work has already been done in order to perform automatic annotion on webpages. Performance is getting better and better and I think it will be a matter of years before the big search engines start annotating webpages using (semantic ) ontologies.</p>]]>
    </content>
    <published>2007-03-26T22:09:06Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30764</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30764" />
    <title>Comment from Mark on 2007-03-26</title>
    <author>
        <name>Mark</name>
        <uri>http://www.theatomicweb.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.theatomicweb.com">
        <![CDATA[<p>Aren't we already tagging the web?  </p>

<p>From my site:  </p>

<p>I'm reading about the semantic web and ontologies in last week's Economist (Tim Berners Lee) and a question arose in my head.  Aren't we already creating this?  Aren't del.icio.us and technorati examples of tools that allow for the emergence of a organized structure of information the same way a mass brain would?  Doesn't the tagging going on today by millions of users ultimately allow for and supply the underpinnings of the pattern recognition algorithms our brain uses every day?  The principles described in Jeff Hawkin's book "On Intelligence" have direct implications to this and may very well redefine how we get information from the web both personally and on a machine to machine basis.</p>

<p>see: www.theatomicweb.com</p>]]>
    </content>
    <published>2007-03-26T22:54:45Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30765</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30765" />
    <title>Comment from Paul Jensen on 2007-03-26</title>
    <author>
        <name>Paul Jensen</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Check out Google Sets, a product under Labs. You'll be surprised by some of the results you get from it.</p>]]>
    </content>
    <published>2007-03-26T22:54:53Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30766</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30766" />
    <title>Comment from Adrian Keys on 2007-03-26</title>
    <author>
        <name>Adrian Keys</name>
        <uri>http://www.jollyjo.org</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.jollyjo.org">
        <![CDATA[<p>Even if Google had jumped ahead and implemented semantic search technology...I am not sure how effective this would be. As far as I have read...effective semantic search will need websites that have utilised semantic technology.</p>

<p>I don't see a semantic search a reality until the basic infrastructure of the web is changed.</p>]]>
    </content>
    <published>2007-03-26T23:06:42Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30767</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30767" />
    <title>Comment from Terry Steichen on 2007-03-26</title>
    <author>
        <name>Terry Steichen</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Phil,</p>

<p>I think that another way of describing the semantic engine is a tool that allows you to search at the topic level.  Once you have the topic of interest, then drill down to the specific document.</p>

<p>Just how you categorize documents by topic is the big unknown, but that's the basic concept as I see it.</p>

<p>Make sense?</p>]]>
    </content>
    <published>2007-03-27T02:06:58Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30768</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30768" />
    <title>Comment from Phil Butler on 2007-03-26</title>
    <author>
        <name>Phil Butler</name>
        <uri>http://www.profy.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.profy.com">
        <![CDATA[<p>Well I like drumming up articles about worthless search engines designed by rocket scientists so that narrow minded Google worshipers have something to negate :) </p>

<p>Some of these engines are quite excellent, and compared to LORD GRAND POOPA of GOOGLE, possibly even more effective. <br />
I shutter to think at how much nay saying we would have if we were talking about inventing something that could stream text and pictures through little wires over thousands of miles at the speed of light! </p>

<p>Is there anyone out there that was not made to feel superior or cool because they had the distinction of being the first in their neighborhood to type Google? </p>

<p>OMFG. I am so amazed that millions of people can talk, inspect, research and finger manipulate a new cell phone ring tone or wideget so insignificant it will be worthless in 3 months, yet people attempt to create something astounding and some geeks just want to their little cubicle to stay safe. </p>

<p>Take some time, email the people involved, get them to explain what they are doing and perhaps understand that anything is possible still. Go to hakia or Powerset and talk with the people, perhaps they can delineate themselves from delicio lmao.</p>

<p>BTW the basic infastructure of the web is still the page, which typically has words on it. So, if one can make a several data sets from those combinations of words and use semantics to return less than the "billions" of permutations that Google's algorithms produce, then it is possible to do a much more effective and even faster search. <br />
 <br />
Always, Phil</p>]]>
    </content>
    <published>2007-03-27T03:36:11Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30769</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30769" />
    <title>Comment from Acronyms on 2007-03-26</title>
    <author>
        <name>Acronyms</name>
        <uri>http://www.all-acronyms.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.all-acronyms.com">
        <![CDATA[<p>They trying to be one? yes. are they? not at all</p>]]>
    </content>
    <published>2007-03-27T04:58:16Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30770</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30770" />
    <title>Comment from Phill Midwinter on 2007-03-27</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>"As far as I have read...effective semantic search will need websites that have utilised semantic technology.</p>

<p>I don't see a semantic search a reality until the basic infrastructure of the web is changed."</p>

<p>Yes, this is exactly what the evangelists tell us. However, it's not necessary! The basic structure of the web is not going to change and that's a good thing.</p>

<p>"Aren't we already tagging the web?"</p>

<p>Yes we are, del.icio.us is a good example - but ultimately, everything delicious does could be done by a machine. I've successfully programmed applications that will automatically tag in the same way Google suggests alternatives.</p>

<p>"You probably would agree, Phil, that mind does not register or record every word imagined and do statistics on them. It works with much smaller sets of cognitivistic concepts that such large numbers of words attempt to represent in some ways. For this reason, I do not think Google, or any other search engine using keyword based indexes and some NLP or AI, could achieve an opinion search as you have outlined. Readware software can."</p>

<p>I very much like this comment, and a few of you have said similar things. In fact, this is a psychological argument and when I was programming my first semantic tools it was something I grappled with a lot. I came to the conclusion that the human brain, as say a young child just learning to talk begins to derives senses from words in 2 basic ways.</p>

<p>- Association<br />
- Questioning</p>

<p>Questioning is of course... asking what a word means or what an object is called.</p>

<p>Association is the statistical part, when a child frequently hears the words cat and kitten together, combined with questioning they build their own semantic ontology that changes and is redefined with age. Which is a very important  reason for programmers NOT to use restrictive defined ontologies needing them to 'restructure the web' to 'solve' the problem. Instead they need to rely more heavily on association, with the limited questioning (through search queries) they receive.</p>]]>
    </content>
    <published>2007-03-27T08:01:20Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30771</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30771" />
    <title>Comment from Alex on 2007-03-27</title>
    <author>
        <name>Alex</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>they can not be "doing this for multiple keyword phrases", cause multiple words ARE semantics. <br />
If two words stand ogether and make some sense, that means that they have proper relationships, and relationships between words are semantics, this is one of two core principles of semantics.<br />
So, my idea is that they simply haven't overcome this step yet. Because it's like "semantics inside semantics", which, for now at least, is probably too much for them. ))</p>]]>
    </content>
    <published>2007-03-27T08:02:58Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30772</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30772" />
    <title>Comment from Phill Midwinter on 2007-03-27</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>Yes there's a snag there you're right. There are a number of ways to do it but which is best?</p>

<p>You could run the same semantic search on each keyword.</p>

<p>You could run the semantic search over the keywords as a phrase.</p>

<p>You could do both and correlate results from each.</p>

<p>I know how mine works, but I couldn't tell you if it was 'right' because I don't exactly know how we as human beings do it yet.</p>]]>
    </content>
    <published>2007-03-27T08:24:00Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30773</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30773" />
    <title>Comment from Alex on 2007-03-27</title>
    <author>
        <name>Alex</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Well, linguistics as a science explains that as well. <br />
I believe the approach they're "confessing" is wrong somewhere deep, much deeper than just selecting from those three options. <br />
It's like when you're drowning you think what to chose to save - to shout for help, to learn how to swim or to drown.. When the right answer is "yuo shouldn't have drunk that much")))) Sorry, that's may be a very strange example, but it was the first to occur)) to explain, that may be they have missed something somewhere on the earlier stage. <br />
Anyway, i don't think they'll become totally semantic. Just because today its semantic, tomorro...??? <br />
The maximum is to make a whole new "googled" search engine, but does it have any sense?</p>]]>
    </content>
    <published>2007-03-27T08:37:55Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30774</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30774" />
    <title>Comment from Rob Scherer on 2007-03-27</title>
    <author>
        <name>Rob Scherer</name>
        <uri>http://robscherer.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://robscherer.com">
        <![CDATA[<p>You lost me at:</p>

<p>"...tagging documents, pages and images to make them acceptable for a computer to read. Well, I‚Äôm sorry but I‚Äôm not going to waste my time tagging when a computer is able to derive context and do it for me."</p>

<p>I'm sorry, but documents must be marked up appropriately so that they can be consumed by a spider in a manner in which the spider is able to garner some information about the content of the document. The Semantic Web has absolutely nothing to do with tagging in the sense of the word that you have used.</p>]]>
    </content>
    <published>2007-03-27T11:10:35Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30775</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30775" />
    <title>Comment from Phill Midwinter on 2007-03-27</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>Documents don't need to be marked up appropriately, it does help if they're written with good english, varied vocabulary etc.</p>

<p>I program web spiders and search engines for a living, and at the risk of offending the purists - such as you. It's been done by many others even apart from myself without the need to tag everything in sight.</p>

<p>I think I made the point quite clearly that 'The Semantic Web' has nothing to do with tagging in the sense of the word that I have used. I don't think it should have anything to do with it because it's a useless idea.</p>]]>
    </content>
    <published>2007-03-27T11:18:01Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30776</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30776" />
    <title>Comment from Alex on 2007-03-27</title>
    <author>
        <name>Alex</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Phill, i think it IS quite possible to make it without tags (though i think that tags are not the most senseless option))), but i really can not understand - is there any place in your article for Latent Semantic Indexing? or is it meant "by default"? Is this used at all or not? It seems that some points in your article quite deeply interwined with LSI?<br />
It would be a pleasure to hear your opinion on that, as i seem to be a bit lost. <br />
Thanks, Alexandra.</p>]]>
    </content>
    <published>2007-03-27T15:36:21Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30777</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30777" />
    <title>Comment from Phill Midwinter on 2007-03-27</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>What I'm saying is that this is latent semantic indexing  (some argue that Google already incorporate it into their algorithm) or that it is the most likely method Google are using to do this. It's also the method I specialise in and advocate strongly as opposed to NLP or 'The Semantic Web' of course :)</p>

<p>Taking what we know from early Google white papers and other research, when Google spiders the web it breaks a page down into 'barrels'. Which is just a collection of all the words on the page essentially.</p>

<p>If you then take these barrels and see how frequently words occur together on a page over say a 1000 pages, you build up a stastistical picture of words that are likely to be related. Of course the more pages you do this across the high the statistical probability is that they actually are related.</p>

<p>If you also store the position in the page at which the word occurs, you can factor in the distance between the words. </p>

<p>You can use this to speed up your algorithm substantially because effectively your select from database statement only has to pull in the say four words before and after the one you're looking for semantic links to. ie. the words occuring within sentence range of your keyword on a page.</p>

<p>I think I'm the first to relate that this in fact is generating a mass average opinion of a keyword's context and that this is itself something that could be used to show different context over subgroups of society, or even through dates.</p>]]>
    </content>
    <published>2007-03-27T16:02:12Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30778</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30778" />
    <title>Comment from Ken Ewell on 2007-03-27</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>"If you then take these barrels and see how frequently words occur together on a page over say a 1000 pages, you build up a stastistical picture of words that are likely to be related."</p>

<p>I don't think you were very clear in that last post Phil.  Word statistics accross these barrels may uncover co-occurrences (i.e. this is where a semantic relation exists) but little else.  They do not tell you how those co-occurrences fit into the rest of the semiotics of the indvidual and his or her world: culture, society or clique, i.e. their 'worldview'.</p>

<p>Say, for example, you want to use your search engine to explicate your query for 'a good car' well-enough that it can include documents about 'high performance automobiles' and those that say 'clean, one-owner vehicles', 'a perfect ford for...' , 'a hot chevy with low milage', and documents with relative idioms I don't care to apprehend-- any methods  depending on co-occurrence, noun or verb phrases, etc., LSI-wise or otherwise will fail and need to be re-trained for every case.  Am I right or wrong?</p>]]>
    </content>
    <published>2007-03-27T23:21:08Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30779</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30779" />
    <title>Comment from Ken Ewell on 2007-03-27</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>"If you then take these barrels and see how frequently words occur together on a page over say a 1000 pages, you build up a statistical picture of words that are likely to be related."</p>

<p>I don't think you were very clear in that last post Phil.  Word statistics across these barrels may uncover co-occurrences (i.e. this is where a semantic relation exists) but little else.  They do not tell you how those co-occurrences fit into the rest of the semiotics of the individual and his or her world: culture, society or clique, i.e. their 'worldview'.</p>

<p>Say, for example, you want to use your search engine to explicate your query for 'a good car' well-enough that it can retrieve pages/resources about 'high performance automobiles' and those that say 'clean, one-owner vehicles', 'a perfect ford for...' , 'a hot Chevy with low mileage', and pages with many of the relative idioms I don't care to apprehend-- any methods depending on co-occurrence, noun or verb phrases, etc., LSI-wise or otherwise, will fail and need to be re-trained or re-indexed for every case. </p>

<p>Developers should also remember that meaning is built up from smaller elements.  There documents semantics and sentence semantics and there must also be semantics at all levels of composition.  Consider how the prefix /re/ changes the meaning of words above.</p>

<p>Now there are those that will argue that linguists know all about that 00and of course they do.  What they do not do is consider or model how the addition of the prefix links to human perception.  Neither do search engines. </p>

<p>While search engines do a great amount of indexing, they do not know what it means to index let alone what the significance or consequence (cause or effect) may be of having to reindex.  I'll bet Google engineers do.</p>]]>
    </content>
    <published>2007-03-27T23:37:52Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30780</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30780" />
    <title>Comment from Phill Midwinter on 2007-03-28</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>I think you're over analysing. Whereas in a single case of one web page you are completely correct, by taking the mean of such huge data sets as are available to a search engine - these things can be virtually ignored.</p>]]>
    </content>
    <published>2007-03-28T08:33:06Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30781</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30781" />
    <title>Comment from Ken Ewell on 2007-03-28</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>Isn't that the problem--sensibility is ignored?   Isn't that what people are expecting from a semantic search engine: sensibility in the results?</p>

<p>It is not too much analysis we do it in milliseconds while indexing. The problem with non-semantic search engines is that they do not register or index or link the semantics in phrases such as "a reliable car" and "this Chevy is dependable transportation" for example.  Not in one or many pages...</p>

<p>In this example, reliable and dependible refer to the same concept. A semantic web product using WordNet should be able to discern that much.  In addition, there is a pragmatic relation between car and Chevy that any search engine claiming to be semantic should be capable of capturing.</p>

<p>The difficulty is that are are milions, zillions, of words and phrases.  They are not distilled or filtered by finding their mean in actuality.  They are distlilled into topics and filtered into categories using a keen sense of semantics to recognize the essence of the message from the cultural code of the language.</p>]]>
    </content>
    <published>2007-03-28T15:02:53Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30782</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30782" />
    <title>Comment from Ken Ewell on 2007-03-28</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>Isn't that the problem--sensibility is ignored?   Isn't that what people are expecting from a semantic search engine: sensibility in the results?</p>

<p>It is not too much analysis; we do it in milliseconds while indexing. The problem with non-semantic search engines is that they do not register or index or link the semantics in phrases such as "a reliable car" and "this Volvo is dependable transportation" for example.  Not in one or many pages...</p>

<p>In this example, reliable and dependable refer to the same concept. A semantic web product using WordNet should be able to discern that much.  In addition, there is a pragmatic relation between car and Chevy that any search engine claiming to be semantic should be capable of capturing.  Tagging, btw, has nothing whatsoever to do with any of this.  Tagging is purely a data-processing crutch (that helps a little).</p>

<p>Think of the name Chevy as a tag, a socio-cultural tag, for a car.  Names are tags.  Names and tags are arbitrary.  Individually indexing each and every occurrence or averaging all of them for their mean does what-- is useful for something? Finding phrases like those above by discerning the nature and import of the relations between specific names and other specific tags (in context) is a semantic process useful for discovery.  Discovery is on the weak side of search engines</p>

<p>The difficulty is that are are millions, zillions, of words and phrases.  They are not distilled or filtered by finding their mean, most frequent or least frequent, in actuality.  To my mind, all that is read is distilled into topics and filtered into categories using a keen sense of semantics to recognize the essence of the message from the cultural code of the language. </p>

<p>Without going too far here, let me just say that it must be true that individuals may have different sensibilities, different modes of expression, different views, affinities, biases and beliefs.  Each individual can have their own special way of interpreting what is going on while thoughts and perceptions race through their minds. Yet every individual has the same perceptual apparatus. </p>

<p>It is also true that shared (interpersonal) perceptions do not present themselves differently to each culture or society, let alone each individual.  I mean that every individual in the world is subject to the same existential affairs.  Unlike the (imperfect, changing) language, the semantics of such affairs must be universal that each individual may interpret as they will.</p>]]>
    </content>
    <published>2007-03-28T16:03:22Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30783</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30783" />
    <title>Comment from Ken Ewell on 2007-03-28</title>
    <author>
        <name>Ken Ewell</name>
        <uri>http://www.readware.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.readware.com">
        <![CDATA[<p>Ooops, sorry for the double post. I thought I had canceled the post #27 from sending, sorry.</p>]]>
    </content>
    <published>2007-03-28T16:07:17Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30784</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30784" />
    <title>Comment from Phill Midwinter on 2007-03-28</title>
    <author>
        <name>Phill Midwinter</name>
        <uri>http://www.surrch.eu</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.surrch.eu">
        <![CDATA[<p>Nothing needs to be filtered.</p>

<p>This is statistics, the more times you perform an operation the higher the percentage probability that the answer is correct.</p>

<p>It doesn't matter that people have imperfect language, or individuals have different semantics.</p>]]>
    </content>
    <published>2007-03-28T16:22:46Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30785</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30785" />
    <title>Comment from CoStas on 2007-03-29</title>
    <author>
        <name>CoStas</name>
        <uri>http://www.vimeo.com/clip:160321</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.vimeo.com/clip:160321">
        <![CDATA[<p>Let's compare the query "iPhone" on Google and Quintura (a semantic SE) and see who shows a more comprehensive context so far. Only when Google understands that the iPhone logo belongs not only to Cisco or Apple, will it be a semantic search engine.</p>]]>
    </content>
    <published>2007-03-29T10:47:53Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30786</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30786" />
    <title>Comment from CoStas on 2007-03-29</title>
    <author>
        <name>CoStas</name>
        <uri>http://www.vimeo.com/clip:160321</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.vimeo.com/clip:160321">
        <![CDATA[<p>Let's compare the query "iPhone" on Google and Quintura (a semantic SE) and see who shows a more comprehensive context so far. Only when Google understands that the iPhone logo belongs not only to Cisco or Apple, will it be a semantic search engine.</p>]]>
    </content>
    <published>2007-03-29T11:08:57Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30787</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30787" />
    <title>Comment from Phil Butler on 2007-03-30</title>
    <author>
        <name>Phil Butler</name>
        <uri>http://www.profy.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.profy.com">
        <![CDATA[<p>I did not notice if anyone had been to the hakia labs site, so in case I missed it please forgive me. Here is the link for those of you interested:</p>

<p><a href="http://labs.hakia.com/" rel="nofollow"><a href="http://labs.hakia.com/" rel="nofollow">http://labs.hakia.com/</a></a></p>

<p>I spoke with Dr. Berkan about some more graphic representations and he told me he would get me some video and other information to supplement my pea brain lol. This is exciting stuff :) </p>

<p>Phil</p>]]>
    </content>
    <published>2007-03-31T05:13:28Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30788</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30788" />
    <title>Comment from Enrique Torrejon on 2007-04-04</title>
    <author>
        <name>Enrique Torrejon</name>
        <uri>http://www.bitext.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.bitext.com">
        <![CDATA[<p><br />
If you want to check how NLP can improve current search engines when analyzing users' queries, check out our demo of NaturalFinder integrated with MSN Search for English:</p>

<p><a href="http://demos.bitext.com/MSNen" rel="nofollow"><a href="http://demos.bitext.com/MSNen" rel="nofollow">http://demos.bitext.com/MSNen</a></a>    <br />
user: readwriteweb<br />
pw: bitext</p>

<p>Bitext's technology can be easily integrated with any search engine like Google Search Appliance, Autonomy, dtSearch, Lucene, etc.   </p>

<p>For more info, check out www.bitext.com</p>]]>
    </content>
    <published>2007-04-04T11:21:55Z</published>
  </entry>

  <entry>
    <id>tag:72.47.210.69,2007://1.3641-comment:30789</id>
    <thr:in-reply-to ref="tag:72.47.210.69,2007://1.3641" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/is_google_a_semantic_search_engine.php#c30789" />
    <title>Comment from Tim Wintle on 2007-04-05</title>
    <author>
        <name>Tim Wintle</name>
        <uri>http://www.timwintle.co.uk</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.timwintle.co.uk">
        <![CDATA[<p>I think that many people here are misunderstanding the definition of a symantic, or more essentially of a language:</p>

<p>Mark Johnson - <br />
"It would be difficult, to classify dogs and cats as animals using that method - and it doesn't seem like Google can do that."<br />
No, Google may not be able to explicitly decide that they belong to "animals" as such, however Google can realise that they belong to a group that relates to what we would call "animals" (see google sets).</p>

<p>"Also, you reference document classification (e.g. auto-tagging) and it's not clear that Google can do that either."<br />
Google wouldn't be great at auto-tagging documents with tags that were english words, as the information about the page does not measure the meaning of the page using english words as symantics. It does "auto-tag" the document with semantics, and it is these semantics that are used in search.</p>

<p>"That seems like a very weak form of semantics." <br />
Actually, compared to the raw semantics of language, this does a remarkably good job. This is because the layout of the text is taken into consideration, which effectively creates new semantics.</p>

<p>Phill Midwinter:<br />
I think that the main problem you have seen is a problem with linguistics, not with the semantic technology behind it.<br />
The concept space is, (it can be analytically proved - assuming it is using a SVD) making as good an approximation as possible. Likewise, we are only searching for the overall concept of a document, so the concept vector must be a very good representation of the document. The problem is in the search query, and in the wide distribution of how people use a language.</p>

<p>This is what google's personalised search is clearly going to try to do, determine how you use linguistics to build a map into the concept space that is more characteristic of your use of the language than the average map. Eventually they will probably be able to do this for the map from a website into the concept space too.</p>

<p>It's been good to come across you all, Phill - phillmidwinter.wordpress.com has gone in my RSS feeds, and I am sure I will be commenting on your blog as I am working on my own semantic search algorithms.</p>

<p>If you want to read my blog, click the link on my name.</p>]]>
    </content>
    <published>2007-04-05T18:14:50Z</published>
  </entry>

</feed>