<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" 
      xmlns:thr="http://purl.org/syndication/thread/1.0">
  <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php" />
  <link rel="self" type="application/atom+xml" href="http://www.readwriteweb.com/atom.xml" />
  <id>tag:,2009:/1/tag:www.readwriteweb.com,2008://1.12852-</id>
  <updated>2009-10-30T13:11:32Z</updated>
  <title>Comments for uClassify: Create Your Own Text Classifiers</title>
  
  <generator uri="http://www.sixapart.com/movabletype/">Movable Type 4.23-en</generator>
  <entry>
    <id>tag:www.readwriteweb.com,2008://1.12852</id>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php" />
    <link rel="service.edit" type="application/atom+xml" href="http://www.readwriteweb.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=1/entry_id=12852" title="uClassify: Create Your Own Text Classifiers" />
    <published>2008-12-07T22:42:12Z</published>
    <updated>2008-12-07T23:06:25Z</updated>
    <title>uClassify: Create Your Own Text Classifiers</title>
    <summary>Ever wanted to know the language of a Web site? Or whether the text within it is considered spam? Well, it&apos;s a lot easier since the launch of uClassify, the free Web service and API out of Sweden that lets you create and train your own text classifiers. According to uClassify&apos;s about page, a text...</summary>
    <author>
      <name>Lidija Davis</name>
      
    </author>
    
    <category term="Products" />
    
    <content type="html" xml:lang="en" xml:base="http://www.readwriteweb.com/">
      <![CDATA[<p><img alt="uclassify_dec_08.jpg" src="http://www.readwriteweb.com/uclassify_dec_08.jpg" width="189" height="57" />Ever wanted to know the language of a Web site?  Or whether the text within it is considered spam?  Well, it's a lot easier since the launch of <a href="http://www.uclassify.com">uClassify</a>, the free Web service and <a href="http://www.uclassify.com/ApiDocumentation.aspx">API</a> out of Sweden that lets you create and train your own <a href="http://en.wikipedia.org/wiki/Document_classification">text classifiers</a>.  </p>

<p>According to uClassify's about page, a text classifier answers the question: "To which predefined category is this text most likely to belong?" Text classifiers can be used to create spam filters, categorize Web pages, detect languages, classify a batch of blog posts, and more. </p>]]>
      <![CDATA[<h2>How uClassify Works</h2>

<p>While there are different types of classifiers, uClassify is a machine learning classifier meaning you need to train it before it can start to classify documents.</p>

<p>Training it is simple enough.  You manually set up two or more classes, for instance spam and legitimate, and then manually attach documents (known as the training corpus) to the class they belong to.  This supervised training helps the classifier understand the characteristics of various classes.</p>

<p>Once trained, the classifier will determine which of the predefined classes a previously unseen document is most likely to belong to and return a percentage based answer.  While you can continue training uClassify once you start classifying, the longer you spend training uClassify, the more accurate the results will be.  </p>

<p>uClassify now has a "click 'n' classify" GUI so you don't need programming skills to create classifers.  All you need to do is create an account, log in and it will walk you through the three step process.</p>

<p><img alt="train_1_dec_08.jpg" src="http://www.readwriteweb.com/train_1_dec_08.jpg" width="589" height="306" /></p>

<p><em>Image: Copy and paste text or enter a URL to train uClassify</em></p>

<p>To date, users have created over 200 classifiers.  Three fun sites include:</p>

<p><a href="http://www.typealyzer.com/en/about?lang=en">Typealyzer</a>: Classifies blog personality using a psychological text analysis. </p>

<p><a href="http://genderanalyzer.com">Genderanalyzer</a>: Decides if a page is written by a man or woman </p>

<p><a href="http://www.ofaust.com">oFaust</a>: Determines which classical author your text is most like</p>

<p>Started by Jon Kagstrom in 2004 uClassify was formed to share classifier technology with the masses.</p>]]>
    </content>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2008://1.12852-comment:119191</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2008://1.12852" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php#c119191" />
    <title>Comment from Hüseyin Erkmen on 2008-12-07</title>
    <author>
        <name>Hüseyin Erkmen</name>
        <uri>http://www.iamlittle.net</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.iamlittle.net">
        <![CDATA[<p>Typealyzer really useful but sometimes give bugs ı dont understand.<br />
www.iamlittle.net</p>]]>
    </content>
    <published>2008-12-07T23:22:43Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2008://1.12852-comment:119198</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2008://1.12852" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/uclassify_create_your_own_text_classifiers.php#c119198" />
    <title>Comment from Mr M on 2008-12-07</title>
    <author>
        <name>Mr M</name>
        <uri>http://49things.blogspot.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://49things.blogspot.com">
        <![CDATA[<p>I tried various uclassify classifiers on 10 really big blogs. The results where pretty impressive and this could very well be a very handy tool for classifying social reputation on the web.</p>]]>
    </content>
    <published>2008-12-08T00:31:07Z</published>
  </entry>

</feed>