audio search - ReadWriteWeb http://www.readwriteweb.com/feeds/tag/audio search en Copyright 2009 Richard MacManus readwriteweb@gmail.com Sun, 22 Nov 2009 08:05:49 -0800 http://www.sixapart.com/movabletype/?v=4.23-en http://blogs.law.harvard.edu/tech/rss Audio Indexing: EveryZing's New Video Player Points Towards the Future of Online Video everyzing_logo.jpgWhile online video is a great tool, automatically indexing these videos and making them searchable is still a very hard task. EveryZing, a Massachusetts-based company that focuses on media indexing through speech-to-text and natural language processing, released a new version of its media player today that allows users to search online videos on third-party sites for keywords and tags. Currently, however, this technology is only available to EveryZing's corporate partners, with the Dallas Cowboys being the first to make use of these new capabilities.

]]>Sponsor

]]> While testing the new player, we came away thoroughly impressed with the accuracy of the transcripts and the ease of use of the interface.

MetaPlayer

As EveryZing's CEO Tom Wilde explained to us, EveryZing integrates the 'chromeless' players (which strip away the user interface and only deliver the video itself) that YouTube and many of its rivals make available, but which are not being used very often by third-party developers yet.

EveryZing's clients choose which videos they want to appear in the MetaPlayer on their sites.

The player itself is very nifty. Not only does EveryZing automatically tag every post, but it also makes every video searchable. There is also a strong SEO aspect to this new player, as EveryZing provides its partners with a full text transcript of the videos, which search engines can then index.

everyzing_player.jpg

Sharing

EveryZing's MetaPlayer features most of the standard sharing options you would expect from a modern video player, including an easy way to embed clips from longer videos.

Comments are the only area where the MetaPlayer falls a bit behind its competition, as other companies like Viddler already allow their users to leave comments which then appear in the video itself, while the MetaPlayer does not feature any commenting functions yet.

Consumers?

everyzing_embed_clips.pngWe asked Tom Wilde if EveryZing was planning to release a consumer version of this product, but for now, EveryZing is focused on a licensing the MetaPlayer to its corporate partners. However, Wilde did not rule out the possibility that EveryZing could release a consumer oriented product in the future.

We know that Google, which has a vested interested in making videos searchable, is also working on a speech-to-text product for YouTube. The company has already made some videos related to the U.S. election searchable http://www.readwriteweb.com/archives/google_audio_indexing_keyword.php , and chances are the Google will roll this feature out to a wider range of its users in the future.

]]>Discuss]]>
http://www.readwriteweb.com/archives/everyzings_speech_to_text_metaplayer.php http://www.readwriteweb.com/archives/everyzings_speech_to_text_metaplayer.php Products Wed, 22 Oct 2008 09:24:03 -0800 Frederic Lardinois
Google Audio Indexing: Keyword Searches for Political Videos google_audio_logo.jpgTwo months ago, Google released a widget for iGoogle that allowed users to search for keywords in election related speeches on YouTube. Today, Google expanded the scope of this search to a larger selection of political videos and the widget graduated to become a full-blown Google Labs product named GAUDI. Interestingly, the tagline in Google Labs describes the product as a search for "what people are saying inside YouTube videos," even though its focus so far is very narrow. Over time, we expect Google to expand this service to cover a larger selection of videos or even all of YouTube.

]]>Sponsor

]]> Only Politics

Google currently only indexes videos in YouTube's "Politicians" channel, which includes both speeches and political advertisements. As Google explains, political speeches are a relatively easy target for speech recognition software because they have received a lot of academic and industry attention and are known to perform well. However, in the GAUDI faq, Google also hints at the fact that Google expects to expand this product beyond these political speeches in the future.

google_audio_example.png

Great Start, But Interface Needs Improvements

While the interface is very well designed, we can image a number of improvements. It would be extremely useful, for example, to be able to sort videos by date. Right now, they are always sorted by relevance, though in this fast-paced political season, it is often more important to know when somebody last used a certain phrase.

Also, while the variation on the YouTube video player Google uses here is well suited for this task, you can not embed the videos on your own site. All you get is a link to the video on the Audio Indexing site.

As of now, Google also doesn't offer an API for this service. Given Google's record, however, we assume that it will make an API available in the near future.

At times, like all speech recognition systems, GAUDI will also make mistakes. Overall, however, we came away impressed with the quality of the transcriptions.

]]>Discuss]]>
http://www.readwriteweb.com/archives/google_audio_indexing_keyword.php http://www.readwriteweb.com/archives/google_audio_indexing_keyword.php Products Tue, 16 Sep 2008 09:25:34 -0800 Frederic Lardinois
You Play a Game, Computers Get Smarter, AI Starts to Work Last week a new site called Gwap was launched by Carnegie Mellon's School of Computer Science. The site offers an array of multi-player games that have a benefit beyond just that of momentary distraction or amusement. These games are helping improve image and audio searches, teaching computers to see, and enhancing AI. However, all that won't matter to the players because, as it turns out, these games are actually fun.

]]>Sponsor

]]> About Gwap

Nicholas Carr blogged about Gwap a couple of days after its launch, noting that "one thing the Internet enables, which wasn't possible before, at least not on anywhere near the same scale, is the transfer of human intelligence into machine intelligence." In Gwap, which stands for "Games With a Purpose," that transfer of intelligence is done by getting people to do the routine chores that computers don't know how to do - chores like tagging photos, describing songs, and outlining objects, as well as transferring a good bit of human common sense to the machine. The trick to getting people to do these things is to make the work fun. Hence the games.

The creator of these games is Luis von Ahn, winner of a 2006 MacArthur Foundation "genius grant" and a pioneer in the field of human computation. Ahn is most notable for helping to develop CAPTCHAs (Completely Automated Public Turing Test to Tell Computers and Humans Apart), those somewhat annoying but rather effective distorted letter puzzles used millions of times each day. Last year, he also introduced the "reCAPTCHA," where CAPTCHAs were used to gain access to a web site while also helping digitize old books.

Gwap homepage

The Games

Gwap currently features five games, one of which is an old classic called the ESP Game. In the ESP game, two players view the same image and try to guess words that the other player would use to describe it. Google licensed this technology and launched Google Image Labeler to help improve the quality of their image search results.

The four new games include:

Matchin, a game in which players judge which of two images is more appealing, is designed to eventually enable image searches to rank images based on which ones look the best.
Tag a Tune, in which players describe songs so that computers can search for music other than by title - such as happy songs or love songs.
Verbosity, a test of common sense knowledge that will amass facts for use by artificial intelligence programs.
Squigl, a game in which players trace the outlines of objects in photographs to help teach computers to more readily recognize objects.

According to the Carnegie Mellon announcement, von Ahn plans to add a lot of games to the site, saying "we have three more that we'll be launching in the coming months." He hopes that by having all the games on the same site it will encourage players to try several different ones. Players also have a single sign-on and password, Top Player rankings, and online chats, said von Ahn.

The Human Processor

In his whitepaper entitled "Invisible Computing," von Ahn compared game design to to algorithm creation, saying:

"...it must be proven correct, its efficiency can be analyzed, a more efficient version can supersede a less efficient one, and so on. Instead of using a silicon processor, these "algorithms" run on a processor consisting of ordinary humans interacting with computers over the Internet."

In other words, we're the processor. The machine is us.

This concept isn't entirely new - Amazon's Mechanical Turk, for example, pays people to contribute their time to work on small, simple tasks called "Human Intelligence Tasks," or HITs. However, unlike HITs, which can sometimes be boring or tedious, the games on Gawp are actually fun - and they don't feel like work.

Some believe that human powered processing is the next big wave for computing. You could argue that Mahalo, the human-powered search engine is an example of this. (Though others call it a human-powered link farm.) Perhaps a better example is ChaCha, the mobile Q&A service that uses human guides to respond to questions called or texted in from your cell phone. We've also covered other human-powered services on RWW in the past, like the Galaxy Zoo and Stardust@Home project, among other (our coverage here). Many of these efforts have tried to incorporate an element of "fun" into what is actually work.

Whether Gwap will actually gain momentum and get a large number of people involved is yet to be seen, but it is definitely has potential to help teach computers the things they can't do for themselves....yet.

]]>Discuss]]>
http://www.readwriteweb.com/archives/you_play_a_game_computers_get_smarter.php http://www.readwriteweb.com/archives/you_play_a_game_computers_get_smarter.php Products Fri, 23 May 2008 05:00:00 -0800 Sarah Perez