voice - ReadWriteWeb http://www.readwriteweb.com/feeds/tag/voice en Copyright 2009 Richard MacManus readwriteweb@gmail.com Tue, 24 Nov 2009 07:47:40 -0800 http://www.sixapart.com/movabletype/?v=4.23-en http://blogs.law.harvard.edu/tech/rss Enjoysthin.gs: A Dazzling New Way to Share and Discover Multimedia thingslogo3.jpgThe internet is a visual medium, so it's no surprise that visual bookmarking services are becoming increasingly popular. The newest one we've discovered, called Enjoysthin.gs, takes the cake so far.

Enjoysthin.gs lets you share all kinds of media, from images to videos to quotes to audio you call in and record from your phone. The interface is beautiful and the site is a real joy to use.

]]>Sponsor

]]> The service was created by Ted Roden, formerly of artists' video site Vimeo and now a "creative technologist" at the New York Times.

enjoysthingsscreen1.jpg

The user experience on the site is a real home run; from browsing through the beautifully displayed items from other users to interacting with the site's basic functionality. Creating an account is remarkably smooth and the plush, over-sized displays make us want to jump up and down and clap our hands.

Anyone can tag items, even those saved by other people, and you can view recent shared items globally by tag. Recent activities by your friends are displayed as another item in a big box when you are logged in. There's even an API for integration with outside services; though we dare anyone to try to make a cooler interface, it would be nice to see Enjoysthin.gs on FriendFeed.

There are clearly more features that are still needed, there are no RSS feeds or ways to view the most popular items, for example. Roden is hard at work improving the site, he added 31 new features in December alone. The new feature that lets you call in a voice recording is very cool - a message we recorded was posted to our profile page in just a few minutes.

You can befriend me at Enjoysthin.gs on this page. I'll probably be using the service for at least the next few days. Not because it's particularly popular, secure, supportive of data portability or anything like that - just because it's so damn much fun to use.

]]>Discuss]]>
http://www.readwriteweb.com/archives/enjoysthings_visual_bookmarking.php http://www.readwriteweb.com/archives/enjoysthings_visual_bookmarking.php Multimedia Wed, 04 Feb 2009 10:01:38 -0800 Marshall Kirkpatrick
Cartoon: Talking Web The word on the street this month is speech. (Which makes it the spoken word on the street.) Speech-based iPhone apps are just a throat-clearing for the stream of oratory that IBM says we can expect from computer users within the next five years.

It may finally be time I did something about that compulsive swearing issue, unless I want some seriously skewed search results. But that may only be the beginning.

]]>Sponsor

]]> Just as a lot of us have sore arms, shoulders and necks from using mice and keyboards, we may find unexpected consequences from training our voices to use precise diction, simple sentences and clear directives. For instance, we may all end up sounding like voice-mail systems in real life. ("Darling, will you marry me?" "It sounds like you're asking me to marry you. I can help you with that.")

My biggest fear, though, is that we'll lose something far more precious than the nuance and subtlety of the human voice - and that's the ability to yell at a recalcitrant device with impunity. There's profound satisfaction in telling your computer to go screw itself, without worrying that it might attempt to comply with the instruction.

More Noise to Signal

]]>Discuss]]>
http://www.readwriteweb.com/archives/cartoon_talking_web.php http://www.readwriteweb.com/archives/cartoon_talking_web.php Cartoons Sun, 30 Nov 2008 14:14:11 -0800 Rob Cottingham
IBM: Talking Web Will be Commonplace in 5 Years Every year IBM releases a "Next Five in Five" list, a list of innovations that "have the potential to change the way people work, live and play over the next five years". This is the third such list, and it mentions a "Talking Web" among the 5 items. You will talk to the Web and the Web will talk back, according to IBM. In the future "you will be able to surf the Internet, hands-free, by using your voice - therefore eliminating the need for visuals or keypads."

In fact this is already starting to happen, as recent iPhone releases from Google and Say Where show.

]]>Sponsor

]]> We can definitely see the potential in a Talking Web - responding to emails quickly using voice, searching the web by barking orders into your computer / phone, composing blog posts by dictating, and so on. The shift to voice will happen in some places for cultural reasons and as a by-product of the rise in popularity of mobile phones to access the Web. IBM notes that in India the spoken word is more prominent than the written word in education, government and culture, so "talking" to the Web is set to usurp all other interfaces. IBM predicts that this change will be driven by new technology, with speech instead of text as the main interface. IBM calls this "VoiceSites," noting that "people without access to a personal computer and Internet, or who are unable to read or write, will be able to take advantage of all the benefits and conveniences the Web has to offer."

Will all this happen in 5 years? While at least one Slashdot commenter thinks it'll be more like 15 years, we see plenty of evidence of voice recognition software on the Web already. google mobileJust a week or so ago Google released an update of its Google Mobile App for the iPhone (iTunes link), which included voice recognition to translate voice commands into search queries. In our tests, we found the voice recognition to be very accurate. Google also offers voice search through GOOG-411 and Yahoo and other information providers offer similar services. There are a whole host of talking search engines in fact. Also, we're seeing voice apps from startups - such as the Say Where iPhone application (our review).

Here are the full 5 predictions from IBM:

  • Energy saving solar technology will be built into asphalt, paint and windows
  • You will have a crystal ball for your health
  • You will talk to the Web . . . and the Web will talk back
  • You will have your own digital shopping assistants
  • Forgetting will become a distant memory
]]>Discuss]]>
http://www.readwriteweb.com/archives/ibm_talking_web.php http://www.readwriteweb.com/archives/ibm_talking_web.php Trends Thu, 27 Nov 2008 06:00:00 -0800 Richard MacManus
Google Mobile App Update With Voice Recognition Finally Available - And it Works google_mobile_app_logo.pngGoogle's highly anticipated update of its Google Mobile App for the iPhone (iTunes link) finally arrived in the App Store this afternoon. The app now uses voice recognition to translate voice commands into search queries. The new version also gives you easy access to all your other Google apps, though the star of this update is obviously the voice recognition, which, in our tests, was amazingly accurate.

Note: the description in iTunes has not been updated yet, but if you follow this link, you can just download and install the new version.

]]>Sponsor

]]> It Actually Works

This new capability takes the Google Mobile App for the iPhone from a rather bland search application to a whole new level. Now, you can just say something like "sushi in Portland" or "movie showtimes" and the application will return a highly relevant set of results. In order to be useful, voice recognition has to be extremely accurate, and here, the new Google Mobile App delivers.

The app also makes use of the iPhone's built-in GPS and always returns local results together with the standard Google search results whenever these could be relevant. For us, this worked especially well when we searched for restaurants or stores.

google_mobile_app_sushi.pngSurprisingly, the app also did quite well when we search for relatively complex addresses, though it didn't automatically take us to Google Maps, which would have been really nice.

It also helps that Google has updated the way it displays search results in the app, with easy ways to get directions or call a restaurant directly from the app.

Interestingly, the apps seems to be doing the voice recognition on the phone itself. Most of Google's competitors first record a sound file, then send it to a server, and then return results. Doing the number crunching locally makes for a faster search experience, especially when using the EDGE network.

The only functionality that is missing is the ability to search your contacts and make phone calls directly from the app, but chances are that there are technical reasons for this (names tend to be a hard problem for voice recognition programs).

Easier Than Typing

Google, of course, already offers voice search through GOOG-411 (and Yahoo and other information providers offer similar services). Also, other iPhone applications like Say Where (iTunes link) already offer voice recognition (though our experience with that application was decisively mixed).

Overall, however, Google's iPhone application works better and faster than any of its competitors. Most voice recognition programs have some issues with unusual accents, and the Google Mobile App is no exception. However, when it works, it works extremely well and using it to search for information is actually easier and faster than having to type a query.

 

]]>Discuss]]>
http://www.readwriteweb.com/archives/google_mobile_app.php http://www.readwriteweb.com/archives/google_mobile_app.php Products Mon, 17 Nov 2008 21:15:25 -0800 Frederic Lardinois
Google Launches Video and Voice Chat for Gmail gmail_logo_nov08.pngGoogle today announced a major update for Gmail: video and voice chat. To make this work, users only have to install a small browser plugin. Google will start enabling these new features for all Gmail and Google Apps users at 12pm PST today. As is typical for new releases in Gmail, Google will roll this feature out to all users over the "next day or so."

The new chat functionality will work on PCs with Windows XP or Vista, as well as on Macs with OS X 10.4 or later. The plugin will work with all the major browsers, including Google's own Chrome, Firefox, Internet Explorer, and Safari.

]]>Sponsor

]]> From what we have seen so far, the videos integrate directly with the standard chat function in Google. To get started, you have to first enable this new functionality from the 'Options' menu at the bottom of a Gmail chat window. After that, all you have to do is select the contact you want to speak with and click on 'Video & more' in the lower-left part of the chat window. From there, you can then start a voice or video chat.

We have not been able to test the video and voice quality of these chats yet, but we will update this post once we get a chance to try it out for ourselves.

gmail_video_chat.png

Skype?

Interestingly, Google Talk, Google's desktop chat program does not have any video functionality yet, though Google enabled voice chat for Google Talk a long time ago. By adding voice and video chat, Google is clearly going up against Skype, which has made video chat one of its main features in the latest version of its software.

gmail_video_chat_2.png

]]>Discuss]]>
http://www.readwriteweb.com/archives/google_launches_video_and_voic.php http://www.readwriteweb.com/archives/google_launches_video_and_voic.php Products Tue, 11 Nov 2008 12:00:01 -0800 Frederic Lardinois
New Translation Services Come to MS Office and JaJah babel-logo.pngAutomated translation services seem to be getting more and more traction these days. Today, we saw announcements about new translation related products from both Microsoft and telephony service JahJah. Microsoft announced that it will be giving its users a free update that will integrate Windows Live Translator into MS Office 2003 and 2007, while JaJah is now offering free voice translations from Mandarin into English through JaJah Babel.

]]>Sponsor

]]> While JaJah doesn't specifically pitch this new service in the context of the Olympics, it is obviously releasing this just in time for the opening ceremonies.

Microsoft Office

Out of the two announcements, Microsoft's is probably the least exciting, but, on the other hand, there is a good chance that it will see a lot more actual use than JaJah's voice translation. The Microsoft Research Machine Translation team has just released this update to MS Office 2003 and 2007 to the Office team for integration, but they already offer instructions on their blog for setting this up yourself without having to wait for the official update.

The integration with Windows Live Translator allows you to translate English texts into Chinese, Japanese, Korean, French, German, Italian, Arabic, Dutch, Portuguese, and Spanish, as well as vice versa. We have tested the Windows Live Translator and the translations were generally about as accurate as you would expect from machine translations. There are various mistakes and words it doesn't recognize, but overall, the translation is relatively readable and gives you at least some impression of the original text.

office-translate.png

JaJah Babel

JaJah Babel is clearly the sexier product of the two. You can call access numbers in the U.S., England, or Australia, and after a voice prompt, you simply speak the text you want to be translated into Mandarin. The service will then replay your message, you acknowledge the accuracy of the input, and after a short delay, you will hear the translation. Given our general lack of knowledge when it comes to Mandarin here, we can't vouch for the accuracy of the translation, but the service itself worked very well and seemed to understand at least our initial input accurately.

If you are in China, of course, the fact that you have to call an international number to get this to work is a bit of a limitation.

jajah-numbers.png

Other Translation Services

There seems to be quite an interest in working on consumer oriented translation services right now. Just yesterday, we wrote about Mloovi, which translates RSS feeds trough Google Translate, and earlier last month, we wrote about the collaborative dictionary and translation service Lingro.

Babel Fish

JaJah's product is especially interesting here because it takes speech as its input and it will get even more interesting once it works for other languages beyond Mandarin as well. JaJah is offering this service based on IBM's technology, and given IBM's expertise in doing voice-to-voice translation, it will probably only be a matter of time before we see support for more languages. Besides other projects, IBM already supports the U.S. Army with an English to Iraqi Arabic translation service.

There has always been a lot of hype around the possibilities of instant voice translations, but very few products were ever good enough to make it in the consumer/business market. JaJah represents a major step forward here, even if its voice prompts make the service a bit less frictionless than the science-fiction ideal if autmated, instant translation.

]]>Discuss]]>
http://www.readwriteweb.com/archives/translation_comes_to_ms_office.php http://www.readwriteweb.com/archives/translation_comes_to_ms_office.php News Thu, 07 Aug 2008 12:05:50 -0800 Frederic Lardinois
Utterz Expands Media Messaging Service Multimedia messaging service Utterz is launching an expanded offering Monday morning that will allow users from 17 additional countries outside of the US to post to the service with their mobile phones. Utterz combines voice, video, photos and text to facilitate conversations either on the Utterz site, through Twitter or on your own blog off-site.

In addition to the local phone numbers for 17 additional countries, Utterz is also launching threaded conversations (small but important), webcam video capture and a newly designed site. The cow motif will likely stay, but whatever. (Update: Upon seeing the relaunched site, there's actually a whole lot less cow action! I kind of miss the cow, now that it's gone.)

]]>Sponsor

]]> There's a lot of nice little touches here, check out this embedded player from Utterz for example. That's pretty cool. Except the text is too small.

On Seesmic

The most logical company to compare Utterz to is Loic Le Meur's Seesmic, which I wrote last week transcends comparisons with the leading micromessaging social platform Twitter.

Utterz has far more features than Seesmic and is also very well thought out, at times. It's not as slick and usable as Seesmic. You can fall off a log and participate in Seesmic, once you've gotten access to the closed alpha at least (and gotten over any aversion you have to Silicon Valley hype). The feature gap is big enough, really, that the two may as well be different services. Seesmic is a good place to go and have short video conversations. Utterz is a service you can use to have more complicated and flexible conversations in mixed media. With Utterz you can post an audio message first to your account, then edit the message to add images and text, then have it all appear as a blog post on your off-site blog ten minutes later. That's pretty cool.

Growing Utterz

For whatever reason, Utterz is also growing much slower than Seesmic, despite the fact that there's no invitation required. Utterz says, and I agree, though that there are so many people in this world with a cell phone that there's not much use squabbling over whether one startups few thousand early users are more than another's.

When I asked Utterz though what their path to market would be, they told me it would be "focusing on a particular set of topical interests, like political dialog." Snore.

The strangely disconcerting anti-hero cow mascot and the general clunkyness of the site aside, though, Utterz is a good service. Once users get used to using it, though, I think many will like it quite a bit. Enabling users in 17 additional countries to come on board is a great move and one I'm sure RWW readers will appreciate.

The list of newly included countries follows:
Australia, Belgium, Canada, Finland, France, Ireland, Israel, Italy, Japan, Mexico, New Zealand, Netherlands, Norway, Spain, Sweden, Switzerland and the United Kingdom.

]]>Discuss]]>
http://www.readwriteweb.com/archives/utterz_expands_media_messaging_service.php http://www.readwriteweb.com/archives/utterz_expands_media_messaging_service.php Products Mon, 04 Feb 2008 05:00:11 -0800 Marshall Kirkpatrick