LaunchBox - ReadWriteWeb http://www.readwriteweb.com/feeds/tag/LaunchBox en Copyright 2012 Richard MacManus readwriteweb@gmail.com Tue, 14 Feb 2012 15:30:00 -0800 http://www.sixapart.com/movabletype/?v=4.35-en http://blogs.law.harvard.edu/tech/rss Google Acquires reCAPTCHA to Fight Spam and Improve Google Books OCR recaptcha_logo_dec08.pngGoogle just announced that it has acquired reCAPTCHA, one of the leading providers of CPATCHAs, the hard-to-read puzzles you often have to solve before you can sign up for a new web service. Google, of course, isn't so much interested in owning software that can generate CAPTCHAs - that's an easy problem to solve - but is looking at reCAPTCHA as a way to improve the optical character recognition (OCR) software it uses for large scale text scanning projects like Google Books and the Google News Archive Search.

]]> According to Google, reCAPTCHA is currently in use on over 100,000 websites to prevent spam and fraud. the reCAPTCHA team, which is currently based at Carnegie Mellon University, will join Google.

Solving CAPTCHAs to Transcribe Books

recaptcha_book.pngWe took detailed looks at reCAPTCHA and how it works last September and in early 2007. In short, reCAPTCHA has found a nifty way to crowdsource book transcriptions. When users solve a CAPTCHA through reCAPTCHA, the software will give users two words: one with a known answer (the control word) and one where the OCR software wasn't quite sure what the word was. Once a certain number of users have solved the suspicious word with the same result, it becomes a control word itself and the OCR software can learn this word.

Now, Google will be able to use this same technology to improve its own OCR efforts. Google currently makes over 1 million out-of-copyright books available for download through Google Books and one of the main arguments against these books has been the fact that these texts are not edited and include a lot of OCR errors. With reCAPTCHA, Google could potentially bring the error rate down dramatically and make Google Books even more useful.

]]> Discuss]]>
http://www.readwriteweb.com/archives/google_acquires_recaptcha.php http://www.readwriteweb.com/archives/google_acquires_recaptcha.php News Wed, 16 Sep 2009 09:58:19 -0800 Frederic Lardinois
The Future of Mobile (Live from the Web 2.0 Expo) This morning at the Web 2.0 Expo in San Francisco, Jason Grigsby of Cloud Four, a mobile and web development firm, presented at a session about the mobile web's future. Specifically, he focused on the different types of mobile applications we have today - native apps, mobile web apps, and hybrid apps - and the challenges of developing across multiple platforms.

]]> The question that Grisby raised during his presentation is an important one: are native applications really the future of the mobile web? Applications built specifically for one device, such as the iPhone, aren't necessarily any more capable than those built using web technology. In fact, they can even tap into the functionality of the phone's hardware itself like the accelerometer and geolocation features. (Really!)

In fact, the ability for mobile web developers to get their apps on a number of platforms without having to start from scratch each time has been made even easier as of late thanks to a new project called Phonegap. This open source development tool is actually a web-based platform that lets developers build apps in HTML and JavaScript while also allowing those apps to take advantage of the core features in the iPhone, Android, and Blackberry.

When you look at the type of applications built using this technology, you may be surprised. If you didn't already know they were made using PhoneGap, you may have thought that they were native applications. Take the Blok-Buster game (iTunes URL) for example, a Tetris-inspired game that involves clearing similarly-colored blocks from the screen. You can tilt the phone horizontally, vertically, or even flip it upside down and it behaves just as a native application would. Yet it is just a web app using the PhoneGap's functionality.

Technology like this can speed up the process of getting applications deployed across a number of platforms. That's something that is more critical than ever as we move forward with mobile web development. Today, there are just too many languages and platforms to code for and in the future there could be even more. So, perhaps we shouldn't be focused solely on native applications as the future of the mobile web after all. Maybe the future of the mobile web will just be the future of the web.

There are a number of fascinating figures and observations from Grigsby's presentation (embedded below) about the state of the mobile web and the challenges ahead. It's definitely worth a look.

]]> Discuss]]>
http://www.readwriteweb.com/archives/the_future_of_mobile_web_20_expo.php http://www.readwriteweb.com/archives/the_future_of_mobile_web_20_expo.php Trends Thu, 02 Apr 2009 12:32:20 -0800 Sarah Perez
Amazon Rents Out MapReduce Power with EC2, S3 and Hadoop Amazon announced today that it is bridging two of its web computing services, EC2 and S3, with Hadoop, an open-source project that brings the same distributed data processing power as Google's MapReduce. In fact, it is calling the new service Amazon Elastic MapReduce. The new service will allow its EC2 customers to perform distributed MapReduce queries on enormous datasets stored in S3, paying only for the computation time they need.

]]> Hadoop has been an open-source project in the making for the last few years, inspired by Google's white paper on its version of MapReduce. The technology is an almost perfect fit with Amazon's growing web services, matching distributed CPU time with vast data storage requirements, both things that fit well with the cloud model.

The way MapReduce works is a fairly straightforward concept: You take a problem that requires working with a giant (and we're talking massive - sometimes petabytes) dataset, distribute working with the dataset over thousands of separate processes (called mapping) and then taking the thousands of results you get back and reducing those results into a single master result. For certain tasks, MapReduce can vastly improve the efficiency of these types of tasks, and adding more computing power gives you a linear improvement in speed.

Yahoo! has been using its own version of Hadoop for a while now. And even before this offering, larger Amazon Cloud Computing customers have already begun to use Hadoop in EC2. This is from Wikipedia's article on Hadoop:

As an example The New York Times used 100 Amazon EC2 instances and a Hadoop application to process 4TB of raw image TIFF data (stored in S3) into 1.1 million finished PDFs in the space of 24 hours at a computation cost of about $240 (not including bandwidth).

As Amazon says on its blog, "After a while [developers] tend to report that they begin to think in terms of the new style, and then see more and more applications for it." Which we believe means that MapReduce is the new, big hammer, and as developers start looking around, every dataset starts looking like a nail. This is good news for Amazon as it only stands to profit.

]]> Discuss]]>
http://www.readwriteweb.com/archives/amazon_rents_out_mapreduce_power_with_ec2_and_hado.php http://www.readwriteweb.com/archives/amazon_rents_out_mapreduce_power_with_ec2_and_hado.php News Thu, 02 Apr 2009 11:00:00 -0800 Phil Glockner
OneRiot Launches Alternative Twitter Search Engine oneriot_logo_mar09.pngOneRiot, a relatively new real-time search engine, launched a new Twitter search engine this morning that takes a very different approach to Twitter search from similar ventures we have seen lately. Instead of surfacing specific tweets, OneRiot focuses on shared links instead of just doing a keyword search on Twitter. While Twitter's own search, for example, will show you the conversation around the leaked copy of Wolverine, OneRiot will actually find the latest shared links about this topic on Twitter.

]]> oneriot_twitter_wolverine.png

On the search results page, OneRiot emphasizes links, though you can also click on a small link at the bottom of each result to see the tweets that included this link. From there, you can also reply to a tweet. If OneRiot finds new tweets that fit your search, it will automatically prompt you to reload the page.

To get data about current trends on Twitter, OneRiot uses both Twitter's own API, as well as data it gathers from users who have installed OneRiot's own toolbar in their browsers.

Looking for Feedback

As OneRiot's GM Tobias Peggs told us yesterday, the company decided to test its new Twitter search feature on a separate page for now (twitter.oneriot.com), in order to get feedback from customers.

We, for one, would like to see the ability to organize results by how many users have linked to a certain page, for example. It would also be nice if OneRiot gave us the option to search for conversations around a specific link by copying and pasting a URL into OneRiot's search box.

oneriot_expanded.png

Overall, we like OneRiot's approach to searching Twitter. In some ways, it is similar to MicroPlaza's search functionality, though OneRiot's results focus more on the real-time experience.

]]> Discuss]]>
http://www.readwriteweb.com/archives/oneriot_launches_alternative_twitter_search_engine.php http://www.readwriteweb.com/archives/oneriot_launches_alternative_twitter_search_engine.php Product Reviews Thu, 02 Apr 2009 10:06:05 -0800 Frederic Lardinois
Guide to Seed Fund Incubators (Y Combinator Clones) They say imitation the most sincere form of flattery. If that's true, then Paul Graham must be about to drown from all the praise. His Y Combinator project, which has funded nearly 60 startups since 2005 and has arguably inspired a new emphasis on smaller scale investments at traditional venture capital firms, has collected a cadre of imitators. Lots of them, from all over the world. While Graham may not like it, there are a large number of start up incubators following the model he created with Y Combinator and handing out microinvestments in web startups in return for a small stake.

]]> If you're a startup founder looking for a bit of seed funding to let you quit your day job and finish your web app or service, our guide to seed fund incubators will help you figure out where to apply. (Be sure to also read this great account of what it's actually like at one of these programs.)

Name: Y Combinator
Location: Cambridge, MA and Bay Area, CA
Investment: $5,000 + $5,000 per founder (i.e., $15,000 for two founders, $20,000 for three)
Stake Taken: 2-10% (usually about 6%)
Companies funded: Too many to list (about 58), but many you've heard of, like Reddit, Scribd, and Xobni (Wikipedia has a full list)
Next application deadline: April 2

Name: TechStars
Location: Boulder, CO
Investment: $5,000 per founder, up to $15,000 (3 founders)
Stake Taken: 5%
Companies funded: 9 so far, including Villij, and Intense Debate
Next application deadline: March 31

Name: SeedCamp
Location: London, UK
Investment: 50,000€ (about US$74,000)
Stake Taken: 10%
Companies funded: 6 so far, including Tablefinder
Next application deadline: August 12

Name: YEurope
Location: Vienna, Austria
Investment: 5,000€ per founder, up to 15,000€ (3 founders)
Stake Taken: 2-10%
Companies funded: Soup.io
Next application deadline: None

Name: Summer@Highland
Location: Lexington, MA
Investment: $7,500 for individuals or $15,000 for teams (split evenly)
Stake Taken: ?
Companies funded: 8
Next application deadline: Not yet announced

Name: LaunchBox
Location: Washington, DC
Investment: Between $15,000 and $30,000
Stake Taken: 4-8%
Companies funded: None yet
Next application deadline: March 14

Name: DreamIt Ventures
Location: Philadelphia, PA
Investment: Between $10,000 and $30,000
Stake Taken: 4-8%
Companies funded: None yet
Next application deadline: March 12

Name: Bootup Labs
Location: Vancouver, BC, Canada
Investment: ?
Stake Taken: ?
Companies funded: None yet
Next application deadline: None / not yet launched

Name: Bootphase
Location: Atlanta, GA
Investment: ?
Stake Taken: ?
Companies funded: None
Next application deadline: None / Not yet launched

See also: Charles River Ventures' QuickStart loan program, in which seed round startups receive a loan of up to $250,000 against a future Series A venture round (which CRV has the option to participate in). And see the annual Google Summer of Code program, in which stipends are awarded to students working on open source projects.

]]> Discuss]]>
http://www.readwriteweb.com/archives/guide_to_seed_fund_incubators.php http://www.readwriteweb.com/archives/guide_to_seed_fund_incubators.php Trends Mon, 04 Feb 2008 11:43:48 -0800 Josh Catone