data sharing - ReadWriteWeb http://www.readwriteweb.com/feeds/tag/data sharing en Copyright 2012 Richard MacManus readwriteweb@gmail.com Mon, 13 Feb 2012 19:17:22 -0800 http://www.sixapart.com/movabletype/?v=4.35-en http://blogs.law.harvard.edu/tech/rss What Can You Do With Government Data? Bust Politicians, That's What Prisons as Progressive Punishment by Flickr user Publik15.jpgPublic data from the government - is it an opportunity for innovation and essential accountability or a snoozer that no one really cares about? Government transparency advocacy group The Sunlight Foundation offers one example today of something that can be done with government data that is clearly worth doing - but the data they used hasn't been made available on the Obama administration's anemic new data repository Data.gov.

]]> Virginia Senator, Jim Webb, has a bill going through hearings that would create a National Commission to study and make recommendations on the reform of US criminal justice laws and practices. Twenty eight other Senators have signed on in support of the bill, but cross referenced statistics shed some interesting light on those Senators still standing on the sidelines.

PrisonMap.jpg

Sunlight downloaded a CSV list of prison populations in all 50 US states and created a map to visualize which states have the largest prisoner populations. Cross reference those numbers with the list of supporters who have singed on to the National Prison Commission Bill and what do you see? Surprise, surprise, not a single Senator from the three states with the biggest prison populations (California, Texas and Florida) are excited to see the the current criminal justice system studied and reformed.

That disinterest could be a matter of economics or satisfaction with the status quo. Or the aversion to studying the matter may represent a desire to avoid embarrassment. The explanation can't be drawn from the data, but it is good data to have. It's not data that Data.gov will help you get your hands on. That catalog of public data remains limited to some of the least controversial data sets online. Three weeks after it's launched, Data.gov keeps adding more and more data sets to its collection - but a search for the word "prison" still brings up zero results.

Take a country with the highest documented incarceration rate and total documented prison population in the world, where under-represented ethnic groups are incarcerated multiple times more than the dominate majority population. Isn't that the kind of situation where you'd like to know whether or not a politician supports just studying the matter? Who does and who doesn't? Easy access to government and public data can help shed important light on current events like this and that's why people get excited about calls to open data up to the public in better and better ways.

]]> Discuss]]>
http://www.readwriteweb.com/archives/what_can_you_do_with_government_data.php http://www.readwriteweb.com/archives/what_can_you_do_with_government_data.php Analysis Fri, 12 Jun 2009 11:55:54 -0800 Marshall Kirkpatrick
Despite New Openness, Facebook Remains Fundamentally Closed What are people saying on Facebook about the swine flu? Facebook knows, but they won't tell you. The company made a major move today to open up some of the data on the site in some interesting ways - but the conversation on Facebook remains fundamentally closed due to extensive privacy limitations and the company's disinterest in overcoming those limitations in an appropriate way.

Ask Twitter what people are saying on that site about the swine flu and you can get the full story to parse until you're blue in the face. The new Facebook openness is like interoperability between different telephone handset manufacturers but conversation remains closed between individuals. Conversation on Facebook is no more easy to analyze today than it was yesterday; that's the real opportunity here, not just the ability to send and receive Facebook messages through different applications.

]]> It's a networked world and we should be able to have our cake and eat it too in terms of secure conversation and network effects anonymously analyzed in aggregate.

The Good News

Facebook users will now be able to send and recieve all kinds of data through other websites and applications. Think Tweetdeck both reading from and writing to Facebook. It is good news that Facebook has decided to implement the work it's been participating in with the Activity Streams working group.

fbstreamsmeetup.jpg

This should enable Facebook messaging through different interfaces to suit different users' needs and desires. It should help cut back on user criticism about UI changes to the site - there will now be different services that serve up the data in different ways - take your pick.

Facebook told us today that every field displayed in the Facebook Newsfeed will be made available through the Activity Streams API, for example, content, username, timestamp, and source if we're talking about something pulled in from offsite. Facebook's, David Swain, told us he expects to "see a lot of innovation on top of the stream. We expect developers to filter it, remix it, and serve it up in new and interesting ways."

The Not So Good News

Unfortunately, the data that developers are able to work with is severely limited. They will simply be able to make a call for a user to Facebook and get back the friends' streams that this particular user has the permission to see. The data can't be cached for later analysis, the company isn't sure yet how far back in the archives access will be permitted (though they say the goal is to expose all the way back), and the Terms of Service will prohibit eyes outside of a user's Facebook friends from seeing the massive amounts of friend-limited data.

In other words, this is permission to build more interfaces for Facebook. That's cool, but that's not really what the world needs - more interfaces for giving Facebook love.

Facebook holds a mind-blowing amount of conversational data. The company is analyzing it extensively and it has an omniscient view of conversations across all the networks of friends and privacy restrictions. It uses that aggregate data analysis to make business decisions and to sell advertisements.

facebookisclosed.jpg

The rest of us are only allowed to give Facebook more data and to get back a sliver per user that will facilitate more user-level participation in amassing more data at Facebook.

The most common analogy used when standards are discussed is the railroads. When railroads agreed to use the same width of rail, then trains could travel from network to network and transcontinental transport became possible. Today's announcement by Facebook is something like that, and that's great. But imagine the analytic value that could be drawn from getting reports about train times and rail conditions across the line. That's the kind of data that allows for more intelligent and interesting decisions to be made. That's explicitly prohibited, in this case, by Facebook.

How is global climate change being dealt with? It's being dealt with through global sharing of data for analysis. How is science being advanced? In some of the most exciting cases, it's being advanced by public sharing of huge data stores that anyone can then analyze for patterns. What's been some of the most important data in the United States over the last few centuries? It's census data that has been made publicly available in anonymous aggregate but in a format that could be cross-referenced with geography, commercial, and other data to better understand the world we live in.

The data that Facebook controls, conversations and social connections, could be used for analysis of real-time social patterns which could lead to world-shaking new insights. Do we get access to that data? No.

Why not? We don't get that access because Facebook was built on a fundamental promise of privacy and a complex system of privacy controls. Privacy is good, it's very good. But, the census gathers and exposes personal data without violating privacy. Lots of systems do.

twitternet.jpg

Facebook needs to too. The data the network controls is just too valuable to keep locked up for only the company's own analysis.

Facebook's David Swain says that privacy on Facebook is evolving and the company is interested in facilitating this kind of analysis, but that Facebook is "not hearing users clamor for interesting analysis." That's a disingenuous argument coming from a company that is famous for pushing through important technologies against the wishes of its users, much less without their request.

The era of user-published content is still catching up to millions of people, but innovation is fast moving beyond that point. In order to deal with pressing global problems and truly leverage what's possible with contemporary internet technology, we need data-centric hooks extended for developers to grab hold of and build the future. Facebook is one of the biggest stores of data that innovation could be built upon. Once that kind of openness is achieved, then we'll say that Facebook is truly opening.

]]> Discuss]]>
http://www.readwriteweb.com/archives/despite_new_openness_facebook_remains_fundamentall_1.php http://www.readwriteweb.com/archives/despite_new_openness_facebook_remains_fundamentall_1.php Analysis Mon, 27 Apr 2009 10:27:12 -0800 Marshall Kirkpatrick
Newspaper as a Platform: Guardian Launches API guardian_open_platoform_logo_mar09.pngThe Guardian just launched a new API which will allow third-party developers to access and reuse the Guardian's content database in their own applications. The new API is part of the Guardian's new Open Platform, which, as of today, consists of the API and a Data Store, but the Guardian also announced that it plans to offer more services in the near future. The Data Store is a collection of high quality data sets which are curated by the Guardian and hosted on Google Docs.

]]> This move by the Guardian comes just about a month after the New York Times opened up its Article Search API, which also gives developers access to a complete set of the paper's content (Disclosure: RWW is a syndication partner of the NYT). The New York Times, however, provides access to a more varied set of data and includes specialized APIs for accessing its movie reviews, breaking news, or information about the U.S. Congress. Other news organizations like the BBC or NPR also offer similar APIs.

Data Store Uses Google Docs

The Data Store and its accompanying blog are more directly aimed at consumers. The Data Store houses a collection of all of the statistics that the newspaper has published. Unlike the New York Times, however, the Guardian does not yet provide an equivalent to the Data Visualization Lab. Interestingly, the Guardian chose Google Docs as its repository for this data. Here, for example, is the Guardian's spreadsheet with the U.S. public debt since 2001.

Future of the Newspaper?

We have already seen a number of very interesting uses of the NYT APIs, and we will surely see a lot of interesting and useful applications that will now make use of the Guardian's data. What is more interesting, however, is that some newspaper are clearly beginning to understand where the future of their business lies (the Guardian now also exposes full-text RSS for all of its sections).  If anything, this feels like a far smarter way to go than the MediaNews Group's misguided attempt to put proprietary printers into readers' living rooms.

]]> Discuss]]>
http://www.readwriteweb.com/archives/newspaper_as_a_platform_guardian_announces_apis.php http://www.readwriteweb.com/archives/newspaper_as_a_platform_guardian_announces_apis.php News Tue, 10 Mar 2009 09:21:02 -0800 Frederic Lardinois
Yawnlog: A Social Sleep Tracker YawnLogLogo.jpgEvery night we lay down in our beds, our consciousness enters a different state and then we stay that way for the next 8 hours. It's pretty amazing if you think about it! It can be no surprise then that someone has created a way to track and share details about such a big part of our lives. Some people sleep with their friends, now the rest of us can track our sleep - with our friends.

Yawnlog is a wacky new site that lets you track how much sleep you're getting, note how good the sleep was, record your dreams and compare all of that information with your friends. This is no laughing matter! Imagine cross referencing aggregate sleeping hours and moods with a timeline of historically significant events. Silly as this service might sound, we think it sounds pretty cool, too.

]]> Some people like to track everything, and we do imagine that it could be helpful to see longer periods of poor or insufficient sleep than we realized we were experiencing.

YawnLogScreen.jpg

Unfortunately the site doesn't seem particularly well constructed, but we sure love the idea. (Update: Lead developer Matt Blake emailed to let us know that the site was put together in a 24 hour hackathon and is only a few days old. Fair enough, hopefully Blake will be able to continue developing it.) We found Yawnlog over on KillerStartups, where you'll find a lot of apps that look like they were written by people who aren't getting enough sleep but some good ones too.

Imagine the possibilities! Say you're thinking of taking a job at one place that reports an aggregate of restless, stunted sleep patterns - but another potential employer's people have an unusually high rate of dreams about flying! Which job are you going to take? Say Republicans slept better after Obama's first major speech last night than Democrats did after Bush's first speech in office. Talk about tangible evidence of bi-partisan comfort with the Presidency! How much sleep, on average, does the staff of ReadWriteWeb get compared to other big tech blogs? More and better sleep, I'd bet.

Of course all of that would require far more adoption of a technology like Yawnlog than is ever going to happen, but we do like to fantasize. This kind of aggregate sleep pattern analysis will probably be a bullet point in the brochure for the forthcoming Google Brain Implant.

In the meantime, we got a kick out of Yawnlog and will see how long we can stick with using it. It's proof that you can socialize anything online. If you'd like to put our sleepy heads together, my profile is here, friend me up and let's share some dreams. I've added Yawnlog to my Morning Coffee Firefox plug-in, so I'll be reminded to make an entry first thing each morning.

]]> Discuss]]>
http://www.readwriteweb.com/archives/yawnlog_a_social_sleep_tracker.php http://www.readwriteweb.com/archives/yawnlog_a_social_sleep_tracker.php NYT Thu, 26 Feb 2009 10:19:41 -0800 Marshall Kirkpatrick