Data.gov, the US federal government's new catalog of sets of public data for outside developers to mashup and analyze, now faces some friendly competition. The Sunlight Foundation, a non-partisan non-profit organization dedicated to government transparency, has announced that it will launch a National Data Catalog to go above and beyond what Data.gov offers.
We've been critical of the sparse offering from Data.gov but Sunlight's plan looks like it could surpass what even the most ambitious government program is likely capable of.
Here's how Clay Johnson, head of Sunlight Labs, described the need for this project:
Because of politics and scale there's only so much the government is going to be able to do. There are legal hurdles and boundaries the government can't cross that we can. For instance: there's no legislative or judicial branch data inside Data.gov and while Data.gov links off to state data catalogs, entries aren't in the same place or format as the rest of the catalog. Community documentation and collaboration are virtual impossibilities because of the regulations that impact the way Government interacts with people on the web.We think we can add value on top of things like Data.gov and the municipal data catalogs by autonomously bringing them into one system, manually curating and adding other data sources and providing features that, well, Government just can't do.
Johnson says the site, which is being developed with publicly visible open source code, will allow users to contribute sets of data, documentation to go along with data and links to places around the web that are using any set of data cataloged. It sounds like a ProgrammableWeb type site for public data.
In addition to government information, Sunlight will also be including derivative data sets, like information from the campaign contribution tracker OpenSecrets. That's something it's very hard to imagine a government website including.
Can Sunlight build a one-stop-shopping destination for public data, and will people make use of that? Time will tell, but it sounds like a very important project.
Comments
Subscribe to comments for this post OR Subscribe to comments for all ReadWriteWeb posts
Sunlight Labs thinks it can top what Data.gov is doing, and the plans look good http://bit.ly/4nAcHn [from http://twitter.com/marshallk/statuses/2674072723]
Posted by: Marshall Kirkpatrick
|
July 16, 2009 12:11 PM
It seems so obvious that you should use MindTouch to implement this.
MindTouch consists of an easy to edit wiki-like collaborative web interface for content creation (not so important for this project), a composition of RESTful web-services that provide machine access to data, an ACL, a scalable and extensible message bus and a web-service orchestration engine.
The MindTouch platform is WOA (RESTful design applied to 'www' tech), which means it's inherently standards compliant, easy to scale (everything is over HTTP--can cache the hell out of it) and easy to extend (stateless and resource oriented--don't have to understand the entire system to extend).
MindTouch is platform and technology agnostic. You can extend it ANY programming language (it is written in mostly C# on .NET/Mono, but most extend it in PHP and Java) and it runs equally well on Linux, Windows, BSD even Solaris!
Seems like a no-brainer. It is as if MindTouch were perfectly designed for this project. I should add: MindTouch is open source (GPL v2 with some Apache components).
thank you for your imformation...........