8 Principles for open government data & beyond

Internet can add power to the right to information, but it depends on how government websites are designed

I argued in a recent post that digital technologies can add power to the right to information movement by making information easily, quickly and cheaply accessible.  But for online data to be effective, government websites have to be suitably designed.  Here are some thoughts on this question:

The 8 principles

A good starting point is a set of far-reaching principles developed in meeting held at California:

Complete: All public data are made available. Public data are data that are not subject to valid privacy, security or privilege limitations.

Primary: Data are collected at the source, with the finest possible level of granularity, not in aggregate or modified forms.

Timely: Data are made available as quickly as necessary to preserve the value of the data.

Accessible: Data are available to the widest range of users for the widest range of purposes.

Machine processable: Data are reasonably structured to allow automated processing.

Non-discriminatory: Data are available to anyone, with no requirement of registration.

Non-proprietary: Data are available in a format over which no entity has exclusive control.

License-free: Data are not subject to any copyright, patent, trademark or trade secret regulation. Reasonable privacy, security and privilege restrictions may be allowed.

Beyond 8 Principles

These principles can have radical implications for how we use governmental data in India.  But to make websites an effective tool for right to information, we have to supplement these with a few other principles:

Authenticity: Information should be automatically certified like any document made available through a formal request, so government is legally accountable to stand by information provided.

Digitising at source: Some reasonably well designed websites contain little or no information in them since a lot information is not digitised.  Digitising information at source and integrating it systematically will avoid duplication and ensure completeness of information.  This needs reliable systems, infrastructure at the grassroots and a suitable legal framework.

Efficient data systems: Information should not be duplicated, and should be gathered at source in a convenient format.

Digitising all basic information: Maps, census data, village infrastructure, etc. are used in a lot of analysis.  Making this information available will increase the ways in which we can use such data dramatically.

I will illustrate the implications of these principles with concrete examples in a series of posts shortly.  Watch this space.

This post is a part of the series on technology and governance in India.

You may also be interested in:

  1. Usability: Giving information is not enough Data, data everywhere, not a bit to use: Why the Budget website is informative but nearly useless, and how to make it usable The Government...
  2. List of data sources for social science research My bookmarks on data sources that is continually updated An index of websites that provide data useful for social science research.  Given my interests, these...
  3. Creating a Yelp for the government In 2004 website called yelp.com was created to help people find local businesses such as small restaurants, salons, etc. The idea of the website was...
  4. Will the Unique ID project improve accountability? Improving accountability depends on political and social initiatives, and not technological solutions. But technology can shape society and politics in limited ways. This article looks...
  5. Politics, technology & accountability I Is accountability a political problem on which technology cannot have an impact? Some thoughts....

Related posts brought to you by Yet Another Related Posts Plugin.

About Vivek S.

I work with the Liberation Technology Program at Stanford University. My interests are cosmopolitan and this blog will reflect many such interests. For more, click on "Me" above.