Data Formats

asiches's picture
Submitted by asiches on Sun, 2012-05-06 16:05

Although opening data is important the format of this data is key to make it useful.

I usually hear people saying things like “they have opened some cool data, they rock!”. But when you look into that data what you see is a bunch of unrelated datasets, even partial data with some internal references to who-knows-what. So, it is a nightmare making that data useful. As a consequence these kinds of data are less used than expected. We need to reduce the consumption effort as much as possible.

What do you think about this?

Group audience: 
Interesting!
1 user has voted.

Comments

alorza's picture
Submitted by alorza on Mon, 2012-05-07 08:59

It's about time we reach an agreement on releasing some datasets, so every public body do it with the same structure and format. By doing so, we will be able to check the real use of public data.

Interesting!
0 users have voted.

rebentisch's picture
Submitted by rebentisch on Thu, 2012-05-24 03:12

The new fashionable phrase seems to be "machine-readable" data. Easy convertable formats do matter. Single data sets get aligned with others in unexpected ways. It doesn't make much sense to develop complicated highly specified public data formats when the purpose of the re-use is unknown. Simplicity matters. Multiple "universal" formats enable the re-use community to deliver their solutions.

Interesting!
1 user has voted.

Oscar Wijsman's picture
Submitted by Oscar Wijsman on Mon, 2012-05-07 23:08

Having the same structure for data does make things a lot easier but it is more important to have it first. Without access or sharing data we cannot do anything at all. Standards take time. So let's first get it all and in parallel work on the standards. In the mean time we can use big data tools to sort out all the unstructured data.

Interesting!
0 users have voted.

asiches's picture
Submitted by asiches on Wed, 2012-05-09 20:26

It's indeed important having the data first. Having it with a well-accepted format and a common structure is the inmediate following step, though. Doing this almost in parallel is the best scenario I can think about to reduce the effort from the Openers and from the Consumers.

Doing it in two different steps will have an impact hard to overcome specially for the Public Administrations with no easy way to face new projects. And of course dealing with backwards compatibility ensures unpleasent times and unwanted side costs for Openers and Consumers.

Interesting!
0 users have voted.

People

casang2's picture
Hensley Peterson's picture
Loankanassy's picture
Valentina Bazzarin's picture
katarzyna.szkuta's picture
rebentisch's picture
JacintaArcadia's picture
uzurutuza's picture
Kasper Peters's picture
lpujol's picture
ozanamblog's picture
annalisa.deluca's picture
Digital Agenda Assembly engagement
glqxz9283 sfy39587stf02 mnesdcuix8
glqxz9283 sfy39587stf03 mnesdcuix8
glqxz9283 sfy39587stf04 mnesdcuix8