Just given a talk at Riken about metadata. People seemed very positive, there is clearly a desire to do this and to get more data types out there. I got the question about requiring too much metadata to understand an experiment; most of the rest were people saying "have you thought about using…?".

The one that I hadn’t thought about is provide metadata for gold standard, generated (non-experimental) data. My initial response is to say that we should be storing the service for producing the data, rather than the data, although there are purposes for standard generated data — enabling deterministic behaviour of tools over "random" data.

Originally published on my old blog site.