-
Notifications
You must be signed in to change notification settings - Fork 5
Related Projects
There are several projects which have attempted to define standards for tabular information exchange. Some of these may be useful in defining the OMERO.features API, or potentially form a backend storage mechanism.
- This is a very recent project started by Max Ogden, GitHub repo
- What is dat
-
git
for open-data - Manages data sychronisation and collaborative modifications to data
- Aims to supports billions of rows and real-time access
Some of the GitHub issues look really relevant, such as common data structure for tabular data
Contains some interesting ideas, could be relevant for sharing data across multiple OMERO servers.
- Variables: a multi-dimensional array, has a name, type and shape
- Attributes: has a name, type, length and value (can be an array); can be global (per dataset) or per variable
- Supports user-defined types including compound types
We're creating lightweight standards and tooling to make it effortless to share and get data
An Open Knowledge Foundation project to publish useful datasets in easy to use formats, and attempting to define simple interchange formats
- Data Packages
- Tabular data Packages
- For example each dataset could consist of a JSON file describing the dataset, a CSV file containing the data, and a JSON file to provide additional metadata for each CSV column
Key ideas include keeping the format simple so that everyone can use it without rewriting their tools.
Defines the structure of data based on triplets, Subject - Predicate - Object.