With great data, comes great data responsibility. Genomic data files are being generated at a fast pace and more and more laboratory groups are required a method to manage this fast growing pile of data. Song provides a metadata management and storage system to easily track and manage files in a secure and validated environment, against your established data model. Some features are particularly tailored towards genomic files, but Song supports any data type!
In conjunction with the data upload tool Score, Song provides:
Song manages a lifecycle of data publication from initial upload, to publication, and even eventual removal of data.
UNPUBLISHED
state. PUBLISHED
state.SUPPRESSED
state, making it unavailable for search and download. We recognize that there are a multitude of use cases for how different institutions may collect data elements. With that in mind, Song is ultimately built to be flexible for any type of data model. There is a small "base" data model that all Song deployments follow to track basic patient identifiers (in the context of genomic data), but beyond that any desired business rules can be encoded within Song's Dynamic Schemas, which are based on JSON Schema.
As a metadata management system, Song does not handle the complexities of cloud file upload. To handle this, Song is built to interact with a required companion application, Score, which manages secure and fast file upload & download, as well as standard genomic file applications, for example viewing with samtools
to view or download portions of genomic files with BAM Slicing
.
As part of the larger Overture.bio software suite, Song can be optionally used with additional integrations, including: