Score facilitates the transfer and storage of your data seamlessly and flexibly for cloud-based projects. This storage and transfer system helps you manage data upload and download with powerful features such as file bundling and resumable downloads.
Score uses the concept of pre-signed URLs (see Amazon S3 definition here) to manage data transfer to and from your cloud storage provider. As such, Score can be thought of as a broker between an object storage system (such as Amazon S3) and the user authorization system, with the responsibility of validating user access and generating the pre-signed URLs required for object access.
Working together, Song and Score enable secure and distributed data management. Score works with object-based storage including Amazon Web Services S3, Azure Storage, and Openstack Ceph to enable file upload and download that can be parallelized into multiple parts and easily resumed with high integrity for a fault-tolerant data transfer. Specific features to support genomic data have been built into Song and Score: file bundling to match genomic files with their index files, and slicing of a sequencing read file for a targeted region instead of downloading the whole file.
Score currently supports data transfer with several popular cloud-based storage providers:
To enable high performance transfers, Scores supports multipart file uploads and downloads. By implementing a multipart transfer solution, Score provides several key benefits:
Score performs standard MD5 validation against all file uploads and downloads to check for corrupted files and ensure data integrity.
Similar to other products in the Overture software suite, Score has particularly useful applications in the field of Genomics, including the following features:
As a data transfer management system, Score is focused on managing data upload and download, and does not handle the complexities of file metadata validation. To handle this, Score is built to interact with a required companion application, Song. Song is responsibe for validating file metadata, assigning unique global identifiers for data management, assigning permisssions for open (public) versus controlled (authentication required) file access, and so on.