Storage and Interoperability (SI)SI1: Facilitate distributed data management using FedoraFedora-SRB Database integration module was developed as part of the SI1 DART work package. The aim of the work package is to enable Fedora to handle large datasets natively, exploring using Storage Resource Broker (SRB) as a backend storage manager for Fedora repository and distributed data management using fedora. More SI2: Improve interoperability between SRB and FedoraThe DART project worked with Fedora and SRB based repositories. It was anticipated that not all large dataset environments should be, or would want to be, integrated into Fedora. The project therefore felt it would be necessary to ensure interoperability between these non-Fedora repositories and existing institutional repositories. More SI3: Support richer metadata for effective discoveryThe Semantic Search Engine for SRB has been developed as part of the SI3 DART workpackage. The Storage Resource Broker (SRB) is a datagrid application developed by San Diego Supercomputer Centre. It is middleware aimed at federating collections of distributed data and presenting them to the user as a coherent collection. More SI4: Secure transfer of data from sensors/instruments to repositoriesThe Grid data storage network will have a secure data access and transfer setup in place to meet the basic security requirements for the entire Grid network. The Grid network will have a secure data transfer mechanism in place for sensors/instruments to Grid, intra-grid data communication, users to grid communication and vice versa and inter-grid data communication More SI5: Abstraction layer that supports data replication systemsSI5 concerns the development of a higher level abstraction for replication services, called the Grid Replication Framework (GRF). The GRF allows the user to interact with a single API, but use several different replication services. More SI6: Simulation data dynamic retrieval or regenerationIn SI6 we propose a life cycle, which starts when the data is first generated, and tracks its progress through replication, distribution, deletion and possible re-computation. More SI7: Data pre-processing system for the secondary storageThe aim of this workpackage is to establish a cost-effective data pre-processing system. This system will be used for refining, integrating and storing synchronous and asynchronous data streams from instruments and sensors into the secondary storage for later use in research. More SI8: Long-distance high speed and secure data transferThis workpackage focuses on the long distance optimised data transfer service including AARNet and GrangeNet, network architecture, hardware, software configuration. More SI9: Scope and pilot storage infrastructure requirementsThis workpackage focuses on the storage infrastructure including High Performance Computing data centres and the design of primary and secondary stores for synchronous and asynchronous data streams. More |
