Swestore-dCache

From SNIC Documentation
Revision as of 14:21, 14 July 2016 by Jens Larsson (NSC) (talk | contribs) (Acquire an eScience client certificate (for all users))
Jump to: navigation, search


Swestore is a robust, flexible and expandable short and medium term storage system aimed at storing large amounts of data produced by various Swedish research projects. It is based on the dCache software.

Swestore is distributed across the SNIC centres C3SE, HPC2N, Lunarc, NSC and Uppmax. Data is stored in two copies with each copy at a different SNIC centre. This enables the system to cope with a multitude of issues ranging from a simple crash of a storage element to losing an entire site while still providing access to the stored data.

One of the major advantages to the distributed nature of Swestore is the excellent aggregated transfer rates possible. This is achieved by bypassing a central node and having transfers going directly to/from the storage elements if the protocol allows it. Swestore can achieve aggregated transfer rates in excess of 100 Gigabit per second, but in practice this is limited by connectivity to the end user, each university or a limited number of files (typically max 1 Gbit/s per file/connection).

To protect against silent data corruption the dCache storage system checksums all stored data and periodically verifies the data using this checksum.

The dCache system does NOT yet provide protection against user errors like inadvertent file deletions and so on.

Getting access

Apply for storage (for the PI)

All normal Swestore projects are managed using the SNIC User and Project Repository portal, SUPR. Please follow the instructions on the Apply for storage on Swestore page to apply for storage.

Acquire an eScience client certificate (for all users)

Swestore authentication is done using certificates. All project members needs to acquire a certificate. Follow the instructions on the Grid certificates page. This step can be performed while waiting for the storage application to be approved and processed. Of course, if you already have a valid eScience certificate you don't need to acquire another one.

Manage your eScience client certificate (for all users)

For Digicert certificates
If intending to access Swestore from a SNIC resource, please make sure you also export the certificate, transfer it to the intended SNIC resource and prepare it for use with grid tools (not necessarily needed with ARC, see proxy certificates using Firefox credential store).
For Nordugrid certificates
Please make sure to also install your client certificate in your browser.

All project members have to register in SUPR and be added to the approved project by the PI. All users also have to register their certificate in SUPR. This information is used by Swestore to authenticate the users when accessing the storage area. Registering the certificate is easy though. Make sure your certificate is stored in your browser, log in to SUPR , click "Personal Information" in the left menu, click "Register Client Certificate" and follow the instructions.

Support

If you have any issues using Swestore, please do not hesitate to contact support@swestore.se.

Using Swestore

Access protocols

Currently supported protocols
HTTP/WebDAVProtocol
http://webdav.swestore.se/ (unauthenticated, read-only)
https://webdav.swestore.se/ (authenticated, read-write)
GridFTP - gsiftp://gsiftp.swestore.se/
(NFS4.1)
(Storage Resource Manager - srm://srm.swegrid.se/)

Download and upload data

Interactive access using a web browser
Swestore is accessible in your web browser as a simple directory index interface at https://webdav.swestore.se/
To browse private data you need to have your certificate installed in your browser (default with Digicert certificates, see above). Projects are organized under the /snic directory as https://webdav.swestore.se/snic/YOUR_PROJECT_NAME/.
Transfer and delete data from the command line

There are several tools that are capable of using the protocols provided by Swestore. For interactive usage on SNIC clusters we recommend using the ARC tools which should be installed on all SNIC resources.

As an integration point for building scripts and automated systems we suggest using the curl program and library.

Use the ARC client. Please see the instructions for Accessing Swestore with the ARC client. Recommended method when logged in on SNIC resources.
Use lftp. Please see the instructions for Accessing Swestore with lftp.
Use cURL. Please see the instructions for Accessing Swestore with cURL.
Use globus-url-copy. Please see the instructions for Accessing Swestore with globus-url-copy.

Tools and scripts

There exists a number of tools and utilities developed externally that can be useful. Here are some links:

Slides and more

Slides and material from seminar for Lund users on April 18th

Usage monitoring