Difference between revisions of "Swestore-dCache"

From SNIC Documentation
Jump to: navigation, search
m (moved SweStore to Swestore over redirect)
Line 1: Line 1:
 
[[Category:Storage]]
 
[[Category:Storage]]
 
[[Category:SweStore]]
 
[[Category:SweStore]]
SNIC is building a storage infrastructure to complement the computational resources.
 
  
Many forms of automated measurements can produce large amounts of data. In scientific areas such as high energy physics (the Large Hadron Collider at CERN), climate modeling, bioinformatics, bioimaging etc., the demands for storage are increasing dramatically. To serve these and other user communities, SNIC has appointed a working group to design a storage strategy, taking into account the needs on many levels and creating a unified storage infrastructure, which is now being implemented.
+
Many experiments produce large amounts of data. In scientific areas such as high energy physics (the Large Hadron Collider at CERN), climate modeling, bioinformatics, bioimaging etc., the demands for storage are increasing dramatically. To serve these and other user communities, SNIC has created a unified storage infrastructure, Swestore.
  
 
Swestore is in collaboration with [http://www.ecds.se/ ECDS], [http://snd.gu.se/ SND], [http://www.bioimaging.se/swedish_bioimaging_network/Welcome.html Bioimage Sweden], [http://www.bils.se/ BILS], [http://www.uppnex.uu.se/ UPPNEX],[http://wlcg.web.cern.ch/ WLCG], [http://www.nrm.se/ NaturHistoriska RiksMuseet].
 
Swestore is in collaboration with [http://www.ecds.se/ ECDS], [http://snd.gu.se/ SND], [http://www.bioimaging.se/swedish_bioimaging_network/Welcome.html Bioimage Sweden], [http://www.bils.se/ BILS], [http://www.uppnex.uu.se/ UPPNEX],[http://wlcg.web.cern.ch/ WLCG], [http://www.nrm.se/ NaturHistoriska RiksMuseet].
  
= National storage "Swestore"=
+
Swestore is a robust, flexible and expandable short and medium term storage system aimed at storing large amounts of data produced by various Swedish research projects. It is based on the [http://www.dcache.org dCache].
The Swestore Nationally Accessible Storage, commonly called just Swestore, is a robust, flexible and expandable long term storage system aimed at storing large amounts of data produced by various Swedish research projects. It is based on the [http://www.dcache.org dCache] and  [http://www.irods.org iRODS]
 
storage systems.
 
  
Swestore is distributed across the SNIC centres [http://www.c3se.chalmers.se/ C3SE], [http://www.hpc2n.umu.se/ HPC2N], [http://www.lunarc.lu.se/ Lunarc], [http://www.nsc.liu.se/ NSC], [http://www.pdc.kth.se PDC] and [http://www.uppmax.uu.se Uppmax]. Data is stored in two copies with each copy at a different SNIC centre. This enables the system to cope with a multitude of issues ranging from a simple crash of a storage element to losing an entire site while still providing access to the stored data.  
+
Swestore is distributed across the SNIC centres [http://www.c3se.chalmers.se/ C3SE], [http://www.hpc2n.umu.se/ HPC2N], [http://www.lunarc.lu.se/ Lunarc], [http://www.nsc.liu.se/ NSC] and [http://www.uppmax.uu.se Uppmax]. Data is stored in two copies with each copy at a different SNIC centre. This enables the system to cope with a multitude of issues ranging from a simple crash of a storage element to losing an entire site while still providing access to the stored data.  
  
One of the major advantages to the distributed nature of dCache and iRODS ([[Swestore-irods]]) is the excellent aggregated transfer rates possible. This is achieved by bypassing a central node and having transfers going directly to/from the storage elements if the protocol allows it. The Swestore Nationally Accessible Storage system can achieve aggregated transfer rates in excess of 100 Gigabit per second, but in practice this is limited by connectivity to each University (usually 10 Gbit/s) or a limited number of files (typically
+
One of the major advantages to the distributed nature of dCache is the excellent aggregated transfer rates possible. This is achieved by bypassing a central node and having transfers going directly to/from the storage elements if the protocol allows it. Swestore can achieve aggregated transfer rates in excess of 100 Gigabit per second, but in practice this is limited by connectivity to the end user, each university or a limited number of files (typically max 1 Gbit/s per file/connection).
max 1 Gbit/s per file/connection).
 
  
 
To protect against silent data corruption the dCache storage system checksums all stored data and periodically verifies the data using this checksum.
 
To protect against silent data corruption the dCache storage system checksums all stored data and periodically verifies the data using this checksum.
  
The dCache system does NOT yet provide protection against user errors like inadvertent file deletions and so on. The [[Swestore-irods]] system provides this protection. Deleted files are moved to a trashcan.
+
The dCache system does NOT yet provide protection against user errors like inadvertent file deletions and so on.
  
 
== Getting access ==
 
== Getting access ==
  
'''Swestore currently has a lack of resources. New projects will not be allocated until Spring 2016. In case you have any questions, please contact the SNIC office (office@snic.se).
+
'''The Call for Proposals is currently closed, but new Proposals will accepted mid June 2016 with new Projects being approved in mid July.'''
'''
 
 
 
  
 
; Apply for storage
 
; Apply for storage
 
: Please follow the instructions on the [[Apply for storage on SweStore]] page.
 
: Please follow the instructions on the [[Apply for storage on SweStore]] page.
: iRODS is in <span style="color:#FF0000"> pilot phase </span><br>
 
: The iRODS system dosen't have the uptime and performance that our production system have.
 
: We are still working with iRODS to get it into production.
 
 
;Difference between dCache and iRODS user authentication
 
:SweStore's dCache system uses eScience client certificates.
 
:SweStore's iRODS system uses [http://www.yubico.com/products/yubikey-hardware/yubikey/ Yubikey] one-time passwords (OTP). With a simple touch of a button, a 44 character one-time password is generated and sent to the system. The user will be provided with a SweStore yubikey.
 
:Yubikey has a status as pilot now. It can be changed in the future.
 
  
 
; dCache usage - How to acquire an eScience client certificate
 
; dCache usage - How to acquire an eScience client certificate
Line 92: Line 78:
  
 
=== Usage monitoring ===
 
=== Usage monitoring ===
* [http://status.swestore.se/munin/monitor/monitor/ Per Project Monitoring of Swestore usage]
+
* On the project page in SUPR: https://supr.snic.se
 
+
* On the monitoring server for Swestore: http://status.swestore.se/munin/monitor/monitor/ (only accessible from .se-domains.
== iRODS ==
 
 
 
Documentation of the SNIC iRODS system: [[Swestore-irods]].
 

Revision as of 17:01, 17 June 2016


Many experiments produce large amounts of data. In scientific areas such as high energy physics (the Large Hadron Collider at CERN), climate modeling, bioinformatics, bioimaging etc., the demands for storage are increasing dramatically. To serve these and other user communities, SNIC has created a unified storage infrastructure, Swestore.

Swestore is in collaboration with ECDS, SND, Bioimage Sweden, BILS, UPPNEX,WLCG, NaturHistoriska RiksMuseet.

Swestore is a robust, flexible and expandable short and medium term storage system aimed at storing large amounts of data produced by various Swedish research projects. It is based on the dCache.

Swestore is distributed across the SNIC centres C3SE, HPC2N, Lunarc, NSC and Uppmax. Data is stored in two copies with each copy at a different SNIC centre. This enables the system to cope with a multitude of issues ranging from a simple crash of a storage element to losing an entire site while still providing access to the stored data.

One of the major advantages to the distributed nature of dCache is the excellent aggregated transfer rates possible. This is achieved by bypassing a central node and having transfers going directly to/from the storage elements if the protocol allows it. Swestore can achieve aggregated transfer rates in excess of 100 Gigabit per second, but in practice this is limited by connectivity to the end user, each university or a limited number of files (typically max 1 Gbit/s per file/connection).

To protect against silent data corruption the dCache storage system checksums all stored data and periodically verifies the data using this checksum.

The dCache system does NOT yet provide protection against user errors like inadvertent file deletions and so on.

Getting access

The Call for Proposals is currently closed, but new Proposals will accepted mid June 2016 with new Projects being approved in mid July.

Apply for storage
Please follow the instructions on the Apply for storage on SweStore page.
dCache usage - How to acquire an eScience client certificate
Follow the instructions on Requesting a certificate to get your client certificate. This step can be performed while waiting for the storage application to be approved and processed. Of course, if you already have a valid eScience certificate you don't need to acquire another one.
For Terena certificates
If intending to access SweStore from a SNIC resource, please make sure you also export the certificate, transfer it to the intended SNIC resource and prepare it for use with grid tools (not necessarily needed with ARC 3.x, see proxy certificates using Firefox credential store).
For Nordugrid certificates
Please make sure to also install your client certificate in your browser.
Request membership in the SweGrid VO
Follow the instructions on Requesting membership in the SweGrid VO to get added to the SweGrid Virtual Organisation (VO) and request membership to your allocated storage project.
iRODS usage - How to acquire a SweStore yubikey
Please send an email to support@swestore.se and provide the shipping address to where the yubikey should be sent.
Yubikey has a status as pilot now. It can be changed in the future.

Support

If you have any issues using SweStore please do not hesitate to contact support@swestore.se.

dCache

Access protocols

Currently supported protocols
GridFTP - gsiftp://gsiftp.swestore.se/
Storage Resource Manager - srm://srm.swegrid.se/
Hypertext Transfer Protocol (read-only), Web Distributed Authoring and Versioning - http://webdav.swestore.se/ (unauthenticated), https://webdav.swestore.se/
NFS4.1

For authentication eScience certificates are used, which provides a higher level of security than legacy username/password schemes.

Download and upload data

Interactive browsing and manipulation of single files
SweStore is accessible in your web browser in two ways, as a simple and reliable directory index interface at https://webdav.swestore.se/ and with a richer interactive file manager at https://webdav.swestore.se/browser/. Note that the interactive file manager has a lot of features and functions not supported in SweStore, only the basic file transfer features are supported.
To browse private data you need to have your certificate installed in your browser (default with Terena certificates, see above). Projects are organized under the /snic directory as https://webdav.swestore.se/snic/YOUR_PROJECT_NAME/.
Upload and delete data interactively or with automation

There are several tools that are capable of using the protocols provided by SweStore national storage. For interactive usage on SNIC clusters we recommend using the ARC tools which should be installed on all SNIC resources. As an integration point for building scripts and automated systems we suggest using the curl program and library.

Use the ARC client. Please see the instructions for Accessing SweStore national storage with the ARC client. Recommended method when logged in on SNIC resources.
Use lftp. Please see the instructions for Accessing SweStore national storage with lftp.
Use cURL. Please see the instructions for Accessing SweStore national storage with cURL.
Use globus-url-copy. Please see the instructions for Accessing SweStore national storage with globus-url-copy.

Tools and scripts

There exists a number of tools and utilities developed externally that can be useful. Here are some links:

Slides and more

Slides and material from seminar for Lund users on April 18th

Usage monitoring