XRootD
XRootD is high-performance storage service providing robust data access to any kind of file format. Its scalability makes it an excellent solution for data access, even at the peabyte level.
XRootD was originally developed by SLAC/SCCS for the purpose of the BaBar collaboration in order to access ROOT files. Since then, XRootD is part of the ROOT standard distribution.
Note
That said, XRootD also provides performant access to other data formats.
Generic instance
CC-IN2P3 provides a general-purpose XRootD instance from the interactive and computing servers, enabling transparent read access to files stored on the HPSS mass storage system, without the need for client-level RFIO commands. The XRootD servers belonging to this instance act as disk cache, reducing access latency to HPSS.
Access to HPSS data through this XRootD service is preferable to using the native rfio
commands for two reasons:
HPSS access performance is optimized through an internal scheduling mechanism;
The data that is present (cached) on the XRootD servers allows direct access without the need for a request to the HPSS service.
Example of data recovery after a connection on an interactive server:
% module load xrootd
% xrdcp root://ccxroot.in2p3.fr:1999//hpss/in2p3.fr/<path> <my file>
You can delete files stored on the XRootD disk cache by using the script xrdRemoteClean available on the interactive servers at CC-IN2P3:
% /usr/bin/xrdRemoteClean /hpss/in2p3.fr/<path>
This operation is only recommended if you have overwritten files with the same name in HPSS.
It is also possible to query Xrootd to find out if a file exists in the cache or not:
% xrdfs ccxroot.in2p3.fr:1999 stat /hpss/in2p3.fr/<path>
ALICE Instance
The ALICE experiment bases its data management model on the XRootD protocol. It benefits from two native XRootD instances at CC-IN2P3:
a pure disk storage instance.
Storage Element :
ALICE::CCIN2P3::SE
an instance serving as a cache in front of the tape system HPSS.
Storage Element :
ALICE::CCIN2P3::TAPE
These two instances are used as part of the LHC computing grid.