Community File System (CFS)¶
The Community File System (CFS) is a global file system available on all NERSC computational systems. It allows sharing of data between users, systems, and the "outside world".
Every MPP repository has an associated Community directory and unix group. Community directories are created in
/global/cfs/cdirs. All members of the project have access through their membership in the unix group. There is an environment variable
$CFS (which expands to
/global/cfs/cdirs/) that can be used to access your CFS directory:
nersc$ cd $CFS/<your_project_name>
Multiple Directories Per Project¶
Occasionally there are cases where the single directory per project model is too limiting. For example, large projects with multiple working groups may wish to have separate Community directories with separate quotas for each working group. In these cases, a PI or PI Proxy for a repository may request an additional Community directory with a specific name via the Iris Storage tab. Because of the way quotas are managed, these directories can only be "top" level directories. For instance, you can create
/global/cfs/cdirs/new_directory_name with a separately managed quota, but you can not create
/global/cfs/cdirs/existing_directory_name/new_directory_name with a separately managed quota. If you wish to present your users with a single directory path to work with, you can create links to these other directories inside your main directory:
nersc$ ls -l /global/cfs/cdirs/existing_directory_name drwxrws--- 3 elvis nstaff 4.0K Feb 18 21:19 random_directory lrwxrwxrwx 1 elvis nstaff 7 Feb 18 21:20 new_directory_name -> /global/cfs/cdirs/new_directory_name
All of a repository's Community directories share one quota. The PI or PI Proxy can adjust the relative quota amounts in the Iris Storage tab.
Quotas on the Community File System are determined by DOE Program Managers based on information PIs supply in their yearly ERCAP requests. If you need a mid-year quota increase on the Community File System, please use the Disk Quota Increase Form and we will pass the information along to the appropriate DOE Program Manager for approval.
See quotas for detailed information about inode, space quotas, and file system purge policies.
The system has a peak aggregate bandwidth of at least 100 GB/sec bandwidth for streaming I/O. While user applications that depend on high-bandwidth for streaming large files can use the Community File System, it is recommended to use Cori scratch or the Burst Buffer instead.
All NERSC users should backup important files on a regular basis. Ultimately, it is the user's responsibility to prevent data loss. However, NERSC provides some mechanisms in protecting against data loss.
Community directories use a snapshot capability to provide users a seven-day history of their contents. Every directory and sub-directory in a Community directory contains a ".snapshots" entry.
.snapshotsis invisible to
findand similar commands
- Contents are visible through
ls -F .snapshots
- Can be browsed normally after
- Files cannot be created, deleted or edited in snapshots
- Files can only be copied out of a snapshot
Community directories will remain in existence as long as the owning project is active. Projects typically "end" at the end of a NERSC Allocation Year. This happens when the PI chooses not to renew the project, or DOE chooses not to provide an allocation for a renewal request. In either case, the following steps will occur following the termination of the project:
-365 days - The start of the new Allocation Year and no Project renewal
The data in the Community directory will remain available on the Community File System until the start of the next Allocation Year.
+0 days - The start of the following Allocation Year
PIs notified that the affected Community directory will be archived, and then removed from the file system in 90 days.
The Community directory will become read-only.
The full pathname to the Community directory will be modified. Automated scripts will likely fail.
User access to the directory will be terminated. The directory will then be archived in HPSS, under ownership of the PI, and subsequently removed from the file system. The PI can request this data by opening a ticket.