[Clas12_software] Fw: Nov 19: Transition of the Farm /cache filesystem to read-only from farm and ifarm
Nathan Baltzell
baltzell at jlab.org
Mon Oct 28 16:07:12 EDT 2024
FYI, looks like it may really happen next month:
________________________________________
From: Jlab-scicomp-briefs <jlab-scicomp-briefs-bounces at jlab.org> on behalf of Wesley Moore via Jlab-scicomp-briefs <jlab-scicomp-briefs at jlab.org>
Sent: Monday, October 28, 2024 4:02 PM
To: Laura Hild via Jlab-scicomp-briefs
Subject: [Jlab-scicomp-briefs] Nov 19: Transition of the Farm /cache filesystem to read-only from farm and ifarm
Transition of the Farm /cache filesystem to read-only from farm and ifarm
What is changing?
On November 19, 2024, during monthly maintenance, the /cache filesystem will be changed to read-only access from farm, ifarm, Globus, and XRootD Data Transfer Nodes. After that date, files will only be copied to /cache via in the following ways:
*
The jcache command, which reads files from tape and writes them into /cache
*
The jput command, which writes files to tape and can optionally place a copy immediately in /cache when the -cache flag is specified
*
Data ingest from the experimental halls using the jmirror command with a regular expression pattern match for data retention in cache.
Why is this happening?
*
The new system will ensure that files move to tape promptly and that /cache is an accurate subset of files stored on tape.
*
In the current /cache filesystem, there are a commonly cases where files are in conflict with tape storage, leading to work slow downs.
*
Small file handling has been a historic problem, and many small files in /cache were not stored on tape or backed up in any way
How will this affect farm job workflows?
*
Jobs that are part of a SWIF workflow with an output specification to /cache will continue to work.
*
Jobs that attempt to write directly to cache using open(), cp, mv, or other POSIX tools will fail. Output from slurm jobs that are not part a SWIF workflow should be stored on /volatile and will need to be moved to tape manually using jput on ifarm. Generally, slurm workflows that need to interact with tape would be better implemented using SWIF.
*
Note that jput is not available on (non-interactive) farm nodes because it may queue, stalling the farm node and potentially timing out the job.
What is not changing?
*
Cache deletion policy: remains unchanged.
*
Cache file pinning: continues to work as before.
*
jcache client: continues to work as before.
* SWIF outputs to /cache: continues to work as before.
References:
*
KBA: Migration to read-only cache<https://jlab.servicenowservices.com/scicomp?id=kb_article_view&sysparm_article=KB0015468>
*
KBA: Computing Coordinators<https://jlab.servicenowservices.com/scicomp?id=kb_article_view&sysparm_article=KB0014686>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: ATT00001.txt
URL: <https://mailman.jlab.org/pipermail/clas12_software/attachments/20241028/f4b90f9e/attachment.txt>
More information about the Clas12_software
mailing list