1. Configuration
  2. Storage Efficiency 
  3. Scrubbing of Public Disks
  4. Dedicated Project Disk Space

1. Configuration

As of  



Quotas per user



DiskSpace

No. of files



Purpose

Name

size

soft/hard

soft/hard

Snapshots?

Scrubbed?Public disks
see note (1)(2)(3)(4)(5)(6)

/home

25T

480/512G

9/10M

yes: 4w 

no

For your basic configuration files, scripts and job files

  • your limit is low but you can recover old stuff up to 4 weeks.

/data/{sao|genomics}

/data/{biology|data_science|fellows|nasm}

80T

5T

1.9/2.0T

0.9/1.0T

4.8/5M

2.0/2.5M

yes: 2w

no

 For important and smaller files like final results, configurations, etc.

  • your limit is medium, you can recover old stuff, so disk space is not released right away.

/pool/{sao|genomics}

/pool/{data_science|fellows|nasm}

/pool/biology

120T

5T

2T

1.9/2.0T

0.9/1.0T

4/5M

2.0/2.5M

no

yes

For storing large files

  • your limit is high, and disk space is released right away.

/scratch/genomics

/scratch/sao (nasm)

/scratch/fellows

/scratch/{biology|data_science}

300T

140T

30T

5T

9/10T

25/26M

no

yes

For storing very large file (more space than in /pool)

  •  your limit is highest,
  • the underlying system is fastest 


  

 Project specific disks (/pool)
/pool/kozakk11T10.5/11.0T26/278MnonoSTRI/Krzysztof Kozak
/pool/nmnh_ggi21T15.0/15.8T37/40MnonoNMHN/GGI
/pool/sao_access21T15.0/15.8T37/40MnonoSAO/ACCESS
/pool/sao_rtdc11T2.8/3.0T2.5/3.0MnonoSAO/RTDC
/pool/sylvain30T29/30T71/75Mnono

SAO/Sylvain Korzennik







Project specific disks (/scratch)
/scratch/bradys

25T

--nonoNMNH/Seán Brady/BRADY_LAB
/scratch/kistlerl100T98/100T254/260MnonoNMNH/Logan Kistler
/scratch/meyerc25T24/25T52/62MnonoNMNH/Christopher Meyer
/scratch/nmnh_corals20T18/20T52/54M
noMNNH/Corals
/scratch/nmnh_ggi77T75/77T195/200MnonoNMNH/GGI
/scratch/nmnh_lab25T4/5T10/12MnonoNMNH/LAB
/scratch/nmnh_mammals25T24/25T66/68M
noNMNH/Mammals
/scratch/nzp_ccg35T34/35T89/92MnonoNZP/Michael Campana/CCG
/scratch/sao_atmos250T245/250T252/261MnonoSAO/ATMOS
/scratch/sao_cga25T7/8T18/20MnonoSAO/CGA
/scratch/sao_tess50T36/40T94/210MnonoSAO/TESS
/scratch/schultzt25T--nonoNMNH/Ted Schultz/SCHULTZ_LAB
/scratch/stri_ap25T4/5T10/12MnonoSTRI/W. Owen McMillan/STRI_AP
/scratch/sylvain70T68/70T176/182MnonoSAO/Sylvain Korzennik
/scratch/usda_sel25T24/25T52/62MnonoNMNH/Christopher Owen/USDA_SEL
/scratch/wrbu40T38/40T99/100M nonoWRBU






Near line (/store)
/store/admin

20T

-n/ayes: 8wnoSys Admin
/store/public175T5/5Tn/ayes: 8wnoPublic, available upon request
/store/bradys

40T

-n/ayes: 8wnoNMNH/Sean Brady/BRADY_LAB
/store/nmnh_ggi90T-n/ayes: 8wnoNMNH/GGI
/store/sao_atmos300TB-n/ayes: 8wnoSAO/ATMOS
/store/sylvain100TB-n/ayes: 8wnoSAO/Sylvain Korzennik
/store/schultzt40TB-n/ayes: 8wnoNMNH/Ted Schultz/SCHULTZ_LAB
/store/wrbu40TB-n/ayes: 8wnoWRBU






Extra

/data/admin

/pool/admin

/scratch/admin

/store/admin

2T

10T

20T

20T



nonosys admin
/scratch/dbs10T

nonodata bases
/scratch/tmp100T

nonoscratch space for jobs that need a big temp, via -l bigtem=XX

Notes

  1. The notation /x/{y|z} stands for /x/y or /x/z
  2. Sizes are indicative, since we adjust them as needed
  3. the notation 1.8/2.0TB means that the soft limit is 1.8TB and the hard limit is 2.0TB of disk space
  4. the notation 4/5M means that the soft limit is 4 million inodes and the hard limit is 5 millions
  5. Snapshots allow to restore deleted files, up to that many weeks
  6. Public space under /pool  and /scratch is scrubbed weekly - files older than 180 days are removed.

2. Storage Efficiency

  • It is inefficient to store a slew of small files and if you do you may reach your inodes quota before your space quota (too many small files).
    •  Some of the disk monitoring tools show the inode usage.
    • If your %(inode)>%(space) your disk usage is inefficient,
      consider archiving your files into zip or tar-compressed sets.

  • While some of the tool(s) you use may force you to be inefficient while jobs are running, you should remember to
    • remove useless files when jobs have completed,
    • compress files that can benefit from compression (with gzip, bzip2 or compress), and
    • archive a slew of files into a zip or a tar-compressed set, as follows:
         % zip archive.zip dir/
      or
         % tar -czf archive.tgz dir/
      both examples archive the content of the directory dir/ into a single zip or a tgz file. You can then delete the content of dir/ with
         % rm -rf dir/
  • You can unpack each type of archive with
       % unzip archive.zip
    or
       % tar xf archive.tgz

3. Scrubbing of Public Disks

  • We have implement a FIFO (first in first out) scrubbing model, where old files are deleted, aka scrubbed, to keep free space on the public disks.
  • In any case, we ask you to remove from /pool and /scratch files that you do not need for active jobs.

4. Dedicated Project Disk Space

  • For projects that would benefit from dedicated disk space, such space can be secured with project's specific funds when we expand the disk farm (contact us).

Last Updated SGK/PBF.

  • No labels