Deduplicated File System

Overview

The Deduplicated File System section shows information about your UCAR garbage collection system.

Deduplicated File System section

For clients using deduplication, the UCAR system runs a garbage collection process every day to find and purge any files that are no longer referenced.

Some ways that data can become the non-referenced garbage are when clients are deleted without their jobs being purged or when old jobs were not removed completely. It is recommended to run garbage collection after deleting jobs to insure the data is cleared completely. This is similar to the jobs with the non-referenced data. (See Unreferenced Data.)

The garbage collection will be deferred for up to 12 hours before it terminates the process. If the process times out, it will be retried at its next regular time. There is one exception. In the event the system is running low on space, the garbage collection will proceed if there are jobs deduplicating or not.

Unique Content-Addressable Repository

Unique Content-Addressable Repository

NameDescription
Garbage CollectionStart the garbage collection
Garbage Collection Time of DaySet the time of day when the garbage collection will run automatically
Compact OnlineStart Online DDFS Compact manually
Verify UCARVerify the UCAR integrity. This will systematically read all the files in UCAR, and verify if their computed signature matches the recorded one. If not, the file will be quarantined. The process is extremely I/O intensive and can take weeks to run to completion on systems with large amount of the stored data. Use only when told by the Infrascale Support.
Request Missing BlocksRequest and upload the replicated data blocks missing on the appliance

This group shows the following real-time processing data:

DataDescription
Total UCAR Bytes 
Processed Files 
Processed Bytes 
Duplicate Files 
Duplicate Bytes 
Quarantined Bytes 
Quarantined Files 

Garbage Collection History

Garbage Collection History

This group shows the following data:

ColumnDescription
DateDate of processing
Files RemovedNumber of files removed
Bytes RemovedNumber of bytes removed
Total FilesTotal number of unique files
Unreferenced FilesNumber of files found in UCAR, but not in the database
DB ErrorsNumber of the database errors encountered while processing
FS ErrorsNumber of the file system errors encountered while processing
Missing FilesNumber of files referenced by backups, but not present in UCAR
Elapsed TimeTotal processing time

Block Deduplication

Block Deduplication

Block Deduplication Statistics

Block Deduplication Statistics

This group shows the following data:

DataDescription
Blocks WrittenTotal number of full blocks that have been written into DDFS since it was configured initially
Block SizeThe size of the blocks files are divided into during the deduplication process. This option is not configurable.
Total BlocksThe total number of blocks that have been written to DDFS since it was configured initially. It includes both full and partial blocks.
Total BytesThe number of partial blocks that have been written to DDFS since it was configured initially
Partial BlocksThe total of the size of all the partial blocks that have been written to DDFS. Partial blocks happen at the end of a file that does not evenly divide into blocks. For example, a 96 kB file will be divided into 64 kB full block, and 32 kB partial block.
Partial BytesThe number of times a block already existed in the block store and did not need to be written again, thus saving space
Duplicate BlocksThe number of bytes that did not have to get written to the RAID because we already had a copy of a block
Duplicate BytesThe sum of the size of all of the blocks marked as free in the block store
Free BlocksA counter of times blocks have been read back from the DDFS
Free BytesThe sum of the size of all of the blocks marked as free in the block store
Blocks ReadA counter of times blocks have been read back from the DDFS
Allocated BytesThe size of the block stores. Includes both the used and the free blocks.

Shredder Statistics

Shredder Statistics

This group shows the following data:

DataDescription
ETA 
Signature per sec 
Queue length (files) 
Queue length (signatures) 
The current file being processed 
Amount of files, blocked by the appliance (busy) 

Block Address Map Statistics

Block Address Map Statistics

This group shows the following data:

DataDescription
Size 
Modified 
Incore Locked 
Incore 
Signature per sec 

App Statistics

App Statistics

This group shows the following data:

DataDescription
Heap Used 
Heap Free 
Heap Maximum 

Previous

Boot Verification

Next

Downloads