Accumulo 4.x Documentation >> Configuration >> Metrics Documentation (4.x)
Metrics Documentation (4.x)
Edit this pageBelow are the metrics used to monitor various components of Accumulo. Metrics emitted by Accumulo should contain the folowing tags: ‘instance.name’, ‘resource.group’, ‘process.name’, ‘host’ and ‘port’. Metrics emitted by Accumulo may contain additional tags where we think it makes sense to capture per-object metrics, for example on a table or tablet basis in the ScanServer and TabletServer, or on a per-queue basis in the CompactionCoordinator.
Table of Contents
- General Server Metrics
- Compaction Metrics
- Compactor Metrics
- Fate Metrics
- Garbage Collection Metrics
- Tablet Server Metrics
- Scan Metrics
- Scan Server Metrics
- Block Cache Metrics
- Manager Metrics
General Server Metrics
Metrics that are generated across all server types.
accumulo.detected.low.memory
Type: GAUGE
Description: Reports 1 when process memory usage is above the threshold, reports 0 when memory is okay.
accumulo.server.idle
Type: GAUGE
Description: Indicates if the server is idle or not. The value will be 1 when idle and 0 when not idle.
accumulo.thrift.execute
Type: DISTRIBUTION_SUMMARY
Description: Time to execute an RPC request.
accumulo.thrift.idle
Type: DISTRIBUTION_SUMMARY
Description: Time waiting to execute an RPC request.
Compaction Metrics
Metrics specific to compactions, both minor and major. Metrics for major compactions will likely have a ‘queue.id’ tag. The CompactionCoordinator component in the Manager creates a queue for each CompactionService in the configuration. The ‘queue.id’ tag may map directly to the name of a Compactor resource group.
accumulo.compaction.entries.read
Type: FUNCTION_COUNTER
Description: Number of entries read by all compactions that have run on this compactor (majc) or tserver (minc).
accumulo.compaction.entries.written
Type: FUNCTION_COUNTER
Description: Number of entries written by all compactions that have run on this compactor (majc) or tserver (minc).
accumulo.compaction.majc.in_progress
Type: GAUGE
Description: Indicator of whether a compaction is in-progress (value: 1) or not (value: 0). An in-progress compaction could also be stuck.
accumulo.compaction.majc.stuck
Type: LONG_TASK_TIMER
Description: Number and duration of stuck major compactions.
accumulo.compaction.minc.paused
Type: COUNTER
Description: Number of paused minor compactions.
accumulo.compaction.minc.queued
Type: TIMER
Description: Queued minor compactions time queued.
accumulo.compaction.minc.running
Type: TIMER
Description: Minor compactions time active.
accumulo.compaction.minc.stuck
Type: LONG_TASK_TIMER
Description: Number and duration of stuck minor compactions.
accumulo.compaction.queue.count
Type: GAUGE
Description: Number of priority queues for compaction jobs.
accumulo.compaction.queue.jobs.avg.age
Type: GAUGE
Description: Average age of currently queued jobs in seconds.
accumulo.compaction.queue.jobs.dequeued
Type: GAUGE
Description: Count of dequeued jobs.
accumulo.compaction.queue.jobs.exit.time
Type: TIMER
Description: Tracks time a job spent in the queue before exiting the queue.
accumulo.compaction.queue.jobs.max.age
Type: GAUGE
Description: Maximum age of currently queued jobs in seconds.
accumulo.compaction.queue.jobs.min.age
Type: GAUGE
Description: Minimum age of currently queued jobs in seconds.
accumulo.compaction.queue.jobs.priority
Type: GAUGE
Description: Lowest priority queued job.
accumulo.compaction.queue.jobs.queued
Type: GAUGE
Description: Count of queued jobs.
accumulo.compaction.queue.jobs.rejected
Type: GAUGE
Description: Count of rejected jobs.
accumulo.compaction.queue.jobs.size
Type: GAUGE
Description: Size of queued jobs in bytes.
accumulo.compaction.svc.misconfigured
Type: GAUGE
Description: A value of 1 indicates a misconfiguration in the compaction service, while a value of 0 indicates that the configuration is valid.
accumulo.minc.queued
Type: GAUGE
Description: Number of queued minor compactions.
accumulo.minc.running
Type: GAUGE
Description: Number of active minor compactions.
accumulo.minc.total
Type: GAUGE
Description: Total number of minor compactions performed.
Compactor Metrics
Metrics that are generated by the Compactor processes.
accumulo.compaction.majc.paused
Type: COUNTER
Description: Number of paused major compactions.
Fate Metrics
Metrics that are generated by the Fate component in the Manager process.
accumulo.fate.errors
Type: GAUGE
Description: Count of errors that occurred when attempting to gather fate metrics.
accumulo.fate.ops
Type: GAUGE
Description: Number of all the current FATE ops in any state.
accumulo.fate.ops.activity
Type: GAUGE
Description: Count of the total number of times fate operations are added, updated, and removed.
accumulo.fate.ops.in.progress.by.type
Type: GAUGE
Description: Number of FATE operations in progress. The op type is designated by the op.type
tag.
accumulo.fate.tx
Type: GAUGE
Description: The state is now in a tag (e.g., state=new, state=in.progress, state=failed, etc.).
Garbage Collection Metrics
Metrics that are generated by the Garbage Collector process.
accumulo.gc.candidates
Type: GAUGE
Description: Number of files that are candidates for deletion.
accumulo.gc.deleted
Type: GAUGE
Description: Number of candidate files deleted.
accumulo.gc.errors
Type: GAUGE
Description: Number of candidate deletion errors.
accumulo.gc.finished
Type: GAUGE
Description: Timestamp GC file collect cycle finished.
accumulo.gc.in.use
Type: GAUGE
Description: Number of candidate files still in use.
accumulo.gc.post.op.duration
Type: GAUGE
Description: GC metadata table post operation duration in milliseconds.
accumulo.gc.run.cycle
Type: GAUGE
Description: Count of gc cycle runs. Value is reset on process start.
accumulo.gc.started
Type: GAUGE
Description: Timestamp GC file collection cycle started.
accumulo.gc.wal.candidates
Type: GAUGE
Description: Number of files that are candidates for deletion.
accumulo.gc.wal.deleted
Type: GAUGE
Description: Number of candidate wal files deleted.
accumulo.gc.wal.errors
Type: GAUGE
Description: Number candidate wal file deletion errors.
accumulo.gc.wal.finished
Type: GAUGE
Description: Timestamp GC WAL collect cycle finished.
accumulo.gc.wal.in.use
Type: GAUGE
Description: Number of wal file candidates that are still in use.
accumulo.gc.wal.started
Type: GAUGE
Description: Timestamp GC WAL collection cycle started.
Tablet Server Metrics
Metrics that are generated by the TabletServer processes.
accumulo.ingest.bytes
Type: GAUGE
Description: Ingest byte count. The rate can be derived from this metric.
accumulo.ingest.entries
Type: GAUGE
Description: Ingest entry (a key/value) count. The rate can be derived from this metric.
accumulo.ingest.hold
Type: GAUGE
Description: Duration for which commits have been held in milliseconds.
accumulo.tablets.assignments.warning
Type: GAUGE
Description: Number of tablet assignments that are taking longer than the configured warning duration.
accumulo.tablets.files
Type: GAUGE
Description: Number of files per tablet.
accumulo.tablets.ondemand.online
Type: GAUGE
Description: Number of online on-demand tablets
accumulo.tablets.ondemand.unloaded.lowmem
Type: GAUGE
Description: Number of online on-demand tablets unloaded due to low memory
accumulo.tablets.online
Type: GAUGE
Description: Number of online tablets.
accumulo.tablets.opening
Type: GAUGE
Description: Number of opening tablets.
accumulo.tablets.unopened
Type: GAUGE
Description: Number of unopened tablets.
accumulo.tserver.entries
Type: GAUGE
Description: Number of entries assigned to a TabletServer.
accumulo.tserver.entries.mem
Type: GAUGE
Description: Number of entries in memory.
accumulo.updates.check
Type: TIMER
Description: Average time taken for conditional mutation to check conditions.
accumulo.updates.commit
Type: TIMER
Description: Average time taken to commit a mutation.
accumulo.updates.commit.prep
Type: TIMER
Description: Average time taken to prepare to commit a single mutation.
accumulo.updates.error
Type: GAUGE
Description: Count of errors during tablet updates. Type/reason for error is stored in the type
tag (e.g., type=permission, type=unknown.tablet, type=constraint.violation).
accumulo.updates.lock
Type: TIMER
Description: Average time taken for conditional mutation to get a row lock.
accumulo.updates.mutation.arrays.size
Type: DISTRIBUTION_SUMMARY
Description: Batch size of mutations from client.
accumulo.updates.walog.write
Type: TIMER
Description: Time taken to write a batch of mutations to WAL.
Scan Metrics
Metrics specific to scans, which can be executed in the ScanServer or the TabletServer.
accumulo.scan.busy.timeout.count
Type: COUNTER
Description: Count of the scans where a busy timeout happened.
accumulo.scan.close
Type: COUNTER
Description: Number of calls to close a scan or multiscan.
accumulo.scan.continue
Type: COUNTER
Description: Number of calls to continue a scan or multiscan.
accumulo.scan.files.open
Type: GAUGE
Description: Number of files open for scans.
accumulo.scan.paused.for.memory
Type: COUNTER
Description: Count of scans paused due to server being low on memory.
accumulo.scan.queries
Type: GAUGE
Description: Number of queries made during scans.
accumulo.scan.query.results
Type: GAUGE
Description: Query count. The rate can be derived from this metric.
accumulo.scan.query.results.bytes
Type: GAUGE
Description: Query byte count. The rate can be derived from this metric.
accumulo.scan.query.scanned.entries
Type: GAUGE
Description: Count of scanned entries. The rate can be derived from this metric.
accumulo.scan.result
Type: GAUGE
Description: Results per scan.
accumulo.scan.return.early.for.memory
Type: COUNTER
Description: Count of scans that returned results early due to server being low on memory.
accumulo.scan.start
Type: COUNTER
Description: Number of calls to start a scan or multiscan.
accumulo.scan.times
Type: TIMER
Description: Scan session lifetime (creation to close).
accumulo.scan.yields
Type: GAUGE
Description: Counts scans that have yielded.
accumulo.scan.zombie.threads
Type: GAUGE
Description: Number of scan threads that have no associated client session.
Scan Server Metrics
Metrics that are generated by the ScanServer processes.
accumulo.scan.reservation.conflict.count
Type: COUNTER
Description: Count of instances where file reservation attempts for scans encountered conflicts.
accumulo.scan.reservation.total.timer
Type: TIMER
Description: Time to reserve a tablet’s files for scan.
accumulo.scan.reservation.writeout.timer
Type: TIMER
Description: Time to write out a tablets file reservations for scan.
accumulo.scan.tablet.metadata.cache
Type: CACHE
Description: Scan server tablet cache metrics.
Block Cache Metrics
Metrics specific to RFile block cache usage in the ScanServer and TabletServer processes.
accumulo.blockcache.data.evictioncount
Type: FUNCTION_COUNTER
Description: Data block cache eviction count.
accumulo.blockcache.data.hitcount
Type: FUNCTION_COUNTER
Description: Data block cache hit count.
accumulo.blockcache.data.requestcount
Type: FUNCTION_COUNTER
Description: Data block cache request count.
accumulo.blockcache.index.evictioncount
Type: FUNCTION_COUNTER
Description: Index block cache eviction count.
accumulo.blockcache.index.hitcount
Type: FUNCTION_COUNTER
Description: Index block cache hit count.
accumulo.blockcache.index.requestcount
Type: FUNCTION_COUNTER
Description: Index block cache request count.
accumulo.blockcache.summary.evictioncount
Type: FUNCTION_COUNTER
Description: Summary block cache eviction count.
accumulo.blockcache.summary.hitcount
Type: FUNCTION_COUNTER
Description: Summary block cache hit count.
accumulo.blockcache.summary.requestcount
Type: FUNCTION_COUNTER
Description: Summary block cache request count.
Manager Metrics
Metrics that are generated by the Manager process.
accumulo.balancer.migrations.needed
Type: GAUGE
Description: The number of migrations that need to complete before the system is balanced.
accumulo.tabletmgmt.meta.errors
Type: GAUGE
Description: Error count encountered by the TabletGroupWatcher for the META data level.
accumulo.tabletmgmt.root.errors
Type: GAUGE
Description: Error count encountered by the TabletGroupWatcher for the ROOT data level.
accumulo.tabletmgmt.user.errors
Type: GAUGE
Description: Error count encountered by the TabletGroupWatcher for the USER data level.