# Metrics

Status: Accepted

Authors:

* [Almog Gavra](https://github.com/agavra)

## Summary

Replace SlateDB's `StatRegistry` with a recorder-based metrics system that supports labels,
histograms, and user-pluggable backends (Prometheus, OTLP, etc.). The new system uses a
`MetricsRecorder` trait that users can implement to bridge metrics to their observability
stack. When no recorder is configured, a zero-cost `NoopMetricsRecorder` is used by default.

## Motivation

The current system (`StatRegistry` with `Counter`/`Gauge`, flat `"prefix/suffix"` names)
has several limitations:

1. **No labels/dimensions.** Cache hits are tracked as 8 separate counters (`dbcache/filter_hit`,
   `dbcache/filter_miss`, `dbcache/index_hit`, ...) instead of one metric with `{entry_kind, result}`
   labels. This makes it hard to aggregate in Prometheus/Grafana.

2. **No histograms.** Latency-sensitive operations (object store requests, DB gets) have no
   distribution tracking.

3. **Hard to integrate.** Users must poll `StatRegistry`, match on string names, and manually
   bridge to their observability stack. There's no hook to receive events as they happen.

## Goals

- Support multi-dimensional metrics (labels) to reduce metric proliferation
- Add histogram support for latency/size distributions
- Provide a pluggable `MetricsRecorder` trait for Prometheus/OTLP/custom backends
- Default to a zero-cost no-op recorder when no user recorder is configured
- No new external dependencies

## Non-Goals

- Providing built-in Prometheus/OTLP recorder implementations (users bring their own)

## Design

### Core traits

```rust
// slatedb/src/metrics.rs

/// User-implemented trait to bridge SlateDB metrics to their observability system.
/// Passed via DbBuilder. If not provided, a NoopMetricsRecorder is used.
pub trait MetricsRecorder: Send + Sync {
    fn register_counter(&self, name: &str, description: &str, labels: &[(&str, &str)]) -> Arc<dyn CounterFn>;
    fn register_gauge(&self, name: &str, description: &str, labels: &[(&str, &str)]) -> Arc<dyn GaugeFn>;
    fn register_up_down_counter(&self, name: &str, description: &str, labels: &[(&str, &str)]) -> Arc<dyn UpDownCounterFn>;
    fn register_histogram(&self, name: &str, description: &str, labels: &[(&str, &str)], boundaries: &[f64]) -> Arc<dyn HistogramFn>;
}

pub trait CounterFn: Send + Sync {
    fn increment(&self, value: u64);
}

pub trait GaugeFn: Send + Sync {
    fn set(&self, value: i64);
}

pub trait UpDownCounterFn: Send + Sync {
    fn increment(&self, value: i64);
}

pub trait HistogramFn: Send + Sync {
    fn record(&self, value: f64);
}

/// Controls which metrics are active. Metrics with a level below the configured
/// threshold are replaced with zero-cost no-op handles at registration time.
#[derive(Debug, Clone, Copy, PartialEq, Eq, PartialOrd, Ord)]
pub enum MetricLevel {
    /// High-frequency or high-cardinality metrics on the hot path.
    /// Only active when the configured level is `Debug`.
    Debug,
    /// Standard operational metrics. Always active at the default level.
    Info,
}
```

**Key decisions:**
- **Labels are `&[(&str, &str)]`** fixed at registration time. Label identity is an unordered set of unique key/value pairs, not the input slice order. Duplicate keys are invalid. SlateDB canonicalizes labels during registration (for example by sorting by key then value) so each unique `(name, labels)` combo maps to exactly one handle.
- **Separate gauge and up/down counter.** Following OpenTelemetry semantics, `GaugeFn` supports only absolute `set` operations (e.g. current memory usage, queue depth snapshots), while `UpDownCounterFn` supports additive `increment` with negative values (e.g. active connection count, in-flight requests). This maps cleanly to OTel's instrument model and avoids forcing backend implementors to guess which semantic the caller intended.
- **Counters and gauges stay integer; histograms use floating point.** `CounterFn` uses `u64`, `UpDownCounterFn` uses `i64`, and `GaugeFn` uses `i64`, while `HistogramFn` uses `f64`. This preserves natural counter/gauge semantics while simplifying the trait surface vs. the current generic `Gauge` (which has separate impls for `i64`, `u64`, `i32`, `bool`).
- **Histogram boundaries are always explicit.** Both the `MetricsRecorder` trait and the internal builder require boundaries upfront — there are no implicit defaults. `register_histogram` accepts `boundaries: &[f64]` specifying the upper-exclusive bucket boundaries. SlateDB provides standard boundary constants (`LATENCY_BOUNDARIES`, `SIZE_BOUNDARIES`) that internal callers select from. This follows the Prometheus philosophy that bucket boundaries are always a deliberate choice.
- **Metric levels for hot-path control.** Each metric is registered with a `MetricLevel` (`Info` or `Debug`). The configured level threshold (set via `Settings::metric_level`, default `Info`) determines which metrics are active. Metrics at or above the threshold are registered normally; metrics below it get static no-op handles — zero allocation, zero virtual dispatch on the hot path. This is decided once at registration time, not checked per-operation. The `MetricsRecorder` trait is not aware of levels; the builder short-circuits before calling the trait for filtered-out metrics. Because `metric_level` lives in `Settings`, it is serializable and supports env variable overrides.
- **No external dependency.** SlateDB owns all trait definitions.

### Internal architecture

```
DbBuilder {
    metrics_recorder: Option>,
}
Settings {
    metric_level: MetricLevel,  // default: Info; supports env override
}
         |
         v
  +------------------------+
  | MetricsRecorderHelper  |  (wraps recorder + level filtering)
  +------------------------+
              |
              v
     User's recorder              OR    NoopMetricsRecorder
     (Prometheus, OTLP, etc.)           (default, zero-cost)
```

When `DbBuilder` opens a database, it creates a `MetricsRecorderHelper` wrapping either the
user-provided `MetricsRecorder` or a `NoopMetricsRecorder` (the default). The helper is passed
to all internal components for metric registration.

There is no built-in `DefaultMetricsRecorder` or `db.metrics()` snapshot API. Users who want to
read metric values should provide their own recorder (e.g. `DefaultMetricsRecorder` from
`slatedb-common` for testing, or a Prometheus/OTLP recorder for production).

A `DefaultMetricsRecorder` (atomic-backed) is provided in `slatedb-common` for convenience and
testing, but it is not used internally by default. Users can pass it as their recorder if they
want snapshot-based access to metric values.

### Migration

Naming convention is dot-separated: `slatedb.<subsystem>.<metric_name>` to conform with
typical OpenTelemetry stndards (Prometheus users can easily convert when they register).

Here is an AI-assisted listing of the current metrics and proposed mapped metrics:

| Current name | New name | Labels |
|-------------|----------|--------|
| `db/get_requests` | `slatedb.db.request_count` | `{op=get}` |
| `db/scan_requests` | `slatedb.db.request_count` | `{op=scan}` |
| `db/flush_requests` | `slatedb.db.request_count` | `{op=flush}` |
| `db/write_ops` | `slatedb.db.write_ops` | |
| `db/write_batch_count` | `slatedb.db.write_batch_count` | |
| `db/backpressure_count` | `slatedb.db.backpressure_count` | |
| `db/immutable_memtable_flushes` | `slatedb.db.immutable_memtable_flushes` | |
| `db/wal_buffer_flushes` | `slatedb.db.wal_buffer_flushes` | |
| `db/wal_buffer_estimated_bytes` | `slatedb.db.wal_buffer_estimated_bytes` | |
| `db/total_mem_size_bytes` | `slatedb.db.total_mem_size_bytes` | |
| `db/l0_sst_count` | `slatedb.db.l0_sst_count` | |
| `db/sst_filter_false_positives` | `slatedb.db.sst_filter_false_positive_count` | |
| `db/sst_filter_positives` | `slatedb.db.sst_filter_positive_count` | |
| `db/sst_filter_negatives` | `slatedb.db.sst_filter_negative_count` | |
| `dbcache/filter_hit` | `slatedb.db_cache.access_count` | `{entry_kind=filter, result=hit}` |
| `dbcache/filter_miss` | `slatedb.db_cache.access_count` | `{entry_kind=filter, result=miss}` |
| `dbcache/index_hit` | `slatedb.db_cache.access_count` | `{entry_kind=index, result=hit}` |
| `dbcache/index_miss` | `slatedb.db_cache.access_count` | `{entry_kind=index, result=miss}` |
| `dbcache/data_block_hit` | `slatedb.db_cache.access_count` | `{entry_kind=data_block, result=hit}` |
| `dbcache/data_block_miss` | `slatedb.db_cache.access_count` | `{entry_kind=data_block, result=miss}` |
| `dbcache/stats_hit` | `slatedb.db_cache.access_count` | `{entry_kind=stats, result=hit}` |
| `dbcache/stats_miss` | `slatedb.db_cache.access_count` | `{entry_kind=stats, result=miss}` |
| `dbcache/get_error` | `slatedb.db_cache.error_count` | |
| `gc/manifest_count` | `slatedb.gc.deleted_count` | `{resource=manifest}` |
| `gc/wal_count` | `slatedb.gc.deleted_count` | `{resource=wal}` |
| `gc/compacted_count` | `slatedb.gc.deleted_count` | `{resource=compacted}` |
| `gc/compactions_count` | `slatedb.gc.deleted_count` | `{resource=compactions}` |
| `compactor/bytes_compacted` | `slatedb.compactor.bytes_compacted` | |
| `compactor/last_compaction_timestamp_sec` | `slatedb.compactor.last_compaction_timestamp_sec` | |
| `compactor/running_compactions` | `slatedb.compactor.running_compactions` | |
| `compactor/total_bytes_being_compacted` | `slatedb.compactor.total_bytes_being_compacted` | |
| `compactor/total_throughput_bytes_per_sec` | `slatedb.compactor.total_throughput_bytes_per_sec` | |
| `oscache/part_hits` | `slatedb.object_store_cache.part_hit_count` | |
| `oscache/part_access` | `slatedb.object_store_cache.part_access_count` | |
| `oscache/cache_keys` | `slatedb.object_store_cache.cache_keys` | |
| `oscache/cache_bytes` | `slatedb.object_store_cache.cache_bytes` | |
| `oscache/evicted_keys` | `slatedb.object_store_cache.evicted_keys` | |
| `oscache/evicted_bytes` | `slatedb.object_store_cache.evicted_bytes` | |

**Standard label vocabulary:**
- `op`: `get`, `put`, `delete`, `scan`, `flush`
- `entry_kind`: `filter`, `index`, `data_block`, `stats`
- `result`: `hit`, `miss`
- `resource`: `manifest`, `wal`, `compacted`, `compactions`

### Registration ergonomics

Internal code does not call `MetricsRecorder` trait methods directly. Instead, a thin
builder layer resolves defaults so that the trait always receives fully-specified parameters:

```rust
/// Internal helper that wraps a MetricsRecorder and the configured MetricLevel.
/// Provides a builder API that resolves defaults and handles level filtering.
impl MetricsRecorderHelper {
    pub fn new(recorder: Arc<dyn MetricsRecorder>, level: MetricLevel) -> Self;
    pub fn counter(&self, name: &str) -> CounterBuilder;
    pub fn gauge(&self, name: &str) -> GaugeBuilder;
    pub fn up_down_counter(&self, name: &str) -> UpDownCounterBuilder;
    pub fn histogram(&self, name: &str, boundaries: &[f64]) -> HistogramBuilder;
}

impl HistogramBuilder {
    pub fn description(self, desc: &str) -> Self;
    pub fn labels(self, labels: &[(&str, &str)]) -> Self;
    pub fn level(self, level: MetricLevel) -> Self;        // optional; defaults to Info
    pub fn register(self) -> Arc<dyn HistogramFn> {
        // If this metric's level is below the configured threshold, return a
        // static no-op handle (no allocation, no virtual dispatch on hot path,
        // no call to the user's MetricsRecorder).
        // Otherwise, call recorder.register_histogram(...)
    }
}
// CounterBuilder, GaugeBuilder, UpDownCounterBuilder follow the same pattern
// (without boundaries).
```

Standard boundary constants (callers pick the appropriate set):

```rust
pub const LATENCY_BOUNDARIES: &[f64] = &[0.001, 0.005, 0.01, 0.025, 0.05, 0.1, 0.25, 0.5, 1.0, 2.5, 5.0, 10.0];
pub const SIZE_BOUNDARIES: &[f64] = &[128.0, 256.0, 512.0, 1024.0, 4096.0, 16384.0, 65536.0, 262144.0, 1048576.0, 4194304.0];
```

The builder is an internal convenience — it is not part of the public API.

Internal registration looks like:

```rust
// In DbStats::new(recorder: &MetricsRecorderHelper)
let get_requests = recorder.counter("slatedb.db.request_count")
    .description("Number of DB requests")
    .labels(&[("op", "get")])
    .register();

let request_latency = recorder.histogram("slatedb.db.request_duration_seconds", LATENCY_BOUNDARIES)
    .description("DB request latency")
    .labels(&[("op", "get")])
    .register();

// Debug-level metric: no-op unless Settings::metric_level is MetricLevel::Debug
let sst_filter_check_latency = recorder.histogram("slatedb.db.sst_filter_check_duration_seconds", LATENCY_BOUNDARIES)
    .description("Per-SST bloom filter check latency")
    .level(MetricLevel::Debug)
    .register();

// Hot path (unchanged pattern from today — no branching, no level checks)
self.db_stats.get_requests.increment(1);
self.db_stats.request_latency.record(elapsed.as_secs_f64());
self.db_stats.sst_filter_check_latency.record(elapsed.as_secs_f64()); // no-op if Debug not enabled
```

Each stats struct changes its field types:

```rust
// Before:
pub(crate) struct DbStats {
    pub(crate) get_requests: Arc,
    pub(crate) wal_buffer_estimated_bytes: Arc>,
}

// After:
pub(crate) struct DbStats {
    pub(crate) get_requests: Arc<dyn CounterFn>,
    pub(crate) wal_buffer_estimated_bytes: Arc<dyn UpDownCounterFn>,
}
```

### User integration examples

#### Prometheus

```rust
use prometheus_client::metrics::counter::Counter as PromCounter;
use prometheus_client::metrics::family::Family;
use prometheus_client::metrics::gauge::Gauge as PromGauge;
use prometheus_client::metrics::histogram::Histogram as PromHistogram;
use prometheus_client::registry::Registry;

// Sketch only: the recorder should keep one Family per metric definition and
// materialize the labeled child metric for each (name, labels) registration.
struct PrometheusRecorder {
    registry: Mutex,
    counters: Mutex>>,
    gauges: Mutex>>,
    histograms: Mutex>>,
}

impl MetricsRecorder for PrometheusRecorder {
    fn register_counter(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn CounterFn> {
        let labels = PromLabels::from_slice(labels);
        let family = self.lookup_or_register_counter_family(name, desc);
        let counter = family.get_or_create(&labels).clone();
        Arc::new(PrometheusCounter(counter))
    }

    fn register_gauge(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn GaugeFn> {
        let labels = PromLabels::from_slice(labels);
        let family = self.lookup_or_register_gauge_family(name, desc);
        let gauge = family.get_or_create(&labels).clone();
        Arc::new(PrometheusGauge(gauge))
    }

    fn register_up_down_counter(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn UpDownCounterFn> {
        // Prometheus has no native up/down counter; map to a gauge.
        let labels = PromLabels::from_slice(labels);
        let family = self.lookup_or_register_gauge_family(name, desc);
        let gauge = family.get_or_create(&labels).clone();
        Arc::new(PrometheusUpDownCounter(gauge))
    }

    fn register_histogram(&self, name: &str, desc: &str, labels: &[(&str, &str)], boundaries: &[f64]) -> Arc<dyn HistogramFn> {
        let labels = PromLabels::from_slice(labels);
        let boundaries = boundaries.to_vec();
        let family = self.lookup_or_register_histogram_family(
            name,
            desc,
            move || PromHistogram::new(boundaries.iter().copied()),
        );
        let histogram = family.get_or_create(&labels).clone();
        Arc::new(PrometheusHistogram(histogram))
    }
}

let db = Db::builder("my_db", object_store)
    .with_metrics_recorder(Arc::new(PrometheusRecorder::new()))
    // metric_level is set via Settings (supports env override), not on DbBuilder
    .open()
    .await?;
```

Because SlateDB passes labels at registration time, a Prometheus recorder should treat each
registration as one labeled time series within a metric family, not as a completely separate
top-level metric. `prometheus-client`'s `Family` is the natural fit here.

#### OpenTelemetry

```rust
use opentelemetry::metrics::MeterProvider;
use opentelemetry_sdk::metrics::SdkMeterProvider;

struct OtelRecorder { meter: opentelemetry::metrics::Meter }

impl OtelRecorder {
    fn new(provider: &SdkMeterProvider) -> Self {
        Self { meter: provider.meter("slatedb") }
    }
}

impl MetricsRecorder for OtelRecorder {
    fn register_counter(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn CounterFn> {
        let attrs: Vec<opentelemetry::KeyValue> = labels.iter()
            .map(|(k, v)| opentelemetry::KeyValue::new(k.to_string(), v.to_string()))
            .collect();
        let counter = self.meter.u64_counter(name).with_description(desc).build();
        Arc::new(OtelCounter { counter, attrs })
    }

    fn register_gauge(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn GaugeFn> {
        let attrs: Vec<opentelemetry::KeyValue> = labels.iter()
            .map(|(k, v)| opentelemetry::KeyValue::new(k.to_string(), v.to_string()))
            .collect();
        let gauge = self.meter.f64_gauge(name).with_description(desc).build();
        Arc::new(OtelGauge::new(gauge, attrs))
    }

    fn register_up_down_counter(&self, name: &str, desc: &str, labels: &[(&str, &str)]) -> Arc<dyn UpDownCounterFn> {
        let attrs: Vec<opentelemetry::KeyValue> = labels.iter()
            .map(|(k, v)| opentelemetry::KeyValue::new(k.to_string(), v.to_string()))
            .collect();
        let counter = self.meter.i64_up_down_counter(name).with_description(desc).build();
        Arc::new(OtelUpDownCounter { counter, attrs })
    }

    fn register_histogram(&self, name: &str, desc: &str, labels: &[(&str, &str)], boundaries: &[f64]) -> Arc<dyn HistogramFn> {
        let attrs: Vec<opentelemetry::KeyValue> = labels.iter()
            .map(|(k, v)| opentelemetry::KeyValue::new(k.to_string(), v.to_string()))
            .collect();
        let histogram = self.meter.f64_histogram(name)
            .with_description(desc)
            .with_boundaries(boundaries.to_vec())
            .build();
        Arc::new(OtelHistogram { histogram, attrs })
    }
}

// OTel exporters (OTLP, stdout, Prometheus bridge) are configured via SdkMeterProvider.
// See: https://docs.rs/opentelemetry/latest/opentelemetry/metrics/index.html
let provider = SdkMeterProvider::builder()
    .with_periodic_exporter(opentelemetry_otlp::MetricExporter::builder().build()?)
    .build();

let db = Db::builder("my_db", object_store)
    .with_metrics_recorder(Arc::new(OtelRecorder::new(&provider)))
    .open()
    .await?;
```

Because SlateDB separates `GaugeFn` (absolute `set`) from `UpDownCounterFn` (additive
`increment`), the OTel recorder maps directly to the corresponding OTel instruments
(`f64_gauge` and `i64_up_down_counter`) with no adapter logic needed. Prometheus recorders
can map both to Prometheus gauges since Prometheus doesn't distinguish the two.

### UniFFI integration

The UniFFI bindings will expose the `MetricsRecorder` trait via callback interfaces, allowing
Go, Java, and Python users to plug in their own recorders just like Rust users. This requires
additional design work around callback handle lifetimes, callback error handling, and the
hot-path cost of forwarding every metric operation across FFI, and will be handled as
separate follow-up work.

## Impact Analysis

### Core API & Query Semantics

- [ ] Basic KV API (`get`/`put`/`delete`)
- [ ] Range queries, iterators, seek semantics
- [ ] Range deletions
- [ ] Error model, API errors

### Consistency, Isolation, and Multi-Versioning

- [ ] Transactions
- [ ] Snapshots
- [ ] Sequence numbers

### Time, Retention, and Derived State

- [ ] Time to live (TTL)
- [ ] Compaction filters
- [ ] Merge operator
- [ ] Change Data Capture (CDC)

### Metadata, Coordination, and Lifecycles

- [ ] Manifest format
- [ ] Checkpoints
- [ ] Clones
- [x] Garbage collection
- [ ] Database splitting and merging
- [ ] Multi-writer

### Compaction

- [x] Compaction state persistence
- [ ] Compaction filters
- [ ] Compaction strategies
- [ ] Distributed compaction
- [ ] Compactions format

### Storage Engine Internals

- [ ] Write-ahead log (WAL)
- [x] Block cache
- [x] Object store cache
- [x] Indexing (bloom filters, metadata)
- [ ] SST format or block format

### Ecosystem & Operations

- [ ] CLI tools
- [x] Language bindings (Go/Python/etc)
- [x] Observability (metrics/logging/tracing)

## Operations

### Performance & Cost

- **Hot path:** One virtual dispatch per metric operation (through the recorder handle). When no user recorder is configured, `NoopMetricsRecorder` returns static no-op handles — zero allocation, zero work on the hot path. Debug-level metrics that are filtered out also use static no-op handles.
- **Registration:** May take a lock depending on the user's recorder implementation, but only during startup. No contention on the hot path since handles are cached.
- **Memory:** Minimal when using `NoopMetricsRecorder` (only `Arc<dyn Fn>` no-op handles). Slightly more per metric when a user recorder is present.
- No impact on object-store request patterns, space/read/write amplification.

### Observability

- **Configuration changes:** New `DbBuilder::with_metrics_recorder(Arc<dyn MetricsRecorder>)` method (runtime component, not serializable). New `Settings::metric_level: MetricLevel` field (default `Info`, serializable, supports env variable override).
- **New components:** `metrics.rs` module in `slatedb-common` (~300 lines). `stats.rs` and `db_metrics.rs` are removed.
- **Metrics:** All 39 existing metrics preserved with new names/labels (see naming table above). Histogram support added as a new metric type. Default is no-op (no metrics collected unless a recorder is configured).
- **Logging:** No changes.

### Compatibility

- **On-disk/object storage formats:** No impact. Metrics are purely in-memory.
- **Public API:** Breaking change:
  - Removed: `db.metrics()` method entirely
  - Removed: `StatRegistry`, `ReadableStat`, `Counter`, `Gauge`, `stat_name!` macro
  - Removed: Public stat name constants (`db_stats::GET_REQUESTS`, etc.)
  - New: `MetricsRecorder`, `CounterFn`, `GaugeFn`, `UpDownCounterFn`, `HistogramFn`, `MetricLevel`, `NoopMetricsRecorder`
  - New: `DefaultMetricsRecorder` in `slatedb-common` (opt-in, not used by default)
- **Bindings:** The `MetricsRecorder` trait will be exposed via UniFFI callback interfaces as follow-up work, enabling Go/Java/Python users to plug in their own recorders.

## Testing

- **Unit tests:** `metrics.rs` tests for default recorder (counter/gauge/up-down-counter/histogram tracking, histogram bucket counts), composite recorder (forwards to both), edge cases (empty histogram). Builder tests for metric level filtering (Debug metrics produce no-ops at Info level, active at Debug level).
- **Integration tests:** Register a `DefaultMetricsRecorder`, perform DB operations, verify handles receive correct calls with expected names and labels via `snapshot()`.
- **Port existing tests:** All existing tests that read metrics (`compactor::test_compactor_compressed_block_size`, db cache hit/miss tests, `sst_iter` bloom filter tests) updated to use `DefaultMetricsRecorder` + `snapshot()` instead of `StatRegistry`.
- Fault-injection/chaos tests: N/A
- Deterministic simulation tests: N/A
- Formal methods verification: N/A
- Performance tests: N/A

## Rollout

Direct, one-time breaking change will be included with an 0.X.X release.

## Alternatives

### 1. Adopt the `metrics` crate facade

The Rust [`metrics`](https://crates.io/crates/metrics) crate provides a global recorder pattern
similar to what we're proposing. We rejected this because:
- It adds an external dependency that may conflict with users' own `metrics` usage.
- Global state doesn't work well with multiple `Db` instances in the same process.
- UniFFI bindings need to own the trait definitions.

### 2. Keep `StatRegistry` and add labels as a map field

We could extend `StatRegistry` to store `HashMap` labels per metric. This was
rejected because:
- It doesn't solve the pluggable-backend problem (users still poll).
- The `ReadableStat` trait returning `i64` can't represent histograms or `f64` gauges.
- The `Gauge` generic makes FFI difficult.

### 3. Status quo

Do nothing. Users continue to poll `StatRegistry` and manually bridge metrics. Rejected because
this doesn't meet the goal of first-class Prometheus/OTLP support, and the label/histogram gaps
make operational use painful.

### 4. Avoid dynamic dispatch on metric handles

The proposed `Arc<dyn CounterFn>` fields mean every hot-path `increment` call goes through a
vtable. Two alternatives were considered to avoid this:

- **Enum dispatch.** Replace `Arc<dyn CounterFn>` with a concrete enum:
  ```rust
  enum CounterHandle {
      Default(Arc),
      Composite { default: Arc, user: Arc<dyn CounterFn> },
  }
  ```
  This avoids one vtable indirection for the no-user-recorder case (the common path), but
  still needs `dyn CounterFn` for the user handle since its type is unknown. It also leaks the
  composite abstraction into every stats struct.

- **Generic stats structs.** Make `DbStats` generic over the recorder.
  This eliminates all dynamic dispatch but requires monomorphization through 20+ files and
  makes the `Db` type itself generic, which is a non-starter for the public API and FFI.

Rejected because a vtable call costs ~2-5ns on modern CPUs, which is negligible on paths that
do I/O. If profiling shows it matters, we can switch to enum dispatch for the composite handles
without changing any external API.

## Open Questions

1. ~~**Histogram default boundaries:** For the default recorder, we track only count/sum/min/max. Should we also store a lightweight fixed-bucket histogram for richer `db.metrics()` output?~~ **Resolved:** The default recorder now maintains bucket counts using boundaries always provided at registration time, and the `Histogram` variant in `MetricValue` includes `boundaries` and `bucket_counts`.

1. ~~**Standard boundaries per metric:** What default boundary sets should SlateDB define for its internal histograms?~~ **Resolved:** Boundaries are always explicit — required as a parameter to `recorder.histogram(name, boundaries)`. SlateDB provides `LATENCY_BOUNDARIES` and `SIZE_BOUNDARIES` constants that internal callers select from. Exact values may be tuned during implementation.

## References

- Current stats module: `slatedb/src/stats.rs`
- Rust `metrics` crate (prior art): https://crates.io/crates/metrics
- Prometheus client_rust: https://crates.io/crates/prometheus-client
- OpenTelemetry Rust SDK: https://crates.io/crates/opentelemetry

## Updates

(none yet)