Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Disk Footprint & Indices

A fully-indexed satd node (-txindex=1 -addressindex=1 -blockfilterindex=basic) uses more disk for its indices than a bitcoind + electrs/Fulcrum + esplora stack uses summed together. This is by design, and this chapter explains where the bytes go and what you get for them.

This is the one place the manual's "shared index / no duplicate index" language can be misread. To be unambiguous:

"Shared" means a single source of truth: one RocksDB, updated atomically with the chain, that every API surface reads. satd indexes more relationships, in more directions, to serve more protocols from one consistent store, so the aggregate on-disk index is larger. You trade disk for consistency and single-process operation.

If you only need a validating node, none of this applies: a consensus-only satd (-txindex=0 -addressindex=0, filters off) has a chainstate comparable to Core's and carries none of the index column families below.

Where the bytes go

satd keeps everything in one RocksDB with multiple column families (CFs). The indices are append-mostly: rows are added as blocks connect and removed only on disconnect (reorg), so there is no tombstone debt accumulating over time. The figures below are representative of a fully-indexed mainnet node in mid-2026; your numbers will track the chain's growth.

Column familyRoleKeyed byRow sizeApprox. on disk
addr_funding_v2every output paying a scriptscripthash[16] ‖ height ‖ txid ‖ vout64 B~200 GB
tx_indextxid → containing blocktxid[32]64 B~140 GB
addr_spending_v2every input spending a scriptscripthash[16] ‖ height ‖ txid ‖ vin92 B~140 GB
outpoint_spendUTXO → the input that spent itprev_txid[32] ‖ vout76 B~100 GB
block_filter / _headerBIP 158 compact filterstype ‖ height~30 KB / 37 B~30 GB
coinsthe live UTXO settxid[32] ‖ vout~28 B varint~tens of MB
undoper-block disconnect datablock_hash[32]~28 B / inputsmall (rolling)

The three address/txid indices plus outpoint_spend are the bulk. The UTXO set itself (coins) is tiny — it lives mostly in the in-memory coin cache and serializes to a few tens of MB on disk.

Note on mid-reindex sizes. During a -reindex/-reindex-chainstate, RocksDB's write-ahead compaction falls behind the write firehose, so tx_index in particular can read substantially larger than its settled size (uncompacted L0 SSTs + bloom filters + index blocks). Measure the per-CF footprint after the node has idled and background compaction has drained — see Compaction below.

Why it is larger than bitcoind + electrs + esplora

Three structural reasons, all deliberate:

1. satd stores the spend graph in both directions

Every spend writes two rows:

  • addr_spending_v2 — keyed by script (scripthash ‖ height ‖ …), answering "show me everything address A spent."
  • outpoint_spend — keyed by outpoint (prev_txid ‖ vout), answering "what input spent this specific UTXO" in a single keyed read.

electrs/Fulcrum keep essentially one spend representation and derive the other direction on demand. satd pays the disk to keep both materialized so both queries are O(1). This is the single biggest source of the overage — and, ironically, the thing the "no duplicate index" tagline understates: the duplication is internal and intentional.

2. satd indexes a superset of what any one external tool does

The often-quoted "30–180 GB" figure is electrs/Fulcrum's address index alone. satd's address index alone (addr_funding + addr_spending) already exceeds that range — and satd additionally carries a Core-style tx_index, an outpoint_spend reverse index, and BIP 158 filters in the same database, because one binary serves Electrum and Esplora and getrawtransaction and compact-filter clients. You are not comparing satd's index to electrs's index; you are comparing it to electrs plus Core's txindex plus a spend index plus a filter index, fused into one store.

3. satd trades pointer compactness for self-containment

tx_index stores the full 32-byte block hash as its value, where Core's txindex stores a ~12-byte on-disk position (CDiskTxPos). That is ~20 extra bytes per transaction (~24 GB across the chain) and one extra indirection on read — in exchange, the index is independent of block-file layout and survives block-file re-packing. satd's keys are also fixed-width binary tuned for prefix seeks rather than byte-minimal, which costs a little space and buys fast range scans.

What satd already does to keep it down

The schema is near the information-theoretic floor for what it indexes:

  • 16-byte scripthash prefix, not 32. Address rows key on the first half of sha256(scriptPubKey), halving the dominant field of every address row. Collisions are vanishingly unlikely and are resolved against the full script on read.
  • Varint-packed UTXOs. The coins CF uses a compact varint encoding (~28 B typical vs ~43 B for a naive struct).
  • Fixed-width keys, no delimiters. Heights are big-endian so range scans return in chain order with no secondary sort.

So the size is row_count × ~70 B, and row_count is "every output and every spend in Bitcoin's history" — genuine data, not per-row bloat or dead weight.

What the disk buys you

Propertysatd (shared store)bitcoind + electrs/Fulcrum
Index vs. tip consistencyAlways atomic — index update is in the same WriteBatch as the blockIndex lags the node; reorg-window races are possible
Build costIndex built inside connect_block validationSecond process re-scans every block to build a parallel DB
Lookup pathO(1) keyed read, in-process function callCross-process RPC + the indexer's own lookup
Spend-by-outpointO(1) (outpoint_spend)Often derived / scanned
Operational surfaceOne process, one config, one backup, one reindexTwo+ daemons to wire, monitor, and keep in lockstep
TLS / authNative on every surfaceUsually a separate reverse proxy
DiskLarger in aggregateSmaller per-tool, but you run several

The headline is consistency and operational simplicity, bought with disk: a read on any surface — Electrum, Esplora, JSON-RPC — can never observe an index out of sync with the chain tip, because there is no second copy to fall behind. You scale read throughput by running more nodes, not more index processes (see API Scaling & Runtimes).

Choosing what to index

The indices are opt-in per surface. Match the disk to what you actually serve:

You want…FlagsHeavy CFs pulled in
Validating node only(defaults; indices off)none
getrawtransaction <txid> anywhere-txindex=1tx_index
Electrum / Esplora address history-addressindex=1 (implies -txindex=1 for Electrum)addr_funding_v2, addr_spending_v2, outpoint_spend, tx_index
BIP 157/158 light-client service-blockfilterindex=basic -peerblockfilters=1block_filter, block_filter_header

Turning a surface off means its CF is never written, and the disk is never spent.

Compaction

RocksDB background compaction runs continuously and is not disabled by satd's bulk-load reindex mode (only the WAL is). When reindex writes stop, the background jobs drain the L0 backlog on their own — no manual step is required. satd additionally force-compacts only the coins CF on a timer (compaction_interval_secs, default 30 min, L0-triggered); there is no satd-level forced full compaction of the large index CFs — they rely on RocksDB auto-compaction.

Because the index CFs are append-mostly (little to no deletion outside reorgs), expect compaction to reclaim the reindex-era L0/overlap debt — a moderate drop — not a collapse. Most of the footprint is genuine index data. satd logs a per-CF compaction diagnostic (pending-compaction-bytes) every compaction_diag_interval_secs (default 60 s); let those settle toward zero before taking a "true" size measurement.