did-btcr2-js

Aggregation

Multi-party coordination for aggregated BTCR2 updates, implementing the Aggregate Beacon protocol.

This guide is a step-by-step walkthrough. If you’ve never used the aggregation subsystem before, read it top to bottom. If you already know what you’re doing, jump to the Service Step-by-Step or Participant Step-by-Step sections.

What Aggregation Solves
Spec Roles and Class Mapping
Two-Layer Architecture
Setting Up the Transport
Service Step-by-Step
Participant Step-by-Step
Decision Callbacks Reference
Events Reference
Error Handling
Power-User: State Machine Layer
Running the E2E Demos
Production Deployment Notes
Extension Points

What Aggregation Solves

Without aggregation, every BTCR2 DID update has to be broadcast individually as its own Bitcoin transaction via a Singleton Beacon. That doesn’t scale: every DID controller pays Bitcoin fees for every update, and the chain fills up with one-shot transactions.

Aggregation lets a coordinator (the Aggregation Service) batch updates from many DID controllers (the Aggregation Participants) into a single Bitcoin transaction. All participants jointly sign that transaction using MuSig2 (an n-of-n Schnorr aggregation scheme) so the on-chain footprint is just one signature regardless of how many participants joined.

The result: every participant’s DID update is committed on-chain, but the cost and bandwidth are amortized across the cohort.

Spec Roles and Class Mapping

The spec defines three roles:

Spec Role	Description	Runner Class	State Machine Class
Aggregation Service	Coordinator that runs the protocol and broadcasts the final tx	`AggregationServiceRunner`	`AggregationService`
Aggregation Participant	DID controller submitting an update	`AggregationParticipantRunner`	`AggregationParticipant`
Aggregation Cohort	The group of accepted participants	(data)	`AggregationCohort`

Two beacon types are supported:

CASBeacon: Aggregated updates form a CAS Announcement Map (DID to updateHash). The service publishes the map to a Content-Addressed Store (e.g., IPFS) and commits its hash on-chain.
SMTBeacon: Aggregated updates form a Sparse Merkle Tree. The SMT root is committed on-chain; participants get individual Merkle proofs.

Two-Layer Architecture

The aggregation subsystem is built in two layers, both public:

┌────────────────────────────────────────────────────┐
│  Runner layer (default API)                        │
│  - AggregationServiceRunner                        │
│  - AggregationParticipantRunner                    │
│  - Wires Transport to state machine                │
│  - Decision callbacks + progress events            │
│  - Use this 95% of the time                        │
└────────────────────────────────────────────────────┘
                         │
                         ▼
┌────────────────────────────────────────────────────┐
│  State machine layer (advanced)                    │
│  - AggregationService (sans-I/O)                   │
│  - AggregationParticipant (sans-I/O)               │
│  - Pure logic, explicit action methods             │
│  - Use for tests, custom transports, debugging     │
└────────────────────────────────────────────────────┘
                         │
                         ▼
┌────────────────────────────────────────────────────┐
│  Transport layer (pluggable)                       │
│  - Transport interface                             │
│  - NostrTransport              (production)        │
│  - HttpClientTransport + HttpServerTransport       │
│                                (production)        │
│  - DidCommTransport            (stub)              │
│  - MockTransport               (in tests)          │
└────────────────────────────────────────────────────┘

Pick the Runner unless you have a reason not to. It’s the default API. The state machine layer exists so the protocol logic stays testable in isolation and so you can build your own integrations on top.

Setting Up the Transport

Every actor (service or participant) needs a Transport to send and receive messages. Two production transports ship, both implementing the same Transport interface:

NostrTransport: relay-based. Censorship-resistant via multi-relay redundancy. NIP-44 envelope encryption for directed messages. Requires at least one running relay.
HttpClientTransport + HttpServerTransport: HTTP/REST. Operator-hosted. Browser-compatible clients via fetch + fetch-based SSE. Framework-agnostic server (caller mounts handleRequest / handleSse in Hono / Express / Fastify / Workers / Bun). TLS-only confidentiality; signed-envelope authenticity per request.

Runners and state machines are transport-agnostic: you pick a transport at construction time and the rest of the system doesn’t care. Pick NostrTransport when you want censorship resistance and multi-relay redundancy; pick the HTTP pair when you want operational familiarity, browser-native clients, or REST-tooling interop.

This section walks through the Nostr setup. For the HTTP walkthrough (wire protocol, Hono snippet, CORS and rate-limit configuration), see http-transport.md.

A single Transport instance can serve multiple registered actors. In production each process typically registers exactly one actor (its own identity); in tests one transport often serves several actors at once for easy in-process round-trips.

import { NostrTransport, SILENT_LOGGER } from '@did-btcr2/method';
import { SchnorrKeyPair } from '@did-btcr2/keypair';
import { DidBtcr2 } from '@did-btcr2/method';

// 1. Generate identity
const keys = SchnorrKeyPair.generate();
const did = DidBtcr2.create(keys.publicKey.compressed, {
  idType  : 'KEY',
  network : 'mutinynet',
});

// 2. Create transport pointing at one or more Nostr relays
const transport = new NostrTransport({
  relays              : ['wss://relay.damus.io', 'wss://nos.lol'],
  // Optional: pass SILENT_LOGGER to suppress transport diagnostics, or a
  // custom Logger to route to pino/winston/etc. Defaults to CONSOLE_LOGGER.
  logger              : SILENT_LOGGER,
  // Optional: `since` filter window (ms) on the COHORT_ADVERT subscription.
  // Some relays don't backfill historical events to late subscribers; this
  // gives them a bounded window of recent events to replay. Default 5 min.
  broadcastLookbackMs : 5 * 60 * 1000,
});

// 3. Register the actor (DID + keys) with the transport
transport.registerActor(did, keys);

// 4. Open relay connections
transport.start();

// 5. When done, cleanly tear down (releases relay subscriptions + handlers)
transport.unregisterActor(did);

That’s the entire transport setup. From here on, you only interact with the Runner: the transport is plumbing.

`NostrTransportConfig` options

Option	Default	Purpose
`relays`	`DEFAULT_NOSTR_RELAYS` (4 public relays)	Array of relay URLs to connect to
`logger`	`CONSOLE_LOGGER`	Injectable `Logger` for transport diagnostics
`broadcastLookbackMs`	5 min	`since` filter on broadcast (COHORT_ADVERT) subscription. Set to `0` to disable.

Choosing between Nostr and HTTP

Both transports carry identical protocol semantics. Differences that matter at deployment time:

Dimension	Nostr	HTTP
Wire infrastructure	One or more Nostr relays	One HTTP server (operator-hosted)
Censorship resistance	High (multi-relay redundancy)	Low-to-medium (single operator)
Confidentiality against the wire	Strong (NIP-44)	TLS only (operator sees plaintext)
Browser participants	Requires a Nostr library	Native (`fetch` + streaming)
Ops / debugging surface	Relay internals	curl, access logs, OpenAPI
Test harness	Needs a relay	Plain HTTP mock or in-process parity test

Factory invocation for each:

// Nostr
const transport = TransportFactory.establish({
  type   : 'nostr',
  relays : ['wss://relay.damus.io', 'wss://nos.lol'],
});

// HTTP client (participant side)
const transport = TransportFactory.establish({
  type    : 'http',
  role    : 'client',
  baseUrl : 'https://aggregator.example.com/',
});

// HTTP server (operator side): mount handleRequest + handleSse in a web framework
const transport = TransportFactory.establish({
  type : 'http',
  role : 'server',
});

`Transport` lifecycle methods

Method	Purpose
`registerActor(did, keys)`	Register an actor identity
`unregisterActor(did)`	Detach an actor: close its relay subscriptions, drop handlers + keys
`registerMessageHandler(did, type, handler)`	Install a per-(actor, type) handler
`unregisterMessageHandler(did, type)`	Remove a handler. Also called automatically by `runner.stop()`
`sendMessage(msg, sender, recipient?)`	Publish once
`publishRepeating(msg, sender, intervalMs, recipient?)`	Publish once immediately, then on an interval; returns an idempotent stop handle

Service Step-by-Step

The service is the coordinator. It creates a cohort, advertises it, accepts participants, finalizes keygen, distributes aggregated data, and drives MuSig2 signing.

Step 1: Construct the runner

import { AggregationServiceRunner } from '@did-btcr2/method';

const runner = new AggregationServiceRunner({
  transport,                     // from previous section
  did       : serviceDid,
  keys      : serviceKeys,

  config: {
    minParticipants : 2,
    network         : 'mutinynet',
    beaconType      : 'CASBeacon',
  },

  // REQUIRED: provide the Bitcoin transaction to be MuSig2-signed
  onProvideTxData: async ({ cohortId, beaconAddress, signalBytes }) => {
    // In production: query Bitcoin for a UTXO at beaconAddress, build a tx
    // that spends it with an OP_RETURN containing signalBytes.
    return await buildBeaconTransaction(beaconAddress, signalBytes, bitcoin);
  },

  // Optional hardening (all have sensible defaults):
  maxUpdateSizeBytes     : 256 * 1024,   // reject oversized updates (default 256 KiB)
  cohortTtlMs            : 10 * 60_000,  // overall deadline, emits cohort-failed on expiry
  phaseTimeoutMs         : 2 * 60_000,   // per-phase stall deadline
  advertRepeatIntervalMs : 60_000,       // re-publish COHORT_ADVERT until keygen complete
});

The config object follows the spec’s “Aggregation Cohort definition”: at minimum you need participant count, network, and beacon type. Optional fields (max participants, time windows, fees) can be added in your onOptInReceived and onReadyToFinalize callbacks.

`AggregationServiceRunnerOptions` (optional hardening)

Option	Default	Purpose
`maxUpdateSizeBytes`	256 KiB	Reject `SUBMIT_UPDATE` payloads whose canonicalized size exceeds this cap. Dropped submissions are surfaced via `message-rejected`.
`cohortTtlMs`	unset (disabled)	Overall deadline for `run()`. On expiry: emit `cohort-failed`, reject `run()`, GC cohort state.
`phaseTimeoutMs`	unset (disabled)	Per-phase stall deadline. Reset automatically on each observed phase transition.
`advertRepeatIntervalMs`	60 s	Re-publish `COHORT_ADVERT` on this cadence until keygen completes, fails, or the runner stops. `0` disables re-publish (single publish).

Events are useful for logging, UI updates, or driving side-effects:

runner.on('cohort-advertised', ({ cohortId }) => console.log('advertised', cohortId));
runner.on('opt-in-received',   (optIn)        => console.log('opt-in from', optIn.participantDid));
runner.on('participant-accepted', ({ participantDid }) => console.log('accepted', participantDid));
runner.on('keygen-complete',   ({ beaconAddress })     => console.log('beacon:', beaconAddress));
runner.on('update-received',   ({ participantDid })    => console.log('update from', participantDid));
runner.on('data-distributed',  ()                      => console.log('aggregated data sent'));
runner.on('validation-received', ({ participantDid, approved }) => console.log(participantDid, approved));
runner.on('signing-started',   ()  => console.log('starting MuSig2'));
runner.on('nonce-received',    ({ participantDid }) => console.log('nonce from', participantDid));
runner.on('signing-complete',  (result) => console.log('signature:', result.signature));
runner.on('error',             (err)    => console.error(err));

Subscribing is purely optional. Skipping events doesn’t change the protocol: they’re a side channel for observability.

Step 3: Run the protocol to completion

const result = await runner.run();
console.log('Final signature:', result.signature);
console.log('Signed tx hex:',   result.signedTx.toHex());
console.log('Cohort id:',       result.cohortId);

runner.run() returns a Promise that resolves with the final aggregation result once the MuSig2 signing session completes. That’s it: internally it has just driven the cohort through all four spec steps.

What `run()` actually does internally

So you understand what’s happening behind the scenes:

Cohort Formation (spec Step 1)
- Creates an AggregationCohort and emits cohort-advertised
- Sends a COHORT_ADVERT broadcast over the transport
- For each incoming COHORT_OPT_IN message:
  - Calls onOptInReceived(optIn): your decision callback
  - If accepted, registers the peer’s communication key, sends COHORT_OPT_IN_ACCEPT, emits participant-accepted
- Once minParticipants is reached, calls onReadyToFinalize: your decision callback
- If finalize: true, computes the n-of-n MuSig2 Taproot beacon address, sends COHORT_READY to all participants, emits keygen-complete
Update Submission (spec Step 2)
- For each incoming SUBMIT_UPDATE message: calls session.receive() and emits update-received
- Once all participants have submitted, the cohort phase becomes UpdatesCollected
Aggregate & Validate (spec Step 3)
- Builds the CAS Announcement Map (or SMT tree, depending on beaconType)
- Sends DISTRIBUTE_AGGREGATED_DATA to all participants, emits data-distributed
- For each incoming VALIDATION_ACK: tracks approval, emits validation-received
- Once all participants approve, calls onProvideTxData: your data callback
- Sends AUTHORIZATION_REQUEST to all participants with the unsigned tx, emits signing-started
Signing & Broadcast (spec Step 4)
- For each incoming NONCE_CONTRIBUTION: tracks the nonce, emits nonce-received
- Once all nonces are in, sends the aggregated nonce back via AGGREGATED_NONCE
- For each incoming SIGNATURE_AUTHORIZATION (a partial signature): adds it to the session
- Once all partial signatures are in, generates the final 64-byte Schnorr signature, attaches it to the tx, emits signing-complete, and resolves run()

You can broadcast the resulting signed transaction yourself using result.signedTx.toHex(): the runner doesn’t broadcast for you (that’s a transport-layer concern outside its scope).

Participant Step-by-Step

The participant joins cohorts advertised by services, submits its own DID update, validates the aggregated data, and contributes its share of the MuSig2 signature.

Step 1: Construct the runner

import { AggregationParticipantRunner, Resolver, Update } from '@did-btcr2/method';

const runner = new AggregationParticipantRunner({
  transport,                    // from the transport setup section
  did  : myDid,
  keys : myKeys,

  // Filter discovered cohorts. Default rejects all - you MUST override this.
  shouldJoin: async (advert) => advert.beaconType === 'CASBeacon' && advert.network === 'mutinynet',

  // REQUIRED: build and sign an update for the cohort's beacon
  onProvideUpdate: async ({ cohortId, beaconAddress }) => {
    // Resolve the current document, then construct an update that adds
    // the cohort's beacon address as a CASBeacon service entry.
    const doc = Resolver.deterministic({
      genesisBytes : myKeys.publicKey.compressed,
      hrp          : 'k',
      idType       : 'KEY',
      version      : 1,
      network      : 'mutinynet',
    });

    const unsigned = Update.construct(doc, [{
      op    : 'add',
      path  : '/service/-',
      value : {
        id              : `${myDid}#beacon-cas`,
        type            : 'CASBeacon',
        serviceEndpoint : `bitcoin:${beaconAddress}`,
      }
    }], 1);

    return Update.sign(myDid, unsigned, doc.verificationMethod![0], myKeys.raw.secret!);
  },
});

Two callbacks are required for the participant:

shouldJoin: filter which cohorts to join. Default is “reject everything”; you MUST override it.
onProvideUpdate: build the BTCR2 update you want included in the aggregated batch.

Two more callbacks are optional:

onValidateData: approve/reject aggregated data. Default approves if the data correctly includes your update.
onApproveSigning: approve/reject the actual signing. Default approves.

runner.on('cohort-discovered',  (advert)             => console.log('found:', advert.cohortId));
runner.on('cohort-joined',      ({ cohortId })       => console.log('joined:', cohortId));
runner.on('cohort-ready',       ({ beaconAddress }) => console.log('beacon:', beaconAddress));
runner.on('update-submitted',   ({ cohortId })       => console.log('update sent'));
runner.on('validation-requested', (info)             => console.log('validating', info.cohortId));
runner.on('signing-requested',  (req)                => console.log('signing requested', req.sessionId));
runner.on('cohort-complete',    ({ beaconAddress }) => console.log('done:', beaconAddress));
runner.on('cohort-failed',      ({ reason })         => console.error('failed:', reason));
runner.on('error',              (err)                => console.error(err));

Step 3: Start the runner

Unlike the service, the participant runner is long-running. It listens indefinitely for new cohort adverts and processes each accepted cohort in parallel.

await runner.start();
// runner is now listening - control returns immediately
// Cohort processing happens via callbacks and events as messages arrive

To stop the runner cleanly:

runner.stop();

Step 3b: One-shot mode

For tests, demos, or scripts that should exit after joining a single cohort, use the joinFirst static helper:

const result = await AggregationParticipantRunner.joinFirst({
  transport,
  did             : myDid,
  keys            : myKeys,
  shouldJoin      : async (advert) => advert.serviceDid === expectedServiceDid,
  onProvideUpdate : async ({ beaconAddress }) => buildAndSignUpdate(beaconAddress),
});

console.log(`Joined cohort ${result.cohortId}, beacon: ${result.beaconAddress}`);

joinFirst waits for a matching cohort, joins it, drives the protocol to completion, and resolves with { cohortId, beaconAddress }.

What the participant runner does internally

Cohort Discovery (spec Step 1)
- Listens for COHORT_ADVERT broadcasts on the transport
- For each advert: emits cohort-discovered and calls shouldJoin(advert)
- If the filter accepts, registers the service’s communication key, sends COHORT_OPT_IN, emits cohort-joined
- Waits for COHORT_OPT_IN_ACCEPT (acknowledgment)
- Waits for COHORT_READY containing the cohort’s MuSig2 beacon address; validates the address against locally-computed value, emits cohort-ready
Update Submission (spec Step 2)
- Calls onProvideUpdate({ cohortId, beaconAddress }): your data callback
- Sends the resulting SignedBTCR2Update via SUBMIT_UPDATE, emits update-submitted
Validation (spec Step 3)
- Waits for DISTRIBUTE_AGGREGATED_DATA containing the CAS announcement (or SMT proof)
- Verifies that the aggregated data correctly includes the participant’s submitted update
- Calls onValidateData(info): your decision callback (default: approve if hash matches)
- Sends VALIDATION_ACK (approved or rejected)
- On rejection, emits cohort-failed and stops processing this cohort
Signing (spec Step 4)
- Waits for AUTHORIZATION_REQUEST containing the unsigned tx
- Calls onApproveSigning(req): your decision callback (default: approve)
- On rejection, emits cohort-failed and stops
- On approval: generates a MuSig2 nonce, sends NONCE_CONTRIBUTION
- Waits for AGGREGATED_NONCE from the service
- Generates and sends the SIGNATURE_AUTHORIZATION (partial signature)
- Emits cohort-complete

Decision Callbacks Reference

Service callbacks

Callback	Required?	Default	Purpose
`onProvideTxData`	✅ yes	n/a	Build the Bitcoin tx to MuSig2-sign once validation completes
`onOptInReceived`	optional	auto-accept	Decide whether to admit each participant who opts in
`onReadyToFinalize`	optional	finalize at minParticipants	Decide whether to finalize keygen now or wait for more

Participant callbacks

Callback	Required?	Default	Purpose
`onProvideUpdate`	✅ yes	n/a	Build and sign the update to include in this cohort
`shouldJoin`	optional	reject all	Filter which advertised cohorts to join
`onValidateData`	optional	approve if hash matches	Approve/reject the aggregated data
`onApproveSigning`	optional	approve	Approve/reject signing the Bitcoin tx

The defaults are designed so the simplest possible runner (provide only the required callbacks) works correctly for the happy path. Override the optional callbacks when you need user prompts, custom filters, fee policies, or audit logs.

Events Reference

`AggregationServiceRunner` events

Event	Payload	Fires when
`cohort-advertised`	`{ cohortId }`	After the cohort is created and advert is queued for broadcast
`opt-in-received`	`PendingOptIn`	A participant opts in (before `onOptInReceived` is called)
`participant-accepted`	`{ participantDid }`	After the operator accepts an opt-in
`keygen-complete`	`{ cohortId, beaconAddress }`	MuSig2 keygen finalizes: beacon address is now known
`update-received`	`{ participantDid }`	A participant submits a signed update
`data-distributed`	`{ cohortId }`	CAS announcement / SMT tree built and sent for validation
`validation-received`	`{ participantDid, approved }`	A participant’s validation ack arrives
`signing-started`	`{ sessionId }`	MuSig2 signing session begins (auth requests sent)
`nonce-received`	`{ participantDid }`	A participant’s MuSig2 nonce arrives
`signing-complete`	`AggregationResult`	Final signature computed (also resolves `run()`)
`cohort-failed`	`{ cohortId, reason }`	Cohort entered a terminal failure state (validation rejection, TTL/phase timeout). Also rejects `run()`.
`message-rejected`	`Rejection & { cohortId }`	The state machine silently dropped an incoming message. `code` distinguishes `WRONG_VERSION`, `UPDATE_TOO_LARGE`, `UPDATE_MALFORMED`, `UPDATE_VERIFICATION_FAILED`.
`error`	`Error`	Protocol or transport error (rejects `run()` for fatal errors)

`AggregationParticipantRunner` events

Event	Payload	Fires when
`cohort-discovered`	`CohortAdvert`	A new cohort advert arrives (before `shouldJoin`)
`cohort-joined`	`{ cohortId }`	After opt-in is sent
`cohort-ready`	`{ cohortId, beaconAddress }`	Cohort keygen finalizes
`update-submitted`	`{ cohortId }`	After the signed update is sent
`validation-requested`	`PendingValidation`	Aggregated data arrives (before `onValidateData`)
`signing-requested`	`PendingSigningRequest`	Auth request arrives (before `onApproveSigning`)
`cohort-complete`	`{ cohortId, beaconAddress }`	Partial signature sent: this participant is done
`cohort-failed`	`{ cohortId, reason }`	Validation rejected, signing rejected, or protocol error
`error`	`Error`	Non-fatal error

Error Handling

The runners distinguish fatal errors (which reject run() for the service or trigger cohort-failed for participants) from non-fatal errors (which only emit error events).

runner.on('error', (err) => {
  // Non-fatal: log and continue
  console.warn('aggregation warning:', err.message);
});

try {
  const result = await runner.run();  // service
  // success
} catch(err) {
  // Fatal: protocol couldn't complete
  console.error('aggregation failed:', err);
}

For participants, cohort-failed is the per-cohort failure signal. The runner stays alive and continues processing other cohorts even if one fails:

runner.on('cohort-failed', ({ cohortId, reason }) => {
  console.error(`cohort ${cohortId} failed: ${reason}`);
  // runner is still listening for other cohorts
});

Common fatal errors:

The transport rejects an outgoing message (no relays accepted it)
A required callback throws
An incoming message references unknown state (e.g., signing without prior keygen)
MuSig2 signature aggregation fails (e.g., a participant submitted an invalid partial signature)

Power-User: State Machine Layer

For tests, custom transports, or fine-grained control, drop down to the sans-I/O state machines directly. They have no transport coupling: every action method returns BaseMessage[] for you to send via whatever mechanism you choose.

Service state machine

import { AggregationService } from '@did-btcr2/method';

const session = new AggregationService({ did: serviceDid, keys: serviceKeys });

// Step 1: Cohort Formation
const cohortId = session.createCohort({ minParticipants: 2, network: 'mutinynet', beaconType: 'CASBeacon' });
const advertMsgs = session.advertise(cohortId);
// You're responsible for sending advertMsgs over your transport

// When opt-ins arrive, feed them in:
session.receive(optInMessage);

// Inspect state:
const pending = session.pendingOptIns(cohortId);

// Accept and finalize:
const acceptMsgs = session.acceptParticipant(cohortId, participantDid);
const readyMsgs = session.finalizeKeygen(cohortId);

// Step 2-3: collect updates and distribute
session.receive(submitUpdateMessage);
const distributeMsgs = session.buildAndDistribute(cohortId);

// Step 4: signing
session.receive(validationAckMessage);
const authMsgs = session.startSigning(cohortId, txData);
session.receive(nonceContributionMessage);
const aggNonceMsgs = session.sendAggregatedNonce(cohortId);
session.receive(signatureAuthorizationMessage);

// Result
const result = session.getResult(cohortId);

Participant state machine

import { AggregationParticipant } from '@did-btcr2/method';

const session = new AggregationParticipant({ did: myDid, keys: myKeys });

// Receive an advert
session.receive(cohortAdvertMessage);

// Inspect discovered cohorts:
const discovered = session.discoveredCohorts;

// Join one:
const optInMsgs = session.joinCohort(cohortId);

// After receive() COHORT_READY, inspect joined cohort:
session.receive(cohortReadyMessage);
const joined = session.joinedCohorts.get(cohortId);

// Submit update:
const submitMsgs = session.submitUpdate(cohortId, signedUpdate);

// After receive() DISTRIBUTE_AGGREGATED_DATA, inspect validation:
session.receive(distributeMessage);
const validation = session.pendingValidations.get(cohortId);

// Approve:
const ackMsgs = session.approveValidation(cohortId);

// Signing
session.receive(authorizationRequestMessage);
const nonceMsgs = session.approveNonce(cohortId);
session.receive(aggregatedNonceMessage);
const partialSigMsgs = session.generatePartialSignature(cohortId);

Mixed mode

The Runner exposes the underlying state machine via runner.session, so you can use the Runner for the heavy lifting and reach into the session for advanced inspection:

const runner = new AggregationServiceRunner({ /* ... */ });

runner.on('opt-in-received', () => {
  // Read directly from the underlying session
  const pending = runner.session.pendingOptIns(cohortId);
  console.log('Opt-ins so far:', pending.size);
});

await runner.run();

Running the E2E Demos

Four runnable scripts in lib/operations/aggregation/ demonstrate the runner API in different deployment configurations:

Script	Description
`e2e-nostr-transport.ts`	Single process, each actor has its own `NostrTransport` pointing to the same relay. Tests real Nostr signing/encryption.
`e2e-http-transport.ts`	Single process, local `node:http` server hosting `HttpServerTransport`, participants connect as `HttpClientTransport` via real `fetch`. Full MuSig2 round over loopback HTTP. No relay, no external service.
`e2e-verify-signing.ts`	Like `e2e-nostr-transport.ts` but additionally verifies the aggregated signature with `@noble/curves` BIP-340 `schnorr.verify`. Exits non-zero on any assertion failure. The canonical “does MuSig2 still work end-to-end” check.
`aggregation-service.ts` + `aggregation-participant.ts`	Two truly separate processes connecting to a relay. Production-realistic.

# HTTP transport: zero external dependencies, runs in ~1 second
PORT=8080 bun lib/operations/aggregation/e2e-http-transport.ts

# Nostr: single process, real relay (requires a local relay)
RELAY=ws://localhost:7777 bun lib/operations/aggregation/e2e-nostr-transport.ts

# Nostr with cryptographic signature verification (CI-droppable)
RELAY=ws://localhost:7777 bun lib/operations/aggregation/e2e-verify-signing.ts

# Multi-process: run each in its own terminal
RELAY=ws://localhost:7777 bun lib/operations/aggregation/aggregation-service.ts
RELAY=ws://localhost:7777 SERVICE_DID=<from above> bun lib/operations/aggregation/aggregation-participant.ts

All scripts exercise the same protocol and produce a 64-byte Schnorr signature on the same dummy P2TR transaction. They differ only in deployment topology and transport, not in protocol logic.

Why verify the signature cryptographically?

e2e-verify-signing.ts captures the Taproot-tweaked x-only pubkey and the BIP-341 witness-v1 sighash inside onProvideTxData, then calls schnorr.verify(signature, sighash, tweakedPk) after run() resolves. If that check passes, the aggregated signature would be accepted by any BIP-340 verifier (including a Bitcoin node) without needing a funded UTXO to actually broadcast. This is the cheapest way to catch a regression anywhere in the MuSig2 pipeline: key aggregation, TapTweak, nonce aggregation, partial-sig pre-verify, or partial-sig aggregation.

Production Deployment Notes

One transport per process

In production, each actor (service or participant) runs in its own process, with its own transport instance (Nostr or HTTP) registering exactly one actor. Sharing one transport across actors is a testing convenience: don’t do it in production unless you have a specific reason.

Relay selection (Nostr transport only)

Aggregation requires reliable delivery, especially for the encrypted directed messages (NIP-44 kind 1059). Use 2–3 relays for redundancy. Public relays may rate-limit or drop kind 1059 events under load. If you need guaranteed delivery, run your own relay.

Observed behavior against common public relays (as of the e2e-verify-signing runs in the refactor/aggregation branch):

Relay	Status	Notes
`ws://localhost:7777` (nostr-rs-relay)	✅ Works	Default for local demos
`wss://nostr-pub.wellorder.net`	✅ Works	Backfills historical events; serves kind 1059
`wss://relay.damus.io`	⚠️ Rate-limited	Aggressive anti-spam for anonymous writes: fine for small cohorts, fails under repeat publish
`wss://nos.lol`	❌ Hangs after advert	Service publishes advert successfully; participants never receive it. Not fixed by `broadcastLookbackMs` or advert re-publish.
`wss://relay.snort.social`	❌ Hangs after advert	Same failure mode as nos.lol.

The reliable production paths are (a) your own relay, or (b) a relay you’ve verified end-to-end with e2e-verify-signing.ts against your expected message volume. The broadcastLookbackMs filter and advertRepeatIntervalMs re-publish are hedges that help on some flaky relays but can’t overcome a relay that simply doesn’t route kind 1 events between subscribers the way the protocol expects.

Reliability hedges (Nostr transport)

The Nostr transport and runner expose two mechanisms to improve delivery on imperfect relays:

NostrTransportConfig.broadcastLookbackMs (default 5 min): applies a since filter to the COHORT_ADVERT subscription so relays that don’t auto-backfill still return recent adverts to late subscribers.
AggregationServiceRunnerOptions.advertRepeatIntervalMs (default 60 s): re-publishes the advert on a fixed cadence until keygen completes. Gives participants whose subscription was still settling when the initial advert went out a chance to catch a later copy.

Set either to 0 to opt out.

The HTTP transport doesn’t need either hedge: the server retains the current advert and emits it to every new broadcast-SSE subscriber on connect; the inbox SSE stream replays buffered messages via Last-Event-ID on reconnect. See http-transport.md for the HTTP transport’s own deployment knobs (CORS, rate limiting, inbox buffer size, advert TTL).

Cohort TTL and phase timeouts

Unchecked, a stalled cohort (e.g. a participant goes offline mid-signing) keeps state machine entries alive forever. Two knobs on the runner bound that:

cohortTtlMs: overall wall-clock budget from run() to signing-complete. On expiry: emit cohort-failed, reject run() with a timeout error, call session.removeCohort(cohortId).
phaseTimeoutMs: maximum time allowed without a phase transition. Reset automatically on every observed phase change.

Both are unset by default: enable them in any production deployment.

Message size limits

AggregationService enforces maxUpdateSizeBytes (default 256 KiB) on the canonicalized size of each SUBMIT_UPDATE body, before the expensive BIP-340 proof verification. Oversized payloads are dropped silently at the state-machine level and surfaced as message-rejected events with code UPDATE_TOO_LARGE. This protects against cheap bandwidth/CPU exhaustion from a hostile participant. Tune downward if your DID documents are typically smaller.

Injectable logger

NostrTransport takes an optional logger: Logger in its config. CONSOLE_LOGGER is the default; SILENT_LOGGER is provided for tests and environments that want to suppress transport diagnostics. Implement the Logger interface (debug/info/warn/error) to route to pino, winston, Sentry, or a structured-logging pipeline.

Beacon address funding

The onProvideTxData callback receives the cohort’s beaconAddress and signalBytes. Before MuSig2 signing can succeed, that address must already hold a UTXO. There are two common patterns:

Pre-funded address: the service operator funds the address out-of-band (e.g., via a watcher process) before the cohort starts signing. Simpler operationally.
First-update funding: a participant’s first update funds the beacon address itself by sending a small change output to it. Requires a more complex onProvideTxData that builds a tx with both the funding input and the beacon’s spending input.

Either way, the runner doesn’t manage funding: it only signs the tx you provide.

Transaction broadcasting

The runner produces a signed transaction but does NOT broadcast it. After runner.run() resolves, you broadcast result.signedTx.toHex() yourself via your Bitcoin connection. This separation keeps the runner free of Bitcoin RPC dependencies.

const result = await runner.run();
const txid = await bitcoin.rest.transaction.send(result.signedTx.toHex());
console.log('Broadcast:', txid);

CAS publishing

For CASBeacon cohorts, the CAS Announcement Map needs to be published to a content-addressed store (typically IPFS) so that resolvers can fetch it via the on-chain hash. This is also the operator’s responsibility: read it from runner.session.getCohort(cohortId).casAnnouncement after signing-complete and publish via your CAS client.

Decision callback latency

Decision callbacks are awaited inline in the protocol flow. If your onOptInReceived or onProvideTxData is slow (e.g., waiting for human review), the protocol blocks until the callback resolves. For UIs, you typically want to:

Surface the decision in the UI immediately (via the corresponding event)
Hold the callback open until the user clicks
Resolve the callback with the user’s decision

This is exactly the pattern an interactive client app should follow.

Error recovery

There is no automatic retry for participant dropout. If a participant goes offline mid-protocol the cohort will either stall out until phaseTimeoutMs/cohortTtlMs fires, or deadlock indefinitely if neither is set. Your application should:

Set cohortTtlMs and/or phaseTimeoutMs so stalls terminate in bounded time
Listen for cohort-failed to drive operator UI / retry logic
Listen for message-rejected to surface individual submission failures back to the offending participant
Use a generous minParticipants floor so a single dropout isn’t fatal
For long-running participant runners, restart on the error event if necessary
Track which cohorts have completed via signing-complete / cohort-complete

Wire-format version

Every BaseMessage carries a version: number field. The current version is exposed as AGGREGATION_WIRE_VERSION. Receive handlers reject any message whose version doesn’t match: there is no implicit backward compatibility. Bumping the constant is a breaking protocol change; coordinate across all participants and any relay / transport middleware before changing it.

Extension Points

Beacon strategy registry

New beacon types can be added without touching the service or participant state machines. Implement AggregateBeaconStrategy and call registerBeaconStrategy(strategy).

import type { AggregateBeaconStrategy } from '@did-btcr2/method';
import { registerBeaconStrategy } from '@did-btcr2/method';

const MyBeaconStrategy: AggregateBeaconStrategy = {
  type : 'MyBeacon',

  // Service-side: build the aggregated data on the cohort.
  buildAggregatedData(cohort) {
    // Populate cohort.signalBytes and any strategy-specific state.
  },

  // Service-side: per-participant payload for DISTRIBUTE_AGGREGATED_DATA.
  getDistributePayload(cohort, participantDid) {
    return { /* custom body fields */ };
  },

  // Participant-side: verify the aggregated view reflects the local update.
  validateParticipantView({ participantDid, submittedUpdate, expectedHash, body }) {
    return { matches: /* boolean */ };
  },
};

registerBeaconStrategy(MyBeaconStrategy);

Once registered, passing beaconType: 'MyBeacon' in CohortConfig routes through the new strategy on both the service and participant sides.

Discriminated message types

For consumers writing custom transports or middleware, every message type has a narrow body interface and a type guard exported from @did-btcr2/method:

import type { AggregationMessage, CohortOptInBody } from '@did-btcr2/method';
import {
  isCohortOptInMessage,
  isSubmitUpdateMessage,
  isAuthorizationRequestMessage,
} from '@did-btcr2/method';

function route(msg: BaseMessage) {
  if(isCohortOptInMessage(msg)) {
    // msg.body is narrowed to CohortOptInBody here
    const { cohortId, participantPk } = msg.body;
    // ...
  } else if(isSubmitUpdateMessage(msg)) {
    // msg.body is narrowed to SubmitUpdateBody
    // ...
  }
}

AggregationMessage is the full discriminated union of every well-formed message variant.

Sans-I/O introspection helpers

Some state-machine methods added for runner plumbing are also useful from custom transports:

Method	Purpose
`session.drainRejections(cohortId)`	Pop the list of silent drops recorded since the last drain. The runner forwards these as `message-rejected` events.
`session.removeCohort(cohortId)`	Remove a cohort from `#cohortStates` once it’s finished. Used by the runner on successful completion and on fail.

Quick Reference

Task	Where to look
Create a service runner	Service Step-by-Step
Create a participant runner	Participant Step-by-Step
Decide what callbacks to override	Decision Callbacks Reference
Wire up event listeners	Events Reference
Handle errors	Error Handling
Work with the raw state machine	Power-User: State Machine Layer
Run example scripts	Running the E2E Demos
Deploy to production	Production Deployment Notes
Add a new beacon type	Beacon strategy registry
Narrow an incoming message	Discriminated message types
Pick a public Nostr relay	Relay selection
Configure the HTTP transport	http-transport.md
Bound a stalled cohort	Cohort TTL and phase timeouts

This site is open source. Improve this page.

did-btcr2-js

Aggregation

Table of Contents

What Aggregation Solves

Spec Roles and Class Mapping

Two-Layer Architecture

Setting Up the Transport

NostrTransportConfig options

Choosing between Nostr and HTTP

Transport lifecycle methods

Service Step-by-Step

Step 1: Construct the runner

AggregationServiceRunnerOptions (optional hardening)

Step 2: Subscribe to events (optional)

Step 3: Run the protocol to completion

What run() actually does internally

Participant Step-by-Step

Step 1: Construct the runner

Step 2: Subscribe to events (optional)

Step 3: Start the runner

Step 3b: One-shot mode

What the participant runner does internally

Decision Callbacks Reference

Service callbacks

Participant callbacks

Events Reference

AggregationServiceRunner events

AggregationParticipantRunner events

Error Handling

Power-User: State Machine Layer

Service state machine

Participant state machine

Mixed mode

Running the E2E Demos

Why verify the signature cryptographically?

Production Deployment Notes

One transport per process

Relay selection (Nostr transport only)

Reliability hedges (Nostr transport)

Cohort TTL and phase timeouts

Message size limits

Injectable logger

Beacon address funding

Transaction broadcasting

CAS publishing

Decision callback latency

Error recovery

Wire-format version

Extension Points

Beacon strategy registry

Discriminated message types

Sans-I/O introspection helpers

Quick Reference

`NostrTransportConfig` options

`Transport` lifecycle methods

`AggregationServiceRunnerOptions` (optional hardening)

What `run()` actually does internally

`AggregationServiceRunner` events

`AggregationParticipantRunner` events