StableLens

Methodology behind the StableLens Score

Version v2.0 · Model v1.1.0 · Locked 26 May 2026

The StableLens Score is a proprietary analytical composite scored 0–100 across the venue-aware framework. This page documents the full audit trail — who reviews the Score, where the data comes from, what we don't cover, and how to dispute or cite the output.

The StableLens Score is a proprietary analytical composite reflecting reserve composition, audit transparency, redemption rights, issuer licensing, consumer protection, and regulatory classification. It is not a credit rating. StableLens is not a registered Nationally Recognized Statistical Rating Organization (NRSRO) and the Score does not constitute investment advice, an offer to buy or sell any security, or a recommendation of any kind. Allocators are responsible for their own diligence.

1. Mission

Stablecoins now settle more daily volume than Visa. They underpin DeFi, on-ramp/off-ramp infrastructure, and an increasing share of institutional treasury operations. But there is no neutral, third- party rating system for them — only issuer marketing, sporadic journalism, and DeFi dashboards focused on TVL.

StableLens exists to fill that gap. We score every tracked stablecoin on a fixed six-dimension compliance framework, monitor pegs continuously, and surface yield opportunities with an honest risk-adjusted view. We differ from generalist on-chain analytics platforms (Glassnode, Nansen, DeFiLlama) on three fronts: stablecoin-specialized, multi-jurisdictional regulatory coverage, and methodology-versioned with an audit trail per score (with the multi-agent audit trail described in Section 2 shipping incrementally; current scoring is deterministic and reproducible).

2. Compliance scoring framework — six dimensions

The compliance scoring framework below is a separate layer from the venue-aware yield-risk model surfaced in section 2.5 below. Compliance scores rate the asset (issuer transparency, regulatory posture, treasury composition); yield-risk scores rate a position in a pool (smart-contract surface, exit cost, chain finality). They share inputs in places but compose independently.

Composite scores are 0–100, mapped to a 10-step letter grade ladder — full thresholds in section 3 below. The mapping is deliberately tighter at the extremes: AAA requires 95+; D triggers below 20.

DimensionWeightSpecialist agentStatus
Treasury & Reserves25%specialist.reserve_compositionlive
Regulatory Status20%specialist.regulatory_statuslive
Peg Stability20%specialist.peg_stabilitylive
Audit & Code Security15%specialist.audit_securityproxy
Issuer Operational Health10%specialist.operational_healthproxy
Historical Behavior10%specialist.historical_behaviorlive

Per-dimension detail

Treasury & Reserves (25%)

What it measures: What backs the stablecoin: cash, T-bills, commercial paper, crypto collateral, or algorithmic.

Inputs: Issuer attestations (monthly), audit reports (quarterly), banking-partner credit ratings (S&P Global), on-chain reserve verification for transparent stables (USDC, DAI).

Regulatory Status (20%)

What it measures: Which jurisdictions the issuer is licensed or registered in; pending enforcement actions; compliance with MiCA, NYDFS, MAS, etc.

Inputs: NYDFS Trust Charter list, FinCEN MSB registry, MAS DPT licensee list, EU MiCA white paper publications, news flow flagged by the regulatory watcher.

Peg Stability (20%)

What it measures: Historical and current deviation from $1.00 (or anchor target). Recovery time after stress events.

Inputs: On-chain price feeds (every 10 minutes), CoinGecko spot prices, depeg event history.

Audit & Code Security (15%)proxy

What it measures: Smart-contract audit coverage, audit-firm tier, exploit history, time-to-disclosure, governance attack surface.

Inputs: Audit reports from Trail of Bits / OpenZeppelin / Certora / Spearbit / Halborn, exploit databases (Rekt, DeFi Llama Hacks), GitHub commit history.

Issuer Operational Health (10%)proxy

What it measures: Engineering team size and continuity, redemption infrastructure uptime, customer-support responsiveness, transparency cadence.

Inputs: GitHub contributor activity, redemption-portal availability monitoring, public communication frequency, employee-count trends from LinkedIn / Crunchbase.

Historical Behavior (10%)

What it measures: Track record of past commitments: did they redeem 1:1 during stress events? Did attestations match on-chain reality?

Inputs: Depeg event archive (this product), public statements vs. on-chain outcomes, attestation-to-reality reconciliation history.

Honest about proxies: dimensions marked proxy use a heuristic input until the proper data pipeline lands. For example, Audit & Code Security currently aggregates audit-firm tier and exploit history but doesn't yet read individual audit reports for finding-severity. Each proxy dimension carries its own roadmap milestone in the engineering backlog.

How StableLens rates DeFi vs CeFi yield

A yield product's risks depend on where it lives. A DeFi pool exposes you to smart-contract code, oracle dependencies, bridge surfaces, and AMM liquidity. A CeFi platform exposes you to custody arrangements, withdrawal gating, the platform's solvency, and the integrity of its proof-of-reserves. Grading them with the same dimension set forces inapplicable signals to average out to neutral — which is dishonest about the actual risks.

The applicability principle: a dimension that doesn't apply to a venue is removed from the model for that row, not neutral-filled. The remaining dimensions are re-normalized so the weights still sum to 1.00. Two consequences:

  • A CeFi row carries zero contribution from smart-contract, oracle-bridge, settlement-chain, AMM-liquidity, or emission-yield dimensions — those are not in its profile at all.
  • A DeFi row carries zero contribution from custody, PoR cadence, withdrawal-gating, or platform-licensing dimensions — same reason.

DeFi risk profile

17 dimensions. The 12 universal dimensions (asset-level) + 5 DeFi-specific dimensions (code, oracle/bridge, chain, AMM liquidity, real-yield sustainability). Weights are preserved from the v1.1 model (Code Session U Phase U-2 calibration against the 12 historical-failure backtest).

DeFi profile · 17 dimensions · weights sum to 1.00
DimensionWeightWhat it measuresPublic data source
Peg stability9%Magnitude + persistence of price deviation from $1.00 (or anchor target).On-chain price feeds + CoinGecko spot prices (every 10 minutes).
Issuer / asset quality10%Issuer charter, treasury composition, banking partners; underlying asset composition for tokenized funds.Public issuer disclosures, attestations, audit reports, banking-partner credit ratings (S&P Global).
Counterparty (coin)5%Coin-issuer counterparty risk distinct from any venue. CDP/synthetic stables expose collateral providers.On-chain collateral graph + issuer governance docs (public).
Regulatory standing (coin)5%Coin-issuer registration + licensure standing across the major jurisdictions.NYDFS Trust Charter list, FinCEN MSB registry, MAS DPT list, EU MiCA white paper publications.
Counter-stress3%Behavior under ETH-down-40%, USDC-depeg, regime-change scenarios — pre-computed stress vectors.On-chain price + supply history; deterministic stress engine.
Collateral quality9%Penalizes reflexive collateral (LUNA→UST, BCT→IRON, FTT→FTX). Among the most predictive failure signals.On-chain collateral composition + circulating-supply graph.
Concentration6%Venue + counterparty concentration of holdings, settlement, and exit liquidity (Voyager←3AC, deUSD).On-chain wallet clustering + DEX/CEX volume share data.
Banking partner7%Concentration in custodial banking relationships (USDC/SVB precedent).Public custodian disclosures + bank financial reports (FDIC, SEC).
Contagion / composability5%Inherited risk from upstream protocols and stablecoins (deUSD ← xUSD, USDe cascade).Curated public dependency graph (lib/alerts/contagion-graph), every edge cited to a primary source.
Transparency / off-chain dependency3%Reliance on opaque off-chain managers or unverifiable backings (xUSD, Celsius).Public disclosures + attestation methodology + verifiability of reserve claims.
Reserve liquidity2%Liquidity of reserves under redemption stress (USDR illiquid-real-estate precedent).Public reserve composition + asset-class liquidity benchmarks.
Governance attack surface2%Vulnerability to governance attacks + adverse parameter changes (Beanstalk, sUSD SIP-420).On-chain governance contracts + public proposal history.
Smart-contract12%Code-level risk: audit count, audit recency, critical findings, exploit history.Public audit reports + exploit databases (Rekt News, DefiLlama hacks).
Oracle + bridge4%Oracle manipulation surface + bridge dependency for cross-chain settlement.On-chain oracle providers + bridge contract analysis (public).
Settlement chain4%Chain-specific risk: validator decentralization, finality guarantees, bridge censoring history.Public chain telemetry + validator/staker distribution.
Liquidity + exit cost8%AMM pool depth, slippage to exit at the target size, withdrawal capacity.On-chain AMM pool snapshots (DefiLlama, on-chain quoters).
Sustainability (real-yield ratio)6%Real yield (lending interest, LP fees) vs token-emission yield. Higher emission share = less sustainable.DefiLlama apyBase vs apyReward decomposition.

CeFi risk profile

17 dimensions. The same 12 universal dimensions + 5 CeFi-specific dimensions (custody, proof-of-reserves cadence, platform solvency, withdrawal gating, platform regulatory standing). The 5 DeFi-only dimensions are removed; the 0.34 weight they carried is redistributed across the new CeFi-only dimensions.

Interim status: CeFi-profile weights ship as a starting calibration. The per-profile backtest gate (Y-6 of the venue-aware methodology spec) must validate them against historical platform stress events before any CeFi grade is published publicly. Today CeFi rows render “limited data” until both the backtest passes and platform-level public inputs (custody disclosures, PoR snapshots, etc.) are ingested.
CeFi profile · 17 dimensions · weights sum to 1.00 (calibration-pending)
DimensionWeightWhat it measuresPublic data source
Peg stability9%Magnitude + persistence of price deviation from $1.00 (or anchor target).On-chain price feeds + CoinGecko spot prices (every 10 minutes).
Issuer / asset quality10%Issuer charter, treasury composition, banking partners; underlying asset composition for tokenized funds.Public issuer disclosures, attestations, audit reports, banking-partner credit ratings (S&P Global).
Counterparty (coin)5%Coin-issuer counterparty risk distinct from any venue. CDP/synthetic stables expose collateral providers.On-chain collateral graph + issuer governance docs (public).
Regulatory standing (coin)5%Coin-issuer registration + licensure standing across the major jurisdictions.NYDFS Trust Charter list, FinCEN MSB registry, MAS DPT list, EU MiCA white paper publications.
Counter-stress3%Behavior under ETH-down-40%, USDC-depeg, regime-change scenarios — pre-computed stress vectors.On-chain price + supply history; deterministic stress engine.
Collateral quality9%Penalizes reflexive collateral (LUNA→UST, BCT→IRON, FTT→FTX). Among the most predictive failure signals.On-chain collateral composition + circulating-supply graph.
Concentration6%Venue + counterparty concentration of holdings, settlement, and exit liquidity (Voyager←3AC, deUSD).On-chain wallet clustering + DEX/CEX volume share data.
Banking partner7%Concentration in custodial banking relationships (USDC/SVB precedent).Public custodian disclosures + bank financial reports (FDIC, SEC).
Contagion / composability5%Inherited risk from upstream protocols and stablecoins (deUSD ← xUSD, USDe cascade).Curated public dependency graph (lib/alerts/contagion-graph), every edge cited to a primary source.
Transparency / off-chain dependency3%Reliance on opaque off-chain managers or unverifiable backings (xUSD, Celsius).Public disclosures + attestation methodology + verifiability of reserve claims.
Reserve liquidity2%Liquidity of reserves under redemption stress (USDR illiquid-real-estate precedent).Public reserve composition + asset-class liquidity benchmarks.
Governance attack surface2%Vulnerability to governance attacks + adverse parameter changes (Beanstalk, sUSD SIP-420).On-chain governance contracts + public proposal history.
Custody arrangement11%Legal segregation of customer funds; cold-storage share; on-chain wallet attestations.Public custody disclosures (ToS, regulator filings), on-chain multisig proofs.
Proof-of-reserves cadence8%Frequency + methodology (merkle / third-party attested / self-reported) of public PoR snapshots.Published PoR snapshots + auditor methodology docs (public).
Counterparty solvency (platform)8%Public solvency signals — lawsuits, layoffs, on-chain wallet outflow velocity. Distinct from the coin-level counterparty dim.PACER court records, WARN-Act filings, on-chain wallet clusters.
Withdrawal gating4%Historical record of pausing, freezing, or delaying withdrawals (single event = material signal).Platform status pages + regulator orders + public press record.
Platform regulatory (licensing)3%Licensing + registration standing from public registries only. Strictly NOT a CLARITY/GENIUS securities classification — that work is counsel-gated and lives separately.NYDFS BitLicense, FinCEN MSB, NMLS state-MSB registry, FCA, MAS, MFSA — all PUBLIC.

Strict boundary: the platform regulatory dimension grades licensing + registration standing from public registries only (NYDFS BitLicense, FinCEN MSB, NMLS state-MSB, FCA, MAS, MFSA). It is NOT a CLARITY / GENIUS securities classification — that work is counsel-gated and lives separately in the regulatory-scoring module. The two never share inputs or weights.

Category → profile routing

The unified yield_sources universe carries a category discriminator on every row. The grader maps each category to its profile:

CategoryProfileNote
defi_poolDeFi profileOn-chain, non-custodial pools.
cefi_rateCeFi profileCustodial-venue rate products.
tokenized_treasuryCeFi profile (interim)Routes to CeFi until a dedicated tokenized-treasury profile lands.
fintech_rewardCeFi profile (interim)Custodial-fintech rewards programs; routes to CeFi until a dedicated profile lands.
bank_stablecoinCeFi profile (interim)Chartered-bank-issued stablecoins; routes to CeFi until a dedicated profile lands.

Confidence, coverage, and “limited data”

Every grade ships with three numbers:

  • Composite (0–100) — the weighted average of present sub-scores after re-normalization.
  • Confidence (0–1) — coverage-weighted average of per-dimension confidences, scaled by the coverage ratio.
  • Coverage (0–1) — the fraction of the profile's applicable weight that's actually present (not missing) on this row.

A row is flagged limited data — surfaced as “—” across the UI rather than a numeric grade — when:

  • composite confidence < 0.50, or
  • coverage < 0.70 (less than 70% of the profile's weight is supported by real data).

When a row is limited-data, its cross-venue sort tier becomes ungraded, so a poorly-supported high score never outranks a well-supported A in the safest-first sort. This is the no-fabrication contract operationalized end-to-end.

Versioning

Profile weights carry an explicit version stamp:PROFILE_WEIGHTS_VERSION = “2.0.0. Every persisted grade records the profile it was scored against and the weights version, so a future calibration bump (2.0.0 → 2.1.0, etc.) re-grades cleanly and the audit trail stays intact.

How StableLens benchmarks stablecoin yield against risk-free cash

A stablecoin yield is only interesting in comparison to what a US-dollar holder can earn risk-free. StableLens shows every stablecoin yield as a spread over the 3-month Treasury bill, paired with the venue's independent risk tier. The investor judges the trade-off: +170 bps for an A-tier custody risk vs +40 bps for the same are very different decisions.

The risk-free anchor

The primary anchor is the 3-month Treasury bill yield (FRED series DGS3MO), pulled live by the existing macro pipeline. Additional context rates surface on /macro: the Federal Funds rate (DFF), 2- and 10-year Treasury yields (DGS2, DGS10), and the 10Y-2Y curve (T10Y2Y).

Two reference numbers anchor the “cash” band of the spectrum and are reviewed weekly from public sources:

  • FDIC national-average savings rate — the “doing nothing” baseline, published weekly by the FDIC.
  • Top insured-cash reference — a single curated public reference for the upper bound of FDIC-insured cash yields available to a US saver. One number, not a maintained directory. StableLens does NOT publish a per-institution rate list, does NOT show “open account” links, and does NOT accept compensation from any deposit institution.

Excess yield (spread over risk-free)

For any stablecoin yield product:

excess_yield(row) = row.yield_apy − risk_free_rate

UI surfaces show excess yield in basis points alongside the row's StableLens grade and venue tier. The two are never collapsed into a single number — that would obscure the trade-off the product exists to expose. Cross-venue sorting uses excess yield (not raw APY) as the within-tier tie-breaker, so a 4.2% yield when T-bills pay 3.7% (+50 bps) ranks below a 5.4% yield (+170 bps) at the same tier.

The dollar-yield spectrum

The spectrum strip on /yields and /optimizer lays out the cash-to-DeFi ladder visually: risk-free cash → bank-issued stablecoins → tokenized treasuries → CeFi rates → DeFi pools. Each rung shows its top yield, the StableLens grade for that venue, and the spread over the 3-month Treasury.

Rungs with no graded venue yet render an honest “—”. Independence and provenance constraints apply uniformly: every rung's data is sourced from public / on-chain / regulatory records only, and TradFi rungs carry zero affiliate relationships of any kind.

3. Composite → letter grade & risk-adjusted APY

Each yield product's sub-scores from the venue profile above are weighted-averaged into a 0–100 composite, which maps to a letter grade with deliberately-tighter bands at the extremes. Source: lib/risk/grade.ts.

AAA95Highest-scored under model
AA90Highest-scored under model
A80Lower-risk under methodology
BBB70Lower-risk under methodology
BB60Moderate under methodology
B50Moderate under methodology
CCC40Elevated risk
CC30Elevated risk
C20High risk
D0Severe risk

Risk-adjusted APY

Risk-adjusted APY is computed as radjAPY = APY × confidence × (1 − damping(grade)). Damping factors are tier-graded: AAA = 0 (no damping), AA = 0.05, A = 0.15, BBB = 0.30, BB = 0.50, B = 0.65, CCC = 0.75, CC = 0.80, C = 0.83, D = 0.85. A 10% APY pool with a BBB grade and 0.9 confidence becomes 6.3% risk-adjusted.

Source of truth lives in-repo at lib/risk/ with full unit-test coverage. Weights are sanity-checked at module load — the test suite fails immediately if they don't sum to 1.0.

4. Peg deviation calculation

Peg status is one of three values:

  • STABLE — deviation from anchor < 0.5% over the most recent 10-minute window. Outlier ticks (single-block flash deviations) are excluded if they don't persist for two consecutive 10-minute windows.
  • WARNING — deviation between 0.5% and 2.0% sustained across two consecutive windows.
  • DEPEG — deviation > 2.0% sustained across two consecutive windows. Triggers depeg-event archival and alerts on Pro / Team / Institutional.

Anchor selection: USD-pegged stablecoins anchor to $1.00 (no FX adjustment; we report the deviation, the user interprets it). Algorithmic stables that drift by design (rebasing, collateral ratios) are reported with their actual peg-target if disclosed by the issuer.

Data sources: Alchemy RPC for on-chain price feeds (Chainlink + Pyth where available; falls back to DEX TWAPs), CoinGecko spot for cross-check. Disagreement > 0.3% between sources is flagged for human review and the peg status holds at the previous state until resolved.

5. Data sources

Every input to the scoring system, with refresh cadence and maturity status. We are deliberately transparent about which sources are live vs. planned — overclaiming data coverage is a trust-killer.

CategorySourceTypeCadenceUsed inStatus
Stablecoin metadataCoinGecko Pro APIAggregatorHourlyAllLive
Yield + TVLDeFiLlama Pro APIAggregatorHourlyYield Risk · LiquidityLive
On-chain pegAlchemy RPC (5 chains)Primary10 minutesPeg StabilityLive
Macro indicatorsFRED (Federal Reserve)PrimaryDailyMacro contextLive
Reserve attestationsTether / Circle / Paxos / Gemini reportsPrimaryMonthly–QuarterlyTreasury & ReservesLive
NY DFS Trust CharterNY Department of Financial ServicesPrimaryOn changeRegulatory StatusLive
EU MiCA filingsESMA white paper registerPrimaryOn changeRegulatory StatusLive
Audit reportsTrail of Bits / OpenZeppelin / CertoraPrimaryOn publishAudit & Code SecurityLive
Banking partner ratingsS&P Global Capital IQPrimaryOn changeTreasury & ReservesPlanned Q3 2026
Exploit databaseDeFi Llama Hacks + RektAggregatorDailyAudit & Code SecurityLive
Redemption uptimeIn-house monitoringPrimary5 minutesOperational HealthPlanned Q3 2026
Layer-2 RPCsAlchemy (Arbitrum / Base / Optimism / Polygon / Mainnet)PrimaryOn queryChain Risk · Peg StabilityLive

6. Update frequency by tier

TierPeg refreshCompliance refreshAlertsAPI
Free10-min cycleHourlyEmail · 5 alerts100 req/day
Starter10-min cycleHourlyEmail · daily summary1,000 req/day
Pro10-min cycleHourlyEmail · sub-60s delivery on detection10,000 req/day
Team10-min cycleHourlyEmail + Slack + webhook · sub-60s delivery25,000 req/day per seat
Institutional10-min cycleHourly + on-eventAll channels + webhook · sub-60s deliveryUnlimited · custom SLA

7. Methodology versioning

Methodology changes are versioned (e.g., v1.0 → v2.0). When the framework changes, the historical scores produced by previous versions are preserved at the version that produced them. The score page shows which version produced which grade.

Material methodology updates (new dimensions, weight changes > 5%, threshold adjustments) trigger a public changelog entry and a 30-day notice before the new version takes effect on production scores. Issuers and customers can comment during the notice period.

The model version is bumped in lib/risk/version.ts whenever per-dimension logic, weights, or grade thresholds change. Current methodology version: v2.0 (locked 2026-05-26); current scoring-engine version: v1.1.0. See methodology changelog (placeholder; populated as the methodology evolves).

8. Confidence and human review

Every sub-score carries a confidence value between 0 and 1, computed from input freshness, source agreement, and historical specialist accuracy. The composite confidence is a weighted average across dimensions. A dimension with no relevant fields on the pool returns confidence 0.2–0.3 with a neutral score — this lets the UI distinguish "we know this is risky" from "we don't have enough data to say".

Look for the LOW CONF tag in the breakdown card on each yield's detail page — that's the UI surface for low-confidence composites.

Publication thresholds

  • Confidence ≥ 0.90 — the score auto-publishes.
  • Confidence 0.70 – 0.89 — auto-publishes but flagged for QA-auditor review within 7 days.
  • Confidence 0.50 – 0.69 — held in queue. A human reviewer must approve before the score updates publicly.
  • Confidence < 0.50 — auto-rejected. The previous score holds until enough evidence accumulates. The dispute process is the path forward.

Public score pages show the agent reasoning with citation links — the entire chain of evidence that led to the grade is visible to any reader, on every score, always.

9. Issuer dispute process

Issuers can dispute any score on their stablecoin. The process:

  1. Issuer submits evidence at /methodology/dispute (placeholder for the Phase 2 form).
  2. A human reviewer evaluates the evidence within 5 business days.
  3. If the score is revised, an explicit changelog entry on the score page records: the prior grade, the new grade, the evidence accepted, the reviewer, the timestamp.
  4. If the score is not revised, the issuer receives a written rationale and can re-submit with new evidence.

We do not silently revise scores. Every revision is documented publicly. This is a pre-requisite for the page being trustworthy.

10. Limitations and caveats

What StableLens is honest about not covering:

  • We are not auditors. We aggregate third-party audit reports; we don't perform smart-contract code review ourselves.
  • We don't make subjective business-quality judgments about issuer teams beyond the operational-health signal.
  • We don't cover items not in the documented dimensions. If a dimension matters and isn't listed in section 2, it isn't in our score.
  • Scoring is informational, not investment advice. A AAA grade is not a recommendation to buy. A D grade is not a recommendation to short. Stablecoin and DeFi markets carry material risk including total loss of capital.
  • Methodology is opinionated. We make explicit choices about weights and thresholds. Reasonable people will disagree. Our defense is transparency: every choice is documented above.

11. The multi-agent compliance system — future roadmap

Current state vs roadmap. Today’s production scores come from the deterministic, rule-based scorer described in Section 2 (six regulatory dimensions plus capped fallbacks for issuers without a regulatory profile) and the venue-aware DeFi/CeFi profiles in Section 2.5 above (currently scoring model v1.1.0). The multi-agent system described below is future work, post-methodology-v2.0 — the public.agent_runs audit-trail table exists but is currently empty in production. We’re shipping the agent layer incrementally and will only retire the deterministic scorer once the agent system passes QA against historical labels.

Once shipped, every StableLens score will be produced by a multi-agent system, not a single LLM call. The architecture exists to make scores reproducible, auditable, and resistant to single-point bias.

Continuous watchers will ingest evidence from primary sources — regulator filings, issuer attestations, on-chain price feeds, audit reports, exploit databases, news flow. Specialist analyst agents will interpret each dimension (one specialist per dimension; see the framework in Section 2). Each specialist will produce a sub-score with explicit reasoning and inline citations.

Two adversarial review agents — Devil's Advocate and Bull Advocate — will actively challenge proposed scores before they reach a final grade. A synthesis agent will compose the sub-scores into a composite letter grade. An editorial agent will generate the public-facing reasoning report.

A citation verification agent will confirm every source URL resolves and the quoted excerpt matches the live source. A QA auditor agent will sample 5% of scores weekly to catch systematic drift. A methodology agent will propose framework updates (new dimensions, weight changes); these require explicit human approval before deployment.

Once the agent layer ships, every score will have a complete audit trail stored append-only in public.agent_runs: which agents ran, which model was used, which inputs they saw, what reasoning produced the output, which citations supported the claims, what confidence value was assigned. Disputes can be reproduced from this trail.

Agent roles (roadmap)

RoleWhat it does
watchersContinuous evidence ingestion. One per data category — regulatory.us, regulatory.eu, regulatory.asia, reserve_attestations, audit_reports, exploit_feed, news_flow, on_chain_pegs.
specialistsPer-dimension scoring. One per compliance + yield-risk dimension. Reads from the watchers and produces a structured sub-score with reasoning + citations.
devil's advocateAdversarial review. Actively challenges proposed scores, surfaces evidence the specialists may have under-weighted, lowers confidence when warranted.
bull advocateCounter-adversarial review. Catches over-conservative scoring driven by stale data or fear-of-missing-issues bias.
synthesizerComposes specialist sub-scores into the final composite letter grade. Enforces sum-to-1 weight invariant.
editorGenerates the public-facing reasoning report. Plain-language, no hedging, with citations inline.
citation verifierConfirms every cited URL still resolves and the quoted excerpt matches the live source. Flags broken citations for human review.
qa auditorAudits a random 5% sample of scores weekly. Catches systematic drift, calibration errors, and individual specialist failures.
methodologyProposes methodology updates (new dimensions, weight changes, threshold adjustments). Requires human approval before deployment.

12. Methodology committee

Methodology changes require explicit human approval. The committee that signs methodology updates:

  • Dom (founder, methodology lead) — solo signatory until Q4 2026.
  • Fractional advisors — onboarding planned at $200K ARR. Target composition: one stablecoin-issuer veteran, one regulator-facing lawyer, one quant. Names will be published here when they sign on.

13. Citation and republication policy

Journalists, researchers, and integrators are welcome to cite the StableLens Score. Recommended format:

StableLens Score: USDC, 92/100 (A · Highly Compliant), retrieved 2026-05-26 from stablelens.com/stablecoins/USDC, methodology v2.0.

Bulk republication of scores or data feeds requires a written agreement (contact partnerships@stablelens.com). Individual scores in journalism, research, or non-competing products are fair use with attribution.