Methodology

How CommodityNode Works

Our approach to calculating impact scores, correlations, sensitivity ratings, and confidence levels across every Signal Report and Impact Map.

Research brief

CommodityNode methodology explains how commodity shocks are mapped into forecast ranges, model agreement, exposed sectors, company sensitivity, and research-only scenario memos. The platform is research-only — not investment advice, not trading signals, not brokerage, and not order execution.

The numbers behind the consensus

Walk-forward score, holdout RMSE, no-harm routing rationale, and the most recent 8-window benchmark — pinned here so methodology stops being a black box.

Walk-forward score

8-window walk-forward backtest across the 5-commodity RL universe. Higher = stronger out-of-sample reward vs hold-only baseline.

0.62 vs 0.41 hold-only · seed 42

Holdout RMSE (90-day)

Median absolute % error on the 15% held-out test split, averaged across 5 commodities.

8.4% Chronos-2 + TimesFM 2.5 ensemble

No-harm routing

Consensus only adopts a model's path when it improves expected reward without harming worst-window calibration. The rejected path is logged for transparency.

  • Chronos-2 win rate: 58%
  • TimesFM 2.5 win rate: 42%
  • Tie / blend cases: 31%

Recent 8-window benchmark

How the policy performed in each of the last 8 walk-forward windows. Two underperformed in elevated-vol regimes — disclosed, not hidden.

Impact Score Calculation

Each node on a CommodityNode Impact Map displays an impact percentage — the estimated price response of that asset to a 10% move in the underlying commodity.

  • Level 1 (Direct Impact): Calculated from 5-year rolling regression of asset returns against commodity price changes, using weekly data. We use the beta coefficient scaled to a 10% commodity move.
  • Level 2 (Secondary Impact): Derived from supply-chain input-output relationships. If Company A sources 30% of its COGS from a Level 1 commodity, and the commodity moves 10%, we estimate the margin impact after typical hedging and pass-through rates.
  • Level 3 (Tertiary Impact): Estimated using industry-level sensitivity models. These capture second-order effects like demand destruction, substitution, and capex cycle changes.
  • Level 4 (Macro Factors): Based on historical macro-commodity relationships — e.g., the oil-CPI transmission coefficient, USD-commodity inverse correlation, and freight cost indices.

Impact scores are directional estimates, not precise predictions. They represent the most likely magnitude and direction of response based on historical patterns.

Data Sources

CommodityNode draws on multiple data sources to build each Signal Report:

  • Price data: Yahoo Finance API for real-time and historical commodity, equity, and ETF prices
  • Fundamental data: SEC filings (10-K, 10-Q) for revenue breakdowns, cost structures, and hedging disclosures
  • Industry reports: IEA, USDA, World Bureau of Metal Statistics, LME, and other commodity-specific reporting bodies
  • Supply chain data: Corporate supply chain disclosures, Bloomberg supply chain mapping, and industry association data
  • Macro data: Federal Reserve (FRED), BLS, and central bank publications for inflation, employment, and monetary policy context

Data Types

CommodityNode uses three types of price data:

Important: when an upstream feed becomes unreliable because of futures rollover, unit discontinuity, frozen prints, or proxy distortion, we may temporarily suppress the displayed day-over-day change instead of showing a misleading figure. We treat missing data as preferable to false precision.

Type Description Examples
Direct FuturesExchange-traded commodity futuresCrude Oil (CL=F), Gold (GC=F), Copper (HG=F)
ETF ProxyETF tracking commodity sectorUranium (URA), Steel (SLX)
Equity ProxyIndividual company stock as indirect indicatorLithium via Albemarle (ALB), Iron Ore via Vale (VALE)

Proxy prices reflect the performance of the proxy asset, not the underlying commodity spot/futures price directly. They may diverge from commodity prices due to company-specific factors, currency effects, or fund composition. All proxy assets are clearly labeled on their respective hub pages.

Correlation Methodology

Correlation values displayed on each node represent the Pearson correlation coefficient between the asset's weekly returns and the commodity's weekly price changes over a rolling 3-year window.

  • Range: -1.0 (perfect inverse) to +1.0 (perfect positive)
  • Update frequency: Recalculated quarterly using the most recent 156 weeks of data
  • Significance: We only display correlations where p-value < 0.05 (95% confidence)
  • Lag adjustment: For secondary and tertiary impacts, we test correlations at 0, 1, 2, and 4-week lags and report the strongest statistically significant relationship

Sensitivity Analysis

Sensitivity ratings classify how responsive an asset or sector is to commodity price movements:

  • Very High: Impact > 10%, Correlation > 0.80 — asset moves almost in lockstep with the commodity
  • High: Impact 5–10%, Correlation 0.60–0.80 — strong, reliable transmission
  • Medium: Impact 2–5%, Correlation 0.40–0.60 — meaningful but moderated by other factors
  • Low: Impact < 2%, Correlation < 0.40 — commodity is one of many drivers

Sensitivity accounts for hedging ratios (from 10-K disclosures), contractual pass-through mechanisms, and inventory buffer effects that dampen or amplify raw price transmission.

Confidence Levels

Each Signal Report carries a confidence assessment based on data quality and model reliability:

  • High: 5+ years of clean price data, well-documented supply chain, multiple confirming data sources, statistically significant correlations (p < 0.01)
  • Medium-High: 3–5 years of data, established commodity-equity relationship, some hedging uncertainty
  • Medium: 2–3 years of data, or newer commodity/equity with less historical context. Correlations may be regime-dependent.
  • Low-Medium: Limited historical data, emerging commodity (e.g., lithium pre-2020), or structural market changes that may invalidate historical patterns

Node Types

Impact Map nodes are classified by their relationship to the commodity:

  • Commodity (gold center) — the primary commodity being analyzed
  • ETF (purple) — exchange-traded funds providing direct or sector exposure
  • Producer (orange) — companies that extract or produce the commodity
  • Processor (cyan) — companies that refine or transform the commodity
  • Consumer (orange-light) — companies or sectors that consume the commodity as input
  • Supplier (green-light) — upstream suppliers to the commodity's production chain
  • Substitute (yellow) — alternative commodities or technologies
  • Regional (blue-light) — geographic or country-level exposure factors
  • Macro (purple-light) — macroeconomic factors like FX, inflation, policy
  • Policy (pink) — regulatory, tariff, or government policy factors

Data Sources — Complete List

CommodityNode integrates data from the following sources, updated on the schedules noted:

Source Data Type Update Frequency
Yahoo Finance APICommodity, equity, ETF prices (historical & real-time)Daily
FRED (Federal Reserve Economic Data)CPI, employment, interest rates, money supplyMonthly
US Bureau of Labor Statistics (BLS)Producer Price Index, Consumer Price Index subcategoriesMonthly
SEC EDGAR Filings10-K, 10-Q — revenue breakdowns, cost structures, hedging disclosuresQuarterly/Annual
EIA (US Energy Information Administration)Oil/gas storage, production, consumption, trade flowsWeekly
USDA (US Dept of Agriculture)Crop reports, WASDE supply/demand estimates, export dataMonthly
IEA (International Energy Agency)Global oil/gas supply-demand balances, energy outlooksMonthly
World Bureau of Metal StatisticsMetal supply-demand balances, production dataMonthly
LME (London Metal Exchange)Metal warehouse stocks, settlement pricesDaily
COMEX / CME GroupFutures prices, open interest, COT positioning dataWeekly
World Gold CouncilCentral bank purchases, gold demand/supply dataQuarterly
World Nuclear AssociationReactor pipeline, uranium supply-demandSemi-annual

Correlation Calculation — Technical Detail

Our Pearson correlation coefficients are computed as follows:

  1. Data preparation: Weekly closing prices for both the commodity and the related asset, aligned to Friday close. Missing data points are forward-filled (last observation carried forward). Weeks with holidays in either market are excluded.
  2. Return calculation: Log returns — ln(Pt/Pt-1) — to normalize for compounding effects and ensure statistical stationarity.
  3. Window: Rolling 156-week (3-year) window, recalculated quarterly. We use a 3-year window as a balance between statistical significance (minimum ~100 observations) and recency (capturing recent structural shifts).
  4. Significance test: T-test with H₀: ρ = 0. We display correlations only when p-value < 0.05. The t-statistic is computed as t = r√(n-2) / √(1-r²), where r is the sample correlation and n is the number of observations.
  5. Lag testing: For Level 2+ nodes, we test correlations at 0, 1, 2, and 4-week lags and report the lag with the strongest statistically significant relationship. This captures delayed transmission effects (e.g., oil → refiner margins have a 1-2 week lag).

Node Graph Layout Logic

Impact maps use D3.js force-directed graph simulation with the following configuration:

  • Center force: The commodity node is pinned at the canvas center with a strong gravitational pull (forceCenter).
  • Radial force: Nodes are assigned to concentric rings based on their level (1-6). A radial force pushes each node toward its assigned ring radius, creating the ripple-ring visual structure.
  • Link force: Edges between connected nodes exert spring forces that keep parent-child relationships visually proximate.
  • Collision force: Nodes repel each other at short range to prevent label overlap.
  • Charge force: A mild electrostatic repulsion (forceManyBody) ensures even distribution within each ring.
  • Node sizing: Node radius is proportional to |impact|, so high-impact nodes are visually larger.
  • Color encoding: Node type determines color (see Node Types above). Edge opacity is proportional to |correlation|.

The simulation runs for 300 ticks on page load. Users can drag nodes to adjust layout; dragged nodes are pinned to their new position.

Ripple Chain Methodology

Ripple Chains trace multi-hop impact paths from a commodity shock to downstream effects. Each hop represents a documented transmission mechanism:

  • Hop 1: Direct price sensitivity (commodity → producer/consumer)
  • Hop 2: Cost pass-through (producer → downstream industry)
  • Hop 3: Second-order demand effects (industry → end consumer/macro)
  • Hop 4: Macro feedback loops (inflation → policy → commodity)

Ripple Chain strength diminishes with each hop. We typically see 60–80% of the signal transmitted at Hop 1, 30–50% at Hop 2, and 10–25% at Hop 3+.

Transparency & Limitations

Historical Backtesting Bias

Correlation coefficients and sensitivity betas are calculated from historical data and may not reflect future relationships, particularly during structural breaks or regime changes.

Proxy Symbol Limitations

26 of our 60 commodity hubs use ETF or equity proxies rather than direct futures prices. These proxies introduce basis risk and may not perfectly track the underlying commodity. All proxy hubs display a "Proxy" badge.

Research-only intelligence

CommodityNode reports and forecasts are research-only analytical tools: not investment advice, not trading signals, not brokerage, not order execution, and not guaranteed outcomes.

Data Freshness

Price data is updated daily via automated scripts. AI forecasts are recalculated weekly. Correlation matrices refresh on a 30/60/90-day rolling basis. All data displays its last-updated timestamp.

Disclaimer

CommodityNode provides research-only market intelligence and analytical tools for educational, informational, and business-planning purposes. Outputs are not investment advice, not trading signals, not brokerage, not order execution, and not guaranteed outcomes.

Impact scores, correlations, and sensitivity ratings are based on historical data and statistical models. Past performance does not guarantee future results. Commodity markets are inherently volatile and subject to rapid regime changes that can invalidate historical relationships.

Use CommodityNode outputs as scenario context for further research, operational planning, procurement review, and company-sensitivity analysis. CommodityNode is not a registered investment adviser, broker, or execution venue.

Data may contain errors or delays. We make no warranties about the accuracy, completeness, or timeliness of the information presented.

Stay Informed

Weekly Commodity Signal Digest

Every Monday: the 3 most important commodity risk moves, biggest supply disruptions, and key events to watch. Free, no spam.

No spam. Unsubscribe anytime.

✓ Weekly research notes ✓ Disruption alerts ✓ Key events calendar

Update schedule, Forecast methodology, and Corrections Policy

Update schedule: price data, forecast consensus, and hub freshness labels are reviewed on the daily publishing loop; weak feeds are labelled or suppressed instead of shown as precise live values.

Forecast methodology: CommodityNode compares model-assisted forecast ranges, proxy benchmarks, and editorial context. Forecasts are uncertainty ranges for research and planning, not point promises.

Corrections: send factual corrections, stale source reports, or methodology questions to corrections@commoditynode.com. Material corrections are reviewed by the editorial team and reflected in the relevant hub/report freshness note.