Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

LLM-native, label-free hawkish–dovish stance from FOMC statements.
Beats supervised baselines. Tracks inflation and interest rates.

LLM-native, label-free

Frozen LLM representations + no human annotations. Self-supervised from the temporal order of FOMC meetings.

Mind the shift

Markets react to stance changes, not absolutes. DCS models inter-meeting shifts, which prior methods miss.

Beats every baseline

Tops supervised and LLM-judge methods. Spearman 0.62 with CPI. Significant β across 2Y, 10Y, 20Y yields.

Why Relative Shift Matters

A moderately hawkish statement is still a dovish shift if the previous one was more hawkish. Markets price the delta, not the level. DCS is the first method to enforce this—no labels needed.

Empirical Evidence Leaderboard

DCS leads on inflation correlation and yield regression across 1B–14B models. Ranked by average of shown metrics; best per column bolded.

Inflation correlation

Meeting-level stance scores and year-over-year inflation changes.
# Model Method CPI (YoY) PPI (YoY)
Pearson Spearman Pearson Spearman
1DeepSeek-R1-14BDCS (Ours)0.5020.6240.4800.553
2Qwen3-4BDCS (Ours)0.4590.6060.4340.524
3Llama-3.1-8BDCS (Ours)0.4810.5380.4240.455
4Llama-3.1-8BLogit-Based Judge0.5030.4170.3300.292
5Llama-3.2-1BDCS (Ours)0.3450.5400.3880.509
6FOMC-RoBERTaSupervised0.3880.4460.2880.285
7Dictionary0.3140.3840.2890.299
8DeepSeek-R1-14BLLM Judge0.3890.3400.2570.242
9Qwen3-4BLogit-Based Judge0.3800.3410.2530.211
10Llama-3.2-1BLinear Probe0.1960.4040.1550.376
11Llama-3.1-8BLLM Judge0.3540.2800.2170.196
12Llama-3.2-1BLLM Judge0.2380.3190.1520.180
13DeepSeek-R1-14BLogit-Based Judge0.2270.1790.0780.069
14Llama-3.2-1BLogit-Based Judge0.0650.2270.0600.181
15Qwen3-4BLLM Judge0.1700.1410.0800.048
16Qwen3-4BLinear Probe0.1660.2060.0320.009
17Llama-3.1-8BLinear Probe0.0340.1620.0000.187
18DeepSeek-R1-14BLinear Probe0.0550.0720.0610.039

Treasury yield regression

Yield levels on stance scores.
# Model Method 2Y Yield 10Y Yield 20Y Yield
β p β p β p
1DeepSeek-R1-14BDCS (Ours)2.121<.010.782<.010.771<.01
2Llama-3.1-8BDCS (Ours)0.842<.010.796<.010.852<.01
3FOMC-RoBERTaSupervised1.058<.010.536<.010.402<.01
4Llama-3.1-8BLLM Judge0.854<.010.328<.010.202.096
5Llama-3.1-8BLogit-Based Judge0.901<.010.299.0120.158.185
6Llama-3.1-8BLinear Probe0.055.7540.227.0670.326<.01
Label-free — self-supervised from meeting order Up to 71.1% accuracy — beats supervised + LLM-judge Market-reflected — significant across CPI, PPI, yields

Stance Over Time

DCS scores track with Fed rate cycles—and lead them. Hover or drag to explore each meeting.

Stance over time

Higher score = more hawkish. Switch panels to view level (DCS & FFR) or relative stance (delta).
Hover or drag

Which Sentences Drive the Score?

Red = hawkish, blue = dovish. Hover to see each sentence's contribution (Δ) to the final score.

How Teams Use DCS

Macro research

Detect policy regime shifts; use stance as a leading-indicator input for CPI/PPI views.

Rates strategy

A structured language factor for yield-curve forecasting and duration positioning.

Risk monitoring

Transparent central-bank communication signal for scenario analysis and risk reporting.