TADiff | Counterfactual Time Series Forecasting with Textual Conditions

01 · Abstract

Forecasting beyond what actually happens

One observed history can lead to many plausible futures. TADiff makes those futures controllable with natural language.

Time series forecasting is increasingly shaped by forthcoming events rather than historical patterns alone. Traditional models usually rely on historical observations or factual future conditions, making it difficult to reason about counterfactual scenarios such as unexpected traffic, weather, health, or market events.

We introduce counterfactual time series forecasting with textual conditions, where future conditions are expressed as flexible natural-language descriptions. TADiff uses a text-attribution mechanism to separate mutable textual effects from immutable historical factors, then forecasts with a condition-aware diffusion process.

The project also includes an evaluation framework for both factual and counterfactual settings. DTTC-I measures consistency with intrinsic historical factors, while DTTC-E measures consistency with future textual semantics when the true counterfactual future is unavailable.

What-if text

Flexible future conditions

Describe events in natural language instead of fixed intervention categories.

Attribution

Intrinsic history first

Extract condition-independent features before generating the future.

Diffusion

Distributional forecasts

Denoise future sequences under target textual assumptions.

DTTC

No-ground-truth scoring

Evaluate semantic consistency for futures that never happened.

02 · Method

Overview

TADiff first attributes condition-invariant historical signals, then generates factual or counterfactual futures under target textual conditions.

TADiff framework with attribution and forecasting pipelines

Training jointly optimizes attribution and forecasting. Inference first attributes x_h,T from history, then uses it as the initial diffusion state for future forecasting.

Attribute history

Diffuse the historical sequence under historical text to remove extrinsic context and obtain intrinsic features x_h,T.

Forecast with future text

Initialize the future diffusion state with x_h,T, concatenate clear history, and denoise under the future textual condition c_f.

Finetune for what-if shifts

Construct alternative future conditions and optimize semantic alignment even when counterfactual ground-truth futures are absent.

Diffusion

Uncertainty-aware forecasting

DDIM-style denoising models stochastic time-series evolution under given text conditions.

Attribution

Intrinsic initial state

Instead of random Gaussian noise, TADiff starts future generation from attributed historical features.

Evaluation

Factual + counterfactual

MAE/MSE are used with ground truth; DTTC handles unobserved counterfactual futures.

03 · Evaluation

Main results

16 / 20Best / Tied-best

1.50Average Rank ↓

Dataset	Metric	TADiff (Ours)	DLinear	PatchTST	Sundial	VerbalTS	TimeCMA	TimeMMD	CT	IATSF
Synth	MAE ↓	0.54	0.73	0.67	0.76	1.00	0.82	0.66	0.57	0.55
	MSE ↓	0.51	0.87	0.73	1.01	1.53	1.09	0.67	0.34	0.48
	DTTC-I ↑	15.37	13.75	14.65	13.73	12.42	13.64	14.30	13.43	13.08
	DTTC-E ↑	78.40	64.68	67.06	64.25	74.30	64.26	68.87	67.68	63.32
ETTm1	MAE ↓	0.57	0.84	0.82	0.90	0.58	0.88	0.79	0.79	0.82
	MSE ↓	0.76	1.52	1.55	1.90	0.87	1.61	1.37	1.29	1.42
	DTTC-I ↑	10.01	8.99	9.97	9.96	8.16	8.34	9.74	8.44	7.92
	DTTC-E ↑	146.91	86.33	109.60	96.98	141.14	74.17	108.14	88.22	80.65
Traffic	MAE ↓	0.35	0.52	0.42	0.40	0.80	0.93	0.41	0.45	0.91
	MSE ↓	0.27	0.43	0.29	0.34	1.07	1.07	0.28	0.34	1.05
	DTTC-I ↑	17.90	11.27	12.97	16.36	9.73	6.31	12.97	11.12	6.52
	DTTC-E ↑	81.38	48.31	58.32	66.14	53.75	21.71	58.86	54.10	24.21
Exchange	MAE ↓	0.15	0.15	0.14	0.14	0.21	0.15	0.22	0.23	0.12
	MSE ↓	0.04	0.04	0.04	0.04	0.10	0.04	0.08	0.10	0.04
	DTTC-I ↑	17.37	16.62	16.78	17.04	15.62	16.62	16.03	15.71	16.33
	DTTC-E ↑	216.03	204.90	208.68	206.49	212.13	204.64	184.25	199.23	209.12
Weather	MAE ↓	0.18	0.26	0.19	0.24	0.51	0.30	0.27	0.31	0.18
	MSE ↓	0.11	0.15	0.11	0.14	0.46	0.19	0.13	0.17	0.09
	DTTC-I ↑	26.65	26.79	26.83	26.83	24.85	26.84	26.41	26.24	26.54
	DTTC-E ↑	31.55	29.60	30.25	29.44	31.23	29.45	29.25	29.98	29.81

8 / 8Best / Tied-best

1.00Average Rank ↓

Dataset	Metric	TADiff (Ours)	DLinear	PatchTST	Sundial	VerbalTS	TimeCMA	TimeMMD	CT	IATSF
ETTm1	DTTC-I ↑	10.59	8.96	10.51	9.94	8.62	9.47	10.14	8.43	7.93
ETTm1	DTTC-E ↑	148.72	96.45	113.02	107.14	140.95	103.42	113.65	93.24	87.71
Traffic	DTTC-I ↑	18.08	11.33	12.99	16.32	9.44	6.36	12.96	11.15	6.57
Traffic	DTTC-E ↑	78.58	48.41	54.86	64.79	53.69	22.71	56.10	52.67	24.62
Exchange	DTTC-I ↑	17.30	16.61	16.79	17.03	15.29	16.79	16.25	15.67	16.31
Exchange	DTTC-E ↑	213.94	200.45	201.30	200.57	211.37	201.84	196.16	194.49	199.65
Weather	DTTC-I ↑	26.93	26.79	26.83	26.83	24.43	26.84	26.75	26.25	26.53
Weather	DTTC-E ↑	30.99	29.04	29.25	28.91	30.16	29.00	29.30	29.23	29.18

04 · Showcase

Showcase

05 · Citation

BibTeX

@inproceedings{gu2026tadiff,
  title     = {What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions},
  author    = {Gu, Shuqi and Zhao, Yongxiang and Jing, Baoyu and Ren, Kan},
  booktitle = {International Conference on Machine Learning (ICML)},
  year      = {2026}
}