nep-for New Economics Papers
on Forecasting
Issue of 2026–01–12
eleven papers chosen by
Malte Knüppel, Deutsche Bundesbank


  1. Ultimate Forward Rate Prediction and its Application to Bond Yield Forecasting: A Machine Learning Perspective By Jiawei Du; Yi Hong
  2. A Real-Time Framework for Forecasting Metal Prices By Andrea Bastianin; Luca Rossini; Lorenzo Tonni
  3. Risk-Aware Financial Forecasting Enhanced by Machine Learning and Intuitionistic Fuzzy Multi-Criteria Decision-Making By Safiye Turgay; Serkan Erdo\u{g}an; \v{Z}eljko Stevi\'c; Orhan Emre Elma; Tevfik Eren; Zhiyuan Wang; Mahmut Bayda\c{s}
  4. Quantitative Financial Modeling for Sri Lankan Markets: Approach Combining NLP, Clustering and Time-Series Forecasting By Linuk Perera
  5. Hybrid Quantum-Classical Ensemble Learning for S\&P 500 Directional Prediction By Abraham Itzhak Weinberg
  6. Forward-Oriented Causal Observables for Non-Stationary Financial Markets By Lucas A. Souza
  7. A Test of Lookahead Bias in LLM Forecasts By Zhenyu Gao; Wenxi Jiang; Yutong Yan
  8. Inference for Forecasting Accuracy: Pooled versus Individual Estimators in High-dimensional Panel Data By Tim Kutta; Martin Schumann; Holger Dette
  9. Are the Bank of Korea's Inflation Forecasts Biased Toward the Target? By Eunkyu Seong; Seojeong Lee
  10. Uncertainty-Adjusted Sorting for Asset Pricing with Machine Learning By Yan Liu; Ye Luo; Zigan Wang; Xiaowei Zhang
  11. Overreaction in Expectations under Signal Extraction: Experimental Evidence By John Duffy; Nobuyuki Hanaki; Donghoon Yoo

  1. By: Jiawei Du; Yi Hong
    Abstract: This study focuses on forecasting the ultimate forward rate (UFR) and developing a UFRbased bond yield prediction model using data from Chinese treasury bonds and macroeconomic variables spanning from December 2009 to December 2024. The de Kort-Vellekooptype methodology is applied to estimate the UFR, incorporating the optimal turning parameter determination technique proposed in this study, which helps mitigate anomalous fluctuations. In addition, both linear and nonlinear machine learning techniques are employed to forecast the UFR and ultra-long-term bond yields. The results indicate that nonlinear machine learning models outperform their linear counterparts in forecasting accuracy. Incorporating macroeconomic variables, particularly price index-related variables, significantly improves the accuracy of predictions. Finally, a novel UFR-based bond yield forecasting model is developed, demonstrating superior performance across different bond maturities.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2601.00011
  2. By: Andrea Bastianin; Luca Rossini; Lorenzo Tonni
    Abstract: This paper develops a real-time forecasting framework for the monthly real prices of four key industrial metals -- aluminum, copper, nickel, and zinc -- whose demand is rising due to their widespread use in manufacturing and low-carbon technologies. To replicate the information set available to forecasters in real time, we construct a new dataset combining daily financial variables with first-release macroeconomic indicators and use nowcasting techniques to address publication lags. Within this real-time environment, we evaluate the predictive accuracy of a broad set of univariate, multivariate, and factor-augmented models, comparing their performance with two industry benchmarks: survey expectations and futures-spot spread models. Results show that although short-run metal price movements remain difficult to predict, medium-term horizons display substantial forecastability. Indicators of manufacturing activity tied to primary metals -- such as new orders and capacity utilization -- significantly improve forecasting accuracy for aluminum and copper, with more moderate gains for zinc and limited improvements for nickel. Futures and survey forecasts generally underperform the real-time econometric models. These findings highlight the value of incorporating timely macroeconomic information into forecasting frameworks for industrial metal markets.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.16521
  3. By: Safiye Turgay; Serkan Erdo\u{g}an; \v{Z}eljko Stevi\'c; Orhan Emre Elma; Tevfik Eren; Zhiyuan Wang; Mahmut Bayda\c{s}
    Abstract: In the face of increasing financial uncertainty and market complexity, this study presents a novel risk-aware financial forecasting framework that integrates advanced machine learning techniques with intuitionistic fuzzy multi-criteria decision-making (MCDM). Tailored to the BIST 100 index and validated through a case study of a major defense company in T\"urkiye, the framework fuses structured financial data, unstructured text data, and macroeconomic indicators to enhance predictive accuracy and robustness. It incorporates a hybrid suite of models, including extreme gradient boosting (XGBoost), long short-term memory (LSTM) network, graph neural network (GNN), to deliver probabilistic forecasts with quantified uncertainty. The empirical results demonstrate high forecasting accuracy, with a net profit mean absolute percentage error (MAPE) of 3.03% and narrow 95% confidence intervals for key financial indicators. The risk-aware analysis indicates a favorable risk-return profile, with a Sharpe ratio of 1.25 and a higher Sortino ratio of 1.80, suggesting relatively low downside volatility and robust performance under market fluctuations. Sensitivity analysis shows that the key financial indicator predictions are highly sensitive to variations of inflation, interest rates, sentiment, and exchange rates. Additionally, using an intuitionistic fuzzy MCDM approach, combining entropy weighting, evaluation based on distance from the average solution (EDAS), and the measurement of alternatives and ranking according to compromise solution (MARCOS) methods, the tabular data learning network (TabNet) outperforms the other models and is identified as the most suitable candidate for deployment. Overall, the findings of this work highlight the importance of integrating advanced machine learning, risk quantification, and fuzzy MCDM methodologies in financial forecasting, particularly in emerging markets.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.17936
  4. By: Linuk Perera
    Abstract: This research introduces a novel quantitative methodology tailored for quantitative finance applications, enabling banks, stockbrokers, and investors to predict economic regimes and market signals in emerging markets, specifically Sri Lankan stock indices (S&P SL20 and ASPI) by integrating Environmental, Social, and Governance (ESG) sentiment analysis with macroeconomic indicators and advanced time-series forecasting. Designed to leverage quantitative techniques for enhanced risk assessment, portfolio optimization, and trading strategies in volatile environments, the architecture employs FinBERT, a transformer-based NLP model, to extract sentiment from ESG texts, followed by unsupervised clustering (UMAP/HDBSCAN) to identify 5 latent ESG regimes, validated via PCA. These regimes are mapped to economic conditions using a dense neural network and gradient boosting classifier, achieving 84.04% training and 82.0% validation accuracy. Concurrently, time-series models (SRNN, MLP, LSTM, GRU) forecast daily closing prices, with GRU attaining an R-squared of 0.801 and LSTM delivering 52.78% directional accuracy on intraday data. A strong correlation between S&P SL20 and S&P 500, observed through moving average and volatility trend plots, further bolsters forecasting precision. A rule-based fusion logic merges ESG and time-series outputs for final market signals. By addressing literature gaps that overlook emerging markets and holistic integration, this quant-driven framework combines global correlations and local sentiment analysis to offer scalable, accurate tools for quantitative finance professionals navigating complex markets like Sri Lanka.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.20216
  5. By: Abraham Itzhak Weinberg
    Abstract: Financial market prediction is a challenging application of machine learning, where even small improvements in directional accuracy can yield substantial value. Most models struggle to exceed 55--57\% accuracy due to high noise, non-stationarity, and market efficiency. We introduce a hybrid ensemble framework combining quantum sentiment analysis, Decision Transformer architecture, and strategic model selection, achieving 60.14\% directional accuracy on S\&P 500 prediction, a 3.10\% improvement over individual models. Our framework addresses three limitations of prior approaches. First, architecture diversity dominates dataset diversity: combining different learning algorithms (LSTM, Decision Transformer, XGBoost, Random Forest, Logistic Regression) on the same data outperforms training identical architectures on multiple datasets (60.14\% vs.\ 52.80\%), confirmed by correlation analysis ($r>0.6$ among same-architecture models). Second, a 4-qubit variational quantum circuit enhances sentiment analysis, providing +0.8\% to +1.5\% gains per model. Third, smart filtering excludes weak predictors (accuracy $
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.15738
  6. By: Lucas A. Souza
    Abstract: We study short-horizon forecasting in financial time series under strict causal constraints, treating the market as a non-stationary stochastic system in which any predictive observable must be computable online from information available up to the decision time. Rather than proposing a machine-learning predictor or a direct price-forecast model, we focus on \emph{constructing} an interpretable causal signal from heterogeneous micro-features that encode complementary aspects of the dynamics (momentum, volume pressure, trend acceleration, and volatility-normalized price location). The construction combines (i) causal centering, (ii) linear aggregation into a composite observable, (iii) causal stabilization via a one-dimensional Kalman filter, and (iv) an adaptive ``forward-like'' operator that mixes the composite signal with a smoothed causal derivative term. The resulting observable is mapped into a transparent decision functional and evaluated through realized cumulative returns and turnover. An application to high-frequency EURUSDT (1-minute) illustrates that causally constructed observables can exhibit substantial economic relevance in specific regimes, while degrading under subsequent regime shifts, highlighting both the potential and the limitations of causal signal design in non-stationary markets.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.24621
  7. By: Zhenyu Gao; Wenxi Jiang; Yutong Yan
    Abstract: We develop a statistical test to detect lookahead bias in economic forecasts generated by large language models (LLMs). Using state-of-the-art pre-training data detection techniques, we estimate the likelihood that a given prompt appeared in an LLM's training corpus, a statistic we term Lookahead Propensity (LAP). We formally show that a positive correlation between LAP and forecast accuracy indicates the presence and magnitude of lookahead bias, and apply the test to two forecasting tasks: news headlines predicting stock returns and earnings call transcripts predicting capital expenditures. Our test provides a cost-efficient, diagnostic tool for assessing the validity and reliability of LLM-generated forecasts.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.23847
  8. By: Tim Kutta; Martin Schumann; Holger Dette
    Abstract: Panels with large time $(T)$ and cross-sectional $(N)$ dimensions are a key data structure in social sciences and other fields. A central question in panel data analysis is whether to pool data across individuals or to estimate separate models. Pooled estimators typically have lower variance but may suffer from bias, creating a fundamental trade-off for optimal estimation. We develop a new inference method to compare the forecasting performance of pooled and individual estimators. Specifically, we propose a confidence interval for the difference between their forecasting errors and establish its asymptotic validity. Our theory allows for complex temporal and cross-sectional dependence in the model errors and covers scenarios where $N$ can be much larger than $T$-including the independent case under the classical condition $N/T^2 \to 0$. The finite-sample properties of the proposed method are examined in an extensive simulation study.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.15592
  9. By: Eunkyu Seong; Seojeong Lee
    Abstract: The Bank of Korea (BoK) regularly publishes the Economic Outlook, offering forecasts for key macroeconomic variables such as GDP growth, inflation, and unemployment rates. This study examines whether the BoK's inflation forecasts exhibit bias, specifically a tendency to align with its inflation target. We extend the Holden and Peel (1990) test to incorporate state-dependency, defining the state of the economy based on whether realized inflation falls below the target at the time of the forecast. Our analysis reveals that the BoK's inflation forecasts are biased under this state-dependent framework. Furthermore, we examine a range of bias correction strategies based on AR(1) and mean error models, including their state-dependent variants. These strategies generally improve forecast accuracy. Among them, the AR(1)-based correction exhibits relatively stable performance, consistently reducing the root mean square error.
    Date: 2025–12
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2512.16068
  10. By: Yan Liu; Ye Luo; Zigan Wang; Xiaowei Zhang
    Abstract: Machine learning is central to empirical asset pricing, but portfolio construction still relies on point predictions and largely ignores asset-specific estimation uncertainty. We propose a simple change: sort assets using uncertainty-adjusted prediction bounds instead of point predictions alone. Across a broad set of ML models and a U.S. equity panel, this approach improves portfolio performance relative to point-prediction sorting. These gains persist even when bounds are built from partial or misspecified uncertainty information. They arise mainly from reduced volatility and are strongest for flexible machine learning models. Identification and robustness exercises show that these improvements are driven by asset-level rather than time or aggregate predictive uncertainty.
    Date: 2026–01
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2601.00593
  11. By: John Duffy; Nobuyuki Hanaki; Donghoon Yoo
    Abstract: We experimentally evaluate three behavioral models of expectation formation that predict overreaction to new information: overconfidence in private signals, misperceptions about the persistence of the data-generating process (DGP), and diagnostic expectations. In our main experiment, participants repeatedly forecast the contemporaneous and one-step-ahead values of a random variable. They are incentivized for accuracy, informed of the exact DGP and its past history, and provided with noisy signals about the unobserved contemporaneous value. One treatment features a persistent AR(1) process, while another has no persistence. We also report on an experiment with no noisy signals. At the individual level, we find systematic overreaction even when the DGP is not persistent and regardless of whether a signal-extraction problem is present. By contrast, consensus (mean) forecasts exhibit underreaciton, consistent with evidence from other studies. Overall, our results indicate that misperceptions about persistence provide the most compelling explanation for the observed patterns of expectation formation.
    Date: 2025–10
    URL: https://d.repec.org/n?u=RePEc:dpr:wpaper:1293

This nep-for issue is ©2026 by Malte Knüppel. It is provided as is without any express or implied warranty. It may be freely redistributed in whole or in part for any purpose. If distributed in part, please include this notice.
General information on the NEP project can be found at https://nep.repec.org. For comments please write to the director of NEP, Marco Novarese at <director@nep.repec.org>. Put “NEP” in the subject, otherwise your mail may be rejected.
NEP’s infrastructure is sponsored by the School of Economics and Finance of Massey University in New Zealand.