nep-big 2025-03-24 papers

on Big Data

Issue of 2025–03–24
thirty-one papers chosen by
Tom Coupé, University of Canterbury

Multi-Agent Stock Prediction Systems: Machine Learning Models, Simulations, and Real-Time Trading Strategies By Daksh Dave; Gauransh Sawhney; Vikhyat Chauhan
Multimodal Stock Price Prediction By Furkan Karada\c{s}; Bahaeddin Eravc{\i}; Ahmet Murat \"Ozbayo\u{g}lu
From Offer to Close: A Machine Learning Approach to Forecast Real Estate Transaction Outcomes By Zhao, Yu
A Distillation-based Future-aware Graph Neural Network for Stock Trend Prediction By Zhipeng Liu; Peibo Duan; Mingyang Geng; Bin Zhang
Machine Learning for Propensity Score Estimation: A Systematic Review and Reporting Guidelines By Leite, Walter; Zhang, Huibin; collier, zachary; Chawla, Kamal; , l.kong@ufl.edu; Lee, Yongseok; Quan, Jia; Soyoye, Olushola
A Method for Evaluating the Interpretability of Machine Learning Models in Predicting Bond Default Risk Based on LIME and SHAP By Yan Zhang; Lin Chen; Yixiang Tian
Stock Price Prediction Using a Hybrid LSTM-GNN Model: Integrating Time-Series and Graph-Based Analysis By Meet Satishbhai Sonani; Atta Badii; Armin Moin
Robust and Efficient Deep Hedging via Linearized Objective Neural Network By Lei Zhao; Lin Cai
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading By Guojun Xiong; Zhiyang Deng; Keyi Wang; Yupeng Cao; Haohang Li; Yangyang Yu; Xueqing Peng; Mingquan Lin; Kaleb E Smith; Xiao-Yang Liu; Jimin Huang; Sophia Ananiadou; Qianqian Xie
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks By Remi Genet
Generalized Factor Neural Network Model for High-dimensional Regression By Zichuan Guo; Mihai Cucuringu; Alexander Y. Shestopaloff
Next-Gen Dynamic and Deal-Based Pricing Strategy in Automotive and Financial Services By Hota, Ashish
FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data By Ankur Sinha; Chaitanya Agarwal; Pekka Malo
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management By Lei Zhao; Lin Cai; Wu-Sheng Lu
Bankruptcy analysis using images and convolutional neural networks (CNN) By Luiz Tavares; Jose Mazzon; Francisco Paletta; Fabio Barros
A Supervised Screening and Regularized Factor-Based Method for Time Series Forecasting By Sihan Tu; Zhaoxing Gao
Gradients can train reward models: An Empirical Risk Minimization Approach for Offline Inverse RL and Dynamic Discrete Choice Model By Enoch H. Kang; Hema Yoganarasimhan; Lalit Jain
Balancing Flexibility and Interpretability: A Conditional Linear Model Estimation via Random Forest By Ricardo Masini; Marcelo Medeiros
Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making By Luca Lalor; Anatoliy Swishchuk
Toward Proactive Policy Design: Identifying 'To-Be' Energy-Poor Households Using Shap for Early Intervention By Budría, Santiago; Fermé, Eduardo; Freitas, Diogo Nuno
ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy? By Jian Chen; Guohao Tang; Guofu Zhou; Wu Zhu
Using Machine Learning to Understand the Heterogeneous Earnings Effects of Exports By Muffert, Johanna; Winkler, Erwin
Humans vs GPTs: Bias and validity in hiring decisions By Lippens, Louis
A Comprehensive Approach to Behavioral Data Analysis and Machine Learning within Unified Systems By Hota, Ashish
A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis By Yuzhi Hao; Danyang Xie
Economic Causal Inference Based on DML Framework: Python Implementation of Binary and Continuous Treatment Variables By Shunxin Yao
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies By Zheli Xiong
Utilizing Effective Dynamic Graph Learning to Shield Financial Stability from Risk Propagation By Guanyuan Yu; Qing Li; Yu Zhao; Jun Wang; YiJun Chen; Shaolei Chen
Stories that (are) Move(d by) Markets: A Causal Exploration of Market Shocks and Semantic Shifts across Different Partisan Groups By Felix Drinkall; Stefan Zohren; Michael McMahon; Janet B. Pierrehumbert
Grounded Persuasive Language Generation for Automated Marketing By Jibang Wu; Chenghao Yang; Simon Mahns; Chaoqi Wang; Hao Zhu; Fei Fang; Haifeng Xu
HedgeAgents: A Balanced-aware Multi-agent Financial Trading System By Xiangyu Li; Yawen Zeng; Xiaofen Xing; Jin Xu; Xiangmin Xu

Multi-Agent Stock Prediction Systems: Machine Learning Models, Simulations, and Real-Time Trading Strategies

By:	Daksh Dave; Gauransh Sawhney; Vikhyat Chauhan
Abstract:	This paper presents a comprehensive study on stock price prediction, leveragingadvanced machine learning (ML) and deep learning (DL) techniques to improve financial forecasting accuracy. The research evaluates the performance of various recurrent neural network (RNN) architectures, including Long Short-Term Memory (LSTM) networks, Gated Recurrent Units (GRU), and attention-based models. These models are assessed for their ability to capture complex temporal dependencies inherent in stock market data. Our findings show that attention-based models outperform other architectures, achieving the highest accuracy by capturing both short and long-term dependencies. This study contributes valuable insights into AI-driven financial forecasting, offering practical guidance for developing more accurate and efficient trading systems.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.15853

Multimodal Stock Price Prediction

By:	Furkan Karada\c{s}; Bahaeddin Eravc{\i}; Ahmet Murat \"Ozbayo\u{g}lu
Abstract:	In an era where financial markets are heavily influenced by many static and dynamic factors, it has become increasingly critical to carefully integrate diverse data sources with machine learning for accurate stock price prediction. This paper explores a multimodal machine learning approach for stock price prediction by combining data from diverse sources, including traditional financial metrics, tweets, and news articles. We capture real-time market dynamics and investor mood through sentiment analysis on these textual data using both ChatGPT-4o and FinBERT models. We look at how these integrated data streams augment predictions made with a standard Long Short-Term Memory (LSTM model) to illustrate the extent of performance gains. Our study's results indicate that incorporating the mentioned data sources considerably increases the forecast effectiveness of the reference model by up to 5%. We also provide insights into the individual and combined predictive capacities of these modalities, highlighting the substantial impact of incorporating sentiment analysis from tweets and news articles. This research offers a systematic and effective framework for applying multimodal data analytics techniques in financial time series forecasting that provides a new view for investors to leverage data for decision-making.
Date:	2025–01
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.05186

From Offer to Close: A Machine Learning Approach to Forecast Real Estate Transaction Outcomes

By:	Zhao, Yu
Abstract:	Accurately forecasting whether a real estate transaction will close is crucial for agents, lenders, and investors, impacting resource allocation, risk management, and client satisfaction. This task, however, is complex due to a combination of economic, procedural, and behavioral factors that influence transaction outcomes. Traditional machine learning approaches, particularly gradient boosting models like Gradient Boost Decision Tree, have proven effective for tabular data, outperforming deep learning models on structured datasets. However, recent advances in attention-based deep learning models present new opportunities to capture temporal dependencies and complex interactions within transaction data, potentially enhancing prediction accuracy. This article explores the challenges of forecasting real estate transaction closures, compares the performance of machine learning models, and examines how attention-based models can improve predictive insights in this critical area of real estate analytics.
Date:	2024–11–08
URL:	https://d.repec.org/n?u=RePEc:osf:osfxxx:sxmq2_v1

A Distillation-based Future-aware Graph Neural Network for Stock Trend Prediction

By:	Zhipeng Liu; Peibo Duan; Mingyang Geng; Bin Zhang
Abstract:	Stock trend prediction involves forecasting the future price movements by analyzing historical data and various market indicators. With the advancement of machine learning, graph neural networks (GNNs) have been extensively employed in stock prediction due to their powerful capability to capture spatiotemporal dependencies of stocks. However, despite the efforts of various GNN stock predictors to enhance predictive performance, the improvements remain limited, as they focus solely on analyzing historical spatiotemporal dependencies, overlooking the correlation between historical and future patterns. In this study, we propose a novel distillation-based future-aware GNN framework (DishFT-GNN) for stock trend prediction. Specifically, DishFT-GNN trains a teacher model and a student model, iteratively. The teacher model learns to capture the correlation between distribution shifts of historical and future data, which is then utilized as intermediate supervision to guide the student model to learn future-aware spatiotemporal embeddings for accurate prediction. Through extensive experiments on two real-world datasets, we verify the state-of-the-art performance of DishFT-GNN.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.10776

Machine Learning for Propensity Score Estimation: A Systematic Review and Reporting Guidelines

By:	Leite, Walter; Zhang, Huibin; collier, zachary; Chawla, Kamal; , l.kong@ufl.edu; Lee, Yongseok (University of Florida); Quan, Jia; Soyoye, Olushola
Abstract:	Machine learning has become a common approach for estimating propensity scores for quasi-experimental research using matching, weighting, or stratification on the propensity score. This systematic review examined machine learning applications for propensity score estimation across different fields, such as health, education, social sciences, and business over 40 years. The results show that the gradient boosting machine (GBM) is the most frequently used method, followed by random forest. Classification and regression trees (CART), neural networks, and the super learner were also used in more than five percent of studies. The most frequently used packages to estimate propensity scores were twang, gbm and randomforest in the R statistical software. The review identified many hyperparameter configurations used for machine learning methods. However, it also shows that hyperparameters are frequently under-reported, as well as critical steps of the propensity score analysis, such as the covariate balance evaluation. A set of guidelines for reporting the use of machine learning for propensity score estimation is provided.
Date:	2024–10–09
URL:	https://d.repec.org/n?u=RePEc:osf:osfxxx:gmrk7_v1

A Method for Evaluating the Interpretability of Machine Learning Models in Predicting Bond Default Risk Based on LIME and SHAP

By:	Yan Zhang; Lin Chen; Yixiang Tian
Abstract:	Interpretability analysis methods for artificial intelligence models, such as LIME and SHAP, are widely used, though they primarily serve as post-model for analyzing model outputs. While it is commonly believed that the transparency and interpretability of AI models diminish as their complexity increases, currently there is no standardized method for assessing the inherent interpretability of the models themselves. This paper uses bond market default prediction as a case study, applying commonly used machine learning algorithms within AI models. First, the classification performance of these algorithms in default prediction is evaluated. Then, leveraging LIME and SHAP to assess the contribution of sample features to prediction outcomes, the paper proposes a novel method for evaluating the interpretability of the models themselves. The results of this analysis are consistent with the intuitive understanding and logical expectations regarding the interpretability of these models.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.19615

Stock Price Prediction Using a Hybrid LSTM-GNN Model: Integrating Time-Series and Graph-Based Analysis

By:	Meet Satishbhai Sonani; Atta Badii; Armin Moin
Abstract:	This paper presents a novel hybrid model that integrates long-short-term memory (LSTM) networks and Graph Neural Networks (GNNs) to significantly enhance the accuracy of stock market predictions. The LSTM component adeptly captures temporal patterns in stock price data, effectively modeling the time series dynamics of financial markets. Concurrently, the GNN component leverages Pearson correlation and association analysis to model inter-stock relational data, capturing complex nonlinear polyadic dependencies influencing stock prices. The model is trained and evaluated using an expanding window validation approach, enabling continuous learning from increasing amounts of data and adaptation to evolving market conditions. Extensive experiments conducted on historical stock data demonstrate that our hybrid LSTM-GNN model achieves a mean square error (MSE) of 0.00144, representing a substantial reduction of 10.6% compared to the MSE of the standalone LSTM model of 0.00161. Furthermore, the hybrid model outperforms traditional and advanced benchmarks, including linear regression, convolutional neural networks (CNN), and dense networks. These compelling results underscore the significant potential of combining temporal and relational data through a hybrid approach, offering a powerful tool for real-time trading and financial analysis.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.15813

Robust and Efficient Deep Hedging via Linearized Objective Neural Network

By:	Lei Zhao; Lin Cai
Abstract:	Deep hedging represents a cutting-edge approach to risk management for financial derivatives by leveraging the power of deep learning. However, existing methods often face challenges related to computational inefficiency, sensitivity to noisy data, and optimization complexity, limiting their practical applicability in dynamic and volatile markets. To address these limitations, we propose Deep Hedging with Linearized-objective Neural Network (DHLNN), a robust and generalizable framework that enhances the training procedure of deep learning models. By integrating a periodic fixed-gradient optimization method with linearized training dynamics, DHLNN stabilizes the training process, accelerates convergence, and improves robustness to noisy financial data. The framework incorporates trajectory-wide optimization and Black-Scholes Delta anchoring, ensuring alignment with established financial theory while maintaining flexibility to adapt to real-world market conditions. Extensive experiments on synthetic and real market data validate the effectiveness of DHLNN, demonstrating its ability to achieve faster convergence, improved stability, and superior hedging performance across diverse market scenarios.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.17757

FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading

By:	Guojun Xiong; Zhiyang Deng; Keyi Wang; Yupeng Cao; Haohang Li; Yangyang Yu; Xueqing Peng; Mingquan Lin; Kaleb E Smith; Xiao-Yang Liu; Jimin Huang; Sophia Ananiadou; Qianqian Xie
Abstract:	Large language models (LLMs) fine-tuned on multimodal financial data have demonstrated impressive reasoning capabilities in various financial tasks. However, they often struggle with multi-step, goal-oriented scenarios in interactive financial markets, such as trading, where complex agentic approaches are required to improve decision-making. To address this, we propose \textsc{FLAG-Trader}, a unified architecture integrating linguistic processing (via LLMs) with gradient-driven reinforcement learning (RL) policy optimization, in which a partially fine-tuned LLM acts as the policy network, leveraging pre-trained knowledge while adapting to the financial domain through parameter-efficient fine-tuning. Through policy gradient optimization driven by trading rewards, our framework not only enhances LLM performance in trading but also improves results on other financial-domain tasks. We present extensive empirical evidence to validate these enhancements.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.11433

Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks

By:	Remi Genet
Abstract:	The execution of Volume Weighted Average Price (VWAP) orders remains a critical challenge in modern financial markets, particularly as trading volumes and market complexity continue to increase. In my previous work arXiv:2502.13722, I introduced a novel deep learning approach that demonstrated significant improvements over traditional VWAP execution methods by directly optimizing the execution problem rather than relying on volume curve predictions. However, that model was static because it employed the fully linear approach described in arXiv:2410.21448, which is not designed for dynamic adjustment. This paper extends that foundation by developing a dynamic neural VWAP framework that adapts to evolving market conditions in real time. We introduce two key innovations: first, the integration of recurrent neural networks to capture complex temporal dependencies in market dynamics, and second, a sophisticated dynamic adjustment mechanism that continuously optimizes execution decisions based on market feedback. The empirical analysis, conducted across five major cryptocurrency markets, demonstrates that this dynamic approach achieves substantial improvements over both traditional methods and our previous static implementation, with execution performance gains of 10 to 15% in liquid markets and consistent outperformance across varying conditions. These results suggest that adaptive neural architectures can effectively address the challenges of modern VWAP execution while maintaining computational efficiency suitable for practical deployment.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.18177

Generalized Factor Neural Network Model for High-dimensional Regression

By:	Zichuan Guo; Mihai Cucuringu; Alexander Y. Shestopaloff
Abstract:	We tackle the challenges of modeling high-dimensional data sets, particularly those with latent low-dimensional structures hidden within complex, non-linear, and noisy relationships. Our approach enables a seamless integration of concepts from non-parametric regression, factor models, and neural networks for high-dimensional regression. Our approach introduces PCA and Soft PCA layers, which can be embedded at any stage of a neural network architecture, allowing the model to alternate between factor modeling and non-linear transformations. This flexibility makes our method especially effective for processing hierarchical compositional data. We explore ours and other techniques for imposing low-rank structures on neural networks and examine how architectural design impacts model performance. The effectiveness of our method is demonstrated through simulation studies, as well as applications to forecasting future price movements of equity ETF indices and nowcasting with macroeconomic data.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.11310

Next-Gen Dynamic and Deal-Based Pricing Strategy in Automotive and Financial Services

By:	Hota, Ashish
Abstract:	This paper explores the evolution of AI-driven pricing strategies in the automotive and financial services sectors, focusing on dynamic and deal-based pricing models that adapt in real time to shifts in consumer behavior, supply chain limitations, and market fluctuations. We examine how advanced machine learning techniques, including deep learning and reinforcement learning, enable predictive and adaptive pricing solutions that drive customer loyalty, revenue optimization, and transparency. Explainable AI also features prominently, offering transparency to consumers and regulators alike.
Date:	2024–11–25
URL:	https://d.repec.org/n?u=RePEc:osf:osfxxx:emgpv_v1

FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data

By:	Ankur Sinha; Chaitanya Agarwal; Pekka Malo
Abstract:	Large language models (LLMs) excel at generating human-like responses but often struggle with interactive tasks that require access to real-time information. This limitation poses challenges in finance, where models must access up-to-date information, such as recent news or price movements, to support decision-making. To address this, we introduce Financial Agent, a knowledge-grounding approach for LLMs to handle financial queries using real-time text and tabular data. Our contributions are threefold: First, we develop a Financial Context Dataset of over 50, 000 financial queries paired with the required context. Second, we train FinBloom 7B, a custom 7 billion parameter LLM, on 14 million financial news articles from Reuters and Deutsche Presse-Agentur, alongside 12 million Securities and Exchange Commission (SEC) filings. Third, we fine-tune FinBloom 7B using the Financial Context Dataset to serve as a Financial Agent. This agent generates relevant financial context, enabling efficient real-time data retrieval to answer user queries. By reducing latency and eliminating the need for users to manually provide accurate data, our approach significantly enhances the capability of LLMs to handle dynamic financial tasks. Our proposed approach makes real-time financial decisions, algorithmic trading and other related tasks streamlined, and is valuable in contexts with high-velocity data flows.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.18471

Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management

By:	Lei Zhao; Lin Cai; Wu-Sheng Lu
Abstract:	In the field of financial derivatives trading, managing volatility risk is crucial for protecting investment portfolios from market changes. Traditional Vega hedging strategies, which often rely on basic and rule-based models, are hard to adapt well to rapidly changing market conditions. We introduce a new framework for dynamic Vega hedging, the Adaptive Nesterov Accelerated Distributional Deep Hedging (ANADDH), which combines distributional reinforcement learning with a tailored design based on adaptive Nesterov acceleration. This approach improves the learning process in complex financial environments by modeling the hedging efficiency distribution, providing a more accurate and responsive hedging strategy. The design of adaptive Nesterov acceleration refines gradient momentum adjustments, significantly enhancing the stability and speed of convergence of the model. Through empirical analysis and comparisons, our method demonstrates substantial performance gains over existing hedging techniques. Our results confirm that this innovative combination of distributional reinforcement learning with the proposed optimization techniques improves financial risk management and highlights the practical benefits of implementing advanced neural network architectures in the finance sector.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.17777

Bankruptcy analysis using images and convolutional neural networks (CNN)

By:	Luiz Tavares; Jose Mazzon; Francisco Paletta; Fabio Barros
Abstract:	The marketing departments of financial institutions strive to craft products and services that cater to the diverse needs of businesses of all sizes. However, it is evident upon analysis that larger corporations often receive a more substantial portion of available funds. This disparity arises from the relative ease of assessing the risk of default and bankruptcy in these more prominent companies. Historically, risk analysis studies have focused on data from publicly traded or stock exchange-listed companies, leaving a gap in knowledge about small and medium-sized enterprises (SMEs). Addressing this gap, this study introduces a method for evaluating SMEs by generating images for processing via a convolutional neural network (CNN). To this end, more than 10, 000 images, one for each company in the sample, were created to identify scenarios in which the CNN can operate with higher assertiveness and reduced training error probability. The findings demonstrate a significant predictive capacity, achieving 97.8% accuracy, when a substantial number of images are utilized. Moreover, the image creation method paves the way for potential applications of this technique in various sectors and for different analytical purposes.
Date:	2025–01
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.15726

A Supervised Screening and Regularized Factor-Based Method for Time Series Forecasting

By:	Sihan Tu; Zhaoxing Gao
Abstract:	Factor-based forecasting using Principal Component Analysis (PCA) is an effective machine learning tool for dimension reduction with many applications in statistics, economics, and finance. This paper introduces a Supervised Screening and Regularized Factor-based (SSRF) framework that systematically addresses high-dimensional predictor sets through a structured four-step procedure integrating both static and dynamic forecasting mechanisms. The static approach selects predictors via marginal correlation screening and scales them using univariate predictive slopes, while the dynamic method screens and scales predictors based on time series regression incorporating lagged predictors. PCA then extracts latent factors from the scaled predictors, followed by LASSO regularization to refine predictive accuracy. In the simulation study, we validate the effectiveness of SSRF and identify its parameter adjustment strategies in high-dimensional data settings. An empirical analysis of macroeconomic indices in China demonstrates that the SSRF method generally outperforms several commonly used forecasting techniques in out-of-sample predictions.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.15275

Gradients can train reward models: An Empirical Risk Minimization Approach for Offline Inverse RL and Dynamic Discrete Choice Model

By:	Enoch H. Kang; Hema Yoganarasimhan; Lalit Jain
Abstract:	We study the problem of estimating Dynamic Discrete Choice (DDC) models, also known as offline Maximum Entropy-Regularized Inverse Reinforcement Learning (offline MaxEnt-IRL) in machine learning. The objective is to recover reward or $Q^*$ functions that govern agent behavior from offline behavior data. In this paper, we propose a globally convergent gradient-based method for solving these problems without the restrictive assumption of linearly parameterized rewards. The novelty of our approach lies in introducing the Empirical Risk Minimization (ERM) based IRL/DDC framework, which circumvents the need for explicit state transition probability estimation in the Bellman equation. Furthermore, our method is compatible with non-parametric estimation techniques such as neural networks. Therefore, the proposed method has the potential to be scaled to high-dimensional, infinite state spaces. A key theoretical insight underlying our approach is that the Bellman residual satisfies the Polyak-Lojasiewicz (PL) condition -- a property that, while weaker than strong convexity, is sufficient to ensure fast global convergence guarantees. Through a series of synthetic experiments, we demonstrate that our approach consistently outperforms benchmark methods and state-of-the-art alternatives.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.14131

Balancing Flexibility and Interpretability: A Conditional Linear Model Estimation via Random Forest

By:	Ricardo Masini; Marcelo Medeiros
Abstract:	Traditional parametric econometric models often rely on rigid functional forms, while nonparametric techniques, despite their flexibility, frequently lack interpretability. This paper proposes a parsimonious alternative by modeling the outcome $Y$ as a linear function of a vector of variables of interest $\boldsymbol{X}$, conditional on additional covariates $\boldsymbol{Z}$. Specifically, the conditional expectation is expressed as $\mathbb{E}[Y\|\boldsymbol{X}, \boldsymbol{Z}]=\boldsymbol{X}^{T}\boldsymbol{\beta}(\boldsymbol{Z})$, where $\boldsymbol{\beta}(\cdot)$ is an unknown Lipschitz-continuous function. We introduce an adaptation of the Random Forest (RF) algorithm to estimate this model, balancing the flexibility of machine learning methods with the interpretability of traditional linear models. This approach addresses a key challenge in applied econometrics by accommodating heterogeneity in the relationship between covariates and outcomes. Furthermore, the heterogeneous partial effects of $\boldsymbol{X}$ on $Y$ are represented by $\boldsymbol{\beta}(\cdot)$ and can be directly estimated using our proposed method. Our framework effectively unifies established parametric and nonparametric models, including varying-coefficient, switching regression, and additive models. We provide theoretical guarantees, such as pointwise and $L^p$-norm rates of convergence for the estimator, and establish a pointwise central limit theorem through subsampling, aiding inference on the function $\boldsymbol\beta(\cdot)$. We present Monte Carlo simulation results to assess the finite-sample performance of the method.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.13438

Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making

By:	Luca Lalor; Anatoliy Swishchuk
Abstract:	In this paper, we propose an event-driven Limit Order Book (LOB) model that captures twelve of the most observed LOB events in exchange-based financial markets. To model these events, we propose using the state-of-the-art Neural Hawkes process, a more robust alternative to traditional Hawkes process models. More specifically, this model captures the dynamic relationships between different event types, particularly their long- and short-term interactions, using a Long Short-Term Memory neural network. Using this framework, we construct a midprice process that captures the event-driven behavior of the LOB by simulating high-frequency dynamics like how they appear in real financial markets. The empirical results show that our model captures many of the broader characteristics of the price fluctuations, particularly in terms of their overall volatility. We apply this LOB simulation model within a Deep Reinforcement Learning Market-Making framework, where the trading agent can now complete trade order fills in a manner that closely resembles real-market trade execution. Here, we also compare the results of the simulated model with those from real data, highlighting how the overall performance and the distribution of trade order fills closely align with the same analysis on real data.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.17417

Toward Proactive Policy Design: Identifying 'To-Be' Energy-Poor Households Using Shap for Early Intervention

By:	Budría, Santiago (Universidad Nebrija); Fermé, Eduardo (University of Madeira); Freitas, Diogo Nuno (University of Madeira)
Abstract:	Identifying at-risk populations is essential for designing effective energy poverty interventions. Using data from the HILDA Survey, a longitudinal dataset representative of the Australian population, and a multidimensional index of energy poverty, we develop a machine learning model combined with SHAP (SHapley Additive exPlanations) values to document the short- and long-term effects of individual and contextual factors—such as income, energy prices, and regional conditions—on future energy poverty outcomes. The findings emphasize the importance of policies focused on income stability and may be used to shift the policy focus from reactive measures, which address existing poverty, to preventive strategies that target households showing early signs of vulnerability.
Keywords:	Energy poverty, panel data, explainable AI, time-series analysis, public policy, temporal dynamics, feature importance
JEL:	I32 D12 C53
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:iza:izadps:dp17669

ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy?

By:	Jian Chen; Guohao Tang; Guofu Zhou; Wu Zhu
Abstract:	We study whether ChatGPT and DeepSeek can extract information from the Wall Street Journal to predict the stock market and the macroeconomy. We find that ChatGPT has predictive power. DeepSeek underperforms ChatGPT, which is trained more extensively in English. Other large language models also underperform. Consistent with financial theories, the predictability is driven by investors' underreaction to positive news, especially during periods of economic downturn and high information uncertainty. Negative news correlates with returns but lacks predictive value. At present, ChatGPT appears to be the only model capable of capturing economic news that links to the market risk premium.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.10008

Using Machine Learning to Understand the Heterogeneous Earnings Effects of Exports

By:	Muffert, Johanna (FAU Erlangen Nuremberg); Winkler, Erwin (University of Erlangen-Nuremberg)
Abstract:	We study how the effects of exports on earnings vary across individual workers, depending on a wide range of worker, firm, and job characteristics. To this end, we combine a generalized random forest with an instrumental variable strategy. Analyzing Germany's exports to China and Eastern Europe, we document sharp disparities: workers in the bottom quartile (ranked by the size of the effect) experience little to no earnings gains due to exports, while those in the top quartile see considerable earnings increases. As expected, the workers who benefit the most on average are employed in larger firms and have higher skill levels. Importantly, however, we also find that workers with the largest earnings gains tend to be male, younger, and more specialized in their industry. These factors have received little attention in the previous literature. Finally, we provide evidence that the contribution to overall earnings inequality is smaller than expected.
Keywords:	machine learning, earnings, inequality, exports, skills, labor market
JEL:	C52 F14 J23 J24 J32
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:iza:izadps:dp17667

Humans vs GPTs: Bias and validity in hiring decisions

By:	Lippens, Louis (Ghent University)
Abstract:	The advent of large language models (LLMs) may reshape hiring in the labour market. This paper investigates how generative pre-trained transformers (GPTs)—i.e. OpenAI’s GPT-3.5, GPT-4, and GPT-4o—can aid hiring decisions. In a direct comparison between humans and GPTs on an identical hiring task, I show that GPTs tend to select candidates more liberally than humans but exhibit less ethnic bias. GPT-4 even slightly favours certain ethnic minorities. While LLMs may complement humans in hiring by making a (relatively extensive) pre-selection of job candidates, the findings suggest that they may miss-select due to a lack of contextual understanding and may reproduce pre-trained human bias at scale.
Date:	2024–07–11
URL:	https://d.repec.org/n?u=RePEc:osf:osfxxx:zxf5y_v1

A Comprehensive Approach to Behavioral Data Analysis and Machine Learning within Unified Systems

By:	Hota, Ashish
Abstract:	The integration of behavioral data analysis and machine learning (ML) within unified systems has become increasingly vital for enhanced decision-making and system optimization across various industries, including healthcare, marketing, and finance. Behavioral data—comprising user actions, preferences, and interactions—provides valuable insights into emerging trends, enabling adaptive and intelligent system functionalities. Coupling this with ML allows systems to continuously learn and improve their performance. This paper presents a comprehensive approach to integrating behavioral data analysis and ML within unified systems, covering key methodologies, technical challenges, applications, and a roadmap for future developments. Additionally, the article includes technical facts, tables, diagrams, and comparisons to aid in understanding the technical aspects and advantages of this integration.
Date:	2024–12–16
URL:	https://d.repec.org/n?u=RePEc:osf:osfxxx:rjpxs_v1

A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis

By:	Yuzhi Hao (Department of Economics, The Hong Kong University of Science and Technology); Danyang Xie (Thrust of Innovation, Policy, and Entrepreneurship, the Society Hub, The Hong Kong University of Science and Technology)
Abstract:	This paper pioneers a novel approach to economic and public policy analysis by leveraging multiple Large Language Models (LLMs) as heterogeneous artificial economic agents. We first evaluate five LLMs' economic decision-making capabilities in solving two-period consumption allocation problems under two distinct scenarios: with explicit utility functions and based on intuitive reasoning. While previous research has often simulated heterogeneity by solely varying prompts, our approach harnesses the inherent variations in analytical capabilities across different LLMs to model agents with diverse cognitive traits. Building on these findings, we construct a Multi-LLM-Agent-Based (MLAB) framework by mapping these LLMs to specific educational groups and corresponding income brackets. Using interest-income taxation as a case study, we demonstrate how the MLAB framework can simulate policy impacts across heterogeneous agents, offering a promising new direction for economic and public policy analysis by leveraging LLMs' human-like reasoning capabilities and computational power.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.16879

Economic Causal Inference Based on DML Framework: Python Implementation of Binary and Continuous Treatment Variables

By:	Shunxin Yao
Abstract:	This study utilizes a simulated dataset to establish Python code for Double Machine Learning (DML) using Anaconda's Jupyter Notebook and the DML software package from GitHub. The research focuses on causal inference experiments for both binary and continuous treatment variables. The findings reveal that the DML model demonstrates relatively stable performance in calculating the Average Treatment Effect (ATE) and its robustness metrics. However, the study also highlights that the computation of Conditional Average Treatment Effect (CATE) remains a significant challenge for future DML modeling, particularly in the context of continuous treatment variables. This underscores the need for further research and development in this area to enhance the model's applicability and accuracy.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.19898

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

By:	Zheli Xiong
Abstract:	This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our results demonstrate that ensemble methods consistently outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, we identify the sensitivity of ensemble performance to the choice of variance threshold {\tau}, highlighting the importance of dynamic {\tau} adjustment to achieve optimal performance. This study emphasizes the value of combining RL with classifiers for adaptive decision-making, with implications for financial trading, robotics, and other dynamic environments.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.17518

Utilizing Effective Dynamic Graph Learning to Shield Financial Stability from Risk Propagation

By:	Guanyuan Yu; Qing Li; Yu Zhao; Jun Wang; YiJun Chen; Shaolei Chen
Abstract:	Financial risks can propagate across both tightly coupled temporal and spatial dimensions, posing significant threats to financial stability. Moreover, risks embedded in unlabeled data are often difficult to detect. To address these challenges, we introduce GraphShield, a novel approach with three key innovations: Enhanced Cross-Domain Infor mation Learning: We propose a dynamic graph learning module to improve information learning across temporal and spatial domains. Advanced Risk Recognition: By leveraging the clustering characteristics of risks, we construct a risk recognizing module to enhance the identification of hidden threats. Risk Propagation Visualization: We provide a visualization tool for quantifying and validating nodes that trigger widespread cascading risks. Extensive experiments on two real-world and two open-source datasets demonstrate the robust performance of our framework. Our approach represents a significant advancement in leveraging artificial intelligence to enhance financial stability, offering a powerful solution to mitigate the spread of risks within financial networks.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.13979

Stories that (are) Move(d by) Markets: A Causal Exploration of Market Shocks and Semantic Shifts across Different Partisan Groups

By:	Felix Drinkall; Stefan Zohren; Michael McMahon; Janet B. Pierrehumbert
Abstract:	Macroeconomic fluctuations and the narratives that shape them form a mutually reinforcing cycle: public discourse can spur behavioural changes leading to economic shifts, which then result in changes in the stories that propagate. We show that shifts in semantic embedding space can be causally linked to financial market shocks -- deviations from the expected market behaviour. Furthermore, we show how partisanship can influence the predictive power of text for market fluctuations and shape reactions to those same shocks. We also provide some evidence that text-based signals are particularly salient during unexpected events such as COVID-19, highlighting the value of language data as an exogenous variable in economic forecasting. Our findings underscore the bidirectional relationship between news outlets and market shocks, offering a novel empirical approach to studying their effect on each other.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.14497

Grounded Persuasive Language Generation for Automated Marketing

By:	Jibang Wu; Chenghao Yang; Simon Mahns; Chaoqi Wang; Hao Zhu; Fei Fang; Haifeng Xu
Abstract:	This paper develops an agentic framework that employs large language models (LLMs) to automate the generation of persuasive and grounded marketing content, using real estate listing descriptions as our focal application domain. Our method is designed to align the generated content with user preferences while highlighting useful factual attributes. This agent consists of three key modules: (1) Grounding Module, mimicking expert human behavior to predict marketable features; (2) Personalization Module, aligning content with user preferences; (3) Marketing Module, ensuring factual accuracy and the inclusion of localized features. We conduct systematic human-subject experiments in the domain of real estate marketing, with a focus group of potential house buyers. The results demonstrate that marketing descriptions generated by our approach are preferred over those written by human experts by a clear margin. Our findings suggest a promising LLM-based agentic framework to automate large-scale targeted marketing while ensuring responsible generation using only facts.
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.16810

HedgeAgents: A Balanced-aware Multi-agent Financial Trading System

By:	Xiangyu Li; Yawen Zeng; Xiaofen Xing; Jin Xu; Xiangmin Xu
Abstract:	As automated trading gains traction in the financial market, algorithmic investment strategies are increasingly prominent. While Large Language Models (LLMs) and Agent-based models exhibit promising potential in real-time market analysis and trading decisions, they still experience a significant -20% loss when confronted with rapid declines or frequent fluctuations, impeding their practical application. Hence, there is an imperative to explore a more robust and resilient framework. This paper introduces an innovative multi-agent system, HedgeAgents, aimed at bolstering system robustness via ``hedging'' strategies. In this well-balanced system, an array of hedging agents has been tailored, where HedgeAgents consist of a central fund manager and multiple hedging experts specializing in various financial asset classes. These agents leverage LLMs' cognitive capabilities to make decisions and coordinate through three types of conferences. Benefiting from the powerful understanding of LLMs, our HedgeAgents attained a 70% annualized return and a 400% total return over a period of 3 years. Moreover, we have observed with delight that HedgeAgents can even formulate investment experience comparable to those of human experts (https://hedgeagents.github.io/).
Date:	2025–02
URL:	https://d.repec.org/n?u=RePEc:arx:papers:2502.13165

This nep-big issue is ©2025 by Tom Coupé. It is provided as is without any express or implied warranty. It may be freely redistributed in whole or in part for any purpose. If distributed in part, please include this notice.

General information on the NEP project can be found at https://nep.repec.org. For comments please write to the director of NEP, Marco Novarese at <director@nep.repec.org>. Put “NEP” in the subject, otherwise your mail may be rejected.

NEP’s infrastructure is sponsored by the School of Economics and Finance of Massey University in New Zealand.