nep-cmp New Economics Papers
on Computational Economics
Issue of 2026–03–02
25 papers chosen by
Stan Miles, Thompson Rivers University


  1. Measuring economic outlook in the news By Elliot Beck; Franziska Eckert; Linus Kühne; Helge Liebert; Rina Rosenblatt-Wisch
  2. Behavioral Economics of AI: LLM Biases and Corrections By Pietro Bini; Lin William Cong; Xing Huang; Lawrence J. Jin
  3. Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation By Zeping Li; Guancheng Wan; Keyang Chen; Yu Chen; Yiwen Zhao; Philip Torr; Guangnan Ye; Zhenfei Yin; Hongfeng Chai
  4. Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks By Kunihiro Miyazaki; Takanobu Kawahara; Stephen Roberts; Stefan Zohren
  5. Beyond the Numbers: Causal Effects of Financial Report Sentiment on Bank Profitability By Krishna Neupane; Prem Sapkota; Ujjwal Prajapati
  6. Evaluating LLMs in Finance Requires Explicit Bias Consideration By Yaxuan Kong; Hoyoung Lee; Yoontae Hwang; Alejandro Lopez-Lira; Bradford Levy; Dhagash Mehta; Qingsong Wen; Chanyeol Choi; Yongjae Lee; Stefan Zohren
  7. Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization By Srijan Sood; Kassiani Papasotiriou; Marius Vaiciulis; Tucker Balch
  8. Seeing the Goal, Missing the Truth: Human Accountability for AI Bias By Sean Cao; Wei Jiang; Hui Xu
  9. Forecasting Future Language: Context Design for Mention Markets By Sumin Kim; Jihoon Kwon; Yoon Kim; Nicole Kagan; Raffi Khatchadourian; Wonbin Ahn; Alejandro Lopez-Lira; Jaewon Lee; Yoontae Hwang; Oscar Levy; Yongjae Lee; Chanyeol Choi
  10. Causal Effect Estimation with Latent Textual Treatments By Omri Feldman; Amar Venugopal; Jann Spiess; Amir Feder
  11. In-processing of actuarial and equity fairness constraints for Neural networks By Hainaut, Donatien
  12. genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression By Masahiro Kato
  13. What Do LLMs Want? By Thomas R. Cook; Sophia Kazinnik; Zach Modig; Nathan M. Palmer
  14. Mimicking Finance By Lauren Cohen; Yiwen Lu; Quoc H. Nguyen
  15. Patent Valuation under Fragile Institutional Enforcement: A Continuous-Time Markov Approach By Gautami Parate; Arpita Choudhary
  16. Illusions and Perceived Wealth: an Agent-based model of Madoff's Ponzi scheme By Paolo Pellizzari; Francesca Parpinel
  17. Application of Quasi Monte Carlo and Global Sensitivity Analysis to Option Pricing and Greeks By Stefano Scoleri; Marco Bianchetti; Sergei Kucherenko
  18. Can Interest-Bearing Positions Solve the Long-Horizon Problem in Prediction Markets? By Caleb Maresca
  19. Machine Learning Meets Markowitz By Yijie Wang; Hao Gao; Campbell R. Harvey; Yan Liu; Xinyuan Tao
  20. Earning While Learning: How to Run Batched Bandit Experiments By Kemper, Jan; Rostam-Afschar, Davud
  21. Extracting Consumer Insight from Text: A Large Language Model Approach to Emotion and Evaluation Measurement By Stephan Ludwig; Peter J. Danaher; Xiaohao Yang; Yu-Ting Lin; Ehsan Abedin; Dhruv Grewal; Lan Du
  22. Bitcoin ETFs and structural decoupling in the cryptocurrency market: evidence from altcoin correlation dynamics By Li, Yuxuan; Zhou, Yuqin; Huang, Jun; Xie, Lin; Huang, Hancheng
  23. Metric geometry for ranking-based voting: Tools for learning electoral structure By Moon Duchin; Kristopher Tapp
  24. Money-Back Tontines for Retirement Decumulation: Neural-Network Optimization under Systematic Longevity Risk By German Nova Orozco; Duy-Minh Dang; Peter A. Forsyth
  25. Scaling Reproducibility: An AI-Assisted Workflow for Large-Scale Reanalysis By Yiqing Xu; Leo Yang Yang

  1. By: Elliot Beck; Franziska Eckert; Linus Kühne; Helge Liebert; Rina Rosenblatt-Wisch
    Abstract: We develop a resource-efficient methodology for measuring economic outlook in news text that combines document embeddings with synthetic training data generated by large language models. Applied to 27 million news articles, the resulting indicator significantly improves GDP growth forecast accuracy and captures sentiment shifts weeks before official releases, proving particularly valuable during crises. The indicator outperforms both survey-based benchmarks and traditional dictionary methods and is interpretable, allowing identification of specific drivers of economic sentiment. Our approach addresses key institutional constraints: it performs sentiment classification locally, enabling analyses of proprietary news content without transmission to external services while requiring minimal computational resources compared to direct large language model classification.
    Keywords: Sentiment analysis, Economic outlook, Forecasting, Big data, Large language models, Natural language processing, Neural networks
    JEL: E66 C45 C55
    Date: 2026
    URL: https://d.repec.org/n?u=RePEc:snb:snbwpa:2026-04
  2. By: Pietro Bini; Lin William Cong; Xing Huang; Lawrence J. Jin
    Abstract: Do generative AI models, particularly large language models (LLMs), exhibit systematic behavioral biases in economic and financial decisions? If so, how can these biases be mitigated? Drawing on the cognitive psychology and experimental economics literatures, we conduct the most comprehensive set of experiments to date$-$originally designed to document human biases$-$on prominent LLM families across model versions and scales. We document systematic patterns in LLM behavior. In preference-based tasks, responses become more human-like as models become more advanced or larger, while in belief-based tasks, advanced large-scale models frequently generate rational responses. Prompting LLMs to make rational decisions reduces biases.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.09362
  3. By: Zeping Li; Guancheng Wan; Keyang Chen; Yu Chen; Yiwen Zhao; Philip Torr; Guangnan Ye; Zhenfei Yin; Hongfeng Chai
    Abstract: Recent works have increasingly applied Large Language Models (LLMs) as agents in financial stock market simulations to test if micro-level behaviors aggregate into macro-level phenomena. However, a crucial question arises: Do LLM agents' behaviors align with real market participants? This alignment is key to the validity of simulation results. To explore this, we select a financial stock market scenario to test behavioral consistency. Investors are typically classified as fundamental or technical traders, but most simulations fix strategies at initialization, failing to reflect real-world trading dynamics. In this work, we assess whether agents' strategy switching aligns with financial theory, providing a framework for this evaluation. We operationalize four behavioral-finance drivers-loss aversion, herding, wealth differentiation, and price misalignment-as personality traits set via prompting and stored long-term. In year-long simulations, agents process daily price-volume data, trade under a designated style, and reassess their strategy every 10 trading days. We introduce four alignment metrics and use Mann-Whitney U tests to compare agents' style-switching behavior with financial theory. Our results show that recent LLMs' switching behavior is only partially consistent with behavioral-finance theories, highlighting the need for further refinement in aligning agent behavior with financial theory.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.07023
  4. By: Kunihiro Miyazaki; Takanobu Kawahara; Stephen Roberts; Stefan Zohren
    Abstract: The advancement of large language models (LLMs) has accelerated the development of autonomous financial trading systems. While mainstream approaches deploy multi-agent systems mimicking analyst and manager roles, they often rely on abstract instructions that overlook the intricacies of real-world workflows, which can lead to degraded inference performance and less transparent decision-making. Therefore, we propose a multi-agent LLM trading framework that explicitly decomposes investment analysis into fine-grained tasks, rather than providing coarse-grained instructions. We evaluate the proposed framework using Japanese stock data, including prices, financial statements, news, and macro information, under a leakage-controlled backtesting setting. Experimental results show that fine-grained task decomposition significantly improves risk-adjusted returns compared to conventional coarse-grained designs. Crucially, further analysis of intermediate agent outputs suggests that alignment between analytical outputs and downstream decision preferences is a critical driver of system performance. Moreover, we conduct standard portfolio optimization, exploiting low correlation with the stock index and the variance of each system's output. This approach achieves superior performance. These findings contribute to the design of agent structure and task configuration when applying LLM agents to trading systems in practical settings.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.23330
  5. By: Krishna Neupane; Prem Sapkota; Ujjwal Prajapati
    Abstract: This study establishes the causal effects of market sentiment on firm profitability, moving beyond traditional correlational analyses. It leverages a causal forest machine learning methodology to control for numerous confounding variables, enabling systematic analysis of heterogeneity and non-linearities often overlooked. A key innovation is the use of a pre-trained FinancialBERT to generate sentiment scores from quarterly reports, which are then treated as causal interventions impacting profitability dynamics like returns and volatilities. Utilizing a comprehensive dataset from NEPSE, NRB, and individual financial institutions, the research employs SHAP analysis to identify influential profit predictors. A two-pronged causal analysis further explores how sentiment's impact is conditioned by Loan Portfolio/Asset Composition and Balance Sheet Strength/Leverage. Average Treatment Effect analyses, combined with SHAP insights, reveal statistically significant causal associations between certain balance sheet and expense management variables and profitability. This advanced causal machine learning framework significantly extends existing literature, providing a more robust understanding of how financial sentiment truly impacts firm performance.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.17851
  6. By: Yaxuan Kong; Hoyoung Lee; Yoontae Hwang; Alejandro Lopez-Lira; Bradford Levy; Dhagash Mehta; Qingsong Wen; Chanyeol Choi; Yongjae Lee; Stefan Zohren
    Abstract: Large Language Models (LLMs) are increasingly integrated into financial workflows, but evaluation practice has not kept up. Finance-specific biases can inflate performance, contaminate backtests, and make reported results useless for any deployment claim. We identify five recurring biases in financial LLM applications. They include look-ahead bias, survivorship bias, narrative bias, objective bias, and cost bias. These biases break financial tasks in distinct ways and they often compound to create an illusion of validity. We reviewed 164 papers from 2023 to 2025 and found that no single bias is discussed in more than 28 percent of studies. This position paper argues that bias in financial LLM systems requires explicit attention and that structural validity should be enforced before any result is used to support a deployment claim. We propose a Structural Validity Framework and an evaluation checklist with minimal requirements for bias diagnosis and future system design. The material is available at https://github.com/Eleanorkong/Awesome-F inancial-LLM-Bias-Mitigation.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.14233
  7. By: Srijan Sood; Kassiani Papasotiriou; Marius Vaiciulis; Tucker Balch
    Abstract: Portfolio Management is the process of overseeing a group of investments, referred to as a portfolio, with the objective of achieving predetermined investment goals. Portfolio optimization is a key component that involves allocating the portfolio assets so as to maximize returns while minimizing risk taken. It is typically carried out by financial professionals who use a combination of quantitative techniques and investment expertise to make decisions about the portfolio allocation. Recent applications of Deep Reinforcement Learning (DRL) have shown promising results when used to optimize portfolio allocation by training model-free agents on historical market data. Many of these methods compare their results against basic benchmarks or other state-of-the-art DRL agents but often fail to compare their performance against traditional methods used by financial professionals in practical settings. One of the most commonly used methods for this task is Mean-Variance Portfolio Optimization (MVO), which uses historical time series information to estimate expected asset returns and covariances, which are then used to optimize for an investment objective. Our work is a thorough comparison between model-free DRL and MVO for optimal portfolio allocation. We detail the specifics of how to make DRL for portfolio optimization work in practice, also noting the adjustments needed for MVO. Backtest results demonstrate strong performance of the DRL agent across many metrics, including Sharpe ratio, maximum drawdowns, and absolute returns.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.17098
  8. By: Sean Cao; Wei Jiang; Hui Xu
    Abstract: This research explores how human-defined goals influence the behavior of Large Language Models (LLMs) through purpose-conditioned cognition. Using financial prediction tasks, we show that revealing the downstream use (e.g., predicting stock returns or earnings) of LLM outputs leads the LLM to generate biased sentiment and competition measures, even though these measures are intended to be downstream task-independent. Goal-aware prompting shifts intermediate measures toward the disclosed downstream objective. This purpose leakage improves performance before the LLM's knowledge cutoff, but with no advantage post-cutoff. AI bias due to "seeing the goal" is not an algorithmic flaw, but stems from human accountability in research design to ensure the statistical validity and reliability of AI-generated measurements.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.09504
  9. By: Sumin Kim; Jihoon Kwon; Yoon Kim; Nicole Kagan; Raffi Khatchadourian; Wonbin Ahn; Alejandro Lopez-Lira; Jaewon Lee; Yoontae Hwang; Oscar Levy; Yongjae Lee; Chanyeol Choi
    Abstract: Mention markets, a type of prediction market in which contracts resolve based on whether a specified keyword is mentioned during a future public event, require accurate probabilistic forecasts of keyword-mention outcomes. While recent work shows that large language models (LLMs) can generate forecasts competitive with human forecasters, it remains unclear how input context should be designed to support accurate prediction. In this paper, we study this question through experiments on earnings-call mention markets, which require forecasting whether a company will mention a specified keyword during its upcoming call. We run controlled comparisons varying (i) which contextual information is provided (news and/or prior earnings-call transcripts) and (ii) how \textit{market probability}, (i.e., prediction market contract price) is used. We introduce Market-Conditioned Prompting (MCP), which explicitly treats the market-implied probability as a prior and instructs the LLM to update this prior using textual evidence, rather than re-predicting the base rate from scratch. In our experiments, we find three insights: (1) richer context consistently improves forecasting performance; (2) market-conditioned prompting (MCP), which treats the market probability as a prior and updates it using textual evidence, yields better-calibrated forecasts; and (3) a mixture of the market probability and MCP (MixMCP) outperforms the market baseline. By dampening the LLM's posterior update with the market prior, MixMCP yields more robust predictions than either the market or the LLM alone.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.21229
  10. By: Omri Feldman; Amar Venugopal; Jann Spiess; Amir Feder
    Abstract: Understanding the causal effects of text on downstream outcomes is a central task in many applications. Estimating such effects requires researchers to run controlled experiments that systematically vary textual features. While large language models (LLMs) hold promise for generating text, producing and evaluating controlled variation requires more careful attention. In this paper, we present an end-to-end pipeline for the generation and causal estimation of latent textual interventions. Our work first performs hypothesis generation and steering via sparse autoencoders (SAEs), followed by robust causal estimation. Our pipeline addresses both computational and statistical challenges in text-as-treatment experiments. We demonstrate that naive estimation of causal effects suffers from significant bias as text inherently conflates treatment and covariate information. We describe the estimation bias induced in this setting and propose a solution based on covariate residualization. Our empirical results show that our pipeline effectively induces variation in target features and mitigates estimation error, providing a robust foundation for causal effect estimation in text-as-treatment settings.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.15730
  11. By: Hainaut, Donatien (Université catholique de Louvain, LIDAM/ISBA, Belgium)
    Abstract: This article introduces a novel in-processing method for integrating actuarial and equity fairness into neural networks used for actuarial valuation. We consider one primary network penalized during training to ensure balanced predictions (actuarial fairness) and independence from sensitive features (equity fairness). Global and local actuarial equilibrium is obtained by aligning the inter-quantile averages of predicted and observed responses. Meanwhile, a second auxiliary network penalizes the primary network for discriminatory predictions. The combined training algorithm eectively preserves predictive accuracy while mitigating discrimination. Numerical illustrations on real-world datasets demonstrate the method's ecacy in achieving fair and reliable insurance pricing models.
    Keywords: Neural network ; equity fairness ; actuarial fairness ; non-life pricing
    Date: 2025–05–12
    URL: https://d.repec.org/n?u=RePEc:aiz:louvad:2025011
  12. By: Masahiro Kato
    Abstract: Efficient estimation of causal and structural parameters can be automated using the Riesz representation theorem and debiased machine learning (DML). We present genriesz, an open-source Python package that implements automatic DML and generalized Riesz regression, a unified framework for estimating Riesz representers by minimizing empirical Bregman divergences. This framework includes covariate balancing, nearest-neighbor matching, calibrated estimation, and density ratio estimation as special cases. A key design principle of the package is automatic regressor balancing (ARB): given a Bregman generator $g$ and a representer model class, genriesz} automatically constructs a compatible link function so that the generalized Riesz regression estimator satisfies balancing (moment-matching) optimality conditions in a user-chosen basis. The package provides a modulr interface for specifying (i) the target linear functional via a black-box evaluation oracle, (ii) the representer model via basis functions (polynomial, RKHS approximations, random forest leaf encodings, neural embeddings, and a nearest-neighbor catchment basis), and (iii) the Bregman generator, with optional user-supplied derivatives. It returns regression adjustment (RA), Riesz weighting (RW), augmented Riesz weighting (ARW), and TMLE-style estimators with cross-fitting, confidence intervals, and $p$-values. We highlight representative workflows for estimation problems such as the average treatment effect (ATE), ATE on treated (ATT), and average marginal effect estimation. The Python package is available at https://github.com/MasaKat0/genriesz and on PyPI.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.17543
  13. By: Thomas R. Cook; Sophia Kazinnik; Zach Modig; Nathan M. Palmer
    Abstract: Large language models (LLMs) are now used for economic reasoning, but their implicit "preferences" are poorly understood. We study these preferences by analyzing revealed choices in canonical allocation games and a sequential job-search environment. In dictator-style allocation games, most models favor equal splits, consistent with inequality aversion. Structural estimation of Fehr-Schmidt parameters suggests this aversion exceeds levels typically observed in human experiments. However, LLM preferences prove malleable. Interventions such as prompt framing (e.g., masking social context) and control vectors reliably shift models toward more payoff-maximizing behavior, while persona-based prompting has more limited impact. We then extend our analysis to a sequential decision-making environment based on the McCall job search model. Here, we recover implied discount factors from accept/reject behavior, but find that responses are less consistently rationalizable and preferences more fragile. Our findings highlight two core insights: (i) LLMs exhibit structured, latent preferences that often align with human behavioral norms, and (ii) these preferences can be steered, albeit more effectively in simple settings than in complex, dynamic ones.
    Keywords: Behavioral economics; Game theory; Search and matching models
    JEL: C63 C68 C61 D14 D83 D91 E20 E21
    Date: 2026–01–30
    URL: https://d.repec.org/n?u=RePEc:fip:fedgfe:102439
  14. By: Lauren Cohen; Yiwen Lu; Quoc H. Nguyen
    Abstract: We use frontier advancements in Artificial Intelligence and machine learning to extract and classify the part of key economic agents’ behaviors that are predictable from past behaviors. Even the agents themselves might view these as novel (innovative) decisions; however, we show in strong contrast that a large percentage of these actions and behaviors can be predicted—and thus mimicked—in the absence of these individuals. In particular, we show that 71% of mutual fund managers’ trade directions can be predicted in the absence of the agent making a single trade. For some managers, this increases to nearly all of their trades in a given quarter. Further, we find that manager behavior is more predictable and replicable for managers who have a longer history of trading and are in less competitive categories. The larger the ownership stake of the manager in the fund, the less predictable their behavior. Lastly, we show strong performance implications: less predictable managers strongly outperform their peers, while the most predictable managers significantly underperform. Even within each manager's portfolio, those stock positions that are more difficult to predict strongly outperform those that are easier to predict. Aggregating across the universe of fund managers each quarter, stocks whose position changes are least predictable additionally significantly outperform stocks whose position changes are most predictable across the universe. Our framework allows researchers to delineate and classify the portion of financial agents’ action sets which are predictable from those which are novel responses to stimuli -- open to being evaluated for value creation or destruction.
    JEL: C45 C53 C55 C82 G11 G23
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:nbr:nberwo:34849
  15. By: Gautami Parate (Madras School of Economics, Gandhi Mandapam Road, Behind Government Data Centre, Kotturpuram, Chennai, 600025, India.); Arpita Choudhary ((Corresponding author), Madras School of Economics, Gandhi Mandapam Road, Behind Government Data Centre, Kotturpuram, Chennai, 600025)
    Abstract: Environmental, Social, and Governance (ESG) considerations have become integral to corporate strategy, investor decision-making, and regulatory oversight. ESG violations—such as environmental harm, governance failures, and social misconduct—pose substantial reputational, financial, and legal risks. This study develops a machine learning-based framework for the early detection of ESG policy violations using the World Benchmarking Alliance’s Nature Benchmark dataset (2022–2024), covering 816 firms across more than 20 industries. To address the pronounced class imbalance inherent in ESG violation data, the Synthetic Minority Over-sampling Technique (SMOTE) is applied. Three classification models—Logistic Regression, Decision Tree, and Random Forest—are evaluated. The Random Forest model demonstrates the most robust performance, achieving a superior balance between accuracy and recall. Model interpretability is ensured through feature importance measures and SHAP values, which identify key ESG dimensions and industry-specific drivers associated with violations. Overall, the findings highlight the effectiveness of combining ensemble learning, resampling techniques, and explainable machine learning to support scalable and proactive ESG risk assessment.
    Keywords: ESG, ESG violations, sustainability analytics, machine learning, Random Forest, SMOTE, SHAP
    JEL: C38 C45 G17 Q56
    Date: 2026–01
    URL: https://d.repec.org/n?u=RePEc:mad:wpaper:2026-293
  16. By: Paolo Pellizzari (Ca’ Foscari University of Venice); Francesca Parpinel (Ca’ Foscari University of Venice)
    Abstract: We describe an Agent-Based Model of a Ponzi scheme following the Madoff's case. Agents have an initial propensity to invest in the scam, as the wealth is perceived to grow, whereas it is not invested in any way, and is dissipated by the fraudster. We emphasize that the widening gap between the perceived wealth and the true total money in the hands of the impostor is the key feature of such schemes. If trust evaporates due to the absorption of bad news on the economy, the propensity gradually reverses and an increasing number of agents withdraw their capital (and made up profits). We examine the time needed to reveal the scam and reach a bankruptcy, as a function of the amount of news that hits the market. We also investigate how a special agent named Markopolos (inspired to a real personage) affects the time to bankruptcy, due to his ability to abruptly "convince" to dis-invest the agents he run across. The Markopolos effect appears to be statistically significant, but is quite weak with respect to the outcome generated by a flow of news and the ensuing widespread loss of trust and redemptions.
    Keywords: Agent-Based Model, Ponzi Schemes, NetLogo
    JEL: C63 K42 G11 D83
    Date: 2026
    URL: https://d.repec.org/n?u=RePEc:ven:wpaper:2026:04
  17. By: Stefano Scoleri; Marco Bianchetti; Sergei Kucherenko
    Abstract: Quasi Monte Carlo (QMC) and Global Sensitivity Analysis (GSA) techniques are applied for pricing and hedging representative financial instruments of increasing complexity. We compare standard Monte Carlo (MC) vs QMC results using Sobol' low discrepancy sequences, different sampling strategies, and various analyses of performance. We find that QMC outperforms MC in most cases, including the highest-dimensional simulations, showing faster and more stable convergence. Regarding greeks computation, we compare standard approaches, based on finite differences (FD) approximations, with adjoint methods (AAD) providing evidences that, when the number of greeks is small, the FD approach combined with QMC can lead to the same accuracy as AAD, thanks to increased convergence rate and stability, thus saving a lot of implementation effort while keeping low computational cost. Using GSA, we are able to fully explain our findings in terms of reduced effective dimension of QMC simulation, allowed in most cases, but not always, by Brownian Bridge discretization or PCA construction. We conclude that, beyond pricing, QMC is a very effcient technique also for computing risk measures, greeks in particular, as it allows to reduce the computational effort of high dimensional Monte Carlo simulations typical of modern risk management.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.14354
  18. By: Caleb Maresca
    Abstract: Prediction markets suffer from reduced liquidity and price accuracy for long-horizon events due to the opportunity cost of committed capital. Recently, major platforms have introduced interest-bearing positions to mitigate this "long-horizon problem." I evaluate this policy using agent-based simulations with large language model (LLM) traders in a 2 x 2 factorial design, varying time horizon (4 days vs. 2 years) and the presence of interest. While long horizons degrade accuracy, the observed pricing bias (0.72 percentage points) is significantly smaller than theoretical and prior empirical estimates. Paying interest eliminates approximately 83% of the horizon effect on accuracy and more than triples market participation (from 17% to 62% of wealth). These findings suggest the long-horizon problem may be overstated in existing literature and that interest-bearing positions are a highly effective intervention, primarily by incentivizing participation rather than correcting bias.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.21091
  19. By: Yijie Wang; Hao Gao; Campbell R. Harvey; Yan Liu; Xinyuan Tao
    Abstract: The standard approach to portfolio selection involves two stages: forecast the asset returns and then plug them into an optimizer. We argue that this separation is deeply problematic. The first stage treats cross-sectional prediction errors as equally important across all securities. However, given that final portfolios might differ given distinct risk preferences and investment restrictions, the standard approach fails to recognize that the investor is not just concerned with the average forecast error - but the precision of the forecasts for the specific assets that are most important for their portfolio. Hence, it is crucial to integrate the two stages. We propose a novel implementation utilizing machine learning tools that unifies the expected return generation process and the final optimized portfolio. Our empirical example provides convincing evidence that our end-to-end method outperforms the traditional two-stage approach. In our framework, each investor has their own, endogenously determined, efficient frontier that depends on risk preferences, investor-specific constraints, as well as exposure to market frictions.
    JEL: C45 C55 G11 G12
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:nbr:nberwo:34861
  20. By: Kemper, Jan; Rostam-Afschar, Davud
    Abstract: Researchers typically collect experimental data sequentially, allowing early outcome observations and adaptive treatment assignment to reduce exposure to inferior treatments. This article reviews multi-armed-bandit adaptive experimental designs that balance exploration and exploitation. Because adaptively collected experimental data through bandit algorithms violate standard asymptotics, inference is challenging. We implement an estimator that yields valid heteroskedasticity-robust confidence intervals in batched bandit designs and compare coverage in Monte Carlo simulations. We introduce bbandits for Stata, a tool for designing experiments via simulation, running interactive bandit experiments, and implementing and analyzing adaptively collected data. bbandits includes three common assignment algorithms-e-first, e-greedy, and Thompson sampling-and supports estimation, inference, and visualization.
    Keywords: Randomized controlled trial, causal inference, multi-armed bandits, experimental design, machine learning
    JEL: C1 C11 C12 C13 C15 C18 C8 C87 C88 C9 D83
    Date: 2026
    URL: https://d.repec.org/n?u=RePEc:zbw:glodps:1717
  21. By: Stephan Ludwig; Peter J. Danaher; Xiaohao Yang; Yu-Ting Lin; Ehsan Abedin; Dhruv Grewal; Lan Du
    Abstract: Accurately measuring consumer emotions and evaluations from unstructured text remains a core challenge for marketing research and practice. This study introduces the Linguistic eXtractor (LX), a fine-tuned, large language model trained on consumer-authored text that also has been labeled with consumers' self-reported ratings of 16 consumption-related emotions and four evaluation constructs: trust, commitment, recommendation, and sentiment. LX consistently outperforms leading models, including GPT-4 Turbo, RoBERTa, and DeepSeek, achieving 81% macro-F1 accuracy on open-ended survey responses and greater than 95% accuracy on third-party-annotated Amazon and Yelp reviews. An application of LX to online retail data, using seemingly unrelated regression, affirms that review-expressed emotions predict product ratings, which in turn predict purchase behavior. Most emotional effects are mediated by product ratings, though some emotions, such as discontent and peacefulness, influence purchase directly, indicating that emotional tone provides meaningful signals beyond star ratings. To support its use, a no-code, cost-free, LX web application is available, enabling scalable analyses of consumer-authored text. In establishing a new methodological foundation for consumer perception measurement, this research demonstrates new methods for leveraging large language models to advance marketing research and practice, thereby achieving validated detection of marketing constructs from consumer data.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.15312
  22. By: Li, Yuxuan; Zhou, Yuqin; Huang, Jun; Xie, Lin; Huang, Hancheng
    Abstract: The approval of U.S.-based spot Bitcoin Exchange-Traded Funds (ETFs) in January 2024 marked a key milestone in the institutionalization of digital assets. This study examines how ETF introduction reshaped inter-asset dynamics in the cryptocurrency market. Using daily returns from January 2021 to September 2025 for Bitcoin and 18 major altcoins, we apply a Long Short-Term Memory (LSTM) neural network to capture evolving return correlations. Our analysis reveals a pronounced post-ETF decline in correlations across both short-term (6-month) and long-term (12-month) rolling windows. We interpret this structural decoupling as the effect of ‘independent inflows’, whereby institutional capital enters Bitcoin without proportionate investment in altcoins. The findings suggest that Bitcoin is evolving into a distinct, standalone asset class with weaker integration in the broader cryptocurrency market. Policy and investment implications include reconsidering portfolio diversification strategies, reassessing systemic risk, and designing digital asset financial instruments to account for market segmentation and institutional flows.
    Keywords: Bitcoin ETF; cryptocurrency markets; LSTM modeling; capital flows; asset decoupling; correlation dynamics; deep learning
    JEL: F3 G3
    Date: 2026–02–28
    URL: https://d.repec.org/n?u=RePEc:ehl:lserod:137306
  23. By: Moon Duchin; Kristopher Tapp
    Abstract: In this paper, we develop the metric geometry of ranking statistics, proving that the two major permutation distances in the statistics literature -- Kendall tau and Spearman footrule -- extend naturally to incomplete rankings with both coordinate embeddings and graph realizations. This gives us a unifying framework that allows us to connect popular topics in computational social choice: metric preferences (and metric distortion), polarization, and proportionality. As an important application, the metric structure enables efficient identification of blocs of voters and slates of their preferred candidates. Since the definitions work for partial ballots, we can execute the methods not only on synthetic elections, but on a suite of real-world elections. This gives us robust clustering methods that often produce an identical grouping of voters -- even though one family of methods is based on a Condorcet-consistent ranking rule while the other is not.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.10293
  24. By: German Nova Orozco; Duy-Minh Dang; Peter A. Forsyth
    Abstract: Money-back guarantees (MBGs) are features of pooled retirement income products that address bequest concerns by ensuring the initial premium is returned through lifetime payments or, upon early death, as a death benefit to the estate. This paper studies optimal retirement decumulation in an individual tontine account with an MBG overlay under international diversification and systematic longevity risk. The retiree chooses withdrawals and asset allocation dynamically to trade off expected total withdrawals (EW) against the Conditional Value-at-Risk (CVaR) of terminal wealth, subject to realistic investment constraints. The optimization is solved under a plan-to-live convention, while stochastic mortality affects outcomes through its impact on mortality credits at the pool level. We develop a neural-network based computational approach for the resulting high-dimensional, constrained control problem. The MBG is priced ex post under the induced EW--CVaR optimal policy via a simulation-based actuarial rule that combines expected guarantee costs with a prudential tail buffer. Using long-horizon historical return data expressed in real domestic-currency terms, we find that international diversification and longevity pooling jointly deliver the largest improvements in the EW--CVaR trade-off, while stochastic mortality shifts the frontier modestly in the expected direction. The optimal controls use foreign equity primarily as a state-dependent catch-up instrument, and implied MBG loads are driven mainly by tail outcomes (and the chosen prudential buffer) rather than by mean payouts.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.16212
  25. By: Yiqing Xu; Leo Yang Yang
    Abstract: Reproducibility is central to research credibility, yet large-scale reanalysis of empricial data remains costly because replication packages vary widely in structure, software environment, and documentation. We develop and evaluate an agentic AI workflow that addresses this execution bottleneck while preserving scientific rigor. The system separates scientific reasoning from computational execution: researchers design fixed diagnostic templates, and the workflow automates the acquisition, harmonization, and execution of replication materials using pre-specified, version-controlled code. A structured knowledge layer records resolved failure patterns, enabling adaptation across heterogeneous studies while keeping each pipeline version transparent and stable. We evaluate this workflow on 92 instrumental variable (IV) studies, including 67 with manually verified reproducible 2SLS estimates and 25 newly published IV studies under identical criteria. For each paper, we analyze up to three two-stage least squares (2SLS) specifications, totaling 215. Across the 92 papers, the system achieves 87% end-to-end success overall. Conditional on accessible data and code, reproducibility is 100% at both the paper and specification levels. The framework substantially lowers the cost of executing established empirical protocols and can be adapted in empirical settings where analytic templates and norms of transparency are well established.
    Date: 2026–02
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2602.16733

This nep-cmp issue is ©2026 by Stan Miles. It is provided as is without any express or implied warranty. It may be freely redistributed in whole or in part for any purpose. If distributed in part, please include this notice.
General information on the NEP project can be found at https://nep.repec.org. For comments please write to the director of NEP, Marco Novarese at <director@nep.repec.org>. Put “NEP” in the subject, otherwise your mail may be rejected.
NEP’s infrastructure is sponsored by the Griffith Business School of Griffith University in Australia.