nep-big New Economics Papers
on Big Data
Issue of 2024–11–11
twenty-one papers chosen by
Tom Coupé, University of Canterbury


  1. Machine Learning in Portfolio Decisions By Manuela Pedio; Massimo Guidolin; Giulia Panzeri
  2. Online Investor Sentiment via Machine Learning By Zongwu Cai; Pixiong Chen
  3. Dynamic Portfolio Rebalancing: A Hybrid new Model Using GNNs and Pathfinding for Cost Efficiency By Diego Vallarino
  4. Machine Learning for Propensity Score Estimation: A Systematic Review and Reporting Guidelines By Leite, Walter; Zhang, Huibin; collier, zachary; Chawla, Kamal; , l.kong@ufl.edu; Lee, Yongseok; Quan, Jia; Soyoye, Olushola
  5. Breaking Down the REIT Market: Is Social Media Capable of Predicting a REITs’ Performance? By Sophia Bodensteiner; Lukas Lautenschlaeger; Wolfgang Schäfers
  6. Mending the Crystal Ball: Enhanced Inflation Forecasts with Machine Learning By Yang Liu; Ran Pan; Rui Xu
  7. Financial Sentiment Analysis on News and Reports Using Large Language Models and FinBERT By Yanxin Shen; Pulin Kirin Zhang
  8. Navigating Inflation in Ghana: How Can Machine Learning Enhance Economic Stability and Growth Strategies By Theophilus G. Baidoo; Ashley Obeng
  9. Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution By Dupret, Jean-Loup; Hainaut, Donatien
  10. Collusion Detection with Graph Neural Networks By Lucas Gomes; Jannis Kueck; Mara Mattes; Martin Spindler; Alexey Zaytsev
  11. Optimizing Performance: How Compact Models Match or Exceed GPT's Classification Capabilities through Fine-Tuning By Baptiste Lefort; Eric Benhamou; Jean-Jacques Ohana; David Saltiel; Beatrice Guez
  12. A Dynamic Approach to Stock Price Prediction: Comparing RNN and Mixture of Experts Models Across Different Volatility Profiles By Diego Vallarino
  13. Fine-Tuning Large Language Models to Simulate German Voting Behaviour (Working Paper) By Holtdirk, Tobias; Assenmacher, Dennis; Bleier, Arnim; Wagner, Claudia
  14. Machine Learning and the Yield Curve:Tree-Based Macroeconomic Regime Switching By Siyu Bie; Francis X. Diebold; Jingyu He; Junye Li
  15. Forecast of residential real estate prices in Slovakia using of neural networks By Miroslav Pánik; Andrej Adamušin
  16. An Anatomy of Firms’ Political Speech By Pablo Ottonello; Wenting Song; Sebastian Sotelo
  17. Information disclosure vs. information learning via Google search By Damian Damianov; Xiangdong Wang; Cheng Yan
  18. Disentangling Demand and Supply of Media Bias: The Case of Newspaper Homepages By Leung, Tin Cheuk; Strumpf, Koleman
  19. Real-GPT: Efficiently Tailoring LLMs for Informed Decision-Making in the Real Estate Industry By Benedikt Gloria; Sven Bienert; Johannes Melsbach; Detlef Schoder
  20. Forecasting short-term inflation in Argentina with Random Forest Models By Federico Daniel Forte
  21. Do Words Match Deeds? Exploring the Link Between ESG Discourse and Performance of Real Estate Companies By Siqi Huang; Anupam Nanda; Eero Valtonen

  1. By: Manuela Pedio; Massimo Guidolin; Giulia Panzeri
    Abstract: Machine learning is significantly shaping the advancement of various fields, and among them, notably, finance, where its range of applications and efficiency impacts are seemingly boundless. Contemporary techniques, particularly in reinforcement learning, have prompted both practitioners and academics to contemplate the potential of an artificial intelligence revolution in portfolio management. In this paper, we provide an overview of the primary methods in machine learning currently utilized in portfolio decision-making. We delve into discussions surrounding the existing limitations of machine learning algorithms and explore prevailing hypotheses regarding their future expansions. Specifically, we categorize and analyze the applications of machine learning in systematic trading strategies, portfolio weight optimization, smart beta and passive investment strategies, textual analysis, and trade execution, each separately surveyed for a comprehensive understanding.
    Keywords: Machine learning; portfolio choice; artificial intelligence; neural language processing; stock return predictions, market timing, mean-variance asset allocation.
    JEL: C45 C61 G10 G11 G17
    Date: 2024
    URL: https://d.repec.org/n?u=RePEc:baf:cbafwp:cbafwp24233
  2. By: Zongwu Cai (Department of Economics, The University of Kansas, Lawrence, KS 66045, USA); Pixiong Chen (Division of Model Risk Management, Wells Fargo Bank, Charlotte, NC 28202, USA)
    Abstract: In this paper, we propose utilizing machine learning methods to determine the expected aggregated stock market risk premium based on online investor sentiment and employing the multifold forward-validation method to select the relevant hyperparameters. Our empirical studies provide strong evidence that some machine learning methods, such as extreme gradient boosting or random forest, show significant predictive ability in terms of their out-of-sample performances with high-dimensional investor sentiment proxies. They also outperform the traditional linear models, which shows a possible unobserved nonlinear relationship between online investor sentiment and risk premium. Moreover, this predictability based on online investor sentiment has a better economic value, so it improves portfolio performance for investors who need to decide the optimal asset allocation in terms of the certainty equivalent return gain and the Sharpe ratio.
    Keywords: Asset return; Machine learning; Nonlinearity; Portfolio allocations; Predictability.
    JEL: C45 C55 C58 G11 G17
    Date: 2024–09
    URL: https://d.repec.org/n?u=RePEc:kan:wpaper:202411
  3. By: Diego Vallarino
    Abstract: This paper introduces a novel approach to optimizing portfolio rebalancing by integrating Graph Neural Networks (GNNs) for predicting transaction costs and Dijkstra's algorithm for identifying cost-efficient rebalancing paths. Using historical stock data from prominent technology firms, the GNN is trained to forecast future transaction costs, which are then applied as edge weights in a financial asset graph. Dijkstra's algorithm is used to find the least costly path for reallocating capital between assets. Empirical results show that this hybrid approach significantly reduces transaction costs, offering a powerful tool for portfolio managers, especially in high-frequency trading environments. This methodology demonstrates the potential of combining advanced machine learning techniques with classical optimization algorithms to improve financial decision-making processes. Future research will explore expanding the asset universe and incorporating reinforcement learning for continuous portfolio optimization.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.01864
  4. By: Leite, Walter; Zhang, Huibin; collier, zachary; Chawla, Kamal; , l.kong@ufl.edu; Lee, Yongseok (University of Florida); Quan, Jia; Soyoye, Olushola
    Abstract: Machine learning has become a common approach for estimating propensity scores for quasi-experimental research using matching, weighting, or stratification on the propensity score. This systematic review examined machine learning applications for propensity score estimation across different fields, such as health, education, social sciences, and business over 40 years. The results show that the gradient boosting machine (GBM) is the most frequently used method, followed by random forest. Classification and regression trees (CART), neural networks, and the super learner were also used in more than five percent of studies. The most frequently used packages to estimate propensity scores were twang, gbm and randomforest in the R statistical software. The review identified many hyperparameter configurations used for machine learning methods. However, it also shows that hyperparameters are frequently under-reported, as well as critical steps of the propensity score analysis, such as the covariate balance evaluation. A set of guidelines for reporting the use of machine learning for propensity score estimation is provided.
    Date: 2024–10–09
    URL: https://d.repec.org/n?u=RePEc:osf:osfxxx:gmrk7
  5. By: Sophia Bodensteiner; Lukas Lautenschlaeger; Wolfgang Schäfers
    Abstract: Twitter is established as a major platform for sharing information and opinions online. This research explores the impact of the sentiment expressed on Twitter on the indirect U.S. real estate market, particularly focusing on financial metrics such as returns and volatility. It analyzes how Twitter sentiment correlates with the overall indirect market and additionally focuses on the corporate level, investigating if general findings are also applicable on an individual company basis. Given by the nature of Twitter messages, comprehensive natural language processing is applied to clean and identify relevant posts and to provide the foundation for extracting the sentiment. The complex linguistic features of the given informal language are handled by using three different approaches for classification including traditional and advanced machine-learning methods. Preliminary results suggest that social media sentiment holds predictive value for both market trends and corporate-level changes. Moreover, they indicate towards changing dynamics in the impact of market sentiment on performance metrics during a crisis, exemplified by the 2020 COVID-19 pandemic. This research additionally highlights the effectiveness of classical dictionary-based approaches for sentiment analysis but also shows the capabilities of more sophisticated classifiers.
    Keywords: Machine Learning; REITs; Sentiment Analysis; Twitter
    JEL: R3
    Date: 2024–01–01
    URL: https://d.repec.org/n?u=RePEc:arz:wpaper:eres2024-090
  6. By: Yang Liu; Ran Pan; Rui Xu
    Abstract: Forecasting inflation has become a major challenge for central banks since 2020, due to supply chain disruptions and economic uncertainty post-pandemic. Machine learning models can improve forecasting performance by incorporating a wider range of variables, allowing for non-linear relationships, and focusing on out-of-sample performance. In this paper, we apply machine learning (ML) models to forecast near-term core inflation in Japan post-pandemic. Japan is a challenging case, because inflation had been muted until 2022 and has now risen to a level not seen in four decades. Four machine learning models are applied to a large set of predictors alongside two benchmark models. For 2023, the two penalized regression models systematically outperform the benchmark models, with LASSO providing the most accurate forecast. Useful predictors of inflation post-2022 include household inflation expectations, inbound tourism, exchange rates, and the output gap.
    Keywords: Core inflation; forecasting; machine learning models; LASSO; Japan
    Date: 2024–09–27
    URL: https://d.repec.org/n?u=RePEc:imf:imfwpa:2024/206
  7. By: Yanxin Shen; Pulin Kirin Zhang
    Abstract: Financial sentiment analysis (FSA) is crucial for evaluating market sentiment and making well-informed financial decisions. The advent of large language models (LLMs) such as BERT and its financial variant, FinBERT, has notably enhanced sentiment analysis capabilities. This paper investigates the application of LLMs and FinBERT for FSA, comparing their performance on news articles, financial reports and company announcements. The study emphasizes the advantages of prompt engineering with zero-shot and few-shot strategy to improve sentiment classification accuracy. Experimental results indicate that GPT-4o, with few-shot examples of financial texts, can be as competent as a well fine-tuned FinBERT in this specialized field.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.01987
  8. By: Theophilus G. Baidoo; Ashley Obeng
    Abstract: Inflation remains a persistent challenge for many African countries. This research investigates the critical role of machine learning (ML) in understanding and managing inflation in Ghana, emphasizing its significance for the country's economic stability and growth. Utilizing a comprehensive dataset spanning from 2010 to 2022, the study aims to employ advanced ML models, particularly those adept in time series forecasting, to predict future inflation trends. The methodology is designed to provide accurate and reliable inflation forecasts, offering valuable insights for policymakers and advocating for a shift towards data-driven approaches in economic decision-making. This study aims to significantly advance the academic field of economic analysis by applying machine learning (ML) and offering practical guidance for integrating advanced technological tools into economic governance, ultimately demonstrating ML's potential to enhance Ghana's economic resilience and support sustainable development through effective inflation management.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.05630
  9. By: Dupret, Jean-Loup (Université catholique de Louvain, LIDAM/ISBA, Belgium); Hainaut, Donatien (Université catholique de Louvain, LIDAM/ISBA, Belgium)
    Abstract: This paper introduces the GPI-PINN algorithm, a novel numerical scheme for solving continuous-time stochastic optimal control problems in high dimensions when the optimal control does not admit an explicit solution. Combining Physics-Informed Neural Networks with an Actor-Critic structure built upon the Generalized Policy Iteration technique, this successive deep learning algorithm employs two separate neural networks to approximate both the value function and the multidimensional optimal control. This way, the GPI-PINN algorithm manages to achieve a global approximation of the optimal solution across all time and space, which can be evaluated online rapidly. The optimality and convergence of the scheme are demonstrated theoretically and its accuracy and efficacy are shown empirically based on two numerical examples. In particular, we generalize the standard Almgren-Chriss model arising from optimal liquidation in finance by allowing for a price impact model with fully nonlinear temporary and permanent impact functions and by considering a multidimensional setting with numerous co-integrated assets.
    Keywords: Machine learning ; Stochastic optimal control ; Deep learning ; Physics-Informed Neural Networks ; Optimal liquidation
    Date: 2024–05–30
    URL: https://d.repec.org/n?u=RePEc:aiz:louvad:2024016
  10. By: Lucas Gomes; Jannis Kueck; Mara Mattes; Martin Spindler; Alexey Zaytsev
    Abstract: Collusion is a complex phenomenon in which companies secretly collaborate to engage in fraudulent practices. This paper presents an innovative methodology for detecting and predicting collusion patterns in different national markets using neural networks (NNs) and graph neural networks (GNNs). GNNs are particularly well suited to this task because they can exploit the inherent network structures present in collusion and many other economic problems. Our approach consists of two phases: In Phase I, we develop and train models on individual market datasets from Japan, the United States, two regions in Switzerland, Italy, and Brazil, focusing on predicting collusion in single markets. In Phase II, we extend the models' applicability through zero-shot learning, employing a transfer learning approach that can detect collusion in markets in which training data is unavailable. This phase also incorporates out-of-distribution (OOD) generalization to evaluate the models' performance on unseen datasets from other countries and regions. In our empirical study, we show that GNNs outperform NNs in detecting complex collusive patterns. This research contributes to the ongoing discourse on preventing collusion and optimizing detection methodologies, providing valuable guidance on the use of NNs and GNNs in economic applications to enhance market fairness and economic welfare.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.07091
  11. By: Baptiste Lefort; Eric Benhamou; Jean-Jacques Ohana; David Saltiel; Beatrice Guez
    Abstract: In this paper, we demonstrate that non-generative, small-sized models such as FinBERT and FinDRoBERTa, when fine-tuned, can outperform GPT-3.5 and GPT-4 models in zero-shot learning settings in sentiment analysis for financial news. These fine-tuned models show comparable results to GPT-3.5 when it is fine-tuned on the task of determining market sentiment from daily financial news summaries sourced from Bloomberg. To fine-tune and compare these models, we created a novel database, which assigns a market score to each piece of news without human interpretation bias, systematically identifying the mentioned companies and analyzing whether their stocks have gone up, down, or remained neutral. Furthermore, the paper shows that the assumptions of Condorcet's Jury Theorem do not hold suggesting that fine-tuned small models are not independent of the fine-tuned GPT models, indicating behavioural similarities. Lastly, the resulted fine-tuned models are made publicly available on HuggingFace, providing a resource for further research in financial sentiment analysis and text classification.
    Date: 2024–08
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2409.11408
  12. By: Diego Vallarino
    Abstract: This study evaluates the effectiveness of a Mixture of Experts (MoE) model for stock price prediction by comparing it to a Recurrent Neural Network (RNN) and a linear regression model. The MoE framework combines an RNN for volatile stocks and a linear model for stable stocks, dynamically adjusting the weight of each model through a gating network. Results indicate that the MoE approach significantly improves predictive accuracy across different volatility profiles. The RNN effectively captures non-linear patterns for volatile companies but tends to overfit stable data, whereas the linear model performs well for predictable trends. The MoE model's adaptability allows it to outperform each individual model, reducing errors such as Mean Squared Error (MSE) and Mean Absolute Error (MAE). Future work should focus on enhancing the gating mechanism and validating the model with real-world datasets to optimize its practical applicability.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.07234
  13. By: Holtdirk, Tobias; Assenmacher, Dennis; Bleier, Arnim; Wagner, Claudia
    Abstract: Surveys are a cornerstone of empirical social science research, providing invaluable insights into the opinions, beliefs, behaviours, and characteristics of people. However, issues such as refusal to participate, skipping questions, sampling bias, and attrition significantly impact the quality and reliability of survey data. Recently, researchers have started investigating the potential of Large Language Models (LLMs) to role-play a pre-defined set of "characters" and simulate their survey responses with little or no additional training data and costs. While previous research on forecasting, imputing, and simulating survey answers with LLMs has focused on zero-shot and few-shot approaches, this study investigates the viability of fine-tuning LLMs to simulate responses of survey participants. We fine-tune Large Language Models (LLMs) on subsets of the data from the German Longitudinal Election Study (GLES) and evaluate their predictive performance on the "vote choice" for a random set of held-out participants. We compare the LLMs' performance against various baseline methods. Our findings show that small, fine-tuned open-source LLMs can outperform zero-shot predictions of larger LLMs. They are able to match the performance of established tabular data classifiers, are more sample efficient, and outperform them in cases with systematic non-responses. This study contributes to the growing body of research on LLMs for simulating survey data by demonstrating the effectiveness of fine-tuning approaches.
    Date: 2024–10–07
    URL: https://d.repec.org/n?u=RePEc:osf:osfxxx:udz28
  14. By: Siyu Bie (City University of Hong Kong); Francis X. Diebold (University of Pennsylvania); Jingyu He (City University of Hong Kong); Junye Li (Fudan University)
    Abstract: We explore tree-based macroeconomic regime-switching in the context of the dynamic Nelson-Siegel (DNS) yield-curve model. In particular, we customize the treegrowing algorithm to partition macroeconomic variables based on the DNS model’s marginal likelihood, thereby identifying regime-shifting patterns in the yield curve. Compared to traditional Markov-switching models, our model offers clear economic interpretation via macroeconomic linkages and ensures computational simplicity. In an empirical application to U.S. Treasury bond yields, we find (1) important yield curve regime switching, and (2) evidence that macroeconomic variables have predictive power for the yield curve when the short rate is high, but not in other regimes, thereby refining the notion of yield curve “macro-spanning”.
    Keywords: Decision Tree; Macro-Finance; Term Structure; Regime Switching; Dynamic Nelson-Siegel Model; Bayesian Estimation
    JEL: C11 E43 G12
    Date: 2024–10–08
    URL: https://d.repec.org/n?u=RePEc:pen:papers:24-028
  15. By: Miroslav Pánik; Andrej Adamušin
    Abstract: The current turbulent development in the real estate market, not only in Slovakia but throughout Europe, is caused by several crises that have a global impact and partially resemble the situation in the market 15 years ago. One factor that negatively affects the real estate market in Slovakia is the lack of residential real estate. In the long term, the demand for housing significantly exceeds the supply of apartments.The article analyzes the prices of real estate intended for housing in Slovakia. It defines economic, demographic and social factors that have a significant impact on the development of real estate prices. The methodology of statistical data collection differs considerably in the developed countries of the EU. Národná banka Slovenska has been publishing quarterly data on real estate prices since 2005. Development modeling is possible, for example, using correlation and regression analysis, which is well known. In the presented article, the forecast of housing prices will be realized with the help of artificial intelligence - neural networks.
    Keywords: Artificial Intelligence; Forecasting; Neural Networks; residential real estate prices
    JEL: R3
    Date: 2024–01–01
    URL: https://d.repec.org/n?u=RePEc:arz:wpaper:eres2024-091
  16. By: Pablo Ottonello; Wenting Song; Sebastian Sotelo
    Abstract: We study the distribution of political speech across U.S. firms. We develop a measure of political engagement based on firms’ communications (earning calls, regulatory filings, and social media) by training a large language model to identify statements that contain political opinions. Using these data, we document five facts about firms’ political engagement: (1) Political engagement is rare among firms; (2) Political engagement is concentrated among large firms; (3) Firms tend to specialize in specific topics and outlets; (4) Large firms tend to engage in a wider set of topics and outlets; (5) The 2020 surge in firms’ political engagement was associated with an increase in the engagement of medium-sized firms and a change in the mix of political topics.
    Keywords: Firm dynamics; Market structure and pricing; Recent economic and financial developments
    JEL: D22 D63 G41 L11 L20
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:bca:bocawp:24-37
  17. By: Damian Damianov; Xiangdong Wang; Cheng Yan
    Abstract: We decompose the Google Trends Search Volume Index into naïve and sophisticated searches and examine their impacts on mortgage default, respectively. Using U.S. data from 2006 to 2018, we find that the sophisticated search activity has a positive and robust relationship with the change in the percentage of mortgages in 90+ days of delinquency. However, foreclosure starts are positively related to naïve search activity in the short term, but negatively related to sophisticated search activity in the long term. Borrowers are more likely learn from sophisticated online searches than from naïve online searches, and they can use the information to avoid foreclosure starts and keep their houses. The relationship between Google search activity and mortgage default outcomes are significantly stronger in states that experienced substantial house price drops in the recent year. Our findings are robust to a battery of alternative settings.
    Keywords: Google search; Information transmission; Mortgage Default Risk
    JEL: R3
    Date: 2024–01–01
    URL: https://d.repec.org/n?u=RePEc:arz:wpaper:eres2024-056
  18. By: Leung, Tin Cheuk (Wake Forest University, Economics Department); Strumpf, Koleman (Wake Forest University, Economics Department)
    Abstract: We introduce a novel method for detecting supply-side media bias without needing information on the ideological leanings of media owners or editors. Focusing on digital homepages, we analyzed over 100, 000 articles from The New York Times (NYT) and The Wall Street Journal (WSJ) and 24 million tweets. By controlling for demand-side factors, we attribute extended homepage visibility of ideologically slanted articles to supply-side biases. Liberal articles stay longer on the NYT homepage, while conservative ones persist on WSJ. Further analysis into articles’ transition to print and podcasts suggest that competition may reduce media bias, supporting the presence of supply-side biases.
    Keywords: Media Bias; Media Economics; Social Media; Machine Learning
    JEL: D22 D72 D83 L82
    Date: 2024–10–17
    URL: https://d.repec.org/n?u=RePEc:ris:wfuewp:0116
  19. By: Benedikt Gloria; Sven Bienert; Johannes Melsbach; Detlef Schoder
    Abstract: In recent times, large language models (LLMs) such as ChatGPT and LLaMA have gained significant attention. These models demonstrate remarkable capability in solving complex tasks, drawing knowledge primarily from a generalized database rather than niche subject areas. Consequently, there has been a growing demand for domain-specific LLMs tailored to social and natural sciences, such as BioGPT or BloombergGPT. In this study, we present our own domain-specific LLM focused on real estate, based on the parameter-efficient finetuning technique known as Low-rank adaptation (LoRA) applied to the Mistral 7B model. To create a comprehensive finetuning dataset, we compiled a curated 21k self-instruction dataset sourced from 670 scientific papers, market research, scholarly articles and real estate books. To assess the efficacy of Real-GPT, we devised a set of ca. 5, 000 multiple-choice questions to gauge the real estate knowledge of the models. Despite its notably compact size, our model outperforms other cutting-edge models. Consequently, our developed model not only showcases superior performance but also illustrates its capacity to facilitate investment decisions, interpret current market data, and potentially simplify property valuation processes. This development showcases the potential of LLMs to revolutionize the field of real estate analysis and decision-making.
    Keywords: Digitalisation; LLMs; NLP; real estate
    JEL: R3
    Date: 2024–01–01
    URL: https://d.repec.org/n?u=RePEc:arz:wpaper:eres2024-036
  20. By: Federico Daniel Forte
    Abstract: This paper examines the performance of Random Forest models in forecasting short-term monthly inflation in Argentina, based on a database of monthly indicators since 1962. It is found that these models achieve forecast accuracy that is statistically comparable to the consensus of market analysts' expectations surveyed by the Central Bank of Argentina (BCRA) and to traditional econometric models. One advantage of Random Forest models is that, as they are non-parametric, they allow for the exploration of nonlinear effects in the predictive power of certain macroeconomic variables on inflation. Among other findings, the relative importance of the exchange rate gap in forecasting inflation increases when the gap between the parallel and official exchange rates exceeds 60%. The predictive power of the exchange rate on inflation rises when the BCRA's net international reserves are negative or close to zero (specifically, below USD 2 billion). The relative importance of inflation inertia and the nominal interest rate in forecasting the following month's inflation increases when the nominal levels of inflation and/or interest rates rise.
    Date: 2024–10
    URL: https://d.repec.org/n?u=RePEc:arx:papers:2410.01175
  21. By: Siqi Huang; Anupam Nanda; Eero Valtonen
    Abstract: Research on ESG issues in real estate has indicated a complex link between sustainable real estate and financial success. A particular challenge in exploring the link comes from the lack of robust measures of various aspects of real estate sustainability, partly due to the limited availability of granular data. Presently, the assessment of firm-level real estate sustainability mainly uses ESG or CSR ratings from rating agencies, which rely heavily on self-disclosed ESG data from real estate companies. These third-party ESG/CSR ratings come with several limitations, including overlooking the potential impact of disclosure language patterns and sentiment bias on ESG ratings. In this study, we focus on the words around ESG actions and promises from official disclosures and credible news sources and offer a comprehensive evaluation of real estate companies' sustainability performance. We focus on 65 US REITs over a 15-year time period. Our preliminary analysis, based on computational linguistic techniques, highlights various salient features of what companies disclose in the public domain.
    Keywords: ESG Discourse; Real Estate Sustainability
    JEL: R3
    Date: 2024–01–01
    URL: https://d.repec.org/n?u=RePEc:arz:wpaper:eres2024-076

This nep-big issue is ©2024 by Tom Coupé. It is provided as is without any express or implied warranty. It may be freely redistributed in whole or in part for any purpose. If distributed in part, please include this notice.
General information on the NEP project can be found at https://nep.repec.org. For comments please write to the director of NEP, Marco Novarese at <director@nep.repec.org>. Put “NEP” in the subject, otherwise your mail may be rejected.
NEP’s infrastructure is sponsored by the School of Economics and Finance of Massey University in New Zealand.