nep-cul New Economics Papers
on Cultural Economics
Issue of 2023‒09‒25
two papers chosen by
Roberto Zanola, Università degli Studi del Piemonte Orientale


  1. Not as good as it used to be: Do streaming platforms penalize quality? By Gambato, Jacopo; Sandrini, Luca
  2. American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers By Melissa Dell; Jacob Carlson; Tom Bryan; Emily Silcock; Abhishek Arora; Zejiang Shen; Luca D'Amico-Wong; Quan Le; Pablo Querubin; Leander Heldring

  1. By: Gambato, Jacopo; Sandrini, Luca
    Abstract: We study the incentives of a streaming platform to bias consumption when products are vertically differentiated. The platform offers mixed bundles of content to monetize consumers' interest in variety and pays royalties to sellers based on the effective consumption of the content they produce. When products are not vertically differentiated, the platform has no incentive to bias consumption in equilibrium: the platform being active represents a Pareto-improvement compared to the case in which she is not. With vertical differentiation, royalties can differ; the platform always biases recommendations in favor of the cheapest content, which hurts consumers and the high-quality seller. Biased recommendation always diminishes the incentives of a seller to increase the quality of her content for a given demand. If a significant share of the users is ex-ante unaware of the existence of the sellers the platform can bias recommendations more freely, but joining the platform encourages investment in quality. The bias, however, can lead to inefficient allocation of R&D efforts. From a policy perspective, we propose this as a novel rationale for regulating algorithmic recommendations in streaming platforms.
    Keywords: platform economics, media economics, recommendation bias, innovation
    JEL: D4 L1 L5
    Date: 2023
    URL: http://d.repec.org/n?u=RePEc:zbw:zewdip:23032&r=cul
  2. By: Melissa Dell; Jacob Carlson; Tom Bryan; Emily Silcock; Abhishek Arora; Zejiang Shen; Luca D'Amico-Wong; Quan Le; Pablo Querubin; Leander Heldring
    Abstract: Existing full text datasets of U.S. public domain newspapers do not recognize the often complex layouts of newspaper scans, and as a result the digitized content scrambles texts from articles, headlines, captions, advertisements, and other layout regions. OCR quality can also be low. This study develops a novel, deep learning pipeline for extracting full article texts from newspaper images and applies it to the nearly 20 million scans in Library of Congress's public domain Chronicling America collection. The pipeline includes layout detection, legibility classification, custom OCR, and association of article texts spanning multiple bounding boxes. To achieve high scalability, it is built with efficient architectures designed for mobile phones. The resulting American Stories dataset provides high quality data that could be used for pre-training a large language model to achieve better understanding of historical English and historical world knowledge. The dataset could also be added to the external database of a retrieval-augmented language model to make historical information - ranging from interpretations of political events to minutiae about the lives of people's ancestors - more widely accessible. Furthermore, structured article texts facilitate using transformer-based methods for popular social science applications like topic classification, detection of reproduced content, and news story clustering. Finally, American Stories provides a massive silver quality dataset for innovating multimodal layout analysis models and other multimodal applications.
    Date: 2023–08
    URL: http://d.repec.org/n?u=RePEc:arx:papers:2308.12477&r=cul

This nep-cul issue is ©2023 by Roberto Zanola. It is provided as is without any express or implied warranty. It may be freely redistributed in whole or in part for any purpose. If distributed in part, please include this notice.
General information on the NEP project can be found at http://nep.repec.org. For comments please write to the director of NEP, Marco Novarese at <director@nep.repec.org>. Put “NEP” in the subject, otherwise your mail may be rejected.
NEP’s infrastructure is sponsored by the School of Economics and Finance of Massey University in New Zealand.