|
on Intellectual Property Rights |
Issue of 2014‒10‒17
two papers chosen by Giovanni Ramello Università degli Studi del Piemonte Orientale “Amedeo Avogadro” |
By: | Bruno VAN POTTELSBERGHE; Nicolas VAN ZEEBROECK |
URL: | http://d.repec.org/n?u=RePEc:ekd:000238:23800148&r=ipr |
By: | Mark Huberty; Amma Serwaah; Georg Zachmann |
Abstract: | The inventors in PATSTAT are often duplicates: the same person or company may be split into multiple entries in PATSTAT, each associated to different patents. In this paper, we address this problem with an algorithm that efficiently de-duplicates the data. It needs minimal manual input and works well even on consumer-grade computers. Comparisons between entries are not limited to their names, and thus this algorithm is an improvement over earlier ones that required extensive manual work or overly cautious clean-up of the names. Source code on Github. Download data. |
Date: | 2014–09 |
URL: | http://d.repec.org/n?u=RePEc:bre:wpaper:850&r=ipr |