Title: Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data
Authors: DE PEDRAZA GARCIA PABLOVISINTIN STEFANOKEA TIJDENSKISMIHOK GABOR
Citation: IZA Journal of Labor Economics vol. 8 no. 1 p. 1-23
Publisher: Springer Open
Publication Year: 2019
JRC N°: JRC113401
ISSN: 2193-8997 (online)
URI: https://content.sciendo.com/view/journals/izajole/8/1/article-20190004.xml
https://publications.jrc.ec.europa.eu/repository/handle/JRC113401
DOI: https://doi.org/10.2478/izajole-2019-0004 (online)
Type: Articles in periodicals and books
Abstract: This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.
JRC Directorate:Joint Research Centre Corporate Activities

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.