Skip to Main content Skip to Navigation
Conference papers

Web Scraping: State-of-the-Art and Areas of Application

Abstract : Main objective of Web Scraping is to extract information from one or many websites and process it into simple structures such as spreadsheets, database or CSV file. However, in addition to be a very complicated task, Web Scraping is resource and time consuming, mainly when it is carried out manually. Previous studies have developed several automated solutions. The purpose of this article is to revisit the different existing Web Scraping approaches, categories, and tools, but also its areas of application.
Document type :
Conference papers
Complete list of metadatas

https://hal-utt.archives-ouvertes.fr/hal-02492481
Contributor : Jean-Baptiste Vu Van <>
Submitted on : Thursday, February 27, 2020 - 9:03:26 AM
Last modification on : Friday, February 28, 2020 - 1:31:19 AM

Identifiers

Collections

ROSAS | UTT | CNRS

Citation

Rabiyatou Diouf, Edouard Ngor Sarr, Ousmane Sall, Babiga Birregah, Mamadou Bousso, et al.. Web Scraping: State-of-the-Art and Areas of Application. 2019 IEEE International Conference on Big Data (Big Data), Dec 2019, Los Angeles, United States. pp.6040-6042, ⟨10.1109/BigData47090.2019.9005594⟩. ⟨hal-02492481⟩

Share

Metrics

Record views

59