Getting structured data from the internet running web crawlers/scrapers on a Big Data production scale
Utilize web scraping at scale to quickly get unlimited amounts of free data available on the web into a structured format. This book teaches you to use Python scripts to crawl through websites at scale and scrape data from HTML and JavaScript-enabled pages and convert it into structured data formats...
Otros Autores: | |
---|---|
Formato: | Libro electrónico |
Idioma: | Inglés |
Publicado: |
[Place of publication not identified] :
Apress
[2020]
|
Edición: | 1st ed. 2020. |
Materias: | |
Ver en Biblioteca Universitat Ramon Llull: | https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009631199106719 |
Tabla de Contenidos:
- Chapter 1: Introduction to Web Scraping
- Chapter 2: Web Scraping in Python Using Beautiful Soup Library
- Chapter 3: Introduction to Cloud Computing and Amazon Web Services (AWS)
- Chapter 4: Natural Language Processing (NLP) and Text Analytics
- Chapter 5: Relational Databases and SQL Language
- Chapter 6: Introduction to Common Crawl Datasets
- Chapter 7: Web Crawl Processing on Big Data Scale
- Chapter 8: Advanced Web Crawlers
- .