Getting structured data from the internet running web crawlers/scrapers on a Big Data production scale

Utilize web scraping at scale to quickly get unlimited amounts of free data available on the web into a structured format. This book teaches you to use Python scripts to crawl through websites at scale and scrape data from HTML and JavaScript-enabled pages and convert it into structured data formats...

Descripción completa

Detalles Bibliográficos
Otros Autores: Patel, Jay M., author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: [Place of publication not identified] : Apress [2020]
Edición:1st ed. 2020.
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009631199106719
Tabla de Contenidos:
  • Chapter 1: Introduction to Web Scraping
  • Chapter 2: Web Scraping in Python Using Beautiful Soup Library
  • Chapter 3: Introduction to Cloud Computing and Amazon Web Services (AWS)
  • Chapter 4: Natural Language Processing (NLP) and Text Analytics
  • Chapter 5: Relational Databases and SQL Language
  • Chapter 6: Introduction to Common Crawl Datasets
  • Chapter 7: Web Crawl Processing on Big Data Scale
  • Chapter 8: Advanced Web Crawlers
  • .