Beginning data analysis with python and Jupyter use powerful industry-standard tools to unlock new, actionable insight from your existing data

Getting started with data science doesn't have to be an uphill battle. This step-by-step guide is ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction. About This Book Get up and running with the Jupyter ecosystem and some example datasets Learn abou...

Descripción completa

Detalles Bibliográficos
Otros Autores: Galea, Alex, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Birmingham : Packt Publishing [2018]
Edición:1st edition
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009630737006719
Tabla de Contenidos:
  • Intro
  • Preface
  • Jupyter Fundamentals
  • Basic Functionality and Features
  • Subtopic A: What is a Jupyter Notebook and Why is it Useful?
  • Subtopic B: Navigating the Platform
  • Introducing Jupyter Notebooks
  • Subtopic C: Jupyter Features
  • Explore some of Jupyter's most useful features
  • Converting a Jupyter Notebook to a Python Script
  • Subtopic D: Python Libraries
  • Import the external libraries and set up the plotting environment
  • Our First Analysis - The Boston Housing Dataset
  • Subtopic A: Loading the Data into Jupyter Using a Pandas DataFrame
  • Load the Boston housing dataset
  • Subtopic B: Data Exploration
  • Explore the Boston housing dataset
  • Subtopic C: Introduction to Predictive Analytics with Jupyter Notebooks
  • Linear models with Seaborn and scikit-learn
  • Activity B: Building a Third-Order Polynomial Model
  • Subtopic D: Using Categorical Features for Segmentation Analysis
  • Create categorical fields from continuous variables and make segmented visualizations
  • Summary
  • Data Cleaning and Advanced Machine Learning
  • Preparing to Train a Predictive Model
  • Subtopic A: Determining a Plan for Predictive Analytics
  • Subtopic B: Preprocessing Data for Machine Learning
  • Explore data preprocessing tools and methods
  • Activity A: Preparing to Train a Predictive Model for the Employee-Retention Problem
  • Training Classification Models
  • Subtopic A: Introduction to Classification Algorithms
  • Training two-feature classification models with scikit-learn
  • The plot_decision_regions Function
  • Training k-nearest neighbors for our model
  • Training a Random Forest
  • Subtopic B: Assessing Models with k-Fold Cross-Validation and Validation Curves
  • Using k-fold cross validation and validation curves in Python with scikit-learn
  • Subtopic C: Dimensionality Reduction Techniques.
  • Training a predictive model for the employee retention problem
  • Summary
  • Web Scraping and Interactive Visualizations
  • Scraping Web Page Data
  • Subtopic A: Introduction to HTTP Requests
  • Subtopic B: Making HTTP Requests in the Jupyter Notebook
  • Handling HTTP requests with Python in a Jupyter Notebook
  • Subtopic C: Parsing HTML in the Jupyter Notebook
  • Parsing HTML with Python in a Jupyter Notebook
  • Activity A: Web Scraping with Jupyter Notebooks
  • Interactive Visualizations
  • Subtopic A: Building a DataFrame to Store and Organize Data
  • Building and merging Pandas DataFrames
  • Subtopic B: Introduction to Bokeh
  • Introduction to interactive visualizations with Bokeh
  • Activity B: Exploring Data with Interactive Visualizations
  • Summary
  • Index.