Data science for dummies

Monetize your company's data and data science expertise without spending a fortune on hiring independent strategy consultants to help What if there was one simple, clear process for ensuring that all your company's data science projects achieve a high a return on investment? What if you co...

Full description

Bibliographic Details
Other Authors: Pierson, Lillian, author (author)
Format: eBook
Language:Inglés
Published: Hoboken, New Jersey : John Wiley & Sons, Inc [2021]
Edition:Third edition
Series:--For dummies
Subjects:
See on Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009645677406719
Table of Contents:
  • Intro
  • Title Page
  • Copyright Page
  • Table of Contents
  • Introduction
  • About This Book
  • Foolish Assumptions
  • Icons Used in This Book
  • Beyond the Book
  • Where to Go from Here
  • Part 1 Getting Started with Data Science
  • Chapter 1 Wrapping Your Head Around Data Science
  • Seeing Who Can Make Use of Data Science
  • Inspecting the Pieces of the Data Science Puzzle
  • Collecting, querying, and consuming data
  • Applying mathematical modeling to data science tasks
  • Deriving insights from statistical methods
  • Coding, coding, coding - it's just part of the game
  • Applying data science to a subject area
  • Communicating data insights
  • Exploring Career Alternatives That Involve Data Science
  • The data implementer
  • The data leader
  • The data entrepreneur
  • Chapter 2 Tapping into Critical Aspects of Data Engineering
  • Defining Big Data and the Three Vs
  • Grappling with data volume
  • Handling data velocity
  • Dealing with data variety
  • Identifying Important Data Sources
  • Grasping the Differences among Data Approaches
  • Defining data science
  • Defining machine learning engineering
  • Defining data engineering
  • Comparing machine learning engineers, data scientists, and data engineers
  • Storing and Processing Data for Data Science
  • Storing data and doing data science directly in the cloud
  • Storing big data on-premise
  • Processing big data in real-time
  • Part 2 Using Data Science to Extract Meaning from Your Data
  • Chapter 3 Machine Learning Means . . . Using a Machine to Learn from Data
  • Defining Machine Learning and Its Processes
  • Walking through the steps of the machine learning process
  • Becoming familiar with machine learning terms
  • Considering Learning Styles
  • Learning with supervised algorithms
  • Learning with unsupervised algorithms
  • Learning with reinforcement
  • Seeing What You Can Do.
  • Selecting algorithms based on function
  • Using Spark to generate real-time big data analytics
  • Chapter 4 Math, Probability, and Statistical Modeling
  • Exploring Probability and Inferential Statistics
  • Probability distributions
  • Conditional probability with Naïve Bayes
  • Quantifying Correlation
  • Calculating correlation with Pearson's r
  • Ranking variable-pairs using Spearman's rank correlation
  • Reducing Data Dimensionality with Linear Algebra
  • Decomposing data to reduce dimensionality
  • Reducing dimensionality with factor analysis
  • Decreasing dimensionality and removing outliers with PCA
  • Modeling Decisions with Multiple Criteria Decision-Making
  • Turning to traditional MCDM
  • Focusing on fuzzy MCDM
  • Introducing Regression Methods
  • Linear regression
  • Logistic regression
  • Ordinary least squares (OLS) regression methods
  • Detecting Outliers
  • Analyzing extreme values
  • Detecting outliers with univariate analysis
  • Detecting outliers with multivariate analysis
  • Introducing Time Series Analysis
  • Identifying patterns in time series
  • Modeling univariate time series data
  • Chapter 5 Grouping Your Way into Accurate Predictions
  • Starting with Clustering Basics
  • Getting to know clustering algorithms
  • Examining clustering similarity metrics
  • Identifying Clusters in Your Data
  • Clustering with the k-means algorithm
  • Estimating clusters with kernel density estimation (KDE)
  • Clustering with hierarchical algorithms
  • Dabbling in the DBScan neighborhood
  • Categorizing Data with Decision Tree and Random Forest Algorithms
  • Drawing a Line between Clustering and Classification
  • Introducing instance-based learning classifiers
  • Getting to know classification algorithms
  • Making Sense of Data with Nearest Neighbor Analysis
  • Classifying Data with Average Nearest Neighbor Algorithms.
  • Classifying with K-Nearest Neighbor Algorithms
  • Understanding how the k-nearest neighbor algorithm works
  • Knowing when to use the k-nearest neighbor algorithm
  • Exploring common applications of k-nearest neighbor algorithms
  • Solving Real-World Problems with Nearest Neighbor Algorithms
  • Seeing k-nearest neighbor algorithms in action
  • Seeing average nearest neighbor algorithms in action
  • Chapter 6 Coding Up Data Insights and Decision Engines
  • Seeing Where Python and R Fit into Your Data Science Strategy
  • Using Python for Data Science
  • Sorting out the various Python data types
  • Putting loops to good use in Python
  • Having fun with functions
  • Keeping cool with classes
  • Checking out some useful Python libraries
  • Using Open Source R for Data Science
  • Comprehending R's basic vocabulary
  • Delving into functions and operators
  • Iterating in R
  • Observing how objects work
  • Sorting out R's popular statistical analysis packages
  • Examining packages for visualizing, mapping, and graphing in R
  • Chapter 7 Generating Insights with Software Applications
  • Choosing the Best Tools for Your Data Science Strategy
  • Getting a Handle on SQL and Relational Databases
  • Investing Some Effort into Database Design
  • Defining data types
  • Designing constraints properly
  • Normalizing your database
  • Narrowing the Focus with SQL Functions
  • Making Life Easier with Excel
  • Using Excel to quickly get to know your data
  • Reformatting and summarizing with PivotTables
  • Automating Excel tasks with macros
  • Chapter 8 Telling Powerful Stories with Data
  • Data Visualizations: The Big Three
  • Data storytelling for decision makers
  • Data showcasing for analysts
  • Designing data art for activists
  • Designing to Meet the Needs of Your Target Audience
  • Step 1: Brainstorm (All about Eve)
  • Step 2: Define the purpose.
  • Step 3: Choose the most functional visualization type for your purpose
  • Picking the Most Appropriate Design Style
  • Inducing a calculating, exacting response
  • Eliciting a strong emotional response
  • Selecting the Appropriate Data Graphic Type
  • Standard chart graphics
  • Comparative graphics
  • Statistical plots
  • Topology structures
  • Spatial plots and maps
  • Testing Data Graphics
  • Adding Context
  • Creating context with data
  • Creating context with annotations
  • Creating context with graphical elements
  • Part 3 Taking Stock of Your Data Science Capabilities
  • Chapter 9 Developing Your Business Acumen
  • Bridging the Business Gap
  • Contrasting business acumen with subject matter expertise
  • Defining business acumen
  • Traversing the Business Landscape
  • Seeing how data roles support the business in making money
  • Leveling up your business acumen
  • Fortifying your leadership skills
  • Surveying Use Cases and Case Studies
  • Documentation for data leaders
  • Documentation for data implementers
  • Chapter 10 Improving Operations
  • Establishing Essential Context for Operational Improvements Use Cases
  • Exploring Ways That Data Science Is Used to Improve Operations
  • Making major improvements to traditional manufacturing operations
  • Optimizing business operations with data science
  • An AI case study: Automated, personalized, and effective debt collection processes
  • Gaining logistical efficiencies with better use of real-time data
  • Another AI case study: Real-time optimized logistics routing
  • Modernizing media and the press with data science and AI
  • Generating content with the click of a button
  • Yet another case study: Increasing content generation rates
  • Chapter 11 Making Marketing Improvements
  • Exploring Popular Use Cases for Data Science in Marketing
  • Turning Web Analytics into Dollars and Sense.
  • Getting acquainted with omnichannel analytics
  • Mapping your channels
  • Building analytics around channel performance
  • Scoring your company's channels
  • Building Data Products That Increase Sales-and-Marketing ROI
  • Increasing Profit Margins with Marketing Mix Modeling
  • Collecting data on the four Ps
  • Implementing marketing mix modeling
  • Increasing profitability with MMM
  • Chapter 12 Enabling Improved Decision-Making
  • Improving Decision-Making
  • Barking Up the Business Intelligence Tree
  • Using Data Analytics to Support Decision-Making
  • Types of analytics
  • Common challenges in analytics
  • Data wrangling
  • Increasing Profit Margins with Data Science
  • Seeing which kinds of data are useful when using data science for decision support
  • Directing improved decision-making for call center agents
  • Discovering the tipping point where the old way stops working
  • Chapter 13 Decreasing Lending Risk and Fighting Financial Crimes
  • Decreasing Lending Risk with Clustering and Classification
  • Preventing Fraud Via Natural Language Processing (NLP)
  • Chapter 14 Monetizing Data and Data Science Expertise
  • Setting the Tone for Data Monetization
  • Monetizing Data Science Skills as a Service
  • Data preparation services
  • Model building services
  • Selling Data Products
  • Direct Monetization of Data Resources
  • Coupling data resources with a service and selling it
  • Making money with data partnerships
  • Pricing Out Data Privacy
  • Part 4 Assessing Your Data Science Options
  • Chapter 15 Gathering Important Information about Your Company
  • Unifying Your Data Science Team Under a Single Business Vision
  • Framing Data Science around the Company's Vision, Mission, and Values
  • Taking Stock of Data Technologies
  • Inventorying Your Company's Data Resources
  • Requesting your data dictionary and inventory.
  • Confirming what's officially on file.