Limit search to available items
Book Cover
E-book
Author Zhang, Nailong

Title A Tour of Data Science : Learn R and Python in Parallel
Edition First edition
Published [Place of publication not identified] : Chapman and Hall/CRC, 2020

Copies

Description 1 online resource (x, 206 pages)
Series Chapman & Hall/CRC Data Science Series
Chapman & Hall/CRC data science series.
Contents Cover -- Half Title -- Title Page -- Copyright Page -- Dedication -- Table of Contents -- Preface -- 0.1 Assumptions About the Reader's Background -- 0.2 Book Overview -- Chapter 1 Introduction to R/Python Programming -- 1.1 Calculator -- 1.2 Variable and Type -- 1.3 Functions -- 1.4 Control Flows -- 1.5 Some Built-In Data Structures -- 1.6 Revisit of Variables -- 1.7 Object-Oriented Programming (OOP) In R/Python -- 1.8 Miscellaneous -- Chapter 2 More on R/Python Programming -- 2.1 Work with R/Python Scripts -- 2.2 Debugging in R/Python -- 2.3 Benchmarking -- 2.4 Vectorization
2.5 Embarrassingly Parallelism in R/Python -- 2.6 Evaluation Strategy -- 2.7 Speed Up With C/C++ in R/Python -- 2.8 A First Impression of Functional Programming -- 2.9 Miscellaneous -- Chapter 3 data. table and pandas -- 3.1 SQL -- 3.2 Get Started with data.table and Pandas -- 3.3 Indexing & Selecting Data -- 3.4 Add/Remove/Update -- 3.5 Group by -- 3.6 Join -- Chapter 4 Random Variables, Distributions & Linear Regression -- 4.1 A Refresher on Distributions -- 4.2 Inversion Sampling & Rejection Sampling -- 4.3 Joint Distribution & Copula -- 4.4 Fit a Distribution -- 4.5 Confidence Interval
4.6 Hypothesis Testing -- 4.7 Basics of Linear Regression -- 4.8 Ridge Regression -- Chapter 5 Optimization in Practice -- 5.1 Convexity -- 5.2 Gradient Descent -- 5.3 Root-Finding -- 5.4 General Purpose Minimization Tools in R/Python -- 5.5 Linear Programming -- 5.6 Miscellaneous -- Chapter 6 Machine Learning -- A gentle introduction -- 6.1 Supervised Learning -- 6.2 Gradient Boosting Machine -- 6.3 Unsupervised Learning -- 6.4 Reinforcement Learning -- 6.5 Deep Q-Networks -- 6.6 Computational Differentiation -- 6.7 Miscellaneous -- Bibliography -- Index
Summary A Tour of Data Science: Learn R and Python in Parallel covers the fundamentals of data science, including programming, statistics, optimization, and machine learning in a single short book. It does not cover everything, but rather, teaches the key concepts and topics in Data Science. It also covers two of the most popular programming languages used in Data Science, R and Python, in one source. Key features: Allows you to learn R and Python in parallel Cover statistics, programming, optimization and predictive modelling, and the popular data manipulation tools data.table and pandas Provides a concise and accessible presentation Includes machine learning algorithms implemented from scratch, linear regression, lasso, ridge, logistic regression, gradient boosting trees, etc. Appealing to data scientists, statisticians, quantitative analysts, and others who want to learn programming with R and Python from a data science perspective
Notes Nailong Zhang is lead Data Scientist at Mass Mutual Life Insurance Company
Vendor-supplied metadata
Subject Data mining.
R (Computer program language)
Python (Computer program language)
Data mining
Python (Computer program language)
R (Computer program language)
Form Electronic book
ISBN 9781003020646
100302064X
9781000215274
100021527X
9781000215236
1000215237
9781000215199
1000215199