Data Analysis, Data Science, Data Understanding
Course Notes for uOttawa’s Professional Data Science Certificate
2021-12-16
Welcome
This website contains draft chapters for Data Analysis, Data Science, Data Understanding, the course notes used for uOttawa’s Professional Data Science Certificate.
Colophone
This website is built using bookdown
package
and hosted with Netlify.
For more information on how to use bookdown
see bookdown.org.
The complete source of this website is available on
You are encouraged to report any errors or to make suggestions
via the issues
function on the github page.
This online version of the book was built with:
License
This work by Patrick Boily and Chunyun Ma is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Below is a human-readable summary of (and not a substitute for) the license. Please see https://creativecommons.org/licenses/by-sa/4.0/legalcode for the full legal text.
You are free to:
Share—copy and redistribute the material in any medium or format
Remix—remix, transform, and build upon the material for any purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
Attribution—You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
ShareAlike—If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
No additional restrictions—You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.
Contributors
A reference manual of this size could not have been compiled without the help of a multitude of individuals over the years:
Oliver Benning contributed to Reinforcement Learning;
Kevin Cheung contributed to A Survey of Optimization, An Introduction to Deep Learning, and Data Science with Streams;
Youssouph Cissokho contributed to Anomaly Detection and Outlier Analysis;
Stephen Davies influenced Data Visualization;
Soufiane Fadel contributed to Anomaly Detection and Outlier Analysis and Reinforcement Learning;
Patrick Farrell influenced Survey Sampling Methods;
Ehssan Ghashim contributed to Programming Fundamentals, Data Visualization, and Bayesian Data Analysis;
Shintaro Hagiwara contributed to Introductory Statistical Analysis;
Lani Haque contributed to Web Scraping and Automated Data Collection and Network Data Analysis;
Rafal Kulik influenced Overview of Probability Theory, Introductory Statistical Analysis, A Linear Regression Cheatsheet, and A Primer of Times Series and Forecasting;
Gilles Lamothe influenced Overview of Probability Theory, Introductory Statistical Analysis, and A Linear Regression Cheatsheet;
Oliver Leduc contributed to Reression and Value Estimation, Spotlight on Clustering, and Feature Selection and Dimension Reduction;
Dong (Elle) Liu contributed to A Primer of Times Series and Forecasting;
Andrew Macfie contributed to Web Scraping and Automated Data Collection, Feature Selection and Dimension Reduction, Natural Language Processing, and Big Data and Parallel Computing;
Aditya Maheshwari contributed to Spotlight on Clustering and Feature Selection and Dimension Reduction;
Richard Millson contributed to Anomaly Detection and Outlier Analysis and Network Data Analysis;
Rachel Ostic contributed to Reinforcement Learning;
Kate Park contributed to Network Data Analysis;
Smit Patel contributed to An Introduction to Deep Learning;
Maia Pelletier contributed to Data Visualization and Feature Selection and Dimension Reduction;
Razieh Pourhasan contributed to Anomaly Detection and Outlier Analysis and An Introduction to Deep Learning;
Bronwyn Rayfield contributed to Spotlight on Classification;
Mohsen Rezapour Tougari contributed to Recommender Engines;
Jen Schellinck contributed to Programming Fundamentals, Introduction to Simulations, Non-Technical Aspects of Quantitative Work, Data Science Basics, Machine Learning 101, Spotlight on Clustering;
Bing Wang contributed to Data Science with Streams.
A hearty “thank you” to everyone, and to all others who have crossed our paths on this data adventure!