Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Data Chats Podcast

2 minute read

I did a podcast with Chris Richardson of Pragmatic Institute, a California based data science education institute a while back, it is out now. Data Science ...

Selection bias – a war story

2 minute read

During the Second World War, in a secret Manhattan building, statisticians and mathematicians were recruited from across the U.S.A. to carry out data analysi...

Some history of sampling

4 minute read

Statistical enumerations of land, people, and property have taken place in many of the better organized empires and states since Babylonian times. These all ...

Variants of random sampling

1 minute read

There are many variations on random sampling that aim to further improve representation or reduce the costs of data collection.

casestudies

Ch02

Finding a good deal among hotels : data preparation

chapters

Part I: DATA EXPLORATION

Chapter 01: Origins of Data This chapter is about data collection and data quality. The chapter starts by introducing key concepts of data. It then describes...

Part II: REGRESSION ANALYSIS

Chapter 07: Simple Regression In this chapter, we introduce simple non-parametric regression and simple linear regression. We discuss nonparametric regressio...

Part III: PREDICTION

Chapter 13: A Framework for Prediction This chapter introduces a framework for prediction. We discuss the distinction between various types of prediction, s...

Part IV: CAUSAL ANALYSIS

Chapter 19: A Framework for Causal Analysis This chapter introduces a framework for causal analysis. The chapter starts by introducing the potential outcomes...

content

datasets

README: cps-earnings dataset

This is a README file for the cps-earnings dataset. Used in the case studies 9A Estimating gender and age differences in earnings and 10A Understanding the...

README: hotels-europe dataset

This is a README file for the hotels-europe dataset that includes information on price and features of hotels in 46 European cities and for 10 different dat...

README: hotels-vienna dataset

This is a README file for the hotels-vienna dataset that includes information on price and features of hotels in Vienna for one date. Used in case studies ...