AnswerBun.com

public Normally-distributed data for teaching intro Stats

Open Data Asked by philophilosophia on September 29, 2021

Background:
I’m teaching an intro course to stats, and this term, I have decided to use real-world public data sets to demonstrate the methods on, instead of synthetic data. I was surprised that I wouldn’t find basic data such as height/weight/IQ of men and women (which are famously well-approximated by Gaussian). I do find parameters (mean/variance of weight of Americans, for example), but I don’t want to synthesize a Gaussian based on parameters. Rather, I’m looking for actual data, so the students experience the noisy-ness of real data, and how approximations work. I have the same problem for finding non-Normal data, e.g., wealth distribution and other heavy-tailed ones. Parameters exist but I cannot find actual data sets.

TLDR:
For an introductory Stats course, I’m looking for publicly available data sets with medium-size sample sizes, i.e., $N=O(10^3)$ or $O(10^4)$. Preferably, with close-to-Gaussian distributions, but anything is useful.

One Answer

You can find the best publicly available datasets on Kaggle with kernels/notebooks for references. This is the best place to find the relevant data for your teaching. Need to signup to download the datasets

Answered by Pluviophile on September 29, 2021

Add your own answers!

Related Questions

Dataset of international football games

1  Asked on September 29, 2021 by ziil

   

Order receipt of bank transactions

1  Asked on September 29, 2021 by tlatwork

     

Mobile Home Parks

1  Asked on September 29, 2021 by user23476

     

public Normally-distributed data for teaching intro Stats

1  Asked on September 29, 2021 by philophilosophia

   

Looking for weekly death data by State/County/City

1  Asked on September 29, 2021 by skyler

       

Telecom Services Database

0  Asked on September 29, 2021 by nikitha-reddy

     

Number of Hospitals in the US with emergency departments

1  Asked on September 29, 2021 by terry-zeigler

     

Twitter dataset to train word embeddings

1  Asked on September 29, 2021 by mugdha-pandya

   

Where can i find project management system data?

1  Asked on September 29, 2021 by jess-jess

   

Text Dataset for Entity Recognition of personal data

1  Asked on September 29, 2021 by j-ruthwik-reddy

       

UN Comtrade Database: Trade value when no quantity reported

0  Asked on September 29, 2021 by quicks

   

Shapefiles for Paraguay

1  Asked on September 29, 2021 by jack-zaki-zakiul-fahmi-jailani

   

COVID-19 Case Line Data Sources for US States

0  Asked on September 29, 2021 by zviovich

   

Ask a Question

Get help from others!

© 2022 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP