Excel File Setup for Analysis

Excel gives you a lot of flexibility when creating files and starting projects, and we’re often asked what the “best” solution is for keeping things organized. This post will review what we recommend. As with everything in life, there may be a few exceptions where you’ll want to set up your file differently. However, this […]

100th Anniversary of Variance

This year (2018) is the 100th anniversary of a paper by R. A. Fisher, which introduced the statistical term “variance”. Variance is one of the toughest concepts in statistics, but it’s crucially important. Variance tells you how spread out your data are (yep, “are”; the word “data” is plural!). First, let’s get some terminology out of […]

Free Datasets

A list of freely available data on the web. The first list is sites we think are the best for accessing quality datasets. Below that are additional sources by category. Best Sources Kaggle By far our personal favorite! There are dozens if not hundreds of quality datasets available here. ICPSR You have to create an account, […]

Installing & Running Jupyter Notebook

Step 1: Install Anaconda Go to this download webpage on Anaconda’s site. Choose the correct link for your operating system, and then go through the installation process. Step 2: Prepare a folder for notebooks Choose or create a folder on your computer where you will store all Jupyter notebook files. Make sure you choose a place […]

Git Resources

Resources for Learning Git Atlassian’s GitFlow Page This is a short tutorial article focused on a typical Git sequences.   try.github.io A list of Git Resources, broken down by type.

Python Resources

A hand-curated list of resources for learning python, practicing python, and everything else python!

Common Python Errors

This post lists out common python errors and how to handle them, including errors encountered during installation with homebrew.

Formatting Charts in Excel

Formatting charts in Excel is no easy task. It’s time-consuming, and Excel is pretty fussy which doesn’t make things easier. In this post I’ll give general tips for formatting charts, and also go over a few common scenarios. Understand the Parts of a Chart First thing’s first: it’s important to get the syntax down pat […]

Grouping in PivotTables

Grouping in PivotTables is a way of combining data to perform analyses without having to use functions. You can group numeric columns to turn them into categories, you can group date columns by date ranges to get even intervals, and you can group text columns to put together similar values. We’ll go through all three […]

Tables & Linking Data Structures in Excel

Tables are one of the most important features of Excel, but are often overlooked. Tables and keeping analyses in Excel connected, will drastically increase your efficiency in Excel. Let’s start by understanding how they work with PivotTables. We’re going to use an R Dataset called DoctorContacts. Download the .csv file using this link (and save […]

