With the initial dataset now complete , the team commenced data exploration, looking to understand how the story came together. To inform perspective, the team reviewed the distribution of all numeric variables, as an aggregate and overtime, breaking the review into 3 distinct populations, the entire dataset, when arbitrage was present and when arbitrage was not present. This approach was taken as it helped to provide insights both into how the variable was distributed (A significant number of histograms provided minimal insight as a result of outliers, which prompted consideration, development and review of extension to differing representations, such as Log, Change, Percentages. While some of these items have been explored, many remain outstanding and provide an opportunity for continued discussion and consideration, we welcome thoughts and encourage questions (Link to Github).