Time Series Analysis and Forecast of SRKWs Sightings
Report
The goal of this report was to compile sighting data of the Southern Resident Killer Whales in the Salish Sea, and to forecast future sightings. Raw data was compiled from the Whale Museums’ Hotline. A total of 16,722 southern resident observations were grouped by month from year 2008 to 2015. After appropriate transformation and differencing was applied, multiple SARIMA models were fit and diagnostics preformed to choose the best. This model was then used to forecast the next 6 months of sightings.
Analysis of Shark Tank Data
Report
This report uses 10 seasons of “Shark Tank” to examine the breakdown of pitches, how these changed over time,and how the famous millionaire Sharks spend their money. Data manipulation such as grouping and aggregating was done with Pandas and Numpy while visuals and regression was done with Altair.
Regression Analysis of Sleep in Mammalian Species
Report
This project investigates the link between brain/body weight, life span, gestation time, time spent sleeping/dreaming, and predation/danger indices for 62 species of mammals. The raw data was cleaned and relationships were explored in a scatterplot matrix. While attempting to determine which variables were significant in predicting time spent dreaming, high influence and leverage points were found and multiple models were fit before selecting the best with AIC. To predict total time spent dreaming step-wise regression was preformed.
Exploratory Analysis of San Jose Traffic Accidents
Report
This project explores traffic data analysis to investigate how external variables interact with traffic accidents and to identify a pattern or hotspot, within traffic accident distributions. This was done by examining traffic accident data from 2020 reported in San Jose, California.