Fuel Economy Data
Fuel Economy data is collected from vehicle testing by the United States Environmental Protection Agency (EPA). The EPA provides this data to the US government each year, which the Department of Energy (DOE) uses to publish their yearly fuel economy guide. This data is publicly available and can be found in the link below.
The Project
Data for 2008 and 2018 are downloaded and analysed in a Jupyter notebook running on a Python kernel.
What We Learned
Using the drop method to keep columns of relevance
Using the rename method to give columns suitable names
Using strip, lower, and replace methods to manipulate string objects
Using regular expressions to search for and extract integers from string objects
Programmatically checking for consistency across dataframes
Searching for missing values and duplicated data
Practice using the astype method to convert data types
Applying a function over each column in a data frame using the apply method in conjunction with a lambda function
Appending data frames together
The Code and the Report
GitHub repository for the data
the report in a Jupyter Notebook can be found here
References
EPA Fuel Economy Data Set: https://www.epa.gov/compliance-and-fuel-economy-data/data-cars-used-testing-fuel-economy