Fuel Economy Data

Fuel Economy data is collected from vehicle testing by the United States Environmental Protection Agency (EPA). The EPA provides this data to the US government each year, which the Department of Energy (DOE) uses to publish their yearly fuel economy guide. This data is publicly available and can be found in the link below.

The Project

Data for 2008 and 2018 are downloaded and analysed in a Jupyter notebook running on a Python kernel.

What We Learned

  • Using the drop method to keep columns of relevance

  • Using the rename method to give columns suitable names

  • Using strip, lower, and replace methods to manipulate string objects

  • Using regular expressions to search for and extract integers from string objects

  • Programmatically checking for consistency across dataframes

  • Searching for missing values and duplicated data

  • Practice using the astype method to convert data types

  • Applying a function over each column in a data frame using the apply method in conjunction with a lambda function

  • Appending data frames together

The Code and the Report

References

Previous
Previous

Wine Quality Dataset

Next
Next

Gapminder World Data