Skip to the content.

Carbon emission Analysis & Prediction

A project by the AIGLass Team - Explore Data Science Academy Internship Project

Project Developers

Project Description

Since the discoveries around the effects of increased carbon emissions on the climate, multiple initiatives have been developed to attempt to curb the issue.

This machine learning project attempts to offer a data-driven solution for the same agenda. Moreover, the project intends to offer predictive capabilities utilizing data analytics & machine learning models.

This is only intended as an overview of the work that was carried out. Find project links at the bottom for the accompanying Research paper, Jupyter Notebooks, Presentation Slides & Application Deployment

Expectations

The project specifically aims at complementing the implementation of carbon credits in reducing carbon emissions.

The general idea of carbon credits is that organizations, institutions & countries are able to purchase in carbon credits as a form of offsetting their overall carbon emissions. These credits are tied to projects that actively remove carbon from the environment, such as building renewable energy plants, planting trees, protecting forest reserves.

Where does this project lie in the carbon credit transactions? The solution enables carbon credit purchasers to develop emission predictions. This offers a baseline insight to complement long-term planning of the purchaser’s expenses & ton organizing prospective projects. The predictions may have a range in error, but this is only due to improve with time & reconsderation of affecting factors.

Data sources

The data utilized in the project are certified under GPL.

Data description/breakdown


Project Strategy

Data analysis

The bulk of the work entailed deriving analytical benefits from the data that was sourced, & understanding how the final model would be affected.

Some of the features collected showed little benefit to the,

Machine Learning model

The final model that was utilized in the web application was the XGBoost, with an RMSE of 15.4. This model performed best in comparison to linear & tree based algorithms that were trained: (decision-trees, RandomForest, Ridge & Lasso Regression).

Future considerations shall be placed on neural networks. However, it is anticipated that the con of runtime for the model will outweigh the benefit of improved model predictions.

Conclusion

The project successfully highlighted the prominent features that are causal to carbon emission according to the dataset that was utilised. The model’s error range is feasible for the purpose of the solution, but there are opportunities to improve it.

Heightened carbon emissions on earth is asuredly a future generation’s problem, and some of the effects already being experienced currently via climate change. The best that our society can offer is by utilising the tools that we have to progress forward; hoping that we shall finally find the silver bullet that will end the worry. Until then, we continue the toil.

Web Application hosted on Heroku Platform.

Note: The application may take some time to load if at all it has not been accessed for some time. Give it some time to ‘wake’, else reload the browser tab

Final Presentation to the EDSA Internship supervisors

Research Paper **to be availed soon

Jupyter Notebooks

Book1: Data Cleaning & Preparation

Book2: Data Analysis & Feature Engineering

Book3: Machine Learning Model development

References & supporting material