Data Science

MOSAIKS for creating Climate Change Models

In this post, you will learn about the framework, MOSAIKS (Multi-Task Observation using Satellite Imagery & Kitchen Sinks) which can be used to create machine learning linear regression models for climate change. Here is the list of few prediction use cases which has already been tested with MOSAIKS and found to have high model performance:

  • Forest cover
  • Elevation
  • Population density
  • Nighttime lights
  • Income
  • Road length
  • Housing price
  • Crop yields
  • Poverty mapping

What is MOSAIKS?

MOSAIKS provides a set of features created from Satellite imagery dataset. We are talking about 90TB of data gathered per day from 700+ satellites. These features can be combined with machine learning algorithms to address global challenges by remotely estimating socioeconomic and environmental conditions in data-poor regions. Combining satellite imagery with machine learning is also termed as SIML approach. 

The set of features generated using MOSAIKS can be merged spatially with the labels. Thereafter, you can run a linear regression of your labels on the MOSAIKS features, measure performance and use the model for making predictions in your area of interest.

The algorithmic component of the MOSAIKS system is built upon the random convolutional features (RCF) algorithm. 

The MOSAIKS features facilitates a generalizable and accessible approach to machine learning with global satellite imagery.  The picture below represents how MOSAIKS can be used to perform different predictions tasks related to solving problems in the areas of socioeconomic and environmental issues.

Fig 1. MOSAIKS used to solve different socioeconomic and environmental problems

Pay attention to some of the following in above picture:

  • K-dimensional features set is created by drawing a fixed sample of K patches from the satellite imagery set, convolving the patches across each image and passing the resultant data set to nonlinear activation function. The output from K nonlinear activation maps is K-dimensional features set.
  • The features set is merged with users provided labels. For different problem statements, you can come up with different labels. The picture below represents aspect of merging unsupervised features with labels and applying regression algorithm.

    Fig 2. MOSAIKS – Merging unsupervised features with labels

  • Create training and test split of labelled data set.
  • Training regression models such as ridge regression, evaluate the performance. The manner it is different from training a convolutional neural network (CNN) is that CNN learn domain image specific features.
  • The trained model can be used for making predictions.

MOSAIKS is tested to achieve comparable performance with respect to a fine-tuned ResNet-18 at a fraction of the computational cost. The picture below shows the comparison of MOSAIKS trained regression models against ResNet-18 and pre-trained CNN.

Fig 3. MOSAIKS models performance and computation comparison with Resnet-18 and pre-trained CNN models

Why MOSAIKS?

There is a limited access to skills, data, compute and resources in relation to understanding and processing data gathered from satellites in form satellite imagery. Transforming the satellite imagery data into relevant statistics is costly and requires skills which may not be available with many. This is where MOSAIKS come as a boon. It converts the satellite imagery data to K-dimensional features set which can be merged with user-chosen labels and used to train the regression models for making predictions. 

How to use MOSAIKS for solving problems related to Climate Change?

As mentioned above, MOSAIKS provides the K-dimensional features set which can be used to solve training regression models related to different problems related to social-economic and environmental factors. Here are the steps which can be used to solve problems related to climate change:

  • First and foremost, identify the problem. The best way to identify problem is to ask questions related with current climate change problems. For example, how can I reverse the deforestation?
  • Download MOSAIKS features from our API for the areas where you have labels

  • Merge the features spatially with your labels.

  • Run a regression of your labels on the MOSAIKS features

  • Evaluate performance

  • Make predictions  

Resources

Here are some important pages which can help you get started:

Here is a great youtube video on MOSAIKS

Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking.

Recent Posts

Agentic Reasoning Design Patterns in AI: Examples

In recent years, artificial intelligence (AI) has evolved to include more sophisticated and capable agents,…

1 month ago

LLMs for Adaptive Learning & Personalized Education

Adaptive learning helps in tailoring learning experiences to fit the unique needs of each student.…

1 month ago

Sparse Mixture of Experts (MoE) Models: Examples

With the increasing demand for more powerful machine learning (ML) systems that can handle diverse…

2 months ago

Anxiety Disorder Detection & Machine Learning Techniques

Anxiety is a common mental health condition that affects millions of people around the world.…

2 months ago

Confounder Features & Machine Learning Models: Examples

In machine learning, confounder features or variables can significantly affect the accuracy and validity of…

2 months ago

Credit Card Fraud Detection & Machine Learning

Last updated: 26 Sept, 2024 Credit card fraud detection is a major concern for credit…

2 months ago