ibm-nasa-geospatial
/

Prithvi-100M-multi-temporal-crop-classification

Crop Classification

Foundation model

Model card Files Files and versions Community

Prithvi-100M-multi-temporal-crop-classification / README.md

hanxLi's picture

Update README.md

8594822 over 1 year ago

|

3.07 kB

	---
	license: apache-2.0
	language:
	- en
	tags:
	- Pytorch
	- mmsegmentation
	- segmentation
	- Crop Classification
	- Multi Temporal
	- Geospatial
	- Foundation model
	datasets:
	- ibm-nasa-geospatial/multi-temporal-crop-classification
	metrics:
	- accuracy
	- IoU
	---
	### Model and Inputs
	The pretrained [Prithvi-100m](https://huggingface.co/ibm-nasa-geospatial/Prithvi-100M/blob/main/README.md) parameter model is finetuned to classify crop and other land cover types based off HLS data and CDL labels from the [multi_temporal_crop_classification dataset](https://huggingface.co/datasets/ibm-nasa-geospatial/multi-temporal-crop-classification).

	This dataset includes input chips of 224x224x18, where 224 is the height and width and 18 is combined with 6 bands of 3 time-steps. The bands are:

	1. Blue
	2. Green
	3. Red
	4. Narrow NIR
	5. SWIR 1
	6. SWIR 2

	Labels are from CDL(Crop Data Layer) and classified into 13 classes.

	![](multi_temporal_crop_classification.png)

	The Prithvi-100m model was initially pretrained using a sequence length of 3 timesteps. For this task, we leverage the capacity for multi-temporal data input, which has been integrated from the foundational pretrained model. This adaptation allows us to achieve more generalized finetuning outcomes.

	### Code
	Code for Finetuning is available through [github](https://github.com/NASA-IMPACT/hls-foundation-os/tree/main/fine-tuning-examples)

	Configuration used for finetuning is available through [config](https://github.com/NASA-IMPACT/hls-foundation-os/blob/main/fine-tuning-examples/configs/multi_temporal_crop_classification.py).

	### Results
	The experiment by running the mmseg stack for 80 epochs using the above config led to the following result:

	\| Classes \| IoU\| Acc\|
	\|:------------------:\|:------:\|:------:\|
	\| Natural Vegetation \| 0.3362 \| 39.06% \|
	\| Forest \| 0.4362 \| 65.88% \|
	\| Corn \| 0.4574 \| 54.53% \|
	\| Soybeans \| 0.4682 \| 62.25% \|
	\| Wetlands \| 0.3246 \| 45.62% \|
	\| Developed/Barren \| 0.3077 \| 49.1% \|
	\| Open Water \| 0.6181 \| 90.04% \|
	\| Winter Wheat \| 0.4497 \| 66.75% \|
	\| Alfalfa \| 0.2518 \| 63.97% \|
	\|Fallow/Idle Cropland\| 0.328 \| 54.46% \|
	\| Cotton \| 0.2679 \| 66.37% \|
	\| Sorghum \| 0.2741 \| 75.91% \|
	\| Other \| 0.2803 \| 39.76% \|

	\|aAcc\|mIoU\|mAcc\|
	\|:------:\|:------:\|:------:\|
	\| 54.32% \| 0.3692 \| 59.51% \|

	It is important to acknowledge that the CDL (Crop Data Layer) labels employed in this process are known to contain noise and are not entirely precise, thereby influencing the model's performance. Fine-tuning the model with more accurate labels is expected to further enhance its overall effectiveness, leading to improved results.


	### Inference
	There is an inference script that allows to run the hls-cdl crop classification model for inference on HLS images. These input have to be geotiff format, including 18 bands for 3 time-step, and each time-step includes the channels described above (Blue, Green, Red, Narrow NIR, SWIR, SWIR 2) in order.