郝鹏宇等:Feature Selection of Time Series MODIS Data for Early Crop Classification Using Random Forest: A Case Study in Kansas, USA
被阅读 1347 次
2015-10-27
Feature Selection of Time Series MODIS Data for Early Crop Classification Using Random Forest: A Case Study in Kansas, USA
作者:Hao, PY (Hao, Pengyu)[ 1,2 ] ; Zhan, YL (Zhan, Yulin)[ 1 ] ; Wang, L (Wang, Li)[ 1 ] ; Niu, Z (Niu, Zheng)[ 1 ] ; Shakir, M (Shakir, Muhammad)[ 1 ]
REMOTE SENSING
卷: 7  期: 5  页: 5347-5369
DOI: 10.3390/rs70505347
出版年: MAY 2015
 
摘要
Currently, accurate information on crop area coverage is vital for food security and industry, and there is strong demand for timely crop mapping. In this study, we used MODIS time series data to investigate the effect of the time series length on crop mapping. Eight time series with different lengths (ranging from one month to eight months) were tested. For each time series, we first used the Random Forest (RF) algorithm to calculate the importance score for all features (including multi-spectral data, Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), and phenological metrics). Subsequently, an extension of the Jeffries-Matusita (JM) distance was used to measure class separability for each time series. Finally, the RF algorithm was used to classify crop types, and the classification accuracy and certainty were used to analyze the influence of the time series length and the number of features on classification performance; the features were added one by one based on their importance scores. Results indicated that when the time series was longer than five months, the top ten features remained stable. These features were mainly in July and August. In addition, the NDVI features contributed the majority of the most significant features for crop mapping. The NDWI and data from multi-spectral bands also contributed to improving crop mapping. On the other hand, separability, classification accuracy, and certainty increased with the number of features used and the time series length, although these values quickly reached saturation. Five months was the optimal time series length, as longer time series provided no further improvement in the classification performance. This result shows that relatively short time series have the potential to identify crops accurately, which allows for early crop mapping over large areas.
 
通讯作者地址: Zhan, YL (通讯作者)
Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China.
地址:
[ 1 ] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China
[ 2 ] Univ Chinese Acad Sci, Beijing 100049, Peoples R China