Objectives To explore the relationship between urban ecological environment and human activities is an important research content in the current urbanization process. With the in-depth development of the era of big data, multi-source data ubiquitous in the Internet has been fully excavated, which has played an important role in promoting the research of urban ecological environment.
Methods Based on multisource data, we propose to construct human activity indicators (residential area walkability index, street vitality index, urban function mixing index) using point of interest(POI), OpenStreetMap (OSM) and residential area data, and urban ecological environment indicator (remote sensing ecological index)using remote sensing images. Combing machine learning models such as polynomial regression(PLR), random forest regression(RFR), extreme gradient boosting regression(XGB) and support vector regression machine(SVR), it is effective to make regression analysis on urban ecological environment and human activity indicators. By comparing the performance of different models in this dataset, the relationship between the urban ecological environment and human activities is revealed.
Results We demonstrate the application of our method using a case study of Nanchang city. The results show that: (1) The three indexes of human activities all present a central high and gradually decrease to the surroundings, while the urban ecological environment indicators show an opposite trend. (2) From the analysis of the performance results of each model on the dataset, XGB has the best regression effect, followed by PLR.(3)There is a strong negative correlation between the urban ecological environment and human activities, and the street vitality index, the urban function mixing index are more relevant to the urban ecological environment, and the walkability index of the residential area is less relevant to the urban ecological environment. (4) In areas where human activities have less impact, the urban ecological environment will be disturbed by other factors, resulting in the low prediction accuracy, while the prediction accuracy in areas with strong human activities is high.
Conclusions Using multisource data and machine learning models can provide an important reference for exploring the relationship between urban ecological environment and human activities.