自适应动态演化粒子群算法在Web主题信息搜索中的应用
Application of Focused Crawler Using Adaptive Dynamical Evolutional Particle Swarm Optimization
-
摘要: 针对传统的基于单一价值评价的网络爬虫搜索策略存在的不足,提出了一种基于自适应动态演化粒子群(adaptive dynamical evolutional particle swarm optimization,ADEPSO)的启发式网络爬虫搜索算法。本算法综合立即价值和未来价值两种链接评价方法,并依据链接价值所反映的Web实际搜索情况动态调整两种价值的关系,使网络爬虫能更准确地预测页面的重要性。实验表明,该算法具有较高的搜索效率。Abstract: Aiming at the disadvatages of traditional topic crawler which uses monistic searching strategy,a new heuristic searching algorithm based on adaptive dynamical evolutionary PSO is proposed,which combines the advantage of linkage's immediate rewards and future rewards to valuate linkages together.The author utilizes the changes of rewards to speculate about how relevant the candidate page-set is to topics based on which the crawler can dynamically adjust the relationship between these two rewards.The experimental results show that this algorithm has better performance compared with traditional algorithms.