陈江平, 傅仲良, 徐志红. 一种Apriori的改进算法[J]. 武汉大学学报 ( 信息科学版), 2003, 28(1): 94-99.
引用本文: 陈江平, 傅仲良, 徐志红. 一种Apriori的改进算法[J]. 武汉大学学报 ( 信息科学版), 2003, 28(1): 94-99.
CHEN Jiangping, FU Zhongliang, XU Zhihong. An Improved Algorithm of Apriori[J]. Geomatics and Information Science of Wuhan University, 2003, 28(1): 94-99.
Citation: CHEN Jiangping, FU Zhongliang, XU Zhihong. An Improved Algorithm of Apriori[J]. Geomatics and Information Science of Wuhan University, 2003, 28(1): 94-99.

一种Apriori的改进算法

An Improved Algorithm of Apriori

  • 摘要: 提出了一种Apriori算法的改进算法,讨论了通过使用概率的方法估算任意数据项集同时出现的概率来求候选频繁项集的方法,并给出了相关算法。将本算法与Apriori算法产生的候选项集大小和扫描数据库次数进行了比较,该算法将关联规则挖掘的运行速度提高了一个数量级,非常适合挖掘数据库、长模式的关联规则。

     

    Abstract: Based on Apriori algorithm,an improved algorithm of Apriori is presented in this paper.The main ideas of the algorithm are:1) Count the probability of each item(I1,I2,…,Im) by scanning the DB for the first time;2) Item Ik and Im appear in one record is Pkm.min(Pk,Pm) ≤PkmPk*Pm,3) Count the support count of the frequence itemsets by scanning the DB another time;4) Output the association rule from the frequence itemsets.The detailed algorithm and its sample are described in this paper.Lastly we compare it with Apriori algorithm.The best quality of the new algorithm is that it reduce the times of scanning DB.

     

/

返回文章
返回