FP-Tree算法的实现（Java版）

在关联规则挖掘领域最经典的算法法是Apriori，其致命的缺点是需要多次扫描事务数据库。于是人们提出了各种裁剪（prune）数据集的方法以减少I/O开支，韩嘉炜老师的FP-Tree算法就是其中非常高效的一种。

我们举个例子来详细讲解FP-Tree算法的完整实现。

事务数据库如下，一行表示一条购物记录：

牛奶，鸡蛋，面包，薯片

鸡蛋，爆米花，薯片，啤酒

鸡蛋，面包，薯片

牛奶，鸡蛋，面包，爆米花，薯片，啤酒

牛奶，面包，啤酒

鸡蛋，面包，啤酒

牛奶，面包，薯片

牛奶，鸡蛋，面包，黄油，薯片

牛奶，鸡蛋，黄油，薯片

我们的目的是要找出哪些商品总是相伴出现的，比如人们买薯片的时候通常也会买鸡蛋，则[薯片，鸡蛋]就是一条频繁模式（frequent pattern）。

FP-Tree算法第一步：扫描事务数据库，每项商品按频数递减排序，并删除频数小于最小支持度MinSup的商品。（第一次扫描数据库）

薯片:7 鸡蛋:7 面包:7 牛奶:6 啤酒:4 （这里我们令MinSup=3）

以上结果就是频繁1项集，记为F1。

第二步：对于每一条购买记录，按照F1中的顺序重新排序。（第二次也是最后一次扫描数据库）

薯片,鸡蛋,面包,牛奶

薯片,鸡蛋,啤酒

薯片,鸡蛋,面包

薯片,鸡蛋,面包,牛奶,啤酒

面包,牛奶,啤酒

鸡蛋,面包,啤酒

薯片,面包,牛奶

薯片,鸡蛋,面包,牛奶

薯片,鸡蛋,牛奶

第三步：把第二步得到的各条记录插入到FP-Tree中。

插入每一条（薯片,鸡蛋,面包,牛奶）之后

FP-Tree算法的实现（Java版）

插入第二条记录（薯片,鸡蛋,啤酒）

FP-Tree算法的实现（Java版）

插入第三条记录（面包,牛奶,啤酒）

FP-Tree算法的实现（Java版）

估计你也知道怎么插了，最终生成的FP-Tree是：

FP-Tree算法的实现（Java版）

树中相同名称的节点要链接起来，后面的算法要用到。

第四步：从FP-Tree中找出频繁项。

遍历F1中的每一项（我们拿“牛奶：6”为例），对于各项都执行以下（1）到（5）的操作：

（1）从FP-Tree中找到所有的“牛奶”节点，向上遍历它的祖先节点，得到4条路径：

薯片：7，鸡蛋：6，牛奶：1

薯片：7，鸡蛋：6，面包：4，牛奶：3

薯片：7，面包：1，牛奶：1

面包：1，牛奶：1

对于每一条路径上的节点，其count都设置为牛奶的count

薯片：1，鸡蛋：1，牛奶：1

薯片：3，鸡蛋：3，面包：3，牛奶：3

薯片：1，面包：1，牛奶：1

面包：1，牛奶：1

因为每一项末尾都是牛奶，可以把牛奶去掉，得到条件模式基（Conditional Pattern Base,CPB）

薯片：1，鸡蛋：1

薯片：3，鸡蛋：3，面包：3

薯片：1，面包：1

面包：1

（2）我们把上面的结果当作原始的事务数据库，对于进行每一步和第二步的处理，得到条件FP-Tree

FP-Tree算法的实现（Java版）

（3）从树中找到所有的长路径

（薯片：4，面包：4，鸡蛋：3）

（薯片：1，鸡蛋：1）

（面包：1）

（4）对于（3）中的每一条路径找出所有的组合方式

第一条：（薯片：4）（面包：4）（鸡蛋：3）（薯片：3，鸡蛋：3）（面包：3，鸡蛋：3）（薯片：4，面包：4）（薯片：3，面包：3，鸡蛋：3）

第二条：（薯片：1）（鸡蛋：1）（薯片：1，鸡蛋：1）

第三条：（面包：1）

每一个组合中的count要一致，都取最小的那一项。

然后把三条得到的组合合并到一起，合并的方法是：对于序列相同的组合，其count相加。比如第一条中的（面包：4）和第三条中的（面包：1）合并后成为（面包：5）,而第一条中的（薯片：3，鸡蛋：3）和第二条中的（薯片：1，鸡蛋：1）合并后成为（薯片：4，鸡蛋：4）。最后删除count小于MinSup的组合。只剩下：

面包:4 薯片:4

薯片:5

鸡蛋: 4

鸡蛋: 3 面包:3

鸡蛋: 4 薯片:4

鸡蛋: 3 面包:3 薯片: 3

面包: 5

（5）与“牛奶”合并，得到频繁项集

面包薯片牛奶 4

薯片牛奶 5

鸡蛋牛奶 4

鸡蛋面包牛奶 3

鸡蛋薯片牛奶 4

鸡蛋面包薯片牛奶 3

面包牛奶 5

源代码实现：

FP树节点定义

001

package fptree;

002

003 import java.util.ArrayList;

004 import java.util.List;

005

006 public class TreeNode implements Comparable<TreeNode> {

007

008 private String name; // 节点名称

009 private int count; // 计数

010 private TreeNode parent; // 父节点

011 private List<TreeNode> children; // 子节点

012 private TreeNode nextHomonym; // 下一个同名节点

013

014 public TreeNode() {

015

016 }

017

018 public TreeNode(String name) {

019 this.name = name;

020 }

021

022 public String getName() {

023 return name;

024 }

025

026 public void setName(String name) {

027 this.name = name;

028 }

029

030 public int getCount() {

031 return count;

032 }

033

034 public void setCount(int count) {

035 this.count = count;

036 }

037

038 public TreeNode getParent() {

039 return parent;

040 }

041

042 public void setParent(TreeNode parent) {

043 this.parent = parent;

044 }

045

046 public List<TreeNode> getChildren() {

047 return children;

048 }

049

050 public void addChild(TreeNode child) {

051 if (this.getChildren() == null) {

052 List<TreeNode> list = new ArrayList<TreeNode>();

053 list.add(child);

054 this.setChildren(list);

055 } else {

056 this.getChildren().add(child);

057 }

058 }

059

060 public TreeNode findChild(String name) {

061 List<TreeNode> children = this.getChildren();

062 if (children != null) {

063 for (TreeNode child : children) {

064 if (child.getName().equals(name)) {

065 return child;

066 }

067 }

068 }

069 return null;

070 }

071

072 public void setChildren(List<TreeNode> children) {

073 this.children = children;

074 }

075

076 public void printChildrenName() {

077 List<TreeNode> children = this.getChildren();

078 if (children != null) {

079 for (TreeNode child : children) {

080 System.out.print(child.getName() + " ");

081 }

082 } else {

083 System.out.print("null");

084 }

085 }

086

087 public TreeNode getNextHomonym() {

088 return nextHomonym;

089 }

090

091 public void setNextHomonym(TreeNode nextHomonym) {

092 this.nextHomonym = nextHomonym;

093 }

094

095 public void countIncrement(int n) {

096 this.count += n;

097 }

098

099 @Override

100 public int compareTo(TreeNode arg0) {

101 // TODO Auto-generated method stub

102 int count0 = arg0.getCount();

103 // 跟默认的比较大小相反，导致调用Arrays.sort()时是按降序排列

104 return count0 - this.count;

105 }

106 }

挖掘频繁模式


package fptree; 
 

   

import java.io.BufferedReader; 
 

import java.io.File; 
 

import java.io.FileReader; 
 

import java.io.IOException; 
 

import java.util.ArrayList; 
 

import java.util.Collections; 
 

import java.util.Comparator; 
 

import java.util.HashMap; 
 

import java.util.Iterator; 
 

import java.util.LinkedList; 
 

import java.util.List; 
 

import java.util.Map; 
 

import java.util.Map.Entry; 
 

import java.util.Set; 
 

   

public class FPTree { 
 

   

    private int minSup; // 最小支持度 
 

   

    public int getMinSup() { 
 

        return minSup; 
 
    }  


   

    public void setMinSup(int minSup) { 
 
        this.minSup = minSup;  

    }  


   
    /**  

     * 1.读入事务记录  

     *   

     * @param filenames  

     * @return  

     */ 


    public List<List<String>> readTransData(String... filenames) { 
 

        List<List<String>> records = new LinkedList<List<String>>(); 
 
        List<String> record;  

        // 从文件读入  


        if (filenames.length > 0) { 
 

            for (String filename : filenames) { 
 

                try { 
 

                    FileReader fr = new FileReader(new File(filename)); 
 

                    BufferedReader br = new BufferedReader(fr); 
 
                    String line = null;  


                    while ((line = br.readLine()) != null) { 
 

                        if (line.trim() != "") { 
 

                            record = new LinkedList<String>(); 
 
                            String[] items = line.split("[，|,]");  


                            for (String item : items) { 
 
                                record.add(item);  

                            }  

                            records.add(record);  

                        }  

                    }  


                } catch (IOException e) { 
 
                    System.out.println("读取事务数据库失败。");  

                    System.exit(-2);  

                }  

            }  

        }  

        // 直接在代码里指定  


        else { 
 

            record = new LinkedList<String>(); 
 

            String[] trans = new String[] { "f", "a", "c", "d", "g", "i", "m", 
 

                    "p" }; 
 

            for (String t : trans) 
 
                record.add(t);  

            records.add(record);  


            record = new LinkedList<String>(); 
 

            trans = new String[] { "a", "b", "c", "f", "l", "m", "o" }; 
 

            for (String t : trans) 
 
                record.add(t);  

            records.add(record);  


            record = new LinkedList<String>(); 
 

            trans = new String[] { "b", "f", "h", "j", "o" }; 
 

            for (String t : trans) 
 
                record.add(t);  

            records.add(record);  


            record = new LinkedList<String>(); 
 

            trans = new String[] { "b", "c", "k", "s", "p" }; 
 

            for (String t : trans) 
 
                record.add(t);  

            records.add(record);  


            record = new LinkedList<String>(); 
 

            trans = new String[] { "a", "f", "c", "e", "l", "p", "m", "n" }; 
 

            for (String t : trans) 
 
                record.add(t);  

            records.add(record);  

        }  


        return records; 
 
    }  


   
    /**  

     * 2.构造频繁1项集  

     *   

     * @param transRecords  

     * @return  

     */ 


    public ArrayList<TreeNode> buildF1Items(List<List<String>> transRecords) { 
 
        ArrayList<TreeNode> F1 = null;  


        if (transRecords.size() > 0) { 
 

            F1 = new ArrayList<TreeNode>(); 
 

            Map<String, TreeNode> map = new HashMap<String, TreeNode>(); 
 
            // 计算事务数据库中各项的支持度  


            for (List<String> record : transRecords) { 
 

                for (String item : record) { 
 

                    if (!map.keySet().contains(item)) { 
 

                        TreeNode node = new TreeNode(item); 
 
                        node.setCount(1);  

                        map.put(item, node);  


                    } else { 
 
                        map.get(item).countIncrement(1);  

                    }  

                }  

            }  

            // 把支持度大于（或等于）minSup的项加入到F1中  

            Set<String> names = map.keySet();  


            for (String name : names) { 
 
                TreeNode tnode = map.get(name);  


                if (tnode.getCount() >= minSup) { 
 
                    F1.add(tnode);  

                }  

            }  

            Collections.sort(F1);  


            return F1; 
 

        } else { 
 

            return null; 
 
        }  

    }  


   
    /**  

     * 3.建立FP-Tree  

     *   

     * @param transRecords  

     * @param F1  

     * @return  

     */ 


    public TreeNode buildFPTree(List<List<String>> transRecords, 
 
            ArrayList<TreeNode> F1) {  


        TreeNode root = new TreeNode(); // 创建树的根节点 
 

        for (List<String> transRecord : transRecords) { 
 
            LinkedList<String> record = sortByF1(transRecord, F1);  

            TreeNode subTreeRoot = root;  

            TreeNode tmpRoot = null;  


            if (root.getChildren() != null) { 
 

                while (!record.isEmpty() 
 
                        && (tmpRoot = subTreeRoot.findChild(record.peek())) != null) {  

                    tmpRoot.countIncrement(1);  

                    subTreeRoot = tmpRoot;  

                    record.poll();  

                }  

            }  

            addNodes(subTreeRoot, record, F1);  

        }  


        return root; 
 
    }  


   
    /**  

     * 3.1把事务数据库中的一条记录按照F1（频繁1项集）中的顺序排序  

     *   

     * @param transRecord  

     * @param F1  

     * @return  

     */ 


    public LinkedList<String> sortByF1(List<String> transRecord, 
 
            ArrayList<TreeNode> F1) {  


        Map<String, Integer> map = new HashMap<String, Integer>(); 
 

        for (String item : transRecord) { 
 
            // 由于F1已经是按降序排列的，  


            for (int i = 0; i < F1.size(); i++) { 
 
                TreeNode tnode = F1.get(i);  


                if (tnode.getName().equals(item)) { 
 
                    map.put(item, i);  

                }  

            }  

        }  


        ArrayList<Entry<String, Integer>> al = new ArrayList<Entry<String, Integer>>( 
 
                map.entrySet());  


        Collections.sort(al, new Comparator<Map.Entry<String, Integer>>() { 
 
            @Override 


            public int compare(Entry<String, Integer> arg0, 
 
                    Entry<String, Integer> arg1) {  

                // 降序排列  


                return arg0.getValue() - arg1.getValue(); 
 
            }  

        });  


        LinkedList<String> rest = new LinkedList<String>(); 
 

        for (Entry<String, Integer> entry : al) { 
 
            rest.add(entry.getKey());  

        }  


        return rest; 
 
    }  


   
    /**  

     * 3.2 把若干个节点作为指定指定节点的后代插入树中  

     *   

     * @param ancestor  

     * @param record  

     * @param F1  

     */ 


    public void addNodes(TreeNode ancestor, LinkedList<String> record, 
 
            ArrayList<TreeNode> F1) {  


        if (record.size() > 0) { 
 

            while (record.size() > 0) { 
 
                String item = record.poll();  


                TreeNode leafnode = new TreeNode(item); 
 
                leafnode.setCount(1);  

                leafnode.setParent(ancestor);  

                ancestor.addChild(leafnode);  


   

                for (TreeNode f1 : F1) { 
 

                    if (f1.getName().equals(item)) { 
 

                        while (f1.getNextHomonym() != null) { 
 
                            f1 = f1.getNextHomonym();  

                        }  

                        f1.setNextHomonym(leafnode);  

                        break;  

                    }  

                }  


   
                addNodes(leafnode, record, F1);  

            }  

        }  

    }  


   
    /**  

     * 4. 从FPTree中找到所有的频繁模式  

     *   

     * @param root  

     * @param F1  

     * @return  

     */ 


    public Map<List<String>, Integer> findFP(TreeNode root, 
 
            ArrayList<TreeNode> F1) {  


        Map<List<String>, Integer> fp = new HashMap<List<String>, Integer>(); 
 

   
        Iterator<TreeNode> iter = F1.iterator();  


        while (iter.hasNext()) { 
 
            TreeNode curr = iter.next();  

            // 寻找cur的条件模式基CPB，放入transRecords中  


            List<List<String>> transRecords = new LinkedList<List<String>>(); 
 
            TreeNode backnode = curr.getNextHomonym();  


            while (backnode != null) { 
 

                int counter = backnode.getCount(); 
 

                List<String> prenodes = new ArrayList<String>(); 
 
                TreeNode parent = backnode;  

                // 遍历backnode的祖先节点，放到prenodes中  


                while ((parent = parent.getParent()).getName() != null) { 
 
                    prenodes.add(parent.getName());  

                }  


                while (counter-- > 0) { 
 
                    transRecords.add(prenodes);  

                }  

                backnode = backnode.getNextHomonym();  

            }  


   
            // 生成条件频繁1项集  

            ArrayList<TreeNode> subF1 = buildF1Items(transRecords);  

            // 建立条件模式基的局部FP-tree  

            TreeNode subRoot = buildFPTree(transRecords, subF1);  


   
            // 从条件FP-Tree中寻找频繁模式  


            if (subRoot != null) { 
 
                Map<List<String>, Integer> prePatterns = findPrePattern(subRoot);  


                if (prePatterns != null) { 
 
                    Set<Entry<List<String>, Integer>> ss = prePatterns  

                            .entrySet();  


                    for (Entry<List<String>, Integer> entry : ss) { 
 
                        entry.getKey().add(curr.getName());  

                        fp.put(entry.getKey(), entry.getValue());  

                    }  

                }  

            }  

        }  


   

        return fp; 
 
    }  


   
    /**  

     * 4.1 从一棵FP-Tree上找到所有的前缀模式  

     *   

     * @param root  

     * @return  

     */ 


    public Map<List<String>, Integer> findPrePattern(TreeNode root) { 
 
        Map<List<String>, Integer> patterns = null;  

        List<TreeNode> children = root.getChildren();  


        if (children != null) { 
 

            patterns = new HashMap<List<String>, Integer>(); 
 

            for (TreeNode child : children) { 
 
                // 找到以child为根节点的子树中的所有长路径（所谓长路径指它不是其他任何路径的子路径）  

                LinkedList<LinkedList<TreeNode>> paths = buildPaths(child);  


                if (paths != null) { 
 

                    for (List<TreeNode> path : paths) { 
 
                        Map<List<String>, Integer> backPatterns = combination(path);  

                        Set<Entry<List<String>, Integer>> entryset = backPatterns  

                                .entrySet();  


                        for (Entry<List<String>, Integer> entry : entryset) { 
 
                            List<String> key = entry.getKey();  


                            int c1 = entry.getValue(); 
 

                            int c0 = 0; 
 

                            if (patterns.containsKey(key)) { 
 
                                c0 = patterns.get(key).byteValue();  

                            }  

                            patterns.put(key, c0 + c1);  

                        }  

                    }  

                }  

            }  

        }  


   
        // 过滤掉那些小于MinSup的模式  

        Map<List<String>, Integer> rect = null;  


        if (patterns != null) { 
 

            rect = new HashMap<List<String>, Integer>(); 
 
            Set<Entry<List<String>, Integer>> ss = patterns.entrySet();  


            for (Entry<List<String>, Integer> entry : ss) { 
 

                if (entry.getValue() >= minSup) { 
 
                    rect.put(entry.getKey(), entry.getValue());  

                }  

            }  

        }  


        return rect; 
 
    }  


   
    /**  

     * 4.1.1 找到从指定节点（root）到所有可达叶子节点的路径  

     *   

     * @param stack  

     * @param root  

     */ 


    public LinkedList<LinkedList<TreeNode>> buildPaths(TreeNode root) { 
 
        LinkedList<LinkedList<TreeNode>> paths = null;  


        if (root != null) { 
 

            paths = new LinkedList<LinkedList<TreeNode>>(); 
 
            List<TreeNode> children = root.getChildren();  


            if (children != null) { 
 
                //在从树上分离单条路径时，对分叉口的节点，其count也要分到各条路径上去  

                //条件FP-Tree是多枝的情况  


                if (children.size() > 1) { 
 

                    for (TreeNode child : children) { 
 

                        int count = child.getCount(); 
 
                        LinkedList<LinkedList<TreeNode>> ll = buildPaths(child);  


                        for (LinkedList<TreeNode> lp : ll) { 
 

                                TreeNode prenode = new TreeNode(root.getName()); 
 
                                prenode.setCount(count);  

                                lp.addFirst(prenode);  

                            paths.add(lp);  

                        }  

                    }  

                }  

                //条件FP-Tree是单枝的情况  

                else{  


                    for (TreeNode child : children) { 
 
                        LinkedList<LinkedList<TreeNode>> ll = buildPaths(child);  


                        for (LinkedList<TreeNode> lp : ll) { 
 
                            lp.addFirst(root);  

                            paths.add(lp);  

                        }  

                    }  

                }  


            } else { 
 

                LinkedList<TreeNode> lp = new LinkedList<TreeNode>(); 
 
                lp.add(root);  

                paths.add(lp);  

            }  

        }  


        return paths; 
 
    }  


   
    /**  

     * 4.1.2  

     * 生成路径path中所有元素的任意组合，并记下每一种组合的count--其实就是组合中最后一个元素的count，因为我们的组合算法保证了树中  

     * （或path中)和组合中元素出现的相对顺序不变  

     *   

     * @param path  

     * @return  

     */ 


    public Map<List<String>, Integer> combination(List<TreeNode> path) { 
 

        if (path.size() > 0) { 
 
            // 从path中移除首节点  

            TreeNode start = path.remove(0);  

            // 首节点自己可以成为一个组合，放入rect中  


            Map<List<String>, Integer> rect = new HashMap<List<String>, Integer>(); 
 

            List<String> li = new ArrayList<String>(); 
 
            li.add(start.getName());  

            rect.put(li, start.getCount());  


   
            Map<List<String>, Integer> postCombination = combination(path);  


            if (postCombination != null) { 
 
                Set<Entry<List<String>, Integer>> set = postCombination  

                        .entrySet();   


                for (Entry<List<String>, Integer> entry : set) { 
 
                    // 把首节点之后元素的所有组合放入rect中  

                    rect.put(entry.getKey(), entry.getValue());  

                    // 首节点并上其后元素的各种组合放入rect中  


                    List<String> ll = new ArrayList<String>(); 
 
                    ll.addAll(entry.getKey());  

                    ll.add(start.getName());  

                    rect.put(ll, entry.getValue());  

                }  

            }  


   

            return rect; 
 

        } else { 
 

            return null; 
 
        }  

    }  


   
    /**  

     * 输出频繁1项集  

     *   

     * @param F1  

     */ 


    public void printF1(List<TreeNode> F1) { 
 
        System.out.println("F-1 set: ");  


        for (TreeNode item : F1) { 
 

            System.out.print(item.getName() + ":" + item.getCount() + "\t"); 
 
        }  

        System.out.println();  

        System.out.println();  

    }  


   
    /**  

     * 打印FP-Tree  

     *   

     * @param root  

     */ 


    public void printFPTree(TreeNode root) { 
 
        printNode(root);  

        List<TreeNode> children = root.getChildren();  


        if (children != null && children.size() > 0) { 
 

            for (TreeNode child : children) { 
 
                printFPTree(child);  

            }  

        }  

    }  


   
    /**  

     * 打印树上单个节点的信息  

     *   

     * @param node  

     */ 


    public void printNode(TreeNode node) { 
 

        if (node.getName() != null) { 
 

            System.out.print("Name:" + node.getName() + "\tCount:"
 
                    + node.getCount() + "\tParent:" 

                    + node.getParent().getName());  


            if (node.getNextHomonym() != null) 
 
                System.out.print("\tNextHomonym:" 

                        + node.getNextHomonym().getName());  

            System.out.print("\tChildren:");  

            node.printChildrenName();  

            System.out.println();  


        } else { 
 
            System.out.println("FPTreeRoot");  

        }  

    }  


   
    /**  

     * 打印最终找到的所有频繁模式集  

     *   

     * @param patterns  

     */ 


    public void printFreqPatterns(Map<List<String>, Integer> patterns) { 
 
        System.out.println();  


        System.out.println("MinSupport=" + this.getMinSup()); 
 
        System.out.println("Frequent Patterns and their Support");  

        Set<Entry<List<String>, Integer>> ss = patterns.entrySet();  


        for (Entry<List<String>, Integer> entry : ss) { 
 
            List<String> list = entry.getKey();  


            for (String item : list) { 
 
                System.out.print(item + " ");  

            }  

            System.out.print("\t"+entry.getValue());  

            System.out.println();  

        }  

    }  


   

    public static void main(String[] args) { 
 

        FPTree fptree = new FPTree(); 
 
        fptree.setMinSup(3);  

        List<List<String>> transRecords = fptree.readTransData("/home/orisun/test/market"); //第一组测试  

        //List<List<String>> transRecords = fptree.readTransData();         //第二组测试  

        ArrayList<TreeNode> F1 = fptree.buildF1Items(transRecords);  

        fptree.printF1(F1);  

        TreeNode treeroot = fptree.buildFPTree(transRecords, F1);  

        fptree.printFPTree(treeroot);  


   
        Map<List<String>, Integer> patterns = fptree.findFP(treeroot, F1);  

        fptree.printFreqPatterns(patterns);  

    }  

}

输出：

F-1 set:

薯片:7 鸡蛋:7 面包:7 牛奶:6 啤酒:4

FPTreeRoot

Name:薯片 Count:7 Parent:null Children:鸡蛋面包

Name:鸡蛋 Count:6 Parent:薯片 NextHomonym:鸡蛋 Children:面包啤酒牛奶

Name:面包 Count:4 Parent:鸡蛋 NextHomonym:面包 Children:牛奶

Name:牛奶 Count:3 Parent:面包 NextHomonym:牛奶 Children:啤酒

Name:啤酒 Count:1 Parent:牛奶 NextHomonym:啤酒 Children:null

Name:啤酒 Count:1 Parent:鸡蛋 NextHomonym:啤酒 Children:null

Name:牛奶 Count:1 Parent:鸡蛋 Children:null

Name:面包 Count:1 Parent:薯片 Children:牛奶

Name:牛奶 Count:1 Parent:面包 NextHomonym:牛奶 Children:null

Name:面包 Count:1 Parent:null NextHomonym:面包 Children:牛奶

Name:牛奶 Count:1 Parent:面包 NextHomonym:牛奶 Children:啤酒

Name:啤酒 Count:1 Parent:牛奶 NextHomonym:啤酒 Children:null

Name:鸡蛋 Count:1 Parent:null Children:面包

Name:面包 Count:1 Parent:鸡蛋 NextHomonym:面包 Children:啤酒

Name:啤酒 Count:1 Parent:面包 Children:null

MinSupport=3

Frequent Patterns and their Support

面包薯片牛奶 4

薯片牛奶 5

鸡蛋薯片面包 4

鸡蛋牛奶 4

鸡蛋面包牛奶 3

薯片面包 5

鸡蛋薯片牛奶 4

面包啤酒 3

鸡蛋面包薯片牛奶 3

面包牛奶 5

鸡蛋啤酒 3

薯片鸡蛋 6

鸡蛋面包 5

转载至：http://www.kuqin.com/algorithm/20111004/312340.html