skip to main content
article
Free Access

Dynamic itemset counting and implication rules for market basket data

Authors Info & Claims
Published:01 June 1997Publication History
Skip Abstract Section

Abstract

We consider the problem of analyzing market-basket data and present several important contributions. First, we present a new algorithm for finding large itemsets which uses fewer passes over the data than classic algorithms, and yet uses fewer candidate itemsets than methods based on sampling. We investigate the idea of item reordering, which can improve the low-level efficiency of the algorithm. Second, we present a new way of generating “implication rules,” which are normalized based on both the antecedent and the consequent and are truly implications (not simply a measure of co-occurrence), and we show how they produce more intuitive results than other methods. Finally, we show how different characteristics of real data, as opposed by synthetic data, can dramatically affect the performance of the system and the form of the results.

References

  1. AIS93a R. Agrawal, T. Imilienski, and A. Swami. Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineerin9, 5(6):914-925, December 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. AIS93b R. Agrawal, T. Imilienski, and A. Swami. Mining Association Rules between Sets of Items in Large Databases. Proc. of the A CM SIGMOD Int'l Conf. on Management of Data, pages 207- 216, May 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. ALSS95 R. Agrawal, K. Lin, S. Sawhney, and K. Shim. Fast similarity search in the presence of noise, scaling and translation in time-series databases. in Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. AS94 R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. AS95 R. Agrawal and R. Srikant. Mining sequential patterns. In Proceedings of the l lth International Conm}erence on Data Engineering, Taipei, Taiwan, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. MAR96 M. Mehta, R. Agrawal, and J. Rissanen. Sliq: A fast scalable classifier for data mining. March 1996.Google ScholarGoogle Scholar
  7. SA95 R. Srikant and R. Agrawal. Mining generalized association rules. 1995.Google ScholarGoogle Scholar
  8. Toi96 H. Toivonen. Sampling large databases for association rules. Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Dynamic itemset counting and implication rules for market basket data

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGMOD Record
        ACM SIGMOD Record  Volume 26, Issue 2
        June 1997
        583 pages
        ISSN:0163-5808
        DOI:10.1145/253262
        Issue’s Table of Contents
        • cover image ACM Conferences
          SIGMOD '97: Proceedings of the 1997 ACM SIGMOD international conference on Management of data
          June 1997
          594 pages
          ISBN:0897919114
          DOI:10.1145/253260

        Copyright © 1997 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 June 1997

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader