|
|
|
|
|
|
|
|
| ( 112 of 688598 ) |
| United States Patent | 7,636,698 |
| Crivat , et al. | December 22, 2009 |
Architecture for analyzing pattern shifts in data patterns of data mining models and outputting the results. This allows comparing and describing differences between two semantically similar sets of patterns (or mining models), and for analyzing historical changes in versions of the same model or differences in patterns found by two or more different algorithms applied to the same data. The architecture can also facilitate explaining data patterns that shift over time and over different data populations, and between versions of the same model that use different algorithms. A model component is employed for storing data mining models have respective sets of data patterns obtained from a dataset, and an analysis component analyzes the sets of the data patterns for difference data therebetween. The dataset can be a subsample of a larger set of data and can be analyzed by the analysis component over a time period.
| Inventors: | Crivat; Ioan Bogdan (Redmond, WA), Cristofor; Elena D. (Redmond, WA), MacLennan; C. James (Redmond, WA) |
| Assignee: |
Microsoft Corporation
(Redmond,
WA)
|
| Appl. No.: | 11/376,993 |
| Filed: | March 16, 2006 |
| Current U.S. Class: | 706/21 |
| Current International Class: | G06E 1/00 (20060101); G06E 3/00 (20060101); G06G 7/00 (20060101) |
| Field of Search: | 706/21 |
| 5748852 | May 1998 | Mahler |
| 6311173 | October 2001 | Levin et al. |
| 6697998 | February 2004 | Damerau et al. |
| 6711585 | March 2004 | Copperman et al. |
| 6931418 | August 2005 | Barnes |
| 6950755 | September 2005 | Stahl |
| 7103222 | September 2006 | Peker |
| 2003/0130996 | July 2003 | Bayerl et al. |
| 2004/0042665 | March 2004 | Il et al. |
| 2004/0215599 | October 2004 | Apps et al. |
| 2005/0108254 | May 2005 | Zhang |
| 2005/0114360 | May 2005 | Russell et al. |
| 2005/0177393 | August 2005 | Sacco et al. |
| 2005/0177414 | August 2005 | Priogin et al. |
| WO2005010727 | Feb., 2005 | WO | |||
Palace, Bill. "What is Data Mining?" 1998, date confirmed by wayback machine. cited by examiner . Eamonn Keogh, Selina Chu, David Hart and Michael Pazzaini. "An Onlinge Algoroithm for Segmenting Time Series" icdm,pp. 289, First IEEE International Conference on Data Mining (ICDM'01), 2001. cited by examiner . Venkatesh Ganti, Johannes Gehrke, and Raghu Ramakrishnan. "Mining data streams under block evolution" (ACM SIGKDD Exploraitons newsletter. vol. 3, Issue 2. Jan. 2002. cited by examiner . Xia, et al.; Indexing and Querying Constantly Evolving Data Using Time Series Analysis; 12 pages. cited by other . Bounsaytip, et al; Overview of Data Mining for Customer Behaviour Modeling; 2001; 59 pages. cited by other . Jian-Cheng, et al.; Towards the Foundation of Data Mining vol. 2:Intelligent Multi-Objective Evolutionary Algorithm for Editing Minimum Reference Set; 2002; 78 pages. cited by other. |
|
|