|
|
|
|
|
|
|
|
| ( 1 of 1 ) |
| United States Patent | 8,589,399 |
| Lee , et al. | November 19, 2013 |
The subject matter of this specification can be embodied in, among other things, a method that includes identifying resources relating to an entity, where each resource includes multiple terms and is included in a corpus of resources relating to multiple entities. Candidate terms from the resources for potentially associating with the entity and a category associated with the entity are identified. A relative frequency of the candidate terms in the identified resources is compared to a frequency of the candidate terms associated with other entities. Each of the candidate terms are weighted, for example, based on a source of the candidate term and the relative frequency of the candidate term. A weighted frequency of each candidate term is calculated based on the weights, and candidate terms are selected as representative terms for the entity based on the weighted frequency.
| Inventors: | Lee; Jason (Forest Hills, NY), Stern; Tamara I. (New York, NY), Donaker; Gregory J. (Brooklyn, NY), Blair-Goldensohn; Sasha J. (New York, NY) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Applicant: |
| ||||||||||
| Assignee: |
Google Inc.
(Mountain View,
CA)
|
||||||||||
| Family ID: | 49555875 | ||||||||||
| Appl. No.: | 13/430,624 | ||||||||||
| Filed: | March 26, 2012 |
| Application Number | Filing Date | Patent Number | Issue Date | ||
|---|---|---|---|---|---|
| 61467911 | Mar 25, 2011 | ||||
| Current U.S. Class: | 707/737 |
| Current CPC Class: | G06F 17/30616 (20130101) |
| Current International Class: | G06F 17/30 (20060101) |
| Field of Search: | ;707/E17.083-84,730,741,737 |
| 5930474 | July 1999 | Dunworth et al. |
| 2007/0203816 | August 2007 | Costache et al. |
| 2009/0193328 | July 2009 | Reis et al. |
| 2010/0306203 | December 2010 | Rozok et al. |
| 2011/0302162 | December 2011 | Xiao et al. |
Effective Product Recommendation Using the Real-Time Web by Sandra Garcia Esparza (hereafter Esparza) Michael P. O'Mahony and Barry Smith, Research and Development in Intelligence System XXVII, Springer, Dec. 3, 2010. cited by examiner . Statistical Identification of Key Phrases for Text Classification by Frans Coenen (hereafter Coenen), Paul Leng, Robert Sanderson and Yanbo J. Wang, P. Perner (ed.): MLDM 2007, LNAI 4751, pp. 838-853, 2007, Spriner-Verlag Berlin Heidelberg 2007. cited by examiner. |
|
|