|
|
|
|
|
|
|
|
| ( 1 of 1 ) |
| United States Patent | 7,430,503 |
| Walker | September 30, 2008 |
The present invention is a method of combining corpora to achieve consistency in phonetic labeling. Corpora are received. A first corpus is selected from the corpora. Generating a phonetic transcript if the first corpus does not include one. A second corpus is selected from the corpora. Generating a phonetic transcript if the second corpus does not include one. Each allophone in the second corpus is identified. At least one allophone is identified for each phone in the second corpus. For each phone in the second corpus, the allophone to which it most closely matches is identified. Each phone symbol in the phone transcript of the second corpus is replaced with a symbol for the corresponding identified allophone. The first corpus and second corpus are combined, including their phonetic transcripts, and designated as the first corpus. If there is another corpus in the corpora to be processed return to the step of selecting another second corpus.
| Inventors: | Walker; Brenton D. (College Park, MD) |
| Assignee: |
The United States of America as represented by the Director, National Security Agency
(Washington,
DC)
N/A ( |
| Appl. No.: | 10/928,879 |
| Filed: | August 24, 2004 |
| Current U.S. Class: | 704/8 ; 704/10; 704/277; 704/E15.007 |
| Current International Class: | G10L 15/06 (20060101); G06F 17/20 (20060101) |
| Field of Search: | 704/8,9,10,243,244,255,277 |
| 4979216 | December 1990 | Malsheen et al. |
| 5758023 | May 1998 | Bordeaux |
| 5815639 | September 1998 | Bennett et al. |
| 5926787 | July 1999 | Bennett et al. |
| 5950159 | September 1999 | Knill |
| 6002998 | December 1999 | Martino et al. |
| 6023670 | February 2000 | Martino et al. |
| 6073095 | June 2000 | Dharanipragada et al. |
| 6085160 | July 2000 | D'hoore et al. |
| 6178397 | January 2001 | Fredenburg |
| 6385579 | May 2002 | Padmanabhan et al. |
| 7107215 | September 2006 | Ghali |
| 7149688 | December 2006 | Schalkwyk |
| 7191116 | March 2007 | Alpha |
| 7277851 | October 2007 | Henton |
| 2002/0173945 | November 2002 | Fabiani et al. |
| 2003/0135356 | July 2003 | Ying et al. |
| 2005/0033575 | February 2005 | Schneider |
| 2005/0165602 | July 2005 | Cote et al. |
| 2005/0197837 | September 2005 | Suontausta et al. |
Definition of "Allophone", Encyclopedia Britannica Online, One Page. cited by examiner . Francoise Beaufays et al., "Learning Name Pronunciations in Automatic Speech Recognition Systems," undated. cited by other. |
|
|