|
|
|
|
|
|
|
|
| ( 1 of 1 ) |
| United States Patent | 5,737,485 |
| Flanagan , et al. | April 7, 1998 |
A neural network is trained to transform distant-talking cepstrum coefficients, derived from a microphone array receiving speech from a speaker distant therefrom, into a form substantially similar to close-talking cepstrum coefficients that would be derived from a microphone close to the speaker, for providing robust hands-free speech and speaker recognition in adverse practical environments with existing speech and speaker recognition systems which have been trained on close-talking speech.
| Inventors: | Flanagan; James L. (Warren, NJ), Lin; Qiguang (Highland Park, NJ), Rahim; Mazin (Manalapan, NJ), Che; Chiwei (Edison, NJ) |
|---|---|
| Assignee: |
Rutgers The State University of New Jersey
(New Brunswick,
NJ)
|
| Family ID: | 23579539 |
| Appl. No.: | 08/399,445 |
| Filed: | March 7, 1995 |
| Current U.S. Class: | 704/232; 704/234; 704/241; 704/244; 704/E15.017 |
| Current CPC Class: | G10L 15/16 (20130101); G10L 25/24 (20130101) |
| Current International Class: | G10L 15/00 (20060101); G10L 15/16 (20060101); G10L 009/00 (); G10L 005/06 () |
| Field of Search: | ;395/2.09,2.1,2.11,2.4,2.41,2.35,2.36,2.37,2.42,2.5,2.56,2.6,2.61,2.67,2.68,21 |
| 3287649 | November 1966 | Rosenblatt |
| 5003490 | March 1991 | Castelaz et al. |
| 5040215 | August 1991 | Amano et al. |
| 5150323 | September 1992 | Castelaz |
| 5179624 | January 1993 | Amano et al. |
| 5185848 | February 1993 | Aritsuka et al. |
| 5212764 | May 1993 | Ariyoshi |
| 5307444 | April 1994 | Tsuboka |
| 5315704 | May 1994 | Shinta et al. |
| 5353376 | October 1994 | Oh et al. |
Che, Lin, Pearson, de Vries, and Flanagan, "Microphone Arrays and Neural Networks for Robust Speech Recognition:, Proceedings of the ARPA Human Language Technology Workshop, pp. 321-326, Mar. 1994, Princeton, NJ. . Lin, Che, and Flanagan, "Microphone Array and Neural Network System for Speaker Identification", Proceedings of the ARPA Spoken Language Systems Technology Workshop, pp. 321-326, Mar. 1994, Princeton, NJ. . Qiguang Lin, Ea-Ee Jan, ChiWei Che, and James Flanagan, Microphone Array and Neural Network System For Speaker Identification, Rutgers University, sent to Carnegie Mellon May 1994 for consideration. . C. Che, Q. Lin, J. Pearson, B. de Vries, and J. Flanagan, Microphone Arrays and Neural Networks for Robust Speech Recognition, Rutgers University, Mar. 10, 1994. . Qiguang Lin, Ea-Ee Jan, James Flanagan, Microphone Arrays and Speaker Identification, IEEE Transactions on Speech and Audio Processing, vol. 2, No. 4, pp. 622-629, Oct. 1994. . Q. Lin, E. Jan, C. Che, and J. Flanagan, Speaker Identification in Teleconferencing Environments Using Microphone Arrays and Neural Networks, ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, pp. 235-238, Apr., 1994. . Flanagan, Berkley, and Shipley, "A Digital Teleconferencing System with Integrated Modalities for Human/Machine Communication; HuMaNet," ICASSP-91, Apr. 14-17, 1991. . Kobatake et al., "Super Directive Sensor Array with Neural Network Structure," ICASSP-92, Mar. 23-26, 1992. . Colnet et al., "Far Field Array Processing with Neural Networks," Apr. 19-22, 1994. . Farrell, Mammone, and Flanagan, "Beamforming Microphone Arrays for Speech Enhancement," ICASSP-92, Mar. 23-26 1992. . Thomas W. Parsons, "Voice and Speech Processing," McGraw-Hill, 1987, pp. 209, 372-374.. |
|
|