WO1996023298A3 - System amd method for generating and using context dependent sub-syllable models to recognize a tonal language - Google Patents
System amd method for generating and using context dependent sub-syllable models to recognize a tonal language Download PDFInfo
- Publication number
- WO1996023298A3 WO1996023298A3 PCT/US1996/001002 US9601002W WO9623298A3 WO 1996023298 A3 WO1996023298 A3 WO 1996023298A3 US 9601002 W US9601002 W US 9601002W WO 9623298 A3 WO9623298 A3 WO 9623298A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- models
- speech
- initials
- finals
- syllable
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
Abstract
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9712532A GB2311640B (en) | 1995-01-26 | 1996-01-23 | System and method for generating and using context dependent sub-syllable models to recognize a tonal language |
AU47057/96A AU4705796A (en) | 1995-01-26 | 1996-01-23 | System amd method for generating and using context dependent sub-syllable models to recognize a tonal language |
HK98105291A HK1006093A1 (en) | 1995-01-26 | 1998-06-15 | A system and method for recognizing a tonal language. |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/378,963 | 1995-01-26 | ||
US08/378,963 US5680510A (en) | 1995-01-26 | 1995-01-26 | System and method for generating and using context dependent sub-syllable models to recognize a tonal language |
Publications (2)
Publication Number | Publication Date |
---|---|
WO1996023298A2 WO1996023298A2 (en) | 1996-08-01 |
WO1996023298A3 true WO1996023298A3 (en) | 1996-12-19 |
Family
ID=23495258
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1996/001002 WO1996023298A2 (en) | 1995-01-26 | 1996-01-23 | System amd method for generating and using context dependent sub-syllable models to recognize a tonal language |
Country Status (7)
Country | Link |
---|---|
US (1) | US5680510A (en) |
KR (1) | KR100391243B1 (en) |
CN (2) | CN1143263C (en) |
AU (1) | AU4705796A (en) |
GB (1) | GB2311640B (en) |
HK (4) | HK1006093A1 (en) |
WO (1) | WO1996023298A2 (en) |
Families Citing this family (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6067520A (en) * | 1995-12-29 | 2000-05-23 | Lee And Li | System and method of recognizing continuous mandarin speech utilizing chinese hidden markou models |
CN1120436C (en) * | 1997-09-19 | 2003-09-03 | 国际商业机器公司 | Speech recognition method and system for identifying isolated non-relative Chinese character |
AU5237398A (en) * | 1997-11-25 | 1999-06-15 | Apple Computer, Inc. | A method of continuous language recognition |
US5995932A (en) * | 1997-12-31 | 1999-11-30 | Scientific Learning Corporation | Feedback modification for accent reduction |
US6256410B1 (en) | 1998-07-30 | 2001-07-03 | International Business Machines Corp. | Methods and apparatus for customizing handwriting models to individual writers |
US6320985B1 (en) | 1998-07-31 | 2001-11-20 | International Business Machines Corporation | Apparatus and method for augmenting data in handwriting recognition system |
JP2001166789A (en) * | 1999-12-10 | 2001-06-22 | Matsushita Electric Ind Co Ltd | Method and device for voice recognition of chinese using phoneme similarity vector at beginning or end |
US6553342B1 (en) | 2000-02-02 | 2003-04-22 | Motorola, Inc. | Tone based speech recognition |
EP1298644B1 (en) * | 2000-06-26 | 2008-05-28 | Mitsubishi Denki Kabushiki Kaisha | Equipment operation system |
US6510410B1 (en) * | 2000-07-28 | 2003-01-21 | International Business Machines Corporation | Method and apparatus for recognizing tone languages using pitch information |
AU2000276402A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Method, apparatus, and system for bottom-up tone integration to chinese continuous speech recognition system |
US7353173B2 (en) * | 2002-07-11 | 2008-04-01 | Sony Corporation | System and method for Mandarin Chinese speech recognition using an optimized phone set |
AU2003252144A1 (en) | 2002-07-31 | 2004-02-16 | Washington State University Research Foundation | Geranyl diphosphate synthase molecules, and nucleic acid molecules encoding same |
US7353172B2 (en) * | 2003-03-24 | 2008-04-01 | Sony Corporation | System and method for cantonese speech recognition using an optimized phone set |
US7353174B2 (en) * | 2003-03-31 | 2008-04-01 | Sony Corporation | System and method for effectively implementing a Mandarin Chinese speech recognition dictionary |
US7684987B2 (en) * | 2004-01-21 | 2010-03-23 | Microsoft Corporation | Segmental tonal modeling for tonal languages |
CN1655232B (en) * | 2004-02-13 | 2010-04-21 | 松下电器产业株式会社 | Context-sensitive Chinese speech recognition modeling method |
CN1674092B (en) * | 2004-03-26 | 2010-06-09 | 松下电器产业株式会社 | Acoustic vowel trans-word modeling and decoding method and system for continuous digital recognition |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US7778831B2 (en) | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
DE602006013969D1 (en) * | 2006-08-11 | 2010-06-10 | Harman Becker Automotive Sys | Speech recognition using a statistical language model using square root smoothing |
US20080120108A1 (en) * | 2006-11-16 | 2008-05-22 | Frank Kao-Ping Soong | Multi-space distribution for pattern recognition based on mixed continuous and discrete observations |
US8244534B2 (en) * | 2007-08-20 | 2012-08-14 | Microsoft Corporation | HMM-based bilingual (Mandarin-English) TTS techniques |
US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
CN103730128A (en) * | 2012-10-13 | 2014-04-16 | 复旦大学 | Audio clip authentication method based on frequency spectrum SIFT feature descriptor |
US20140214401A1 (en) | 2013-01-29 | 2014-07-31 | Tencent Technology (Shenzhen) Company Limited | Method and device for error correction model training and text error correction |
CN103970765B (en) * | 2013-01-29 | 2016-03-09 | 腾讯科技(深圳)有限公司 | Correct mistakes model training method, device and text of one is corrected mistakes method, device |
US9626354B2 (en) * | 2014-01-21 | 2017-04-18 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for using tone indicator in text recognition |
US9946704B2 (en) | 2014-07-18 | 2018-04-17 | Lenovo (Singapore) Pte. Ltd. | Tone mark based text suggestions for chinese or japanese characters or words |
RU2632137C2 (en) * | 2015-06-30 | 2017-10-02 | Общество С Ограниченной Ответственностью "Яндекс" | Method and server of transcription of lexical unit from first alphabet in second alphabet |
CN109410918B (en) * | 2018-10-15 | 2020-01-24 | 百度在线网络技术(北京)有限公司 | Method and device for acquiring information |
US11554322B2 (en) | 2019-04-26 | 2023-01-17 | Sony Interactive Entertainment LLC | Game controller with touchpad input |
CN111046220A (en) * | 2019-04-29 | 2020-04-21 | 广东小天才科技有限公司 | Method for replaying reading voice in dictation process and electronic equipment |
US11048356B2 (en) * | 2019-07-31 | 2021-06-29 | Sony Interactive Entertainment LLC | Microphone on controller with touchpad to take in audio swipe feature data |
CN113096650B (en) * | 2021-03-03 | 2023-12-08 | 河海大学 | Acoustic decoding method based on prior probability |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5220639A (en) * | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4831551A (en) * | 1983-01-28 | 1989-05-16 | Texas Instruments Incorporated | Speaker-dependent connected speech word recognizer |
US5164900A (en) * | 1983-11-14 | 1992-11-17 | Colman Bernath | Method and device for phonetically encoding Chinese textual data for data processing entry |
US5212638A (en) * | 1983-11-14 | 1993-05-18 | Colman Bernath | Alphabetic keyboard arrangement for typing Mandarin Chinese phonetic data |
JPS62235998A (en) * | 1986-04-05 | 1987-10-16 | シャープ株式会社 | Syllable identification system |
US4803729A (en) * | 1987-04-03 | 1989-02-07 | Dragon Systems, Inc. | Speech recognition method |
US5027408A (en) * | 1987-04-09 | 1991-06-25 | Kroeker John P | Speech-recognition circuitry employing phoneme estimation |
JP2739945B2 (en) * | 1987-12-24 | 1998-04-15 | 株式会社東芝 | Voice recognition method |
EP0438662A2 (en) * | 1990-01-23 | 1991-07-31 | International Business Machines Corporation | Apparatus and method of grouping utterances of a phoneme into context-de-pendent categories based on sound-similarity for automatic speech recognition |
US5450523A (en) * | 1990-11-15 | 1995-09-12 | Matsushita Electric Industrial Co., Ltd. | Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems |
CA2088080C (en) * | 1992-04-02 | 1997-10-07 | Enrico Luigi Bocchieri | Automatic speech recognizer |
US5473728A (en) * | 1993-02-24 | 1995-12-05 | The United States Of America As Represented By The Secretary Of The Navy | Training of homoscedastic hidden Markov models for automatic speech recognition |
-
1995
- 1995-01-26 US US08/378,963 patent/US5680510A/en not_active Expired - Lifetime
-
1996
- 1996-01-23 WO PCT/US1996/001002 patent/WO1996023298A2/en active IP Right Grant
- 1996-01-23 CN CNB961915978A patent/CN1143263C/en not_active Expired - Lifetime
- 1996-01-23 AU AU47057/96A patent/AU4705796A/en not_active Abandoned
- 1996-01-23 KR KR1019970705072A patent/KR100391243B1/en not_active IP Right Cessation
- 1996-01-23 GB GB9712532A patent/GB2311640B/en not_active Expired - Lifetime
- 1996-01-23 CN CNB2004100040683A patent/CN1277248C/en not_active Expired - Lifetime
-
1998
- 1998-06-15 HK HK98105291A patent/HK1006093A1/en not_active IP Right Cessation
-
1999
- 1999-08-03 HK HK99103353A patent/HK1019258A1/en not_active IP Right Cessation
- 1999-08-03 HK HK99103354A patent/HK1019259A1/en not_active IP Right Cessation
-
2005
- 2005-05-03 HK HK05103746A patent/HK1070973A1/en not_active IP Right Cessation
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5220639A (en) * | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
Non-Patent Citations (4)
Title |
---|
GIACHIN ET AL.: "WORD JUNCTURE MODELING USING INTER-WORD CONTEXT-DEPENDENT PHONE-LIKE UNITS", CSELT TECHNICAL REPORT ON EUROSPEECH 1991, vol. 20, no. 1, 1 March 1992 (1992-03-01), IT, pages 43 - 47, XP000314309 * |
HON ET AL.: "Towards large vocabulary Mandarin Chinese speech recognition", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING 1994, vol. 1, 19 April 1994 (1994-04-19) - 22 April 1994 (1994-04-22), ADELAIDE, SA, AU, pages 545 - 548, XP002006388 * |
LIN ET AL.: "A new framework for recognition of Mandarin syllables with tones using sub-syllabic units", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING 1993, vol. 2, 27 April 1993 (1993-04-27) - 30 April 1993 (1993-04-30), MINNEAPOLIS, MN, US, pages 227 - 230, XP000427767 * |
WANG ET AL.: "An intinial study on large-vocabulary contiuous Mandarin speech recognition with limited training data based on sub-syllabic models", PROCEEDINGS OF INTERNATIONAL COMPUTER SYMPOSIUM 1994, vol. 2, 12 December 1994 (1994-12-12) - 15 December 1994 (1994-12-15), HSINCHU, TW, pages 1140 - 1145, XP000573580 * |
Also Published As
Publication number | Publication date |
---|---|
KR19980701676A (en) | 1998-06-25 |
AU4705796A (en) | 1996-08-14 |
HK1019259A1 (en) | 2000-01-28 |
CN1542735A (en) | 2004-11-03 |
GB2311640B (en) | 1999-04-21 |
HK1019258A1 (en) | 2000-01-28 |
GB9712532D0 (en) | 1997-08-20 |
US5680510A (en) | 1997-10-21 |
CN1143263C (en) | 2004-03-24 |
CN1277248C (en) | 2006-09-27 |
KR100391243B1 (en) | 2003-10-17 |
CN1169199A (en) | 1997-12-31 |
HK1070973A1 (en) | 2005-06-30 |
GB2311640A (en) | 1997-10-01 |
WO1996023298A2 (en) | 1996-08-01 |
HK1006093A1 (en) | 1999-02-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO1996023298A3 (en) | System amd method for generating and using context dependent sub-syllable models to recognize a tonal language | |
Huang et al. | Whistler: A trainable text-to-speech system | |
US5668926A (en) | Method and apparatus for converting text into audible signals using a neural network | |
US6085160A (en) | Language independent speech recognition | |
KR100815115B1 (en) | An Acoustic Model Adaptation Method Based on Pronunciation Variability Analysis for Foreign Speech Recognition and apparatus thereof | |
US20020173962A1 (en) | Method for generating pesonalized speech from text | |
Chen et al. | Tone recognition of continuous Mandarin speech based on neural networks | |
JPS6413595A (en) | Voice recognition circuit using estimate of phoneme | |
AU640164B2 (en) | Method of speech recognition | |
WO2004090866A3 (en) | Phonetically based speech recognition system and method | |
WO2003019528A1 (en) | Intonation generating method, speech synthesizing device by the method, and voice server | |
MX9505299A (en) | Systems, methods and articles of manufacture for performing high resolution n-best string hypothesization. | |
EP1349145A3 (en) | System and method for providing information using spoken dialogue interface | |
EP0749109A3 (en) | Speech recognition for tonal languages | |
US20070294082A1 (en) | Voice Recognition Method and System Adapted to the Characteristics of Non-Native Speakers | |
EP0852374A3 (en) | Method and system for speaker-independent recognition of user-defined phrases | |
EP0767950B1 (en) | Method and device for adapting a speech recognition equipment for dialectal variations in a language | |
EP0949606A3 (en) | Method and system for speech recognition based on phonetic transcriptions | |
EP1005019A3 (en) | Segment-based similarity measurement method for speech recognition | |
Hon et al. | Towards large vocabulary Mandarin Chinese speech recognition | |
KR20060066483A (en) | Method for extracting feature vectors for voice recognition | |
US11276389B1 (en) | Personalizing a DNN-based text-to-speech system using small target speech corpus | |
Malfrère et al. | Phonetic alignment: speech synthesis based vs. hybrid HMM/ANN. | |
Li et al. | Acoustical F0 analysis of continuous Cantonese speech | |
Pols | Flexible, robust, and efficient human speech processing versus present-day speech technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 96191597.8 Country of ref document: CN |
|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AL AM AT AU AZ BB BG BR BY CA CH CN CZ DE DK EE ES FI GB GE HU IS JP KE KG KP KR KZ LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK TJ TM TR TT UA UG UZ VN AZ BY KG KZ RU TJ TM |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): KE LS MW SD SZ UG AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 9712532.2 Country of ref document: GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1019970705072 Country of ref document: KR |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
WWP | Wipo information: published in national office |
Ref document number: 1019970705072 Country of ref document: KR |
|
WWG | Wipo information: grant in national office |
Ref document number: 1019970705072 Country of ref document: KR |