CA2174258A1 - Method and System for Automatic Transcription Correction - Google Patents

Method and System for Automatic Transcription Correction

Info

Publication number
CA2174258A1
CA2174258A1 CA2174258A CA2174258A CA2174258A1 CA 2174258 A1 CA2174258 A1 CA 2174258A1 CA 2174258 A CA2174258 A CA 2174258A CA 2174258 A CA2174258 A CA 2174258A CA 2174258 A1 CA2174258 A1 CA 2174258A1
Authority
CA
Canada
Prior art keywords
transcription
image
input text
character
modified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2174258A
Other languages
French (fr)
Other versions
CA2174258C (en
Inventor
Gary E. Kopec
Philip A. Chou
Leslie T. Niles
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Publication of CA2174258A1 publication Critical patent/CA2174258A1/en
Application granted granted Critical
Publication of CA2174258C publication Critical patent/CA2174258C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Abstract

A method and system for automatically modifying an original transcription produced as the output of a recognition operation produces a second, modified transcription, such as, for example, automatically correcting an errorful transcription produced by an OCR operation. The invention uses information in an input text image of character images and in an original transcription associated with the input text image to modify aspects of a formal image source model that models as a grammar the spatial image structure of a set of text images. A recognition operation is then performed on the input text image using the modified formal image source model to produce a second, modified transcription. When the original transcription is errorful, the second transcription is a corrected transcription. Several aspectsof the formal image source model may be modified; in particular, character templates to be used in the recognition operation are trained in the font of the glyphs occurring in the input text image. When errors in the original transcription are caused by matching glyphs against templates that are inadequately specified for the given input text image, the subsequently performed recognition operation on the text image using the trained, font-specific character templates produces a more accurate transcription. Another image source model modification includes constructing a formal language model, based on a character n-gram technique, that models sequences of character n-grams occurring in the original transcription. In one implementation, the 2D image model is represented as a regular grammar in the form of a finite state transition network.
CA002174258A 1995-06-02 1996-04-16 Method and system for automatic transcription correction Expired - Fee Related CA2174258C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US460,454 1990-01-03
US08/460,454 US5883986A (en) 1995-06-02 1995-06-02 Method and system for automatic transcription correction

Publications (2)

Publication Number Publication Date
CA2174258A1 true CA2174258A1 (en) 1996-12-03
CA2174258C CA2174258C (en) 2000-06-06

Family

ID=23828779

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002174258A Expired - Fee Related CA2174258C (en) 1995-06-02 1996-04-16 Method and system for automatic transcription correction

Country Status (4)

Country Link
US (1) US5883986A (en)
EP (1) EP0745952B1 (en)
CA (1) CA2174258C (en)
DE (1) DE69616153T2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113239776A (en) * 2021-05-10 2021-08-10 北方工业大学 Pedestrian re-identification method based on energy model

Families Citing this family (136)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5836771A (en) * 1996-12-02 1998-11-17 Ho; Chi Fai Learning method and system based on questioning
US6498921B1 (en) * 1999-09-01 2002-12-24 Chi Fai Ho Method and system to answer a natural-language question
US6687404B1 (en) * 1997-06-20 2004-02-03 Xerox Corporation Automatic training of layout parameters in a 2D image model
AU728961B2 (en) * 1997-09-15 2001-01-25 Canon Kabushiki Kaisha A font architecture and creation tool for producing richer text
EP0902378A3 (en) 1997-09-15 2003-07-16 Canon Kabushiki Kaisha A font architecture and creation tool for producing richer text
US6154754A (en) * 1997-09-25 2000-11-28 Siemens Corporate Research, Inc. Automatic synthesis of semantic information from multimedia documents
AUPO951397A0 (en) 1997-09-29 1997-10-23 Canon Information Systems Research Australia Pty Ltd A method for digital data compression
US6332040B1 (en) * 1997-11-04 2001-12-18 J. Howard Jones Method and apparatus for sorting and comparing linear configurations
US6157905A (en) * 1997-12-11 2000-12-05 Microsoft Corporation Identifying language and character set of data representing text
JP3099797B2 (en) * 1998-03-19 2000-10-16 日本電気株式会社 Character recognition device
US5970451A (en) * 1998-04-14 1999-10-19 International Business Machines Corporation Method for correcting frequently misrecognized words or command in speech application
US6741743B2 (en) * 1998-07-31 2004-05-25 Prc. Inc. Imaged document optical correlation and conversion system
US6157910A (en) * 1998-08-31 2000-12-05 International Business Machines Corporation Deferred correction file transfer for updating a speech file by creating a file log of corrections
US6754631B1 (en) * 1998-11-04 2004-06-22 Gateway, Inc. Recording meeting minutes based upon speech recognition
US6333999B1 (en) * 1998-11-06 2001-12-25 International Business Machines Corporation Systematic enumerating of strings using patterns and rules
AU5451800A (en) 1999-05-28 2000-12-18 Sehda, Inc. Phrase-based dialogue modeling with particular application to creating recognition grammars for voice-controlled user interfaces
US20020032564A1 (en) 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
US20020023123A1 (en) * 1999-07-26 2002-02-21 Justin P. Madison Geographic data locator
CN1207664C (en) * 1999-07-27 2005-06-22 国际商业机器公司 Error correcting method for voice identification result and voice identification system
EP1509902A4 (en) * 1999-07-28 2005-08-17 Custom Speech Usa Inc System and method for improving the accuracy of a speech recognition program
US7454509B2 (en) * 1999-11-10 2008-11-18 Yahoo! Inc. Online playback system with community bias
KR100530475B1 (en) 1999-11-10 2006-01-09 론치 미디어, 인크. Internet radio and broadcast method
US6977665B2 (en) * 1999-11-29 2005-12-20 Fuji Photo Film Co., Ltd. Method, apparatus and recording medium for generating composite image
US6389467B1 (en) 2000-01-24 2002-05-14 Friskit, Inc. Streaming media search and continuous playback system of media resources located by multiple network addresses
US7266236B2 (en) * 2000-05-03 2007-09-04 California Institute Of Technology Accelerated handwritten symbol recognition in a pen based tablet computer
US7251665B1 (en) 2000-05-03 2007-07-31 Yahoo! Inc. Determining a known character string equivalent to a query string
US8352331B2 (en) 2000-05-03 2013-01-08 Yahoo! Inc. Relationship discovery engine
US7024485B2 (en) * 2000-05-03 2006-04-04 Yahoo! Inc. System for controlling and enforcing playback restrictions for a media file by splitting the media file into usable and unusable portions for playback
US7162482B1 (en) * 2000-05-03 2007-01-09 Musicmatch, Inc. Information retrieval engine
US6678415B1 (en) 2000-05-12 2004-01-13 Xerox Corporation Document image decoding using an integrated stochastic language model
US6594393B1 (en) 2000-05-12 2003-07-15 Thomas P. Minka Dynamic programming operation with skip mode for text line image decoding
US6738518B1 (en) 2000-05-12 2004-05-18 Xerox Corporation Document image decoding using text line column-based heuristic scoring
US7110621B1 (en) * 2000-05-19 2006-09-19 Xerox Corporation Assist channel coding using a rewrite model
US7389234B2 (en) * 2000-07-20 2008-06-17 Microsoft Corporation Method and apparatus utilizing speech grammar rules written in a markup language
US7236932B1 (en) * 2000-09-12 2007-06-26 Avaya Technology Corp. Method of and apparatus for improving productivity of human reviewers of automatically transcribed documents generated by media conversion systems
JP3494292B2 (en) * 2000-09-27 2004-02-09 インターナショナル・ビジネス・マシーンズ・コーポレーション Error correction support method for application data, computer device, application data providing system, and storage medium
US8271333B1 (en) 2000-11-02 2012-09-18 Yahoo! Inc. Content-related wallpaper
US7406529B2 (en) * 2001-02-09 2008-07-29 Yahoo! Inc. System and method for detecting and verifying digitized content over a computer network
US7120900B2 (en) 2001-04-19 2006-10-10 International Business Machines Bi-directional display
US7574513B2 (en) 2001-04-30 2009-08-11 Yahoo! Inc. Controllable track-skipping
JP2002335041A (en) * 2001-05-07 2002-11-22 Sony Corp Laser driver and laser driving method
US7996207B2 (en) 2001-06-26 2011-08-09 International Business Machines Corporation Bidirectional domain names
US6883007B2 (en) * 2001-08-16 2005-04-19 International Business Machines Meta normalization for text
US7173914B2 (en) * 2001-08-30 2007-02-06 Northrop Grumman Corporation Communication node receipt of node-output information from processorless central equipment
US20070265834A1 (en) * 2001-09-06 2007-11-15 Einat Melnick In-context analysis
DE10147734A1 (en) * 2001-09-27 2003-04-10 Bosch Gmbh Robert Method for setting a data structure, in particular phonetic transcriptions for a voice-operated navigation system
US7707221B1 (en) 2002-04-03 2010-04-27 Yahoo! Inc. Associating and linking compact disc metadata
US7263227B2 (en) * 2002-04-25 2007-08-28 Microsoft Corporation Activity detector
US7305483B2 (en) 2002-04-25 2007-12-04 Yahoo! Inc. Method for the real-time distribution of streaming data on a network
US7024039B2 (en) * 2002-04-25 2006-04-04 Microsoft Corporation Block retouching
US7043079B2 (en) * 2002-04-25 2006-05-09 Microsoft Corporation “Don't care” pixel interpolation
US7392472B2 (en) * 2002-04-25 2008-06-24 Microsoft Corporation Layout analysis
US7120297B2 (en) * 2002-04-25 2006-10-10 Microsoft Corporation Segmented layered image system
US7164797B2 (en) * 2002-04-25 2007-01-16 Microsoft Corporation Clustering
US7110596B2 (en) * 2002-04-25 2006-09-19 Microsoft Corporation System and method facilitating document image compression utilizing a mask
US6986106B2 (en) 2002-05-13 2006-01-10 Microsoft Corporation Correction widget
US7137076B2 (en) * 2002-07-30 2006-11-14 Microsoft Corporation Correcting recognition results associated with user input
US20040057064A1 (en) * 2002-09-20 2004-03-25 Stringham Gary Glen Method to edit a document on a peripheral device
US7539086B2 (en) * 2002-10-23 2009-05-26 J2 Global Communications, Inc. System and method for the secure, real-time, high accuracy conversion of general-quality speech into text
US7958443B2 (en) 2003-02-28 2011-06-07 Dictaphone Corporation System and method for structuring speech recognized text into a pre-selected document format
US7310769B1 (en) * 2003-03-12 2007-12-18 Adobe Systems Incorporated Text encoding using dummy font
US7246311B2 (en) * 2003-07-17 2007-07-17 Microsoft Corporation System and methods for facilitating adaptive grid-based document layout
WO2005026916A2 (en) * 2003-09-10 2005-03-24 Musicmatch, Inc. Music purchasing and playing system and method
US7283685B2 (en) * 2003-09-23 2007-10-16 Microtek International Inc. Device that appends a recognition point for image joining to the extracted image and a recognition element thereof
US7870504B1 (en) * 2003-10-01 2011-01-11 TestPlant Inc. Method for monitoring a graphical user interface on a second computer display from a first computer
WO2005050474A2 (en) 2003-11-21 2005-06-02 Philips Intellectual Property & Standards Gmbh Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics
US7848573B2 (en) * 2003-12-03 2010-12-07 Microsoft Corporation Scaled text replacement of ink
US7506271B2 (en) * 2003-12-15 2009-03-17 Microsoft Corporation Multi-modal handwriting recognition correction
JP2005301664A (en) * 2004-04-12 2005-10-27 Fuji Xerox Co Ltd Image dictionary forming device, encoding device, data file, image dictionary forming method, and program thereof
GB2432704B (en) * 2004-07-30 2009-12-09 Dictaphone Corp A system and method for report level confidence
TWI405135B (en) * 2005-05-17 2013-08-11 Ibm System, method and recording medium
JP2006350867A (en) * 2005-06-17 2006-12-28 Ricoh Co Ltd Document processing device, method, program, and information storage medium
US8032372B1 (en) * 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
US8170289B1 (en) * 2005-09-21 2012-05-01 Google Inc. Hierarchical alignment of character sequences representing text of same source
US20070078806A1 (en) * 2005-10-05 2007-04-05 Hinickle Judith A Method and apparatus for evaluating the accuracy of transcribed documents and other documents
US7751592B1 (en) * 2006-01-13 2010-07-06 Google Inc. Scoring items
US7702182B2 (en) 2006-02-16 2010-04-20 Adobe Systems, Incorporated Method and apparatus for creating a high-fidelity glyph prototype from low-resolution glyph images
US20150255067A1 (en) * 2006-04-05 2015-09-10 Canyon IP Holding LLC Filtering transcriptions of utterances using received information to correct transcription errors
PL2115732T3 (en) 2007-02-01 2015-08-31 Museami Inc Music transcription
US8126262B2 (en) * 2007-06-18 2012-02-28 International Business Machines Corporation Annotating video segments using feature rhythm models
US9277090B2 (en) 2007-10-01 2016-03-01 Hewlett-Packard Development Company, L.P. System and method of document reproduction
JP5557419B2 (en) * 2007-10-17 2014-07-23 スパンション エルエルシー Semiconductor device
EP2223265A1 (en) * 2007-11-20 2010-09-01 Lumex As A method for resolving contradicting output data from an optical character recognition (ocr) system, wherein the output data comprises more than one recognition alternative for an image of a character
US8442959B2 (en) * 2007-12-19 2013-05-14 Verizon Patent And Licensing Inc. Methods and systems for automated processing of fallout orders
US8494257B2 (en) * 2008-02-13 2013-07-23 Museami, Inc. Music score deconstruction
US7991153B1 (en) 2008-08-26 2011-08-02 Nanoglyph, LLC Glyph encryption system and related methods
US20100145677A1 (en) * 2008-12-04 2010-06-10 Adacel Systems, Inc. System and Method for Making a User Dependent Language Model
US8306327B2 (en) * 2008-12-30 2012-11-06 International Business Machines Corporation Adaptive partial character recognition
CN102439607B (en) * 2009-05-21 2015-05-20 惠普开发有限公司 Generation of an individual glyph, and system and method for inspecting individual glyphs
US20110060650A1 (en) * 2009-09-09 2011-03-10 Yefim Maranets Method of facilitating the vendor advertisement to local consumers via decentralized self-adjustable internet based apparatus
US9424242B2 (en) * 2010-04-14 2016-08-23 International Business Machines Corporation Data capture and analysis
US8682075B2 (en) * 2010-12-28 2014-03-25 Hewlett-Packard Development Company, L.P. Removing character from text in non-image form where location of character in image of text falls outside of valid content boundary
US8626681B1 (en) * 2011-01-04 2014-01-07 Google Inc. Training a probabilistic spelling checker from structured data
US9787725B2 (en) 2011-01-21 2017-10-10 Qualcomm Incorporated User input back channel for wireless displays
US8416243B2 (en) * 2011-03-10 2013-04-09 Konica Minolta Laboratory U.S.A., Inc. Approximating font metrics for a missing font when substituting an available replacement
US8688688B1 (en) 2011-07-14 2014-04-01 Google Inc. Automatic derivation of synonym entity names
US8953885B1 (en) * 2011-09-16 2015-02-10 Google Inc. Optical character recognition
KR20130128681A (en) * 2012-05-17 2013-11-27 삼성전자주식회사 Method for correcting character style and an electronic device thereof
US9218546B2 (en) * 2012-06-01 2015-12-22 Google Inc. Choosing image labels
US9047540B2 (en) * 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US8713433B1 (en) * 2012-10-16 2014-04-29 Google Inc. Feature-based autocorrection
KR101446468B1 (en) * 2012-11-28 2014-10-06 (주)이스트소프트 System and method for prividing automatically completed query
WO2014205632A1 (en) * 2013-06-24 2014-12-31 Adobe Systems Incorporated Gravity point drawing method
CN104252446A (en) * 2013-06-27 2014-12-31 鸿富锦精密工业(深圳)有限公司 Computing device, and verification system and method for consistency of contents of files
US9087272B2 (en) * 2013-07-17 2015-07-21 International Business Machines Corporation Optical match character classification
US20180270350A1 (en) 2014-02-28 2018-09-20 Ultratec, Inc. Semiautomated relay method and apparatus
US10748523B2 (en) 2014-02-28 2020-08-18 Ultratec, Inc. Semiautomated relay method and apparatus
US20180034961A1 (en) 2014-02-28 2018-02-01 Ultratec, Inc. Semiautomated Relay Method and Apparatus
US10878721B2 (en) 2014-02-28 2020-12-29 Ultratec, Inc. Semiautomated relay method and apparatus
US10389876B2 (en) 2014-02-28 2019-08-20 Ultratec, Inc. Semiautomated relay method and apparatus
US9753915B2 (en) 2015-08-06 2017-09-05 Disney Enterprises, Inc. Linguistic analysis and correction
US9787819B2 (en) * 2015-09-18 2017-10-10 Microsoft Technology Licensing, Llc Transcription of spoken communications
US9760786B2 (en) * 2015-10-20 2017-09-12 Kyocera Document Solutions Inc. Method and device for revising OCR data by indexing and displaying potential error locations
US10902284B2 (en) 2017-05-31 2021-01-26 Hcl Technologies Limited Identifying optimum pre-process techniques for text extraction
US10878186B1 (en) * 2017-09-18 2020-12-29 University Of South Florida Content masking attacks against information-based services and defenses thereto
US11222162B2 (en) 2017-09-29 2022-01-11 Dropbox, Inc. Managing content item collections
US10592595B2 (en) 2017-09-29 2020-03-17 Dropbox, Inc. Maintaining multiple versions of a collection of content items
US10922426B2 (en) 2017-09-29 2021-02-16 Dropbox, Inc. Managing content item collections
US11038973B2 (en) 2017-10-19 2021-06-15 Dropbox, Inc. Contact event feeds and activity updates
JP2019105957A (en) * 2017-12-12 2019-06-27 コニカミノルタ株式会社 Document structure analysis system, document structure analysis method, and program
US10783400B2 (en) 2018-04-06 2020-09-22 Dropbox, Inc. Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
US11017778B1 (en) 2018-12-04 2021-05-25 Sorenson Ip Holdings, Llc Switching between speech recognition systems
US11170761B2 (en) 2018-12-04 2021-11-09 Sorenson Ip Holdings, Llc Training of speech recognition systems
US10388272B1 (en) 2018-12-04 2019-08-20 Sorenson Ip Holdings, Llc Training speech recognition systems using word sequences
US10573312B1 (en) 2018-12-04 2020-02-25 Sorenson Ip Holdings, Llc Transcription generation from multiple speech recognition systems
CN111325194B (en) * 2018-12-13 2023-12-29 杭州海康威视数字技术股份有限公司 Character recognition method, device and equipment and storage medium
US11093690B1 (en) * 2019-07-22 2021-08-17 Palantir Technologies Inc. Synchronization and tagging of image and text data
US20210056220A1 (en) * 2019-08-22 2021-02-25 Mediatek Inc. Method for improving confidentiality protection of neural network model
US10614810B1 (en) * 2019-09-06 2020-04-07 Verbit Software Ltd. Early selection of operating parameters for automatic speech recognition based on manually validated transcriptions
CN110956133A (en) * 2019-11-29 2020-04-03 上海眼控科技股份有限公司 Training method of single character text normalization model, text recognition method and device
CN111241365B (en) * 2019-12-23 2023-06-30 望海康信(北京)科技股份公司 Table picture analysis method and system
US11539900B2 (en) 2020-02-21 2022-12-27 Ultratec, Inc. Caption modification and augmentation systems and methods for use by hearing assisted user
US11488604B2 (en) 2020-08-19 2022-11-01 Sorenson Ip Holdings, Llc Transcription of audio
CN112417851A (en) * 2020-11-26 2021-02-26 新智认知数据服务有限公司 Text error correction word segmentation method and system and electronic equipment
US20220198127A1 (en) * 2020-12-21 2022-06-23 International Business Machines Corporation Enhancement aware text transition
CN114639107B (en) * 2022-04-21 2023-03-24 北京百度网讯科技有限公司 Table image processing method, apparatus and storage medium

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3167746A (en) * 1962-09-20 1965-01-26 Ibm Specimen identification methods and apparatus
US3548202A (en) * 1968-11-29 1970-12-15 Ibm Adaptive logic system for unsupervised learning
US3969700A (en) * 1974-04-10 1976-07-13 International Business Machines Corporation Regional context maximum likelihood error correction for OCR, keyboard, and the like
US4654875A (en) * 1983-05-23 1987-03-31 The Research Foundation Of State University Of New York System to achieve automatic recognition of linguistic strings
US4599692A (en) * 1984-01-16 1986-07-08 Itt Corporation Probabilistic learning element employing context drive searching
US4769716A (en) * 1986-10-17 1988-09-06 International Business Machines Corporation Facsimile transmission using enhanced symbol prototypes with precalculated front and back white spaces
EP0312905B1 (en) * 1987-10-16 1992-04-29 Computer Gesellschaft Konstanz Mbh Method for automatic character recognition
US5048113A (en) * 1989-02-23 1991-09-10 Ricoh Company, Ltd. Character recognition post-processing method
US5020112A (en) * 1989-10-31 1991-05-28 At&T Bell Laboratories Image recognition method using two-dimensional stochastic grammars
US5048097A (en) * 1990-02-02 1991-09-10 Eastman Kodak Company Optical character recognition neural network system for machine-printed characters
JPH05346970A (en) * 1991-04-04 1993-12-27 Fuji Xerox Co Ltd Document recognizing device
US5526444A (en) * 1991-12-10 1996-06-11 Xerox Corporation Document image decoding using modified branch-and-bound methods
US5321773A (en) * 1991-12-10 1994-06-14 Xerox Corporation Image recognition method using finite state networks
US5303313A (en) * 1991-12-16 1994-04-12 Cartesian Products, Inc. Method and apparatus for compression of images
US5438630A (en) * 1992-12-17 1995-08-01 Xerox Corporation Word spotting in bitmap images using word bounding boxes and hidden Markov models
US5544260A (en) * 1994-07-12 1996-08-06 International Business Machines Corporation Silent training by error correction for on-line handwritting recognition systems

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113239776A (en) * 2021-05-10 2021-08-10 北方工业大学 Pedestrian re-identification method based on energy model

Also Published As

Publication number Publication date
EP0745952A3 (en) 1997-09-10
US5883986A (en) 1999-03-16
EP0745952B1 (en) 2001-10-24
DE69616153D1 (en) 2001-11-29
EP0745952A2 (en) 1996-12-04
CA2174258C (en) 2000-06-06
DE69616153T2 (en) 2002-03-07

Similar Documents

Publication Publication Date Title
CA2174258A1 (en) Method and System for Automatic Transcription Correction
CN108287820B (en) Text representation generation method and device
CN109583952B (en) Advertisement case processing method, device, equipment and computer readable storage medium
CA2171773A1 (en) Automatic Training of Character Templates Using a Transcription and a Two-Dimensional Image Source Model
US20030130847A1 (en) Method of training a computer system via human voice input
WO1994029782A3 (en) Method and system for creating, specifying, and generating parametric fonts
WO2006002219A3 (en) Systems and methods for spell correction of non-roman characters and words
EP0768612A3 (en) Method and apparatus for generating structured document
CN112329447B (en) Training method of Chinese error correction model, chinese error correction method and device
CN110211562B (en) Voice synthesis method, electronic equipment and readable storage medium
CN110929094A (en) Video title processing method and device
US5101375A (en) Method and apparatus for providing binding and capitalization in structured report generation
US6701023B1 (en) Reducing appearance differences between coded and noncoded units of text
US20050033566A1 (en) Natural language processing method
KR102430918B1 (en) Device and method for correcting Korean spelling
CN112528680A (en) Corpus expansion method and system
JPH11344998A (en) Method, device for setting reading and metrical information and storage medium in which reading and metrical information setting program is stored
CN113435426B (en) Data augmentation method, device and equipment for OCR recognition and storage medium
JP2794919B2 (en) Machine translation equipment
CN117539771A (en) Test case automatic generation method based on pre-training language model
TR2023017484A2 (en) A SYSTEM THAT PROVIDES THE REMOVAL OF WRITTEN ERRORS IN TEXT DATA
CN111241845A (en) Automatic financial subject identification method and device based on semantic matching method
CN117933234A (en) Method for controlling model to output structured data
CN117764148A (en) Training method and device for pre-training language model
JPH05188998A (en) Speech recognizing method

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed