WO1996039675A1 - Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments - Google Patents

Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments Download PDF

Info

Publication number
WO1996039675A1
WO1996039675A1 PCT/US1996/005882 US9605882W WO9639675A1 WO 1996039675 A1 WO1996039675 A1 WO 1996039675A1 US 9605882 W US9605882 W US 9605882W WO 9639675 A1 WO9639675 A1 WO 9639675A1
Authority
WO
WIPO (PCT)
Prior art keywords
points
angle
change
point
segmentation
Prior art date
Application number
PCT/US1996/005882
Other languages
French (fr)
Inventor
Kannan Parthasarathy
John L. C. Seybold
Original Assignee
Motorola Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc. filed Critical Motorola Inc.
Priority to KR1019970708942A priority Critical patent/KR100297481B1/en
Priority to AU57884/96A priority patent/AU5788496A/en
Publication of WO1996039675A1 publication Critical patent/WO1996039675A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/32Digital ink
    • G06V30/333Preprocessing; Feature extraction
    • G06V30/347Sampling; Contour coding; Stroke extraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • This invention relates generally to handwriting recognition, and more particularly to a method of stroke segmentation of handwritten input.
  • a key requirement in a stroke-based recognizer is that multiple instances of the same character class (e.g. the letter "A" written at different times and by different writers) should be divided into a similar set of strokes each time. This helps insure that recognition is not too difficult, because the description of the character instances will "look" similar to the character recognizer itself. In the ideal case, all instances of a given character would always contain the same number of strokes, the strokes would all be in the same relative locations, and the featural descriptions of the strokes would all be very similar across the instances. This ideal is not achievable in practice, but to the extent it can be approached, recognition accuracy can be improved.
  • stroke boundaries are set at points where pen velocity in the vertical (or "y") direction is zero — that is, at points where the writing begins moving upward, or begins moving downward.
  • the resulting set of strokes may then be called “upstrokes” and “downstrokes”.
  • This method is discussed in Mermelstein & Eden, "Experiments on Computer Recognition of Connected Handwritten Words", in Information And Control vol. 7, pp. 255-270, 1964.
  • One problem with this method is that it is overly sensitive to changes in the vertical direction, and not sensitive at all to changes in the horizontal direction.
  • Yet many characters are composed of horizontal pieces — for example, the cross of a "t” and the three prongs of an "E” are normally much more horizontal than vertical, even in sloppy writing.
  • a y-velocity based stroke segmenter will sometimes break a horizontal piece into one stroke, but often it will be broken into two, three, or even many more, simply because of small jitters of the pen in the vertical direction. This leads to poor recognition accuracy, because multiple instances of the same character will often be segmented into different-looking sets of strokes.
  • FIG. 1 illustrates a flow diagram of operation for identifying stroke boundaries in accordance with a preferred embodiment of the present invention.
  • FIG 2 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 110 of FIG. 1.
  • FIG 3 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 112 of FIG. 1.
  • FIG. 4 illustrates a sequence of straight line segments (strokes) representing the handwritten input of FIG. 2.
  • FIG. 5 illustrates determining a sequence of straight line segments (strokes) representing handwritten input wherein an input pen is lifted (dashed line).
  • FIG. 6 is an exploded view illustrating a change in angle determination.
  • FIG. 7 illustrates graphically the angle change values calculated for each point in FIG. 6.
  • FIG. 8 is a block diagram of a device/handwriting recognition microprocessor for recognizing handwritten characters which have a preponderance of straight line segments in accordance with the present invention.
  • handwritten character input is collected from the user in the form of discrete continuous segments.
  • a discrete continuous segment consists of one or more pen segments, where a pen segment is the mark left by a pen during its period of contact with an input device such as a digitizing tablet or paper.
  • one or more discrete continuous segments are the units of handwritten input being segmented into strokes.
  • a stroke is a straight line approximation of the actual handwritten input.
  • Handwritten input is input which is captured electronically that includes but is not limited to the following: handwritten input; electronic input; input captured through pressure, such as stamped input; input that is received electronically, such as via facsimile, pager, or other device.
  • a segment is represented as a sequence of points sampled at approximately regular intervals by the input device. Each point is described at minimum by an X coordinate, a Y coordinate, and a pen state. A pen state indicates whether the pen is in contact with the tablet and can take only one of two values (pen up and pen down). Segments may be captured electronically using a digitizing tablet, or alternatively may be derived from a scanned or faxed image through a process of line detection in the image; such methods of capturing electronically are understood in the art. In a preferred method handwritten input is accepted by a device, such as a personal digital assistant (PDA) or other device.
  • PDA personal digital assistant
  • handwritten inputs include, but are not limited to, the following: computers, modems, pagers, telephones, digital televisions, interactive televisions, devices having a digitizing tablet, facsimile devices, scanning devices, and other devices with the ability to capture handwritten input.
  • computers modems, pagers, telephones, digital televisions, interactive televisions, devices having a digitizing tablet, facsimile devices, scanning devices, and other devices with the ability to capture handwritten input.
  • a digitizing tablet devices having a digitizing tablet
  • facsimile devices scanning devices
  • other devices with the ability to capture handwritten input are captured electronically, each point is represented by a pixel, such that a segment is represented by a series of pixels on the device.
  • handwritten input may be in the form of alphanumeric characters, ideographic characters or other forms of characters or symbols of written communication.
  • stroke boundaries are obtained as points where the angle change is a local maximum and exceeds a predetermined threshold value that corresponds to sharp bends in the writing. Since a sharp bend can occur regardless of the direction the pen is moving, this method is not sensitive to the orientation of the various parts of handwritten input, such as words or characters.
  • FIG. 1 illustrates a flowchart of a preferred method in accordance with the teachings of the present invention.
  • Handwritten input from the digitizer or other device is received in the form of x, y coordinates and associated penup or pendown states (102).
  • these points are represented by pixels.
  • the x, y inputs are scaled (104) so the resulting coordinate values are within prespecified limits.
  • the scaled x, y coordinates are smoothed (106) to remove high-frequency noise arising from the digitizing tablet. Smoothing is done by averaging the coordinates (x,y) of a point with its neighbors (weighting the point itself and the nearest points higher), and replacing the (x,y) coordinate values of the point in question with the computed average.
  • the smoothing window size is fixed. Smoothing is not done across two separate pendown segments.
  • the angle between the first vector and the second vector which represents the angle change at the current point (see FIG. 6 ), is then determined (108).
  • Potential segmentation points are identified as pendown points at local maxima of the angle changes that exceed a preset threshold (110) (see FIG. 7).
  • the preset threshold is selected to maximize true segmentation points and to minimize false segmentation points (e.g., a segmentation point caused by jitter).
  • spurious segmentation points are removed (112). For example, 'hooks', i.e., short pen movements at beginning/end of pen down segments, are removed, and where an angle change at a first PC ⁇ YUS96/05882
  • segmentation point is less than the angle change at a second segmentation point that lies within a preset distance from the first segmentation point, the first segmentation point is deleted. Then, a sequence of straight line strokes are determined from the useful segmentation points (114). Straight line segments are also introduced between two segmentation points that have a penup between them.
  • Each straight line stroke is typically described by four parameters: x coordinate of a midpoint of the straight line stroke, y coordinate of the midpoint of the straight line stroke, a length of the straight line stroke, and an angle of the straight line stroke with respect to a predetermined reference.
  • FIG 2 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 110 of FIG. 1.
  • the segmentation points are obtained from a local maximum of the angle change that exceeds a preset threshold and from the beginnings and ends of pen down segments.
  • segments (201, 202, 204, 206, 208, 210, 212, 214, 216, 218) are determined.
  • Each of the above segments are bounded by two segmentation points.
  • FIG 3 illustrates segmentation points (302) for handwritten input produced by a preferred embodiment of the present invention after step 112 of FIG. 1.
  • segmentation point 220 in FIG. 2 is eliminated in FIG. 3 since it represents a spurious segmentation point. That is, segment 201 represents a 'hook', i.e., an artifact, so segmentation point 220 is removed.
  • the rule for determining that the segmentation point 220 is spurious is that segmentation 220 lies less than a prespecified distance from the segmentation point 222, which has a larger angle change when compared with segmentation point 220.
  • FIG. 4 illustrates a sequence (402) of straight line segments (strokes) representing the handwritten input of FIG. 2. The sequence of points between any two consecutive segmentation points shown in FIG. 3 is replaced by a straight line in FIG. 4.
  • a pen is raised from the input device (penup segment) and then is placed at a new position to continue the handwritten input.
  • An example of this is shown in FIG. 5.
  • the dotted line (502) represents the segment where the pen was lifted.
  • the penup stroke is indistinguishable from pendown strokes.
  • FIG. 6 is an exploded view illustrating a change in angle determination.
  • a first vector (602) joins previous raw point k- 1 (from original input data) to the current point k.
  • a second vector (604) joins the current point k to the next point k+1.
  • the angle a between the two vectors is the angle change at point k.
  • the angle change is set to zero.
  • FIG. 7 illustrates graphically the angle change values calculated for each point in FIG. 6 plotted against the index k of the original points.
  • the figure shows three local maxima (702, 704, 706) for the angle changes.
  • the maxima 702 and 704 exceed the preset threshold 708 and hence are selected as potential segmentation points.
  • maximum 706 is less than the preset threshold 708, and thus is not selected as a segmentation point.
  • the spatial distance between the points represented by 702 and 704 is less than a predetermined distance threshold, and hence only one of the two points represented by the maxima is a useful segmentation point and the other is a spurious segmentation point. Since the maximum 702 is greater than the maximum 704, the point represented by maximum 702 is the useful segmentation point, and the point represented by the maximum 704 is a spurious segmentation point.
  • the first threshold 708 is used to compare the angle changes at the local maxima of the angle change curve (FIG. 7) in order to select potential segmentation points.
  • the second threshold is used to compare the distance between two segmentation points to separate useful and spurious segmentation points.
  • the set of segmentation points in accordance with the present invention defines a set of corresponding straight line strokes. These straight line strokes can then be forwarded to a stroke-based character recognizer for recognition of the handwritten input.
  • the present invention may also be implemented by a device/handwriting recognition microprocessor for, upon receiving handwritten character input as a sequence of (x,y, pen) points where x and y are coordinates in a two dimensional coordinate system and pen is a binary value indicating an associated penup/pendown state, recognizing a handwritten character composed of a plurality of (x,y,pen) values.
  • the device/handwriting recognition microprocessor includes an angle change determiner (806), a potential segmentation point identifier (808), an artifact remover (810), and a stroke determiner (812). Where selected, said device/microprocessor includes at least one of: a sealer (802) and a smoother (804).
  • the sealer (802) is used for scaling the x,y coordinates so the resulting coordinate values are within prespecified limits.
  • the smoother (804) may immediately precede the angle change determiner (806) or may be operably coupled to the sealer (802) (where a sealer is utilized), and is used for smoothing the scaled x, y coordinates to remove high-frequency noise.
  • the angle change determiner (806) is operably coupled to receive the sequence of (x,y,pen) points and is used for determining a change in angle at each (x,y,pen) point. Where selected, the angle change determiner
  • the potential segmentation point identifier (808) is operably coupled to the angle change determiner (806) and is used for identifying potential segmentation points as pendown points whose angle change exceeds a preset threshold.
  • the artifact remover (810) is operably coupled to the potential segmentation point identifier (808) and is used for removing spurious segmentation points to provide useful segmentation points.
  • the stroke determiner (812) is operably coupled to the artifact remover (810) and is used for determining a sequence of straight line strokes to represent the handwritten input from the useful segmentation points.
  • the device/handwriting recognition microprocessor may include: A) an angle change determiner (806), operably coupled to receive the (x,y, pen) points, for determining a change in angle at each (x,y,pen) point; and B) a point to stroke delineator (816), operably coupled to the angle change determiner, for determining useful segmentation points and generating a sequence of straight line strokes to represent the handwritten input from the useful segmentation points.
  • a value adjuster may be operably coupled immediately prior to the angle change determiner (806) to receive the sequence of (x,y,pen) values, for modifying said values to facilitate processing and for providing the modified values to the angle change determiner (806).
  • the value adjuster (814) may be selected to include: A) a sealer (802), for scaling the x,y coordinates so the resulting coordinate values are within prespecified limits; and B) a smoother (804), operably coupled to the sealer, for smoothing the scaled x, y coordinates to remove high- frequency noise.
  • the point to stroke delineator (816) may include: A) a potential segmentation point identifier (808), operably coupled to the angle change determiner (806), for identifying potential segmentation points as pendown points whose angle change exceeds a preset threshold; B) an artifact remover (810), operably coupled to the potential segmentation point identifier (808), for removing spurious segmentation points to provide useful segmentation points; and C) a stroke determiner (812), operably coupled to the artifact remover (810), for determining a sequence of straight line strokes to represent the handwritten input from the useful segmentation points.
  • the method of the present invention may be embodied in a tangible medium of/for a computer.
  • the tangible medium may be a computer diskette or a memory unit of the computer.
  • the present invention and its preferred embodiments relate to novel, more accurate methods of stroke segmentation for characters composed of a preponderance of straight line segments (for example, Chinese characters).
  • the input is repeatedly divided into a similar set of straight line strokes each time.
  • stroke segmentation aids in providing more accurate interpretation by a stroke-based character recognizer.

Abstract

The method and microprocessor of the present invention includes a step of calculating the angle changes, in the handwritten input (108). The method then selects potential segmentation points in the input whose angle change exceeds a predetermined threshold (110). The segmentation points are used to determine a sequence of straight line strokes that may be used by a recognizer for interpretation of the handwritten input (114).

Description

METHOD AND MICROPROCESSOR FOR PREPROCESSING
HANDWRITING HAVING CHARACTERS COMPOSED OF A
PREPONDERANCE OF STRAIGHT LINE SEGMENTS
Field of the Invention
This invention relates generally to handwriting recognition, and more particularly to a method of stroke segmentation of handwritten input.
Background of the Invention
Machine recognition of human handwriting is a very difficult problem, and with the recent explosion of pen-based computing devices, has become an important problem to be addressed. There exist many very different approaches to this problem, but one useful approach has been to divide the writing into a sequence of fundamental movements, or "strokes", and use the strokes (which are parametrized in some- way) as inputs to a stroke-based character recognizer.
A key requirement in a stroke-based recognizer is that multiple instances of the same character class (e.g. the letter "A" written at different times and by different writers) should be divided into a similar set of strokes each time. This helps insure that recognition is not too difficult, because the description of the character instances will "look" similar to the character recognizer itself. In the ideal case, all instances of a given character would always contain the same number of strokes, the strokes would all be in the same relative locations, and the featural descriptions of the strokes would all be very similar across the instances. This ideal is not achievable in practice, but to the extent it can be approached, recognition accuracy can be improved. In one technique in the prior art, stroke boundaries are set at points where pen velocity in the vertical (or "y") direction is zero — that is, at points where the writing begins moving upward, or begins moving downward. The resulting set of strokes may then be called "upstrokes" and "downstrokes". This method is discussed in Mermelstein & Eden, "Experiments on Computer Recognition of Connected Handwritten Words", in Information And Control vol. 7, pp. 255-270, 1964. One problem with this method is that it is overly sensitive to changes in the vertical direction, and not sensitive at all to changes in the horizontal direction. Yet many characters are composed of horizontal pieces — for example, the cross of a "t" and the three prongs of an "E" are normally much more horizontal than vertical, even in sloppy writing. A y-velocity based stroke segmenter will sometimes break a horizontal piece into one stroke, but often it will be broken into two, three, or even many more, simply because of small jitters of the pen in the vertical direction. This leads to poor recognition accuracy, because multiple instances of the same character will often be segmented into different-looking sets of strokes.
Attempts to correct the inaccuracies of this method, including requiring a minimum vertical direction change before creating a new stroke have had only limited success, and many of the same basic problems still remain.
Accordingly a need exists for a stroke segmentation technique that is more accurate and not subject to the problems in the methods discussed, such as the y-velocity method.
Brief Description of the Drawings
FIG. 1 illustrates a flow diagram of operation for identifying stroke boundaries in accordance with a preferred embodiment of the present invention. FIG 2 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 110 of FIG. 1.
FIG 3 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 112 of FIG. 1.
FIG. 4 illustrates a sequence of straight line segments (strokes) representing the handwritten input of FIG. 2.
FIG. 5 illustrates determining a sequence of straight line segments (strokes) representing handwritten input wherein an input pen is lifted (dashed line).
FIG. 6 is an exploded view illustrating a change in angle determination.
FIG. 7 illustrates graphically the angle change values calculated for each point in FIG. 6.
FIG. 8 is a block diagram of a device/handwriting recognition microprocessor for recognizing handwritten characters which have a preponderance of straight line segments in accordance with the present invention.
Detailed Description of the Preferred Embodiments
Typically, handwritten character input is collected from the user in the form of discrete continuous segments. A discrete continuous segment consists of one or more pen segments, where a pen segment is the mark left by a pen during its period of contact with an input device such as a digitizing tablet or paper. In the present invention one or more discrete continuous segments are the units of handwritten input being segmented into strokes. A stroke is a straight line approximation of the actual handwritten input. Handwritten input is input which is captured electronically that includes but is not limited to the following: handwritten input; electronic input; input captured through pressure, such as stamped input; input that is received electronically, such as via facsimile, pager, or other device.
A segment is represented as a sequence of points sampled at approximately regular intervals by the input device. Each point is described at minimum by an X coordinate, a Y coordinate, and a pen state. A pen state indicates whether the pen is in contact with the tablet and can take only one of two values (pen up and pen down). Segments may be captured electronically using a digitizing tablet, or alternatively may be derived from a scanned or faxed image through a process of line detection in the image; such methods of capturing electronically are understood in the art. In a preferred method handwritten input is accepted by a device, such as a personal digital assistant (PDA) or other device. Other devices that function to receive handwritten input include, but are not limited to, the following: computers, modems, pagers, telephones, digital televisions, interactive televisions, devices having a digitizing tablet, facsimile devices, scanning devices, and other devices with the ability to capture handwritten input. Generally, when handwritten inputs are captured electronically, each point is represented by a pixel, such that a segment is represented by a series of pixels on the device.
In accordance with the present invention, handwritten input may be in the form of alphanumeric characters, ideographic characters or other forms of characters or symbols of written communication. In the present technique, stroke boundaries are obtained as points where the angle change is a local maximum and exceeds a predetermined threshold value that corresponds to sharp bends in the writing. Since a sharp bend can occur regardless of the direction the pen is moving, this method is not sensitive to the orientation of the various parts of handwritten input, such as words or characters. Referring now to FIG. 1, FIG. 1 illustrates a flowchart of a preferred method in accordance with the teachings of the present invention. Handwritten input from the digitizer or other device is received in the form of x, y coordinates and associated penup or pendown states (102). Typically these points are represented by pixels. Where selected, the x, y inputs are scaled (104) so the resulting coordinate values are within prespecified limits. Where selected, the scaled x, y coordinates are smoothed (106) to remove high-frequency noise arising from the digitizing tablet. Smoothing is done by averaging the coordinates (x,y) of a point with its neighbors (weighting the point itself and the nearest points higher), and replacing the (x,y) coordinate values of the point in question with the computed average. Preferably, the smoothing window size is fixed. Smoothing is not done across two separate pendown segments. Upon joining the previous point to the current point, a first vector is formed. A second vector is formed by joining the current point to the next point. The angle between the first vector and the second vector, which represents the angle change at the current point (see FIG. 6 ), is then determined (108). Potential segmentation points are identified as pendown points at local maxima of the angle changes that exceed a preset threshold (110) (see FIG. 7). The preset threshold is selected to maximize true segmentation points and to minimize false segmentation points (e.g., a segmentation point caused by jitter). Then, spurious segmentation points are removed (112). For example, 'hooks', i.e., short pen movements at beginning/end of pen down segments, are removed, and where an angle change at a first PCΪYUS96/05882
segmentation point is less than the angle change at a second segmentation point that lies within a preset distance from the first segmentation point, the first segmentation point is deleted. Then, a sequence of straight line strokes are determined from the useful segmentation points (114). Straight line segments are also introduced between two segmentation points that have a penup between them.
Each straight line stroke is typically described by four parameters: x coordinate of a midpoint of the straight line stroke, y coordinate of the midpoint of the straight line stroke, a length of the straight line stroke, and an angle of the straight line stroke with respect to a predetermined reference.
Referring now to FIGs. 2 and FIG. 3, FIG 2 illustrates segmentation points for handwritten input produced by a preferred embodiment of the present invention after step 110 of FIG. 1. The segmentation points are obtained from a local maximum of the angle change that exceeds a preset threshold and from the beginnings and ends of pen down segments.
Thus, in FIG. 2, segments (201, 202, 204, 206, 208, 210, 212, 214, 216, 218) are determined. Each of the above segments are bounded by two segmentation points.
FIG 3 illustrates segmentation points (302) for handwritten input produced by a preferred embodiment of the present invention after step 112 of FIG. 1. Note that the segmentation point 220 in FIG. 2 is eliminated in FIG. 3 since it represents a spurious segmentation point. That is, segment 201 represents a 'hook', i.e., an artifact, so segmentation point 220 is removed. The rule for determining that the segmentation point 220 is spurious is that segmentation 220 lies less than a prespecified distance from the segmentation point 222, which has a larger angle change when compared with segmentation point 220. FIG. 4 illustrates a sequence (402) of straight line segments (strokes) representing the handwritten input of FIG. 2. The sequence of points between any two consecutive segmentation points shown in FIG. 3 is replaced by a straight line in FIG. 4.
Sometimes a pen is raised from the input device (penup segment) and then is placed at a new position to continue the handwritten input. An example of this is shown in FIG. 5. The dotted line (502) represents the segment where the pen was lifted. At the end preprocessing, the penup stroke is indistinguishable from pendown strokes.
FIG. 6 is an exploded view illustrating a change in angle determination. A first vector (602) joins previous raw point k- 1 (from original input data) to the current point k. A second vector (604) joins the current point k to the next point k+1. The angle a between the two vectors is the angle change at point k. At a beginning point of a pendown segment and also at the end point of a pendown segment, the angle change is set to zero.
FIG. 7 illustrates graphically the angle change values calculated for each point in FIG. 6 plotted against the index k of the original points. The figure shows three local maxima (702, 704, 706) for the angle changes. The maxima 702 and 704 exceed the preset threshold 708 and hence are selected as potential segmentation points. However, maximum 706 is less than the preset threshold 708, and thus is not selected as a segmentation point. The spatial distance between the points represented by 702 and 704 is less than a predetermined distance threshold, and hence only one of the two points represented by the maxima is a useful segmentation point and the other is a spurious segmentation point. Since the maximum 702 is greater than the maximum 704, the point represented by maximum 702 is the useful segmentation point, and the point represented by the maximum 704 is a spurious segmentation point.
Two thresholds must be estimated. The first threshold 708 is used to compare the angle changes at the local maxima of the angle change curve (FIG. 7) in order to select potential segmentation points. The second threshold is used to compare the distance between two segmentation points to separate useful and spurious segmentation points. When the preprocessing method of the present invention is used with a handwriting character recognizer, the exact values for the two thresholds are not critical as long as an error-tolerant character recognizer is used. In performing any experimental tuning of these or any other parameters for creating a specific embodiment of the invention, the desired goal is to attain segmentation which is as consistent as possible across multiple instances of particular character classes. This should be done empirically by examining how the procedure segments vary according to actual samples of the writing to be recognized.
The set of segmentation points in accordance with the present invention defines a set of corresponding straight line strokes. These straight line strokes can then be forwarded to a stroke-based character recognizer for recognition of the handwritten input.
As shown in FIG. 8, the present invention may also be implemented by a device/handwriting recognition microprocessor for, upon receiving handwritten character input as a sequence of (x,y, pen) points where x and y are coordinates in a two dimensional coordinate system and pen is a binary value indicating an associated penup/pendown state, recognizing a handwritten character composed of a plurality of (x,y,pen) values. The device/handwriting recognition microprocessor includes an angle change determiner (806), a potential segmentation point identifier (808), an artifact remover (810), and a stroke determiner (812). Where selected, said device/microprocessor includes at least one of: a sealer (802) and a smoother (804). The sealer (802) is used for scaling the x,y coordinates so the resulting coordinate values are within prespecified limits. The smoother (804) may immediately precede the angle change determiner (806) or may be operably coupled to the sealer (802) (where a sealer is utilized), and is used for smoothing the scaled x, y coordinates to remove high-frequency noise. The angle change determiner (806) is operably coupled to receive the sequence of (x,y,pen) points and is used for determining a change in angle at each (x,y,pen) point. Where selected, the angle change determiner
(806) may be coupled to receive the output of one of: the sealer (802) and the smoother (804). The potential segmentation point identifier (808) is operably coupled to the angle change determiner (806) and is used for identifying potential segmentation points as pendown points whose angle change exceeds a preset threshold. The artifact remover (810) is operably coupled to the potential segmentation point identifier (808) and is used for removing spurious segmentation points to provide useful segmentation points. The stroke determiner (812) is operably coupled to the artifact remover (810) and is used for determining a sequence of straight line strokes to represent the handwritten input from the useful segmentation points.
In another embodiment, the device/handwriting recognition microprocessor may include: A) an angle change determiner (806), operably coupled to receive the (x,y, pen) points, for determining a change in angle at each (x,y,pen) point; and B) a point to stroke delineator (816), operably coupled to the angle change determiner, for determining useful segmentation points and generating a sequence of straight line strokes to represent the handwritten input from the useful segmentation points. Where selected, a value adjuster (814) may be operably coupled immediately prior to the angle change determiner (806) to receive the sequence of (x,y,pen) values, for modifying said values to facilitate processing and for providing the modified values to the angle change determiner (806).
For example, the value adjuster (814) may be selected to include: A) a sealer (802), for scaling the x,y coordinates so the resulting coordinate values are within prespecified limits; and B) a smoother (804), operably coupled to the sealer, for smoothing the scaled x, y coordinates to remove high- frequency noise.
In one implementation, the point to stroke delineator (816) may include: A) a potential segmentation point identifier (808), operably coupled to the angle change determiner (806), for identifying potential segmentation points as pendown points whose angle change exceeds a preset threshold; B) an artifact remover (810), operably coupled to the potential segmentation point identifier (808), for removing spurious segmentation points to provide useful segmentation points; and C) a stroke determiner (812), operably coupled to the artifact remover (810), for determining a sequence of straight line strokes to represent the handwritten input from the useful segmentation points.
Clearly, the method of the present invention may be embodied in a tangible medium of/for a computer. For example, the tangible medium may be a computer diskette or a memory unit of the computer. The present invention and its preferred embodiments relate to novel, more accurate methods of stroke segmentation for characters composed of a preponderance of straight line segments (for example, Chinese characters). In accordance with the present invention, in multiple instances of handwritten input the input is repeatedly divided into a similar set of straight line strokes each time. Such stroke segmentation aids in providing more accurate interpretation by a stroke-based character recognizer.
Those skilled in the art will find many embodiments of the present invention to be useful. One obvious advantage is that printed and cursive styles of writing look more similar to one another and thus makes the recognition task easier. Another embodiment would be the segmentation of scanned, or "off-line" writing, into strokes. A straightforward way of applying the present invention to such a task would be to perform thinning of the writing to obtain a constant-width ink curve. Stroke boundaries could then be set at both angle change maxima points and at intersection points, since the lack of temporal information makes intersections and touching bends look similar.
It will be apparent to those skilled in the art that the disclosed invention may be modified in numerous ways and may assume many embodiments other than the preferred forms particularly set out and described above. Accordingly, it is intended by the appended claims to cover all modifications of the invention that fall within the true spirit and scope of the invention and its equivalents.
What is claimed is:

Claims

1. A method comprising the steps of: sampling points of handwriting input to a handwriting recognizer wherein vectors are formed between the points; determining for each one of the points a change of angle between two of the vectors joining at the one of the points; comparing the change of angle for each one of the points with a preset threshold and identifying the change of angle for each one of the points as potential segmentation points if the change of angle for each one of the points is greater than the preset threshold; removing spurious segmentation points where spurious segmentation points are each of the potential segmentation points having a change of angle greater than the preset threshold but is less than a preset distance from another of the potential segmentation points having a change of angle greater than the preset threshold; and determining a sequence of straight line strokes between the potential segmentation point for each one of the points.
2. A method according to claim 1 wherein the method further includes smoothing the points before determining for each one of the points a change of angle.
3. A method according to claim 2 wherein the step of smoothing the points includes weighting a first point with a point nearest to the first point to compute an average, and replacing the first point with the average.
4. A method according to claim 1 wherein the step of determining for each one of the points a change of angle includes: joining a first point with a previous point by a first vector; and joining the first point with a next point by a second vector, such that the angle between the first vector and second vector is the change of angle.
5. A handwriting recognizer comprising: an angle change determiner wherein for each of a set of points sampled along handwriting input to a handwriting recognizer wherein vectors are formed between each of the set of points, the angle change determiner determines a change of angle between two of the vectors joining at each one of the set of points; a potential segmentation point identifier coupled to the angle change determiner which compares the change of angle for each one of the set of points with a preset threshold and identifies the change of angle for each one of the set of points as potential segmentation points if the change of angle for each one of the set of points is greater than the preset threshold; an artifact remover coupled to the potential segmentation point identifier which removes spurious segmentation points where spurious segmentation points are each of the potential segmentation points having a change of angle greater than the preset threshold but is less than a preset distance from another of the potential segmentation points having a change of angle greater than the preset threshold; and a stroke determiner coupled to the artifact remover which determines a sequence of straight line strokes between the potential segmentation point for each one of the set of points.
6. A handwriting recognizer according to claim 5 wherein the handwriting recognizer further comprises a smoother coupled to the angle change determiner to smooth the set of points before the angle change determiner determines a change i of angle for each one of the set of points.
PCT/US1996/005882 1995-06-05 1996-04-26 Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments WO1996039675A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
KR1019970708942A KR100297481B1 (en) 1995-06-05 1996-04-26 Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments
AU57884/96A AU5788496A (en) 1995-06-05 1996-04-26 Method and microprocessor for preprocessing handwriting havi ng characters composed of a preponderance of straight line s egments

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US46336695A 1995-06-05 1995-06-05
US08/463,366 1995-06-05

Publications (1)

Publication Number Publication Date
WO1996039675A1 true WO1996039675A1 (en) 1996-12-12

Family

ID=23839842

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1996/005882 WO1996039675A1 (en) 1995-06-05 1996-04-26 Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments

Country Status (6)

Country Link
US (1) US5740273A (en)
KR (1) KR100297481B1 (en)
CN (1) CN1121662C (en)
AU (1) AU5788496A (en)
TW (1) TW397951B (en)
WO (1) WO1996039675A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6647145B1 (en) 1997-01-29 2003-11-11 Co-Operwrite Limited Means for inputting characters or commands into a computer
WO2008025019A1 (en) * 2006-08-25 2008-02-28 Dynamic Representation Systems, Llc., Part I Methods and systems for generating a symbol identification challenge for an automated agent
WO2009148873A2 (en) * 2008-05-30 2009-12-10 Dynamic Representation Systems, Llc - Part V Method and system for generating a representation symbol as an identification challenge
CN102135820A (en) * 2011-01-18 2011-07-27 浙江大学 Planarization pre-processing method

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7401299B2 (en) 2001-09-05 2008-07-15 Autodesk, Inc. Method and apparatus for providing a presumptive drafting solution
US6229920B1 (en) * 1996-04-25 2001-05-08 Delphi Systemsimulation Gmbh Method of classifying and recognizing patterns by additive contour extraction and averaging
US6044174A (en) * 1996-10-11 2000-03-28 Lucent Technologies Inc. Method and apparatus for parametric representation of handwritten symbols
US6275611B1 (en) * 1996-10-17 2001-08-14 Motorola, Inc. Handwriting recognition device, method and alphabet, with strokes grouped into stroke sub-structures
JP3006545B2 (en) * 1997-06-09 2000-02-07 日本電気株式会社 Online character recognition device
US7006685B2 (en) * 1998-06-29 2006-02-28 Lumeniq, Inc. Method for conducting analysis of two-dimensional images
US6757424B2 (en) 1998-06-29 2004-06-29 Lumeniq, Inc. Method for conducting analysis of two-dimensional images
US20020176619A1 (en) * 1998-06-29 2002-11-28 Love Patrick B. Systems and methods for analyzing two-dimensional images
US6445820B1 (en) * 1998-06-29 2002-09-03 Limbic Systems, Inc. Method for conducting analysis of handwriting
US7091959B1 (en) * 1999-03-31 2006-08-15 Advanced Digital Systems, Inc. System, computer program product, computing device, and associated methods for form identification and information manipulation
US7068829B1 (en) 1999-06-22 2006-06-27 Lumeniq, Inc. Method and apparatus for imaging samples
KR100315428B1 (en) * 1999-12-24 2001-11-26 오길록 Character Feature Extraction Apparatus And Method For Recognition Of Multi-Language Printed Document
US6826551B1 (en) * 2000-05-10 2004-11-30 Advanced Digital Systems, Inc. System, computer software program product, and method for producing a contextual electronic message from an input to a pen-enabled computing system
US7006711B2 (en) * 2000-06-21 2006-02-28 Microsoft Corporation Transform table for ink sizing and compression
CA2423114C (en) 2000-08-25 2009-11-17 Limbic Systems, Inc. Method for conducting analysis of two-dimensional images
US6798907B1 (en) * 2001-01-24 2004-09-28 Advanced Digital Systems, Inc. System, computer software product and method for transmitting and processing handwritten data
US20020107885A1 (en) * 2001-02-01 2002-08-08 Advanced Digital Systems, Inc. System, computer program product, and method for capturing and processing form data
US7346229B2 (en) * 2001-06-27 2008-03-18 Microsoft Corporation Transform table for ink sizing and compression
US20030123745A1 (en) * 2001-12-28 2003-07-03 Mattias Bryborn Method and apparatus for compression and reconstruction of electronic handwriting
US20040109608A1 (en) * 2002-07-12 2004-06-10 Love Patrick B. Systems and methods for analyzing two-dimensional images
AU2002952483A0 (en) * 2002-11-05 2002-11-21 Silverbrook Research Pty Ltd Methods and Systems (NPW009)
AU2003900865A0 (en) * 2003-02-26 2003-03-13 Silverbrook Research Pty Ltd Methods, systems and apparatus (NPW010)
US7116806B2 (en) * 2003-10-23 2006-10-03 Lumeniq, Inc. Systems and methods relating to AFIS recognition, extraction, and 3-D analysis strategies
US7529410B2 (en) 2004-01-07 2009-05-05 Microsoft Corporation Local localization using fast image match
JP4213052B2 (en) * 2004-01-28 2009-01-21 任天堂株式会社 Game system using touch panel input
JP2005346467A (en) * 2004-06-03 2005-12-15 Nintendo Co Ltd Graphic recognition program
US7515752B2 (en) * 2004-08-27 2009-04-07 Corel Corporation Sketch recognition and enhancement
US7639876B2 (en) * 2005-01-14 2009-12-29 Advanced Digital Systems, Inc. System and method for associating handwritten information with one or more objects
US7720286B2 (en) * 2005-05-25 2010-05-18 Advanced Digital Systems, Inc. System and method for associating handwritten information with one or more objects via discontinuous regions of a printed pattern
JP2007272473A (en) * 2006-03-30 2007-10-18 Toshiba Corp Character recognition device, method and program
US8306327B2 (en) * 2008-12-30 2012-11-06 International Business Machines Corporation Adaptive partial character recognition
CN102129560B (en) * 2010-01-18 2013-11-06 富士通株式会社 Method and device for identifying characters
US9285983B2 (en) * 2010-06-14 2016-03-15 Amx Llc Gesture recognition using neural networks
TW201222416A (en) * 2010-11-22 2012-06-01 Acer Inc Method for comparing a handwriting path with a template path
CN103299254B (en) * 2010-12-02 2016-09-14 诺基亚技术有限公司 For the method and apparatus that overlap is hand-written
US9274607B2 (en) 2013-03-15 2016-03-01 Bruno Delean Authenticating a user using hand gesture
CN104951811B (en) * 2015-07-09 2018-05-29 北京盛世宣合信息科技有限公司 Row style line recognition methods and device applied to brush writing
KR102149105B1 (en) * 2019-09-18 2020-08-27 세종대학교산학협력단 Mixed reality based 3D sketching device and method
CN111198562A (en) * 2019-12-31 2020-05-26 武汉中海庭数据技术有限公司 Preprocessing optimization method for space line characteristics of crowdsourcing fragment map

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4653107A (en) * 1983-12-26 1987-03-24 Hitachi, Ltd. On-line recognition method and apparatus for a handwritten pattern
US5038382A (en) * 1989-10-13 1991-08-06 International Business Machines Corporation Multi-scale recognizer for hand drawn strokes

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6079485A (en) * 1983-10-06 1985-05-07 Sharp Corp Handwriting character recognition processing device
US5023918A (en) * 1989-10-03 1991-06-11 International Business Machines Corporation Cross-product filter
US5313527A (en) * 1991-06-07 1994-05-17 Paragraph International Method and apparatus for recognizing cursive writing from sequential input information
US5442715A (en) * 1992-04-06 1995-08-15 Eastman Kodak Company Method and apparatus for cursive script recognition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4653107A (en) * 1983-12-26 1987-03-24 Hitachi, Ltd. On-line recognition method and apparatus for a handwritten pattern
US5038382A (en) * 1989-10-13 1991-08-06 International Business Machines Corporation Multi-scale recognizer for hand drawn strokes

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6647145B1 (en) 1997-01-29 2003-11-11 Co-Operwrite Limited Means for inputting characters or commands into a computer
WO2008025019A1 (en) * 2006-08-25 2008-02-28 Dynamic Representation Systems, Llc., Part I Methods and systems for generating a symbol identification challenge for an automated agent
WO2009148873A2 (en) * 2008-05-30 2009-12-10 Dynamic Representation Systems, Llc - Part V Method and system for generating a representation symbol as an identification challenge
WO2009148873A3 (en) * 2008-05-30 2010-03-04 Dynamic Representation Systems, Llc - Part V Method and system for generating a representation symbol as an identification challenge
CN102135820A (en) * 2011-01-18 2011-07-27 浙江大学 Planarization pre-processing method

Also Published As

Publication number Publication date
CN1121662C (en) 2003-09-17
TW397951B (en) 2000-07-11
US5740273A (en) 1998-04-14
KR19990022461A (en) 1999-03-25
AU5788496A (en) 1996-12-24
CN1187256A (en) 1998-07-08
KR100297481B1 (en) 2001-08-07

Similar Documents

Publication Publication Date Title
US5740273A (en) Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments
EP0708944B1 (en) Method for segmenting handwritten input
AU690781B2 (en) Method of splitting handwritten input
US5737443A (en) Method of joining handwritten input
Dave Segmentation methods for hand written character recognition
EP2038813A2 (en) A method and apparatus for recognition of handwritten symbols
US5818963A (en) Method and system for recognizing a boundary between characters in handwritten text
WO1991017519A1 (en) Row-by-row segmentation and thresholding for optical character recognition
WO1995032485A1 (en) Method of stroke segmentation for handwritten input
KR19980058361A (en) Korean Character Recognition Method and System
Tang et al. Video-based handwritten Chinese character recognition
Tolentino et al. Recognition of air-drawn alphanumeric characters by applying slope orientation sequence matching algorithm using leap motion controller
Bawa et al. A preprocessing technique for recognition of online handwritten Gurmukhi numerals
JPH09102013A (en) Handwriting character/graphic reader
MXPA96000189A (en) Method of segmentation of traces for the entry of characters manuscri
KR100313993B1 (en) Method for recognizing probable stroke by area partition
JP3662967B2 (en) Character loop candidate extraction device and character completion device
JPH0934998A (en) On-line handwritten character recognizing method and device for the same
AU2004214901A1 (en) Line extraction in digital ink
JPH0821064B2 (en) Character writing input device
JPH05189616A (en) On-line character recognizing device

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 96194490.0

Country of ref document: CN

AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA CN JP KR SG

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 1019970708942

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 1996914560

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1996914560

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: CA

WWP Wipo information: published in national office

Ref document number: 1019970708942

Country of ref document: KR

122 Ep: pct application non-entry in european phase
WWG Wipo information: grant in national office

Ref document number: 1019970708942

Country of ref document: KR