All Packages  Class Hierarchy  This Package  Previous  Next  Index

Class COM.Lexical.LatinMap

java.lang.Object
    |
    +----COM.Lexical.LatinMap

public abstract class LatinMap
extends java.lang.Object
Utility class for mapping ISO Latin-1 (8859-1) accented letters to corresponding unaccented ASCII. Maps all of \\u00C0-\\u00FF to ASCII, excepting the following (which are left unchanged):

CharHexUnicode name
Ð00D0LATIN CAPITAL LETTER ETH
×00D7MULTIPLICATION SIGN
Þ00DELATIN CAPITAL LETTER THORN
ß00DFLATIN SMALL LETTER SHARP S = ess-zed
ð00F0LATIN SMALL LETTER ETH
÷00F7DIVISION SIGN
þ00FELATIN SMALL LETTER THORN

The AE ligatures, 'Æ' and 'æ', are replaced with 'E' and 'e', respectively.

See http://www.unicode.org/ for more information on Unicode (which subsumes ISO Latin-1).


Method Index

 o main(String[])
Allows use as a program.
 o replaceLatin1(char[])
Replace Latin-1 letters in array with ASCII letters.
 o replaceLatin1(String)
Replace Latin-1 letters in String with ASCII letters.

Method Detail

 o replaceLatin1
public static final java.lang.String replaceLatin1(java.lang.String string)
          Replace Latin-1 letters in String with ASCII letters. Returns the original string whenever no substitution is necessary.
 o replaceLatin1
public static final void replaceLatin1(char array[])
          Replace Latin-1 letters in array with ASCII letters.
 o main
public static void main(java.lang.String args[])
          Allows use as a program. Run `java COM.Lexical.LatinMap -?` for usage details.

All Packages  Class Hierarchy  This Package  Previous  Next  Index