Text files produced by Microsoft programs such as Word frequently end up containing characters with values above 0x7F from Microsoft's Winlatin 1 (Codepage 1252) and Winlatin 2 (Codepage 1250) encodings, even when the user intends to produce "plain text". These do not show up properly in contexts in which plain ASCII is expected. This script replaces the non-ASCII punctuation characters from these two codepages with ASCII equivalents and removes any other non-ASCII characters, guaranteeing that the result is pure ASCII.
Note: this program was formerly called "fix1250".
Language | Python |
Environment | OS Independent |
License | GPL version 2 |
Current version | 1.1 |
Last modified | 2007-01-28 |