![]() ![]() ![]() It does this is by using an encoding standard.Īn encoding standard is a numbering scheme that assigns each text character in a character set to a numeric value. ![]() Your computer translates the numeric values into visible characters. What appears to you as text on the screen is actually stored as numeric values in the text file. Look up encoding standards that are available in Word What do you want to do?Ĭhoose an encoding standard when you open a fileĬhoose an encoding standard when you save a file When you or someone else opens a text file in Microsoft Word or in another program - perhaps on a computer that has system software in a language that is different from the language that was used to create the file - the encoding standard helps that program determine how to represent the text so that it is readable. However, if you share text files with people who work in other languages, download text files across the Internet, or share text files with other computer systems, you may need to choose an encoding standard when you open or save a file. Prefer native tools since other users of my script won't necessarily have the proper toolset if it's not built-in.Typically, you can share text files without worrying about the underlying details of how the text is stored.I will have already saved the temp txt file, as MacRoman, to disk using the built-in AppleScript routines. However, I cannot see how "sed" can easily convert the text encoding. I am thinking that forcing a text encoding conversion may help to eliminate all non-utf8 characters in the file. However, there are still many elements of the file that need to be cleaned up, characters that appear as garbage if the file is opened as utf-8 (e.g. I have found a tool called, "sed", which allows me to do the text parsing. As such, I want to do my text parsing and conversion using the OS X command line. AppleScript is extremely slow when working with very large text blocks. I will be calling the utility from an AppleScript that I have created. I would like to call a command line utility in Mac OS X 10.8 that gives me the ability to convert a text file saved in standard Western Mac OS Roman encoding to the more generic UTF-8. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
December 2022
Categories |