Charset module

The charset module contains the list of all the encoding schemes supported by LocalSolver and the methods required to query that list.

A charset is not useful on its own, but it can be used in conjunction with the I/O module to read or write files with the appropriate encoding.

Complete list of supported encodings

The following table lists all the encodings supported by the modeler of LocalSolver with their common aliases and the languages for which the encoding is usually employed.

Encoding name Aliases Languages
ascii us-ascii English
cp437 pc-8, ibm437 English
cp1250 windows-1250 Central and Eastern Europe
cp1251 windows-1251 Bulgarian, Serbian and Macedonian
cp1252 windows-1252 Western Europe
cp1253 windows-1253 Modern greek
cp1254 windows-1254 Turkish
cp1255 windows-1255 Hebrew
cp1256 windows-1256 Arabic
cp1257 windows-1257 Estonian, Latvian and Lithuanian
cp1258 windows-1258 Vietnamese
iso-8859-1 latin, latin1 Western Europe
iso-8859-2 latin2 Central and Eastern Europe
iso-8859-3 latin3 Esperanto, Maltese, Turkish
iso-8859-4 latin4 Estonian, Latvian, Lithuanian, Sami
iso-8859-5 cyrillic Bulgarian, Serbian, Russian
iso-8859-6 arabic Arabic
iso-8859-7 greek Greek
iso-8859-8 hebrew Hebrew
iso-8859-9 latin5 Turkish
iso-8859-10 latin6 Nordic languages
iso-8859-11 thai Thai
iso-8859-13 latin7 Baltic languages
iso-8859-14 latin8 Celtic languages
iso-8859-15 latin9 Western Europe
iso-8859-16 latin10 South-eastern Europe
utf-8   All languages
utf-16   All languages
utf-16be   All languages
utf-16le   All languages
utf-32be   All languages
utf-32le   All languages

Note

To use the features of this module, you have to put a special import statement at the begining of your LSP file: use charset;

Constants

This module instantiate 2 encodings by default: utf-8 and iso-8859-1. You can use them directly through these built-in variables without using the charset.lookup() method.

charset.ISO_8859_1

Latin-1 encoding. This encoding is the most widely used for western europe. This is also the default encoding of LocalSolver.

charset.UTF_8

UTF-8 encoding. This encoding should be the default one for your new projects with LocalSolver. The parser of LocalSolver will use it automatically if a Byte-Order-Mark (BOM) is detected at the begining of your files.

Functions

charset.default()

Returns the default encoding used by LocalSolver. For the current version of LocalSolver, this method returns charset.ISO_8859_1.

charset.lookup(name)

Returns the encoding corresponding to the given name. If no encoding exists with the given name, an exception is thrown. You can use the name of the encoding or one of its aliases. The lookup is not case sensitive.