/******************************************************************************
* This file is part of The Unicode Tools Of Rexx (TUTOR) *
* See https://rexx.epbcn.com/tutor/ *
* and /~https://github.com/JosepMariaBlasco/TUTOR *
* Copyright © 2023-2025 Josep Maria Blasco <josep.maria.blasco@epbcn.com> *
* License: Apache License 2.0 (https://www.apache.org/licenses/LICENSE-2.0) *
******************************************************************************/
This release, apart from a large number of documentation improvements and some small bug fixes, contains two main improvements:
- A new UTF8 built-in function has been defined. It contains the main part of the UTF-8 decoder, previously found in the
encodings/Encoding.cls
package. The new routine has been generalized so that it can manage strings in UTF-8, UTF-8Z, CESU-8, MUTF-8 and WTF-8 formats. See the code and the UTF8 section of this helpfile for documentation details. - Documentation has been migrated to markdown format. Most classes and routines are fully documented, and some are partially documented. Documentation includes some internal details, which in most cases are labeled as "implementation notes".
There are currently two components of TUTOR which can be used independently of TUTOR, since they have no absolute dependencies on other TUTOR components.
- The Rexx Tokenizer can be used independently of TUTOR, but you will need TUTOR when you use one of the Unicode subclasses.
- The UTF8 routine can be used independently of TUTOR. UTF8 detects whether Unicode.cls has been loaded (by looking for the existence of a .Bytes class that subclasses .String), and returns .Bytes strings or standard ooRexx strings as appropriate.