"Fossies" - the Fresh Open Source Software Archive

Member "recode-3.7.12/NEWS" (17 Feb 2022, 17409 Bytes) of package /linux/misc/recode-3.7.12.tar.gz:


As a special service "Fossies" has tried to format the requested text file into HTML format (style: standard) with prefixed line numbers. Alternatively you can here view or download the uninterpreted source code file. See also the latest Fossies "Diffs" side-by-side code changes report for "NEWS": 3.7.11_vs_3.7.12.

    1 ==================================
    2 Recode NEWS - User visible changes
    3 ==================================
    4 
    5 .. contents::
    6 .. sectnum::
    7 
    8 :Copyright: © 1993-2022 Free Software Foundation, Inc.
    9 
   10 Version 3.7.12
   11 ==============
   12 
   13 :Author: Reuben Thomas, 2022-02.
   14 
   15 + Simplify support for ignoring invalid input with iconv, integrated with
   16   the --strict and --force mechanisms.
   17 + Various documentation improvements.
   18 
   19 
   20 Version 3.7.11
   21 ==============
   22 
   23 :Author: Reuben Thomas, 2022-01.
   24 
   25 + Re-add support for transliteration with iconv (removed in 3.7).
   26 + Add support for ignoring invalid input with iconv.
   27 + Fix a bug introduced in 3.7.10 that prevented building the code.
   28 
   29 
   30 Version 3.7.10
   31 ==============
   32 
   33 :Author: Reuben Thomas, 2022-01.
   34 
   35 + In recode program, only use iconv if needed; add --prefer-iconv option to
   36   allow its use in all cases.
   37 + Remove support for pre-3.5 request syntax (colon as charset separator).
   38 + PO files have been updated from the Translation Project.
   39 + Build system fixes and improvements.
   40 + Update gnulib to fix a problem building on Android.
   41 + Fix confusingly wrong NEWS entry for 3.7.4.
   42 
   43 
   44 Version 3.7.9
   45 =============
   46 
   47 :Author: Reuben Thomas, 2021-06.
   48 
   49 + A fix to the CP1252 encoding: U+017E LATIN SMALL LETTER Z WITH CARON is at
   50   byte 0x9e, not byte 0x8f.
   51 + Minor documentation fixes.
   52 
   53 
   54 Version 3.7.8
   55 =============
   56 
   57 :Author: Reuben Thomas, 2020-09.
   58 
   59 + Improvements to the build system.
   60 + Fix man page generation, and document that help2man must be built with
   61   gettext.
   62 + Updates to nl, pt, sv translations (thanks, translators!).
   63 
   64 
   65 Version 3.7.7
   66 =============
   67 
   68 :Author: Reuben Thomas, 2020-07.
   69 
   70 + Improvements to the build system.
   71 + Updates to nl, pt, sv translations (thanks, translators!).
   72 
   73 
   74 Version 3.7.6
   75 =============
   76 
   77 :Author: Reuben Thomas, 2019-09.
   78 
   79 + Improvements to the build system.
   80 
   81 
   82 Version 3.7.5
   83 =============
   84 
   85 :Author: Reuben Thomas, 2019-09.
   86 
   87 + Port tests to Python 3.
   88 
   89 
   90 Version 3.7.4
   91 =============
   92 
   93 :Author: Reuben Thomas, 2019-09.
   94 
   95 + Fixes to file handling in recode program.
   96 + Fix tests on Windows.
   97 
   98 
   99 Version 3.7.3
  100 =============
  101 
  102 :Author: Reuben Thomas, 2019-08.
  103 
  104 + No code changes to recode itself; this release features a properly
  105 versioned shared library.
  106 
  107 
  108 Version 3.7.2
  109 =============
  110 
  111 :Author: Reuben Thomas, 2019-08.
  112 
  113 + No code changes to recode itself; this release includes updates to license
  114 headers to guide users to the GPL online, corrects the version of
  115 COPYING-LIB shipped with the sources, and updates the message files for
  116 various languages.
  117 
  118 
  119 Version 3.7.1
  120 =============
  121 
  122 :Author: Reuben Thomas, 2018-09.
  123 
  124 + No code changes to recode itself; this release just updates the version of
  125 gnulib to fix a bug in glibc 2.28: (GitHub issue #11
  126 https://github.com/rrthomas/recode/issues/11
  127 
  128 
  129 Version 3.7
  130 ===========
  131 
  132 :Author: François Pinard, 2008-03; Reuben Thomas, 2018-01.
  133 
  134 + Converters for BibTeX (from Vincent Danjean) and the ANSEL and ISO 5426
  135   character sets (from Wolfram Schneider) have been added.
  136 + The conversion strategies (whether to use pipes, memory or files) are
  137   no longer available.  Now it is reasonable to assume virtual memory, so
  138   files and memory have similar performance characteristics (in
  139   particular, the memory method is not limited by physical memory.)
  140   Further, tests showed that even for runs on little data, the pipes
  141   method has minimal performance impact (none was measured).  This is
  142   not a surprise, as for one-step recodings, the commonest case, no
  143   forking is needed.  The command-line options -i, -p and
  144   --sequence=STRATEGY are ignored for backwards compatibility.
  145 + Recode does not include libiconv anymore, but uses an external iconv
  146   library if one was available at installation time.  The -x: option
  147   to the program, or a new flag to the library recode_new_outer
  148   function, inhibits the initialisation and usage of iconv.
  149 + The experimental ``tree`` surface is removed.  Structured data
  150   needs a proper parser, and that doesn't fit the framework of Recode.
  151 + Many bug fixes.
  152 + Long ago, I renamed GNU recode to Free recode: the permission for
  153   using the GNU prefix mandated a level of obedience to the FSF that
  154   once went overboard, in my opinion.  After that change, I realized
  155   that some people read Free as a four letter word!  To be peaceful,
  156   this version changes the name again, to merely Recode.  recode
  157   (no capital) still names the executable program specifically, or
  158   the distribution archive itself.
  159 + make check accepts a LIMIT= option, for limiting tests to one or a
  160   few cases.  See tests/Makefile.am.
  161 + PO files have been updated from the Translation Project.
  162 + The test system has been overhauled.  Tests now run much faster, and
  163   require Python and Cython.
  164 + Overhauled build system, now using gnulib for portability.  This reduces
  165   the amount of code in the Recode tree considerably.
  166 
  167 
  168 Version 3.6
  169 ===========
  170 
  171 :Author: François Pinard, Bruno Haible, 2001-01.
  172 
  173 General changes
  174 ---------------
  175 
  176 + The recode manual is now indexed, by charset, by concept, etc.
  177 + Program messages are also available in Greek, Gallicean and Italian.
  178 + Bruno Haible's nice portable iconv library has been integrated.
  179 + RFC 1345 tables and French character names have been updated.
  180 + The Texinfo charset has been refreshed, and made reversible.
  181 
  182 New charsets
  183 ------------
  184 
  185 (most from libiconv)
  186 
  187 + Japanese
  188 
  189   + EUC-JP (csEUCPkdFmtJapanese, EUC_JP,
  190     Extended_UNIX_Code_Packed_Format_for_Japanese);
  191   + ISO-2022-JP (csISO2022JP); ISO-2022-JP-1; ISO-2022-JP-2 (csISO2022JP2);
  192   + JIS_C6220-1969-ro (csISO14JISC6220ro, ISO646-JP, iso-ir-14, jp);
  193   + JIS_X0201 (csHalfWidthKatakana, JIS0201, JISX0201-1976, JISX0201.1976-0,
  194     X0201);
  195   + JIS_X0208 (csISO87JISX0208, ISO-IR-87, JIS0208, JIS_X0208.1983-0,
  196     JIS_X0208.1983-1, JIS_X0208-1990-0, JIS_X0208.1983-1, X0208);
  197   + JIS_X0212 (csISO159JISX02121990, ISO-IR-159, JIS0212, JIS_X0212.1990-0,
  198     JIS_X0212-1990, X0212);
  199   + SJIS (csShiftJIS, MS_KANJI, SHIFT-JIS).
  200 
  201 + Chinese
  202 
  203   + BIG5 (BIG-5, BIG-FIVE, BIGFIVE, CN-BIG5 csBig5); BIG5HKSCS;
  204   + EUC-CN (CN-GB, csGB2312, EUC_CN, GB2312); EUC-TW (csEUCTW, EUC_TW);
  205   + GB18030; HZ (HZ-GB-2312); ISO-2022-CN (csISO2022CN); ISO-2022-CN-EXT;
  206   + GB_1988-80 (cn, csISO57GB1988, ISO646-CN, iso-ir-57);
  207   + GB_2312-80 (CHINESE, csISO58GB231280, GB2312.1980-0, ISO-IR-58);
  208   + ISO-IR-165 (CN-GB-ISOIR165).
  209 
  210 + Korean
  211 
  212   + JOHAB (CP1361); EUC-KR (csEUCKR, EUC_KR); GBK (CP936);
  213   + ISO-2022-KR (csISO2022KR);
  214   + KSC_5601 (CP949, csKSC56011987, ISO-IR-149, KOREAN, KSC5601.1987-0,
  215     KS_C_5601-1987, KS_C_5601-1989, KSX1001:1992).
  216 
  217 + Vietnamese (independently of libiconv)
  218 
  219   + TCVN; VIQR; VISCII; VNI; VPS.
  220 
  221 + Other languages
  222 
  223   + ARMSCII-8; Georgian-Academy; Georgian-PS; WINDOWS-874 (CP874);
  224   + MuleLao-1; CP1133 (IBM-CP1133); CP1258 (WINDOWS-1258);
  225   + TIS-620 (ISO-IR-166, TIS620, TIS620.2529-1, TIS620-0, TIS620.2533-0,
  226     TIS620.2533-1).
  227 
  228 + Apple specifics
  229 
  230   + MacArabic; MacCentralEurope; MacCroatian; MacCyrillic; MacGreek;
  231   + MacHebrew; MacIceland; MacRomania; MacThai; MacTurkish; MacUkraine
  232 
  233 + Unicode
  234 
  235   + JAVA; UCS-2-INTERNAL; UCS-2LE (UnicodeLITTLE); UCS-2-SWAPPED; UCS-4BE;
  236   + UCS-4-INTERNAL; UCS-4LE; UCS-4-SWAPPED; UTF-16BE; UTF-16LE.
  237 
  238 + Others
  239 
  240   + CP932; CP949 (UHC); CP950; CP866 (866, csIBM866, IBM866).
  241   + ISO-8859-16 (ISO-IR-226, ISO_8859-16:2000).
  242 
  243 + Recode internal
  244 
  245   + :libiconv: (:)   [so option -x: avoids going through libiconv]
  246 
  247 New aliases
  248 -----------
  249 
  250 (from libiconv) [list to be revised]
  251 
  252 + csASCII (for ANSI_X3.4-1968); csHPRoman8 (for hp-roman8);
  253 + csISOLatin1 (for ISO-8859-1); csISOLatin2 (for ISO-8859-2);
  254 + csISOLatin3 (for ISO-8859-3); csISOLatin4 (for ISO-8859-4);
  255 + csISOLatin5 (for ISO-8859-9);
  256 + csISOLatin6 and ISO_8859-10:1992 (for ISO-8859-10);
  257 + csISOLatinArabic (for ISO-8859-6); csISOLatinCyrillic (for ISO-8859-5);
  258 + csISOLatinGreek (for ISO-8859-7); csISOLatinHebrew (for ISO-8859-8);
  259 + csKOI8R (for KOI8-R); csPC850Multilingual (for IBM850);
  260 + csUCS4 (for ISO-10646-UCS-4);
  261 + csUnicode, csUnicode11, UCS-2BE, UnicodeBIG (for ISO-10646-UCS-2);
  262 + csUnicode11UTF7 (for UNICODE-1-1-UTF-7);
  263 + csVISCII and VISCII1.1-1 (for VISCII);
  264 + ISO-IR-179 (for ISO-8859-13); csMacintosh and MacRoman (for macintosh);
  265 + TCVN5712-1, TCVN5712-1:1993 and TCVN-5712 (for TCVN).
  266 
  267 New surfaces
  268 ------------
  269 
  270 + tree (experimental).
  271 
  272 Version 3.5
  273 ===========
  274 
  275 :Author: François Pinard, 1999-05.
  276 
  277 Incompatible changes
  278 --------------------
  279 
  280 + A double dot ``..`` should now be used instead of a colon ``:``.
  281 + Option --force (-f) is needed to pursue recoding despite errors.
  282 + There is no more quoting for special characters within charsets names.
  283 + Auto check (``-a``) and popen (``-o``) options have been withdrawn.
  284 + Some charsets and aliases were deleted, see `Charsets & aliases`_ below.
  285 
  286 Extended features
  287 -----------------
  288 
  289 + Program messages are available in localised form for many languages.
  290 + Long character names are available in French, if LANGUAGE is set to ``fr``.
  291 + A new request syntax allows for recode chaining, and for surfaces.
  292 + Option --header-file (-h) accepts a language parameter, and Perl is new.
  293 + Full charset listings now show the UCS-2 value for characters.
  294 + Option --known=PAIRS (-k) also accepts octal and hexadecimal numbers.
  295 + Option --list (-l) better sorts charsets and aliases, also fully written.
  296 + Charset ``RFC1345`` implements mnemonic+ascii+38, and is now reversible.
  297 + HTML is not limited anymore to Latin-1, HTML 4.0 entities are supported.
  298 
  299 New features
  300 ------------
  301 
  302 + Euro support.
  303 + Updated RFC 1345 set of tables, from Keld Simonsen.
  304 + Some African charsets and transliterated forms.
  305 + Conversions for ISO 10646 and Unicode.
  306 + Combining or explosion of UCS-2 diacriticized characters and ligatures.
  307 + Implementation of surfaces, see `Surfaces & aliases`_ below.
  308 + Mixed mode for recoding only comments and strings in C sources or PO files.
  309 + A stand-alone recoding library gets installed, often as a shared library.
  310 + Option --find-subsets (-T) lists charsets which are subsets of another.
  311 + The library may generate testing data, and study character frequencies.
  312 
  313 Charsets & aliases
  314 ------------------
  315 
  316 + New ISO 10646 and Unicode charsets
  317 
  318   + combined-UCS-2: pseudo-charset.
  319   + count-characters: pseudo-charset.
  320   + dump-with-names: pseudo-charset.
  321   + ISO-10646-UCS-2 (UNICODE-1-1, BMP, rune, u2).
  322   + ISO-10646-UCS-4 (10646, ISO-10646, UCS-4, u4).
  323   + UNICODE-1-1-UTF-7 (TF-7, u7).
  324   + UTF-8 (UTF-2, UTF-FSS, FSS_UTF, TF-8, u8).
  325   + UTF-16 (Unicode, TF-16, u6).
  326 
  327 + RFC 1345.bis matters
  328 
  329   + Deleted charsets
  330 
  331     + dk-us, us-dk (because of &duplicate which Recode does not handle yet).
  332 
  333   + New charsets
  334 
  335     + baltic (alias is iso-ir-179); CP1250 (1250, ms-ee, windows-1250);
  336     + CP1251 (1251, ms-cyrl, windows-1251);
  337     + CP1252 (1252, ms-ansi, windows-1252);
  338     + CP1253 (1253, ms-greek, windows-1253);
  339     + CP1254 (1254, ms-turk, windows-1254);
  340     + CP1255 (1255, ms-hebr, windows-1255);
  341     + CP1256 (1256, ms-arab, windows-1256);
  342     + CP1257 (1257, WinBaltRim, windows-1257);
  343     + CWI (CWI-2, cp-hu); EBCDIC-IS-FRISS (friss);
  344     + GOST_19768-87 with aliases of previous GOST_19768-74;
  345     + IBM256 (256, CP256, EBCDIC-INT1); IBM875 (875, CP875, EBCDIC-Greek);
  346     + IBM1004 (1004, CP1004, os2latin1); IBM1047 (1047, CP1047);
  347     + ISO-8859-13 (ISO_8859-13:1998, iso-baltic, iso-ir-179a, l7, latin7);
  348     + ISO-8859-14 (ISO_8859-14:1998, iso-celtic, iso-ir-199, l8, latin8);
  349     + ISO-8859-15 (ISO_8859-15:1998, iso-ir-203, l9, latin9);
  350     + KOI-7; KOI-8 (GOST_19768-74); KOI8-R; KOI8-RU; KOI8-U;
  351     + macintosh_ce (macce); mac-is;
  352     + NeXTSTEP (next) yet previous Recode had it outside RFC 1345.
  353 
  354   + Alias promoted to charset (with previous charset becoming alias)
  355 
  356     + ISO-646.basic (with ISO-646.basic:1983); ISO-646.irv (ISO-646.irv:1983);
  357     + ISO_5427-ext (ISO_5427:1981); ISO_5428 (ISO_5428:1980);
  358     + ISO-8859-1 (ISO_8859-1:1987); ISO-8859-2 (ISO_8859-2:1987);
  359     + ISO-8859-3 (ISO_8859-3:1988); ISO-8859-4 (ISO_8859-4:1988);
  360     + ISO-8859-5 (ISO_8859-5:1988); ISO-8859-6 (ISO_8859-6:1987);
  361     + ISO-8859-7 (ISO_8859-7:1987); ISO-8859-8 (ISO_8859-8:1988);
  362     + ISO-8859-9 (ISO_8859-9:1989); ISO-8859-10 (latin6);
  363     + NC_NC00-10 (NC_NC00-10:81); sami (latin-lap).
  364 
  365   + New aliases
  366 
  367     + 037 (for charset IBM037); 038 (IBM038); 273 (IBM273); 274 (IBM274);
  368     + 275 (IBM275); 278 (IBM278); 280 (IBM280); 281 (IBM281); 284 (IBM284);
  369     + 285 (IBM285); 290 (IBM290); 297 (IBM297); 367 (ANSI_X3.4-1968);
  370     + 420 (IBM420); 423 (IBM423); 424 (IBM424); 500, 500V1 (IBM500);
  371     + 819 (ISO-8859-1); 864 (IBM864); 868 (IBM868); 870 (IBM870);
  372     + 871 (IBM871); 880 (IBM880); 891 (IBM891); 903 (IBM903); 905 (IBM905);
  373     + 912, CP912, IBM912 (ISO-8859-2); 918 (IBM918); 1026 (IBM1026);
  374     + ECMA-113, ECMA-113:1986 (ECMA-Cyrillic); GOST_19768-74 (KOI8);
  375     + ISO_8859-N (ISO-8859-N) for N = 1 through 10 and 13 through 15;
  376     + ISO_8859-10:1993 (ISO-8869-10); iso-ir-170 (INVARIANT);
  377     + KOI8_L2 (CSN_369103); pclatin2, pcl2 (IBM852); SS636127 (SEN_850200_B).
  378 
  379 + New African charsets
  380 
  381   + AFRL1-101-BPI_OCIL (t-francais, t-fra);
  382   + AFRFUL-102-BPI_OCIL (bambara, bra, ewondo, fulfulde);
  383   + AFRFUL-103-BPI_OCIL (t-bambara, t-bra, t-ewondo, t-fulfulde);
  384   + AFRLIN-104-BPI_OCIL (lingala, lin, sango, wolof);
  385   + AFRLIN-105-BPI_OCIL (t-lingala, t-lin, t-sango, t-wolof).
  386 
  387 + Extra miscellaneous charsets
  388 
  389   + KEYBCS2 (Kamenicky); CORK (T1); KOI-8_CS2.
  390 
  391 + New HTML pseudo-charsets
  392 
  393   + HTML_1.1 (h1);  HTML_2.0 (RFC 1866, 1866, h2); HTML-i18n (RFC 2070);
  394   + HTML_3.2 (h3) reimplemented; HTML_4.0 (h4, HTML, h);
  395   + deleted aliases HTF, 8859, ISO 8859, Entities, SGML, WWW, w3.
  396 
  397 Surfaces & aliases
  398 ------------------
  399 
  400 + Base64 (64, b64); Quoted-Printable (qp, Quote-Printable);
  401 + 21-Permutation (swabytes); 4321-Permutation; CR; CR-LF (cl);
  402 + Decimal-1 (d, d1); Decimal-2 (d2), Decimal-4 (d4);
  403 + Hexadecimal-1 (x, x1); Hexadecimal-2 (x2); Hexadecimal-4 (x4);
  404 + Octal-1 (o, o1); Octal-2 (o2); Octal-4 (o4).
  405 + data; test7; test8; test15; test16.
  406 
  407 Version 3.4
  408 ===========
  409 
  410 :Author: François Pinard, 1994-11.
  411 
  412 + Charset HTML is new, it handles ``&...;`` sequences for Latin-1.
  413 + Charset AtariST handling is more general, --list may be used with it.
  414 + Charset ASCII-BS overstriking has been extended, mainly for German.
  415 + Charset RFC1345 may be a goal, to debug or study RFC 1345 short names.
  416 + Charset names have been revised.  Note that nextstep is now NeXT.
  417 + Option --force (-f) is accepted, but does not yet protect reversibility.
  418 + Option --quiet or --silent (-q) silences irreversible recoding messages.
  419 + Option --known=PAIRS (-k) helps searching through recodings.
  420 + Option --sequence=pipe (-p) does not fall back on -o anymore.
  421 + Option --auto-check may narrow its study around one particular charset.
  422 + An MSDOS port is available, check ftp.iro.umontreal.ca in pub/gnuish.
  423 + Compilation should now succeed on OS/2 EMX.  Thanks to Kai Uwe Rommel.
  424 + Program initialization is almost three times faster on average.
  425 + Corrected reported bugs, added small improvements, some aesthetic.
  426 
  427 Version 3.3
  428 ===========
  429 
  430 :Author: François Pinard, 1993-12.
  431 
  432 + Charsets atarist, ebcdic-ccc, ebcdic-ibm and nextstep have been added.
  433 + Also, most RFC 1345 charsets and aliases are handled.  That's a bunch!
  434 + Old ascii disappears because of RFC 1345's ascii, use ascii-bs instead.
  435 + Old maci disappears because of RFC 1345's macintosh, use applemac instead.
  436 + Charsets cccascii and cdcascii disappear, use ebcdic-ccc and ebcdic instead.
  437 + Recoding between latin1, ibmpc and applemac is (almost) reversible.
  438 + The texinfo documentation has been reorganized, this to be continued.
  439 + Long options are accepted, charset names may be abbreviated.
  440 + Option --list (-l) displays charsets, aliases and contents in many formats.
  441 + Option --strict (-s) asks for stricter, non-reversible recodings.
  442 + Option --graphics (-g) approximates ibmpc rulers with ASCII graphics.
  443 + Option --header (-h) produces C source for many recoding tables.
  444 + Option --auto-check (-a) reports about all possible recodings.
  445 + Option --ignore (-x) prevents a charset from being selected.
  446 + Execution has been sped up through step merging, hashing for charset names.
  447 + Many various buglets have been eradicated, portability increased.
  448 + Charsets may be edited out by modifying the Makefile only.
  449 + Configuration is made through the use of an external config.h file.
  450 + New -d ``diacritics_only`` option for LaTeX.
  451 + A few bugs have been corrected.
  452 + Documentation reorganization and improvements.
  453 + Increased portability, now uses Autoconf.
  454 + A few bugs solved.
  455 
  456 Version 3.2
  457 ===========
  458 
  459 :Author: François Pinard, 1991-10.
  460 
  461 + MSDOS port redone.
  462 + New check goal at installation time.
  463 + Add -v option for verbose processing, remove old -q.
  464 + Add -i, -o and -p for letting the user control the strategy.
  465 + A few bugs corrected.
  466 + Embedded NULs should now be transmitted.
  467 
  468 Version 3.1
  469 ===========
  470 
  471 :Author: François Pinard, 1990-03.
  472 
  473 + Rename -V to -C for showing Copyright.
  474 + Calling sequence changed, said files now recoded on themselves.
  475 + Add -t option for touching files.
  476 + Better on-line help.
  477 + Add -q option for quiet processing.
  478 + Executable file now considerably smaller, also speedier.
  479 + A few bugs corrected.
  480 
  481 Version 3.0
  482 ===========
  483 
  484 :Author: François Pinard, 1989-10.
  485 
  486 + New Text to Latin1 processing, should be faster.
  487 + A few bugs corrected.
  488 
  489 For prior history down to 1980, see at the end of the ChangeLog.