"Fossies" - the Fresh Open Source Software Archive

Member "pcre-8.42/NEWS" (20 Mar 2018, 29139 Bytes) of package /linux/misc/pcre-8.42.tar.bz2:


As a special service "Fossies" has tried to format the requested text file into HTML format (style: standard) with prefixed line numbers. Alternatively you can here view or download the uninterpreted source code file. See also the latest Fossies "Diffs" side-by-side code changes report for "NEWS": 8.41_vs_8.42.

    1 News about PCRE releases
    2 ------------------------
    3 
    4 Release 8.42 20-March-2018
    5 --------------------------
    6 
    7 This is a bug-fix release.
    8 
    9 
   10 Release 8.41 13-June-2017
   11 -------------------------
   12 
   13 This is a bug-fix release.
   14 
   15 
   16 Release 8.40 11-January-2017
   17 ----------------------------
   18 
   19 This is a bug-fix release.
   20 
   21 
   22 Release 8.39 14-June-2016
   23 -------------------------
   24 
   25 Some appropriate PCRE2 JIT improvements have been retro-fitted to PCRE1. Apart
   26 from that, this is another bug-fix release. Note that this library (now called
   27 PCRE1) is now being maintained for bug fixes only. New projects are advised to
   28 use the new PCRE2 libraries.
   29 
   30 
   31 Release 8.38 23-November-2015
   32 -----------------------------
   33 
   34 This is bug-fix release. Note that this library (now called PCRE1) is now being
   35 maintained for bug fixes only. New projects are advised to use the new PCRE2
   36 libraries.
   37 
   38 
   39 Release 8.37 28-April-2015
   40 --------------------------
   41 
   42 This is bug-fix release. Note that this library (now called PCRE1) is now being
   43 maintained for bug fixes only. New projects are advised to use the new PCRE2
   44 libraries.
   45 
   46 
   47 Release 8.36 26-September-2014
   48 ------------------------------
   49 
   50 This is primarily a bug-fix release. However, in addition, the Unicode data
   51 tables have been updated to Unicode 7.0.0.
   52 
   53 
   54 Release 8.35 04-April-2014
   55 --------------------------
   56 
   57 There have been performance improvements for classes containing non-ASCII
   58 characters and the "auto-possessification" feature has been extended. Other
   59 minor improvements have been implemented and bugs fixed. There is a new callout
   60 feature to enable applications to do detailed stack checks at compile time, to
   61 avoid running out of stack for deeply nested parentheses. The JIT compiler has
   62 been extended with experimental support for ARM-64, MIPS-64, and PPC-LE.
   63 
   64 
   65 Release 8.34 15-December-2013
   66 -----------------------------
   67 
   68 As well as fixing the inevitable bugs, performance has been improved by
   69 refactoring and extending the amount of "auto-possessification" that PCRE does.
   70 Other notable changes:
   71 
   72 .  Implemented PCRE_INFO_MATCH_EMPTY, which yields 1 if the pattern can match
   73    an empty string. If it can, pcretest shows this in its information output.
   74 
   75 .  A back reference to a named subpattern when there is more than one of the
   76    same name now checks them in the order in which they appear in the pattern.
   77    The first one that is set is used for the reference. Previously only the
   78    first one was inspected. This change makes PCRE more compatible with Perl.
   79 
   80 .  Unicode character properties were updated from Unicode 6.3.0.
   81 
   82 .  The character VT has been added to the set of characters that match \s and
   83    are generally treated as white space, following this same change in Perl
   84    5.18. There is now no difference between "Perl space" and "POSIX space".
   85 
   86 .  Perl has changed its handling of \8 and \9. If there is no previously
   87    encountered capturing group of those numbers, they are treated as the
   88    literal characters 8 and 9 instead of a binary zero followed by the
   89    literals. PCRE now does the same.
   90 
   91 .  Following Perl, added \o{} to specify codepoints in octal, making it
   92    possible to specify values greater than 0777 and also making them
   93    unambiguous.
   94 
   95 .  In UCP mode, \s was not matching two of the characters that Perl matches,
   96    namely NEL (U+0085) and MONGOLIAN VOWEL SEPARATOR (U+180E), though they
   97    were matched by \h.
   98 
   99 .  Add JIT support for the 64 bit TileGX architecture.
  100 
  101 .  Upgraded the handling of the POSIX classes [:graph:], [:print:], and
  102    [:punct:] when PCRE_UCP is set so as to include the same characters as Perl
  103    does in Unicode mode.
  104 
  105 .  Perl no longer allows group names to start with digits, so I have made this
  106    change also in PCRE.
  107 
  108 .  Added support for [[:<:]] and [[:>:]] as used in the BSD POSIX library to
  109    mean "start of word" and "end of word", respectively, as a transition aid.
  110 
  111 
  112 Release 8.33 28-May-2013
  113 --------------------------
  114 
  115 A number of bugs are fixed, and some performance improvements have been made.
  116 There are also some new features, of which these are the most important:
  117 
  118 .  The behaviour of the backtracking verbs has been rationalized and
  119    documented in more detail.
  120 
  121 .  JIT now supports callouts and all of the backtracking verbs.
  122 
  123 .  Unicode validation has been updated in the light of Unicode Corrigendum #9,
  124    which points out that "non characters" are not "characters that may not
  125    appear in Unicode strings" but rather "characters that are reserved for
  126    internal use and have only local meaning".
  127 
  128 .  (*LIMIT_MATCH=d) and (*LIMIT_RECURSION=d) have been added so that the
  129    creator of a pattern can specify lower (but not higher) limits for the
  130    matching process.
  131 
  132 .  The PCRE_NEVER_UTF option is available to prevent pattern-writers from using
  133    the (*UTF) feature, as this could be a security issue.
  134 
  135 
  136 Release 8.32 30-November-2012
  137 -----------------------------
  138 
  139 This release fixes a number of bugs, but also has some new features. These are
  140 the highlights:
  141 
  142 .  There is now support for 32-bit character strings and UTF-32. Like the
  143    16-bit support, this is done by compiling a separate 32-bit library.
  144 
  145 .  \X now matches a Unicode extended grapheme cluster.
  146 
  147 .  Case-independent matching of Unicode characters that have more than one
  148    "other case" now makes all three (or more) characters equivalent. This
  149    applies, for example, to Greek Sigma, which has two lowercase versions.
  150 
  151 .  Unicode character properties are updated to Unicode 6.2.0.
  152 
  153 .  The EBCDIC support, which had decayed, has had a spring clean.
  154 
  155 .  A number of JIT optimizations have been added, which give faster JIT
  156    execution speed. In addition, a new direct interface to JIT execution is
  157    available. This bypasses some of the sanity checks of pcre_exec() to give a
  158    noticeable speed-up.
  159 
  160 .  A number of issues in pcregrep have been fixed, making it more compatible
  161    with GNU grep. In particular, --exclude and --include (and variants) apply
  162    to all files now, not just those obtained from scanning a directory
  163    recursively. In Windows environments, the default action for directories is
  164    now "skip" instead of "read" (which provokes an error).
  165 
  166 .  If the --only-matching (-o) option in pcregrep is specified multiple
  167    times, each one causes appropriate output. For example, -o1 -o2 outputs the
  168    substrings matched by the 1st and 2nd capturing parentheses. A separating
  169    string can be specified by --om-separator (default empty).
  170 
  171 .  When PCRE is built via Autotools using a version of gcc that has the
  172    "visibility" feature, it is used to hide internal library functions that are
  173    not part of the public API.
  174 
  175 
  176 Release 8.31 06-July-2012
  177 -------------------------
  178 
  179 This is mainly a bug-fixing release, with a small number of developments:
  180 
  181 . The JIT compiler now supports partial matching and the (*MARK) and
  182   (*COMMIT) verbs.
  183 
  184 . PCRE_INFO_MAXLOOKBEHIND can be used to find the longest lookbehind in a
  185   pattern.
  186 
  187 . There should be a performance improvement when using the heap instead of the
  188   stack for recursion.
  189 
  190 . pcregrep can now be linked with libedit as an alternative to libreadline.
  191 
  192 . pcregrep now has a --file-list option where the list of files to scan is
  193   given as a file.
  194 
  195 . pcregrep now recognizes binary files and there are related options.
  196 
  197 . The Unicode tables have been updated to 6.1.0.
  198 
  199 As always, the full list of changes is in the ChangeLog file.
  200 
  201 
  202 Release 8.30 04-February-2012
  203 -----------------------------
  204 
  205 Release 8.30 introduces a major new feature: support for 16-bit character
  206 strings, compiled as a separate library. There are a few changes to the
  207 8-bit library, in addition to some bug fixes.
  208 
  209 . The pcre_info() function, which has been obsolete for over 10 years, has
  210   been removed.
  211 
  212 . When a compiled pattern was saved to a file and later reloaded on a host
  213   with different endianness, PCRE used automatically to swap the bytes in some
  214   of the data fields. With the advent of the 16-bit library, where more of this
  215   swapping is needed, it is no longer done automatically. Instead, the bad
  216   endianness is detected and a specific error is given. The user can then call
  217   a new function called pcre_pattern_to_host_byte_order() (or an equivalent
  218   16-bit function) to do the swap.
  219 
  220 . In UTF-8 mode, the values 0xd800 to 0xdfff are not legal Unicode
  221   code points and are now faulted. (They are the so-called "surrogates"
  222   that are reserved for coding high values in UTF-16.)
  223 
  224 
  225 Release 8.21 12-Dec-2011
  226 ------------------------
  227 
  228 This is almost entirely a bug-fix release. The only new feature is the ability
  229 to obtain the size of the memory used by the JIT compiler.
  230 
  231 
  232 Release 8.20 21-Oct-2011
  233 ------------------------
  234 
  235 The main change in this release is the inclusion of Zoltan Herczeg's
  236 just-in-time compiler support, which can be accessed by building PCRE with
  237 --enable-jit. Large performance benefits can be had in many situations. 8.20
  238 also fixes an unfortunate bug that was introduced in 8.13 as well as tidying up
  239 a number of infelicities and differences from Perl.
  240 
  241 
  242 Release 8.13 16-Aug-2011
  243 ------------------------
  244 
  245 This is mainly a bug-fix release. There has been a lot of internal refactoring.
  246 The Unicode tables have been updated. The only new feature in the library is
  247 the passing of *MARK information to callouts. Some additions have been made to
  248 pcretest to make testing easier and more comprehensive. There is a new option
  249 for pcregrep to adjust its internal buffer size.
  250 
  251 
  252 Release 8.12 15-Jan-2011
  253 ------------------------
  254 
  255 This release fixes some bugs in pcregrep, one of which caused the tests to fail
  256 on 64-bit big-endian systems. There are no changes to the code of the library.
  257 
  258 
  259 Release 8.11 10-Dec-2010
  260 ------------------------
  261 
  262 A number of bugs in the library and in pcregrep have been fixed. As always, see
  263 ChangeLog for details. The following are the non-bug-fix changes:
  264 
  265 . Added --match-limit and --recursion-limit to pcregrep.
  266 
  267 . Added an optional parentheses number to the -o and --only-matching options
  268   of pcregrep.
  269 
  270 . Changed the way PCRE_PARTIAL_HARD affects the matching of $, \z, \Z, \b, and
  271   \B.
  272 
  273 . Added PCRE_ERROR_SHORTUTF8 to make it possible to distinguish between a
  274   bad UTF-8 sequence and one that is incomplete when using PCRE_PARTIAL_HARD.
  275 
  276 . Recognize (*NO_START_OPT) at the start of a pattern to set the PCRE_NO_
  277   START_OPTIMIZE option, which is now allowed at compile time
  278 
  279 
  280 Release 8.10 25-Jun-2010
  281 ------------------------
  282 
  283 There are two major additions: support for (*MARK) and friends, and the option
  284 PCRE_UCP, which changes the behaviour of \b, \d, \s, and \w (and their
  285 opposites) so that they make use of Unicode properties. There are also a number
  286 of lesser new features, and several bugs have been fixed. A new option,
  287 --line-buffered, has been added to pcregrep, for use when it is connected to
  288 pipes.
  289 
  290 
  291 Release 8.02 19-Mar-2010
  292 ------------------------
  293 
  294 Another bug-fix release.
  295 
  296 
  297 Release 8.01 19-Jan-2010
  298 ------------------------
  299 
  300 This is a bug-fix release. Several bugs in the code itself and some bugs and
  301 infelicities in the build system have been fixed.
  302 
  303 
  304 Release 8.00 19-Oct-09
  305 ----------------------
  306 
  307 Bugs have been fixed in the library and in pcregrep. There are also some
  308 enhancements. Restrictions on patterns used for partial matching have been
  309 removed, extra information is given for partial matches, the partial matching
  310 process has been improved, and an option to make a partial match override a
  311 full match is available. The "study" process has been enhanced by finding a
  312 lower bound matching length. Groups with duplicate numbers may now have
  313 duplicated names without the use of PCRE_DUPNAMES. However, they may not have
  314 different names. The documentation has been revised to reflect these changes.
  315 The version number has been expanded to 3 digits as it is clear that the rate
  316 of change is not slowing down.
  317 
  318 
  319 Release 7.9 11-Apr-09
  320 ---------------------
  321 
  322 Mostly bugfixes and tidies with just a couple of minor functional additions.
  323 
  324 
  325 Release 7.8 05-Sep-08
  326 ---------------------
  327 
  328 More bug fixes, plus a performance improvement in Unicode character property
  329 lookup.
  330 
  331 
  332 Release 7.7 07-May-08
  333 ---------------------
  334 
  335 This is once again mainly a bug-fix release, but there are a couple of new
  336 features.
  337 
  338 
  339 Release 7.6 28-Jan-08
  340 ---------------------
  341 
  342 The main reason for having this release so soon after 7.5 is because it fixes a
  343 potential buffer overflow problem in pcre_compile() when run in UTF-8 mode. In
  344 addition, the CMake configuration files have been brought up to date.
  345 
  346 
  347 Release 7.5 10-Jan-08
  348 ---------------------
  349 
  350 This is mainly a bug-fix release. However the ability to link pcregrep with
  351 libz or libbz2 and the ability to link pcretest with libreadline have been
  352 added. Also the --line-offsets and --file-offsets options were added to
  353 pcregrep.
  354 
  355 
  356 Release 7.4 21-Sep-07
  357 ---------------------
  358 
  359 The only change of specification is the addition of options to control whether
  360 \R matches any Unicode line ending (the default) or just CR, LF, and CRLF.
  361 Otherwise, the changes are bug fixes and a refactoring to reduce the number of
  362 relocations needed in a shared library. There have also been some documentation
  363 updates, in particular, some more information about using CMake to build PCRE
  364 has been added to the NON-UNIX-USE file.
  365 
  366 
  367 Release 7.3 28-Aug-07
  368 ---------------------
  369 
  370 Most changes are bug fixes. Some that are not:
  371 
  372 1. There is some support for Perl 5.10's experimental "backtracking control
  373    verbs" such as (*PRUNE).
  374 
  375 2. UTF-8 checking is now as per RFC 3629 instead of RFC 2279; this is more
  376    restrictive in the strings it accepts.
  377 
  378 3. Checking for potential integer overflow has been made more dynamic, and as a
  379    consequence there is no longer a hard limit on the size of a subpattern that
  380    has a limited repeat count.
  381 
  382 4. When CRLF is a valid line-ending sequence, pcre_exec() and pcre_dfa_exec()
  383    no longer advance by two characters instead of one when an unanchored match
  384    fails at CRLF if there are explicit CR or LF matches within the pattern.
  385    This gets rid of some anomalous effects that previously occurred.
  386 
  387 5. Some PCRE-specific settings for varying the newline options at the start of
  388    a pattern have been added.
  389 
  390 
  391 Release 7.2 19-Jun-07
  392 ---------------------
  393 
  394 WARNING: saved patterns that were compiled by earlier versions of PCRE must be
  395 recompiled for use with 7.2 (necessitated by the addition of \K, \h, \H, \v,
  396 and \V).
  397 
  398 Correction to the notes for 7.1: the note about shared libraries for Windows is
  399 wrong. Previously, three libraries were built, but each could function
  400 independently. For example, the pcreposix library also included all the
  401 functions from the basic pcre library. The change is that the three libraries
  402 are no longer independent. They are like the Unix libraries. To use the
  403 pcreposix functions, for example, you need to link with both the pcreposix and
  404 the basic pcre library.
  405 
  406 Some more features from Perl 5.10 have been added:
  407 
  408   (?-n) and (?+n) relative references for recursion and subroutines.
  409 
  410   (?(-n) and (?(+n) relative references as conditions.
  411 
  412   \k{name} and \g{name} are synonyms for \k<name>.
  413 
  414   \K to reset the start of the matched string; for example, (foo)\Kbar
  415   matches bar preceded by foo, but only sets bar as the matched string.
  416 
  417   (?| introduces a group where the capturing parentheses in each alternative
  418   start from the same number; for example, (?|(abc)|(xyz)) sets capturing
  419   parentheses number 1 in both cases.
  420 
  421   \h, \H, \v, \V match horizontal and vertical whitespace, respectively.
  422 
  423 
  424 Release 7.1 24-Apr-07
  425 ---------------------
  426 
  427 There is only one new feature in this release: a linebreak setting of
  428 PCRE_NEWLINE_ANYCRLF. It is a cut-down version of PCRE_NEWLINE_ANY, which
  429 recognizes only CRLF, CR, and LF as linebreaks.
  430 
  431 A few bugs are fixed (see ChangeLog for details), but the major change is a
  432 complete re-implementation of the build system. This now has full Autotools
  433 support and so is now "standard" in some sense. It should help with compiling
  434 PCRE in a wide variety of environments.
  435 
  436 NOTE: when building shared libraries for Windows, three dlls are now built,
  437 called libpcre, libpcreposix, and libpcrecpp. Previously, everything was
  438 included in a single dll.
  439 
  440 Another important change is that the dftables auxiliary program is no longer
  441 compiled and run at "make" time by default. Instead, a default set of character
  442 tables (assuming ASCII coding) is used. If you want to use dftables to generate
  443 the character tables as previously, add --enable-rebuild-chartables to the
  444 "configure" command. You must do this if you are compiling PCRE to run on a
  445 system that uses EBCDIC code.
  446 
  447 There is a discussion about character tables in the README file. The default is
  448 not to use dftables so that that there is no problem when cross-compiling.
  449 
  450 
  451 Release 7.0 19-Dec-06
  452 ---------------------
  453 
  454 This release has a new major number because there have been some internal
  455 upheavals to facilitate the addition of new optimizations and other facilities,
  456 and to make subsequent maintenance and extension easier. Compilation is likely
  457 to be a bit slower, but there should be no major effect on runtime performance.
  458 Previously compiled patterns are NOT upwards compatible with this release. If
  459 you have saved compiled patterns from a previous release, you will have to
  460 re-compile them. Important changes that are visible to users are:
  461 
  462 1. The Unicode property tables have been updated to Unicode 5.0.0, which adds
  463    some more scripts.
  464 
  465 2. The option PCRE_NEWLINE_ANY causes PCRE to recognize any Unicode newline
  466    sequence as a newline.
  467 
  468 3. The \R escape matches a single Unicode newline sequence as a single unit.
  469 
  470 4. New features that will appear in Perl 5.10 are now in PCRE. These include
  471    alternative Perl syntax for named parentheses, and Perl syntax for
  472    recursion.
  473 
  474 5. The C++ wrapper interface has been extended by the addition of a
  475    QuoteMeta function and the ability to allow copy construction and
  476    assignment.
  477 
  478 For a complete list of changes, see the ChangeLog file.
  479 
  480 
  481 Release 6.7 04-Jul-06
  482 ---------------------
  483 
  484 The main additions to this release are the ability to use the same name for
  485 multiple sets of parentheses, and support for CRLF line endings in both the
  486 library and pcregrep (and in pcretest for testing).
  487 
  488 Thanks to Ian Taylor, the stack usage for many kinds of pattern has been
  489 significantly reduced for certain subject strings.
  490 
  491 
  492 Release 6.5 01-Feb-06
  493 ---------------------
  494 
  495 Important changes in this release:
  496 
  497 1. A number of new features have been added to pcregrep.
  498 
  499 2. The Unicode property tables have been updated to Unicode 4.1.0, and the
  500    supported properties have been extended with script names such as "Arabic",
  501    and the derived properties "Any" and "L&". This has necessitated a change to
  502    the interal format of compiled patterns. Any saved compiled patterns that
  503    use \p or \P must be recompiled.
  504 
  505 3. The specification of recursion in patterns has been changed so that all
  506    recursive subpatterns are automatically treated as atomic groups. Thus, for
  507    example, (?R) is treated as if it were (?>(?R)). This is necessary because
  508    otherwise there are situations where recursion does not work.
  509 
  510 See the ChangeLog for a complete list of changes, which include a number of bug
  511 fixes and tidies.
  512 
  513 
  514 Release 6.0 07-Jun-05
  515 ---------------------
  516 
  517 The release number has been increased to 6.0 because of the addition of several
  518 major new pieces of functionality.
  519 
  520 A new function, pcre_dfa_exec(), which implements pattern matching using a DFA
  521 algorithm, has been added. This has a number of advantages for certain cases,
  522 though it does run more slowly, and lacks the ability to capture substrings. On
  523 the other hand, it does find all matches, not just the first, and it works
  524 better for partial matching. The pcrematching man page discusses the
  525 differences.
  526 
  527 The pcretest program has been enhanced so that it can make use of the new
  528 pcre_dfa_exec() matching function and the extra features it provides.
  529 
  530 The distribution now includes a C++ wrapper library. This is built
  531 automatically if a C++ compiler is found. The pcrecpp man page discusses this
  532 interface.
  533 
  534 The code itself has been re-organized into many more files, one for each
  535 function, so it no longer requires everything to be linked in when static
  536 linkage is used. As a consequence, some internal functions have had to have
  537 their names exposed. These functions all have names starting with _pcre_. They
  538 are undocumented, and are not intended for use by outside callers.
  539 
  540 The pcregrep program has been enhanced with new functionality such as
  541 multiline-matching and options for output more matching context. See the
  542 ChangeLog for a complete list of changes to the library and the utility
  543 programs.
  544 
  545 
  546 Release 5.0 13-Sep-04
  547 ---------------------
  548 
  549 The licence under which PCRE is released has been changed to the more
  550 conventional "BSD" licence.
  551 
  552 In the code, some bugs have been fixed, and there are also some major changes
  553 in this release (which is why I've increased the number to 5.0). Some changes
  554 are internal rearrangements, and some provide a number of new facilities. The
  555 new features are:
  556 
  557 1. There's an "automatic callout" feature that inserts callouts before every
  558    item in the regex, and there's a new callout field that gives the position
  559    in the pattern - useful for debugging and tracing.
  560 
  561 2. The extra_data structure can now be used to pass in a set of character
  562    tables at exec time. This is useful if compiled regex are saved and re-used
  563    at a later time when the tables may not be at the same address. If the
  564    default internal tables are used, the pointer saved with the compiled
  565    pattern is now set to NULL, which means that you don't need to do anything
  566    special unless you are using custom tables.
  567 
  568 3. It is possible, with some restrictions on the content of the regex, to
  569    request "partial" matching. A special return code is given if all of the
  570    subject string matched part of the regex. This could be useful for testing
  571    an input field as it is being typed.
  572 
  573 4. There is now some optional support for Unicode character properties, which
  574    means that the patterns items such as \p{Lu} and \X can now be used. Only
  575    the general category properties are supported. If PCRE is compiled with this
  576    support, an additional 90K data structure is include, which increases the
  577    size of the library dramatically.
  578 
  579 5. There is support for saving compiled patterns and re-using them later.
  580 
  581 6. There is support for running regular expressions that were compiled on a
  582    different host with the opposite endianness.
  583 
  584 7. The pcretest program has been extended to accommodate the new features.
  585 
  586 The main internal rearrangement is that sequences of literal characters are no
  587 longer handled as strings. Instead, each character is handled on its own. This
  588 makes some UTF-8 handling easier, and makes the support of partial matching
  589 possible. Compiled patterns containing long literal strings will be larger as a
  590 result of this change; I hope that performance will not be much affected.
  591 
  592 
  593 Release 4.5 01-Dec-03
  594 ---------------------
  595 
  596 Again mainly a bug-fix and tidying release, with only a couple of new features:
  597 
  598 1. It's possible now to compile PCRE so that it does not use recursive
  599 function calls when matching. Instead it gets memory from the heap. This slows
  600 things down, but may be necessary on systems with limited stacks.
  601 
  602 2. UTF-8 string checking has been tightened to reject overlong sequences and to
  603 check that a starting offset points to the start of a character. Failure of the
  604 latter returns a new error code: PCRE_ERROR_BADUTF8_OFFSET.
  605 
  606 3. PCRE can now be compiled for systems that use EBCDIC code.
  607 
  608 
  609 Release 4.4 21-Aug-03
  610 ---------------------
  611 
  612 This is mainly a bug-fix and tidying release. The only new feature is that PCRE
  613 checks UTF-8 strings for validity by default. There is an option to suppress
  614 this, just in case anybody wants that teeny extra bit of performance.
  615 
  616 
  617 Releases 4.1 - 4.3
  618 ------------------
  619 
  620 Sorry, I forgot about updating the NEWS file for these releases. Please take a
  621 look at ChangeLog.
  622 
  623 
  624 Release 4.0 17-Feb-03
  625 ---------------------
  626 
  627 There have been a lot of changes for the 4.0 release, adding additional
  628 functionality and mending bugs. Below is a list of the highlights of the new
  629 functionality. For full details of these features, please consult the
  630 documentation. For a complete list of changes, see the ChangeLog file.
  631 
  632 1. Support for Perl's \Q...\E escapes.
  633 
  634 2. "Possessive quantifiers" ?+, *+, ++, and {,}+ which come from Sun's Java
  635 package. They provide some syntactic sugar for simple cases of "atomic
  636 grouping".
  637 
  638 3. Support for the \G assertion. It is true when the current matching position
  639 is at the start point of the match.
  640 
  641 4. A new feature that provides some of the functionality that Perl provides
  642 with (?{...}). The facility is termed a "callout". The way it is done in PCRE
  643 is for the caller to provide an optional function, by setting pcre_callout to
  644 its entry point. To get the function called, the regex must include (?C) at
  645 appropriate points.
  646 
  647 5. Support for recursive calls to individual subpatterns. This makes it really
  648 easy to get totally confused.
  649 
  650 6. Support for named subpatterns. The Python syntax (?P<name>...) is used to
  651 name a group.
  652 
  653 7. Several extensions to UTF-8 support; it is now fairly complete. There is an
  654 option for pcregrep to make it operate in UTF-8 mode.
  655 
  656 8. The single man page has been split into a number of separate man pages.
  657 These also give rise to individual HTML pages which are put in a separate
  658 directory. There is an index.html page that lists them all. Some hyperlinking
  659 between the pages has been installed.
  660 
  661 
  662 Release 3.5 15-Aug-01
  663 ---------------------
  664 
  665 1. The configuring system has been upgraded to use later versions of autoconf
  666 and libtool. By default it builds both a shared and a static library if the OS
  667 supports it. You can use --disable-shared or --disable-static on the configure
  668 command if you want only one of them.
  669 
  670 2. The pcretest utility is now installed along with pcregrep because it is
  671 useful for users (to test regexs) and by doing this, it automatically gets
  672 relinked by libtool. The documentation has been turned into a man page, so
  673 there are now .1, .txt, and .html versions in /doc.
  674 
  675 3. Upgrades to pcregrep:
  676    (i)   Added long-form option names like gnu grep.
  677    (ii)  Added --help to list all options with an explanatory phrase.
  678    (iii) Added -r, --recursive to recurse into sub-directories.
  679    (iv)  Added -f, --file to read patterns from a file.
  680 
  681 4. Added --enable-newline-is-cr and --enable-newline-is-lf to the configure
  682 script, to force use of CR or LF instead of \n in the source. On non-Unix
  683 systems, the value can be set in config.h.
  684 
  685 5. The limit of 200 on non-capturing parentheses is a _nesting_ limit, not an
  686 absolute limit. Changed the text of the error message to make this clear, and
  687 likewise updated the man page.
  688 
  689 6. The limit of 99 on the number of capturing subpatterns has been removed.
  690 The new limit is 65535, which I hope will not be a "real" limit.
  691 
  692 
  693 Release 3.3 01-Aug-00
  694 ---------------------
  695 
  696 There is some support for UTF-8 character strings. This is incomplete and
  697 experimental. The documentation describes what is and what is not implemented.
  698 Otherwise, this is just a bug-fixing release.
  699 
  700 
  701 Release 3.0 01-Feb-00
  702 ---------------------
  703 
  704 1. A "configure" script is now used to configure PCRE for Unix systems. It
  705 builds a Makefile, a config.h file, and the pcre-config script.
  706 
  707 2. PCRE is built as a shared library by default.
  708 
  709 3. There is support for POSIX classes such as [:alpha:].
  710 
  711 5. There is an experimental recursion feature.
  712 
  713 ----------------------------------------------------------------------------
  714           IMPORTANT FOR THOSE UPGRADING FROM VERSIONS BEFORE 2.00
  715 
  716 Please note that there has been a change in the API such that a larger
  717 ovector is required at matching time, to provide some additional workspace.
  718 The new man page has details. This change was necessary in order to support
  719 some of the new functionality in Perl 5.005.
  720 
  721           IMPORTANT FOR THOSE UPGRADING FROM VERSION 2.00
  722 
  723 Another (I hope this is the last!) change has been made to the API for the
  724 pcre_compile() function. An additional argument has been added to make it
  725 possible to pass over a pointer to character tables built in the current
  726 locale by pcre_maketables(). To use the default tables, this new argument
  727 should be passed as NULL.
  728 
  729           IMPORTANT FOR THOSE UPGRADING FROM VERSION 2.05
  730 
  731 Yet another (and again I hope this really is the last) change has been made
  732 to the API for the pcre_exec() function. An additional argument has been
  733 added to make it possible to start the match other than at the start of the
  734 subject string. This is important if there are lookbehinds. The new man
  735 page has the details, but you just want to convert existing programs, all
  736 you need to do is to stick in a new fifth argument to pcre_exec(), with a
  737 value of zero. For example, change
  738 
  739   pcre_exec(pattern, extra, subject, length, options, ovec, ovecsize)
  740 to
  741   pcre_exec(pattern, extra, subject, length, 0, options, ovec, ovecsize)
  742 
  743 ****