diff options
author | thegeorg <thegeorg@yandex-team.ru> | 2022-06-17 10:09:37 +0300 |
---|---|---|
committer | thegeorg <thegeorg@yandex-team.ru> | 2022-06-17 10:09:37 +0300 |
commit | e91c5c4c2b4265bec65f9042c60ba56c60fa4e3f (patch) | |
tree | 770a3d0552d1d642ce48908f3af7de2e474c48c8 | |
parent | 216b853064f62638fa6625c77e1112dda0415c06 (diff) | |
download | ydb-e91c5c4c2b4265bec65f9042c60ba56c60fa4e3f.tar.gz |
Fix reimport of contrib/libs/utf8proc
ref:597c236ee2b5815e783abc9f828c5001cbed79e6
-rw-r--r-- | contrib/libs/utf8proc/.yandex_meta/devtools.licenses.report | 31 | ||||
-rw-r--r-- | contrib/libs/utf8proc/.yandex_meta/licenses.list.txt | 109 | ||||
-rw-r--r-- | contrib/libs/utf8proc/CMakeLists.txt | 6 | ||||
-rw-r--r-- | contrib/libs/utf8proc/NEWS.md | 316 | ||||
-rw-r--r-- | contrib/libs/utf8proc/README.md | 69 |
5 files changed, 525 insertions, 6 deletions
diff --git a/contrib/libs/utf8proc/.yandex_meta/devtools.licenses.report b/contrib/libs/utf8proc/.yandex_meta/devtools.licenses.report index f45aebbb75..80d2041e00 100644 --- a/contrib/libs/utf8proc/.yandex_meta/devtools.licenses.report +++ b/contrib/libs/utf8proc/.yandex_meta/devtools.licenses.report @@ -74,6 +74,18 @@ BELONGS ya.make Files with this license: LICENSE.md [59:93] +KEEP MIT 74ab01fc06feca8221855c04a783e6ad +BELONGS ya.make + License text: + - Changed license from BSD to MIT style. + Scancode info: + Original SPDX id: MIT + Score : 95.00 + Match type : REFERENCE + Links : http://opensource.org/licenses/mit-license.php, https://spdx.org/licenses/MIT + Files with this license: + NEWS.md [211:211] + KEEP MIT b9f647ef7e29973cc8b999af88006590 BELONGS ya.make License text: @@ -98,3 +110,22 @@ BELONGS ya.make Links : https://github.com/nexB/scancode-toolkit/tree/develop/src/licensedcode/data/licenses/free-unknown.LICENSE Files with this license: utf8proc.h [28:28] + +KEEP MIT AND Unicode f1f18913b6e9a1006771a56a06d41a82 +BELONGS ya.make +FILE_INCLUDE LICENSE.md found in files: README.md at line 31 + Note: matched license text is too long. Read it in the source files. + Scancode info: + Original SPDX id: MIT + Score : 100.00 + Match type : NOTICE + Links : http://opensource.org/licenses/mit-license.php, https://spdx.org/licenses/MIT + Files with this license: + README.md [26:31] + Scancode info: + Original SPDX id: LicenseRef-scancode-unicode + Score : 100.00 + Match type : NOTICE + Links : http://unicode.org/, http://unicode.org/copyright.html, https://github.com/nexB/scancode-toolkit/tree/develop/src/licensedcode/data/licenses/unicode.LICENSE + Files with this license: + README.md [26:31] diff --git a/contrib/libs/utf8proc/.yandex_meta/licenses.list.txt b/contrib/libs/utf8proc/.yandex_meta/licenses.list.txt index f2277aac7e..4ffd995ebc 100644 --- a/contrib/libs/utf8proc/.yandex_meta/licenses.list.txt +++ b/contrib/libs/utf8proc/.yandex_meta/licenses.list.txt @@ -16,6 +16,102 @@ under the Terms of Use in http://www.unicode.org/copyright.html.* *Copyright (c) 2009, 2013 Public Software Group e. V., Berlin, Germany* +====================File: LICENSE.md==================== +## utf8proc license ## + +**utf8proc** is a software package originally developed +by Jan Behrens and the rest of the Public Software Group, who +deserve nearly all of the credit for this library, that is now maintained by the Julia-language developers. Like the original utf8proc, +whose copyright and license statements are reproduced below, all new +work on the utf8proc library is licensed under the [MIT "expat" +license](http://opensource.org/licenses/MIT): + +*Copyright © 2014-2015 by Steven G. Johnson, Jiahao Chen, Tony Kelman, Jonas Fonseca, and other contributors listed in the git history.* + +Permission is hereby granted, free of charge, to any person obtaining a +copy of this software and associated documentation files (the "Software"), +to deal in the Software without restriction, including without limitation +the rights to use, copy, modify, merge, publish, distribute, sublicense, +and/or sell copies of the Software, and to permit persons to whom the +Software is furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in +all copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING +FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER +DEALINGS IN THE SOFTWARE. + +## Original utf8proc license ## + +*Copyright (c) 2009, 2013 Public Software Group e. V., Berlin, Germany* + +Permission is hereby granted, free of charge, to any person obtaining a +copy of this software and associated documentation files (the "Software"), +to deal in the Software without restriction, including without limitation +the rights to use, copy, modify, merge, publish, distribute, sublicense, +and/or sell copies of the Software, and to permit persons to whom the +Software is furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in +all copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING +FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER +DEALINGS IN THE SOFTWARE. + +## Unicode data license ## + +This software distribution contains derived data from a modified version of +the Unicode data files. The following license applies to that data: + +**COPYRIGHT AND PERMISSION NOTICE** + +*Copyright (c) 1991-2007 Unicode, Inc. All rights reserved. Distributed +under the Terms of Use in http://www.unicode.org/copyright.html.* + +Permission is hereby granted, free of charge, to any person obtaining a +copy of the Unicode data files and any associated documentation (the "Data +Files") or Unicode software and any associated documentation (the +"Software") to deal in the Data Files or Software without restriction, +including without limitation the rights to use, copy, modify, merge, +publish, distribute, and/or sell copies of the Data Files or Software, and +to permit persons to whom the Data Files or Software are furnished to do +so, provided that (a) the above copyright notice(s) and this permission +notice appear with all copies of the Data Files or Software, (b) both the +above copyright notice(s) and this permission notice appear in associated +documentation, and (c) there is clear notice in each modified Data File or +in the Software as well as in the documentation associated with the Data +File(s) or Software that the data or software has been modified. + +THE DATA FILES AND SOFTWARE ARE PROVIDED "AS IS", WITHOUT WARRANTY OF ANY +KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF +MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT OF +THIRD PARTY RIGHTS. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR HOLDERS +INCLUDED IN THIS NOTICE BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL INDIRECT OR +CONSEQUENTIAL DAMAGES, OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF +USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER +TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR +PERFORMANCE OF THE DATA FILES OR SOFTWARE. + +Except as contained in this notice, the name of a copyright holder shall +not be used in advertising or otherwise to promote the sale, use or other +dealings in these Data Files or Software without prior written +authorization of the copyright holder. + +Unicode and the Unicode logo are trademarks of Unicode, Inc., and may be +registered in some jurisdictions. All other trademarks and registered +trademarks mentioned herein are the property of their respective owners. + + ====================MIT==================== * Permission is hereby granted, free of charge, to any person obtaining a * copy of this software and associated documentation files (the "Software"), @@ -61,6 +157,10 @@ under the Terms of Use in http://www.unicode.org/copyright.html.* ====================MIT==================== +- Changed license from BSD to MIT style. + + +====================MIT==================== Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation @@ -85,6 +185,15 @@ work on the utf8proc library is licensed under the [MIT "expat" license](http://opensource.org/licenses/MIT): +====================MIT AND Unicode==================== +The utf8proc package is licensed under the +free/open-source [MIT "expat" +license](http://opensource.org/licenses/MIT) (plus certain Unicode +data governed by the similarly permissive [Unicode data +license](http://www.unicode.org/copyright.html#Exhibit1)); please see +the included `LICENSE.md` file for more detailed information. + + ====================Unicode==================== *Copyright (c) 1991-2007 Unicode, Inc. All rights reserved. Distributed under the Terms of Use in http://www.unicode.org/copyright.html.* diff --git a/contrib/libs/utf8proc/CMakeLists.txt b/contrib/libs/utf8proc/CMakeLists.txt index 11ff6a416f..b7b86cf259 100644 --- a/contrib/libs/utf8proc/CMakeLists.txt +++ b/contrib/libs/utf8proc/CMakeLists.txt @@ -11,12 +11,6 @@ add_library(contrib-libs-utf8proc) target_compile_options(contrib-libs-utf8proc PUBLIC -DUTF8PROC_STATIC ) -target_compile_options(contrib-libs-utf8proc PRIVATE - -std=c99 -) -target_link_libraries(contrib-libs-utf8proc PUBLIC - contrib-libs-cxxsupp -) target_sources(contrib-libs-utf8proc PRIVATE ${CMAKE_SOURCE_DIR}/contrib/libs/utf8proc/utf8proc.c ) diff --git a/contrib/libs/utf8proc/NEWS.md b/contrib/libs/utf8proc/NEWS.md new file mode 100644 index 0000000000..6e6d1ab0bb --- /dev/null +++ b/contrib/libs/utf8proc/NEWS.md @@ -0,0 +1,316 @@ +# utf8proc release history # + +## Version 2.1.1 ## + +2018-04-27 + +- Fixed composition bug ([#128]). + +- Minor build fixes ([#94], [#99], [#113], [#125]). + +## Version 2.1 ## + +2016-12-26: + +- New functions `utf8proc_map_custom` and `utf8proc_decompose_custom` + to allow user-supplied transformations of codepoints, in conjunction + with other transformations ([#89]). + +- New function `utf8proc_normalize_utf32` to apply normalizations + directly to UTF-32 data (not just UTF-8) ([#88]). + +- Fixed stack overflow that could occur due to incorrect definition + of `UINT16_MAX` with some compilers ([#84]). + +- Fixed conflict with `stdbool.h` in Visual Studio ([#90]). + +- Updated font metrics to use Unifont 9.0.04. + +## Version 2.0.2 ## + +2016-07-27: + +- Move `-Wmissing-prototypes` warning flag from `Makefile` to `.travis.yml` + since MSVC does not understand this flag and it is occasionally useful to + build using MSVC through the `Makefile` ([#79]). + +- Use a different variable name for a nested loop in `bench/bench.c`, and + declare it in a C89 way rather than inside the `for` to avoid "error: + 'for' loop initial declarations are only allowed in C99 mode" ([#80]). + +## Version 2.0.1 ## + +2016-07-13: + +- Bug fix in `utf8proc_grapheme_break_stateful` ([#77]). + +- Tests now use versioned Unicode files, so they will no longer + break when a new version of Unicode is released ([#78]). + +## Version 2.0 ## + +2016-07-13: + +- Updated for Unicode 9.0 ([#70]). + +- New `utf8proc_grapheme_break_stateful` to handle the complicated + grapheme-breaking rules in Unicode 9. The old `utf8proc_grapheme_break` + is still provided, but may incorrectly identify grapheme breaks + in some Unicode-9 sequences. + +- Smaller Unicode tables ([#62], [#68]). This required changes + in the `utf8proc_property_t` structure, which breaks backward + compatibility if you access this `struct` directly. The + functions in the API remain backward-compatible, however. + +- Buffer overrun fix ([#66]). + +## Version 1.3.1 ## + +2015-11-02: + +- Do not export symbol for internal function `unsafe_encode_char()` ([#55]). + +- Install relative symbolic links for shared libraries ([#58]). + +- Enable and fix compiler warnings ([#55], [#58]). + +- Add missing files to `make clean` ([#58]). + +## Version 1.3 ## + +2015-07-06: + +- Updated for Unicode 8.0 ([#45]). + +- New `utf8proc_tolower` and `utf8proc_toupper` functions, portable + replacements for `towlower` and `towupper` in the C library ([#40]). + +- Don't treat Unicode "non-characters" as invalid, and improved + validity checking in general ([#35]). + +- Prefix all typedefs with `utf8proc_`, e.g. `utf8proc_int32_t`, + to avoid collisions with other libraries ([#32]). + +- Rename `DLLEXPORT` to `UTF8PROC_DLLEXPORT` to prevent collisions. + +- Fix build breakage in the benchmark routines. + +- More fine-grained Makefile variables (`PICFLAG` etcetera), so that + compilation flags can be selectively overridden, and in particular + so that `CFLAGS` can be changed without accidentally eliminating + necessary flags like `-fPIC` and `-std=c99` ([#43]). + +- Updated character-width tables based on Unifont 8.0.01 ([#51]) and + the Unicode 8 character categories ([#47]). + +## Version 1.2 ## + +2015-03-28: + +- Updated for Unicode 7.0 ([#6]). + +- New function `utf8proc_grapheme_break(c1,c2)` that returns whether + there is a grapheme break between `c1` and `c2` ([#20]). + +- New function `utf8proc_charwidth(c)` that returns the number of + column-positions that should be required for `c`; essentially a + portable replacment for `wcwidth(c)` ([#27]). + +- New function `utf8proc_category(c)` that returns the Unicode + category of `c` (as one of the constants `UTF8PROC_CATEGORY_xx`). + Also, a function `utf8proc_category_string(c)` that returns the Unicode + category of `c` as a two-character string. + +- `cmake` script `CMakeLists.txt`, in addition to `Makefile`, for + easier compilation on Windows ([#28]). + +- Various `Makefile` improvements: a `make check` target to perform + tests ([#13]), `make install`, a rule to automate updating the Unicode + tables, etcetera. + +- The shared library is now versioned (e.g. has a soname on GNU/Linux) ([#24]). + +- C++/MSVC compatibility ([#17]). + +- Most `#defined` constants are now `enums` ([#29]). + +- New preprocessor constants `UTF8PROC_VERSION_MAJOR`, + `UTF8PROC_VERSION_MINOR`, and `UTF8PROC_VERSION_PATCH` for compile-time + detection of the API version. + +- Doxygen-formatted documentation ([#29]). + +- The Ruby and PostgreSQL plugins have been removed due to lack of testing ([#22]). + +## Version 1.1.6 ## + +2013-11-27: + +- PostgreSQL 9.2 and 9.3 compatibility (lowercase `c` language name) + +## Version 1.1.5 ## + +2009-08-20: + +- Use `RSTRING_PTR()` and `RSTRING_LEN()` instead of `RSTRING()->ptr` and + `RSTRING()->len` for ruby1.9 compatibility (and `#define` them, if not + existent) + +2009-10-02: + +- Patches for compatibility with Microsoft Visual Studio + +2009-10-08: + +- Fixes to make utf8proc usable in C++ programs + +2009-10-16: + +## Version 1.1.4 ## + +2009-06-14: + +- replaced C++ style comments for compatibility reasons +- added typecasts to suppress compiler warnings +- removed redundant source files for ruby-gemfile generation + +2009-08-19: + +- Changed copyright notice for Public Software Group e. V. +- Minor changes in the `README` file + +## Version 1.1.3 ## + +2008-10-04: + +- Added a function `utf8proc_version` returning a string containing the version + number of the library. +- Included a target `libutf8proc.dylib` for MacOSX. + +2009-05-01: +- PostgreSQL 8.3 compatibility (use of `SET_VARSIZE` macro) + +## Version 1.1.2 ## + +2007-07-25: + +- Fixed a serious bug in the data file generator, which caused characters + being treated incorrectly, when stripping default ignorable characters or + calculating grapheme cluster boundaries. + +## Version 1.1.1 ## + +2007-06-25: + +- Added a new PostgreSQL function `unistrip`, which behaves like `unifold`, + but also removes all character marks (e.g. accents). + +2007-07-22: + +- Changed license from BSD to MIT style. +- Added a new function `utf8proc_codepoint_valid` to the C library. +- Changed compiler flags in `Makefile` from `-g -O0` to `-O2` +- The ruby script, which was used to build the `utf8proc_data.c` file, is now + included in the distribution. + +## Version 1.0.3 ## + +2007-03-16: + +- Fixed a bug in the ruby library, which caused an error, when splitting an + empty string at grapheme cluster boundaries (method `String#utf8chars`). + +## Version 1.0.2 ## + +2006-09-21: + +- included a check in `Integer#utf8`, which raises an exception, if the given + code-point is invalid because of being too high (this was missing yet) + +2006-12-26: + +- added support for PostgreSQL version 8.2 + +## Version 1.0.1 ## + +2006-09-20: + +- included a gem file for the ruby version of the library + +Release of version 1.0.1 + +## Version 1.0 ## + +2006-09-17: + +- added the `LUMP` option, which lumps certain characters together (see `lump.md`) (also used for the PostgreSQL `unifold` function) +- added the `STRIPMARK` option, which strips marking characters (or marks of composed characters) +- deprecated ruby method `String#char_ary` in favour of `String#utf8chars` + +## Version 0.3 ## + +2006-07-18: + +- changed normalization from NFC to NFKC for postgresql unifold function + +2006-08-04: + +- added support to mark the beginning of a grapheme cluster with 0xFF (option: `CHARBOUND`) +- added the ruby method `String#chars`, which is returning an array of UTF-8 encoded grapheme clusters +- added `NLF2LF` transformation in postgresql `unifold` function +- added the `DECOMPOSE` option, if you neither use `COMPOSE` or `DECOMPOSE`, no normalization will be performed (different from previous versions) +- using integer constants rather than C-strings for character properties +- fixed (hopefully) a problem with the ruby library on Mac OS X, which occurred when compiler optimization was switched on + +## Version 0.2 ## + +2006-06-05: + +- changed behaviour of PostgreSQL function to return NULL in case of invalid input, rather than raising an exceptional condition +- improved efficiency of PostgreSQL function (no transformation to C string is done) + +2006-06-20: + +- added -fpic compiler flag in Makefile +- fixed bug in the C code for the ruby library (usage of non-existent function) + +## Version 0.1 ## + +2006-06-02: initial release of version 0.1 + +[#6]: https://github.com/JuliaLang/utf8proc/issues/6 +[#13]: https://github.com/JuliaLang/utf8proc/issues/13 +[#17]: https://github.com/JuliaLang/utf8proc/issues/17 +[#20]: https://github.com/JuliaLang/utf8proc/issues/20 +[#22]: https://github.com/JuliaLang/utf8proc/issues/22 +[#24]: https://github.com/JuliaLang/utf8proc/issues/24 +[#27]: https://github.com/JuliaLang/utf8proc/issues/27 +[#28]: https://github.com/JuliaLang/utf8proc/issues/28 +[#29]: https://github.com/JuliaLang/utf8proc/issues/29 +[#32]: https://github.com/JuliaLang/utf8proc/issues/32 +[#35]: https://github.com/JuliaLang/utf8proc/issues/35 +[#40]: https://github.com/JuliaLang/utf8proc/issues/40 +[#43]: https://github.com/JuliaLang/utf8proc/issues/43 +[#45]: https://github.com/JuliaLang/utf8proc/issues/45 +[#47]: https://github.com/JuliaLang/utf8proc/issues/47 +[#51]: https://github.com/JuliaLang/utf8proc/issues/51 +[#55]: https://github.com/JuliaLang/utf8proc/issues/55 +[#58]: https://github.com/JuliaLang/utf8proc/issues/58 +[#62]: https://github.com/JuliaLang/utf8proc/issues/62 +[#66]: https://github.com/JuliaLang/utf8proc/issues/66 +[#68]: https://github.com/JuliaLang/utf8proc/issues/68 +[#70]: https://github.com/JuliaLang/utf8proc/issues/70 +[#77]: https://github.com/JuliaLang/utf8proc/issues/77 +[#78]: https://github.com/JuliaLang/utf8proc/issues/78 +[#79]: https://github.com/JuliaLang/utf8proc/issues/79 +[#80]: https://github.com/JuliaLang/utf8proc/issues/80 +[#84]: https://github.com/JuliaLang/utf8proc/issues/84 +[#88]: https://github.com/JuliaLang/utf8proc/issues/88 +[#89]: https://github.com/JuliaLang/utf8proc/issues/89 +[#90]: https://github.com/JuliaLang/utf8proc/issues/90 +[#94]: https://github.com/JuliaLang/utf8proc/issues/94 +[#99]: https://github.com/JuliaLang/utf8proc/issues/99 +[#113]: https://github.com/JuliaLang/utf8proc/issues/113 +[#125]: https://github.com/JuliaLang/utf8proc/issues/125 +[#128]: https://github.com/JuliaLang/utf8proc/issues/128 diff --git a/contrib/libs/utf8proc/README.md b/contrib/libs/utf8proc/README.md new file mode 100644 index 0000000000..cb9f69478d --- /dev/null +++ b/contrib/libs/utf8proc/README.md @@ -0,0 +1,69 @@ +# utf8proc +[![Travis CI Status](https://travis-ci.org/JuliaLang/utf8proc.png)](https://travis-ci.org/JuliaLang/utf8proc) +[![AppVeyor Status](https://ci.appveyor.com/api/projects/status/aou20lfkyhj8xbwq/branch/master?svg=true)](https://ci.appveyor.com/project/tkelman/utf8proc/branch/master) + + +[utf8proc](http://julialang.org/utf8proc/) is a small, clean C +library that provides Unicode normalization, case-folding, and other +operations for data in the [UTF-8 +encoding](http://en.wikipedia.org/wiki/UTF-8). It was [initially +developed](http://www.public-software-group.org/utf8proc) by Jan +Behrens and the rest of the [Public Software +Group](http://www.public-software-group.org/), who deserve *nearly all +of the credit* for this package. With the blessing of the Public +Software Group, the [Julia developers](http://julialang.org/) have +taken over development of utf8proc, since the original developers have +moved to other projects. + +(utf8proc is used for basic Unicode +support in the [Julia language](http://julialang.org/), and the Julia +developers became involved because they wanted to add Unicode 7 support and other features.) + +(The original utf8proc package also includes Ruby and PostgreSQL plug-ins. +We removed those from utf8proc in order to focus exclusively on the C +library for the time being, but plan to add them back in or release them as separate packages.) + +The utf8proc package is licensed under the +free/open-source [MIT "expat" +license](http://opensource.org/licenses/MIT) (plus certain Unicode +data governed by the similarly permissive [Unicode data +license](http://www.unicode.org/copyright.html#Exhibit1)); please see +the included `LICENSE.md` file for more detailed information. + +## Quick Start + +For compilation of the C library run `make`. + +## General Information + +The C library is found in this directory after successful compilation +and is named `libutf8proc.a` (for the static library) and +`libutf8proc.so` (for the dynamic library). + +The Unicode version supported is 9.0.0. + +For Unicode normalizations, the following options are used: + +* Normalization Form C: `STABLE`, `COMPOSE` +* Normalization Form D: `STABLE`, `DECOMPOSE` +* Normalization Form KC: `STABLE`, `COMPOSE`, `COMPAT` +* Normalization Form KD: `STABLE`, `DECOMPOSE`, `COMPAT` + +## C Library + +The documentation for the C library is found in the `utf8proc.h` header file. +`utf8proc_map` is function you will most likely be using for mapping UTF-8 +strings, unless you want to allocate memory yourself. + +## To Do + +See the Github [issues list](https://github.com/JuliaLang/utf8proc/issues). + +## Contact + +Bug reports, feature requests, and other queries can be filed at +the [utf8proc issues page on Github](https://github.com/JuliaLang/utf8proc/issues). + +## See also + +An independent Lua translation of this library, [lua-mojibake](https://github.com/differentprogramming/lua-mojibake), is also available. |