summaryrefslogtreecommitdiffstats
path: root/yql/essentials/sql/v1/lexer/regex
Commit message (Collapse)AuthorAgeFilesLines
* YQL-20496: Improve YQL highlightingvitya-smirnov11 days1-9/+8
| | | | commit_hash:edf72fd8386c41f858f677152612bd58680506a6
* YQL-20086 sql/v1vitya-smirnov14 days8-684/+684
| | | | commit_hash:55bc611cdaa0d8a0fc3c4c7708ed9f17cc4976cf
* YQL-19616: Optimize regexes for Monarchvitya-smirnov2025-09-292-84/+86
| | | | | | | Monarch regex engine is so slow on regexes like `([A-Z]|[a-z]|_|[0-9])*`, but works well on `([A-Za-z_0-9])*`. So such optimization is added. commit_hash:b749e59818a20d7fd1d1ba21b233e1467e9d6ced
* YQL-20297: Make generated regexes more stablevitya-smirnov2025-09-051-1/+5
| | | | commit_hash:b3270397329599800d4e7b1f92b8e0f18e94cfd6
* YQL-19616: Generate YQLs syntax highlightingvitya-smirnov2025-08-012-11/+27
| | | | | | | | | | | | | | | | | | | | - Support `Before` at core `TRegexPattern` and `IGenericLexer`. - Added `Name` and `Extension` to core `THighlighting`. - Added `Tighlighting` for `YQLs` factory method. - Added `--language` option to `yql_highlight`. - Added `artifact` targets for `YQLs`. Yes, using the `NSQLTranslation::THighlighting` for `YQLs` is not correct, but much simplier than generalize this infrastructure just for a `YQLs`. So here is a trade-off between development time and a clean code. Results: - JetBrains: https://nda.ya.ru/t/PXkZVE8m7H5wHS. - Vim: https://nda.ya.ru/t/Am-6ZHQa7H5wJi. - TextMate: https://nda.ya.ru/t/wH0YggAf7H5wKw. - yql_highlight: https://nda.ya.ru/t/3FaCm57q7H7QSF. commit_hash:f0e1abb8e7f1b083df531d761b357330bd514cb0
* YQL-19616: Generate Vim SQL syntax highlightingvitya-smirnov2025-07-151-1/+6
| | | | | | | | | | Introduced a Vim syntax highlighting for YQL. This is a replacement for an existing almost manually written conguration. It uses regexes generated from the original ANTLR4 grammar. Now only Default lexer mode is supported. commit_hash:85fa094593bd9d80373754a492b46ede1a50148d
* YQL-19616: Fix TRegexLexer performancevityaman2025-05-195-45/+120
| | | | | | | | | | | | | | | | | Fix `TRegexLexer` performance. Now it is just 2 times slower than a reference ANTLR implementation on Release mode, so merged regexes are 3 times better than scan&compare. ![image](https://github.com/user-attachments/assets/4e0cb27a-491d-4dbd-b10a-5725ffa6d902) --- - Related to `YQL-19616` - Related to https://github.com/ydb-platform/ydb/issues/15129 - Related to https://github.com/vityaman/ydb/issues/42 --- Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/1278 commit_hash:1529f641172fea13f0d33fbfd06a4827c6efde01
* Intermediate changesrobot-piglet2025-05-127-192/+325
| | | | commit_hash:3a624a323006078de71f50747f7b2e8cadba7ccd
* Intermediate changesrobot-piglet2025-04-143-3/+11
| | | | commit_hash:b6187f8eba6e8debc23f1928b2e44a396f3511ad
* YQL-19616 Fix lexer/regex STRING_VALUE and TSKIP recognitionvityaman2025-04-111-7/+12
| | | | | | | | | | - Related to https://github.com/ydb-platform/ydb/issues/15129 - Related to https://github.com/vityaman/ydb/issues/11 --- Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/1201 commit_hash:53ef677a35649a6dc77d8c4269a8aceefcd15026
* YQL-19616 Fix regex lexervityaman2025-04-093-5/+8
| | | | | | | | | | | | | | | | | | Fixed regex lexer issues: - `TSKIP` token recognition - `HEXGIGITS` number recognition - `EOF` token content --- - Related to https://github.com/ydb-platform/ydb/issues/15129 - Related to https://github.com/vityaman/ydb/issues/11 --- Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/1190 commit_hash:497c39efcbbe4e387da523b5e2c8abaa6485d93b
* Intermediate changesrobot-piglet2025-04-073-11/+17
| | | | commit_hash:6768768ea3a3962231d3fabdffb2ce0db44e9347
* Intermediate changesrobot-piglet2025-04-021-2/+3
| | | | commit_hash:28e9e4bd6b02e8914d82b2aafe9f341b5492421f
* Intermediate changesrobot-piglet2025-04-013-23/+24
| | | | commit_hash:e57b3e95787cc8037f200f1b6b6073e35403b27e
* YQL-19616 Convert YQL lexer grammar to regexesvityaman2025-03-288-0/+876
- [x] Parse YQL grammar to extract lexer grammar into `TLexerGrammar`. - [x] Translate `TLexerGrammar` into regexes. - [x] Implement a lexer via regexes `TRegexLexer` to test generated regexes validity. - [x] Test on `Default` syntax mode. - [x] Test on `ANSI` syntax mode. --- - Related to https://github.com/ydb-platform/ydb/issues/15129 - Requirement for https://github.com/ytsaurus/ytsaurus/pull/1112 --- Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/1127 commit_hash:03ffffe81cdafe7f93a4d3fd9a3212fe67f1c72d