summaryrefslogtreecommitdiffstats
path: root/yt/cpp/mapreduce/interface
Commit message (Collapse)AuthorAgeFilesLines
...
* Revert "YT-21253 Include HyperLogLog in YT table columnar statistics"orlovorlov2024-07-032-4/+0
| | | | | | This reverts commit b5399faf1a9757b07a2d2ee25bd16b8a27be7939, reversing changes made to d7e3e35dd1a856c587d7a9eb2e0dd180d3cf39ed. 82c6dea5d3958fc85ee39e7bcc23c6ec24d6aee9
* [yt/cpp/mapreduce] YT-21595: Use gtest instead of ytest in all mapreduce testsnadya732024-07-0231-2098/+2111
| | | | 85671f0cf4f45b4f015fa2cc0d195b81c16c6e8a
* YT-21253 Include HyperLogLog in YT table columnar statisticsorlovorlov2024-07-022-0/+4
| | | | | | тестирование HLL на случайно сгенерированных данных: p=10 показывает худшую погрешность в 9.9% (равномерное распределение на отрезке [0, 10^6), 10 HLL-групп, 1М значений, 631К уникальных b5399faf1a9757b07a2d2ee25bd16b8a27be7939
* [yt/cpp/mapreduce] Update misleading commenteak1mov2024-06-051-1/+2
| | | | | | Похоже в rXXXXXX по ошибке перенесли комментарий из `Abort()` в `Finish()`: https://a.yandex-team.ru/arcadia/commit/rXXXXXX#file-mapreduce/yt/interface/io.h:L208 c182c2732c309d8c5371e3ef8071ecd07aa54928
* YT-21308: Add redirect_stdout_to_stderr flag for C++ clientapachee2024-05-245-1/+36
| | | | | Adds redirect_stdout_to_stderr spec option for operations that allows writing to stdout as if it was stderr. 6a8ac5f21955a79848d86f72715628c7b8bb65c4
* Fix typo: comitted, commited -> committedEgor Chunaev2024-05-031-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | I left only these typos: ```cpp # Build. build/scripts/link_sbom.py 61: res["properties"].append({'name': 'has_uncommited_changes', 'value': True}) # Contrib. contrib/clickhouse/src/Coordination/KeeperLogStore.h 19: /// Read log storage from filesystem starting from last_commited_log_index 20: void init(uint64_t last_commited_log_index, uint64_t logs_to_keep); contrib/clickhouse/src/Coordination/KeeperStateManager.h 36: void loadLogStore(uint64_t last_commited_index, uint64_t logs_to_keep); contrib/clickhouse/src/Coordination/Changelog.h 100: void readChangelogAndInitWriter(uint64_t last_commited_log_index, uint64_t logs_to_keep); contrib/clickhouse/src/Databases/DatabaseReplicatedSettings.h 13: M(UInt64, wait_entry_commited_timeout_sec, 3600, "Replicas will try to cancel query if timeout exceed, but initiator host has not executed it yet", 0) \ contrib/clickhouse/src/Databases/DatabaseReplicatedWorker.cpp 337: size_t max_iterations = database->db_settings.wait_entry_commited_timeout_sec; contrib/python/pytest-benchmark/pytest_benchmark/utils.py 77: parts.append("uncommited-changes") contrib/libs/poco/Data/include/Poco/Data/Transaction.h 57: /// commited automatically. If no error occurs, rollback is disabled and does 85: /// Rolls back the current database transaction if it has not been commited contrib/clickhouse/src/Storages/StorageMergeTree.cpp 2061: /// and we should be able to rollback already added (Precomitted) parts # Kinda contrib. yt/spark/spark/sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/StateStoreSuite.scala 1048: // Reload the store from the commited version and repeat the above test. # Metrics tag :( yt/yt/server/master/security_server/security_manager.cpp 595: TWithTagGuard guard(&buffer, "status", "commited"); 599: TWithTagGuard guard(&buffer, "status", "uncommited"); ``` The only backwards incompatible place is: https://github.com/ytsaurus/ytsaurus/blob/091bcf82fe4fc8d9a967eb36beddf5767e92e659/yt/python/yt/wrapper/dynamic_table_commands.py#L26-L27 --- 9f6a944af13ef3fbf3f25c15b2c2d3982ed7e39b Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/566 Co-authored-by: ignat <[email protected]>
* [yt/cpp/mapreduce] YT-21465: Pass table index via SkiffRowHintsnadya732024-04-251-1/+8
| | | | | Pass table index via SkiffRowHints 73ade54789f2bab159368dfcc876b0a6121b4e7a
* [yt/cpp/mapreduce] YT-21405: Don't ignore backoff and pass actual exception ↵nadya732024-04-181-1/+2
| | | | | | | in Retry() Don't ignore backoff and pass actual exception in Retry() b821c02fd21c9f8115cd2a4896372a9fda69e5f6
* Fix more problems with RetryfulWriterV2ermolovd2024-04-171-7/+0
| | | | 404e999bcffb20d5497161a98f48f566b5245704
* YT-18503: Mirror Cypress Tx to Sequoia Groundkvk19202024-04-071-0/+13
| | | | e6d585180289325f8082f42f85a60478194ba266
* Use async tx pinger by default once againermolovd2024-04-011-1/+1
| | | | 5c990fdee5899ef1cfcc5429f3631998277cd218
* Fix commentermolovd2024-03-241-0/+1
| | | | d547e94dc63865b96a5cdfbe9866d87b11a57193
* YT-18458: Introduce wide types into mapreduce interfacewhatsername2024-03-214-0/+51
| | | | 7ae047ef618cc44d7dd3e817dc27f2336d9e38c3
* Support building yt/cpp and yt/yt/core with vanilla protobufGrigory Reznikov2024-03-193-15/+17
| | | | | | | | | | | | | | After this PR yt/cpp and yt/yt/core are possible to be built both with Arcadia protobuf (that uses TString as a string) and vanilla protobuf (that uses std::string as a string). To achieve so, a couple of interoperability primitives are introduced. * `TProtobufString` is an alias to protobuf string type, i.e. it can be `TString` or `std::string` depending on the protobuf implementation. * `IsVanillaProtobuf` and `IsArcadiaProtobuf` are the constexpr boolean values that allow to check protobuf implementation both in the compile time and runtime. The most challenging interoperability issue solved here is a string copy between protobuf message and C++ code that has a form of `TString str = msg.str()`. This code works perfect with Arcadia protobuf but does not work with vanilla protobuf. To solve it, a previously introduced primitive `FromProto<TString>` is used. This expression makes the most efficient cast possible between protobuf string and C++ string. Internally, it is just a copy in both cases. Since TString is CoW by default, this expression is almost zero-cost (actually it's just one atomic operation), so no degradation is expected for YTsaurus server builds. The most hot code is handled differently to avoid even atomic operations (see `GetRequestTargetYPath`). In case of vanilla protobuf string is copied, however there are no places in C++ SDK where it might be a problem. If such issues would appear, performance-critial code can be rewritten in `GetRequestTargetYPath`-style. --- 1a6f3e02cb6e83915102c24b73bc8734f6a48e74 Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/466
* YT-21141 Avoid content deduplication for files under 10MBorlovorlov2024-03-182-0/+4
| | | | febae4e49cd0f600bf21616025f210e99235cfdc
* Intermediate changesrobot-piglet2024-03-131-0/+4
|
* Intermediate changesrobot-piglet2024-03-101-5/+5
|
* Intermediate changesrobot-piglet2024-02-151-1/+1
|
* Intermediate changesrobot-piglet2024-02-121-1/+1
|
* Intermediate changesrobot-piglet2024-01-301-6/+0
|
* Intermediate changesrobot-piglet2024-01-251-0/+13
|
* erm: Add new version for `@yatool/prebuilder`: `0.5.1` and set `0.5.1` as ↵robot-erm2024-01-253-2/+26
| | | | | | default Executed command: `./erm --verbose --profile update @yatool/prebuilder`
* feat contrib: aiogram 3armenqa2024-01-1912-921/+0
| | | | Relates: https://st.yandex-team.ru/, https://st.yandex-team.ru/
* Library import 7 (#937)AlexSm2024-01-116-90/+90
|
* Library import 5, delete go dependencies (#832)AlexSm2024-01-041-2/+2
| | | | | * Library import 5, delete go dependencies * Fix yt client
* Library import 2 (#639)AlexSm2023-12-221-5/+1
|
* External build system generator release 65robot-ya-builder2023-12-052-6/+6
| | | | Update tools: yexport, os-yexport
* YT-19269: table writer implementation that doesn't wait for complete buffer ↵ermolovd2023-11-281-0/+18
| | | | before sending to network
* YT-20315: Support retries of cross cell copyingnadya022023-11-242-0/+2
| | | | | | add options YT-20315: Support retries of cross cell copying
* Fix serialization of decimal type in TTableSchemaEgor Chunaev2023-11-232-2/+52
| | | | | | | | | | | | I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en Fix for https://github.com/ytsaurus/ytsaurus/issues/173 --- Pull Request resolved: https://github.com/ytsaurus/ytsaurus/pull/174 Co-authored-by: ermolovd <[email protected]>
* Revert "YT-20315: Support retries of cross cell copying"ermolovd2023-11-232-2/+0
| | | | | This reverts commit 9b45f88f366c2a170ab826922dd6eeaa64ea4192, reversing changes made to d6dc5a658da5b61fd71e72f1a60479989c5c64c5.
* YT-18863 Support 'deleted' field in NYT::TTableSchemaorlovorlov2023-11-222-0/+14
|
* YT-20315: Support retries of cross cell copyingnadya022023-11-222-0/+2
|
* add darwin-arm64 CMakeListsdcherednik2023-11-204-0/+181
|
* Move MaxFailedJobCount to TOperationSpecBaseermolovd2023-11-161-3/+3
|
* YT-20029: Support url schema for YT_PROXYwhatsername2023-11-151-1/+1
| | | | | Example YT_PROXY=https://freud.yt.yandex.net
* Add possibility to create groups in C++ clientermolovd2023-11-081-0/+1
|
* add acquire buffer size to parallel file writeralxmopo3ov2023-10-311-1/+7
| | | | | | | | test with acquire ram buffers Add test on write with acquiring hard limit on file writer Implement acquire ram buffers setting for parallel file writer
* YT-18571: Fix myriads of typosbabenko2023-10-237-14/+14
|
* add using http-proxy for reading table from YTannashest182023-10-221-0/+3
| | | | | | add using http-proxy for reading table from YT Нам нужна возможность ходить в YT через HTTP proxy для чтения таблиц, используя С++ клиент не из контура Яндекса, к сожалению, сейчас такой возможности нет. В этом ПР черновик изменения, которого нам достаточно https://a.yandex-team.ru/review/4676436/details - тут это же изменение в YT + коммит с тем, как мы планируем использовать
* Y_FAIL->Y_ABORT at '^yt'ilnurkh2023-10-177-34/+34
| | | | https://clubs.at.yandex-team.ru/arcadia/29404
* Possibility to get operation by its aliasermolovd2023-10-162-0/+10
|
* Y_VERIFY->Y_ABORT_UNLESS at ^ytilnurkh2023-10-0910-29/+29
| | | | https://clubs.at.yandex-team.ru/arcadia/29404
* Cosmeticspogorelov2023-10-041-1/+1
|
* support disk_request in user job specermolovd2023-09-281-0/+33
|
* Debug printing for NYT::TTableSchemaermolovd2023-09-192-1/+8
|
* [yt/cpp/mapreduce] YT-19268: Lock memory for parallel writernadya732023-09-142-0/+27
|
* Add `layer_paths` user job spec option to the C++ interfacegaltsev2023-09-061-0/+3
|
* [yt/cpp/mapreduce] Fix linksnadya732023-09-061-2/+2
|
* [yt/cpp/mapreduce] Fix documentation linksnadya732023-09-068-209/+209
|