diff options
| author | orlovorlov <[email protected]> | 2024-07-02 12:45:32 +0300 |
|---|---|---|
| committer | orlovorlov <[email protected]> | 2024-07-02 12:58:04 +0300 |
| commit | 77685066e8c73adcf14993f5272e2600c6d3c7bd (patch) | |
| tree | fd251076f222f7df1a822e011df7f6a1d8e62e10 /yt/cpp/mapreduce/interface/serialize.cpp | |
| parent | d254dc1ecbbd99a5c2ec7f401b3097b7d5a74eb1 (diff) | |
YT-21253 Include HyperLogLog in YT table columnar statistics
тестирование HLL на случайно сгенерированных данных:
p=10 показывает худшую погрешность в 9.9% (равномерное распределение на отрезке [0, 10^6), 10 HLL-групп, 1М значений, 631К уникальных
b5399faf1a9757b07a2d2ee25bd16b8a27be7939
Diffstat (limited to 'yt/cpp/mapreduce/interface/serialize.cpp')
| -rw-r--r-- | yt/cpp/mapreduce/interface/serialize.cpp | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/yt/cpp/mapreduce/interface/serialize.cpp b/yt/cpp/mapreduce/interface/serialize.cpp index 9be114b45e6..b195bf04d6e 100644 --- a/yt/cpp/mapreduce/interface/serialize.cpp +++ b/yt/cpp/mapreduce/interface/serialize.cpp @@ -509,6 +509,7 @@ void Deserialize(TTableColumnarStatistics& statistics, const TNode& node) { const auto& nodeMap = node.AsMap(); DESERIALIZE_ITEM("column_data_weights", statistics.ColumnDataWeight); + DESERIALIZE_ITEM("column_estimated_unique_counts", statistics.ColumnEstimatedUniqueCounts); DESERIALIZE_ITEM("legacy_chunks_data_weight", statistics.LegacyChunksDataWeight); DESERIALIZE_ITEM("timestamp_total_weight", statistics.TimestampTotalWeight); } |
