apache / incubator-pegasus

Apache Pegasus - A horizontally scalable, strongly consistent and high-performance key-value store
https://pegasus.apache.org/
Apache License 2.0
1.96k stars 310 forks source link

The amount of data increased after backing up and restoring a partition-split table #1964

Open acelyc111 opened 3 months ago

acelyc111 commented 3 months ago

Bug Report

Please answer these questions before submitting your issue. Thanks!

  1. What did you do?

    • create a table A
    • write some data to the table A
    • count the total data on table A as number_a1
    • execute a partition-split on the table A as number_a2
    • count the total data on table A
    • back up the table A to HDFS
    • restore the data to a new table B from the backed up data from HDFS
    • count the total data on table B as number_b
  2. What did you expect to see? number_a1 == number_a2 == number_b

  3. What did you see instead? number_a1 == number_a2 != number_b

  4. What version of Pegasus are you using? 2.4

acelyc111 commented 3 months ago

It's because the replica.split.validate_partition_hash environment is lost on table B.