flow-php / flow

Flow PHP - data processing framework
https://flow-php.com
MIT License
404 stars 23 forks source link

Flow filesystem introduction #1104

Closed norberttech closed 2 days ago

norberttech commented 2 days ago

Change Log

Added

  • Flow Filesystem Library
  • Flow Azure SDK Library
  • Flow Monolog HTTP Bridge
  • Flow Filesystem Azure Bridge

Fixed

Changed

  • Remote and Local files extractor are now merged into single FilesExtractor

Removed

  • Parquet append to existing files

Deprecated

Security


Description

github-actions[bot] commented 2 days ago

Flow PHP - Benchmarks

Results of the benchmarks from this PR are compared with the results from 1.x branch.

Extractors ```shell +-----------------------+-------------------+------+-----+------------------+-------------------+----------------+ | benchmark | subject | revs | its | mem_peak | mode | rstdev | +-----------------------+-------------------+------+-----+------------------+-------------------+----------------+ | CSVExtractorBench | bench_extract_10k | 1 | 3 | 5.175mb -24.44% | 506.404ms +50.73% | ±0.19% +34.73% | | JsonExtractorBench | bench_extract_10k | 1 | 3 | 5.208mb -24.29% | 1.060s -0.71% | ±0.77% -65.98% | | ParquetExtractorBench | bench_extract_10k | 1 | 3 | 135.881mb -0.37% | 738.818ms -1.98% | ±0.15% -91.04% | | TextExtractorBench | bench_extract_10k | 1 | 3 | 4.965mb -26.09% | 34.019ms -5.17% | ±0.89% +39.21% | | XmlExtractorBench | bench_extract_10k | 1 | 3 | 4.971mb -27.25% | 432.851ms -0.11% | ±0.26% -59.21% | +-----------------------+-------------------+------+-----+------------------+-------------------+----------------+ ```
Transformers ```shell +-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+ | benchmark | subject | revs | its | mem_peak | mode | rstdev | +-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+ | RenameEntryTransformerBench | bench_transform_10k_rows | 1 | 3 | 116.272mb -0.27% | 57.839ms -2.77% | ±0.17% -87.20% | +-----------------------------+--------------------------+------+-----+------------------+-----------------+----------------+ ```
Loaders ```shell +--------------------+----------------+------+-----+------------------+------------------+-----------------+ | benchmark | subject | revs | its | mem_peak | mode | rstdev | +--------------------+----------------+------+-----+------------------+------------------+-----------------+ | CSVLoaderBench | bench_load_10k | 1 | 3 | 55.254mb -2.15% | 86.471ms +27.39% | ±1.60% +404.99% | | JsonLoaderBench | bench_load_10k | 1 | 3 | 107.628mb -1.05% | 53.281ms +3.77% | ±1.41% +127.22% | | ParquetLoaderBench | bench_load_10k | 1 | 3 | 227.051mb -0.54% | 1.406s -0.02% | ±0.98% +294.86% | | TextLoaderBench | bench_load_10k | 1 | 3 | 18.009mb -6.39% | 43.133ms +10.95% | ±1.36% +398.74% | +--------------------+----------------+------+-----+------------------+------------------+-----------------+ ```
Building Blocks ```shell +-------------------------+----------------------------+------+-----+------------------+------------------+-----------------+ | benchmark | subject | revs | its | mem_peak | mode | rstdev | +-------------------------+----------------------------+------+-----+------------------+------------------+-----------------+ | NativeEntryFactoryBench | bench_entry_factory | 1 | 3 | 116.821mb -0.26% | 501.117ms -1.26% | ±1.49% -40.10% | | NativeEntryFactoryBench | bench_entry_factory | 1 | 3 | 60.299mb -0.51% | 249.807ms -0.88% | ±0.25% -79.19% | | NativeEntryFactoryBench | bench_entry_factory | 1 | 3 | 15.233mb -2.02% | 54.932ms +1.21% | ±1.98% +8.33% | | TypeDetectorBench | bench_type_detector | 1 | 3 | 60.000mb -0.51% | 435.420ms +0.75% | ±3.03% +350.09% | | TypeDetectorBench | bench_type_detector | 1 | 3 | 14.539mb -2.12% | 85.049ms -2.00% | ±0.74% +98.56% | | RowsBench | bench_chunk_10_on_10k | 2 | 3 | 87.091mb -0.35% | 3.381ms -6.52% | ±1.44% -51.19% | | RowsBench | bench_diff_left_1k_on_10k | 2 | 3 | 102.689mb -0.30% | 187.121ms -0.07% | ±0.64% -20.44% | | RowsBench | bench_diff_right_1k_on_10k | 2 | 3 | 85.409mb -0.36% | 18.822ms -0.68% | ±0.63% -39.48% | | RowsBench | bench_drop_1k_on_10k | 2 | 3 | 88.331mb -0.35% | 1.786ms -5.57% | ±2.55% -5.78% | | RowsBench | bench_drop_right_1k_on_10k | 2 | 3 | 88.331mb -0.35% | 1.727ms -13.67% | ±0.85% -63.78% | | RowsBench | bench_entries_on_10k | 2 | 3 | 85.443mb -0.36% | 2.571ms -16.14% | ±0.98% -62.04% | | RowsBench | bench_filter_on_10k | 2 | 3 | 85.972mb -0.36% | 17.518ms +6.96% | ±0.64% -59.18% | | RowsBench | bench_find_on_10k | 2 | 3 | 85.972mb -0.36% | 16.991ms -0.87% | ±0.82% -30.37% | | RowsBench | bench_find_one_on_10k | 10 | 3 | 83.876mb -0.37% | 1.606μs -10.47% | ±2.89% +8.16% | | RowsBench | bench_first_on_10k | 10 | 3 | 83.876mb -0.37% | 0.300μs 0.00% | ±0.00% 0.00% | | RowsBench | bench_flat_map_on_1k | 2 | 3 | 93.226mb -0.33% | 13.109ms +6.36% | ±3.14% +468.00% | | RowsBench | bench_map_on_10k | 2 | 3 | 122.597mb -0.25% | 60.561ms -1.89% | ±1.52% +190.57% | | RowsBench | bench_merge_1k_on_10k | 2 | 3 | 86.492mb -0.36% | 1.297ms -17.92% | ±2.96% -14.30% | | RowsBench | bench_partition_by_on_10k | 2 | 3 | 89.838mb -0.34% | 62.800ms -2.90% | ±0.88% +5.64% | | RowsBench | bench_remove_on_10k | 2 | 3 | 88.593mb -0.35% | 4.108ms -6.43% | ±1.11% -40.97% | | RowsBench | bench_sort_asc_on_1k | 2 | 3 | 84.019mb -0.44% | 38.874ms -2.94% | ±1.06% +29.01% | | RowsBench | bench_sort_by_on_1k | 2 | 3 | 84.020mb -0.44% | 39.467ms -2.13% | ±0.37% -30.17% | | RowsBench | bench_sort_desc_on_1k | 2 | 3 | 84.019mb -0.44% | 39.557ms -1.29% | ±1.07% -63.22% | | RowsBench | bench_sort_entries_on_1k | 2 | 3 | 86.317mb -0.36% | 7.324ms -1.08% | ±0.77% +23.32% | | RowsBench | bench_sort_on_1k | 2 | 3 | 83.876mb -0.37% | 28.362ms -2.33% | ±0.21% -84.02% | | RowsBench | bench_take_1k_on_10k | 10 | 3 | 83.876mb -0.37% | 13.900μs +2.25% | ±0.00% -100.00% | | RowsBench | bench_take_right_1k_on_10k | 10 | 3 | 83.876mb -0.37% | 16.483μs +0.43% | ±1.30% +127.29% | | RowsBench | bench_unique_on_1k | 2 | 3 | 102.690mb -0.30% | 189.816ms -1.39% | ±0.42% -76.47% | +-------------------------+----------------------------+------+-----+------------------+------------------+-----------------+ ```