PyDataBlog / ParallelKMeans.jl

Parallel & lightning fast implementation of available classic and contemporary variants of the KMeans clustering algorithm
MIT License
50 stars 13 forks source link

Public release v1.0.0 #112

Closed PyDataBlog closed 3 years ago

PyDataBlog commented 3 years ago

Public release of package 🎉

github-actions[bot] commented 3 years ago
Benchmark result # Judge result # Benchmark Report for */home/runner/work/ParallelKMeans.jl/ParallelKMeans.jl* ## Job Properties * Time of benchmarks: - Target: 24 May 2021 - 22:10 - Baseline: 24 May 2021 - 22:11 * Package commits: - Target: 583d1a - Baseline: fb11db * Julia commits: - Target: 44fa15 - Baseline: 44fa15 * Julia command flags: - Target: None - Baseline: None * Environment variables: - Target: None - Baseline: None ## Results A ratio greater than `1.0` denotes a possible regression (marked with :x:), while a ratio less than `1.0` denotes a possible improvement (marked with :white_check_mark:). Only significant results - results that indicate possible regressions or improvements - are shown below (thus, an empty table means that all benchmark results remained invariant between builds). | ID | time ratio | memory ratio | |------------------------------------------|------------|--------------| ## Benchmark Group List Here's a list of all the benchmark groups executed by this job: - `["distance"]` - `["kmeans"]` ## Julia versioninfo ### Target ``` Julia Version 1.4.2 Commit 44fa15b150* (2020-05-23 18:35 UTC) Platform Info: OS: Linux (x86_64-pc-linux-gnu) Ubuntu 20.04.2 LTS uname: Linux 5.4.0-1047-azure #49-Ubuntu SMP Thu Apr 22 14:30:37 UTC 2021 x86_64 x86_64 CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: speed user nice sys idle irq #1 2394 MHz 20733 s 19 s 1172 s 7310 s 0 s #2 2394 MHz 5603 s 6 s 1187 s 22342 s 0 s Memory: 6.791339874267578 GB (3624.0625 MB free) Uptime: 297.0 sec Load Avg: 1.0 0.72509765625 0.3369140625 WORD_SIZE: 64 LIBM: libopenlibm LLVM: libLLVM-8.0.1 (ORCJIT, haswell) ``` ### Baseline ``` Julia Version 1.4.2 Commit 44fa15b150* (2020-05-23 18:35 UTC) Platform Info: OS: Linux (x86_64-pc-linux-gnu) Ubuntu 20.04.2 LTS uname: Linux 5.4.0-1047-azure #49-Ubuntu SMP Thu Apr 22 14:30:37 UTC 2021 x86_64 x86_64 CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: speed user nice sys idle irq #1 2394 MHz 21596 s 19 s 1191 s 15771 s 0 s #2 2394 MHz 14101 s 6 s 1222 s 23164 s 0 s Memory: 6.791339874267578 GB (3697.8984375 MB free) Uptime: 390.0 sec Load Avg: 1.04296875 0.81494140625 0.4072265625 WORD_SIZE: 64 LIBM: libopenlibm LLVM: libLLVM-8.0.1 (ORCJIT, haswell) ``` --- # Target result # Benchmark Report for */home/runner/work/ParallelKMeans.jl/ParallelKMeans.jl* ## Job Properties * Time of benchmark: 24 May 2021 - 22:10 * Package commit: 583d1a * Julia commit: 44fa15 * Julia command flags: None * Environment variables: None ## Results Below is a table of this job's results, obtained by running the benchmarks. The values listed in the `ID` column have the structure `[parent_group, child_group, ..., key]`, and can be used to index into the BaseBenchmarks suite to retrieve the corresponding benchmarks. The percentages accompanying time and memory values in the below table are noise tolerances. The "true" time/memory value for a given benchmark is expected to fall within this percentage of the reported value. An empty cell means that the value was zero. | ID | time | GC time | memory | allocations | |------------------------------------------|----------------:|--------:|-----------------:|------------:| | `["distance", "100kx10"]` | 716.400 μs (5%) | | | | | `["distance", "100kx3"]` | 259.400 μs (5%) | | | | | `["kmeans", "10x100_000x10x1 Lloyd"]` | 2.054 s (5%) | | 785.67 KiB (1%) | 20 | | `["kmeans", "10x100_000x10x1 Hammerly"]` | 1.623 s (5%) | | 2.29 MiB (1%) | 23 | | `["kmeans", "10x100_000x10x2 Lloyd"]` | 2.032 s (5%) | | 1011.20 KiB (1%) | 2545 | | `["kmeans", "10x100_000x10x2 Hammerly"]` | 1.637 s (5%) | | 3.47 MiB (1%) | 12554 | | `["kmeans", "10x100_000x3x1 Lloyd"]` | 153.176 ms (5%) | | 783.84 KiB (1%) | 20 | | `["kmeans", "10x100_000x3x1 Hammerly"]` | 207.334 ms (5%) | | 2.29 MiB (1%) | 23 | | `["kmeans", "10x100_000x3x2 Lloyd"]` | 152.390 ms (5%) | | 824.89 KiB (1%) | 478 | | `["kmeans", "10x100_000x3x2 Hammerly"]` | 206.656 ms (5%) | | 2.47 MiB (1%) | 1946 | ## Benchmark Group List Here's a list of all the benchmark groups executed by this job: - `["distance"]` - `["kmeans"]` ## Julia versioninfo ``` Julia Version 1.4.2 Commit 44fa15b150* (2020-05-23 18:35 UTC) Platform Info: OS: Linux (x86_64-pc-linux-gnu) Ubuntu 20.04.2 LTS uname: Linux 5.4.0-1047-azure #49-Ubuntu SMP Thu Apr 22 14:30:37 UTC 2021 x86_64 x86_64 CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: speed user nice sys idle irq #1 2394 MHz 20733 s 19 s 1172 s 7310 s 0 s #2 2394 MHz 5603 s 6 s 1187 s 22342 s 0 s Memory: 6.791339874267578 GB (3624.0625 MB free) Uptime: 297.0 sec Load Avg: 1.0 0.72509765625 0.3369140625 WORD_SIZE: 64 LIBM: libopenlibm LLVM: libLLVM-8.0.1 (ORCJIT, haswell) ``` --- # Baseline result # Benchmark Report for */home/runner/work/ParallelKMeans.jl/ParallelKMeans.jl* ## Job Properties * Time of benchmark: 24 May 2021 - 22:11 * Package commit: fb11db * Julia commit: 44fa15 * Julia command flags: None * Environment variables: None ## Results Below is a table of this job's results, obtained by running the benchmarks. The values listed in the `ID` column have the structure `[parent_group, child_group, ..., key]`, and can be used to index into the BaseBenchmarks suite to retrieve the corresponding benchmarks. The percentages accompanying time and memory values in the below table are noise tolerances. The "true" time/memory value for a given benchmark is expected to fall within this percentage of the reported value. An empty cell means that the value was zero. | ID | time | GC time | memory | allocations | |------------------------------------------|----------------:|--------:|-----------------:|------------:| | `["distance", "100kx10"]` | 716.300 μs (5%) | | | | | `["distance", "100kx3"]` | 259.300 μs (5%) | | | | | `["kmeans", "10x100_000x10x1 Lloyd"]` | 2.055 s (5%) | | 785.67 KiB (1%) | 20 | | `["kmeans", "10x100_000x10x1 Hammerly"]` | 1.670 s (5%) | | 2.29 MiB (1%) | 23 | | `["kmeans", "10x100_000x10x2 Lloyd"]` | 2.104 s (5%) | | 1011.20 KiB (1%) | 2545 | | `["kmeans", "10x100_000x10x2 Hammerly"]` | 1.653 s (5%) | | 3.47 MiB (1%) | 12554 | | `["kmeans", "10x100_000x3x1 Lloyd"]` | 150.088 ms (5%) | | 783.84 KiB (1%) | 20 | | `["kmeans", "10x100_000x3x1 Hammerly"]` | 204.813 ms (5%) | | 2.29 MiB (1%) | 23 | | `["kmeans", "10x100_000x3x2 Lloyd"]` | 152.480 ms (5%) | | 824.89 KiB (1%) | 478 | | `["kmeans", "10x100_000x3x2 Hammerly"]` | 207.618 ms (5%) | | 2.47 MiB (1%) | 1946 | ## Benchmark Group List Here's a list of all the benchmark groups executed by this job: - `["distance"]` - `["kmeans"]` ## Julia versioninfo ``` Julia Version 1.4.2 Commit 44fa15b150* (2020-05-23 18:35 UTC) Platform Info: OS: Linux (x86_64-pc-linux-gnu) Ubuntu 20.04.2 LTS uname: Linux 5.4.0-1047-azure #49-Ubuntu SMP Thu Apr 22 14:30:37 UTC 2021 x86_64 x86_64 CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: speed user nice sys idle irq #1 2394 MHz 21596 s 19 s 1191 s 15771 s 0 s #2 2394 MHz 14101 s 6 s 1222 s 23164 s 0 s Memory: 6.791339874267578 GB (3697.8984375 MB free) Uptime: 390.0 sec Load Avg: 1.04296875 0.81494140625 0.4072265625 WORD_SIZE: 64 LIBM: libopenlibm LLVM: libLLVM-8.0.1 (ORCJIT, haswell) ``` --- # Runtime information | Runtime Info | | |:--|:--| | BLAS #threads | 2 | | `BLAS.vendor()` | `openblas64` | | `Sys.CPU_THREADS` | 2 | `lscpu` output: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian Address sizes: 46 bits physical, 48 bits virtual CPU(s): 2 On-line CPU(s) list: 0,1 Thread(s) per core: 1 Core(s) per socket: 2 Socket(s): 1 NUMA node(s): 1 Vendor ID: GenuineIntel CPU family: 6 Model: 63 Model name: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz Stepping: 2 CPU MHz: 2394.457 BogoMIPS: 4788.91 Hypervisor vendor: Microsoft Virtualization type: full L1d cache: 64 KiB L1i cache: 64 KiB L2 cache: 512 KiB L3 cache: 30 MiB NUMA node0 CPU(s): 0,1 Vulnerability Itlb multihit: KVM: Vulnerable Vulnerability L1tf: Mitigation; PTE Inversion Vulnerability Mds: Mitigation; Clear CPU buffers; SMT Host state unknown Vulnerability Meltdown: Mitigation; PTI Vulnerability Spec store bypass: Vulnerable Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization Vulnerability Spectre v2: Mitigation; Full generic retpoline, STIBP disabled, RSB filling Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Not affected Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single pti fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt md_clear | Cpu Property | Value | |:------------------ |:------------------------------------------------------- | | Brand | Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz | | Vendor | :Intel | | Architecture | :Haswell | | Model | Family: 0x06, Model: 0x3f, Stepping: 0x02, Type: 0x00 | | Cores | 2 physical cores, 2 logical cores (on executing CPU) | | | No Hyperthreading detected | | Clock Frequencies | Not supported by CPU | | Data Cache | Level 1:3 : (32, 256, 30720) kbytes | | | 64 byte cache line size | | Address Size | 48 bits virtual, 46 bits physical | | SIMD | 256 bit = 32 byte max. SIMD vector size | | Time Stamp Counter | TSC is accessible via `rdtsc` | | | TSC increased at every clock cycle (non-invariant TSC) | | Perf. Monitoring | Performance Monitoring Counters (PMC) are not supported | | Hypervisor | Yes, Microsoft |