Open fzyzcjy opened 2 years ago
@fzyzcjy Thanks for the benchmarking. The latency in engine to replicas results in the slow performance. We are still working on performance improvement. Currently, Longhorn is suitable for the SSD storage.
@derekbit Hi thanks for the reply. So may I know when will it be faster? It seems to be the most lightweight storage provisioner and is quite promising - except the speed.
Argh. It drives me insane how slow Longhorn is on HDD's. It makes my £5,000 setup feel like a tiny investment, and then told I need to fork out several hundred for a few SSD's... I feel like x3 1TB 7200 RPM disks in a RAID 0 doesn't help much at all. Simply the way Longhorn is designed by COMPELTETLY rebuilding the data any time there's a problem makes me long for Ceph, why can't it checksum/reuse actual data bits!? But because of Monitor instability and needing 3/5 monitors and the fact that it blew up on me and has only mirroring to external Ceph cluster made me settle for Longhorn, begrudgingly
LD;TR Ceph works fine on HDD's and 1GB networking, no problems, except for the 3 monitor issue making me lose data when they lost quorum.
@voarsh2 Ah... So how did you solve it?
@voarsh2 Ah... So how did you solve it?
I wish I did...... The only thing that has made me stick with Longhorn is the S3 compatible backups.... and there's no quorum.... otherwise I'd be rooting for Ceph.
I will review SSD's in the future, but it simply isn't in my budget, especially since I have several TB's of data. Even 2-10GB volumes take forever to rebuild. The only solution I have at the moment is to split up your volume data into "sub" volumes and mount volumes within volumes so that the data per volume is smaller which can help with restoring and rebuilding. You'd just need to remove the bad volume if you're completely restoring and the workload can (depends on your app) continue with that data gone.... Databases (especially Maria Galera) is especially painful as Longhorn doesn't support TRIM, so when it drops and recreates table the size just grows and grows. I find myself frequently deleting volumes and recreating them to reclaim space and rebuild times....
That sounds like a painful experience... :(
@fzyzcjy Thanks for the benchmarking. The latency in engine to replicas results in the slow performance. We are still working on performance improvement. Currently, Longhorn is suitable for the SSD storage.
No, it's not. It's insanely slow on SSD as well. I just benchmarked. Rsync results.
sent 57,683 bytes received 337 bytes 1,172.12 bytes/sec
sent 53,839 bytes received 337 bytes 976.14 bytes/sec
No, it is not the network speed. No, it is not SSD problem
Without Longhorn I get +200MB/sec transfer rates
This pretty much sums up my issue here, getting some strange results when I run tests on host disk and the LVM for Longhorn (which is also on the same physical disk): https://github.com/longhorn/longhorn/discussions/4186
Hi thanks for the storage solution! However, it seems very very slow. For example, I run sysbench on a folder that is mounted via
hostPath
:Then, I run the same thing, this time on a PersistentVolume, whose PVC has storage class handled by longhorn:
As you can see, it is much much slower.
I wonder what am I doing wrong?