adamant-kite-43734
07/11/2023, 5:27 PMicy-agency-38675
07/12/2023, 12:46 AMfull-train-34126
07/13/2023, 6:02 AMicy-agency-38675
07/13/2023, 7:34 AMRAID particularly feels wrong.Why do you think it is wrong?
Longhorn conceptually to me looks a bit like a mix between Ceph and GlusterFS.Can you elaborate more on why you think LH is a mix Ceph and GlusterFS. I would recommend reading the https://longhorn.io/docs/1.5.0/concepts/ first and see how the data path work.
icy-agency-38675
07/13/2023, 7:35 AMis it more performant to have many smaller disks,Using raid can definitely increase the throughput. Or, do you mean latency or IOPS?
high-butcher-71851
07/13/2023, 3:06 PMWhen it comes to the performance of Longhorn storage algorithms, the number and size of disks can have an impact, but it ultimately depends on several factors. Longhorn is a distributed block storage solution, and its performance is influenced by various elements, such as the workload characteristics, I/O patterns, disk types, network infrastructure, and hardware capabilities.
In Longhorn, data is striped across multiple disks to improve performance and redundancy. When determining whether it is more performant to have many smaller disks or fewer larger disks, you should consider the following points:
Disk I/O Parallelism: With many smaller disks, Longhorn can distribute I/O operations across a larger number of disks, potentially allowing for better parallelism and higher overall throughput. This can be advantageous for workloads that benefit from parallelism, such as scenarios with high concurrency or random I/O patterns.
Disk Seek Time: Smaller disks typically have shorter seek times compared to larger disks. In scenarios where latency is critical, such as workloads with frequent small random reads, many smaller disks might offer better performance due to reduced seek times.
Aggregated Bandwidth: Fewer larger disks can potentially provide higher aggregated bandwidth compared to many smaller disks. This can be beneficial for workloads that are more sequential in nature or require sustained high throughput. However, keep in mind that other factors like network bandwidth can also affect the overall performance.
Resource Utilization: Having many smaller disks might lead to improved resource utilization since you can distribute the data and I/O load across multiple drives. It can also offer better fault tolerance, as the loss of a single small disk would impact a smaller portion of the data compared to a larger disk.
It's important to note that the performance difference between many smaller disks and fewer larger disks might not be substantial in all cases. It is recommended to benchmark and test your specific workload using different disk configurations to determine the optimal setup for your needs.
Additionally, Longhorn provides various storage optimizations, such as caching, snapshotting, and replication, which can impact performance as well. Considering the overall system architecture and workload characteristics will help in making an informed decision regarding the number and size of disks to achieve the desired performance goals.
icy-agency-38675
07/13/2023, 3:16 PMThe statement from ChatGPT is not correct.Copy codeIn Longhorn, data is striped across multiple disks to improve performance and redundancy.
high-butcher-71851
07/13/2023, 3:22 PMLonghorn uses a distributed block storage approach, but it does not employ data striping techniques. Instead, it creates replicas of the data across multiple disks to ensure redundancy and data availability.
Each replica of a Longhorn volume is stored on a separate disk, and the data is synchronously replicated to ensure data durability and fault tolerance. This means that the number of disks in Longhorn primarily affects the level of redundancy and fault tolerance rather than performance.
To improve performance in Longhorn, you can consider factors such as the disk type (SSD vs. HDD), network infrastructure, and workload characteristics. Longhorn provides options for configuring disk-specific parameters, such as disk scheduler and caching mechanisms, to optimize performance based on your specific requirements.
full-train-34126
07/13/2023, 4:44 PMicy-agency-38675
07/14/2023, 7:16 AMWe’re trying out longhorn with Harvester, so was trying to get my head around the best disk setup for a VM based workload.@salmon-city-57654 Do you has best practice for harvester with LH?
salmon-city-57654
07/14/2023, 4:39 PMfull-train-34126
07/14/2023, 5:10 PMfull-train-34126
07/14/2023, 8:08 PMsalmon-city-57654
07/17/2023, 2:57 PMfull-train-34126
07/19/2023, 5:30 PMsalmon-city-57654
07/20/2023, 1:53 AM