Improve performance of plotly dendrogram implementation

Here is essence of code to draw a dendrogram efficiently with plotly:

# This is needed to avoid RecursionError on some haplotype clustering analyses
# with larger numbers of haplotypes.
sys.setrecursionlimit(10000)

# Compute pairwise distances.
dist, phased_samples, n_snps = ag3.haplotype_pairwise_distances(...)

# Perform hierarchical clustering.
Z = scipy.cluster.hierarchy.linkage(dist, method=linkage_method)

# Get scipy to build a dendrogram but not plot it.
dend = scipy.cluster.hierarchy.dendrogram(Z, count_sort=True, no_plot=True)

# Compile the line coordinates into a single dataframe.
px_segments_x = []
px_segments_y = []
for ik, dk in zip(icoord, dcoord):
    # Adding None here breaks up the lines.
    px_segments_x += ik + [None]
    px_segments_y += dk + [None]
df_px_segments = pd.DataFrame({'x': px_segments_x, 'y': px_segments_y})

# Convert X coordinates to haplotype indices.
df_px_segments["x"] = (df_px_segments["x"] - 5) / 10

# Plot the lines.
fig = px.line(df_px_segments, x="x", y="y")

# Can add a scatter trace for the leaves too.
nl = len(dend["leaves"])
fig.add_scatter(x=np.arange(nl), y=[-1]*nl)

malariagen / malariagen-data-python

Improve performance of plotly dendrogram implementation #456