pulp / pulpcore

Pulp 3 pulpcore package https://pypi.org/project/pulpcore/
GNU General Public License v2.0
303 stars 116 forks source link

As a user, I can sync file repos that don't have a PULP_MANIFEST #4700

Open pulpbot opened 2 years ago

pulpbot commented 2 years ago

Author: @bmbouter (bmbouter)

Bugzilla: https://bugzilla.redhat.com/buglist.cgi?quicksearch=1779869 Redmine Issue: 8413, https://pulp.plan.io/issues/8413


Motivation

It would be great to have the ability to sync file repositories that support directory listing but do not have a PULP_MANIFEST file.

For example: https://releases.hashicorp.com/vault/1.3.0/

pulpbot commented 2 years ago

From: daviddavis (daviddavis) Date: 2021-03-17T14:29:46Z


This would be a great feature to have. How can we identify which files to download since directory listing pages vary? For example, compare: https://fixtures.pulpproject.org/file/ to https://releases.hashicorp.com/vault/1.3.0/.

pulpbot commented 2 years ago

From: @bmbouter (bmbouter) Date: 2021-03-17T14:49:06Z


I was thinking we'll use parser that allows us to iterate through href items and ignore ./ and ... I believe all of these types of dir listings use <a href> tags to link to their content.