Open ddelange opened 1 year ago
This is nice! Hope this will be merged soon!
Just ran into issues with my library being not usable by Mac and Windows users because I rely on python-magic
. If there are wheels, I don't need to find a workaround or replace the library :)
python-magic-bin
did not work for some of them, by the way.
This is huge, thank you! Apology for the delay I thought I'd commented earlier but guess not. I'll look this over soon; I didn't quite understand how bad the binary dep situation was expecially on windows.
@ahupp @stumpylog could we have this one merged (and released) by the end of the year please ?
I don't have anything to do with the project, so I can't help. I just dropped by to look at something else and happened to notice some things
One month has passed since last activity, I think this PR is badly needed to fix the issues with windows. Would it be possible to move things forward or at least give an estimated schedule?
Wheels now get installed and tested inside the cibuildwheel docker environment. I also tested the latest wheels on google colab:
# pip install python-magic --force-reinstall --find-links https://github.com/ddelange/python-magic/releases/expanded_assets/0.4.28
>>> import magic
>>> magic.Magic(mime=True).from_buffer(b'\x00\x00\x00\x1cftypisom\x00\x00\x02\x00isomiso2mp41\x00')
'video/mp4'
- Are uploaded as GitHub Release assets, as well as upload to PyPI, whenever a GitHub Release is (pre)released.
- For this, you still need to add this repo as trusted publisher to https://pypi.org/manage/project/python-magic/settings/publishing/
@ahupp one last TODO for you before/after merging ^
@ahupp Any update?
Thank you @ddelange
I confirm that pip install python-magic --force-reinstall --find-links https://github.com/ddelange/python-magic/releases/expanded_assets/0.4.28
works for me on MSW.
On Linux, I feel that libmagic
is easy enough to install from your package manager (if it is not already installed in your base installation) so I am not certain we should include a copy of the binary libraries by default in all cases.
Perhaps to get the best of both worlds we should create an optional dependency on binaries for libmagic and then clearly specify how to install:
pip install 'python-magic[libmagic]'
to install with bundled libmagic (recommended method on MSW) pip install python-magic
to install using a pre-existing libmagic
installation (recommended method on Linux). I have not enough experience on MAC to know which to recommend as the default.
What do you think @ddelange and @ahupp ?
hi @jmoraleda :wave: please see the discussion here
if a package has reasons to refrain from binary distributions (bdist_wheel, .whl) e.g. for linux, it's as simple as not publishing wheels for linux. then, on linux systems, pip will fall back to the source distribution (sdist, tar.gz). i personally dont see any reason to refrain from publishing linux wheels.
if you want to circumvent the binary distribution because you want to use your own libmagic, you can add --no-binary python-magic
to your pip install command to force a fallback to source distribution
Thank you again @ddelange. Definitely having MSW wheels available is a huge plus. With respect to Linux, as long as there is a mechanism for installing using the system libraries and is prominently mentioned in the installation page, I agree there is no reason from not publishing linux wheels as well.
Also, in debian like distributions python-magic is actually packaged by the system maintainers to already refer to the installed libraries, so if one installs from there with the distribution package manager, then avoiding duplicate binaries is already achieved, so even more of a reason to include wheels for everything in the pypi distribution.
found out that the manylinux2014
images are pretty old and hence were getting a pretty old libmagic:
Package file-libs-5.11-37.el7.x86_64 already installed and latest version
upgraded now to manylinux_2_28
(rhel/almalinux 8):
Package file-libs-5.33-25.el8.x86_64 is already installed.
the musllinux wheels are getting libmagic 5.45 ref https://pkgs.alpinelinux.org/package/edge/main/x86/libmagic
could consider building from source in the manylinux images, to get 5.45 instead of 5.33...
@ahupp building from source done in fe62a26
- Are uploaded as GitHub Release assets, as well as upload to PyPI, whenever a GitHub Release is (pre)released.
- For this, you still need to add this repo as trusted publisher to https://pypi.org/manage/project/python-magic/settings/publishing/
one last TODO for you before/after merging ^
Was looking for exactly something to simplify the binary installations for python-magic and came across this PR.
Looks like it is very close to the finish line ! Anything I could do to help ? If there is a place I can fetch the wheels and test, I am happy to try them out :)
Hi @AbdealiLoKo :wave: to fetch the wheels, see the pip install
command in the PR description. From my side this is PR ready for review.
So, I tried this out:
$ venv/bin/pip install python-magic
Successfully installed python-magic-0.4.27
$ venv/bin/ipython
In [1]: import magic
In [2]: magic.detect_from_content(open('./README.md', 'rb').read(2048))
Out[2]: FileMagic(mime_type='text/plain', encoding='us-ascii', name='ASCII text')
$ venv/bin/pip uninstall python-magic
Successfully uninstalled python-magic-0.4.27
$ venv/bin/pip install python-magic --force-reinstall --find-links https://github.com/ddelange/python-magic/releases/expanded_assets/0.4.28.post6
Successfully installed python-magic-0.4.28
$ venv/bin/ipython
In [2]: magic.detect_from_content(open('./README.md', 'rb').read(2048))
---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
Cell In[2], line 1
----> 1 magic.detect_from_content(open('./README.md', 'rb').read(2048))
File ~/corridor-platforms/venv/lib/python3.10/site-packages/magic/__init__.py:479, in _add_compat.<locals>.deprecation_wrapper.<locals>._(*args, **kwargs)
473 def _(*args, **kwargs):
474 warnings.warn(
475 "Using compatibility mode with libmagic's python binding. "
476 "See https://github.com/ahupp/python-magic/blob/master/COMPAT.md for details.",
477 PendingDeprecationWarning)
--> 479 return fn(*args, **kwargs)
File ~/corridor-platforms/venv/lib/python3.10/site-packages/magic/compat.py:286, in detect_from_content(byte_content)
280 def detect_from_content(byte_content):
281 '''Detect mime type, encoding and file type from bytes
282
283 Returns a `FileMagic` namedtuple.
284 '''
--> 286 return _create_filemagic(mime_magic.buffer(byte_content),
287 none_magic.buffer(byte_content))
File ~/corridor-platforms/venv/lib/python3.10/site-packages/magic/compat.py:248, in _create_filemagic(mime_detected, type_detected)
247 def _create_filemagic(mime_detected, type_detected):
--> 248 splat = mime_detected.split('; ')
249 mime_type = splat[0]
250 if len(splat) == 2:
AttributeError: 'NoneType' object has no attribute 'split'
This error comes for all the files I have tried so far:
can you try from_buffer
instead of detect_from_content
?
see also https://github.com/ahupp/python-magic/blob/0.4.27/magic/__init__.py#L427-L430 and https://github.com/ahupp/python-magic/blob/0.4.27/COMPAT.md?plain=1#L13
@AbdealiLoKo I've released 0.4.28.post7 containing eba05b6, and verified that the compat module now correctly loads the magic.mgc
file bundled in the wheel.
In compat.py, mime_magic.load()
and none_magic.load()
were silently returning a -1, because load_lib()
now prefers the (recent) libmagic bundled in the wheel over the (older) system libmagic, which then caused magic.mgc
not to be found in the standard paths. The fix now points to the bundled magic.mgc
in this case.
On Google Colab:
# pip install python-magic --force-reinstall --find-links https://github.com/ddelange/python-magic/releases/expanded_assets/0.4.28.post7
>>> import magic
>>> magic.detect_from_content(b'\x00\x00\x00\x1cftypiso5\x00\x00\x00\x01isomiso5hlsf\x00\x00')
FileMagic(mime_type='video/mp4', encoding='binary', name='ISO Media, MP4 Base Media v5 ')
>>> magic.from_buffer(b'\x00\x00\x00\x1cftypiso5\x00\x00\x00\x01isomiso5hlsf\x00\x00')
'ISO Media, MP4 Base Media v5 '
Thanks for catching this!
What are the remaining TODOs on this pull request?
- Are uploaded as GitHub Release assets, as well as upload to PyPI, whenever a GitHub Release is (pre)released.
- For this, you still need to add this repo as trusted publisher to https://pypi.org/manage/project/python-magic/settings/publishing/
The compat part (which is what I generally use to avoid confusion with the non-pypi magic - works with post7 :) Thanks ! This works well for me.
+1 to merging and releasing it !
Many people have commented on this pull request but there are very few reviews.
When you think a pull request is useful and is ready to be merged, please consider giving it a positive review.
Every check mark โ๏ธ at the top right of this page gives project maintainers confidence that the proposed changes have been read through and deemed both useful and safe to merge into the codebase. Lots of ๐ and "what is the ETA?" comments are easier for maintainers to ignore than โ๏ธโ๏ธโ๏ธโ๏ธโ๏ธ from several different reviewers.
Anyone can review a pull request on GitHub. To do so here:
Files changed
tab and read through each file carefully looking for potential issues.Review changes
button.Approve
(or one of the other options) and add comments only if they have not already been stated in the PR.Submit review
so that your โ๏ธ will be added to the list. thanks for the reviews!
There is wonderful work in this pull request and it has four positive reviews. Unfortunately, it has been open for ~9 months without landing. Perhaps it would be useful to break it into three separate PRs that are easier to review and land.
One PR that deals only with macOS and another that deals only with Windows might be easier to land. Once that is done then this PR could be rebased to deal only with Linux and friends. I know it is extra work but I sense that new momentum is needed.
Hi! Great PR, is there any real contention about it beyond the scope of OS/distro support in add_libmagic.sh
and README.md
?
Most users will only ever want the wheels from this repo. In particular, this looks like it will solidly cover usage in Docker images. Anyone who wants to use the source version and provide libmagic themselves, probably knows best how to do the latter in their environment. given info on where this package will look for the library. (Those who package python-magic for their Linux distro of choice will already have a preferred way of ensuring libmagic presence. This will probably not exactly match anything suggested in python-magic docs.)
Even for those particularly invested in having a range of setup instructions, the PR in its current state should look like a clear improvement on master, and further improvements in that are will come more easily as separate PRs (because they wonโt be tied to CI scripts).
So: how about merging this without completing libmagic setup instructions for every possible platform? Seems like it already does what the PR title says.
So: how about merging this without completing libmagic setup instructions for every possible platform? Seems like it already does what the PR title says.
Totally for it. My suggestions just show the way to improve it, but I would merge it "as is" since it already provides a huge value. "Perfect is the enemy of good".
@ahupp hopefully you can find some time to review this most discussed PR in the python-magic
's history :)
@ahupp this also fixes failing CI on master (looks like the github actions linux runner image no longer ships libmagic by default)
@ahupp @ddelange Any update on this PR and the release? ๐ Is there anything that needs to be tested or are we waiting on anything other than a review?
@ahupp please review, merge, add trusted publisher (see PR description) and release :pray:
Also would love to see this merged, it's causing some issues for windows users of our package (which depends on mocket which depends on python-magic). Thank you! ๐
@ashtonpaul @ddelange @wilhelmklopp @ferdnyc https://github.com/ahupp/python-magic/pull/294#issuecomment-2129493694
@ahupp Speaking personally, this is a big, complex PR that I don't really feel qualified to approve given the scope and nature of all the changes here. I guess I'm in agreement with your https://github.com/ahupp/python-magic/pull/294#issuecomment-2191597272
Perhaps it would be useful to break it into three separate PRs that are easier to review and land.
One PR that deals only with macOS and another that deals only with Windows might be easier to land. Once that is done then this PR could be rebased to deal only with Linux and friends. I know it is extra work but I sense that new momentum is needed.
OTOH, the PR has received a number of approvals as-is, so there's ample indication that others are satisfied with it in its current form. I left some comments, which were addressed, so my input certainly shouldn't be viewed as disapproval by any means.
Hi @ahupp ๐
This PR builds self-contained wheels as discussed in #233. For Windows users, this renders
python-magic-bin
from @julian-r obsolete.pip install these wheels
pip can install from GitHub Release assets from my fork:
The wheels:
.dylib
on mac,.so
on nix,.ddl
on win) - no additional user action neededwheels.yml
as trusted publisher hereCI/CD
dists build with official
cibuildwheel
on GitHub Actions, and they build in parallel:fix #137, fix #288, fix #225, fix #276, fix #248, fix #87, fix #139, fix #233, fix #73, fix #60, fix #34, fix #293, fix #233, fix #278, fix #262, fix #248, fix #238, fix #145, fix #61, fix #12, fix #295, fix #311, fix #312, fix #313, fix #321, fix #332, fix #249, fix #333