Open razvanm opened 7 years ago
I'd be quite happy if Bazel adds official support for Python Zip Applications that renders subpar obsolete, but so far the .zip support is incomplete, only available to Windows by default, and makes some interesting design choices.
I'm using --build_python_zip
in Linux and I didn't notice any issues yet. Can you please elaborate in which way the .zip support is incomplete?
I actually find it useful to be able to specify the .par
file as a dependency in other rules, such as docker_build
. IIUC, that's not possible with the --build_python_zip
flag.
As per Bazel devs:
--build_python_zip
is ongoing.@duggelz Regarding (4), do you have a link to where that discussion is ongoing?
To the point of @hwright, we have been using the zipped binaries as file dependencies in docker_build
rules without any problems.
After spending some time reviewing subpar vs --build_python_zip
, here's a summary of the main differences I've found:
The standard rules don't give you a way to specify whether or not you want a zip on a per-target basis (bazelbuild/bazel#3530). This shouldn't be hard to fix.
The standard rules' zip file is self-extracting, so it presents the same view of runfiles as any other standard ruleset. In contrast, subpar is only self-extracting when zip_safe
is false, so by default you can't use the standard runfiles libraries with subpar and have to use pkg_resources
instead.
The standard rules use a stub file, executed by #!/usr/bin/env python, which unpacks the runfiles, locates the second-stage interpreter, and uses it to execute the payload. Subpar bypasses the stub file and uses the same interpreter for both its own initialization and the payload. This means that subpar can't be used with in-build runtimes (#98). This might be fixed by adding a bootstrapping stage to subpar's initialization or unifying its init logic with the standard rules' stub script.
From my point of view, the most compelling path forward is to add zips as implicit outputs in the standard rules and deprecate subpar. I do wonder if there's anyone really depending on features in subpar they can't otherwise get.
@siddharthab Do you have an example where you use rules_docker and the output of --build_python_zip
?
@brandjon I agree, I feel like if py_binary had zipped output, then subpar would no longer need to be supported.
@groodt Not sure how detailed an example you want because it's a simple point. But something like,
py_binary(
name = "bin",
...
)
container_image(
name = "img",
...
files = [":bin"],
)
works just fine for us.
Just the above is enough to package python binaries with their runfiles and our python toolchain (wrappers to ensure correct minor versions of python, etc.) into the docker image as a single zip file.
Repeated invocations of the self-extracting zip however is slow (e.g. calling the binary in a loop), and I wish that there was some documentation that gave instructions or guidance on how to extract once and then call the stub multiple times.
Thanks @siddharthab
So to make sure I understand, do you push the image with:
bazel run //example:img --build_python_zip
?
Yes. We have the option in our repo bazelrc.
Yes. We have the option in our repo bazelrc.
Interesting. Thanks! I'll have to try that. The only reason we're currently using subpar is because I couldn't get py_binary working reliably when added as files with rules_docker.
The problem I have with this argument is that your comparing subpar to zip when subpar is shell of what it was suppose to be. Subpar does not support the features that are needed. Anything not full python breaks it. (psutil, netaddress). Its unfortunate that it looks like python is going to get the shaft on this one which of course will make bazel fall out of favor for serious python.
Bazel has a
--build_python_zip
flag for build that also produces a self-contained executable binary. Could someone update theREADME.md
to clearly articulate the advantages the.par
files provide over that?Reference: https://bazel.build/designs/2016/09/05/build-python-on-windows.html