Open OrielResearchCure opened 6 years ago
The issue was with the pip version. --download was deprecated. I dont know where this need to be mentioned / fixed.
running:
pip install pip==9.0.3
solved the issue. thanks, eila
@OrielResearchCure It looks like the Apache Beam Dataflow runner is trying to install its dependencies under the hood and relying on an very old version of pip to do so.
That simply will not work in Datalab as we use a newer version of pip, and not all of our packages are installed via pip anyway: most are installed via Conda.
I would classify it as a bug in the Apache Beam library, but it looks like you can work around it.
Specifically, it seems like you can circumvent that bug by manually installing the dependencies yourself.
Run the following in a Code cell, and then restart your notebook's kernel:
%%bash
source activate py2env
conda install pytz==2018.4
pip install apache-beam google-cloud-dataflow
I had the same problem (error) with DataflowRunner (DirectRunner worked normally). pip install pip==9.0.3 solved the problem for me as well !
I have the same error.
CalledProcessError: Command '['/usr/bin/python3', '-m', 'pip', 'download', '--dest', '/tmp/dataflow-requirements-cache', '-r', 'requirements.txt', '--exists-action', 'i', '--no-binary', ':all:']' returned non-zero exit status 1
This is my environment: apache-beam==2.16.0 tensorflow==2.1.0 tensorflow-metadata==0.15.2 tensorflow-transform==0.15.0 Python 2.7.13 pip 20.0.2
I think my pip is already updated. What am I missing?
--no-binary
Might be causing problems.
--no-binary
Might be causing problems.
@Ark-kun , what is your suggestion to fix it?
I have moved to python 3 and would like to share the installation to save others the time:
!pip install --upgrade --force-reinstall pip==9.0.3 !pip install --upgrade virtualenv --disable-pip-version-check !pip install apache-beam --disable-pip-version-check !pip install apache-beam[gcp] --disable-pip-version-check !pip install apache-beam[test] --disable-pip-version-check !pip install apache-beam[interactive] --disable-pip-version-check !pip install --upgrade pip !pip install tensorflow=='2.0.0b1' # added tensorflow for machine and other nice units methods
I hope that this is helpful,
Best, eilalan
On Wed, Apr 29, 2020 at 11:05 AM psyyip notifications@github.com wrote:
--no-binary Might be causing problems.
@Ark-kun https://github.com/Ark-kun , what is your suggestion to fix it?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/googledatalab/datalab/issues/2066#issuecomment-621270920, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHGYKZUED7JBLUWZANYHL5DRPA627ANCNFSM4FQ7PSQQ .
-- Eila http://www.orielresearch.com Meetup https://www.meetup.com/Deep-Learning-In-Production/
pip install pip==9.0.3
@OrielResearchCure Where do you put this?
Hi all,
Pipeline that was running perfectly fires an error when using dataflow. so I tried a simple pipeline and gets the same error. Please let me know if there is anything that I need to change / update in my environment or any other advice?
Many thanks, Eila
will fire the following error:
What am I missing? has something changed?
Thanks, Eila