mind / wheels

Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
886 stars 108 forks source link

Windows wheel for Tensorflow 1.4 CUDA9 + CUDNN7 #6

Closed 0xDaksh closed 6 years ago

danqing commented 6 years ago

Unfortunately we don’t have windows machines ourselves and won’t be able to provide that. Sorry.

0xDaksh commented 6 years ago

@danqing can you provide instructions so that I can build it ?

danqing commented 6 years ago

The installation guide is here. I don't have any experience with Windows, but it looks like you need to install Bazel and use the CMake build. It also likely works for Python 3.5 only.

0xDaksh commented 6 years ago

@danqing Building it today, if it works then I Will Provide it here.

danqing commented 6 years ago

Sounds good thanks! Lemme know if you run into issues and I'll see if I can help too.

0xDaksh commented 6 years ago

@danqing it just compiled but failed, do you use any explicit commands / instructions for cuda9 and cudnn7 ?

danqing commented 6 years ago

What's the error?

0xDaksh commented 6 years ago

@danqing I used this except for cuda 9 and cudnn7 https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/cmake/README.md

danqing commented 6 years ago

What’s the error you saw?

0xDaksh commented 6 years ago

I don't know, I compiled it a while ago but that's what happened.

danqing commented 6 years ago

Hmm yeah without specific errors I don't think I can tell what went wrong. If you are gonna give it another try, lemme know what problem you run into and I can see if I spot anything suspicious.

0xDaksh commented 6 years ago

Okay, I'll recompile!

0xDaksh commented 6 years ago

atm it is stuck at:

 Performing download step (download, verify and extract) for 'png'
  -- Downloading...
     dst='C:/Code/builds/tensorflow/tensorflow/contrib/cmake/build/downloads/libpng-1.2.53.tar.gz'
     timeout='none'
  -- Using src='https://storage.googleapis.com/libpng-public-archive/libpng-1.2.53.tar.gz'
  -- [download 0% complete]
  -- [download 1% complete]
  -- [download 2% complete]
danqing commented 6 years ago

Hmm stuck at download? Does the [download x% complete] thing not progress?

0xDaksh commented 6 years ago

Nope, It's just stuck there.

0xDaksh commented 6 years ago

So what I did, I manually downloaded the file from sourceforge rather than google storage apis, because that url was dead and replaced it. Now I restarted the build as it was stuck there.

danqing commented 6 years ago

Interesting... If you are talking about https://storage.googleapis.com/libpng-public-archive/libpng-1.2.53.tar.gz, it works for me..

0xDaksh commented 6 years ago

then it's the routing which is dead xD

0xDaksh commented 6 years ago

anyway, I downloaded that file from sourceforge and it worked. Let's see where the next error comes in!

0xDaksh commented 6 years ago

These are the new errors:

c:\code\builds\tensorflow\tensorflow\contrib\cmake\build\external\eigen_archive\eigen\src\core\products\generalblockpan
elkernel.h(2006): fatal error C1002: compiler is out of heap space in pass 2 [C:\Code\builds\tensorflow\tensorflow\cont
rib\cmake\build\tf_core_kernels.vcxproj]
  example_parsing_ops.cc
c:\code\builds\tensorflow\tensorflow\contrib\cmake\build\external\eigen_archive\eigen\src\core\products\generalblockpan
elkernel.h(1902): fatal error C1002: compiler is out of heap space in pass 2 [C:\Code\builds\tensorflow\tensorflow\cont
rib\cmake\build\tf_core_kernels.vcxproj]
cl : Command line error D8040: error creating or communicating with child process [C:\Code\builds\tensorflow\tensorflow
\contrib\cmake\build\tf_core_kernels.vcxproj]
0xDaksh commented 6 years ago

@danqing Compilation Completed:

"C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_tutorials_example_trainer.vcxproj" (default target) (1) ->
"C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_core_gpu_kernels.vcxproj" (default target) (117) ->
(CustomBuild target) ->
  C:\Program Files (x86)\Microsoft Visual Studio\2017\BuildTools\Common7\IDE\VC\VCTargets\Microsoft.CppCommon.targets(1
71,5): error MSB6006: "cmd.exe" exited with code 1. [C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_core_g
pu_kernels.vcxproj]

"C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_tutorials_example_trainer.vcxproj" (default target) (1) ->
"C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_core_kernels.vcxproj" (default target) (118) ->
(ClCompile target) ->
  c:\code\builds\tensorflow\tensorflow\contrib\cmake\build\external\eigen_archive\eigen\src\core\products\generalblockp
anelkernel.h(2006): fatal error C1002: compiler is out of heap space in pass 2 [C:\Code\builds\tensorflow\tensorflow\co
ntrib\cmake\build\tf_core_kernels.vcxproj]
  c:\code\builds\tensorflow\tensorflow\contrib\cmake\build\external\eigen_archive\eigen\src\core\products\generalblockp
anelkernel.h(1902): fatal error C1002: compiler is out of heap space in pass 2 [C:\Code\builds\tensorflow\tensorflow\co
ntrib\cmake\build\tf_core_kernels.vcxproj]
  cl : Command line error D8040: error creating or communicating with child process [C:\Code\builds\tensorflow\tensorfl
ow\contrib\cmake\build\tf_core_kernels.vcxproj]

    149 Warning(s)
    4 Error(s)

Time Elapsed 00:18:03.99
0xDaksh commented 6 years ago

@danqing Trying this solution: https://github.com/tensorflow/tensorflow/issues/10799

But instead of MSBuild /m:4 /p:Configuration=Release tf_python_build_pip_package.vcxproj I used MSBuild /m:6 /p:Configuration=Release tf_python_build_pip_package.vcxproj as I've more cores 🗡

0xDaksh commented 6 years ago

@danqing after doing that, those 4 errors squashed into 1:

       "C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_tutorials_example_trainer.vcxproj" (default target)
        (1) ->
       "C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_cc_ops.vcxproj" (default target) (4) ->
       "C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\array_ops_gen_cc.vcxproj" (default target) (9) ->
       "C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\tf_core_lib.vcxproj" (default target) (16) ->
       "C:\Code\builds\tensorflow\tensorflow\contrib\cmake\build\snappy.vcxproj" (default target) (48) ->
       (CustomBuild target) ->
         C:\Program Files (x86)\Microsoft Visual Studio\2017\BuildTools\Common7\IDE\VC\VCTargets\Microsoft.CppCommon.ta
       rgets(171,5): error MSB6006: "cmd.exe" exited with code 1. [C:\Code\builds\tensorflow\tensorflow\contrib\cmake\b
       uild\snappy.vcxproj]

    235 Warning(s)
    1 Error(s)
danqing commented 6 years ago

Seems to be a VS issue. Have you tried something like https://stackoverflow.com/questions/44024268/msbuild-error-msb6006-cmd-exe-exited-with-code-1?