intel / xFasterTransformer

Apache License 2.0
344 stars 60 forks source link

The current commit(d666741) of xFT make failed. #345

Closed xiuying1 closed 4 months ago

xiuying1 commented 4 months ago

The current commit(d666741) of xft make failed, this issue blocked xFT daily run.

When run cmd:

RUN source activate llm && \
    source ./3rdparty/oneccl/build/_install/env/setvars.sh && \
    mkdir -p build && \
    cd build && \
    cmake -DXFT_BUILD_EVALUATION=ON .. && \
    make -j && \
    cd .. && \
    python setup.py install

make error:

15:18:09  #12 ERROR: process "/bin/bash -c source activate llm &&     source ./3rdparty/oneccl/build/_install/env/setvars.sh &&     mkdir -p build &&     cd build &&     cmake -DXFT_BUILD_EVALUATION=ON .. &&     make -j &&     cd .. &&     python setup.py install" did not complete successfully: exit code: 2
15:18:09  ------
15:18:09   > [stage-0  6/12] RUN source activate llm &&     source ./3rdparty/oneccl/build/_install/env/setvars.sh &&     mkdir -p build &&     cd build &&     cmake -DXFT_BUILD_EVALUATION=ON .. &&     make -j &&     cd .. &&     python setup.py install:
15:18:09  285.6 /home/xFasterTransformer/src/models/hybrid_model.h:25:7: note: provided for 'template<template<class, class> class Model, class FirstTokenDtype, class NextTokenDtype, class KVCacheDtype> class HybridModel'
15:18:09  285.6    25 | class HybridModel : public AbstractDecoder {
15:18:09  285.6       |       ^~~~~~~~~~~
15:18:09  289.0 [ 98%] Linking CXX shared library /home/xFasterTransformer/src/xfastertransformer/libxfastertransformer_pt.so
15:18:09  290.8 make[2]: *** [evaluation/CMakeFiles/evaluation.dir/build.make:90: evaluation/CMakeFiles/evaluation.dir/eval.cpp.o] Error 1
15:18:09  290.8 make[1]: *** [CMakeFiles/Makefile2:709: evaluation/CMakeFiles/evaluation.dir/all] Error 2
15:18:09  290.8 make[1]: *** Waiting for unfinished jobs....
15:18:09  293.1 Copying libxft_comm_helper.so to src/xfastertransformer directory
15:18:09  293.1 [ 98%] Built target xfastertransformer_pt
15:18:09  293.1 make: *** [Makefile:91: all] Error 2
15:18:09  ------
15:18:09  Dockerfile.1.daily:27

Detail logs:https://cje-ir-prod01.devtools.intel.com/nex-nswe-wsf/job/ServicesFramework/job/post-silicon-validation/job/image/9014/console

xiuying1 commented 4 months ago

@shanzhou2186 would you please help to assign to the Dev. Thanks.

pujiang2018 commented 4 months ago

could you pls try again?

xiuying1 commented 4 months ago

Verified commit 27e6154 and closed. image