PaddlePaddle / Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
https://www.paddlepaddle.org.cn/lite
Apache License 2.0
6.97k stars 1.61k forks source link

x86编译问题 error: ‘__m256’ was not declared in this scope ‘w_content’ ‘_mm256_loadu_ps’ ‘x_content’ ‘alpha_content’ #5869

Closed mayouyuan closed 9 months ago

mayouyuan commented 3 years ago

我编译环境是: Linux R710-226 5.4.0-70-generic #78~18.04.1-Ubuntu SMP Sat Mar 20 14:10:07 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux

Paddle-Lite版本是: commit 6d6a6c74968bcbda33581466bcabf6ee17761dd3 (HEAD -> v2.8, origin/release/v2.8) Author: hong19860320 9973393+hong19860320@users.noreply.github.com Date: Thu Apr 8 14:18:12 2021 +0800

编译命令是: ./lite/tools/build.sh --build_extra=ON x86

报错的信息: /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc: In function ‘void paddle::lite::jit::more::intrinsic::CRFDecoding(int, const float, const float, float, int, int)’: /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:57:5: error: ‘m256’ was not declared in this scope m256 w_content, x_content, alpha_content; ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:57:5: note: suggested alternative: ‘._56’ m256 w_content, x_content, alpha_content; ^~ ._56 /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:59:5: error: ‘w_content’ was not declared in this scope w_content = _mm256_loadu_ps(w + i_offset); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:59:5: note: suggested alternative: ‘wcsnlen’ w_content = _mm256_loadu_ps(w + i_offset); ^~~~~ wcsnlen /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:59:17: error: ‘_mm256_loadu_ps’ was not declared in this scope w_content = _mm256_loadu_ps(w + i_offset); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:60:5: error: ‘x_content’ was not declared in this scope x_content = _mm256_loadu_ps(x + i_offset); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:61:5: error: ‘alpha_content’ was not declared in this scope alpha_content = _mm256_add_ps(w_content, x_content); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:61:21: error: ‘_mm256_add_ps’ was not declared in this scope alpha_content = _mm256_add_ps(w_content, x_content); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:62:5: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps(alpha + i_offset, alpha_content); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:84:7: error: ‘m256’ was not declared in this scope m256 max_score = _mm256_set1_ps(-std::numeric_limits::max()); ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:84:7: note: suggested alternative: ‘._56’ m256 max_score = _mm256_set1_ps(-std::numeric_limits::max()); ^~ ._56 /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:85:7: error: ‘m256i’ was not declared in this scope m256i max_j = _mm256_set1_epi32(0); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:103:16: error: expected ‘;’ before ‘alpha_content’ m256 alpha_content = _mm256_broadcast_ss(alpha + seq_offset + i); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:105:16: error: expected ‘;’ before ‘w_content’ __m256 w_content = _mm256_loadu_ps(w + trans_offset); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:106:16: error: expected ‘;’ before ‘score_v’ m256 score_v = _mm256_add_ps(alpha_content, w_content); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:107:17: error: expected ‘;’ before ‘mask’ m256i mask = ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:114:9: error: ‘m128i’ was not declared in this scope m128i lo_max_j = _mm256_extractf128_si256(max_j, 0); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:114:9: note: suggested alternative: ‘f128’ m128i lo_max_j = _mm256_extractf128_si256(max_j, 0); ^~~ f128 /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:115:17: error: expected ‘;’ before ‘hi_max_j’ m128i hi_max_j = _mm256_extractf128_si256(max_j, 1); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:116:17: error: expected ‘;’ before ‘lo_mask’ m128i lo_mask = _mm256_extractf128_si256(mask, 0); // NOLINT ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:117:17: error: expected ‘;’ before ‘hi_mask’ m128i hi_mask = _mm256_extractf128_si256(mask, 1); // NOLINT ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:118:9: error: ‘lo_max_j’ was not declared in this scope lo_max_j = _mm_andnot_si128(lo_mask, lo_max_j); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:118:9: note: suggested alternative: ‘intmax_t’ lo_max_j = _mm_andnot_si128(lo_mask, lo_max_j); ^~~~ intmax_t /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:118:37: error: ‘lo_mask’ was not declared in this scope lo_max_j = _mm_andnot_si128(lo_mask, lo_max_j); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:118:37: note: suggested alternative: ‘fd_mask’ lo_max_j = _mm_andnot_si128(lo_mask, lo_max_j); ^~~ fd_mask /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:118:20: error: ‘_mm_andnot_si128’ was not declared in this scope lo_max_j = _mm_andnot_si128(lo_mask, lo_max_j); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:119:9: error: ‘hi_max_j’ was not declared in this scope hi_max_j = _mm_andnot_si128(hi_mask, hi_max_j); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:119:9: note: suggested alternative: ‘uintmax_t’ hi_max_j = _mm_andnot_si128(hi_mask, hi_max_j); ^~~~ uintmax_t /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:119:37: error: ‘hi_mask’ was not declared in this scope hi_max_j = _mm_andnot_si128(hi_mask, hi_max_j); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:119:37: note: suggested alternative: ‘fd_mask’ hi_max_j = _mm_andnot_si128(hi_mask, hi_max_j); ^~~ fd_mask /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:120:42: error: ‘_mm_set1_epi32’ was not declared in this scope lo_mask = _mm_and_si128(lo_mask, _mm_set1_epi32(i)); ^~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:120:19: error: ‘_mm_and_si128’ was not declared in this scope lo_mask = _mm_and_si128(lo_mask, _mm_set1_epi32(i)); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:122:20: error: ‘_mm_or_si128’ was not declared in this scope lo_max_j = _mm_or_si128(lo_mask, lo_max_j); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:124:9: error: ‘max_j’ was not declared in this scope max_j = _mm256_insertf128_si256(max_j, lo_max_j, 0); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:124:17: error: ‘_mm256_insertf128_si256’ was not declared in this scope max_j = _mm256_insertf128_si256(max_j, lo_max_j, 0); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:128:9: error: ‘max_score’ was not declared in this scope max_score = _mm256_max_ps(max_score, score_v); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:128:46: error: ‘score_v’ was not declared in this scope max_score = _mm256_max_ps(max_score, score_v); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:128:21: error: ‘_mm256_max_ps’ was not declared in this scope max_score = _mm256_max_ps(max_score, score_v); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:144:14: error: expected ‘;’ before ‘x_content’ m256 x_content = _mm256_loadu_ps(x + seq_offset + tag_num + j_offset); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:145:7: error: ‘max_score’ was not declared in this scope max_score = _mm256_add_ps(max_score, x_content); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc: In function ‘void paddle::lite::jit::more::intrinsic::LayerNorm(float, float, float, float, const float, const float, int, float, int)’: /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:34:3: error: ‘m256’ was not declared in this scope m256 sum; ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:34:3: note: suggested alternative: ‘._56’ m256 sum; ^~ ._56 /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:35:10: error: expected ‘;’ before ‘mean_vec’ __m256 mean_vec, var_vec; ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:36:3: error: ‘m128’ was not declared in this scope m128 hi, lo; ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:145:44: error: ‘x_content’ was not declared in this scope max_score = _mm256_add_ps(max_score, x_content); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:36:3: note: suggested alternative: ‘f128’ m128 hi, lo; ^~ __f128 /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:37:10: error: expected ‘;’ before ‘tmp’ m256 tmp; ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:44:10: error: expected ‘;’ before ‘reverse_num_vec’ m256 reverse_num_vec = ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:46:10: error: expected ‘;’ before ‘epsilon_vec’ m256 epsilon_vec = _mm256_set1_ps(epsilon); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:49:3: error: ‘m256i’ was not declared in this scope m256i mask_vec = _mm256_set_epi32(rest_mask & 0x80 ? 0xffffffff : 0, ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:62:5: error: ‘sum’ was not declared in this scope sum = _mm256_setzero_ps(); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:145:19: error: ‘_mm256_add_ps’ was not declared in this scope max_score = _mm256_add_ps(max_score, x_content); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:62:11: error: ‘_mm256_setzero_ps’ was not declared in this scope sum = _mm256_setzero_ps(); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:146:7: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps(alpha + seq_offset + tag_num + j_offset, max_score); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:64:32: error: ‘_mm256_loadu_ps’ was not declared in this scope sum = _mm256_add_ps(sum, _mm256_loadu_ps((const float)x + j)); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:64:13: error: ‘_mm256_add_ps’ was not declared in this scope sum = _mm256_add_ps(sum, _mm256_loadu_ps((const float)x + j)); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:148:28: error: ‘m256i’ does not name a type reinterpret_cast<__m256i>(track + seq_offset + tag_num + j_offset), ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:148:35: error: expected ‘>’ before ‘’ token reinterpret_cast<m256i>(track + seq_offset + tag_num + j_offset), ^ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:148:35: error: expected ‘(’ before ‘’ token /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:148:36: error: expected primary-expression before ‘>’ token reinterpret_cast<m256i>(track + seq_offset + tag_num + j_offset), ^ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/crf_decoding.cc:149:11: error: ‘max_j’ was not declared in this scope max_j); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:68:7: error: ‘tmp’ was not declared in this scope tmp = _mm256_loadu_ps((const float)x + j); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:68:7: note: suggested alternative: ‘tm’ tmp = _mm256_loadu_ps((const float)x + j); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:68:13: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_loadu_ps((const float)x + j); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:71:39: error: expected primary-expression before ‘)’ token *(m256)&mask_vec); // NOLINT ^ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:71:41: error: ‘mask_vec’ was not declared in this scope (m256)&mask_vec); // NOLINT ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:71:41: note: suggested alternative: ‘makedev’ (m256)&mask_vec); // NOLINT ^~~~ makedev /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:69:13: error: ‘_mm256_blendv_ps’ was not declared in this scope tmp = _mm256_blendv_ps(_mm256_setzero_ps(), ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:72:13: error: ‘_mm256_add_ps’ was not declared in this scope sum = _mm256_add_ps(sum, tmp); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:74:5: error: ‘hi’ was not declared in this scope hi = _mm256_extractf128_ps(sum, 1); ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:74:5: note: suggested alternative: ‘i’ hi = _mm256_extractf128_ps(sum, 1); ^~ i /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:74:10: error: ‘_mm256_extractf128_ps’ was not declared in this scope hi = _mm256_extractf128_ps(sum, 1); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:75:5: error: ‘lo’ was not declared in this scope lo = _mm256_extractf128_ps(sum, 0); ^~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:79:13: error: ‘_mm256_insertf128_ps’ was not declared in this scope _mm256_insertf128_ps(_mm256_setzero_ps(), hi, 0), lo, 1)); ^~~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:78:9: error: ‘_mm256_insertf128_ps’ was not declared in this scope _mm256_insertf128_ps( ^~~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:76:11: error: ‘_mm256_add_ps’ was not declared in this scope sum = _mm256_add_ps( ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:80:11: error: ‘_mm256_hadd_ps’ was not declared in this scope sum = _mm256_hadd_ps(sum, sum); ^~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:82:5: error: ‘mean_vec’ was not declared in this scope mean_vec = _mm256_mul_ps(sum, reverse_num_vec); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:82:5: note: suggested alternative: ‘mean’ mean_vec = _mm256_mul_ps(sum, reverse_num_vec); ^~~~ mean /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:82:35: error: ‘reverse_num_vec’ was not declared in this scope mean_vec = _mm256_mul_ps(sum, reverse_num_vec); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:82:16: error: ‘_mm256_mul_ps’ was not declared in this scope mean_vec = _mm256_mul_ps(sum, reverse_num_vec); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:88:7: error: ‘tmp’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:88:7: note: suggested alternative: ‘tm’ tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:88:27: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:88:13: error: ‘_mm256_sub_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:94:7: error: ‘tmp’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:94:7: note: suggested alternative: ‘tm’ tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:94:27: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:94:13: error: ‘_mm256_sub_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:98:39: error: expected primary-expression before ‘)’ token (__m256)&mask_vec); // NOLINT ^ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:98:41: error: ‘mask_vec’ was not declared in this scope (m256)&mask_vec); // NOLINT ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:98:41: note: suggested alternative: ‘makedev’ (m256)&mask_vec); // NOLINT ^~~~ makedev /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:96:13: error: ‘_mm256_blendv_ps’ was not declared in this scope tmp = _mm256_blendv_ps(_mm256_setzero_ps(), ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:109:5: error: ‘var_vec’ was not declared in this scope var_vec = _mm256_mul_ps(sum, reverse_num_vec); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:114:7: error: ‘tmp’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:114:7: note: suggested alternative: ‘tm’ tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:114:27: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:114:13: error: ‘_mm256_sub_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:116:65: error: ‘epsilon_vec’ was not declared in this scope _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:116:65: note: suggested alternative: ‘epsilon’ _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~ epsilon /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:116:27: error: ‘_mm256_sqrt_ps’ was not declared in this scope _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:115:13: error: ‘_mm256_div_ps’ was not declared in this scope tmp = _mm256_div_ps(tmp, ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:117:7: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps(reinterpret_cast<float>(out) + j, tmp); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:121:7: error: ‘tmp’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:121:7: note: suggested alternative: ‘tm’ tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:121:27: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:121:13: error: ‘_mm256_sub_ps’ was not declared in this scope tmp = _mm256_sub_ps(_mm256_loadu_ps((const float)x + j), mean_vec); ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:123:65: error: ‘epsilon_vec’ was not declared in this scope _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:123:65: note: suggested alternative: ‘epsilon’ _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~ epsilon /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:123:27: error: ‘_mm256_sqrt_ps’ was not declared in this scope _mm256_sqrt_ps(_mm256_add_ps(var_vec, epsilon_vec))); ^~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:122:13: error: ‘_mm256_div_ps’ was not declared in this scope tmp = _mm256_div_ps(tmp, ^~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:124:7: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps(reinterpret_cast<float>(out) + j, tmp); ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:130:9: error: ‘tmp’ was not declared in this scope tmp = _mm256_loadu_ps((const float)out + j); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:130:9: note: suggested alternative: ‘tm’ tmp = _mm256_loadu_ps((const float)out + j); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:130:15: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_loadu_ps((const float)out + j); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:135:27: error: ‘_mm256_loadu_ps’ was not declared in this scope _mm256_mul_ps(_mm256_loadu_ps((const float)out + j), ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:133:9: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps( ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:142:27: error: ‘tmp’ was not declared in this scope _mm256_mul_ps(tmp, ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:142:27: note: suggested alternative: ‘tm’ _mm256_mul_ps(tmp, ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:143:27: error: ‘_mm256_loadu_ps’ was not declared in this scope _mm256_loadu_ps((const float)scale + j - offset))); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:140:9: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps( ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:150:9: error: ‘tmp’ was not declared in this scope tmp = _mm256_loadu_ps((const float)out + j); ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:150:9: note: suggested alternative: ‘tm’ tmp = _mm256_loadu_ps((const float)out + j); ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:150:15: error: ‘_mm256_loadu_ps’ was not declared in this scope tmp = _mm256_loadu_ps((const float)out + j); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:155:27: error: ‘_mm256_loadu_ps’ was not declared in this scope _mm256_add_ps(_mm256_loadu_ps((const float)out + j), ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:153:9: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps( ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:162:27: error: ‘tmp’ was not declared in this scope _mm256_add_ps(tmp, ^~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:162:27: note: suggested alternative: ‘tm’ _mm256_add_ps(tmp, ^~~ tm /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:163:27: error: ‘_mm256_loadu_ps’ was not declared in this scope _mm256_loadu_ps((const float*)bias + j - offset))); ^~~~~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:160:9: error: ‘_mm256_storeu_ps’ was not declared in this scope _mm256_storeu_ps( ^~~~ /home/xxx/paddle/Paddle-Lite/lite/backends/x86/jit/more/intrinsic/layer_norm.cc:47:7: error: unused variable ‘rest_mask’ [-Werror=unused-variable] int rest_mask = ^~~~~ make[3]: [lite/backends/x86/jit/more/intrinsic/CMakeFiles/jit_kernel_intrinsic.dir/crf_decoding.cc.o] Error 1 make[3]: 正在等待未完成的任务.... cc1plus: all warnings being treated as errors make[3]: [lite/backends/x86/jit/more/intrinsic/CMakeFiles/jit_kernel_intrinsic.dir/layer_norm.cc.o] Error 1 make[2]: [lite/backends/x86/jit/more/intrinsic/CMakeFiles/jit_kernel_intrinsic.dir/all] Error 2 make[2]: 正在等待未完成的任务.... make[1]: [lite/CMakeFiles/publish_inference.dir/rule] Error 2 make: *** [publish_inference] Error 2.

麻烦看一下,这个问题困扰我好几天了,谢谢!

paddle-bot-old[bot] commented 3 years ago

您好,我们已经收到了您的问题,会安排技术人员尽快解答您的问题,请耐心等待。请您再次检查是否提供了清晰的问题描述、复现代码、环境&版本、报错信息等。同时,您也可以通过查看官网文档常见问题历史Issue来寻求解答。祝您生活愉快~

Hi! We've received your issue and please be patient to get responded. We will arrange technicians to answer your questions as soon as possible. Please make sure that you have posted enough message to demo your request. You may also check out the APIFAQ and Github Issue to get the answer.Have a nice day!

zhangjun commented 3 years ago

@mayouyuan 方便列下环境信息吗 包括cpu、gcc,以及gcc -mavx2 -dM -E - < /dev/null | egrep "SSE|AVX" | sort输出信息

mayouyuan commented 3 years ago

-- Found Paddle host system: ubuntu, version: 18.04.5 -- Found Paddle host system's CPU: 8 cores -- The CXX compiler identification is GNU 7.5.0 -- The C compiler identification is GNU 7.5.0 -- Check for working CXX compiler: /usr/bin/c++ -- Check for working CXX compiler: /usr/bin/c++ -- works -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Detecting CXX compile features -- Detecting CXX compile features - done -- Check for working C compiler: /usr/bin/cc -- Check for working C compiler: /usr/bin/cc -- works -- Detecting C compiler ABI info -- Detecting C compiler ABI info - done -- Detecting C compile features -- Detecting C compile features - done -- CXX compiler: /usr/bin/c++, version: GNU 7.5.0 -- C compiler: /usr/bin/cc, version: GNU 7.5.0 -- AR tools: /usr/bin/ar -- Found Git: /usr/bin/git (found version "2.17.1") -- Looking for pthread.h -- Looking for pthread.h - found -- Looking for pthread_create -- Looking for pthread_create - not found -- Looking for pthread_create in pthreads -- Looking for pthread_create in pthreads - not found -- Looking for pthread_create in pthread -- Looking for pthread_create in pthread - found -- Found Threads: TRUE
-- Performing Test FMA_FOUND -- Performing Test FMA_FOUND - Failed -- Performing Test MMX_FOUND -- Performing Test MMX_FOUND - Success -- Performing Test SSE2_FOUND -- Performing Test SSE2_FOUND - Success -- Performing Test SSE3_FOUND -- Performing Test SSE3_FOUND - Success -- Performing Test AVX_FOUND -- Performing Test AVX_FOUND - Failed -- Performing Test AVX2_FOUND -- Performing Test AVX2_FOUND - Failed -- Performing Test AVX512F_FOUND -- Performing Test AVX512F_FOUND - Failed -- CMAKE_BUILD_TYPE: Release -- MKLML_VER: Glibc225_vsErf_mklml_lnx_2019.0.1.20181227, MKLML_URL: http://paddlepaddledeps.bj.bcebos.com/Glibc225_vsErf_mklml_lnx_2019.0.1.20181227.tgz -- Protobuf protoc executable: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/protobuf/bin/protoc -- Protobuf-lite library: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/protobuf/lib/libprotobuf-lite.a -- Protobuf library: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/protobuf/lib/libprotobuf.a -- Protoc library: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/protobuf/lib/libprotoc.a -- Protobuf version: 3.1.0 -- Found cblas and lapack in MKLML (include: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/mklml/include, library: mklml) -- BLAS library: mklml -- BLAS Include: /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/mklml/include -- Enable Intel OpenMP with /home/xxx/paddle/Paddle-Lite/build.lite.x86/third_party/install/mklml/lib/libiomp5.so -- Paddle version is 0.0.0 -- Looking for UINT64_MAX -- Looking for UINT64_MAX - found -- Looking for sys/types.h -- Looking for sys/types.h - found -- Looking for stdint.h -- Looking for stdint.h - found -- Looking for stddef.h -- Looking for stddef.h - found -- Check size of pthread_spinlock_t -- Check size of pthread_spinlock_t - done -- Check size of pthread_barrier_t -- Check size of pthread_barrier_t - done -- Performing Test C_COMPILER_SUPPORT_FLAGfPIC -- Performing Test C_COMPILER_SUPPORT_FLAG__fPIC - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGfPIC -- Performing Test CXX_COMPILER_SUPPORT_FLAGfPIC - Success -- Performing Test C_COMPILER_SUPPORT_FLAGfno_omit_frame_pointer -- Performing Test C_COMPILER_SUPPORT_FLAGfno_omit_frame_pointer - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGfno_omit_frame_pointer -- Performing Test CXX_COMPILER_SUPPORT_FLAGfno_omit_frame_pointer - Success -- Performing Test C_COMPILER_SUPPORT_FLAG__Werror -- Performing Test C_COMPILER_SUPPORT_FLAGWerror - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWerror -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Werror - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWall -- Performing Test C_COMPILER_SUPPORT_FLAGWall - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wall -- Performing Test CXX_COMPILER_SUPPORT_FLAGWall - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWextra -- Performing Test C_COMPILER_SUPPORT_FLAG__Wextra - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWextra -- Performing Test CXX_COMPILER_SUPPORT_FLAGWextra - Success -- Performing Test C_COMPILER_SUPPORT_FLAG__Wnon_virtual_dtor -- Performing Test C_COMPILER_SUPPORT_FLAGWnon_virtual_dtor - Failed -- Performing Test CXX_COMPILER_SUPPORT_FLAGWnon_virtual_dtor -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wnon_virtual_dtor - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWdelete_non_virtual_dtor -- Performing Test C_COMPILER_SUPPORT_FLAGWdelete_non_virtual_dtor - Failed -- Performing Test CXX_COMPILER_SUPPORT_FLAGWdelete_non_virtual_dtor -- Performing Test CXX_COMPILER_SUPPORT_FLAGWdelete_non_virtual_dtor - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unused_but_set_variable -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unused_but_set_variable - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wno_unused_but_set_variable -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unused_but_set_variable - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unknown_pragmas -- Performing Test C_COMPILER_SUPPORT_FLAG__Wno_unknown_pragmas - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unknown_pragmas -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unknown_pragmas - Success -- Performing Test C_COMPILER_SUPPORT_FLAG__Wno_strict_aliasing -- Performing Test C_COMPILER_SUPPORT_FLAGWno_strict_aliasing - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_strict_aliasing -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wno_strict_aliasing - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unused_parameter -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unused_parameter - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wno_unused_parameter -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unused_parameter - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_unused_function -- Performing Test C_COMPILER_SUPPORT_FLAG__Wno_unused_function - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unused_function -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_unused_function - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_literal_suffix -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_literal_suffix - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_literal_suffix -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_literal_suffix - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_sign_compare -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_sign_compare - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_sign_compare -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_sign_compare - Success -- Performing Test C_COMPILER_SUPPORT_FLAG__Wno_error_unused_local_typedefs -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_unused_local_typedefs - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_unused_local_typedefs -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wno_error_unused_local_typedefs - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_parentheses_equality -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_parentheses_equality - Failed -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_parentheses_equality -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_parentheses_equality - Failed -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_ignored_attributes -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_ignored_attributes - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_ignored_attributes -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_ignored_attributes - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_terminate -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_terminate - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAG__Wno_error_terminate -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_terminate - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_int_in_bool_context -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_int_in_bool_context - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_int_in_bool_context -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_int_in_bool_context - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWimplicit_fallthrough_0 -- Performing Test C_COMPILER_SUPPORT_FLAG__Wimplicit_fallthrough_0 - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWimplicit_fallthrough_0 -- Performing Test CXX_COMPILER_SUPPORT_FLAGWimplicit_fallthrough_0 - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_maybe_uninitialized -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_maybe_uninitialized - Success -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_maybe_uninitialized -- Performing Test CXX_COMPILER_SUPPORT_FLAGWno_error_maybe_uninitialized - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_unused_function -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_unused_function - Success -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_array_bounds -- Performing Test C_COMPILER_SUPPORT_FLAGWno_error_array_bounds - Success -- Performing Test C_COMPILER_SUPPORT_FLAG__gencode -- Performing Test C_COMPILER_SUPPORT_FLAGgencode - Failed -- Performing Test C_COMPILER_SUPPORT_FLAG_arch_compute_62,code_sm_62 -- Performing Test C_COMPILER_SUPPORT_FLAG_arch_compute_62,code_sm_62 - Failed -- LIGHT_FRAMEWORK: OFF -- LITE_WITH_CUDA: OFF -- LITE_WITH_X86: ON -- LITE_WITH_ARM: OFF -- LITE_WITH_OPENCL: OFF -- LITE_WITH_NPU: OFF -- LITE_WITH_RKNPU: OFF -- LITE_WITH_XPU: OFF -- LITE_WITH_APU: OFF -- LITE_WITH_XTCL:
-- LITE_WITH_FPGA: OFF -- LITE_WITH_MLU: OFF -- LITE_WITH_HUAWEI_ASCEND_NPU: OFF -- LITE_WITH_BM: OFF -- LITE_WITH_IMAGINATION_NNA: OFF -- LITE_WITH_PROFILE: OFF -- LITE_WITH_CV: OFF -- add lite kernels -- compile with lite host kernels fatal: 没有标签准确匹配 '6d6a6c74968bcbda33581466bcabf6ee17761dd3' -- tag: -- branch: heads/v2.8 -- commit: 6d6a6c74 -- ----> mir_subgraphs: subgraph_pass -- Collected 194 ops -- Collected 44 X86 kernels -- Collected 0 CUDA kernels -- Collected 61 Host kernels -- Collected 0 ARM kernels -- Collected 0 OpenCL kernels -- Collected 0 NPU kernels -- Collected 0 APU kernels -- Collected 0 XPU kernels -- Collected 0 RKNPU kernels -- Collected 0 FPGA kernels -- Collected 0 BM kernels -- Collected 0 MLU kernels -- Collected 0 HUAWEI_ASCEND_NPU kernels -- Collected 0 IMAGINATION_NNA kernels -- bundled_tgt_full_name: /home/xxx/paddle/Paddle-Lite/build.lite.x86/libpaddle_api_full_bundled.a -- bundled_tgt_full_name: /home/xxx/paddle/Paddle-Lite/build.lite.x86/libpaddle_api_light_bundled.a -- publish inference lib to /home/xxx/paddle/Paddle-Lite/build.lite.x86/inference_lite_lib -- Configuring done -- Generating done CMake Warning: Manually-specified variables were not used by the project:

HUAWEI_ASCEND_NPU_DDK_ROOT
LITE_WITH_LTO
PY_VERSION
WITH_GPU
XPU_SDK_ROOT
mayouyuan commented 3 years ago

架构: x86_64 CPU 运行模式: 32-bit, 64-bit 字节序: Little Endian CPU: 8 在线 CPU 列表: 0-7 每个核的线程数: 2 每个座的核数: 4 座: 1 NUMA 节点: 1 厂商 ID: GenuineIntel CPU 系列: 6 型号: 44 型号名称: Intel(R) Xeon(R) CPU E5620 @ 2.40GHz 步进: 2 CPU MHz: 1596.389 BogoMIPS: 4788.06 虚拟化: VT-x L1d 缓存: 32K L1i 缓存: 32K L2 缓存: 256K L3 缓存: 12288K NUMA 节点0 CPU: 0-7 标记: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 popcnt aes lahf_lm pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid dtherm ida arat flush_l1d

mayouyuan commented 3 years ago

gcc -mavx2 -dM -E - < /dev/null | egrep "SSE|AVX" | sort

define AVX 1

define AVX2 1

define SSE 1

define SSE2 1

define __SSE2_MATH__ 1

define SSE3 1

define __SSE4_1__ 1

define __SSE4_2__ 1

define __SSE_MATH__ 1

define SSSE3 1

zhangjun commented 3 years ago

架构: x86_64 CPU 运行模式: 32-bit, 64-bit 字节序: Little Endian CPU: 8 在线 CPU 列表: 0-7 每个核的线程数: 2 每个座的核数: 4 座: 1 NUMA 节点: 1 厂商 ID: GenuineIntel CPU 系列: 6 型号: 44 型号名称: Intel(R) Xeon(R) CPU E5620 @ 2.40GHz 步进: 2 CPU MHz: 1596.389 BogoMIPS: 4788.06 虚拟化: VT-x L1d 缓存: 32K L1i 缓存: 32K L2 缓存: 256K L3 缓存: 12288K NUMA 节点0 CPU: 0-7 标记: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 popcnt aes lahf_lm pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid dtherm ida arat flush_l1d

这里是开启了vt-x虚拟化吗 虚拟化环境中avx指令集没支持