Closed kozlov-alexey closed 3 years ago
Hello @kozlov-alexey! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:
There are currently no PEP 8 issues detected in this Pull Request. Cheers! :beers:
Test failures of read_csv tests with:
Failed in nopython mode pipeline (step: nopython rewrites) module 'sdc.hiframes.pd_dataframe_ext' has no attribute 'init_dataframe'
are expected because this PR requires changes from #918 which was rolled-back recently. So this will be blocked until #918 is returned.
@kozlov-alexey @xaleryb win 3.6 build fails with svml error again:
test_series_apply_np (sdc.tests.test_series.TestSeries) ... LLVM ERROR: Symbol not found: __svml_log4_ha
@kozlov-alexey @xaleryb win 3.6 build fails with svml error again:
test_series_apply_np (sdc.tests.test_series.TestSeries) ... LLVM ERROR: Symbol not found: __svml_log4_ha
I think something's wrong with the packages being used (see mkl and many others are installed from public channels, but not built). Can this be a reason?
Motivation: init_dataframe was implemented via Numba intrinsic taking args, which seems to generate redundant extractvalue/insertvalue LLVM instructions, producing quadratic IR when number of DF columns grows and affecting total compilation time of function that create large DFs. This PR replaces singe init_dataframe with multiple functions basing on number of columns in a DF which are generated at compile time, thus avoiding use of args.