Closed tashrifbillah closed 1 month ago
A rows=[]
should be defined in the main()
function underneath. The populate()
function will append to that rows
. Thus, in the current design, argument passing between functions will still not be required.
Hi @dheshanm , I realized that appending rows a DataFrame is slow. Instead, all rows should be appended outside first. The resultant list should be transformed to a DataFrame. I have done it here: https://github.com/AMP-SCZ/utility/commit/3ffad0ace559c2dbbf686da70e68ee2b3e288e0f
Currently, @kcho 's big NDA manifest takes >3 hours to process. This is the line that needs to be modified as above:
https://github.com/AMP-SCZ/utility/blob/3ffad0ace559c2dbbf686da70e68ee2b3e288e0f/nda-transform/image03.py#L111