Hi there,
I have a large dataset of about 20 million samples. The task is simple: I want to get sum of a column.
The weird thing is like this: when I run codes below, the time estimate is 19 s.
but when I run sum command individually, it takes about 1s.
But, the steps before sum calculation takes only 0.3 s.....so I don't know what happens/what makes the difference between speed in the two figures.
Thoughts? solutions?
Hi there, I have a large dataset of about 20 million samples. The task is simple: I want to get sum of a column. The weird thing is like this: when I run codes below, the time estimate is 19 s. but when I run sum command individually, it takes about 1s. But, the steps before sum calculation takes only 0.3 s.....so I don't know what happens/what makes the difference between speed in the two figures. Thoughts? solutions?