Closed marklit closed 8 years ago
I never seen this error message before. Can I get that csv file somewhere to test the load on my machine ? Also, do you really need to load all the fields ? It seems that you need just a few for the queries. You can try loading just those and see if you still get the error.
I'll email you a link to the file.
I'll play around with loading in a reduced set of data in the mean time.
I loaded the data successfully. The only issue I had was 'out of memory' error, so I had to reduce the segment size to 500 MB : ./alenka -l 500 load_trips.sql Can you try it ?
Certainly, I'll try that and report back.
With 500 as a parameter I got that exception again but with 200 it loaded just fine.
~/Alenka_master/alenka -l 500 load.sql
terminate called after throwing an instance of 'thrust::system::system_error'
what(): failed synchronize in thrust::system::cuda::detail::trivial_copy_n: an illegal memory access was encountered
~/Alenka_master/alenka -l 200 load.sql
~/Alenka_master/alenka query.sql
...
mRecCount=1 mcount = 1 term 1 limit=0 print_all=1
|20000046 |
Thanks for your help on that one.
While running a query on your data I found and fixed a bug in Alenka. I updated the master branch, so please update if you have any issues.
Good stuff, I'll re-compile Alenka before I start the 1.1B record import. I've earmarked Saturday to get started on this.
Don't forget to use APPEND when loading consecutive files !
On Wed, Oct 19, 2016 at 10:43 AM, Mark Litwintschik < notifications@github.com> wrote:
Good stuff, I'll re-compile Alenka before I start the 1.1B record import. I've earmarked Saturday to get started on this.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/antonmks/Alenka/issues/103#issuecomment-254737134, or mute the thread https://github.com/notifications/unsubscribe-auth/ABhkFC_YbTkmPt86Yn-_feyoUdr2uxwvks5q1coKgaJpZM4KX_Xx .
I made a few changes to alenka including addition of CAST operator necessary for your queries. Also, please notice that in a load script the types should be specified in lower caps, like "decimals", not "DECIMALS", otherwise it is not going to work, alenka is case sensitive. I tested your queries and a new load script, if you need them I attached them all to this message.
Best regards,
Anton
On Wed, Oct 19, 2016 at 11:42 AM, mks antonmks@gmail.com wrote:
Don't forget to use APPEND when loading consecutive files !
On Wed, Oct 19, 2016 at 10:43 AM, Mark Litwintschik < notifications@github.com> wrote:
Good stuff, I'll re-compile Alenka before I start the 1.1B record import. I've earmarked Saturday to get started on this.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/antonmks/Alenka/issues/103#issuecomment-254737134, or mute the thread https://github.com/notifications/unsubscribe-auth/ABhkFC_YbTkmPt86Yn-_feyoUdr2uxwvks5q1coKgaJpZM4KX_Xx .
Thanks Anton.
I'm not seeing the attachments here, could you send them over again please?
That was an old message, I remember that after that I sent all the queries as text to you in an email. It might take for me a few days to add what I need to add to make queries 3 and 4 run, I'll try to it this weekend.
Anton
On Wed, Oct 26, 2016 at 9:11 AM, Mark Litwintschik <notifications@github.com
wrote:
Thanks Anton.
I'm not seeing the attachments here, could you send them over again please?
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/antonmks/Alenka/issues/103#issuecomment-256258285, or mute the thread https://github.com/notifications/unsubscribe-auth/ABhkFCppfjsHbRZvFde_MpYHhZqW6pfaks5q3u77gaJpZM4KX_Xx .
Cool. I'll earmark Sunday evening again to have another go with all this.
I fixed an issue with APPEND and groupby operators, so Q1 should work. Unfortunately you have to reload the data. I'll start working on the rest of the queries.
Great, I'll recompile and import the data again on Sunday and report back.
Hi,
I'm loading in a 8.7 GB, 20 million line CSV file into Alenka. The import starts out well and a number of the
.hash
files growing to 50 MB+ but around 10 minutes into the load command I get anillegal memory access was encountered
error message.I've compiled the master branch of Alenka (commit 59022b5) on Ubuntu 16.04 64-bit with CUDA 8 and I'm running it with an Nvidia GTX 1080 and the 367.48 driver.
Here are the steps I took that led up to the issue:
A few minutes before the exception
nvidia-smi
was showing the following:These were the last files to be modified before the exception:
Here is the last few lines of strace:
Any idea what might have caused this issue or what I can do to work around it? I'm happy to provide more telemetry if needed.
Cheers, Mark