RUSH-LAB / LSHUniqueEntityEstimator

Other
8 stars 5 forks source link

malloc error in minhashing run #2

Open vm-wylbur opened 6 years ago

vm-wylbur commented 6 years ago

I compiled the minhashing program to a.out using gcc v8.2.0, and a.out runs without error for the config_restaurant.txt and config_cd.txt examples.

However, when I point it at our new matching data, I get the following error:

┌─( pball@petunia:~/projects/hrdag/GT/match/ciidh+ceh+remhi+pnr+fafg/lsh-blocking/hand )────(  λ:master | ◼ )─┐
└─❱❱❱ mlsh Config.txt 1 10                                                                          ❰─┘
Config file is Config.txt
K = 1
L = 10
a.out(10438,0x7fff9df32380) malloc: *** error for object 0x7fd515d16c78: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
[1]    10438 abort      /Users/pball/src/LSHUniqueEntityEstimator/C++Codes/a.out Config.txt 1 10

What should I test, or what information can I provide for debugging? Tagging @mgargiulo who will be giving followup.

thegargiulian commented 6 years ago

I got a.out to compile on my machine using gcc v8.1.0 and run on our new matching data, so we're set to continue substantive testing of the package, but we would still like to fix the bug described above.

keroro824 commented 6 years ago

@mgargiulo What is the data format? Can you provide the data dimension and number of data points please?

thegargiulian commented 6 years ago

The data is a .csv file with 56895 records, each containing information on 37 variables.

resteorts commented 6 years ago

Were you able to run the other two config files and the Syrian dataset up through 2014. This might be a good starting point just to make sure we’re all on the same page.

On Aug 14, 2018, at 3:56 PM, Patrick Ball notifications@github.com<mailto:notifications@github.com> wrote:

I compiled the minhashing program to a.out using gcc v8.2.0, and a.out runs without error for the config_restaurant.txt and config_cd.txt examples.

However, when I point it at our new matching data, I get the following error:

┌─( pball@petunia:~/projects/hrdag/GT/match/ciidh+ceh+remhi+pnr+fafg/lsh-blocking/hand )────( λ:master | ◼ )─┐ └─❱❱❱ mlsh Config.txt 1 10 ❰─┘ Config file is Config.txt K = 1 L = 10 a.out(10438,0x7fff9df32380) malloc: error for object 0x7fd515d16c78: incorrect checksum for freed object - object was probably modified after being freed. set a breakpoint in malloc_error_break to debug [1] 10438 abort /Users/pball/src/LSHUniqueEntityEstimator/C++Codes/a.out Config.txt 1 10

What should I test, or what information can I provide for debugging? Tagging @mgargiulohttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_mgargiulo&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=jia4GNbBzW-iVoWFk45S66AXpdnbpSxk3uvI1PJQ3v4&e= who will be giving followup.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_RUSH-2DLAB_LSHUniqueEntityEstimator_issues_2&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=qUFXz-hrnPcrQslc2G8ykS-50EJw7GeS8hk_lcfyIeo&e=, or mute the threadhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AEnmmg9BjZXth9mPKj51-5F7CcGOFkbDs5ks5uQ0efgaJpZM4V9Pfp&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=_Sh3cl6E_wee2fBFCwR3yMb0rcucVoKfY6-iVyr552g&e=.

resteorts commented 6 years ago

You’ll most likely need to change K and L depending on the dataset you are using. Perhaps an email might be the most helpful moving forward if someone can describe the dataset, etc. I’m traveling this week but back next week. Happy to help.

Sent from my iPhone

On Aug 14, 2018, at 7:17 PM, Rebecca Steorts, Ph.D. rebecca.steorts@duke.edu<mailto:rebecca.steorts@duke.edu> wrote:

Were you able to run the other two config files and the Syrian dataset up through 2014. This might be a good starting point just to make sure we’re all on the same page.

On Aug 14, 2018, at 3:56 PM, Patrick Ball notifications@github.com<mailto:notifications@github.com> wrote:

I compiled the minhashing program to a.out using gcc v8.2.0, and a.out runs without error for the config_restaurant.txt and config_cd.txt examples.

However, when I point it at our new matching data, I get the following error:

┌─( pball@petunia:~/projects/hrdag/GT/match/ciidh+ceh+remhi+pnr+fafg/lsh-blocking/hand )────( λ:master | ◼ )─┐ └─❱❱❱ mlsh Config.txt 1 10 ❰─┘ Config file is Config.txt K = 1 L = 10 a.out(10438,0x7fff9df32380) malloc: error for object 0x7fd515d16c78: incorrect checksum for freed object - object was probably modified after being freed. set a breakpoint in malloc_error_break to debug [1] 10438 abort /Users/pball/src/LSHUniqueEntityEstimator/C++Codes/a.out Config.txt 1 10

What should I test, or what information can I provide for debugging? Tagging @mgargiulohttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_mgargiulo&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=jia4GNbBzW-iVoWFk45S66AXpdnbpSxk3uvI1PJQ3v4&e= who will be giving followup.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_RUSH-2DLAB_LSHUniqueEntityEstimator_issues_2&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=qUFXz-hrnPcrQslc2G8ykS-50EJw7GeS8hk_lcfyIeo&e=, or mute the threadhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AEnmmg9BjZXth9mPKj51-5F7CcGOFkbDs5ks5uQ0efgaJpZM4V9Pfp&d=DwMFaQ&c=imBPVzF25OnBgGmVOlcsiEgHoG1i6YHLR0Sj_gZ4adc&r=k0Os9to_LJqcai_kiG1NjQaT5Y6v4DTxHPQOT9l3Kcw&m=ZGXhruV3E70v0kPGYV9n9Zq4Nzaa_hpa3G0TOzDgvp0&s=_Sh3cl6E_wee2fBFCwR3yMb0rcucVoKfY6-iVyr552g&e=.

vm-wylbur commented 6 years ago

@resteorts I am able to run a.out on the config_restaurant.txt and config_cd.txt examples. There are no other test files in the data/ directory.

We know about K and L, but we're trying to get it running consistently before we tune the parameters.

iitkgpanshu commented 6 years ago

Great! Keep us posted.

Anshumali

On Tue, Aug 14, 2018 at 7:25 PM, Patrick Ball notifications@github.com wrote:

@resteorts https://github.com/resteorts I am able to run a.out on the config_restaurant.txt and config_cd.txt examples. There are no other test files in the data/ directory.

We know about K and L, but we're trying to get it running consistently before we tune the parameters.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/RUSH-LAB/LSHUniqueEntityEstimator/issues/2#issuecomment-413074846, or mute the thread https://github.com/notifications/unsubscribe-auth/ANsGJJ_a-CCBa6ecwPtX8WG9XNcIeOZQks5uQ4aDgaJpZM4V9Pfp .