CogComp / saul

Saul : Declarative Learning-Based Programming
Other
64 stars 18 forks source link

ER Classifiers switched to SparseNetworkLBP #340

Closed bhargav closed 8 years ago

bhargav commented 8 years ago

JointTraining was not updating weights when SparseNetworkLBP was used due to a stale reference to network. Updated the SparseNetworkLBP class to not store any values and interact with the actual class instead.

Changes in saul-core:

danyaljj commented 8 years ago

About the changes in results of L+I: I think the existing evaluation is not quite correct.

Two scenarios:

I don't remember how we reported the previous results in the readme file, but I think it was using the latter. I think we should update the results after dropping pathToHead from testing.

bhargav commented 8 years ago

@danyaljj In the PR, deriveTestInstances method uses pathToHead to get the incident Node object. Once we have the node, I am using the getTestingInstance on the node to retrieve test candidates for the default test() method. Can you have a look at the latest iteration? I had already updated results after making those changes.

danyaljj commented 8 years ago

Oops. All looks good. Implementation makes sense.

bhargav commented 8 years ago

Updated LBJava Version and removed the SparseNetworkLBP file. Updated er-models package. But I see that some tests in POS and SRL are failing. Will investigate later.

danyaljj commented 8 years ago

Shoot this is a little sad ... the pre-trained models for SRL were trained on SparseNetworkLBP and need to be retrained; @kordjamshidi how hard is it to re-training predicateSenseClassifier and argumentTypeLearner and argumentXuIdentifierGivenApredicate?

bhargav commented 8 years ago

Yeah. This might have to wait till next week then. I updated the ER Models but other examples need to be re-trained.

@danyaljj Not sure why POS test is failing though.

kordjamshidi commented 8 years ago

Yes, this is what I mentioned before, we need to remove this class after the deadline.

danyaljj commented 8 years ago

@bhargav I think somehow the new lbjava version brings new version of Edison (which has had some changes) and causes some of the feature-extraction related stuff to fail (related to #339).

bhargav commented 8 years ago

@danyaljj @kordjamshidi Updated this PR. I'll have to train the SRL models again.

kordjamshidi commented 8 years ago

yes, it would be great to merge this one asap. @bhargav

bhargav commented 8 years ago

I tried running SRL training a couple of time and end up with Heap Size limit exceeded error while parsing data model. By the time 1k sentences were parsed, the process had used ~35Gb. Training examples have 37k sentences. I think we'll have to look at memory usage and fix issues there first.

kordjamshidi commented 8 years ago

You can look at it but I could run it by increasing the memory, using the instructions that I put here: https://github.com/IllinoisCogComp/saul/wiki/Common-Erros, maybe try this and see if it works for you. Are you sure, you really used that 35G? I think I could run the first part on my laptop also (16G).

bhargav commented 8 years ago

@danyaljj @kordjamshidi This is ready. The performance on SRL task was similar. It was less ( -0.5 %) on aTr but equal or better bTr, cTr, dTr, fTr.

kordjamshidi commented 8 years ago

ok, great.

danyaljj commented 8 years ago

Just a few comments. Feel free to merge after addressing them.

danyaljj commented 8 years ago

Small comment on the test error margin. Otherwise good to merge.

bhargav commented 8 years ago

Updated test margins. :)

danyaljj commented 8 years ago

Looks good. Merging!