Closed zhangir-azerbayev closed 3 years ago
Hi Ansong, I fixed the issues raised in your comments with a new commit. Let me know what you think. I've also added a new class called SummEvalMetric, which inherits from SummMetric and is specifically for metrics that use SummEval as a backend (as of right now, this is all of them) .
@zhangir-azerbayev Did you forget to check the summeval_metric.py
into git? It seems to be missing from my end.
Also, there are some relative imports, which we should try to avoid.
Other than those, I think the class design makes sense to me and is much better than last time :)
@niansong1996
Sorry, I forgot to check the summeval_metric.py
, this is fixed now. I also got rid of the relative imports.
@zhangir-azerbayev I think one other thing that is missing is testing.
Can you follow what we have in tests
and add a eval_test.py
and add some testing, to make sure it works as expected?
The commits looks good so far, let me know when you resolved all my previous comments by requesting a review, then I can see if this can be merged into main, thanks!
@zhangir-azerbayev Where are we on this thread?
@zhangir-azerbayev Where are we on this thread?
Hi @niansong1996. I added a commit with unit testing.
@zhangir-azerbayev Please resolve the comments I made above, also there is currently a conflict on demo.ipynb
, have you made any important changes to that file?
Should be ready to merge.
@zhangir-azerbayev Why is RougeWE still removed? I thought we fixed the loading issue?
@niansong1996 good catch, I fixed it.
@zhangir-azerbayev LGTM, are all of the evaluation metrics passing the test? If so, I think this is ready to merge
@niansong1996 Found a minor bug in the testing script. Tests now pass.
Modified evaluation library to better align with style conventions.
One thing I can't figure out how to do is import SummModel into base_metric.py for type annotation purposes. Any help with this is appreciated.