Open JoakimEdin opened 1 year ago
Also, would it be possible to get more data statistics on the datasets you are using? Here are some examples of statistics I would find interesting:
@JoakimEdin Thank you very much for your helpful information. We had grand plans to test various input types, tuned parameters, and models for this paper, but ran out of time.
The focus of the paper was more of a methods/process instruction, rather than a fair comparison, but your point is taken. We have several projects in this space and will review your suggestions and see how we might improve future works.
We cut out a good bit of data to make the page limit, including stats on the dataset. @amullen34 could you post the stats requested in the previous post.
I would also like to see where we could perhaps collaborate on medical NLP.
Thanks again. Cody
Sure, here are those stats.
I don't have the exact number of words per document, but I can say that the average number of characters per document was 636. The average number of codes per case was 1.14. The majority of cases only had a single code attributed to them. Here are some overall stats on the distribution of codes (in the largest dataset):
Thank you, Aaron
Thank you very much for your response and the data statistics!
It is understandable there wasn't time to do everything that was planned.
I'm always up for collaboration! You can contact me at je@corti.ai.
Looking forward to hear from you!
I enjoyed your paper a lot! However, I have one big criticism: I think that the implemented BERT models are too weak. Your results are worse than those I've seen in the literature. Either your dataset is more challenging than MIMIC-III and MIMIC-IV, or you have not implemented the BERT models properly. I have some suggestions for how to get stronger baseline models and make the comparison with the LLMs more fair:
I hope these comments are not interpreted as self-promotion! I am genuinely interested in how LLMs perform on automated medical coding compared to other models! Also, cool to see a comparison on a new dataset!