Legal Element Extraction

FudanDISC / DISC-LawLLM

[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.

Apache License 2.0

549 stars 61 forks source link

Could you provide some more details on how to construct the legal element extraction dataset?

There are multiple open-source datasets that you can refer to, for instance, LEVEN and JointExtraction. Based on these, we use ChatGPT to provide some chain-of-thoughts.

By the way, I want to find out if you have encountered the decrease of general ability while fine-tuning?

Yes, but I cannot summarize a method to avoid this from happening. As some suggestions, you may increase the diversity of your prompts, and mix some other types of data in each fine-tuning. Still, this is largely trial-and-error.

FudanDISC / DISC-LawLLM

Legal Element Extraction #30