abhisheks008 / DL-Simplified

Deep Learning Simplified is an Open-source repository, containing beginner to advance level deep learning projects for the contributors, who are willing to start their journey in Deep Learning. Devfolio URL, https://devfolio.co/projects/deep-learning-simplified-f013
https://quine.sh/repo/abhisheks008-DL-Simplified-499023976
MIT License
390 stars 340 forks source link

Automated Legal Document Summarizer #858

Open anushkasaxena07 opened 4 months ago

anushkasaxena07 commented 4 months ago

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Project Title : Automated Legal Document Summarizer
:red_circle: Aim : Create a model that can read and summarize lengthy legal documents, preserving the key legal points and clauses.
:red_circle: Dataset : collected from diverse sources to ensure a variety and contents for comprehensive testing.
:red_circle: Approach : Approach:

Use a pre-trained transformer model fine-tuned on a legal text dataset. Incorporate Named Entity Recognition (NER) to identify and highlight important entities (e.g., names, dates, legal terms). Evaluate the summaries for accuracy and completeness by comparing them to human-generated summaries.


📍 Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

github-actions[bot] commented 4 months ago

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

anushkasaxena07 commented 4 months ago

@abhisheks008 plz assign me this issue

abhisheks008 commented 4 months ago

Hi @anushkasaxena07 it'll be better to focus on one issue at a time.

Also in the approach part name of the proposed models/architectures are missing. Can you put some clarification on the same?

anushkasaxena07 commented 4 months ago

Proposed Models/Architectures: BERTSUM: A variant of BERT (Bidirectional Encoder Representations from Transformers) specifically designed for extractive summarization. T5 (Text-To-Text Transfer Transformer): A versatile model that can be fine-tuned for summarization tasks by framing them as text-to-text problems. PEGASUS: A model designed for abstractive summarization with a focus on generating high-quality summaries. Longformer: A transformer model designed to handle long documents, making it suitable for summarizing lengthy legal texts. Tools: Hugging Face Transformers, TensorFlow, PyTorch