parth126 / IT550

Project Proposals for the IT-550 Course (Autumn 2024)
0 stars 0 forks source link

Document Ranking with a Pretrained Sequence-to-Sequence Model #4

Open pratham4521 opened 2 months ago

pratham4521 commented 2 months ago

Title

Document Ranking with a Pretrained Sequence-to-Sequence Model

Team Name

Team DSSM

Email

202311022@daiict.ac.in

Team Member 1 Name

Pratham Patel

Team Member 1 Id

202311022

Team Member 2 Name

Nishit Munjani

Team Member 2 Id

202311026

Team Member 3 Name

Rohit Rathod

Team Member 3 Id

202311039

Team Member 4 Name

Ayushi Mehta

Team Member 4 Id

202311008

Category

Reproducibility

Problem Statement

The project is aimed to re-rank the documents using pre-trained models like T5.

Evaluation Strategy

AP, Precision, NDCG

Dataset

Robust - https://trec.nist.gov/data/robust/04.guidelines.html

Resources

Document Ranking with a Pretrained Sequence-to-Sequence Model Arxiv : https://arxiv.org/abs/2003.06713

parth126 commented 2 months ago
pratham4521 commented 2 months ago

Respected sir, due to the large size of standard datasets we did not have enough time to train pre-implemented models on these datasets. Therefore we decided to change the topic for our project.

parth126 commented 2 months ago

Include the reference paper in the proposal. Task is reranking using T5 on Trec robust and MS Marco datasets.