Open arjunjauhari opened 6 years ago
Metric:
Summary: Total number of question pairs for training: 404290 Duplicate pairs: 36.92% Total number of questions in the training data: 537933 Number of questions that appear multiple times: 111780 Total number of question pairs for testing: 2345796
Metrics to evaluate:
Step 1: Extract a feature Step 2: Apply logistic regression.
1) Split data (train/dev/test) 2) Define a metric (code it)