The conversationId column in the unannotated dataset does not match the conversationIds that appear in the paper_splits dataset.
It looks like the conversationIds always end with a suffix of -1 or -2 in the unannotated dataset. It looks like conversations that end in -2 are duplicates of those that end in -1.
Ideally the conversation ID would match between the unannotated and paper_splits datsets.
The conversationId column in the unannotated dataset does not match the conversationIds that appear in the paper_splits dataset.
It looks like the conversationIds always end with a suffix of -1 or -2 in the unannotated dataset. It looks like conversations that end in -2 are duplicates of those that end in -1.
Ideally the conversation ID would match between the unannotated and paper_splits datsets.