Closed cryingjin closed 10 hours ago
There are cases where the entity_id in ro_linkings does not actually exist.
ro_linkings
should be between segments but not entities, so the index within ro_linkings
should be segment id
but not entity_id
. We have correct it in the README file.
I have been closely following your related research and appreciate your work. However, I have some questions regarding the data construction. Was the dataset built automatically?
I expected that by using the
ro_linkings
, I would be able to logically connect the text within the document in a coherent flow. However, when I preprocess and concatenate the text according to thero_linkings
order, the result is quite disjointed and lacks coherence. Do you have any insights into what might be causing this issue?e.g.
Ordered Text: LTS. 100'S S. J. Farnham NO. STORES SUBMISSION DATE: EFFECTIVENESS OF PRE- SELL (REPORT ON OCT 3 ONLY). JAN 23, 1995 DISTRIBUTION: R. B. SPELL PROMOTIONAL IMPACT: 9 0 % CLASSIFIED CALLS 2 % ANNUAL CALLS 100'S % OF DISTRIBUTION ACHIEVED IN RETAIL OUTLETS: Excellent. Continues to drive all carton business SALES FORCE 20'S $ 50 OFF PACK: DIRECT ACCOUNT AND CHAIN VOIDS (USE X TO INDICATE A VOID). SUBJECT: DEC 26 NONE OCT 31 ACCOUNT OCT 3 FROM: HARLEY DAVIDSON 100'S CIGARETTES PROGRESS REPORT TO: $ 5.00 OFF CARTON: Excellent but quickly depleted. Excellent movement when couponed. Without coupons, movement slows dramatically!