mayhugotong / GenTKG

This is the official impletation repository of NAACL findings paper, GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models. (https://arxiv.org/abs/2310.07793).
23 stars 2 forks source link

In-domain Generalization #3

Open binchen4110 opened 5 months ago

binchen4110 commented 5 months ago

In 4.3 "In-domain Generalization", How do you design your various partitions of training data? Are this step in phrase of TLR or FIT or both them?

mayhugotong commented 5 months ago

Hi, the training data is first temporally ordered, i.e. from the ealierst to the latest timestamp. Then we select the first 5%, 10%, ... 75%, etc. In the setting presented in our paper, the partition is done before TLR and FIT. Note that in the TLR phase, for each partition we build prompts with TLR and uniformly sample 1024 prompts from this x% of the original training set for FIT training in GenTKG. Other baseline models are trained on x% of the original training set.