Closed rhamnett closed 4 years ago
Hm this was removed as part of 1cea2b0fe88b888ae8bbbb4cbe2743c1a6087552 last year, was it a mistake or on purpose? I don't remember cc @reuben
It's literally not in the code base. It's up to you. It's easy to create a new, limited CSV file but it seems better just have a flag so I was prepared to re-implement it.
It was an oversight on my part when porting the feeding code to tf.data and then I never got around to fixing it. Should be simple enough to add back the limits.
On 20 Feb 2020, at 20:28, Richard Hamnett notifications@github.com wrote:
It's literally not in the code base. It's up to you. It's easy to create a new, limited CSV file but it seems better just have a flag so I was prepared to re-implement it.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.
@rhamnett Should be a fairly easy PR, do you want to try ?
I'd put it on hold till #2723 lands, as it'd have to be re-implemented completely.
I will sort it yeah but I'll take @tilmankamp's advice
I'm wondering since when this is broken. Checking v0.4.1
, there's limit
as an argument to DataSet
, but it's never used anywhere.
Ok, actual removal of the feature seems to have been in 44e502e236d676dfcdb3068f6a6d9d1a9d644dd1
How about something like
--train_files some/data/set.csv[10:-100],some/other/data.sdb[:100]
?
Should be straight-forward to implement through extended generator functions in util.sample_collections.SDB
and util.sample_collections.CSV
.
How about something like
--train_files some/data/set.csv[10:-100],some/other/data.sdb[:100]
? Should be straight-forward to implement through extended generator functions inutil.sample_collections.SDB
andutil.sample_collections.CSV
.
I was looking into create_dataset
and re-vive the --limit
flags. I worry that the proposed syntax might be unobvious to people and error prone from shell point of view
Closing in favor of #1565.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Add code to ensure the flags below work.
f.DEFINE_integer('limit_train', 0, 'maximum number of elements to use from train set - 0 means no limit') f.DEFINE_integer('limit_dev', 0, 'maximum number of elements to use from validation set- 0 means no limit') f.DEFINE_integer('limit_test', 0, 'maximum number of elements to use from test set- 0 means no limit')