pajachiet / pymongo-schema

A schema analyser for MongoDB, written in Python.
GNU Lesser General Public License v3.0
74 stars 13 forks source link

speed up schema extraction with field trade-off #32

Closed iamsk closed 4 years ago

iamsk commented 4 years ago

It's too slow for big collection, and timeout always happen. So, replace scan all rows with only 1000 random samples to speed up extraction.

pajachiet commented 4 years ago

Thanks a lot for your contribution ! However, sometimes we do want to scan the whole database if possible.

Could you please add an argument to the extract function, to keep the default behavior ? And update also the README command line usage ?

iamsk commented 4 years ago

Thanks a lot for your contribution ! However, sometimes we do want to scan the whole database if possible.

Could you please add an argument to the extract function, to keep the default behavior ? And update also the README command line usage ?

I'm agree with you, I will try to update this.

iamsk commented 4 years ago

@pajachiet updated, please have a review