JetBrains-Research / astminer

A library for mining of path-based representations of code (and more)
MIT License
282 stars 80 forks source link

Manage the number of output path contexts #216

Open ngohlong opened 2 years ago

ngohlong commented 2 years ago

Hello,

I am using astminer to extract path contexts into code2vec format. By modifying the filter in the config (.yaml file), I can remove some long paths. But somehow, when using a big size for the filter, astminer produces more paths than the number of input code snippets.

Could you please explain this problem to me? And it would be great if you can show me the way to manage exactly the number of output path contexts as desired.

Thank you so much in advance.

Best regards, Long Ngo