ipinfo / mmdbctl

mmdbctl is an MMDB file management CLI supporting various operations on MMDB database files.
Apache License 2.0
111 stars 15 forks source link

It does not reutilize data appropriately #42

Open MarianoFacundoArch opened 4 days ago

MarianoFacundoArch commented 4 days ago

The results of exporting a csv, such as locations, and reimporting, is impossible to manage. Maxmind original MMDBs are more optimized to reutilize already present data, further reducing the size. What would be a good approach to improve this?

abdullahdevrel commented 1 day ago

Thank you for your question! I'd appreciate a bit more context to better understand your issue.

It does not reutilize data appropriately

From what I know, the data in MMDBs is compressed and compiled into a structure optimized for reducing file size. Could you clarify what specific issue you're encountering that makes it seem otherwise?

MaxMind original MMDBs are more optimized to reutilize already present data

Could you elaborate on why you believe that’s the case? Since mmdbctl is a deterministic system, the process for creating MMDB files should reliably optimize and reuse data where possible. If you're noticing a discrepancy, it might be helpful to dig into how the data is being prepared or structured before using mmdbctl.

Thanks.

Best Regards, Abdullah (DevRel @ IPinfo.io)

MarianoFacundoArch commented 19 hours ago

The maxmind geoip db uses a particualr structure, it links the ip to a location object, that contains the rest of the info, to be able to re utilize. I did not find any way in mmdbctl to perform a similar approach

abdullahdevrel commented 18 hours ago

Thanks. Paging @max-ipinfo who might have some clue around this issue.