Open neu5ron opened 1 year ago
The response of ES is right. It's not just simplely calculate the MD5 of 'a', but concatenate all values of 'fileds' with a delimeter of byte '0'. So its result is more like this function = Base64(MD5(join(0, value of fields)
ok,is there a possibility to add an option to change this or. Have years of data with fingerprints/hashes and moving everything to ingest pipeline the fingerprinting does not match with logstash or previous ETL provided by Elastic.
Pinging @elastic/es-data-management (Team:Data Management)
it would be great to have consistent hashes over the years. thank you!
or at least make it not add a null byte if hashing a single field.
Bump 🥶
hi I was wanting to follow up on this issue. I know this may be expected results as it was built for elasticsearch fingerprint process. However, this is not how it works for logstash or filebeat. Also, it makes it difficult for a field like cyber security where it is necessary to share hashes across communities and environments of all sorts of technology - and if those of us using Elastic are sharing inconsistent hashes with the community then it puts us in a difficult position. I continue to see the fingerprint processor be used (as recent as 2 days ago) in Elastic ingest pipelines for ECS - and I know this issue will only continue to grow in the future.
Personally myself, I have solved this - I have found an undocumented hashing technique outside of a processor by using painless. However, I don't want the majority of the community using Elastic to continue to be in this realm of separation of sharing incorrect intel..
Elasticsearch Version
8.9.0, tested also on 8.5 and 8.6
Installed Plugins
No response
Java Version
bundled
OS Version
N/A
Problem Description
When using the fingerprint processor there are unexpected results with showing the actual method's hex representation. For example using the method
MD5
and the valuea
.Expected: hex:
0cc175b9c0f1b6a831c399e269772661
base64:DMF1ucDxtqgxw5niaXcmYQ==
Fingerprint Processor: hex:7687355dbc955b0074758acb4d5f9a
base64:dg91NXbylVsAdHWKy01fpg==
Steps to Reproduce
Logs (if relevant)