nv-morpheus / Morpheus

Morpheus SDK
Apache License 2.0
309 stars 119 forks source link

[FEA]: log parsing output should include a unique column #1789

Open dagardner-nv opened 3 days ago

dagardner-nv commented 3 days ago

Is this a new feature, an improvement, or a change to existing functionality?

Improvement

How would you describe the priority of this feature request

Medium

Please provide a clear description of problem this feature solves

Currently the output data includes these fields:

{
  "remote_host": "5.113.18.208",
  "time_received": "[28/May/2018:08:28:30+0200]",
  "request_method": "GET",
  "request_url": "/apache-log/access.log",
  "request_http_ver": "1.1",
  "status": "206",
  "response_bytes_clf": "43192",
  "request_header_referer": "http:/www.almhuette-raith.at/apache-log/",
  "request_header_user_agent": "Mozilla/5.0 (Windows NT 6.1; Trident/7.0; rv:11.0) like Gecko",
  "error_level": null,
  "error_message": null,
  "request_header_user_agent__os__version_string": null
}

The problem is that none of these fields uniquely identify the record, making it difficult to compare/validate output with the scripts/compare_data_files.py script.

Describe your ideal solution

Include a unique index column in the output

Additional context

No response

Code of Conduct