Previously, if we couldn't detect a charset for a file we would still download it into the unified platform, which caused files like PDFs to be present in the Unified Platform.
Now, if the result of trying to detect a charset of a file is 'None' we log a download error into the Unified Platform DB, and it's excluded from further checks (e.g. validation)
Note that we are seperately pursuing an enhancement to the Registry that would only allow xml files to be included in the registry.
Trello
Previously, if we couldn't detect a charset for a file we would still download it into the unified platform, which caused files like PDFs to be present in the Unified Platform. Now, if the result of trying to detect a charset of a file is 'None' we log a download error into the Unified Platform DB, and it's excluded from further checks (e.g. validation)
Note that we are seperately pursuing an enhancement to the Registry that would only allow xml files to be included in the registry.