scholl-lab / vcf-filtering

A collection of scripts for filtering annotated variant call format files
MIT License
2 stars 1 forks source link

Bug: Fields with Commas Converted to Numbers in Excel #53

Open berntpopp opened 1 month ago

berntpopp commented 1 month ago

Description

Some fields, such as AD or others containing commas, are being converted to numbers in Excel. This results in incorrect data representation. The expected behavior is for these fields to be treated as text to preserve their original format.

Steps to Reproduce

  1. Run the script with a VCF file containing fields like AD that have comma-separated values.
  2. Convert the output to Excel format.
  3. Open the Excel file and observe that fields with commas are converted to numbers.

Expected Behavior

Fields with commas should be treated as text in the Excel output to ensure the original format is preserved.

Actual Behavior

Fields with commas are converted to numbers, resulting in incorrect data representation.

Proposed Solution

Fix the type of such fields as text when converting to Excel format.

Impact

Ensuring fields with commas are treated as text will preserve the integrity of the data and avoid misrepresentation in the Excel output.

Additional Notes