However, the autoSql still claims it's a bed3. Does autoSql have a way to declare at least something like "bed3+"?
Beyond that, it would be nice to able to store a custom autoSql definition, which we can use in bioframe.
Since in bioframe we can already infer common bed schemas from a dataframe and map numpy/pandas dtypes to autoSql types, we should be able to generate valid autoSql schemas from an input dataframe. Being able to store those upon writing a bigBed would keep that information in-band (and we could use it to automatically parse fields on read).
However, the autoSql still claims it's a bed3. Does autoSql have a way to declare at least something like "bed3+"?
Beyond that, it would be nice to able to store a custom autoSql definition, which we can use in bioframe.
Since in bioframe we can already infer common bed schemas from a dataframe and map numpy/pandas dtypes to autoSql types, we should be able to generate valid autoSql schemas from an input dataframe. Being able to store those upon writing a bigBed would keep that information in-band (and we could use it to automatically parse fields on read).
Originally posted by @nvictus in https://github.com/jackh726/bigtools/issues/54#issuecomment-2454495341