microbiomedata / sheets_and_friends

Enhance a LinkML model with imported and optionally modified slots
0 stars 0 forks source link

several pipe sep sheets #93

Closed turbomam closed 2 years ago

turbomam commented 2 years ago

92

In an effort to limit redundant typing and super long sheets_and_friends inputs (like nmdc-dh-sheets-...), at least one pipe separated column is now supported on several sheets.

Look for this pattern in the code:

    df = pd.read_csv(config_tsv, sep="\t")
    df[pipe_sel_col] = df[pipe_sel_col].str.split("|")
    df = df.explode(pipe_sel_col)

I guess explode may be a good search term with high S/N