Closed julianharty closed 2 months ago
--input-file
and --output-file
to the src/github_repo_request_local.py
parser.add_argument( '--input-file', type=str, default='./data/original_github_df.csv', help='Path to the input CSV file containing repository URLs ' '(default: ./data/original_github_df.csv)' ) parser.add_argument( '--output-file', type=str, default='./data/updated_local_github_df_test_count.csv', help='Path to output CSV file to store updated repository data ' '(default: ./data/updated_local_github_df_test_count.csv)' )
changed the flags `parser.add_argument( "--input-file", type=str, default=str(Path("data/original_github_df.csv")), help="Path to the input CSV file." )
parser.add_argument( "--output-file", type=str, default=str(Path("data/updated_local_github_df_test_count1.csv")), help="Path to the output CSV file." )`
--input-file data/test/original_github_df.csv --output-file data/test/updated_local_github_df_test_count.csv
)I have added 2 flags for the script src/github_repo_request_local.py
:
-- --ttl-file
-- --test-file-list
I have also added argparse flags
to the script utils/initial_data_preparation.py
. The flags are:
-- --input-file
-- --output-folder
Context
We now have at least 3 scripts that read content from input files and write updated and additional content to output files. Currently these filenames are often hardcoded in the scripts, this makes the scripts less flexible for users who may wish to use different inputs, etc.
Let's add command-line options, with the current filenames as default values, to the processing scripts in the
./src
folder. Let's also update the repo's README to document the new capabilities.