blobcity / autoai

Python based framework for Automatic AI for Regression and Classification over numerical data. Performs model search, hyper-parameter tuning, and high-quality Jupyter Notebook code generation.
Apache License 2.0
175 stars 43 forks source link

Pandas DataFrame Support #9

Closed Thilakraj1998 closed 3 years ago

Thilakraj1998 commented 3 years ago

files to refer:

      https://github.com/blobcity/autoai/blob/main/blobcity/blobcity.py  
      https://github.com/blobcity/autoai/blob/main/blobcity/utils/FileType.py

Currently, the main driver function train accepts file path as an argument to fetch dataset from user-specified location and identifies file type associated with the file.

Enhancement: provide user a flexibility by providing support to accept pandas.Dataframe object has an argument to train function and must support other follow up functions inside driver function.

balamurugan1603 commented 3 years ago

can i work on this issue?

balamurugan1603 commented 3 years ago

Files to refer are these right?

https://github.com/blobcity/autoai/blob/main/blobcity/main/driver.py https://github.com/blobcity/autoai/blob/main/blobcity/utils/FileType.py

sanketsarang commented 3 years ago

@balamurugan1603 done, the task is assigned to you. Yes, the files you mentioned are correct.

Currently the program creates a Pandas DataFrame based on a file that is passed as a parameter to the train function. However, users may want to create the DataFrame from say a Database and not from a file. We thereby need to extend functionality to allow a DataFrame to be passed directly into the blobcity.train(df, target) function.

balamurugan1603 commented 3 years ago

sure I will do it and another silly off-topic doubt. If I submit a pull request by now, will it count towards hacktoberfest? Sorry to ask this.

sanketsarang commented 3 years ago

No, it won't count. You have to generate the PR only after 01 Oct, once it is live. Would suggest waiting till then.

balamurugan1603 commented 3 years ago

Fine. Thanks.