Technocolabs100 / Stack-Overflow-Tag-Predictions

Tag Prediction from Stack Overflow Questions
10 stars 11 forks source link

Data loading Using SQLite DB #13

Closed Technocolabs100 closed 3 years ago

Technocolabs100 commented 3 years ago

To process further, We have used the simplest and most used database called SQLite. It is a Super powerful, lite weight and open-source database. The best part is that python comes inbuilt with SQLite. More about SQLite (https://www.sqlite.org/index.html) So instead of using a CSV file, we pump all of the data into the SQLite database and use SQL queries to process further. we have taken all the data that is there in the CSV file and append it into the database ‘train.db’. Now we can use the created SQLite database for further process.

dethebera commented 3 years ago

Would like to work on this. Will provide pr in the form of ipynb for loading the data with SQL light.

Technocolabs100 commented 3 years ago

Thanks, Utkarsh! I will be assigning this issue to you.

dethebera commented 3 years ago

Hi @Technocolabs100 ! I just made the pr! Lemme know if there are any changes. 😊

20

dethebera commented 3 years ago

HI @Technocolabs100, If there's an issue with Github loading the notebook. You can try this Link to view the progress made.

Sometimes GitHub fails to load the files and I wasn't able to open the file too unless I opened it directly from my repo. Issue Details - > https://github.com/iurisegtovich/PyTherm-applied-thermodynamics/issues/11

Technocolabs100 commented 3 years ago

This issue is done by Utkarsh

dethebera commented 3 years ago

@Technocolabs100 Hi, I guess you'll have to update labels for GSSOC'21 Leaderboard. level0, level1, level2 ... etc