Open VikVal123 opened 4 years ago
To extract all of the existing cards I will try using bs4 and scrape the information from https://www.hearthpwn.com/cards?display=2
Scraping from the website is the right choice if you want to practice web scraping. However, if your goal is just to get the data google around as there is structured data sources already available (I have them already)
On Fri, Feb 14, 2020 at 11:09 AM VikVal123 notifications@github.com wrote:
To extract all of the existing cards I will try using bs4 and scrape the information from https://www.hearthpwn.com/cards?display=2
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/VikVal123/Hearthstone-Card-Value-Predictions/issues/1?email_source=notifications&email_token=ABHPQD6MIIBWLVRKOEIYVGDRCZNT5A5CNFSM4KVDUCGKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELYDC3Q#issuecomment-586166638, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABHPQD2SLB4E27X6YZ3TYWTRCZNT5ANCNFSM4KVDUCGA .
-- Aidis Stukas +370 641 39571
I would need to see the data. Want to make sure all of the cards up to date and etc. I have found multiple sources but some of them have outdated information or just not updated to the most recent.
List all the sources that you found and mark their state. Also, if your goal is to learn data analysis/machine learning than it is not important if the data is outdated
On Fri, Feb 14, 2020 at 11:22 AM VikVal123 notifications@github.com wrote:
I would need to see the data. Want to make sure all of the cards up to date and etc. I have found multiple sources but some of them have outdated information or just not updated to the most recent.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/VikVal123/Hearthstone-Card-Value-Predictions/issues/1?email_source=notifications&email_token=ABHPQD2OK4RSDCBR5LY77D3RCZPEPA5CNFSM4KVDUCGKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELYEPZA#issuecomment-586172388, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABHPQDYG34HAFQHKDJFIYW3RCZPEPANCNFSM4KVDUCGA .
-- Aidis Stukas +370 641 39571
https://docs.google.com/spreadsheets/d/1IrptmK0KWlZi1PDU6MC2adIwLAc-exCrF7-bNExZ_6s/edit#gid=191287612 - only lists out the cards by name, no other information is accessible. This spreadsheet is prepared for collection tracking.
https://github.com/kevin-internet/HearthstoneCardDataToolAndListerConsoleWPF/blob/master/CSV_AndExcelFilesPregenerated/HearthstoneAllCard_8.0.0.18336.csv - contains a lot of information. The only set back is that information was last updated 2017 but as mentioned above the date is not important.
Located a web page that mentions "HearthstoneJSON includes a full image API for all the cards in the game." Currently reviewing what JSON and API are. This is the direct link to the web page https://hearthstonejson.com/docs/images.html
All of Hearthstone cards have been extracted and normalized with pandas into a table. cards_json_utf8.py
I need to extract all of the standard cards that are mentioned in another issue.
Now that the JSON file has been updated. Will need to extract standard cards.
Before I cant start working I need to extract all existing Hearthstone cards and sort them by format(Standart or Wild).