Open whoislocalhost opened 3 years ago
Коста,
Виж какъв проблем възниква при тестването на Starter Kit.
Явно германските колеги са почнали да го тестват.
Поздрави, Галя
On 22 Jun 2021 1:26 pm, Anssi Lintulampi @.***> wrote:
KeyError appears when I'm trying to execute cell 3.4 finding URLs to scrape from Jupyter Notebook file OBEC_Starter_Kit_URLs_Finder.ipynb under URLsFinder directory. The following method is called:
dfnt = uf.start_ws( timeout=20, sleep=0.5, urlsatstart=10, urlsatend=10, slice=0, url='Suggested URL', what='find_urls_to_scrape_from_suggested_url' )
And after the method has loaded the file black_list_urls.csv the following error message appears:
KeyError Traceback (most recent call last)
Dear Anssi,
My colleague Kostadin tried to resolve the problem, but it’s not clear how the errors were raised. May I kindly ask you to send us the input files which you were using. Thanks.
Best regards, Galya
From: Anssi Lintulampi @.> Sent: Tuesday, June 22, 2021 1:26 PM To: EnterpriseCharacteristicsESSnetBigData/StarterKit @.> Cc: Subscribed @.***> Subject: [EnterpriseCharacteristicsESSnetBigData/StarterKit] URLsFinder find_urls_to_scrape_from_suggested_url results to KeyError (#1)
KeyError appears when I'm trying to execute cell 3.4 finding URLs to scrape from Jupyter Notebook file OBEC_Starter_Kit_URLs_Finder.ipynb under URLsFinder directory. The following method is called:
dfnt = uf.start_ws(
timeout=20,
sleep=0.5,
urlsatstart=10,
urlsatend=10,
slice=0,
url='Suggested URL',
what='find_urls_to_scrape_from_suggested_url'
)
And after the method has loaded the file black_list_urls.csv the following error message appears:
KeyError Traceback (most recent call last)
Dear Anssi,
I made changes to the code. Hope I fixed the bugs. I test the code with your data and it worked. Please be in mind that the fields in your input data have to be like words in the URLs_words.txt without “Has ”, otherwise you will get an error in step 3.5 (Upper cases matters). URLs_words.txt: Has Address Has Email Has ID Has Name Has Phone Has Populated place Your file fields: ID Name phone address URL Populated place Email
Sorry for the delayed answer. Please, contact me, if there are other problems.
Best regards, Kostadin
From: Galia Stateva Sent: Friday, June 25, 2021 2:49 PM To: @.***' Cc: 'EnterpriseCharacteristicsESSnetBigData/StarterKit'; Kostadin Georgiev; Katja Subject: RE: [EnterpriseCharacteristicsESSnetBigData/StarterKit] URLsFinder find_urls_to_scrape_from_suggested_url results to KeyError (#1) Importance: High
Dear Anssi,
My colleague Kostadin tried to resolve the problem, but it’s not clear how the errors were raised. May I kindly ask you to send us the input files which you were using. Thanks.
Best regards, Galya
From: Anssi Lintulampi @.> Sent: Tuesday, June 22, 2021 1:26 PM To: EnterpriseCharacteristicsESSnetBigData/StarterKit @.> Cc: Subscribed @.***> Subject: [EnterpriseCharacteristicsESSnetBigData/StarterKit] URLsFinder find_urls_to_scrape_from_suggested_url results to KeyError (#1)
KeyError appears when I'm trying to execute cell 3.4 finding URLs to scrape from Jupyter Notebook file OBEC_Starter_Kit_URLs_Finder.ipynb under URLsFinder directory. The following method is called:
dfnt = uf.start_ws(
timeout=20,
sleep=0.5,
urlsatstart=10,
urlsatend=10,
slice=0,
url='Suggested URL',
what='find_urls_to_scrape_from_suggested_url'
)
And after the method has loaded the file black_list_urls.csv the following error message appears:
KeyError Traceback (most recent call last)
KeyError appears when I'm trying to execute cell 3.4 finding URLs to scrape from Jupyter Notebook file
OBEC_Starter_Kit_URLs_Finder.ipynb
under URLsFinder directory. The following method is called:And after the method has loaded the file
black_list_urls.csv
the following error message appears:Error indicates that the pandas data frame is empty when it's not supposed to be and this causes an exception that is not handled.