NDelventhal / InsolvencyAnnouncementsGer

InsolvencyAnnouncementsGer is a Python library for searching, viewing and scraping public announcements of German bankruptcy courts.
MIT License
9 stars 0 forks source link

Possibly outdated #2

Open FelixWeichselgartner opened 1 year ago

FelixWeichselgartner commented 1 year ago
import InsolvencyAnnouncementsGer as ia

ia.insol_proc_scr(reg = ["HRA", "HRB"], state = "Berlin",date_from = "30.08.2022", date_to = "", name = "",
                  domicile = "", department_number = "", register_reference = "", seq_number  = "", year = "",
                  reg_court = "", reg_number = "", subject = "", search_type = "unlimited", ins_court = "",
                  scrape_html = True)

returns

  File "InsolvencyAnnouncementsGer.py", line 208, in insol_proc_scr
    text = soup.find_all('table')[1].find_all('p')[0].find_all('b')[0].get_text()
IndexError: list index out of range

with the variable soup being

Register type: HRA
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">

<html><head>
<title>403 Forbidden</title>
</head><body>
<h1>Forbidden</h1>
<p>You don't have permission to access this resource.</p>
</body></html>

My guess is that something is outdated. Going to dig into it and post here...

FelixWeichselgartner commented 1 year ago

seems like alt.insolvenzbekanntmachungen.de does not exist anymore and neu.insolvenzbekanntmachungen.de should be used.

jurekvisionneo commented 9 months ago

Hi @FelixWeichselgartner how did you solve the issue? I am encountering the same problem even though I have changed the URL to "neu.insolvenzbekanntmachungen.de". Is there any other change I have to make? Thanks in advance

FelixWeichselgartner commented 9 months ago

@jurekvisionneo changing the url did not solve the problem for me either I did not further investigate this problem Keep me posted if you solve the problem