Artemis Scraper

Overview

The Artemis Scraper retrieves key data such as spread, expected loss, maturity, attachment point, etc., from Artemis, the main catastrophe bond directory. It works by opening each deal article on the Artemis platform and analyzing the content using regex patterns and other language processing methods. The extracted data is then saved in an Excel sheet named Transactions_Chart.xlsx.

Key Features:

Incremental Data Retrieval: If Transactions_Chart.xlsx already exists, the scraper will only add new transactions to it by fetching data from deals that haven't been scraped yet. If the file doesn't exist, the scraper will create it and populate it with the latest 1000 transactions from the Artemis directory.
Pricing Analysis: The scraper can also generate a Pricing_Chart.xlsx file that shows regressions of spread on expected loss based on a set number of parameters.

Getting Started

Prerequisites

Before running the scraper, ensure that the necessary Python modules are installed. The required modules are listed in the requirements.txt file.

Installation

Clone the Repository:

git clone https://github.com/yourusername/artemis-scraper.git
cd artemis-scraper

Install Dependencies

Next, install the necessary Python packages using pip:
```
pip install -r requirements.txt 
```
This will install all the required libraries listed in the requirements.txt file.
Set Up the Working Directory

The script uses a hybrid approach to determine the working directory:

Default Directory: You can specify a default working directory in a config.ini file. Create a config.ini file in the root of the project directory with the following structure:
```
[Settings]
working_directory = /path/to/your/default/directory
```
Replace /path/to/your/default/directory with the actual path to your desired default directory.
User prompt: If the config.ini file is not found or if you prefer to use a different directory, the script will prompt you to enter the working directory when it runs. The script will change the working directory based on your input or the default provided in the config.ini

Usage

Running the Scraper

To run the scraper, simply execute the artemis_scraper.py script:

python artemis_scraper.py

The script will begin scraping the Artemis Deal Directory, extracting the specified data points from each deal article.

Output

The scraped data will be automatically saved to an Excel file named Transactions_Chart.xlsx. The scraper behaves as follows:

If the Transactions_Chart.xlsx file already exists, the code will only open new deals and update the Excel sheet by adding new rows with transactions that have not been scraped yet (if any). If the Transactions_Chart.xlsx file does not exist, the code will create the file and scrape the last 1000 transactions from the Artemis directory. Additionally, the Pricing_Chart.xlsx file shows regressions of spread on expected loss based on a set number of parameters.

Customisation

Specifying Data Points

If you want to customize which data points are extracted, you can modify the artemis_scraper.py script. Locate the section where data is parsed and add or remove fields according to your needs.

Examples

You can look at a customizable example under the examples folder

License

This project is licensed under the MIT License. See the LICENSE file for more details.

federicoditirro / Catastrophe-Bond-Market-Scraper

readme