openego / powerd-data

GNU Affero General Public License v3.0
1 stars 0 forks source link

Find out if status2019 and eGon100RE can be created in one pipeline #342

Open ClaraBuettner opened 5 days ago

ClaraBuettner commented 5 days ago

Since the results from osmTGmod are not deterministic, it would help to be able to create both scenarios in one pipeline.

I will test with a Schleswig-Holstein run if that works out.

ClaraBuettner commented 5 days ago

@CarlosEpia Is it fine to use the current dev for this check or should I use another branch?

ClaraBuettner commented 5 days ago

There seems to be only one task that is only in the status quo pipeline: https://github.com/openego/powerd-data/blob/bb84edca4e99ec00cf150d8014b95d4fc03fd4a6/src/egon/data/airflow/dags/pipeline_status_quo.py#L47 I guess we can simply add this one.

ClaraBuettner commented 5 days ago

Some tasks run into errors (electricity_demand.temporal.insert-cts-load, industry.temporal.insert-osm-ind-load) because the weather year for status2019 is another one then for eGon100RE. I think that could be a problem for other tasks as well. Fixing that would require some large changes.

@CarlosEpia and @ulfmueller: Would you think that it is fine to create a status2019-scenario with the general weather year we have for the other scenarios? At least for the path it would help I guess.

CarlosEpia commented 4 days ago

Some tasks run into errors (electricity_demand.temporal.insert-cts-load, industry.temporal.insert-osm-ind-load) because the weather year for status2019 is another one then for eGon100RE. I think that could be a problem for other tasks as well. Fixing that would require some large changes.

@CarlosEpia and @ulfmueller: Would you think that it is fine to create a status2019-scenario with the general weather year we have for the other scenarios? At least for the path it would help I guess.

I think it is okay to use the same weather data for both scenarios, but I do not know how to justify it, especially because we have weather data for 2019. What do you think if we create the scenarios using 2011 weather data but update it for 2019 once the pipeline is ready?

ClaraBuettner commented 4 days ago

I think it is okay to use the same weather data for both scenarios, but I do not know how to justify it, especially because we have weather data for 2019. What do you think if we create the scenarios using 2011 weather data but update it for 2019 once the pipeline is ready?

I think we can justify it by saying that we need one consistent weather year for the scenario path. That also makes sense for the scaling, demand profiles often depend on the calendar year, so it would be better to not mix up different years.

For the validation, 2019 makes sense, but we would not do that again.

CarlosEpia commented 4 days ago

Sounds good to me.

ClaraBuettner commented 4 days ago

I tried it out but run into problems with demandregio:

  File "/home/clara/powerd-data-36/venv/lib/python3.8/site-packages/requests/sessions.py", line 703, in send
    r = adapter.send(request, **kwargs)
  File "/home/clara/powerd-data-36/venv/lib/python3.8/site-packages/requests/adapters.py", line 700, in send
    raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPConnectionPool(host='open***.ffe.de', port=4000): Max retries exceeded with url: /demandregio_spatial?id_spatial=eq.71&&year=eq.2011&&value=gt.0.0 (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x78dfdb453850>: Failed to establish a new connection: [Errno 110] Connection timed out'))

The cached data is there. Could it be the case that it was not used when the year changed?

ClaraBuettner commented 20 hours ago

The tasks electrical_neighbours.insert-generators-sq, electrical_neighbours.insert-storage-units-sq and electrical_neighbours.insert-loads-sq are not working as expected. The data in the backups includes data for 2019 and the entsoe-API can be used for years after 2015. The year 2011 is not working.