Closed MuffinLinwist closed 9 months ago
Just to quickly add my first thoughts from when I looked at this a bit earlier:
This line from the stacktrace has an open() call that doesn't specify any character encoding:
And the exception itself seems to come from the cp1252 decoder. So maybe we're being bitten by Python not using UTF-8 by default on Windows?
Looks like that, @johenglisch.
Just to quickly add my first thoughts from when I looked at this a bit earlier:
This line from the stacktrace has an open() call that doesn't specify any character encoding:
And the exception itself seems to come from the cp1252 decoder. So maybe we're being bitten by Python not using UTF-8 by default on Windows?
Yep, we've had that issue before in a different context: https://github.com/concepticon/pyconcepticon/issues/10
Can it be patched quickly?
Should be doable, yes. I'll set up a test environment and have a look at that.
Meanwhile, @MuffinLinwist would you mind giving the fix outlined in https://github.com/concepticon/pyconcepticon/issues/10 a try? I.e. in the Windows command prompt, before running cldfbench
, set a temporary default encoding for Python with:
set PYTHONIOENCODING=utf-8
(Please note that this is a temporary environmental variable, i.e. this is not persisted after closing the command prompt.)
Thanks @everyone for your comments. Now the error message does not appear (and it takes also way less time to run the conversion) but the requirements file still gets empty. Do you have an idea why could this be happening?
Thanks @everyone for your comments. Now the error message does not appear (and it takes also way less time to run the conversion) but the requirements file still gets empty. Do you have an idea why could this be happening?
I've tried addressing both issues in https://github.com/lexibank/pylexibank/pull/273 and https://github.com/cldf/cldfbench/pull/92, respectively. The PRs might not be final. I'll let you know once the issues are fixed completely.
@MuffinLinwist Both PRs have been merged, if you install both packages from source you can check whether this fixed the issue for you (in a venv
):
$ (my_venv) pip install git+https://github.com/cldf/cldfbench.git@9ff9fe91331030309c53d55efc36f76deb516e0f
$ (my_venv) pip install git+https://github.com/lexibank/pylexibank.git@1a52624a97f371fb6e61cc44014b1dbf0a03c142
@MuffinLinwist Have you had a chance to test the fixes?
@MuffinLinwist Have you had a chance to test the fixes?
I couldn't because my laptop broke down and I had to get a new one. So, I'm closing this issue, thanking @all of you for your assistance on it :)
I've been having problems while running the CLDF conversion on different datasets. The content on
cldf/requirements.txt
file is erased. I know it's normal behaviour that the conversion automatically overwrites all the files whenever i run makecldf. However, the file appears empty when it shouldn't. Also, I get this error message but the conversion still runs and all the other files are okay. This seems to be happening only on my end. I tried deleting everything and redownloading it again but it does the same. I'm on Windows. Here is the command I run:(venv) PS C:\Users\user\Documents\datasets\seifartecheverriboran> cldfbench lexibank.makecldf lexibank_seifartecheverriboran.py --concepticon-version=v3.1.0 --glottolog-version=v4.8 --clts-version=v2.2.0
And here is the output I get:
Here is a the dataset example. Can I kindly ask you to give me some clarity on why this is happening?