neurosynth / ACE

Tools for automatic extraction of activation coordinates from published neuroimaging articles.
MIT License
42 stars 27 forks source link

[FIX] column parsing #51

Closed jdkent closed 2 months ago

jdkent commented 2 months ago
  File "ace_create_db_and_add_articles.py", line 32, in <module>
    missing_sources = add_articles(
  File "/home/james/ACE/ace/ingest.py", line 67, in add_articles
    article = source.parse_article(html, pmid, metadata_dir=metadata_dir, **kwargs)
  File "/home/james/ACE/ace/sources.py", line 468, in parse_article
    t = self.parse_table(table_html)
  File "/home/james/ACE/ace/sources.py", line 490, in parse_table
    return super(ScienceDirectSource, self).parse_table(table)
  File "/home/james/ACE/ace/sources.py", line 215, in parse_table
    n_cols = max([n_cols_in_row(
  File "/home/james/ACE/ace/sources.py", line 215, in <listcomp>
    n_cols = max([n_cols_in_row(
  File "/home/james/ACE/ace/sources.py", line 208, in n_cols_in_row
    return sum([int(td['colspan']) if td.has_attr('colspan') else 1 for td in row.find_all(['th', 'td'])])
  File "/home/james/ACE/ace/sources.py", line 208, in <listcomp>
    return sum([int(td['colspan']) if td.has_attr('colspan') else 1 for td in row.find_all(['th', 'td'])])
ValueError: invalid literal for int() with base 10: 'NaN'