issues
search
HazyResearch
/
fonduer
A knowledge base construction engine for richly formatted data
https://fonduer.readthedocs.io/
MIT License
407
stars
77
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CandidateExtractor doesn't scale for larger relations
#546
robbieculkin
opened
3 years ago
1
Resolve a memory leak by large data on out_queue (related to #494)
#545
YasushiMiyata
closed
3 years ago
2
Resolve memory leaks caused by add and commit to postgres (related to #494, redo #541)
#544
YasushiMiyata
closed
3 years ago
0
docs: pin sphinx version to <4.0.0
#543
lukehsiao
closed
3 years ago
1
Add multiline Japanese strings support to HocrVisualParser() to fix #534 and redo #537
#542
YasushiMiyata
closed
3 years ago
2
Resolve memory leaks caused by adding and commiting to postgres (related to #494)
#541
YasushiMiyata
closed
3 years ago
2
Add multiline Japanese strings support to HocrVisualParser() to fix #534 and redo #537
#540
YasushiMiyata
closed
3 years ago
15
Fix sqlalchemy query error of test_postgres.py (Fix #538)
#539
YasushiMiyata
closed
3 years ago
5
Test code "test_postgres.py" failes with sqlalchemy delete method
#538
YasushiMiyata
closed
3 years ago
0
Add multiline Japanese strings support to HocrVisualParser() to fix #534
#537
YasushiMiyata
closed
3 years ago
2
Tables aren't redefined for re-runs of UDF apply
#536
robbieculkin
opened
3 years ago
5
UDF hangs with no exception / warning
#535
robbieculkin
closed
3 years ago
5
HOCRParser fails to multiline Japanese strings
#534
YasushiMiyata
closed
3 years ago
2
Its dead slow with Win10 + PY 3.6
#533
nageshsvs
closed
3 years ago
2
Parser can't handle big tables?
#532
linM24
closed
3 years ago
3
Update setup-miniconda to avoid the use of add-path and set-env
#531
HiromuHota
closed
3 years ago
6
Update visual.py
#530
annelhote
closed
3 years ago
4
hOCR preprocessor not available in latest release despite documentation suggesting othwerwise
#529
AmitPoonia
closed
3 years ago
2
Use spaCy v2.3.0 or later to use HocrVisualParser
#528
HiromuHota
closed
3 years ago
1
Tokens not aligned error when spacy < 2.3.0
#527
HiromuHota
closed
3 years ago
3
Unwrap "ocrx_line" as well as "ocr_line" as Fonduer has no data model
#526
HiromuHota
closed
3 years ago
2
unable to read images in the pdf file
#525
ashleo25
closed
3 years ago
8
Parser is not splitting multiple lines sentences properly
#524
eng-khaled1
opened
3 years ago
3
Suggestion required: Getting error while applying Featurizer
#523
AshutoshUpadhya
opened
3 years ago
3
How can i extract a paragraph and all associated sentences in document
#522
ashleo25
opened
3 years ago
1
HTMLDocPreprocessor for PDF documents is it always required
#521
ashleo25
closed
3 years ago
3
Process the tail text only after child elements (#333)
#520
HiromuHota
closed
3 years ago
2
Add HOCRDocProprocessor and HocrVisualParser
#519
HiromuHota
closed
3 years ago
9
Rename "VisualLinker" to "PdfVisualParser" to welcome "HocrVisualParser"
#518
HiromuHota
closed
3 years ago
2
docs: fix epub warning by adding version to conf.py
#517
lukehsiao
closed
3 years ago
0
docs: configure RTD using config file
#516
lukehsiao
closed
3 years ago
2
Add a missing requirement for ReadTheDocs (#512)
#515
HiromuHota
closed
3 years ago
5
Featurizer.get_keys() does not honor candidate classes in context
#514
HiromuHota
opened
3 years ago
0
CORE_XX was renamed to BASIC_XX at #283
#513
HiromuHota
closed
3 years ago
1
ReadTheDocs error
#512
HiromuHota
closed
3 years ago
4
Is this the right way to test the saved emmental models?
#511
saikalyan9981
opened
3 years ago
5
Improve an error message
#510
HiromuHota
closed
3 years ago
1
Native support for hOCR
#509
HiromuHota
closed
3 years ago
2
Use "--use-feature=2020-resolver" to fix #390
#508
HiromuHota
closed
3 years ago
2
BBox value errors
#507
saikalyan9981
closed
3 years ago
3
Support v2.3.X of spaCy, which includes pretrained models for Chinese and Japanese
#506
HiromuHota
closed
3 years ago
1
Move textual functions in data_model_utils.tabular to data_model_utils.textual
#505
HiromuHota
closed
3 years ago
1
get_cell_ngrams and get_neighbor_cell_ngrams yield nothing when the mention is not tabular (fix #471)
#504
HiromuHota
closed
3 years ago
1
get_sentence_ngrams, get_neighbor_sentence_ngrams, same_sentence should be fonduer.utils.data_model_utils.textual?
#503
HiromuHota
closed
3 years ago
0
Extracting Information from tables without Borders
#502
saikalyan9981
closed
4 years ago
4
Duplicate key error while adding two mentions which are same
#501
saikalyan9981
closed
3 years ago
9
Use miniconda to consolidate GitHub Actions workflow
#500
HiromuHota
closed
4 years ago
1
Adopt to black 20.8b
#499
HiromuHota
closed
4 years ago
1
Setup/teardown a database every unit test for better isolation
#498
HiromuHota
closed
4 years ago
1
Add `nullables` to candidate_subclass()
#497
HiromuHota
closed
4 years ago
2
Next