issues
search
Unstructured-IO
/
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
9.21k
stars
764
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
rfctr(part): remove double-decoration 5 <- Ingest test fixtures update
#3693
ryannikolaidis
closed
1 month ago
0
rfctr(part): remove double-decoration 5
#3692
scanny
closed
1 month ago
0
rfctr(part): remove double-decoration 4 <- Ingest test fixtures update
#3691
ryannikolaidis
closed
1 month ago
0
rfctr(part): remove double-decoration 4
#3690
scanny
closed
1 month ago
0
bug/Titles not included in chunks by-title
#3688
dividor
opened
1 month ago
1
rfctr(part): remove double-decoration 3
#3687
scanny
closed
1 month ago
0
rfctr(part): remove double-decoration 2
#3686
scanny
closed
1 month ago
0
rfctr(part): remove double-decoration 1
#3685
scanny
closed
1 month ago
0
feat/numpy_2
#3684
mgraczyk
opened
1 month ago
1
build(deps): bump ruff from 0.4.10 to 0.6.8 in /requirements
#3683
dependabot[bot]
closed
1 month ago
1
build(deps): bump peter-evans/create-pull-request from 5 to 7
#3682
dependabot[bot]
opened
1 month ago
0
build(deps): bump ammaraskar/sphinx-action from e781e9af3e80bfe0ea539e4ea46858d51e027214 to c61ac11d9ee097caf8983c10c8b5af5861b32b54
#3681
dependabot[bot]
opened
1 month ago
0
build(deps): bump langchain-community from 0.3.0 to 0.3.1 in /requirements
#3680
dependabot[bot]
closed
1 month ago
1
build(deps): bump astrapy from 1.4.2 to 1.5.0 in /requirements
#3679
dependabot[bot]
closed
1 month ago
1
build(deps): bump openai from 1.46.1 to 1.50.2 in /requirements
#3678
dependabot[bot]
closed
1 month ago
1
bug/application/octet-stream not supported
#3677
jeremydiba
opened
1 month ago
1
Test Ingest CI [DO NOT MERGE]
#3676
cragwolfe
closed
1 month ago
0
Fix bug - Auto partition fails on text files which are empty or contain only whitespaces
#3675
tc360950
closed
1 month ago
2
bug/Auto partition fails on text files which are empty or contain only whitespaces
#3674
tc360950
opened
1 month ago
1
bug/API Fails out of the box ingesting a PDF - "File type application/octet-stream is not supported"
#3673
ReMuSoMeGA93
opened
1 month ago
5
bug/partition_msg halts for attachmentes with UNK type
#3671
S1M0N38
closed
1 month ago
7
bug/Extensions .mdx and .markdown not supported
#3670
butasebi
opened
1 month ago
0
DO NOT MERGE: CI test run only <- Ingest test fixtures update
#3669
ryannikolaidis
closed
1 month ago
0
null <- Ingest test fixtures update
#3668
ryannikolaidis
closed
1 month ago
0
rfctr(meta): refine @apply_metadata() decorator
#3667
scanny
closed
1 month ago
4
bug/html parsing incorrectly categorizing text
#3666
bhoppeadoy
opened
1 month ago
0
bug/<TypeError: unstructured.partition.common.add_element_metadata() got multiple values for keyword argument 'coordinates'>
#3665
MrForExample
opened
1 month ago
2
Fix bug causing partition_xlsx to raise error
#3663
bawgz
opened
1 month ago
0
bug/`partition_xlsx` function raises TypeError with `infer_table_structure = False` and `find_subtable = False`
#3662
bawgz
opened
1 month ago
0
rfctr(part): prepare for pluggable auto-partitioners 3
#3661
scanny
closed
1 month ago
1
build(deps): bump ruff from 0.4.10 to 0.6.7 in /requirements
#3660
dependabot[bot]
closed
1 month ago
1
bug/OCRAgentGoogleVision takes 1 positional argument but 2 were given
#3659
pprados
opened
1 month ago
2
fix: fix occasional key error when mapping parent id
#3658
badGarnet
closed
1 month ago
1
rfctr(part): prepare for pluggable auto-partitioners 2
#3657
scanny
closed
1 month ago
0
fix: update python SDK syntax for forward compatibility
#3656
awalker4
closed
1 month ago
0
rfctr(part): prepare for pluggable auto-partitioners 1
#3655
scanny
closed
1 month ago
0
bug/<502 bad gatway Error>
#3654
shriharshan
opened
2 months ago
1
Not extracting data using api_url in aws marketplace
#3653
shriharshan
closed
2 months ago
0
bug/Cannot partition doc files with multi-byte names
#3652
Snowman-s
opened
2 months ago
6
build(deps): bump ruff from 0.4.10 to 0.6.6 in /requirements
#3651
dependabot[bot]
closed
1 month ago
1
rfctr(part): add new decorator to replace four
#3650
scanny
closed
1 month ago
1
rfctr(part): extract partition.common submodules
#3649
scanny
closed
2 months ago
0
refactor: pdfminer image cleanup
#3648
christinestraub
closed
2 months ago
0
fix: correctly install mesa-gl for arm
#3647
MthwRobinson
closed
2 months ago
0
bug/error while loading unstructured.partition.pdf import partition_pdf
#3646
dtruong46me
opened
2 months ago
4
chore(file): remove dead code
#3645
scanny
closed
2 months ago
0
bug/AttributeError: 'lxml.etree._ProcessingInstruction' object has no attribute 'is_phrasing'
#3642
skehlet
closed
2 months ago
1
add requirements files to ingest cache hash key
#3641
badGarnet
closed
2 months ago
0
error in reading and parsing elements from file
#3640
prashanthin
opened
2 months ago
0
Test/full float or without plus1 <- Ingest test fixtures update
#3639
ryannikolaidis
closed
2 months ago
0
Previous
Next