-
```
[](https://localhost:8080/#) in extract_data_from_pdf(pdf_path)
57 # Function to extract text using the unstructured library
58 def extract_data_from_pdf(pdf_path):
---> 59 eleme…
-
I would like to add custom metadata to chunks when saved to pinecone with Pipeline.from_configs.
Following the 'Custom meta data extraction ...' notebook on [this page](https://docs.unstructured.io…
-
I have logs that are mostly json, but some logs come from system calls that can't be structured. It would be nice if I could set a rule that allowed me to capture the whole log and put it for example …
-
My custom image works as expected when ran locally against a `test.docx` from an s3 path.
But when I upload the image to lambda, I get the error `BadZipFile: Bad magic number for central directory`…
-
While reading html files we encountered the problem that we end up with an empty list.
Here is a small example:
```python
from unstructured.partition.html import partition_html
html_content="…
-
Hi,
I am trying to prune Mistral 7B (https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) and while I was able to successfully run the commands for magnitude pruning, I was facing issues with…
-
These contain valuable data nuggets among an ocean of junk and we need to be able to find the good things there.
Some sources are:
- mailing lists such as:
- https://github.com/nexB/vulnera…
-
**Topic**
> The signup options at the home page are too unstructured and doesn't look much attractive.
**Details**
Using CSS we can make a box around it and wrap it all in a single and then sty…
-
WDYT? Is this publication in scope?
```
@inbook{Maher_1997,
author = {Maher, David P.},
booktitle = {Financial Cryptography},
doi = {10.1007/3-540-63594-7_71},
isbn = {9783540696070},
issn = {161…
aewag updated
3 weeks ago
-
## ✨ Feature Request
Firstly, thanks very much for creating this library, I'm excited to use it in my research!
Secondly, would it be possible to generalise `geovista.Transform.from_unstructur…