-
One big problem when scraping content, is that the fulltext is often not trivially accessible.
Instead, we have the whole DOM HTML document. One approach would be to use the html roles: https://wi…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
Seeing the following deprecation warnings in Zeek stderr.log:
```
warning in /opt/zeek/share/zeek/policy/securityonion/file-extraction/./extract.zeek, lines 63-65: "when" statement referring to loca…
-
Good day.
I have finished developing my add-on for the game. I would like my addon to be included in your patch.
My supplement includes:
1. Division into AI levels (4 levels with 4 computer oppon…
-
Description
Hi Team ,
We have identified a critical issue with the Properties API data extraction.
whenever any update happens on a Property in Reapit ; Over the callback API the property descr…
-
Hello there Mikf, hope you are doing well my man! It's been a while since i've posted here, been super busy with school and stuff. Anyways was hoping if I could get this site https://archive.4plebs.or…
-
_migrated from Trac, where originally posted by **kohlhase** on 3-Mar-2009 6:03pm_
We need to think about a NarCon structure for OMDoc documents.
With the OMDoc1.6 language design I would like to m…
-
### Context
(reported by @bonjarlow) Many of our training sources don't have h1–6 or meta tags; not every page follows those conventions.
example: https://www.longbeach.gov/police/press-release…
-
Dear Sami Pietilä,
Thanks for excellent software for metaproteomic DIA data! I met a problem when running glaDIAtor on Windows 10 platform from web browser. My input rawdata was one of your exampl…
-
Windows 11 x64, Python 3.10.11 + torch 2.0.2 + cu11.8.
Running on local URL: http://127.0.0.1:8888
To create a public link, set `share=True` in `launch()`.
IMPORTANT: You are using gradio versio…