-
- Harmonizer files / records use the schema from #45
- Index function on a Ring can ingest arbitrary new Vocabularies
- When indexing the namespace defined in the Vocabulary, it matches elements to…
-
Adding sitemap.xml and robots.txt files helps optimize a website for search engines.
Sitemap.xml provides a list of important URLs, helping search engines discover, crawl, and index new and updated…
-
Might make more sense to integrate it into py-wacz, which has cdxj-indexer as a dependency.
e.g. follow how py-wacz validation works to go through the indexes (https://specs.webrecorder.net/wacz/1.1.…
-
### Before submitting your bug report
- [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions
- [X] I'm not able to find an [open issue]…
-
**Is your feature request related to a problem? Please describe.**
Now, omnisearch is working with all plaintext files, can it work with HTML file? I always save html by using save page WE .
**Descr…
-
On version `v3.0.75` :
While doing slack indexation with the slack connector, there is the following error being raised :
```
2024-06-10 15:36:50,541: INFO/MainProcess] Task check_for_document…
-
Can't share the repo, unfortunately. I've just installed the extension, it did the indexing, indexed everything except a single file. The only difference between this and other files that I could thin…
-
Issues in clangd:
- https://github.com/clangd/clangd/issues/587
- https://github.com/clangd/clangd/discussions/1206
- https://github.com/clangd/clangd/issues/1340
Some requests, e.g. `go to defi…
-
It seems that the dataset-viewer search function returns no hits if one searches for terms such as “what”, “can”, “which”, and so on. Has the indexing function removed stopwords like this? The rows ar…
-
### To reproduce
Without pre-creating the table, add rows to the table with the python client:
```python
with Sender.from_conf(conf) as sender:
sender.dataframe(
df,
…