-
Hi again!
As far as I know, ntopng currently _can't_ deduplicate NetFlow data like in this example here:
```mermaid
flowchart LR
nfd[NetFlow device] -->|NetFlow| np1
nfd[NetFlow device]…
-
- [ ] Support other behaviours for deduplication
- Currently, we support REPLACE i.e `last writer wins`. This replaces the whole row on conflict.
- Add MERGE functionality to dedup, which …
-
Hi authors, I'm using `dedup/bff` to run deduplication on my data. I split my data into 512 jsonl files, each containing ~170000 docs. The size of my data is about ~500G. I ran the following command:
…
-
### Suggested new feature or improvement
Enable ReFS data deduplication feature for DevDrive on Windows 11.
### Scenario
As a Developer
I want to have an ability to configure data deduplication on…
-
### Is your proposal related to a problem?
In a deployment based only on receivers (no sidecars or HA Prometheus instances), each replicated timeseries holds the same data or parts of it (for examp…
-
Hello,
I didnt see anything in the docs on deduplication, so thought I'd ask.
I do see that it has a (resume) type feature, so that is in one sense deduplication.
But what if I start running it …
-
Following up on https://github.com/superfly/litefs/issues/105, would it be possible to eventually add some kind of data deduplication if a lot of the tables are same across multiple DBs with only user…
-
In current Microsoft.ML developers may need to reduce size of huge datasets (#6679) or at least it might be advisable to do so: For many problems and algorithms hyperparameters tuning is important and…
-
Editions, authors, and places need to be deduped. This can be solved by taking Levenshtein distance across different node properties and marking the node as duplicate if it fails to meet a certain thr…
-
Steps for POSTing a new resource:
1. Validation, return SHACL report, e.g. missing title
2. Deduplication test, return with existing URI if applicable
3. Store data with new URIs
4. POST returns…