GDD-Nantes / FedShop

Code for FedShop: The Federated Shop Benchmark
GNU General Public License v3.0
8 stars 0 forks source link

Value selection using Random Sampling #70

Open mhoangvslev opened 2 months ago

mhoangvslev commented 2 months ago

This could be an use-case for RAW-JENA where it can help reduce (massively) the workload generation time.

mhoangvslev commented 2 months ago

Problem:

Steps to reproduce results

SELECT DISTINCT ?ProductXYZ WHERE { GRAPH ?g1 { ?localProduct rdfs:label ?localProductLabel; bsbm:productFeature ?localProdFeature; bsbm:productPropertyNumeric1 ?simProperty1; bsbm:productPropertyNumeric2 ?simProperty2; owl:sameAs ?product. ?localProdFeature owl:sameAs ?prodFeature. } GRAPH ?g2 { ?localProductXYZ bsbm:productFeature ?localProdFeatureXYZ; bsbm:productPropertyNumeric1 ?origProperty1; bsbm:productPropertyNumeric2 ?origProperty2; owl:sameAs ?ProductXYZ. ?localProdFeatureXYZ owl:sameAs ?prodFeature. } FILTER((?simProperty1 < (?origProperty1 + 20 )) && (?simProperty1 > (?origProperty1 - 20 ))) FILTER((?simProperty2 < (?origProperty2 + 70 )) && (?simProperty2 > (?origProperty2 - 70 ))) }



# Other 
Chat-Wane commented 2 months ago

This could be an use-case for RAW-JENA where it can help reduce (massively) the workload generation time.

To provide more context: to instanciate a templated query, you sometimes need actual values from the dataset. I assume that it does not take so long after the dataset generation (since you may still have all information in memory?). But then, assuming you want more, and the dataset is already ingested, you may use random walks (and why not web preemption?) to provide random values.

Query 5 doesn't give results.

This would need some testing (with smaller queries). Possible culprits: