softsys4ai / unicorn

A Framework for Reasoning about System Performance using Causal AI
MIT License
41 stars 7 forks source link

Run Scalability experiments with Facebook DLRM systems. #17

Closed iqbal128855 closed 2 years ago

iqbal128855 commented 3 years ago

--- Performance analysis of the Facebook DLRM systems with different configurations. Show how difficult it is to debug for misconfigurations in real-world production systems and discuss challenges. Discuss the richness in performance landscape (more complex behavior). --- Run CAUPER, BugDoc, SMAC, DeltaDebugging, Encore, and CBI on the DLRM fault dataset and evaluate using the ground truth dataset for both single and multi-objective performance faults. --- Show proof of scalability of CAUPER in Facebook DLRM system with a high number of allowable values taken by different configuration options. --- Write about the evaluation of Facebook DLRM systems. Analyze by 3 slices of latency, energy and heat.