First all pass version got.

HigashikataZhangsuke commented 3 months ago

Now try to add more profiling data, as well as the part data. We may get results by Monday!!!!

HigashikataZhangsuke commented 3 months ago

Tomorrow working on three thing: 1.PPFaaS Slide -> OK, slightly tune the version. For testing results may not need too many slides so put them at the end, as the extra parts. 2.Small bug fix, and Profiling data get and MBA last experiment result. -> Waiting for collection of Profiling data, and do a "second time test" for MBA. -> Re did MBA, nothing changed. Now working on get Profiling data.

Try to run some results for our system. And also finish MXFaaS's code modification. -> Modification finished, and find out they add the resource usage at the log of their nodecontroller.py . now do and modify our code, to enable multiple functions could run at the same time. Also, try to get some results of MXFaaS, to figure out their log usage. LBNL, align their and our test method script. I think it's better for us to do a at least 2/3 sec test. -> Already know what should do: change the nc part to add a configuration record; and then for the running test script, add the trace execution record. -> OK Nearly everything corrected and Ready for get final results.

Then, the day after tomorrow: Get all the results we want.

HigashikataZhangsuke commented 3 months ago

Trace Selection: Redo, since we also maybe have M-M function. 2Function Co-Run: ['che', 'mls'] ['omp', 'mls'] ['rot', 'res'] ['mlt', 'omp'] ['pyae', 'res'] ['rot', 'omp'] ['alu', 'che'] ['pyae', 'omp'] ['mlt', 'vid'] ['web', 'mls'] 4Func Co-run ['alu', 'mlt', 'mls', 'che'] ['alu', 'pyae', 'web', 'mls'] ['web', 'omp', 'che', 'res'] ['alu', 'pyae', 'mlt', 'vid'] ['alu', 'web', 'omp', 'mls']

HigashikataZhangsuke commented 3 months ago

For Profiling Data, Record these: 1、Standalone Latency Latest 2、Peak Throughput-CPU curve when Running Inside our system 3、Intel MBA memory profiler's MemBW usage. 4、Docker Stats for Memory usage. -> Svc is not like this, maybe this is the only way... Yes you can but the name resolution is a problem may leave it later, it does not matter currently. 5、Per-Func CPU usage is one, which is determined, at least you need one CPU 6、Cache Usage: also one. The minimum cache way allocated is 1 7、Other resource-peakTP curve, for global co-placement The Bold metrics are currently not important for single-node tests, and could be left until multiple-node tests.

HigashikataZhangsuke commented 3 months ago

Dockerfile need To change from gunicorn to single thread. I guess it's becaues multiple threads running together, therefore caused bad results. Try to find out why have this bug, since ultra load may need multiple threads?

Cannot find out which part caused this. Just skip this, leave it to later if we do find out sth wrong happened, or single thread is not enough for routing all requests.

HigashikataZhangsuke commented 3 months ago

Note that need to modify imgres and web code. Toolong. PF： CAlu: 0.0111907+4.2MB/s MChe:5.42909+Max2600,Avg1000 MImgRes: 0.8005+Avg 900 CImgRot: 0.7840 + Avg 500 MMLS:0.437876+AVg 3500,Peak 4000 CMLT:7.0979 + Avg 5.2 Momp:4.141488 +6000MB/s,MAX around 6200 Cpyae: 0.28206 + 11MB/s Mvid:1.348898+2000MB/s(Peak,Avg ~ 1500) Cweb: 0.30621425+ 30MB/s

HigashikataZhangsuke commented 3 months ago

Please Tune your function co-placement test trace selection, based on the new profiling data. -> I think maybe just 25,50, 100,200,400,800 as the 6 baselines, to figure out the results here. For single node test, the pktp should not exceed the maximum workload you could done. Also, need to know that use proportional method to send? but not Round Robin? Double check here, could think this problem when having dinner. Here, use MBA BW monitor to get the results, so it shall be accurate.