wineslab / colosseum-oran-coloran-dataset

Dataset for the paper M. Polese, L. Bonati, S. D'Oro, S. Basagni, T. Melodia, "ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms," IEEE Transactions on Mobile Computing, pp. 1-14, July 2022.
GNU General Public License v3.0
28 stars 2 forks source link

How do the files correspond with each other? #6

Closed maticpokorn closed 1 year ago

maticpokorn commented 1 year ago

Hi!

I can't seem to understand how the .csv files are connected with each other. I presume the BS and UE data are measured on the same time interval, but they all have slightly different lengths, for example a .csv for UE1 could have 50 lines more than a .csv for UE2. Is there any offset involved or do they all start at line 1 and the measurements just stop earlier for some network elements?

Also, how do these files correspond to the files in the directory 'slices_bs{i}'? It looks to me that there are always 6 files in there, 2 for each slice, but how do they connect to the BS and UE files?

In the README.md, under 'UEs belong to different traffic classes' there are UEs with numbers going up to 44, but when I counted them, there are only 40 listed.

leonardobonati commented 1 year ago

I can't seem to understand how the .csv files are connected with each other. I presume the BS and UE data are measured on the same time interval, but they all have slightly different lengths, for example a .csv for UE1 could have 50 lines more than a .csv for UE2. Is there any offset involved or do they all start at line 1 and the measurements just stop earlier for some network elements?

Measurement should start when the UE connects and stop when the UE disconnects. If UEs connect/disconnect at different times, this is likely to happen.

Also, how do these files correspond to the files in the directory 'slices_bs{i}'? It looks to me that there are always 6 files in there, 2 for each slice, but how do they connect to the BS and UE files?

There are 2 files for each slide since the dataset was run with 2 UEs per slice.

In the README.md, under 'UEs belong to different traffic classes' there are UEs with numbers going up to 44, but when I counted them, there are only 40 listed.

Sorry, I'm not sure I get what you mean here.

maticpokorn commented 1 year ago

Thank you for the clarification!

In the last point I was referring to this part of the README file:

ues

Here there are 40 listed UEs with labels going up to 44 (I guess there are 4 missing), but above you state that there were 42 UEs used when acquiring the dataset. That was not entirely clear to me.

leonardobonati commented 1 year ago

This is a typo in the README.md, thank you for spotting it. The correct configuration should be the following, you can also verify which user is associated to which slice by looking at the slice_id column in the .csv files (e.g., file 1010123456002_metrics.csv would be UE 2, 1010123456003_metrics.csv UE 3, and so on).

There should be 7 missing nodes that correspond to the base stations

I will update the README.md accordingly.

maticpokorn commented 1 year ago

Ok it makes sense to me now, thanks a lot for all the clarifications!