Closed NiwanDao closed 2 years ago
@xingjitansuo: Are you asking for extension into phase 2.6 (which concluded recently) or phase 2.7?
@orvn, for phase 2.7.
Thanks! cc @dkkapur to evaluate
@dkkapur is there any update here ?
@xingjitansuo - approving this one.
@orvn can we enable this dataset for project team "XingjiTanSuo-smartcity"
@xingjitansuo is your project name XingjiTanSuo-smartcity
?
We couldn't find that, but did find: Smartcity- Sensor-based network and data analysis system
? (link to project)
Is that the right one?
@xingjitansuo - just looked through the past conversations on this dataset and it looks like we did not get the full context on this dataset. Can you help me with:
Pending response to above questions before approving.
@xingjitansuo is your project name
XingjiTanSuo-smartcity
? We couldn't find that, but did find:Smartcity- Sensor-based network and data analysis system
? (link to project)Is that the right one?
Correct
@dkkapur @orvn
- who is generating the dataset? Those datasets are generated by a research lab in University of Electronic Science and Technology of China.
- who is paying for the data to be collected today? University Research Fund.
- is it available for a free download somewhere on the web today? Yes, you could access from http://api.sr2.glm2m.com/index.php?r=smartcity-dataset%2Fdataset
- what city or cities is the data being captured in? Most of the data are captured in Sichuan - China.
Thanks for the reply @xingjitansuo.
I had some more questions and concerns with issues I've been having with your app UI.
Did something change with the app? Last week I was able to download files successfully (.wav
files). However now I get .db
files. Is it just a mime type issue or something else?
I’m also getting some 503 when making requests occasionally
In order to work for Slingshot, these datasets need to be easily accessible by other participants. Is there a way for a user to download the fully dataset from you app? (either as one download or just a few chunks?)
That is a good call out. The server was down and it has now been fixed. Please check one more time. Those datasets are open to public and is accessible by clicking download button shown in the website. @orvn
@xingjitansuo: it seems to mostly work now, two issues I'm finding:
I am unable to download the dataset: 数据集下载 (Dataset Download)
does not trigger a download
Some files still don't seem to download in the correct format: I'm downloading 920.wav
from the 校园 (Campus)
tab here, but it still returns a .db
file. May other files do work however.
@orvn,
This page does not support batch download
Does this mean that other participants can't download the full dataset (without running a script that scrapes your app UI)? @xingjitansuo
Because that would be a blocker to other participants using this dataset, since Slingshot users will normally try to fill 32 GiB sectors.
@orvn I talked to the team, and they provided the batch download features specifically for onboarding to Filecoin.
I tested the download from the list and it works, but the list needs some slight modification.
All files have an extra /
character in the URI constructor. Please fix this @xingjitansuo.
I ran a test on all 24k of your records when it's removed and they do download successfully.
...
http://117.175.0.137/nfs/179//B3/data2/883.wav
http://117.175.0.137/nfs/179//B3/data2/898.wav
http://117.175.0.137/nfs/179//B3/data2/949.wav
http://117.175.0.137/nfs/179//B3/data2/932.wav
...
To make it easier for other Slingshot users, I also think there should be a command that helps them download all files.. @dkkapur, does that make sense to you?
A simple *nix-friendly version with no dependencies would look something like:
curl -s http://117.175.0.137/down.downlist\?id\=1 | xargs -L1 -I {} curl -O {}
@orvn Please check again. :)
The URLs in the download list are fixed. @dkkapur to review.
@xingjitansuo - let's proceed with approving this for current and future phases for now. Thanks for your hard work in making it available easily to others as well!
Thanks for all your effort ! @dkkapur @orvn
@xingjitansuo, you will find that your project is able to select the Smart City dataset.
@dkkapur, for now, I added it to the temporarily disallowed dataset, but if needed, you can move it up as a dataset available for everyone on the next phase at your discretion.
@xingjitansuo, the app URL appears to be down? Is this just temporary?
@orvn I synced up with the team, and it should be alive next week.
Sounds good, thanks!
You can request to continue uploading an incompletely onboarded dataset to Slingshot if it previously qualified for rewards but no longer does per the list of curated datasets for Slingshot. Please note that these requests will be reviewed on a case by case basis and approvals are only for specific project teams to continue onboarding the specific dataset.
Slingshot participation information
Dataset onboarding progress