Australian-Text-Analytics-Platform / cooee

a sample notebook working with A COrpus of Oz Early English (COOEE) data
1 stars 0 forks source link

Notebook doesn't get any texts #4

Open stevecassidy opened 2 weeks ago

stevecassidy commented 2 weeks ago

The step that looks for annotation files via the DerivedMaterial type fails to find anything. The resulting list of texts is empty. Also based on the example I have no idea how to approach fixing this.

There are many files in the previous step, all of type 'File' and all seemingly plain text files. The example doesn't give me any clue about how I would download them.

iamyifan commented 1 week ago

Hi Steve, I'm not from their dev team.

I created a new notebook based on their code to download Braided Channels dataset.

Simply replacing COLLECTION_ID with the new COOEE ID (from your last issue) works perfectly for me:

# COLLECTION_ID = 'arcp://name,hdl10.4225~01~4F8E1281B8E2A' # Change to the collection you want to download
COLLECTION_ID = 'arcp://name,hdl10.26180~23961609'

Let me know if you have any problems : )

moisbo commented 1 week ago

Thanks Yifao that is great! You could contribute at https://github.com/Australian-Text-Analytics-Platform with this new notebook if you like. We can add it to our repository of notebooks with some metadata.

Hi Steve -- We (LDaCA team) are working on improving cooee notebook with more interesting analysis and will be updated soon (with the correct ID!)

iamyifan commented 1 week ago

Thanks Yifao that is great! You could contribute at https://github.com/Australian-Text-Analytics-Platform with this new notebook if you like. We can add it to our repository of notebooks with some metadata.

Hi Steve -- We (LDaCA team) are working on improving cooee notebook with more interesting analysis and will be updated soon (with the correct ID!)

Hi @moisbo, I'm definitely happy to contribute this notebook. I made a few adjustments here and there yesterday.

Just wondering how I'm going to upload this notebook to the ATAP GitHub.