alan-turing-institute / TuringDataStories

TuringDataStories: An open community creating “Data Stories”: A mix of open data, code, narrative 💬, visuals 📊📈 and knowledge 🧠 to help understand the world around us.
Other
40 stars 12 forks source link

[Turing Data Story] UK PhD thesis metadata analysis #171

Open mhauru opened 2 years ago

mhauru commented 2 years ago

Story description

The British Library publishes a data set with almost all PhD theses ever written in the UK, called EThOS: https://doi.org/10.23636/ybpt-nh33. It’s got publication year, author name, title, and university/institution for all of them, I think some theses may have some more metadata too. The data goes back more than a hundred years. You can observe some cool trends in academic fields and institutions using it.

I have in the past done an exploratory analysis of the data (http://nbviewer.org/github/mhauru/EThOS-analysis/blob/master/analysis.ipynb). We could base the story on that, but I appreciate that it might be more fun for others to take some fresh angle on the data, and I'm very much open to suggestions there.

Ethical guideline

Ideally a Turing Data Story has these properties and follows the 5 safes framework.

Current status

Updates

KatrionaGoldmann commented 1 year ago

Ideas and meeting minutes: https://hackmd.io/wwP0w2sESDisXn2fQmah_Q