This transcription data can be used since it is symptomatic for Chinese tradition in general (although this is changing): The IPA version is outdated, some special characters are being used, and the aspiration marker is the marker for ejectives.
In general, I think that with a growing number of transcription datasets (also smaller sources for regional varieites) we can populate the database, learn about new variants, etc., and in this way, we can stick to our broader definition of transcription data: a use case where people have intended to show how languages sound by providing transcriptions. With Hou's data, we will also add another continent.
This transcription data can be used since it is symptomatic for Chinese tradition in general (although this is changing): The IPA version is outdated, some special characters are being used, and the aspiration marker is the marker for ejectives.
In general, I think that with a growing number of transcription datasets (also smaller sources for regional varieites) we can populate the database, learn about new variants, etc., and in this way, we can stick to our broader definition of transcription data: a use case where people have intended to show how languages sound by providing transcriptions. With Hou's data, we will also add another continent.