Closed yanj-github closed 2 years ago
Can I also suggest we choose 2 PN seeds and document them in the annotations spec? Then use "PN1" or "PN2" in the filename?
Can I also suggest we choose 2 PN seeds and document them in the annotations spec? Then use "PN1" or "PN2" in the filename?
We dont have specific requirement for seed if we can read it from the file name then it is fine. What is PN stands for please? Do you want to suggest splicing main and ad seeds as well please if you wish?
@cta-source @yanj-github Please explicitly agree what you want & don't expect @nicholas-fr to do anything unless/until you have reached agreement.
@jpiesing , totally agree--I'm hoping to intercept any "production" path mezzanine audio files while we get documented agreement. @nicholas-fr has put together a spec for these kinds of details in the mezz annotations. I'll discuss here and if we are OK, I'll edit his draft with the results.
Regarding "PN", sorry, that's an abbreviation for Pseudo-Noise, which is what we're getting with the Python numpy.random library methods. The term PN is commonly used in signal processing to indicate not-actually-random-but-random-like sequences.
I think we already discussed that the Python libraries for PN sequences aren't long-term reproducible. Changes in the library may result in equally valid PN sequences that don't match what we're using now. So we need to work from static copies of our defined PN sequences. In other words, the test code must draw the PN sequence from a WAVE-validated file, not from a library call. That's easy, we just need to archive the PN sequences.
Another dev note, when converting ASCII to decimal to create the entropy seed, we need to make sure we use leading zeros. E.g., for ascii("P"), use "080" rather than "80". Using the shorter form results in less than 32 bits of entropy (about 26 bits). This should actually still work fine, either way, but we need to have a consistent approach. Note that the current audiomezz.py does NOT add leading zeros, so if we make this change we need to be consistent.
I recommend we set up some kind of structure like the following: <html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
Seed Source String | Seed | Use | MD5 Hash -- | -- | -- | -- test | 116101115116 | Development purposes | A18D8B6BEF5A37C2018EA02E7A0B3F5A PN01 | 80078048049 | Primary content, production code | 3A2E4B3B41FED078241A02DE6848ACA9 PN02 | 80078048050 | Ad (spliced) content, production code | 144A3E6C0D4D4F4EC42D35C9BD2E00ED PN03 | 80078048051 | Spare No. 1 | 72F92675E263E5AFD0F34FA5EE70DCE0 PN04 | 80078048052 | Spare No. 2 | 7120086841ECB45721D9CE89A7E5D732
Can you kindly create another 60 seconds mezzanine same as what is it there now, but with different seed please? We would like to have seed noted on the file name if possible.
We also required white noise mezzanines main and ad for splicing test if you can help to create them as well please?