Closed ttoivone closed 1 year ago
Raised an issue on Mozilla's standard positions
+@eehakkin +@riju
Thanks! Responding having seen you post https://github.com/w3c/webcodecs/issues/607, and speaking as BBC contributor, with my Media WG chair hat removed.
We need to be sure that there are no ethical issues exposing this to the web, concerns I mentioned at the TPAC 2021 breakout meeting: https://www.w3.org/2021/10/20-webrtc-ic-minutes.html
It's good that detecting facial expressions is stated as a non-goal, but I'd recommend going further to say it "must not" rather than "does not need to". Misdetection is a concern, as mentioned in the explainer, but also, there are privacy implications of exposing inferred emotions, at least without strong user consent.
As such I'd want to see this proposal go through wide review, including Privacy and TAG.
I'd recommend going further to say it "must not" rather than "does not need to". Misdetection is a concern, as mentioned in the explainer, but also, there are privacy implications of exposing inferred emotions, at least without strong user consent.
I changed the wording in the explainer as you suggested and it will be updated in the next PR. However, while not having a problem updating the wording, I don't personally see this as an issue with the proposed API. Misdetection is an issue, but by not offering the detection in the Web API we just make people to run their custom detection algorithms which hardly improves the situation. I don't see any privacy issues here -- the metadata is inferred from the same frame where it is attached to, so it does not bring any new information to whoever gets the frame what the original frame alone wouldn't already have. Privacy issues would exist only if the metadata would be delivered to user without the related video frame, but that is not done by the proposed or other Web APIs.
Changes in 5f8b11b:
id
more accurately@youennf @jan-ivar Requesting review. I couldn't add reviewers myself for some reason.
Is the PR ready for CfC?
Thanks @ttoivone for updating the explainer, looks OK from my point of view.
Is the PR ready for CfC?
We are still waiting for review comments from WebCodecs team (Dan Sanders/Dale Curtis).
Changes in e2ec3d6:
Feedback was positive from WebCodecs (Dale Curtis) "Structure looks good to me for VideoFrameMetadata. I defer to @youennf around correctness issues for what metadata should be there." @jan-ivar was removed inadvertently from the reviewer list and I still can't add reviewers myself, sorry.
@youennf @jan-ivar: Please let us know if further updates are needed into the PR before CfC, thanks.
@dontcallmedom
Updated the PR. All previous comments should have been now addressed either by changing the PR or otherwise. Asking reviewers to check if this version could be merged or if more changes are needed.
@jan-ivar @alvestrand @chrisn @martinthomson @youennf
In particular, after the CfC, three objections were made:
Segmentation metadata #79 Scope of Applicability #84 Variance of Results #85
As per the Feb 21 meeting, proposal was to mark issues 84 and 85 as non blocking. This updated PR should now address issue 79 which was a blocker.
Asking @adoba to mark issues 84 and 85 as non-blockers and checking if this PR now unblocks issue 79.
Given @ttoivone comment, I think we should review the PR at next editor's meeting.
Changes in the latest update of the PR:
id
sEditors agreed to merge with the change above.
This PR supersedes previous PRs related to face detection (#57 , #48 ). It adds the constraints (and related settings and capabilities) and extends the recently introduced VideoFrameMetadata to have descriptions for faces in the frames.
The feedback has been taken into consideration, simplifying the API by removing most of the previously proposed constraints. Also the mesh-based facial description has been removed. Only those judged to be essential for good performance are left. An exception is face landmarks, which are already supported by some platforms and could be therefore immediately useful. Furthermore, HumanFace-term is used instead of more generic DetectedFace to anticipate future extensions of VideoFrameMetadata.
The PR consists of two commits. The first updates the explainer and the second updates the spec.
Preview | Diff