It's hard to tell I can imagine some motivated individuals could utilize all sorts of packaging systems and embed them in third-party applications and so on, and extract pertinent information using this type of surveillance, and then sell this data to data brokers which would sell it to the large ad networks. I mean there's lots of ways to transcribe even most of the whisper models can run all the way down to 150 megabyte file not to mention the quantization versions of these models. I have something that I run on my computer for my server not throughout my house that does real Time transcription and whatnot but I use it for my own purposes, so you know someone who makes money off advertising or even selling insights about people, would certainly find ways to do this. I mean it's simply not regulated is it?
https://huggingface.co/spaces/jilangdi/whisper-web