hoherd 3 days ago

Benn Jordan of The Flashbulb fame (who showed up here on HN recently for saving a PNG to a bird and then retrieving it) has a YT video where he shows off several interesting techniques for capturing audio using only video capture, which seem similar to this technique. https://youtu.be/mEC6PM97IRI

Eisenstein 3 days ago

You can do something similar with a laser. The laser is reflected off of an object and into an receiver where it is converted into audio. The two techniques differ in methods, but they both rely on measuring the movement of secondary materials affected by sound waves which can then be used to infer speech.

* https://hackaday.com/2010/09/25/laser-mic-makes-eavesdroppin...

lolc 3 days ago

It's interesting that measuring speech vibration is possible, but on the whole I don't see this as viable attack. One would need to get real close to a target, the cited 60% accuracy is at 50 cm distance. At 300 cm they report 2% which seems on the level of guessing words randomly.

The "remotely" in the title is no warranted in my view. Of course there may be ways to improve on their results. But so far it seems to me that a highly directional microphone would outperform the radar at any distance. One interesting aspect here is how sensitive the attack is to ambient noise. So maybe sitting close to a target with a radar hidden in headphones, one could glean some words even in a noisy environment like a subway. But I'd still put my money on a microphone performing better in that environment.

Now I wonder whether the attack would work better on headphones. They are smaller and should vibrate more. On the other hand the surface is smaller, so on the whole they might not provide more signal to a radar.

  • bcrl 2 days ago

    Most humans would assume that giving an app permission to use the accelerometer is not the same as giving the app permission to use the microphone. It's yet another form of permission confusion that allows for unintended privacy violations.

    • 4gotunameagain a day ago

      MEMS accelerometers found in phones don't have nearly enough sampling frequency to encode meaningful audio

      • kibwen 21 hours ago

        Researchers have managed to produce photos from the theoretically 1-bit photosensor on your phone (which is similarly permissionless), so don't make that proclamation so hastily. Even if it can't decipher a conversation, it may be able to detect things about your environment that leak bits of entropy about you.

        https://news.mit.edu/2024/study-smart-devices-ambient-light-...

        • 4gotunameagain 21 hours ago

          Single pixel imaging is a well known method that depends on injecting known information (light patterns) and capturing the same scene multiple times in order to reconstruct.

          Audio is by definition a time varying signal. If you cannot sample it fast enough, the information is gone. The fundamental Nyquist-Shannon sampling theorem has proven the concept of the critical (Nyquist) frequency. There are ways around this in special cases: for band limited signals undersampling is a method to reconstruct them using sampling lower than the critical frequency, and for sparse signals compressed sensing can be used. Real life audio is neither band limited (as sampled by any device on the phone), nor sparse. I think it is a physical impossibility with the current sensors.

          https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampli...

          • kibwen 20 hours ago

            To reiterate, I'm not suggesting it should be able to listen in on conversations, but rather that it may plausibly be able to detect recurring environmental vibrations. For example, let's say your phone is left on a table unnattended, then if your house is near a train then it may be able to sense the regularity of the train's passing (whose schedule itself varies throughout the day, providing a unique fingerprint for both determining which train it might be and your position along that train's route).

            • 4gotunameagain 20 hours ago

              Yes, for sure you can sample and detect lower frequency vibrations. That's the whole point of an accelerometer :)

  • nine_k 3 days ago

    A microphone could work together with a radar; correlating the two signals may help tell the sound from the noise.

    I suppose that at certain radio frequencies human bodies are mostly translucent, while the small metal membrane is still resolvable. When a head of a person covers well the acoustic signal (the phone is at the opposite ear), a microphone is helpless, but a radar us still fairly usable.

    • lolc 3 days ago

      Note that in this study the radar targeted the whole phone, not the membrane.

  • cess11 a day ago

    As I understand it, this technology would pick up the remote end of the conversation, are you sure your directed microphone could accomplish that?

rurban a day ago

Of course Penn State. See Geschickter Fund (an CIA cover for MKULTRA research) and the School of International Affairs (SIA) (another CIA recruitment front).

thrown-0825 a day ago

i remember a poc that showed using the phones accelerometer as a microphone and bypassing app permissions.

if i remember correctly it was a defcon demo and the guy ended up getting hired by facebook.

  • ethan_smith a day ago

    That was likely Gyrophone (2014) from Stanford/Rafael researchers who demonstrated accelerometers and gyroscopes could capture speech frequencies at 80-250Hz - sufficient for speaker identification and some speech recognition without requiring microphone permissions.

    • thrown-0825 4 hours ago

      Thanks, do you know if the claim about members of that team winding up at metabook is true?

BobbyTables2 a day ago

Feels more like an advertisement for the radar than a practical application.

We already have highly accurate sensors for detecting vibrations a few feet away — EARs!

Seems like laser bounce on the phone’s glassy screen/back would be more effective…