Back on our hand-decoding USB from oscilloscope traces schtick.
@huffeec pointed out that they're Start-of-Frame packets rather than NAKs. (People who build protocols sending packets least-significant-bit first are even worse than those who use little-endian!)
At least now I (think) we're getting somewhere. Looks like the ESP32C3 is ignoring the computer for some reason.