-
Notifications
You must be signed in to change notification settings - Fork 573
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how do I query output latency for echo cancellation #80
Comments
Hey,
About 1) (5*128)/24kHz = 26ms Also you can query the maximum buffer-latency by looking at the buffer capacity with getBufferCapacityInFrames() hopefully helpful sources Greetings Niklas |
Thank you so much!谢谢你! |
If I want get the right delay for all android devices,I shoud measure the delay for every devices and save to a device_delay_list,is it right? |
Great answers @niklasdahlheimer. Just a couple of things to add:
This is almost correct. The buffer size determines the latency between your app and the Android audio framework. It (unfortunately) does not account for latency from the Android audio framework to the audio device. There are, however, a few methods of obtaining the "app to audio device rendering" latency: From API 19: Use AudioTrack.getTimestamp (in Java only) There are 2 caveats with this approach to calculating latency:
This is a common source of confusion. "sampleRate" nearly always refers to "sampleRate per channel" so even if you have 10 channels your frameRate will still be 48kHz, with each frame containing 10 samples. With this in mind if you had a buffer of 5 "bursts" and each burst is 128 frames the latency would be: (5 * 128) / 48000 = 0.01333s = 13.33ms To answer @yangwuan55
I would try the "get timestamp" approach above first. If that doesn't work out then try one or all of the following. You could measure delay manually on different devices, and that is indeed what some developers do. It definitely gives the most accurate results. Although the process of acquiring and testing Android devices (and testing them on different O/S versions) can be expensive and time consuming, depending on the number of models you test. If you go down this road I can recommend getting a WALT device: /~https://github.com/google/walt which will allow you to test the output latency in isolation (and is what we use to test new Pixel devices). Another approach is to include a "loopback" test in your app which runs when the app is first run. The app plays a simple sound (like a sine wave) and measures the time taken for that sound to reach the built-in microphone. This measurement gives you the "round-trip latency" (ouput+input+app). If you just want the output latency then a rough estimate is to subtract about 3ms off that figure as input latency is typically much lower than output latency (source: /~https://github.com/google/walt/blob/master/docs/AudioLatency.md). If you're writing an app which requires the use of headphones (meaning the sound played cannot be heard at the built-in microphone), such as a karaoke app, you can implement a "clap track". The app plays a clap sound at regular intervals and the user claps along whilst the app records the clap sounds. Assuming the user is able to clap exactly when they hear the clap the app can measure the difference between the clap played and the clap recorded in order to obtain the round-trip latency figure. |
How about add a api to the android framework in the futrue?Let the oem offer the delay of the device. |
Please see this related bug: Note that knowing the reported latency will not be precise enough for echo cancellation. You will still need to run an adaptive filter that sync up with the actual echo. The timestamps are only accurate enough for AV sync (lip sync). |
I added calculateLatencyMillis() in this Pull Request. Can we close this? |
@philburk from which API level is the calculateLatencyMillis() method available? (I'm looking for a solution that can work from level 19) |
It requires AAudio, which was added in API 26, Oreo Cookie. I am closing this because we added calculateLatencyMillis(). |
I do not want to reopen this issue, but maybe @dturner, @philburk or someone else could specifiy the term "audio device", the point where the presentation timestamp is coming from. So is it:
? I see that the term is coming from other audio-frameworks, but I can not find a precise defintion. |
Referring to this issue and the |
Yes, sure it would. In fact, we have been discussing a way of passing the Java environment to Oboe so that Oboe can query the relevant audio APIs itself (e.g. for getting the native sample rate for built-in audio devices). Is it easy? No, not really. It's a bit of a pain to pass the JNIEnv to C++ elegantly (should it be cached between calls? should the client have to pass it every time?). I'm planning on hacking on some ideas for it next week. I'll update progress on #116 |
yangwuan55, I did some preliminary practical tests for latency using the AAudio echo demo. What I found was that the latency varied i.e. there was considerable jitter. You can try to reproduce the experiment. Plug a headset with built-in mic into the earphone jack of your android device. Run the AAudio echo sample app. Place one of the headset's earphones near a second android device's built in microphone. Run an audio recorder app on the second android device. Now use the second android device to record some environmental sound (stand some feet away from the devices and snap your fingers to generate pulses). The second android device will hear (and be recording to a file) - your finger snap sound. Meanwhile that same sound will go through the first android device (AAudio echo sample app) and be output on the first android device's earphones. So the second android device will hear your finger snap over the air, and also hear it from the earphones (coming from the first android device). After you have snapped your fingers a few times, take the audio recording that was made by the second android device, and view it in Audacity or some audio editor app. You will be able to see the finger snap pulses, and notice that it occurs in pairs. Measure the time-difference between these pulses. You will notice that sometimes it is as low as 40 milliseconds, but sometimes it can be as large as 100 milliseconds. This difference is also audible - as you use the AAudio echo sample app, you will be able to hear that when it starts off it is quite low latency, but then later as it runs the latency can increase. |
@stereomatch Yes it was,I can only do this work between differents devices and write the result to a property file.At the beginning I thought there has some way(invoke some method) to get the latency,anyway I thanks every body who care about this question. |
@niklasdahlheimer The presentation timestamp is supposed to be the time that the audio signal leaves the android device. So it might be when the sound comes out the speaker or is sent to the headphones. Additional latency may be added by external devices such as USB converters or televisions. Also note that approximately 1 msec is added per foot of air. Also note that timestamps vary in accuracy. We are working to improve the latency. @stereomatch When reporting latency, please include information about the device and Android version. Note that latency can vary from device to device. But once a measurement has stabilized then the latency should not increase. It should stabilize within a second. If the latency continues to increase then that might be a bug, which should be filed separately. This bug is closed. |
yangwuan55 and philburk:
Results were for:
The test setup is as described above - one device is running our app which is using AAudio to echo the microphone to the earphones. A headset (with built-in mic and earphones) is plugged in. When this android device hears a click on the headset mic, that emanates 40ms to 100ms later on the headset earphones. Sometimes that is also picked up by the headset mic again - and thus sometimes there will be a series of clicks emanating from the earphones, diminishing as they repeat. Now we place a 2nd android device that is running an audio recorder app - right next to one of the headset earphones mentioned above. So the 2nd android device is merely recording what a human would hear in the earphones. So you now have:
Now when you make a click sound, that sound is heard by the 1st device headset microphone and emanates on headset earphones 40ms to 100ms later. The second android device records the first click directly (over the air). 40ms to 100ms later when that click emanates from 1st device earphones, that is also heard by the second device audio recorder app. Occasionally the feedback from headset earphone to headset mic will also be significant - and you will have a train of pulses emanating from headset earphones - each pulse diminishing in strength (feedback happening on 1st android headset earphones to headset mic). This means that the 2nd android device audio recorder app will now hear one strong click (over the air), then 40ms to 100ms later it will hear a click from headset earphones, and sometimes it will hear echoes of that (feedback mentioned above). For our purposes, we will take the recording from the 2nd android device, and analyze it in Audacity on a computer. We will measure the time interval between the first click (over the air) and the second click (from earphone). Latency Test Data Here is an example of the click separation as viewed in Audacity for the OnePlus 5T running Oreo 8.1 - you can observe there is some variation in latency from 0.34ms to 0.42ms and one with 104ms: NOTE: each horizontal timeline is one experiment i.e. the impact of a single click and what was heard by 2nd audio device audio recorder app. Each image below has multiple rows - each row is a separate click. Thus each row gives an estimate for latency by measuring interval between first two pulses, and as you compare between rows, you can see that the latency seems to be different each time. Here is the same for the Nexus 4 running Oreo 8.1 - using headset mic and headset - giving about 80ms: But then there were other examples where this varied - from 75ms to 85ms for the Nexus 4: And examples where it varied much more - for Nexus 4 device mic, and headset earphones - about 110ms: The OnePlus 5T also varied a bit - for device mic and headset earphones, it varies from 100ms to 155ms: I recognize these experiments are not conclusive - but the methodology is correct (please correct me if wrong), since the experiment is measuring the end-result (how click separation is heard by end-user on earphones) - the 2nd device audio recorder is standing in for the human observer. The only weakness I can identify in the above method is if the 2nd android device audio recorder app itself has jitter or is not recording all audio frames (which can happen with android audio recorder app). So an improvement would be to use a certified digital audio recorder as the 2nd device (which is known to not lose any audio frames). However, the results of the above experiments are consistent with user experience - I have heard noticeable variation in the latency (the space between click sounds as heard on earphones will often be very close together initially but then a few seconds later it will be much larger i.e. sound like there is now larger latency). This is something that has been reported to me by users as well. So it is a very noticeable effect. Thanks for all the feedback. |
I initially posted a series of incrementally developed explanations for why latency maybe increasing - which is now better explained in the new issue thread below (hopefully it is more understandable): #165 |
@philburk Do you find the above explanation of latency increasing and the experimental data plausible ? If so, is there a way for developers to flush/clear the playback pipeline ? |
To reproduce the latency variation, run AAudio echo sample - choose: Now listen to the earphone speaker with your ear, as you snap your fingers. Each snap will create a pulse train (echoes due to feedback). Initially the pulse train will be very close together, but then 20 seconds later the pulse train period will noticeably seem to be longer. However, a bit later the pulse train period will again become smaller. This suggests the audio latency doesn't just increase monotonically and then become stable. Instead it increases, then is reset to the minimum, then a bit later it is more. This suggests there is resetting going on of the playback pipeline (or something like that) - which is resetting the latency back to low levels. However there is a great deal of variation in it (which means there is probably data loss on playback pipeline as well). I urge AAudio engineers to do this test on their android devices (Oreo 8.1) to see if they can reproduce this issue. |
I want to know how to get the delay for output(write data until it real rendered),or is there any way to get the delay?Can I use the buffersize as the play delay?
I want do echo cancel use the webrtc-apm,but the AudioTrack can't get the right delay(I'm not sure if the delay is too long).
So I want to try oboe.
Thanks for help!!
The text was updated successfully, but these errors were encountered: