time delay about xCORE VOCAL FUSION SPEAKER algorithm Topic is solved

Voice related projects and technical discussions
Post Reply
andy wong
Member
Posts: 12
Joined: Tue Jan 02, 2018 12:04 pm

time delay about xCORE VOCAL FUSION SPEAKER algorithm

Post by andy wong »

Hi,
I just Using xmos.I had a problem with the 3100 that when I used the 1i0o0_I2S_ONLY_48KHz_Cir43 to build the configuration, the delay of the sound from the microphone board input to the horn output was more than 50ms. I realized that this could be a delay for far-end stuff (beamform AEC AGC etc), is my guess correct? If right, I can turn off some functions such as AEC, because I am on-site equipment.
If anyone is willing to answer my question I will be very happy
Wish you a Happy New Year!


View Solution
User avatar
infiniteimprobability
XCore Legend
Posts: 1126
Joined: Thu May 27, 2010 10:08 am
Contact:

Post by infiniteimprobability »

Hi,
the total delay for all processing blocks is around 170 milliseconds or 2750ish samples. The block size if 256 and there are 4 such buffers internally which is 64ms before we add processing time which accounts for the remainder. There are a lot of processing stages as well as time to frequency domain conversions and inherently this adds latency.

The AEC and BAP (BAP contains beaformer, ns, agc, other filtering etc.) blocks are designed to work hand in hand. It is not easy to separate them (it's not a simple case of forwarding echo removed samples) and although I am sure it is possible with enough work, is not supported/recommended.

Changing the block size is also not supported and has not been tested - all AVS and other ASR tests have been done in the current configuration. It may not even work due to the block size assumptions.

I assume this also addresses your other post:
http://www.xcore.com/viewtopic.php?f=3&t=6297
andy wong
Member
Posts: 12
Joined: Tue Jan 02, 2018 12:04 pm

Post by andy wong »

My friend, thank you very much~

Because you say that separating AEC and BAP is possible, so I'm going to give it a try. Can you give me some suggestions like how to get started? My current practice is to change SMARTHOME as a try.

you can ignore my other post,I'm sorry to bother you。
User avatar
infiniteimprobability
XCore Legend
Posts: 1126
Joined: Thu May 27, 2010 10:08 am
Contact:

Post by infiniteimprobability »

CORRECTION

The latency of the Vocalfusion processing is actually in the order of 50ms. Apologies to anyone who took the incorrect number above - my mistake.
Even adding lib_mic_array (1.125ms) and some upsampling to 48k + USB full speed delays, mic to host it will be comfortably below 60ms.
Post Reply