Hi,
I just Using xmos.I had a problem with the 3100 that when I used the 1i0o0_I2S_ONLY_48KHz_Cir43 to build the configuration, the delay of the sound from the microphone board input to the horn output was more than 50ms. I realized that this could be a delay for far-end stuff (beamform AEC AGC etc), is my guess correct? If right, I can turn off some functions such as AEC, because I am on-site equipment.
If anyone is willing to answer my question I will be very happy
Wish you a Happy New Year!
			
			
									
							
		time delay about xCORE VOCAL FUSION SPEAKER algorithm Topic is solved
- 
				andy wong
- Member
- Posts: 12
- Joined: Tue Jan 02, 2018 12:04 pm
- 
				infiniteimprobability Verified Verified
- XCore Legend
- Posts: 1177
- Joined: Thu May 27, 2010 10:08 am
Hi,
the total delay for all processing blocks is around 170 milliseconds or 2750ish samples. The block size if 256 and there are 4 such buffers internally which is 64ms before we add processing time which accounts for the remainder. There are a lot of processing stages as well as time to frequency domain conversions and inherently this adds latency.
The AEC and BAP (BAP contains beaformer, ns, agc, other filtering etc.) blocks are designed to work hand in hand. It is not easy to separate them (it's not a simple case of forwarding echo removed samples) and although I am sure it is possible with enough work, is not supported/recommended.
Changing the block size is also not supported and has not been tested - all AVS and other ASR tests have been done in the current configuration. It may not even work due to the block size assumptions.
I assume this also addresses your other post:
http://www.xcore.com/viewtopic.php?f=3&t=6297
			
			
									
										the total delay for all processing blocks is around 170 milliseconds or 2750ish samples. The block size if 256 and there are 4 such buffers internally which is 64ms before we add processing time which accounts for the remainder. There are a lot of processing stages as well as time to frequency domain conversions and inherently this adds latency.
The AEC and BAP (BAP contains beaformer, ns, agc, other filtering etc.) blocks are designed to work hand in hand. It is not easy to separate them (it's not a simple case of forwarding echo removed samples) and although I am sure it is possible with enough work, is not supported/recommended.
Changing the block size is also not supported and has not been tested - all AVS and other ASR tests have been done in the current configuration. It may not even work due to the block size assumptions.
I assume this also addresses your other post:
http://www.xcore.com/viewtopic.php?f=3&t=6297
Engineer at XMOS
						- 
				andy wong
- Member
- Posts: 12
- Joined: Tue Jan 02, 2018 12:04 pm
My friend, thank you very much~
Because you say that separating AEC and BAP is possible, so I'm going to give it a try. Can you give me some suggestions like how to get started? My current practice is to change SMARTHOME as a try.
you can ignore my other post,I'm sorry to bother you。
			
			
									
										
						Because you say that separating AEC and BAP is possible, so I'm going to give it a try. Can you give me some suggestions like how to get started? My current practice is to change SMARTHOME as a try.
you can ignore my other post,I'm sorry to bother you。
- 
				infiniteimprobability Verified Verified
- XCore Legend
- Posts: 1177
- Joined: Thu May 27, 2010 10:08 am
CORRECTION
The latency of the Vocalfusion processing is actually in the order of 50ms. Apologies to anyone who took the incorrect number above - my mistake.
Even adding lib_mic_array (1.125ms) and some upsampling to 48k + USB full speed delays, mic to host it will be comfortably below 60ms.
			
			
									
										The latency of the Vocalfusion processing is actually in the order of 50ms. Apologies to anyone who took the incorrect number above - my mistake.
Even adding lib_mic_array (1.125ms) and some upsampling to 48k + USB full speed delays, mic to host it will be comfortably below 60ms.
Engineer at XMOS
						