Hmm this might be slightly off topic but,
I agree with Lilltroll about not needing as much memory because of the way Xmos XS1 is architected, particularly for multicore chips. But there are a few simple things that could help:
1) Ability to be able to dynamically
load modules, rather than having to statically load everything into the image. This could make an enormous difference in some case where only some of the functions are needed at any time.
2) Adding mobile features to XC as described by the
XOSIG thread, would also go a long way and would likely eradicate the need for any OS. It also lowers the inter core boundaries for memory, and thus can make more effective use of XS1 muticore distributed design.
3) Generous internal fast read memory mapped Flash for storing code. That would mean code run directly from flash, freeing valuable SRAM for other data intensive functions and applications.
4) Larger SRAM options, the L1's are a little tight given their performance compared to high end MCUs.
5) All I/O pins should have any function (1,4/4l,8,16,32bit), much like an FPGA including conversion to memory mapped address buses.
Obviously it gets tougher as one moves down that wish list!!
regards
Al